Patent application title: METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS
Inventors:
IPC8 Class: AC12Q168FI
USPC Class:
Class name:
Publication date: 2015-01-01
Patent application number: 20150005194
Abstract:
Processes and methods for creating a database of genomic samples from
healthy human donors, methods that use the database to identify and
correlate polymorphic genetic markers and other markers with diseases and
conditions are provided.Claims:
1. A method for determining whether a polymorphism correlates with a gene
or pathway involved in the onset and progression of disease in males or
females, comprising: a) obtaining samples from healthy male and female
individuals; b) pooling the samples from healthy male individuals within
an age range to obtain pools from younger and older male populations; c)
pooling the samples from healthy female individuals within an age range
to obtain pools from younger and older female populations; d) determining
the frequency of the polymorphism in said pooled samples using a mass
spectrometer; e) determining a difference in frequency of the
polymorphism in the pooled samples obtained from younger and older female
populations or younger and older male populations; f) determining a
difference in the frequency of the polymorphism of step e) between male
and female populations; and g) associating the polymorphism with a
disease or a biochemical pathway that occurs with high frequency in
younger or older male or female populations.
2. The method of claim 1, wherein the mass spectrometric format is selected from the group consisting of Matrix Assisted Laser Desorption/Ionization, Time of Flight (MALDI TOF), Electrospray (ES); IR-MALDI, Ion Cyclotron Resonance (ICR), Fourier Transform, and combinations thereof.
3. The method of claim 1, wherein the frequency of the polymorphism in said samples is in a database, and said database is sorted to identify correlations between the frequency of said polymorphism in older and younger male and female populations.
4. The method of claim 1, wherein the polymorphism comprises a SNP.
5. The method of claim 1, further comprising identifying the locus of said polymorphism and assessing or deducing the function of a gene at said locus.
6. The method of claim 1, wherein said sample comprises body tissue or fluid from said individual.
7. The method of claim 1, wherein said sample comprises DNA.
8. The method of claim 1, wherein said method comprises obtaining genomic nucleic acid from a sample from a healthy individual.
9. The method of claim 1, further comprising amplifying a portion of the genomic nucleic acid to produce amplified fragments thereof.
10. The method of claim 1, wherein the polymorphism is identified by a method comprising primer oligo base extension.
11. The method of claim 10, wherein the primer oligo base extension comprises hybridizing a nucleic acid molecule from a sample from a healthy individual with a primer oligonucleotide that is complementary to the nucleic acid molecule at a site adjacent to the polymorphic marker.
12. The method of claim 11, further comprising optionally immobilizing the nucleic acid molecule onto a solid support, to produce an immobilized nucleic acid molecule; contacting the optionally-immobilized nucleic acid molecule with a composition comprising a dideoxynucleoside triphosphate or a 3'-deoxynucleoside triphosphate and a polymerase, so that only a dideoxynucleoside or 3'-deoxynucleoside triphosphate that is complementary to the polymorphic marker is extended onto the primer; and detecting the extended primer, thereby identifying the polymorphism.
13. The method of claim 1, wherein the polymorphism is identified by a method comprising: identifying samples by sorting a database comprising datapoints representative of a plurality of individuals from whom biological samples are obtained, wherein each datapoint is associated with data representative of the organism type and other identifying information, wherein said database is sorted according to a selected parameter to identify samples that match the selected parameter; isolating a nucleic acid molecule from each identified sample; pooling each isolated nucleic acid molecule; and identifying the polymorphism in the nucleic acid molecule by a method comprising primer oligo base extension.
14. The method of claim 1, wherein the polymorphism is identified by a method comprising: identifying samples by sorting a database comprising datapoints representative of a plurality of individuals from whom biological samples are obtained, wherein each datapoint is associated with data representative of the organism type and other identifying information, wherein said database is sorted according to a selected parameter to identify samples that match the selected parameter; isolating a biopolymer from each identified sample; pooling each isolated biopolymer; cleaving the pooled biopolymers to produce fragments thereof; obtaining a mass spectrum of the resulting fragments and comparing the mass spectrum with a control mass spectrum to identify differences between the spectra and thereby identifying any polymorphisms; wherein: the control mass spectrum is obtained from either samples represented by datapoints in said database that were not selected by sorting said database; or samples identified by sorting said database according to a different selected parameter.
15. The method of claim 1, wherein the polymorphism is identified by a method comprising: isolating a biopolymer from samples of body tissue or fluid from older and younger male and female populations; pooling each isolated biopolymer; cleaving the pooled biopolymers to produce fragments thereof; obtaining a mass spectrum of the resulting fragments; determining a frequency of each fragment, whereby an average frequency is calculated; and comparing the frequency of each fragment to identify fragments present in amounts lower than the average frequency, thereby identifying the polymorphism.
16. The method of claim 14, wherein the biopolymers comprise genomic nucleic acid molecules.
Description:
RELATED APPLICATIONS
[0001] This application is a continuation application of U.S. patent application Ser. No. 13/536,807, filed Jun. 28, 2012, to Andreas Braun, Hubert Koster, Dirk Van Den Boom, Ping Yip, Charles Rodi, Liyan He, Norman Chiu, and Christiane Jurinke, and entitled "METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS," which is a continuation application of U.S. patent application Ser. No. 12/643,933, filed Dec. 21, 2009, to Andreas Braun, Hubert Koster, Dirk Van Den Boom, Ping Yip, Charles Rodi, Liyan He, Norman Chiu, and Christiane Jurinke, and entitled "METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS," which is a continuation application of U.S. patent application Ser. No. 10/273,321, filed Oct. 15, 2002, to Andreas Braun, Hubert Koster, Dirk Van den Boom, Yip Ping, Charles Rodi, Liyan He, Norman Chiu and Christian Jurinke and entitled "METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS;" which issued as U.S. Pat. No. 7,668,658, which is a divisional application of U.S. patent application Ser. No. 09/687,483, filed Oct. 13, 2000, to Andreas Braun, Hubert Koster, Dirk Van den Boom, Yip Ping, Charles Rodi, Liyan He, Norman Chiu and Christian Jurinke, entitled "METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS;" which is a continuation-in-part of U.S. application Ser. No. 09/663,968, to Ping Yip, filed Sep. 19, 2000, entitled "METHOD AND DEVICE FOR IDENTIFYING A BIOLOGICAL SAMPLE," which issued as U.S. Pat. No. 7,917,301.
[0002] Benefit of priority under 35 U.S.C. ยง119(e) to the following provisional applications is claimed herein:
U.S. provisional application Ser. No. 60/217,658 to Andreas Braun, Hubert Koster; Dirk Van den Boom, filed Jul. 10, 2000, entitled "METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS"; U.S. provisional application Ser. No. 60/159,176 to Andreas Braun, Hubert Koster, Dirk Van den Boom, filed Oct. 13, 1999, entitled "METHODS FOR GENERATING DATABASES AND DATABASES FOR IDENTIFYING POLYMORPHIC GENETIC MARKERS"; U.S. provisional application Ser. No. 60/217,251, filed Jul. 10, 2000, to Andreas Braun, entitled "POLYMORPHIC KINASE ANCHOR PROTEIN GENE SEQUENCES, POLYMORPHIC KINASE ANCHOR PROTEINS AND METHODS OF DETECTING POLYMORPHIC KINASE ANCHOR PROTEINS AND NUCLEIC ACIDS ENCODING THE SAME."
[0003] The above-noted applications and provisional applications are incorporated by reference in their entirety.
FIELD OF THE INVENTION
[0004] Process and methods for creating a database of genomic samples from healthy human donors. Methods that use the database to identify and correlate with polymorphic genetic markers and other markers with diseases and conditions are provided.
BACKGROUND
[0005] Diseases in all organisms have a genetic component, whether inherited or resulting from the body's response to environmental stresses, such as viruses and toxins. The ultimate goal of ongoing genomic research is to use this information to develop new ways to identify, treat and potentially cure these diseases. The first step has been to screen disease tissue and identify genomic changes at the level of individual samples. The identification of these "disease" markers has then fueled the development and commercialization of diagnostic tests that detect these errant genes or polymorphisms. With the increasing numbers of genetic markers, including single nucleotide polymorphisms (SNPs), microsatellites, tandem repeats, newly mapped introns and exons, the challenge to the medical and pharmaceutical communities is to identify genotypes which not only identify the disease but also follow the progression of the disease and are predictive of an organism's response to treatment.
[0006] Currently the pharmaceutical and biotechnology industries find a disease and then attempt to determine the genomic basis for the disease. This approach is time consuming and expensive and in many cases involves the investigator guessing as to what pathways might be involved in the disease.
[0007] Genomics
[0008] Presently the two main strategies employed in analyzing the available genomic information are the technology driven reverse genetics brute force strategy and the knowledge-based pathway oriented forward genetics strategy. The brute force approach yields large databases of sequence information but little information about the medical or other uses of the sequence information. Hence this strategy yields intangible products of questionable value. The knowledge-based strategy yields small databases that contain a lot of information about medical uses of particular DNA sequences and other products in the pathway and yield tangible products with a high value.
[0009] Polymorphisms
[0010] Polymorphisms have been known since 1901 with the identification of blood types. In the 1950's they were identified on the level of proteins using large population genetic studies. In the 1980's and 1990's many of the known protein polymorphisms were correlated with genetic loci on genomic DNA. For example, the gene dose of the apolipoprotein E type 4 allele was correlated with the risk of Alzheimer's disease in late onset families (see, e.g., Corder et al. (1993) Science 261: 921-923; mutation in blood coagulation factor V was associated with resistance to activated protein C (see, e.g., Bertina et al. (1994) Nature 369:64-67); resistance to HIV-1 infection has been shown in caucasian individuals bearing mutant alleles of the CCR-5 chemokine receptor gene (see, e.g., Samson et al. (1996) Nature 382:722-725); and a hypermutable tract in antigen presenting cells (APC, such as macrophages), has been identified in familial colorectal cancer in individuals of Ashkenzi jewish background (see, e.g., Laken et al. (1997) Nature Genet. 17:79-83). There can be more than three million polymorphic sites in the human genome. Many have been identified, but not yet characterized or mapped or associated with a marker.
[0011] Single Nucleotide Polymorphisms (SNPs)
[0012] Much of the focus of genomics has been in the identification of SNPs, which are important for a variety of reasons. They allow indirect testing (association of haplotypes) and direct testing (functional variants). They are the most abundant and stable genetic markers. Common diseases are best explained by common genetic alterations, and the natural variation in the human population aids in understanding disease, therapy and environmental interactions.
[0013] Currently, the only available method to identify SNPs in DNA is by sequencing, which is expensive, difficult and laborious. Furthermore, once a SNP is discovered it must be validated to determine if it is a real polymorphism and not a sequencing error. Also, discovered SNPs must then be evaluated to determine if they are associated with a particular phenotype. Thus, there is a need to develop new paradigms for identifying the genomic basis for disease and markers thereof. Therefore, it is an object herein to provide methods for identifying the genomic basis of disease and markers thereof.
SUMMARY
[0014] Databases and methods using the databases are provided herein. The databases comprise sets of parameters associated with subjects in populations selected only on the basis of being healthy (i.e., where the subjects are mammals, such as humans, they are selected based upon apparent health and no detectable infections). The databases can be sorted based upon one or more of the selected parameters.
[0015] The databases, for example, can be relational databases, in which an index that represents each subject serves to relate parameters, which are the data, such as age, ethnicity, sex, medical history, etc. and ultimately genotypic information, that was inputted into and stored in the database. The database can then be sorted according to these parameters. Initially, the parameter information is obtained from a questionnaire answered by each subject from whom a body tissue or body fluid sample is obtained. As additional information about each sample is obtained, this information can be entered into the database and can serve as a sorting parameter.
[0016] The databases obtained from healthy individuals have numerous uses, such as correlating known polymorphisms with a phenotype or disease. The databases can be used to identify alleles that are deleterious, that are beneficial, and that are correlated with diseases.
[0017] For purposes herein, genotypic information can be obtained by any method known to those of skill in the art, but is generally obtained using mass spectrometry.
[0018] Also provided herein, is a new use for existing databases of subjects and genotypic and other parameters, such as age, ethnicity, race, and gender. Any database can be sorted according to the methods herein, and alleles that exhibit statistically significant correlations with any of the sorting parameters can be identified. It is noted, however, is noted, that the databases provided herein and randomly selected databases will perform better in these methods, since disease-based databases suffer numerous limitations, including their relatively small size, the homogeneity of the selected disease population, and the masking effect of the polymorphism associated with the markers for which the database was selected. Hence, the healthy database provided herein, provides advantages not heretofore recognized or exploited. The methods provided herein can be used with a selected database, including disease-based databases, with or without sorting for the discovery and correlation of polymorphisms. In addition, the databases provided herein represent a greater genetic diversity than the unselected databases typically utilized for the discovery of polymorphisms and thus allow for the enhanced discovery and correlation of polymorphisms.
[0019] The databases provided herein can be used for taking an identified polymorphism and ascertaining whether it changes in frequency when the data are sorted according to a selected parameter.
[0020] One use of these methods is correlating a selected marker with a particular parameter by following the occurrence of known genetic markers and then, having made this correlation, determining or identifying correlations with diseases. Examples of this use are p53 and Lipoprotein Lipase polymorphism. As exemplified herein, known markers are shown to have particular correlation with certain groups, such as a particular ethnicity or race or one sex. Such correlations will then permit development of better diagnostic tests and treatment regimens.
[0021] These methods are valuable for identifying one or more genetic markers whose frequency changes within the population as a function of age, ethnic group, sex or some other criteria. This can allow the identification of previously unknown polymorphisms and ultimately a gene or pathway involved in the onset and progression of disease.
[0022] The databases and methods provided herein permit, among other things, identification of components, particularly key components, of a disease process by understanding its genetic underpinnings and also permit an understanding of processes, such as individual drug responses. The databases and methods provided herein also can be used in methods involving elucidation of pathological pathways, in developing new diagnostic assays, identifying new potential drug targets, and in identifying new drug candidates.
[0023] The methods and databases can be used with experimental procedures, including, but are not limited to, in silico SNP identification, in vitro SNP identification/verification, genetic profiling of large populations, and in biostatistical analyses and interpretations.
[0024] Also provided herein, are combinations that contain a database provided herein and a biological sample from a subject in the database, and typically biological samples from all subjects or a plurality of subjects in the database. Collections of the tissue and body fluid samples are also provided.
[0025] Also, provided herein, are methods for determining a genetic marker that correlates with age, comprising identifying a polymorphism and determining the frequency of the polymorphism with increasing age in a healthy population.
[0026] Further provided herein are methods for determining whether a genetic marker correlates with susceptibility to morbidity, early mortality, or morbidity and early mortality, comprising identifying a polymorphism and determining the frequency of the polymorphism with increasing age in a healthy population.
[0027] Any of the methods herein described can be used out in a multiplex format.
[0028] Also provided are an apparatus and process for accurately identifying genetic information. It is another object herein that genetic information be extracted from genetic data in a highly automated manner. Therefore, to overcome the deficiencies in the known conventional systems, methods and apparatus for identifying a biological sample are provided.
[0029] Briefly, the method and system for identifying a biological sample generates a data set indicative of the composition of the biological sample. In a particular example, the data set is DNA spectrometry data received from a mass spectrometer. The data set is denoised, and a baseline is deleted. Since possible compositions of the biological sample can be known, expected peak areas can be determined. Using the expected peak areas, a residual baseline is generated to further correct the data set. Probable peaks are then identifiable in the corrected data set, which are used to identify the composition of the biological sample. In a disclosed example, statistical methods are employed to determine the probability that a probable peak is an actual peak, not an actual peak, or that the data too inconclusive to call.
[0030] Advantageously, the method and system for identifying a biological sample accurately makes composition calls in a highly automated manner. In such a manner, complete SNP profile information, for example, can be collected efficiently. More importantly, the collected data are analyzed with highly accurate results. For example, when a particular composition is called, the result can be relied upon with great confidence. Such confidence is provided by the robust computational process employed.
DESCRIPTION OF THE DRAWINGS
[0031] FIG. 1 depicts an exemplary sample bank. FIG. 1A shows the samples as a function of sex and ethnicity. FIG. 1B shows the caucasians as a function of age. FIG. 1C shows the Hispanics as a function of age.
[0032] FIGS. 2A and 2C show an age- and sex-distribution of the 291S allele of the lipoprotein lipase gene in which a total of 436 males and 589 females were investigated. FIG. 2B shows an age distribution for the 436 males.
[0033] FIG. 3 is an exemplary questionnaire for population-based sample banking.
[0034] FIG. 4 depicts processing and tracking of blood sample components.
[0035] FIG. 5 depicts the allelic frequency of "sick" alleles and "healthy" alleles as a function of age. It is noted that the relative frequency of healthy alleles increases in a population with increasing age.
[0036] FIG. 6 depicts the age-dependent distribution of ApoE genotypes (see, Schachter et al. (1994) Nature Genetics 6:29-32).
[0037] FIG. 7AA-7D depicts age-related and genotype frequency of the p53 (tumor suppressor) codon 72 among the caucasian population in the database. *R72 and *P72 represent the frequency of the allele in the database population. R72, R72P, and P72 represent the genotypes of the individuals in the population. The frequency of the homozygous P72 allele drops from 6.7% to 3.7% with age.
[0038] FIG. 8 depicts the allele and genotype frequencies of the p21 S31R allele as a function of age.
[0039] FIG. 9 depicts the frequency of the FVII Allele 353Q in pooled versus individual samples.
[0040] FIG. 10 depicts the frequency of the CETP (cholesterol ester transfer protein) allele in pooled versus individual samples.
[0041] FIG. 11 depicts the frequency of the plasminogen activator inhibitor-1 (PAI-1) 5G in pooled versus individual samples.
[0042] FIG. 12 shows mass spectra of the samples and the ethnic diversity of the PAI-1 alleles.
[0043] FIGS. 12A-12D show mass spectra of the samples and the ethnic diversity of the PAI-1 alleles.
[0044] FIGS. 13A-13D show mass spectra of the samples and the ethnic diversity of the CETP 405 alleles.
[0045] FIGS. 14A-14D show mass spectra of the samples and the ethnic diversity of the Factor VII 353 alleles.
[0046] FIG. 16 shows the p53-Rb pathway and the relationships among the various factors in the pathway.
[0047] FIG. 17, which is a block diagram of a computer constructed to provide and process the databases described herein, depicts a typical computer system for storing and sorting the databases provided herein and practicing the methods provided herein.
[0048] FIG. 18 is a flow diagram that illustrates the processing steps performed using the computer illustrated in FIG. 17, to maintain and provide access to the databases for identifying polymorphic genetic markers.
[0049] FIG. 19 is a histogram showing the allele and genotype distribution in the age and sex stratified Caucasian population for the AKAP10-1 locus. Bright green bars show frequencies in individuals younger than 40 years. Dark green bars show frequencies in individuals older than 60 years.
[0050] FIG. 20 is a histogram showing the allele and genotype distribution in the age and sex stratified Caucasian population for the AKAP10-5 locus. Bright green bars show frequencies in individuals younger than 40 years; dark green bars show frequencies in individuals older than 60 years.
[0051] FIG. 21 is a histogram showing the allele and genotype distribution in the age and sex stratified Caucasian population for the h-msrA locus. Genotype difference between male age groups is significant. Bright green bars show frequencies in individuals younger than 40 years. Dark green bars show frequencies in individuals older than 60 years.
[0052] FIG. 22A-D is a sample data collection questionnaire used for the healthy database.
[0053] FIG. 23 is a flowchart showing processing performed by the computing device of FIG. 24 when performing genotyping of sense strands and antisense strands from assay fragments.
[0054] FIG. 24 is a block diagram showing a system provided herein;
[0055] FIG. 25 is a flowchart of a method of identifying a biological sample provided herein;
[0056] FIG. 26 is a graphical representation of data from a mass spectrometer;
[0057] FIG. 27 is a diagram of wavelet transformation of mass spectrometry data;
[0058] FIG. 28 is a graphical representation of wavelet stage 0 hi data;
[0059] FIG. 29 is a graphical representation of stage 0 noise profile;
[0060] FIG. 30 is a graphical representation of generating stage noise standard deviations;
[0061] FIG. 31 is a graphical representation of applying a threshold to data stages;
[0062] FIG. 32 is a graphical representation of a sparse data set;
[0063] FIG. 33 is a formula for signal shifting;
[0064] FIG. 34 is a graphical representation of a wavelet transformation of a denoised and shifted signal;
[0065] FIG. 35 is a graphical representation of a denoised and shifted signal;
[0066] FIG. 36 is a graphical representation of removing peak sections;
[0067] FIG. 37 is a graphical representation of generating a peak free signal;
[0068] FIG. 38 is a block diagram of a method of generating a baseline correction;
[0069] FIG. 39 is a graphical representation of a baseline and signal;
[0070] FIG. 40 is a graphical representation of a signal with baseline removed;
[0071] FIG. 41 is a table showing compressed data;
[0072] FIG. 42 is a flowchart of method for compressing data;
[0073] FIG. 43 is a graphical representation of mass shifting;
[0074] FIG. 44 is a graphical representation of determining peak width;
[0075] FIG. 45 is a graphical representation of removing peaks;
[0076] FIG. 46 is a graphical representation of a signal with peaks removed;
[0077] FIG. 47 is a graphical representation of a residual baseline;
[0078] FIG. 48 is a graphical representation of a signal with residual baseline removed;
[0079] FIG. 49 is a graphical representation of determining peak height;
[0080] FIG. 50 is a graphical representation of determining signal-to-noise for each peak;
[0081] FIG. 51 is a graphical representation of determining a residual error for each peak;
[0082] FIG. 52 is a graphical representation of peak probabilities;
[0083] FIG. 53 is a graphical representation of applying an allelic ratio to peak probability;
[0084] FIG. 54 is a graphical representation of determining peak probability;
[0085] FIG. 55 is a graphical representation of calling a genotype;
[0086] FIG. 56 is a flowchart showing a statistical procedure for calling a genotype;
[0087] FIG. 57 is a flowchart showing processing performed by the computing device of FIG. 1 when performing standardless genotyping; and
[0088] FIG. 58 is graphical representation of applying an allelic ratio to peak probability for standardless genotype processing.
DETAILED DESCRIPTION
Definitions
[0089] Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of ordinary skill in the art to which this invention belongs. All patents, applications, published applications and other publications and sequences from GenBank and other databases referred to herein throughout the disclosure are incorporated by reference in their entirety.
[0090] As used herein, a biopolymer includes, but is not limited to, nucleic acid, proteins, polysaccharides, lipids and other macromolecules. Nucleic acids include DNA, RNA, and fragments thereof. Nucleic acids can be derived from genomic DNA, RNA, mitochondrial nucleic acid, chloroplast nucleic acid and other organelles with separate genetic material.
[0091] As used herein, morbidity refers to conditions, such as diseases or disorders, that compromise the health and well-being of an organism, such as an animal. Morbidity susceptibility or morbidity-associated genes are genes that, when altered, for example, by a variation in nucleotide sequence, facilitate the expression of a specific disease clinical phenotype. Thus, morbidity susceptibility genes have the potential, upon alteration, of increasing the likelihood or general risk that an organism will develop a specific disease.
[0092] As used herein, mortality refers to the statistical likelihood that an organism, particularly an animal, will not survive a full predicted lifespan. Hence, a trait or a marker, such as a polymorphism, associated with increased mortality is observed at a lower frequency in older than younger segments of a population.
[0093] As used herein, a polymorphism, e.g. genetic variation, refers to a variation in the sequence of a gene in the genome amongst a population, such as allelic variations and other variations that arise or are observed. Thus, a polymorphism refers to the occurrence of two or more genetically determined alternative sequences or alleles in a population. These differences can occur in coding and non-coding portions of the genome, and can be manifested or detected as differences in nucleic acid sequences, gene expression, including, for example transcription, processing, translation, transport, protein processing, trafficking, DNA synthesis, expressed proteins, other gene products or products of biochemical pathways or in post-translational modifications and any other differences manifested amongst members of a population. A single nucleotide polymorphism (SNP) refers to a polymorphism that arises as the result of a single base change, such as an insertion, deletion or change in a base.
[0094] A polymorphic marker or site is the locus at which divergence occurs. Such site can be as small as one base pair (an SNP). Polymorphic markers include, but are not limited to, restriction fragment length polymorphisms, variable number of tandem repeats (VNTR's), hypervariable regions, minisatellites, dinucleotide repeats, trinucleotide repeats, tetranucleotide repeats and other repeating patterns, simple sequence repeats and insertional elements, such as Alu. Polymorphic forms also are manifested as different mendelian alleles for a gene. Polymorphisms can be observed by differences in proteins, protein modifications, RNA expression modification, DNA and RNA methylation, regulatory factors that alter gene expression and DNA replication, and any other manifestation of alterations in genomic nucleic acid or organelle nucleic acids.
[0095] As used herein, a healthy population refers to a population of organisms, including but are not limited to, animals, bacteria, viruses, parasites, plants, eubacteria, and others, that are disease free. The concept of disease-free is a function of the selected organism. For example, for mammals it refers to a subject not manifesting any disease state. Practically a healthy subject, when human, is defined as human donor who passes blood bank criteria to donate blood for eventual use in the general population. These criteria are as follows: free of detectable viral, bacterial, mycoplasma, and parasitic infections; not anemic; and then further selected based upon a questionnaire regarding history (see FIG. 3). Thus, a healthy population represents an unbiased population of sufficient health to donate blood according to blood bank criteria, and not further selected for any disease state. Typically such individuals are not taking any medications. For plants, for example, it is a plant population that does not manifest diseases pathology associated with plants. For bacteria it is a bacterial population replicating without environmental stress, such as selective agents, heat and other pathogens.
[0096] As used herein, a healthy database (or healthy patient database) refers to a database of profiles of subjects that have not been pre-selected for any particular disease. Hence, the subjects that serve as the source of data for the database are selected, according to predetermined criteria, to be healthy. In contrast to other such databases that have been pre-selected for subjects with a particular disease or other characteristic, the subjects for the database provided herein are not so-selected. Also, if the subjects do manifest a disease or other condition, any polymorphism discovered or characterized should be related to an independent disease or condition. In a one embodiment, where the subjects are human, a healthy subject manifests no disease symptoms and meets criteria, such as those set by blood banks for blood donors.
[0097] Thus, the subjects for the database are a population of any organism, including, but are not limited to, animals, plants, bacteria, viruses, parasites and any other organism or entity that has nucleic acid. Among subjects are mammals, such as, although not necessarily, humans. Such a database can capture the diversity of a population, thus providing for discovery of rare polymorphisms.
[0098] As used herein, a profile refers to information relating to, but not limited to and not necessarily including all of, age, sex, ethnicity, disease history, family history, phenotypic characteristics, such as height and weight and other relevant parameters. A sample collect information form is shown in FIG. 22, which illustrates profile intent.
[0099] As used herein, a disease state is a condition or abnormality or disorder that can be inherited or result from environmental stresses, such as toxins, bacterial, fungal and viral infections.
[0100] As used herein, set of non-selected subjects means that the subjects have not been pre-selected to share a common disease or other characteristic. They can be selected to be healthy as defined herein.
[0101] As used herein, a phenotype refers to a set of parameters that includes any distinguishable trait of an organism. A phenotype can be physical traits and can be, in instances in which the subject is an animal, a mental trait, such as emotional traits. Some phenotypes can be determined by observation elicited by questionnaires (see, e.g., FIGS. 3 and 22) or by referring to prior medical and other records. For purposes herein, a phenotype is a parameter around which the database can be sorted.
[0102] As used herein, a parameter is any input data that will serve as a basis for sorting the database. These parameters will include phenotypic traits, medical histories, family histories and any other such information elicited from a subject or observed about the subject. A parameter can describe the subject, some historical or current environmental or social influence experienced by the subject, or a condition or environmental influence on someone related to the subject. Paramaters include, but are not limited to, any of those described herein, and known to those of skill in the art.
[0103] As used herein, haplotype refers to two or polymorphism located on a single DNA strand. Hence, haplotyping refers to identification of two or more polymorphisms on a single DNA strand. Haplotypes can be indicative of a phenotype. For some disorders a single polymorphism can suffice to indicate a trait; for others a plurality (i.e., a haplotype) can be needed. Haplotyping can be performed by isolating nucleic acid and separating the strands. In addition, when using enzymes such a certain nucleases, that produce, different size fragments from each strand, strand separation is not needed for haplotyping.
[0104] As used herein, pattern with reference to a mass spectrum or mass spectrometric analyses, refers to a characteristic distribution and number of signals (such peaks or digital representations thereof).
[0105] As used herein, signal in the context of a mass spectrum and analysis thereof refers to the output data, which the number or relative number of molecules having a particular mass. Signals include "peaks" and digital representations thereof.
[0106] As used herein, adaptor, when used with reference to haplotyping using Fen ligase, refers to a nucleic acid that specifically hybridizes to a polymorphism of interest. An adaptor can be partially double-stranded. An adaptor complex is formed when an adaptor hybridizes to its target.
[0107] As used herein, a target nucleic acid refers to any nucleic acid of interest in a sample. It can contain one or more nucleotides.
[0108] As used herein, standardless analysis refers to a determination based upon an internal standard. For example, the frequency of a polymorphism can be determined herein by comparing signals within a single mass spectrum.
[0109] As used herein, amplifying refers to methods for increasing the amount of a bipolymer, especially nucleic acids. Based on the 5' and 3' primers that are chosen, amplification also serves to restrict and define the region of the genome which is subject to analysis. Amplification can be performed by any method known to those skilled in the art, including use of the polymerase chain reaction (PCR) etc. Amplification, e.g., PCR must be done quantitatively when the frequency of polymorphism is required to be determined.
[0110] As used herein, cleaving refers to non-specific and specific fragmentation of a biopolymer.
[0111] As used herein, multiplexing refers to the simultaneous detection of more than one polymorphism. Methods for performing multiplexed reactions, particularly in conjunction with mass spectrometry are known (see, e.g., U.S. Pat. Nos. 6,043,031, 5,547,835 and International PCT application No. WO 97/37041).
[0112] As used herein, reference to mass spectrometry encompasses any suitable mass spectrometric format known to those of skill in the art. Such formats include, but are not limited to, Matrix-Assisted Laser Desorption/Ionization, Time-of-Flight (MALDI-TOF), Electrospray (ES), IR-MALDI (see, e.g., published International PCT application No. 99/57318 and U.S. Pat. No. 5,118,937), Ion Cyclotron Resonance (ICR), Fourier Transform and combinations thereof. MALDI, particular UV and IR, are among the formats contemplated.
[0113] As used herein, mass spectrum refers to the presentation of data obtained from analyzing a biopolymer or fragment thereof by mass spectrometry either graphically or encoded numerically.
[0114] As used herein, a blood component is a component that is separated from blood and includes, but is not limited to red blood cells and platelets, blood clotting factors, plasma, enzymes, plasminogen, immunoglobulins. A cellular blood component is a component of blood, such as a red blood cell, that is a cell. A blood protein is a protein that is normally found in blood. Examples of such proteins are blood factors VII and VIII. Such proteins and components are well-known to those of skill in the art.
[0115] As used herein, plasma can be prepared by any method known to those of skill in the art. For example, it can be prepared by centrifuging blood at a force that pellets the red cells and forms an interface between the red cells and the buffy coat, which contains leukocytes, above which is the plasma. For example, typical platelet concentrates contain at least about 10% plasma.
[0116] Blood can be separated into its components, including, but not limited to, plasma, platelets and red blood cells by any method known to those of skill in the art. For example, blood can be centrifuged for a sufficient time and at a sufficient acceleration to form a pellet containing the red blood cells. Leukocytes collect primarily at the interface of the pellet and supernatant in the buffy coat region. The supernatant, which contains plasma, platelets, and other blood components, can then be removed and centrifuged at a higher acceleration, whereby the platelets pellet.
[0117] As used herein, p53 is a cell cycle control protein that assesses DNA damage and acts as a transcription factor regulation gene which control cell growth, DNA repair and apoptosis. The p53 mutations have been found in a wide variety of different cancers, including all of the different types of leukemia, with varying frequency. The loss of normal p53 functions results in genomic instability and uncontrolled growth of the host cell.
[0118] As used herein, p21 is a cyclin-dependent kinase inhibitor, associated with G1 phase arrest of normal cells. Expression triggers apoptosis or programmed cell death and has been associated with Wilms' tumor, a pediatric kidney cancer.
[0119] As used herein, Factor VII is a serine protease involved the extrinsic blood coagulation cascade. This factor is activated by thrombin and works with tissue factor (Factor III) in the processing of Factor X to Factor Xa. Evidence has supported an association between polymorphisms in the gene and increase Factor VII activity which can result in an elevated risk of ischemic cardiovascular disease including myocardial infarction.
[0120] As used herein, a relational database stores information in a form representative of matrices, such as two-dimensional tables, including rows and columns of data, or higher dimensional matrices. For example, in one embodiment, the relational database has separate tables each with a parameter. The tables are linked with a record number, which also acts as an index. The database can be searched or sorted by using data in the tables and is stored in any suitable storage medium, such as floppy disk, CD rom disk, hard drive or other suitable medium.
[0121] As used herein, a bar codes refers any array of optically readable marks of any desired size and shape that are arranged in a reference context or frame of, typically, although not necessarily, one or more columns and one or more rows. For purposes herein, the bar code refers to any symbology, not necessary "bar" but can include dots, characters or any symbol or symbols.
[0122] As used herein, symbology refers to an identifier code or symbol, such as a bar code, that is linked to a sample. The index will reference each such symbology. The symbology is any code known or designed by the user. The symbols are associated with information stored in the database. For example, each sample can be uniquely identified with an encoded symbology. The parameters, such as the answers to the questions and subsequent genotypic and other information obtained upon analysis of the samples is included in the database and associated with the symbology. The database is stored on any suitable recording medium, such as a hard drive, a floppy disk, a tape, a CD ROM, a DVD disk and any other suitable medium.
Databases
[0123] Human genotyping is currently dependent on collaborations with hospitals, tissues banks and research institutions that provide samples of disease tissue. This approach is based on the concept that the onset and/or progression of diseases can be correlated with the presence of a polymorphisms or other genetic markers. This approach does not consider that disease correlated with the presence of specific markers and the absence of specific markers. It is shown herein that identification and scoring of the appearance and disappearance of markers is possible only if these markers are measured in the background of healthy subjects where the onset of disease does not mask the change in polymorphism occurrence. Databases of information from disease populations suffer from small sample size, selection bias and heterogeneity. The databases provided herein from healthy populations solve these problems by permitting large sample bands, simple selection methods and diluted heterogeneity.
[0124] Provided herein are first databases of parameters, associated with non-selected, particularly healthy, subjects. Also provided are combinations of the databases with indexed samples obtained from each of the subjects. Further provided are databases produced from the first databases. These contain, in addition to the original parameters, information, such as genotypic information, including, but are not limited to, genomic sequence information, derived from the samples.
[0125] The databases, which are herein designated healthy databases, are so-designated because they are not obtained from subjects pre-selected for a particular disease. Hence, although individual members can have a disease, the collection of individuals is not selected to have a particular disease.
[0126] The subjects from whom the parameters are obtained comprise either a set of subjects who are randomly selected across, typically, all populations, or are pre-selected to be disease-free or healthy. As a result, the database is not selected to be representative of any pre-selected phenotype, genotype, disease or other characteristic. Typically the number of subjects from which the database is prepared is selected to produce statistically significant results when used in the methods provided herein. Generally, the number of subjects will be greater than 100, 200, and typically than 1000. The precise number can be empirically determined based upon the frequency of the parameter(s) that can be used to sort the database. Generally the population can have at least 50, at least 100, at least 200, at least 500, at least 1000, at least 5000 or at least 10,000 or more subjects.
[0127] Upon identification of a collection of subjects, information about each subject is recorded and associated with each subject as a database. The information associated with each of the subjects, includes, but is not limited to, information related to historical characteristics of the subjects, phenotypic characteristics and also genotypic characteristics, medical characteristics and any other traits and characteristics about the subject that can be determined. This information will serve as the basis for sorting the database.
[0128] In an exemplary embodiment, the subjects are mammals, such as humans, and the information relates to one or more of parameters, such as age, sex, medical history, ethnicity and any other factor. Such information, when the animals are humans, for example, can be obtained by a questionnaire and by observations about the individual, such as hair color, eye color and other characteristics. Genotypic information can be obtained from tissue or other body and body fluid samples from the subject.
[0129] The healthy genomic database can include profiles and polymorphisms from healthy individuals from a library of blood samples where each sample in the library is an individual and separate blood or other tissue sample. Each sample in the database is profiled as to the sex, age, ethnic group, and disease history of the donor.
[0130] The databases are generated by first identifying healthy populations of subjects and obtaining information about each subject that will serve as the sorting parameters for the database. This information can be entered into a storage medium, such as the memory of a computer.
[0131] The information obtained about each subject in a population used for generating the database is stored in a computer memory or other suitable storage medium. The information is linked to an identifier associated with each subject. Hence the database will identify a subject, for example by a datapoint representative of a bar code, and then all information, such as the information from a questionnaire, regarding the individual is associated with the datapoint. As the information is collected the database is generated.
[0132] Thus, for example, profile information, such as subject histories obtained from questionnaires, is collected in the database. The resulting database can be sorted as desired, using standard software, such as by age, sex and/or ethnicity. An exemplary questionnaire for subjects from whom samples are to be obtained is shown in FIGS. 22A-D. Each questionnaire, for example, can be identified by a bar code, particularly a machine readable bar code for entry into the database. After a subject provides data and is deemed to be healthy (i.e., meets standards for blood donation), the data in the questionnaire is entered into the database and is associated with the bar code. A tissue, cell or blood sample is obtained from the subject.
[0133] FIG. 4 exemplifies processing and tracking of blood sample components. Each component is tracked with a bar code, dated, is entered into the database and associated with the subject and the profile of the subject. Typically, the whole blood is centrifuged to produce plasma, red blood cells (which pellet) and leukocytes found in the buffy coat which layers in between. Various samples are obtained and coded with a bar code and stored for use as needed.
[0134] Samples are collected from the subjects. The samples include, but are not limited to, tissues, cells, and fluids, such as nucleic acid, blood, plasma, amniotic fluid, synovial fluid, urine, saliva, aqueous humor, sweat, sperm samples and cerebral spinal fluid. It is understood that the particular set of samples depends upon the organisms in the population.
[0135] Once samples are obtained the collection can be stored and, in some embodiments, each sample is indexed with an identifier, particularly a machine readable code, such as a bar code. For analyses, the samples or components of the samples, particularly biopolymers and small molecules, such as nucleic acids and/or proteins and metabolites, are isolated.
[0136] After samples are analyzed, this information is entered into the database in the memory of the storage medium and associated with each subject. This information includes, but is not limited to, genotypic information. Particularly, nucleic acid sequence information and other information indicative of polymorphisms, such as masses of PCR fragments, peptide fragment sequences or masses, spectra of biopolymers and small molecules and other indicia of the structure or function of a gene, gene product or other marker from which the existence of a polymorphism within the population can be inferred.
[0137] In an exemplary embodiment, a database can be derived from a collection of blood samples. For example, FIG. 1 (see, also FIG. 10) shows the status of a collection of over 5000 individual samples. The samples were processed in the laboratory following SOP (standard operating procedure) guidelines. Any standard blood processing protocol can be used.
[0138] For the exemplary database described herein, the following criteria were used to select subjects:
[0139] No testing is done for infectious agents.
[0140] Age: At least 17 years old
[0141] Weight: Minimum of 110 pounds
[0142] Permanently Disqualified:
[0143] History of hepatitis (after age 11)
[0144] Leukemia Lymphoma
[0145] Human immunodeficiency virus (HIV), AIDS
[0146] Chronic kidney disease
[0147] Temporarily Disqualified:
[0148] Pregnancy--until six weeks after delivery, miscarriage or abortion
[0149] Major surgery or transfusions--for one year
[0150] Mononucleosis--until complete recovery
[0151] Prior whole blood donation--for eight weeks
[0152] Antibiotics by injection for one week; by mouth, for forty-eight hours, except antibiotics for skin complexion;
[0153] 5 year Deferment:
[0154] Internal cancer and skin cancer if it has been removed, is healed and there is no recurrence These correspond to blood bank criteria for donating blood and represent a healthy population as defined herein for a human healthy database.
[0155] Structure of the Database
[0156] Any suitable database structure and format known to those of skill in the art can be employed. For example, a relational database is a an exemplary format in which data are stored as matrices or tables of the parameters linked by an indexer that identifies each subject. Software for preparing and manipulating, including sorting the database, can be readily developed or adapted from commercially available software, such as Microsoft Access.
[0157] Quality Control
[0158] Quality control procedures can be implemented. For example, after collection of samples, the quality of the collection in the bank can be assessed. For example, mix-up of samples can be checked by testing for known markers, such as sex. After samples are separated by ethnicity, samples are randomly tested for a marker associated with a particular ethnicity, such as HLA DQA1 group specific component, to assess whether the samples have been properly sorted by ethnic group. An exemplary sample bank is depicted in FIG. 4.
Obtaining Genotypic Data and Other Parameters for the Database
[0159] After informational and historical parameters are entered into the database, material from samples obtained from each subject, is analyzed. Analyzed material include proteins, metabolites, nucleic acids, lipids and any other desired constituent of the material. For example, nucleic acids, such as genomic DNA, can be analyzed by sequencing.
[0160] Sequencing can be performed using any method known to those of skill in the art. For example, if a polymorphism is identified or known, and it is desired to assess its frequency or presence among the subjects in the database, the region of interest from each sample can be isolated, such as by PCR or restriction fragments, hybridization or other suitable method known to those of skill in the art and sequenced. For purposes herein, sequencing analysis can be effected using mass spectrometry (see, e.g., U.S. Pat. Nos. 5,547,835, 5,622,824, 5,851,765, and 5,928,906). Nucleic acids also can be sequenced by hybridization (see, e.g., U.S. Pat. Nos. 5,503,980, 5,631,134, 5,795,714) and including analysis by mass spectrometry (see, U.S. application Ser. Nos. 08/419,994 and 09/395,409).
[0161] In other detection methods, it is necessary to first amplify prior to identifying the allelic variant. Amplification can be performed, e.g., by PCR and/or LCR, according to methods known in the art. In one embodiment, genomic DNA of a cell is exposed to two PCR primers and amplification for a number of cycles sufficient to produce the required amount of amplified DNA. In some embodiments, the primers are located between 150 and 350 base pairs apart.
[0162] Alternative amplification methods include: self sustained sequence replication (Guatelli, J. C. et al., 1990, Proc. Natl. Acad. Sci. U.S.A. 87:1874-1878), transcriptional amplification system (Kwoh, D. Y. et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:1173-1177), Q-Beta Replicase (Lizardi, P. M. et al., 1988, Bio/Technology 6:1197), or any other nucleic acid amplification method, followed by the detection of the amplified molecules using techniques well known to those of skill in the art. These detection schemes are especially useful for the detection of nucleic acid molecules if such molecules are present in very low numbers.
[0163] Nucleic acids also can be analyzed by detection methods and protocols, particularly those that rely on mass spectrometry (see, e.g., U.S. Pat. Nos. 5,605,798, 6,043,031, allowed copending U.S. application Ser. No. 08/744,481, U.S. application Ser. No. 08/990,851 and International PCT application No. WO 99/31278, International PCT application No. WO 98/20019). These methods can be automated (see, e.g., copending U.S. application Ser. No. 09/285,481 and published International PCT application No. PCT/US00/08111, which describes an automated process line). Among the methods of analysis herein are those involving the primer oligo base extension (PROBE) reaction with mass spectrometry for detection (described herein and elsewhere, see e.g., U.S. Pat. No. 6,043,031; see, also U.S. application Ser. Nos. 09/287,681, 09/287,682, 09/287,141 and 09/287,679, allowed copending U.S. application Ser. No. 08/744,481, International PCT application No. PCT/US97/20444, published as International PCT application No. WO 98/20019, and based upon U.S. application Ser. Nos. 08/744,481, 08/744,590, 08/746,036, 08/746,055, 08/786,988, 08/787,639, 08/933,792, 08/746,055, 08/786,988 and 08/787,639; see, also U.S. application Ser. No. 09/074,936, U.S. Pat. No. 6,024,925, and U.S. application Ser. Nos. 08/746,055 and 08/786,988, and published International PCT application No. WO 98/20020).
[0164] A chip based format in which the biopolymer is linked to a solid support, such as a silicon or silicon-coated substrate, such as in the form of an array, is among the formats for performing the analyses is. Generally, when analyses are performed using mass spectrometry, particularly MALDI, small nanoliter volumes of sample are loaded on, such that the resulting spot is about, or smaller than, the size of the laser spot. It has been found that when this is achieved, the results from the mass spectrometric analysis are quantitative. The area under the signals in the resulting mass spectra are proportional to concentration (when normalized and corrected for background). Methods for preparing and using such chips are described in U.S. Pat. No. 6,024,925, co-pending U.S. application Ser. Nos. 08/786,988, 09/364,774, 09/371,150 and 09/297,575; see, also U.S. application Serial No. PCT/US97/20195, which published as WO 98/20020. Chips and kits for performing these analyses are commercially available from SEQUENOM under the trademark MassARRAY. MassArray relies on the fidelity of the enzymatic primer extension reactions combined with the miniaturized array and MALDI-TOF (Matrix-Assisted Laser Desorption Ionization-Time of Flight) mass spectrometry to deliver results rapidly. It accurately distinguishes single base changes in the size of DNA fragments associated with genetic variants without tags.
[0165] The methods provided herein permit quantitative determination of alleles. The areas under the signals in the mass spectra can be used for quantitative determinations. The frequency is determined from the ratio of the signal to the total area of all of the spectrum and corrected for background. This is possible because of the PROBE technology as described in the above applications incorporated by reference herein.
[0166] Additional methods of analyzing nucleic acids include amplification-based methods including polymerase chain reaction (PCR), ligase chain reaction (LCR), mini-PCR, rolling circle amplification, autocatalytic methods, such as those using Qฮฒ replicase, TAS, 3SR, and any other suitable method known to those of skill in the art.
[0167] Other methods for analysis and identification and detection of polymorphisms, include but are not limited to, allele specific probes, Southern analyses, and other such analyses.
[0168] The methods described below provide ways to fragment given amplified or non-amplified nucleotide sequences thereby producing a set of mass signals when mass spectrometry is used to analyze the fragment mixtures.
Amplified fragments are yielded by standard polymerase chain methods (U.S. Pat. Nos. 4,683,195 and 4,683,202). The fragmentation method involves the use of enzymes that cleave single or double strands of DNA and enzymes that ligate DNA. The cleavage enzymes can be glycosylases, nickases, and site-specific and non site-specific nucleases, such as, but are not limited to, glycosylases, nickases and site-specific nucleases.
Glycosylase Fragmentation Method
[0169] DNA glycosylases specifically remove a certain type of nucleobase from a given DNA fragment. These enzymes can thereby produce abasic sites, which can be recognized either by another cleavage enzyme, cleaving the exposed phosphate backbone specifically at the abasic site and producing a set of nucleobase specific fragments indicative of the sequence, or by chemical means, such as alkaline solutions and or heat. The use of one combination of a DNA glycosylase and its targeted nucleotide would be sufficient to generate a base specific signature pattern of any given target region.
[0170] Numerous DNA glcosylases are known, For example, a DNA glycosylase can be uracil-DNA glycolsylase (UDG), 3-methyladenine DNA glycosylase, 3-methyladenine DNA glycosylase II, pyrimidine hydrate-DNA glycosylase, FaPy-DNA glycosylase, thymine mismatch-DNA glycosylase, hypoxanthine-DNA glycosylase, 5-Hydroxymethyluracil DNA glycosylase (HmUDG), 5-Hydroxymethylcytosine DNA glycosylase, or 1,N6-etheno-adenine DNA glycosylase (see, e.g., U.S. Pat. Nos. 5,536,649, 5,888, 795, 5,952,176 and 6,099,553, International PCT application Nos. WO 97/03210, WO 99/54501; see, also, Eftedal et al. (1993) Nucleic Acids Res 21:2095-2101, Bjelland and Seeberg (1987) Nucleic Acids Res. 15:2787-2801, Saparbaev et al. (1995) Nucleic Acids Res. 23:3750-3755, Bessho (1999) Nucleic Acids Res. 27:979-983) corresponding to the enzyme's modified nucleotide or nucleotide analog target. uracil-DNA glycolsylase (UDG) is an exemplary glycosylase.
[0171] Uracil, for example, can be incorporated into an amplified DNA molecule by amplifying the DNA in the presence of normal DNA precursor nucleotides (e.g. dCTP, dATP, and dGTP) and dUTP. When the amplified product is treated with UDG, uracil residues are cleaved. Subsequent chemical treatment of the products from the UDG reaction results in the cleavage of the phosphate backbone and the generation of nucleobase specific fragments. Moreover, the separation of the complementary strands of the amplified product prior to glycosylase treatment allows complementary patterns of fragmentation to be generated. Thus, the use of dUTP and Uracil DNA glycosylase allows the generation of T specific fragments for the complementary strands, thus providing information on the T as well as the A positions within a given sequence. Similar to this, a C-specific reaction on both (complementary) strands (i.e. with a C-specific glycosylase) yields information on C as well as G positions within a given sequence if the fragmentation patterns of both amplification strands are analyzed separately. Thus, with the glycosylase method and mass spectrometry, a full series of A, C, G and T specific fragmentation patterns can be analyzed.
Nickase Fragmentation Method
[0172] A DNA nickase, or DNase, can be used to recognize and cleave one strand of a DNA duplex. Numerous nickases are known. Among these, for example, are nickase NY2A nickase and NYS1 nickase (Megabase) with the following cleavage sites:
TABLE-US-00001 NY2A: 5' . . . R AG . . . 3' 3' . . . Y TC . . . 5' where R = A or G and Y = C or T NYS1: 5' . . . CC[A/G/T] . . . 3' 3' . . . GG[T/C/A] . . . 5'.
Fen-Ligase Fragmentation Method
[0173] The Fen-ligase method involves two enzymes: Fen-1 enzyme and a ligase. The Fen-1 enzyme is a site-specific nuclease known as a "flap" endonuclease (U.S. Pat. Nos. 5,843,669, 5,874,283, and 6,090,606). This enzyme recognizes and cleaves DNA "flaps" created by the overlap of two oligonucleotides hybridized to a target DNA strand. This cleavage is highly specific and can recognize single base pair mutations, permitting detection of a single homologue from an individual heterozygous at one SNP of interest and then genotyping that homologue at other SNPs occurring within the fragment. Fen-1 enzymes can be Fen-1 like nucleases e.g. human, murine, and Xenopus XPG enzymes and yeast RAD2 nucleases or Fen-1 endonucleases from, for example, M. jannaschii, P. furiosus, and P. woesei. Among such enzymes are the Fen-1 enzymes.
[0174] The ligase enzyme forms a phosphodiester bond between two double stranded nucleic acid fragments. The ligase can be DNA Ligase I or DNA Ligase III (see, e.g., U.S. Pat. Nos. 5,506,137, 5,700,672, 5,858,705 and 5,976,806; see, also, Waga, et al. (1994) J. Biol. Chem. 269:10923-10934, Li et al. (1994) Nucleic Acids Res. 22:632-638, Arrand et al. (1986) J. Biol. Chem. 261:9079-9082, Lehman (1974) Science 186:790-797, Higgins and Cozzarelli (1979) Methods Enzymol. 68:50-71, Lasko et al. (1990) Mutation Res. 236:277-287, and Lindahl and Barnes (1992) Ann. Rev. Biochem. 61:251-281).
[0175] Thermostable ligase (Epicenter Technologies), where "thermostable" denotes that the ligase retains activity even after exposure to temperatures necessary to separate two strands of DNA, are among the ligases for use herein.
Type IIS Enzyme Fragmentation Method
[0176] Restriction enzymes bind specifically to and cleave double-stranded DNA at specific sites within or adjacent to a particular recognition sequence. These enzymes have been classified into three groups (e.g. Types I, II, and III) as known to those of skill in the art. Because of the properties of type I and type III enzymes, they have not been widely used in molecular biological applications. Thus, for purposes herein type II enzymes are among those contemplated. Of the thousands of restriction enzymes known in the art, there are 179 different type II specificities. Of the 179 unique type II restriction endonucleases, 31 have a 4-base recognition sequence, 11 have a 5-base recognition sequence, 127 have a 6-base recognition sequence, and 10 have recognition sequences of greater than six bases (U.S. Pat. No. 5,604,098). Of category type II enzymes, type IIS is exemplified herein.
[0177] Type IIS enzymes can be Alw XI, Bbv I, Bce 83, Bpm I, Bsg I, Bsm AI, Bsm FI, Bsa I, Bcc I, Bcg I, Ear I, Eco 57I, Esp 3I, Fau I, Fok I, Gsu I, Hga I, Mme I, Mbo II, Sap I, and the otheres.
[0178] The Fok I enzyme endonuclease is an exemplary well characterized member of the Type IIS class (see, e.g., U.S. Pat. Nos. 5,714,330, 5,604,098, 5,436,150, 6,054,276 and 5,871,911; see, also, Szybalski et al. (1991) Gene 100:13-26, Wilson and Murray (1991) Ann. Rev. Genet. 25:585-627, Sugisaki et al. (1981) Gene 16:73-78, Podhajska and Szalski (1985) Gene 40:175-182. Fok I recognizes the sequence 5'GGATG-3' and cleaves DNA accordingly. Type IIS restriction sites can be introduced into DNA targets by incorporating the sites into primers used to amplify such targets. Fragments produced by digestion with Fok I are site specific and can be analyzed by mass spectrometry methods such as MALDI-TOF mass spectrometry, ESI-TOF mass spectrometry, and any other type of mass spectrometry well known to those of skill in the art.
[0179] Once a polymorphism has been found to correlate with a parameter such as age, age groups can be screened for polymorphisms. The possibility of false results due to allelic dropout is examined by doing comparative PCR in an adjacent region of the genome.
[0180] Analyses
[0181] In using the database, allelic frequencies can be determined across the population by analyzing each sample in the population individually, determining the presence or absence of allele or marker of interest in each individual sample, and then determining the frequency of the marker in the population. The database can then be sorted (stratified) to identify any correlations between the allele and a selected parameter using standard statistical analysis. If a correlation is observed, such as a decrease in a particular marker with age or correlation with sex or other parameter, then the marker is a candidate for further study, such as genetic mapping to identify a gene or pathway in which it is involved. The marker can then be correlated, for example, with a disease. Haplotying also can be carried out. Genetic mapping can be effected using standard methods and can also require use of databases of others, such as databases previously determined to be associated with a disorder.
[0182] Exemplary analyses have been performed and these are shown in the figures, and discussed herein.
[0183] Sample Pooling
[0184] It has been found that using the databases provided herein, or any other database of such information, substantially the same frequencies that were obtained by examining each sample separately can be obtained by pooling samples, such as in batches of 10, 20, 50, 100, 200, 500, 1000 or any other number. A precise number can be determined empirically if necessary, and can be as low as 3.
[0185] In one embodiment, the frequency of genotypic and other markers can be obtained by pooling samples. To do this a target population and a genetic variation to be assessed is selected, a plurality of samples of biopolymers are obtained from members of the population, and the biopolymer from which the marker or genotype can be inferred is determined or detected. A comparison of samples tested in pools and individually and the sorted results therefrom are shown in FIG. 9, which shows frequency of the factor VII Allele 353Q. FIG. 10 depicts the frequency of the CETP Allele in pooled versus individual samples. FIG. 15 shows ethnic diversity among various ethnic groups in the database using pooled DNA samples to obtain the data. FIGS. 12-14 show mass spectra for these samples.
[0186] Pooling of test samples has application not only to the healthy databases provided herein, but also to use in gathering data for entry into any database of subjects and genotypic information, including typical databases derived from diseased populations. What is demonstrated herein, is the finding that the results achieved are statistically the same as the results that would be achieved if each sample is analyzed separately. Analysis of pooled samples by a method, such as the mass spectrometric methods provided herein, permits resolution of such data and quantitation of the results.
[0187] For factor VII the R53Q acid polymorphism was assessed. In FIG. 9, the "individual" data represent allelic frequency observed in 92 individuals reactions. The pooled data represent the allelic frequency of the same 92 individuals pooled into a single probe reaction. The concentration of DNA in the samples of individual donors is 250 nanograms. The total concentration of DNA in the pooled samples is also 250 nanograms, where the concentration of any individual DNA is 2.7 nanograms.
[0188] It also was shown that it is possible to reduce the DNA concentration of individuals in a pooled samples from 2.7 nanograms to 0.27 nanograms without any change in the quality of the spectrum or the ability to quantitate the amount of sample detected. Hence low concentrations of sample can be used in the pooling methods.
Use of the Databases and Markers Identified Thereby
[0189] The successful use of genomics requires a scientific hypothesis (i.e., common genetic variation, such as a SNP), a study design (i.e., complex disorders), samples and technology, such as the chip-based mass spectrometric analyses (see, e.g., U.S. Pat. No. 5,605,798, U.S. Pat. No. 5,777,324, U.S. Pat. No. 6,043,031, allowed copending U.S. application Ser. No. 08/744,481, U.S. application Ser. No. 08/990,851, International PCT application No. WO 98/20019, copending U.S. application Ser. No. 09/285,481, which describes an automated process line for analyses; see, also, U.S. application Ser. Nos. 08/617,256, 09/287,681, 09/287,682, 09/287,141 and 09/287,679, allowed copending U.S. application Ser. No. 08/744,481, International PCT application No. PCT/US97/20444, published as International PCT application No. WO 98/20019, and based upon U.S. application Ser. Nos. 08/744,481, 08/744,590, 08/746,036, 08/746,055, 08/786,988, 08/787,639, 08/933,792, 08/746,055, 09/266,409, 08/786,988 and 08/787,639; see, also U.S. application Ser. No. 09/074,936). All of these aspects can be used in conjunction with the databases provided herein and samples in the collection.
[0190] The databases and markers identified thereby can be used, for example, for identification of previously unidentified or unknown genetic markers and to identify new uses for known markers. As markers are identified, these can be entered into the database to use as sorting parameters from which additional correlations can be determined.
[0191] Previously Unidentified or Unknown Genetic Markers
[0192] The samples in the healthy databases can be used to identify new polymorphisms and genetic markers, using any mapping, sequencing, amplification and other methodologies, and in looking for polymorphisms among the population in the database. The thus-identified polymorphism can then be entered into the database for each sample, and the database sorted (stratified) using that polymorphism as a sorting parameter to identify any patterns and correlations that emerge, such as age correlated changes in the frequency of the identified marker. If a correlation is identified, the locus of the marker can be mapped and its function or effect assessed or deduced.
[0193] Thus, the databases here provide means for:
[0194] identification of significantly different allelic frequencies of genetic factors by comparing the occurrence or disappearance of the markers with increasing age in population and then associating the markers with a disease or a biochemical pathway;
[0195] identification of significantly different allelic frequencies of disease causing genetic factors by comparing the male with the female population or comparing other selected stratified populations and associating the markers with a disease or a biochemical pathway;
[0196] identification of significantly different allelic frequencies of disease causing genetic factors by comparing different ethnic groups and associating the markers with a disease or a biochemical pathway that is known to occur in high frequency in the ethnic group;
[0197] profiling potentially functional variants of genes through the general panmixed population stratified according to age, sex, and ethnic origin and thereby demonstrating the contribution of the variant genes to the physical condition of the investigated population;
[0198] identification of functionally relevant gene variants by gene disequilibrium analysis performed within the general panmixed population stratified according to age, sex, and ethnic origin and thereby demonstrating their contribution to the physical condition of investigated population;
[0199] identification of potentially functional variants of chromosomes or parts of chromosomes by linkage disequilibrium analysis performed within the general panmixed population stratified according to age, sex, and ethnic origin and thereby demonstrating their contribution to the physical condition of investigated population.
[0200] Uses of the Identified Markers and Known Markers
[0201] The databases can also be used in conjunction with known markers and sorted to identify any correlations. For example, the databases can be used for:
[0202] determination and evaluation of the penetrance of medically relevant polymorphic markers;
[0203] determination and evaluation of the diagnostic specificity of medically relevant genetic factors;
[0204] determination and evaluation of the positive predictive value of medically relevant genetic factors;
[0205] determination and evaluation of the onset of complex diseases, such as, but are not limited to, diabetes, hypertension, autoimmune diseases, arteriosclerosis, cancer and other diseases within the general population with respect to their causative genetic factors;
[0206] delineation of the appropriate strategies for preventive disease treatment;
[0207] delineation of appropriate timelines for primary disease intervention;
[0208] validation of medically relevant genetic factors identified in isolated populations regarding their general applicability;
[0209] validation of disease pathways including all potential target structures identified in isolated populations regarding their general applicability; and
[0210] validation of appropriate drug targets identified in isolated populations regarding their general applicability.
[0211] Among the diseases and disorders for which polymorphisms can be linked include, those linked to inborn errors of metabolism, acquired metabolic disorders, intermediary metabolism, oncogenesis pathways, blood clotting pathways, and DNA synthetic and repair pathways, DNA repair/replication/transcription factors and activities, e.g., such as genes related to oncogenesis, aging and genes involved in blood clotting and the related biochemical pathways that are related to thrombosis, embolism, stroke, myocardial infarction, angiogenesis and oncogenesis.
[0212] For example, a number of diseases are caused by or involve deficient or defective enzymes in intermediary metabolism (see, e.g., Tables 1 and 2, below) that result, upon ingestion of the enzyme substrates, in accumulation of harmful metabolites that damage organs and tissues, particularly an infant's developing brain and other organs, resulting in mental retardation and other developmental disorders.
Identification of Markers and Genes for Such Disorders is of Great Interest.
[0213] Model Systems
[0214] Several gene systems, p21, p53 and Lipoprotein Lipase polymorphism (N291S), were selected. The p53 gene is a tumor suppressor gene that is mutated in diverse tumor types. One common allelic variant occurs at codon 72. A polymorphism that has been identified in the p53 gene, i.e., the R72P allele, results in an amino acid exchange, arginine to proline, at codon 72 of the gene.
[0215] Using diseased populations, it has been shown that there are ethnic differences in the allelic distribution of these alleles among African-Americans and Caucasians in the U.S. The results here support this finding and also demonstrate that the results obtained with a healthy database are meaningful (see, FIG. 7B).
[0216] The 291S allele leads to reduced levels of high density lipoprotein cholesterol (HDL-C) that is associated with an increased risk of males for arteriosclerosis and in particular myocardial infarction (see, Reymer et al. (1995) Nature Genetics 10:28-34).
[0217] Both genetic polymorphisms were profiled within a part of the Caucasian population-based sample bank. For the polymorphism located in the lipoprotein lipase gene a total of 1025 unselected individuals (436 males and 589 females) were tested. Genomic DNA was isolated from blood samples obtained from the individuals.
[0218] As shown in the Examples and figures, an exemplary database containing about 5000 subjects, answers to the questionnaire (see FIG. 3), and genotypic information has been stratified. A particular known allele has been selected, and the samples tested for the marker using mass spectrometric analyses, particularly PROBE (see the EXAMPLES) to identify polymorphisms in each sample. The population in the database has been sorted according to various parameters and correlations have been observed. For example, FIGS. 2A-C, show sorting of the data by age and sex for the Lipoprotein Lipase gene in the Caucasian population in the database. The results show a decrease in the frequency of the allele with age in males but no such decrease in females. Other alleles that have been tested against the database, include, alleles of p53, p21 and factor VII. Results when sorted by age are shown in the figures.
[0219] These examples demonstrate an effect of altered frequency of disease causing genetic factors within the general population. The scientific interpretation of those results allows prediction of medical relevance of polymorphic genetic alterations. In addition, conclusions can be drawn with regard to their penetrance, diagnostic specificity, positive predictive value, onset of disease, most appropriate onset of preventive strategies, and the general applicability of genetic alterations identified in isolated populations to panmixed populations.
[0220] Therefore, an age- and sex-stratified population-based sample bank that is ethnically homogenous is a suitable tool for rapid identification and validation of genetic factors regarding their potential medical utility.
Exemplary Computer System for Creating, Storing and Processing the Databases
[0221] Systems
[0222] Systems, including computers, containing the databases are provided herein. The computers and databases can be used in conjunction, for example, with the APL system (see, copending U.S. application Ser. No. 09/285,481), which is an automated system for analyzing biopolymers, particularly nucleic acids. Results from the APL system can be entered into the database.
[0223] Any suitable computer system can be used. The computer system can be integrated into systems for sample analysis, such as the automated process line described herein (see, e.g., copending U.S. application Ser. No. 09/285,481).
[0224] FIG. 17 is a block diagram of a computer constructed to provide and process the databases described herein. The processing that maintains the database and performs the methods and procedures can be performed on multiple computers all having a similar construction, or can be performed by a single, integrated computer. For example, the computer through which data are added to the database can be separate from the computer through which the database is sorted, or can be integrated with it. In either arrangement, the computers performing the processing can have a construction as illustrated in FIG. 17.
[0225] FIG. 17 is a block diagram of an exemplary computer 1700 that maintains the database described above and performs the methods and procedures. Each computer 1700 operates under control of a central processor unit (CPU) 1702, such as a "Pentium" microprocessor and associated integrated circuit chips, available from Intel Corporation of Santa Clara, Calif., USA. A computer user can input commands and data from a keyboard and display mouse 1704 and can view inputs and computer output at a display 1706. The display is typically a video monitor or flat panel display device. The computer 1700 also includes a direct access storage device (DASD) 1707, such as a fixed hard disk drive. The memory 1708 typically comprises volatile semiconductor random access memory (RAM). Each computer can include a program product reader 1710 that accepts a program product storage device 1712, from which the program product reader can read data (and to which it can optionally write data). The program product reader can comprise, for example, a disk drive, and the program product storage device can comprise removable storage media such as a magnetic floppy disk, an optical CD-ROM disc, a CD-R disc, a CD-RW disc, or a DVD data disc. If desired, the computers can be connected so they can communicate with each other, and with other connected computers, over a network 1713. Each computer 1700 can communicate with the other connected computers over the network 1713 through a network interface 1714 that enables communication over a connection 1716 between the network and the computer.
[0226] The computer 1700 operates under control of programming steps that are temporarily stored in the memory 1708 in accordance with conventional computer construction. When the programming steps are executed by the CPU 1702, the pertinent system components perform their respective functions. Thus, the programming steps implement the functionality of the system as described above. The programming steps can be received from the DASD 1707, through the program product reader 1712, or through the network connection 1716. The storage drive 1710 can receive a program product, read programming steps recorded thereon and transfer the programming steps into the memory 1708 for execution by the CPU 1702. As noted above, the program product storage device 1710 can comprise any one of multiple removable media having recorded computer-readable instructions, including magnetic floppy disks and CD-ROM storage discs. Other suitable program product storage devices can include magnetic tape and semiconductor memory chips. In this way, the processing steps necessary for operation can be embodied on a program product.
[0227] Alternatively, the program steps can be received into the operating memory 1708 over the network 1713. In the network method, the computer receives data including program steps into the memory 1708 through the network interface 1714 after network communication has been established over the network connection 1716 by well-known methods that will be understood by those skilled in the art without further explanation. The program steps are then executed by the CPU 1702 to implement the processing of the Garment Database system.
[0228] It should be understood that all of the computers of the system and can have a construction similar to that shown in FIG. 17. Details described with respect to the FIG. 17 computer 1700 will be understood to apply to all computers of the system 1700. This is indicated by multiple computers 1700 shown connected to the network 1713. Any one of the computers 1700 can have an alternative construction, so long as they can communicate with the other computers and support the functionality described herein.
[0229] FIG. 18 is a flow diagram that illustrates the processing steps performed using the computer illustrated in FIG. 17, to maintain and provide access to the databases, such as for identifying polymorphic genetic markers. In particular, the information contained in the database is stored in computers having a construction similar to that illustrated in FIG. 17. The first step for maintaining the database, as indicated in FIG. 18, is to identify healthy members of a population. As noted above, the population members are subjects that are selected only on the basis of being healthy, and where the subjects are mammals, such as humans, they can be selected based upon apparent health and the absence of detectable infections. The step of identifying is represented by the flow diagram box numbered 1802.
[0230] The next step, represented by the flow diagram box numbered 1804, is to obtain identifying and historical information and data relating to the identified members of the population. The information and data comprise parameters for each of the population members, such as member age, ethnicity, sex, medical history, and ultimately genotypic information. Initially, the parameter information is obtained from a questionnaire answered by each member, from whom a body tissue or body fluid sample also is obtained. The step of entering and storing these parameters into the database of the computer is represented by the flow diagram box numbered 1806. As additional information about each population member and corresponding sample is obtained, this information can be inputted into the database and can serve as a sorting parameter.
[0231] In the next step, represented by the flow diagram box numbered 1808, the parameters of the members are associated with an indexer. This step can be executed as part of the database storage operation, such as when a new data record is stored according to the relational database structure and is automatically linked with other records according to that structure. The step 1806 also can be executed as part of a conventional data sorting or retrieval process, in which the database entries are searched according to an input search or indexing key value to determine attributes of the data. For example, such search and sort techniques can be used to follow the occurrence of known genetic markers and then determine if there is a correlation with diseases for which they have been implicated. Examples of this use are for assessing the frequencies of the p53 and Lipoprotein Lipase polymorphisms.
[0232] Such searching of the database also can be valuable for identifying one or more genetic markers whose frequency changes within the population as a function of age, ethnic group, sex, or some other criteria. This can allow the identification of previously unknown polymorphisms and, ultimately, identification of a gene or pathway involved in the onset and progression of disease.
[0233] In addition, the database can be used for taking an identified polymorphism and ascertaining whether it changes in frequency when the data are sorted according to a selected parameter.
[0234] In this way, the databases and methods provided herein permit, among other things, identification of components, particularly key components, of a disease process by understanding its genetic underpinnings, and also an understanding of processes, such as individual drug responses. The databases and methods provided herein also can be used in methods involving elucidation of pathological pathways, in developing new diagnostic assays, identifying new potential drug targets, and in identifying new drug candidates.
Morbidity and/or Early Mortality Associated Polymorphisms
[0235] A database containing information provided by a population of healthy blood donors who were not selected for any particular disease to can be used to identify polymorphisms and the alleles in which they are present, whose frequency decreases with age. These can represent morbidity susceptibility markers and genes.
[0236] Polymorphisms of the genome can lead to altered gene function, protein function or genome instability. To identify those polymorphisms which have a clinical relevance/utility is the goal of a world-wide scientific effort. It can be expected that the discovery of such polymorphisms will have a fundamental impact on the identification and development of novel drug compounds to cure diseases. The strategy to identify valuable polymorphisms is cumbersome and dependent upon the availability of many large patient and control cohorts to show disease association. In particular, genes that cause a general risk of the population to suffer from any disease (morbidity susceptibility genes) will escape these case/control studies entirely.
[0237] Here described is a screening strategy to identify morbidity susceptibility genes underlying a variety of different diseases. The definition of a morbidity susceptibility gene is a gene that is expressed in many different cell types or tissues (housekeeping gene) and its altered function can facilitate the expression of a clinical phenotype caused by disease-specific susceptibility genes that are involved in a pathway specific for this disorder. In other words, these morbidity susceptibility genes predispose people to develop a distinct disease according to their genetic make-up for this disease.
[0238] Candidates for morbidity susceptibility genes can be found at the bottom level of pathways involving transcription, translation, heat-shock proteins, protein trafficking, DNA repair, assembly systems for subcellular structures (e.g. mitochondria, peroxysomes and other cellular microbodies), receptor signaling cascades, immunology, etc. Those pathways control the quality of life at the cellular level as well as for the entire organism. Mutations/polymorphisms located in genes encoding proteins for those pathways can reduce the fitness of cells and make the organism more susceptible to express the clinical phenotype caused by the action of a disease-specific susceptibility gene. Therefore, these morbidity susceptibility genes can be potentially involved in a whole variety of different complex diseases if not in all. Disease-specific susceptibility genes are involved in pathways that can be considered as disease-specific pathways like glucose-, lipid, hormone metabolism, etc.
[0239] The exemplified method permit, among other things, identification of genes and/or gene products involved in a man's general susceptibility to morbidity and/or mortality; use of these genes and/or gene products in studies to elucidate the genetic underpinnings of human diseases; use of these genes and/or gene products in combinatorial statistical analyses without or together with disease-specific susceptibility genes; use of these genes and/or gene products to predict penetrance of disease susceptibility genes; use of these genes and/or gene products in predisposition and/or acute medical diagnostics and use of these genes and/or gene products to develop drugs to cure diseases and/or to extend the life span of humans.
Screening Process
[0240] The healthy population stratified by age, gender and ethnicity, etc. is a very efficient and a universal screening tool for morbidity associated genes. Changes of allelic frequencies in the young compared to the old population are expected to indicate putative morbidity susceptibility genes. Individual samples of this healthy population base can be pooled to further increase the throughput. In an experiment, pools of young and old Caucasian females and males were applied to screen more than 400 randomly chosen single nucleotide polymorphisms located in many different genes. Candidate polymorphisms were identified if the allelic difference was greater than 8% between young and old for both or only one of the genders. The initial results were assayed again in at least one independent subsequent experiments. Repeated experiments are necessary to recognize unstable biochemical reactions, which occur with a frequency of about 2-3% and can mimic age-related allelic frequency differences. Average frequency differences and standard deviations are calculated after successful reproducibility of initial results. The final allelic frequency is then compared to a reference population of Caucasian CEPH sample pool. The result should show similar allelic frequencies in the young Caucasian population. Subsequently, the exact allele frequencies of candidates including genotype information were obtained by analyzing all individual samples. This procedure is straight forward with regard to time and cost. It enables the screening of an enormous number of SNPs. So far, several markers with a highly significant association to age were identified and described below.
[0241] In general at least 5 individuals in a stratified population should to be screened to produce statistically significant results. The frequency of the allele is determined for an age stratified population. Chi square analysis is then performed on the allelic frequencies to determine if the difference between age groups is statistically significant. A p value less than of 0.1 is considered to represent a statistically significant difference. Typically the p value should be less than 0.05.
Clinical Trials
[0242] The identification of markers whose frequency in a population decreases with age also allows for better designed and balanced clinical trials. Currently, if a clinical trial utilizes a marker as a significant endpoint in a study and the marker disappears with age, then the results of the study can be inaccurate. By using methods provided herein, it can be ascertained that if a marker decreases in frequency with age. This information can be considered and controlled when designing the study. For, example, an age independent marker could be substituted in its place.
[0243] The following examples are included for illustrative purposes only and are not intended to limit the scope of the invention.
Example 1
[0244] This example describes the use of a database containing information provided by a population of healthy blood donors who were not selected for any particular disease to determine the distribution of allelic frequencies of known genetic markers with age and by sex in a Caucasian subpopulation of the database. The results described in this example demonstrate that a disease-related genetic marker or polymorphism can be identified by sorting a healthy database by a parameter or parameters, such as age, sex and ethnicity.
[0245] Generating a Database
[0246] Blood was obtained by venous puncture from human subjects who met blood bank criteria for donating blood. The blood samples were preserved with EDTA at pH 8.0 and labeled. Each donor provided information such as age, sex, ethnicity, medical history and family medical history. Each sample was labeled with a barcode representing identifying information. A database was generated by entering, for each donor, the subject identifier and information corresponding to that subject into the memory of a computer storage medium using commercially available software, e.g., Microsoft Access.
[0247] Model Genetic Markers
[0248] The frequencies of polymorphisms known to be associated at some level with disease were determined in a subpopulation of the subjects represented in the database. These known polymporphisms occur in the p21, p53 and Lipoprotein Lipase genes. Specifically, the N291S polymorphism (N291S) of the Lipoprotein Lipase gene, which results in a substitution of a serine for an asparagine at amino acid codon 291, leads to reduced levels of high density lipoprotein cholesterol (HDL-C) that is associated with an increased risk of males for arteriosclerosis and in particular myocardial infarction (see, Reymer et al. (1995) Nature Genetics 10:28-34).
[0249] The p53 gene encodes a cell cycle control protein that assesses DNA damage and acts as a transcription factor regulating genes that control cell growth, DNA repair and apoptosis (programmed cell death). Mutations in the p53 gene have been found in a wide variety of different cancers, including different types of leukemia, with varying frequency. The loss of normal p53 function results in genomic instability an uncontrolled cell growth. A polymorphism that has been identified in the p53 gene, i.e., the R72P allele, results in the substitution of a proline for an arginine at amino acid codon 72 of the gene.
[0250] The p21 gene encodes a cyclin-dependent kinase inhibitor associated with G1 phase arrest of normal cells. Expression of the p21 gene triggers apoptosis. Polymorphisms of the p21 gene have been associated with Wilms' tumor, a pediatric kidney cancer. One polymorphism of the p21 gene, the S31R polymorphism, results in a substitution of an arginine for a serine at amino acid codon 31.
[0251] Database Analysis
[0252] Sorting of Subjects According to Specific Parameters
[0253] The genetic polymorphisms were profiled within segments of the Caucasian subpopulation of the sample bank. For p53 profiling, the genomic DNA isolated from blood from a total of 1277 Caucasian subjects age 18-59 years and 457 Caucasian subjects age 60-79 years was analyzed. For p21 profiling, the genomic DNA isolated from blood from a total of 910 Caucasian subjects age 18-49 years and 824 Caucasian subjects age 50-79 years was analyzed. For lipoprotein lipase gene profiling, the genomic DNA from a total of 1464 Caucasian females and 1470 Caucasian males under 60 years of age and a total of 478 Caucasian females and 560 Caucasian males over 60 years of age was analyzed.
[0254] Isolation and Analysis of Genomic DNA
[0255] Genomic DNA was isolated from blood samples obtained from the individuals. Ten milliliters of whole blood from each individual was centrifuged at 2000รg. One milliliter of the buffy coat was added to 9 ml of 155 mM NH4Cl, 10 mM KHCO3, and 0.1 mM Na2EDTA, incubated 10 min at room temperature and centrifuged for 10 min at 2000รg. The supernatant was removed, and the white cell pellet was washed in 155 mM NH4Cl, 10 mM KHCO3 and 0.1 mM Na2EDTA and resuspended in 4.5 ml of 50 mM Tris, 5 mM EDTA and 1% SDS. Proteins were precipitated from the cell lysate by 6 mM ammonium acetate, pH 7.3, and then separated from the nucleic acids by centrifugation at 3000รg. The nucleic acid was recovered from the supernatant by the addition of an equal volume of 100% isopropanol and centrifugation at 2000รg. The dried nucleic acid pellet was hydrated in 10 mM Tris, pH 7.6, and 1 mM Na2EDTA and stored at 4ยฐ C.
[0256] Assays of the genomic DNA to determine the presence or absence of the known genetic markers were developed using the BiomassPROBEยฎ detection method (primer oligo base extension) reaction. This method uses a single detection primer followed by an oligonucleotide extension step to give products, which can be readily resolved by mass spectrometry, and, in particular, MALDI-TOF mass spectrometry. The products differ in length depending on the presence or absence of a polymorphism. In this method, a detection primer anneals adjacent to the site of a variable nucleotide or sequence of nucleotides, and the primer is extended using a DNA polymerase in the presence of one or more dideoxyNTPs and, optionally, one or more deoxyNTPs. The resulting products are resolved by MALDI-TOF mass spectrometry. The mass of the products as measured by MALDI-TOF mass spectrometry makes possible the determination of the nucleotide(s) present at the variable site.
[0257] First, each of the Caucasian genomic DNA samples was subjected to nucleic acid amplification using primers corresponding to sites 5' and 3' of the polymorphic sites of the p21 (S31R allele), p53 (R72P allele) and Lipoprotein Lipase (N291S allele) genes. One primer in each primer pair was biotinylated to permit immobilization of the amplification product to a solid support. Specifically, the polymerase chain reaction primers used for amplification of the relevant segments of the p21, p53 and lipoprotein lipase genes are shown below: US4p21c31-2F (SEQ ID NO: 9) and US5p21-2R (SEQ ID NO: 10) for p21 gene amplification; US4-p53-ex4-F (also shown as p53-ex4US4 (SEQ ID NO: 2)) and US5-p53/2-4R (also shown as US5P53/4R (SEQ ID NO: 3)) for p53 gene amplification; and US4-LPL-F2 (SEQ ID NO: 16) and US5-LPL-R2 (SEQ ID NO: 17) for lipoprotein lipase gene amplification.
[0258] Amplification of the respective DNA sequences was conducted according to standard protocols. For example, primers can be used in a concentration of 8 pmol. The reaction mixture (e.g., total volume 50 ฮผl) can contain Taq-polymerase including 10ร buffer and dTNPs. Cycling conditions for polymerase chain reaction amplification can typically be initially 5 min. at 95ยฐ C., followed by 1 min. at 94ยฐ C., 45 sec at 53ยฐ C., and 30 sec at 72ยฐ C. for 40 cycles with a final extension time of 5 min at 72ยฐ C. Amplification products can be purified by using Qiagen's PCR purification kit (No. 28106) according to manufacturer's instructions. The elution of the purified products from the column can be done in 50 ฮผl TE-buffer (10 mM Tris, 1 mM EDTA, pH 7.5).
[0259] The purified amplification products were immobilized via a biotin-avidin linkage to streptavidin-coated beads and the double-stranded DNA was denatured. A detection primer was then annealed to the immobilized DNA using conditions such as, for example, the following: 50 ฮผl annealing buffer (20 mM Tris, 10 mM KCl, 10 mM (NH4)2SO4, 2 mM MgSO2, 1% Triton X-100, pH 8) at 50ยฐ C. for 10 min, followed by washing of the beads three times with 200 ฮผl washing buffer (40 mM Tris, 1 mM EDTA, 50 mM NaCl, 0.1% Tween 20, pH 8.8) and once in 200 ฮผl TE buffer.
[0260] The PROBE extension reaction was performed, for example, by using some components of the DNA sequencing kit from USB (No. 70770) and dNTPs or ddNTPs from Pharmacia. An exemplary protocol could include a total reaction volume of 45 ฮผl, containing of 21 ฮผl water, 6 ฮผl Sequenase-buffer, 3 ฮผl 10 mM DTT solution, 4.5 ฮผl, 0.5 mM of three dNTPs, 4.5 ฮผl, 2 mM the missing one ddNTP, 5.5 ฮผl glycerol enzyme dilution buffer, 0.25 ฮผl Sequenase 2.0, and 0.25 pyrophosphatase. The reaction can then by pipetted on ice and incubated for 15 min at room temperature and for 5 min at 37ยฐ C. The beads can be washed three times with 200 ฮผl washing buffer and once with 60 ฮผl of a 70 mM NH4-Citrate solution.
[0261] The DNA was denatured to release the extended primers from the immobilized template. Each of the resulting extension products was separately analyzed by MALDI-TOF mass spectrometry using 3-hydroxypicolinic acid (3-HPA) as matrix and a UV laser.
[0262] Specifically, the primers used in the PROBE reactions are as shown below: P21/31-3 (SEQ ID NO: 12) for PROBE analysis of the p21 polymorphic site; P53/72 (SEQ ID NO: 4) for PROBE analysis of the p53 polymorphic site; and LPL-2 for PROBE analysis of the lipoprotein lipase gene polymorphic site. In the PROBE analysis of the p21 polymorphic site, the extension reaction was performed using dideoxy-C. The products resulting from the reaction conducted on a "wild-type" allele template (wherein codon 31 encodes a serine) and from the reaction conducted on a polymorphic S31R allele template (wherein codon 31 encodes an arginine) are shown below and designated as P21/31-3 Ser (wt) (SEQ ID NO: 13) and P21/31-3 Arg (SEQ ID NO: 14), respectively. The masses for each product as can be measured by MALDI-TOF mass spectrometry are also provided (i.e., 4900.2 Da for the wild-type product and 5213.4 Da for the polymorphic product).
[0263] In the PROBE analysis of the p53 polymorphic site, the extension reaction was performed using dideoxy-C. The products resulting from the reaction conducted on a "wild-type" allele template (wherein codon 72 encodes an arginine) and from the reaction conducted on a polymorphic R72P allele template (wherein codon 72 encodes a proline) are shown below and designated as Cod72 G Arg (wt) and Cod72 C Pro, respectively. The masses for each product as can be measured by MALDI-TOF mass spectrometry are also provided (i.e., 5734.8 Da for the wild-type product and 5405.6 Da for the polymorphic product).
[0264] In the PROBE analysis of the lipoprotein lipase gene polymorphic site, the extension reaction was performed using a mixture of ddA and ddT. The products resulting from the reaction conducted on a "wild-type" allele template (wherein codon 291 encodes an asparagine) and from the reaction conducted on a polymorphic N291S allele template (wherein codon 291 encodes a serine) are shown below and designated as 291Asn and 291Ser, respectively. The masses for each product as can be measured by MALDI-TOF mass spectrometry are also provided (i.e., 6438.2 Da for the wild-type product and 6758.4 Da for the polymorphic product).
P53-1 (R72P)
TABLE-US-00002
[0265] PCR Product length: 407 bp (SEQ ID NO: 1) US4-p53-ex4-F ctg aggacctggt cctctgactg ctcttttcac ccatctacag tcccccttgc cgtcccaagc aatggatgat ttgatgctgt ccccggacga tattgaacaa tggttcactg aagacccagg tccagatgaa gctcccagaa P53/72 72R tgccagaggc tgctccccgc gtggcccctg caccagcagc tcctacaccg gcggcccctg c 72P caccagcccc ctcctggccc ctgtcatctt ctgtcccttc ccagaaaacc taccagggca gctacggttt ccgtctgggc ttcttgcatt ctgggacagc caagtctgtg acttgcacgg tcagttgccc tgaggggctg gcttccatga gacttcaa US5-p53/2-4R Primers (SEQ ID NOs: 2-4) p53-ex4FUS4 ccc agt cac gac gtt gta aaa cgc tga gga cct ggt cct ctg ac US5P53/4R agc gga taa caa ttt cac aca ggt tga agt ctc atg gaa gcc P53/72 gcc aga ggc tgc tcc cc
Masses
TABLE-US-00003
[0266] Product Termination: SEQ Allele ddC # Length Mass P53/72 gccagaggctgctcccc 5 17 5132.4 Cod72 G Arg gccagaggctgctccccgc 6 19 5734.8 (wt) Cod72 C Pro gccagaggctgctccccc 7 18 5405.6
Biotinylated US5 primer is used in the PCR amplification.
LPL-1 (N291S)
[0267] Amino acid exchange asparagine to serine at codon 291 of the lipoprotein lipase gene.
TABLE-US-00004 PCR Product length: 251 bp (SEQ ID NO: 15) US4-LPL-F2 (SEQ ID NO: 16) gcgctccatt catctcttca tcgactctct gttgaatgaa gaaaatccaa gtaaggccta caggtgcagt tccaaggaag cctttgagaa agggctctgc ttgagttgta gaaagaaccg LPL-2 291N ctgcaacaat ctgggctatg agatcaataa agtcagagcc aaaagaagca gcaaaatgta g 291S cctgaagact cgttctcaga tgccc US4-LPL-R2 Primers (SEQ ID NOs: 16-18): US4-LPL-F2 ccc agt cac gac gtt gta aaa cgg cgc tcc att cat ctc ttc US5-LPL-R2 agc gga taa caa ttt cac aca ggg ggc atc tga gaa cga gtc LPL-2 caa tct ggg cta tga gat ca
Masses
TABLE-US-00005
[0268] Product Termination: SEQ Allele ddA, ddT # Length Mass LPL-2 caatctgggctatgagatca 19 20 6141 291 Asn caatctgggctatgagatcaa 20 21 6438.2 291 Ser caatctgggctatgagatcagt 21 22 6758.4
Biotinylated US5 primer is used in the PCR amplification.
P21-1 (S31R)
[0269] Amino acid exchange serine to arginine at codon 31 of the tumor suppressor gene p21. Product length: 207 bp (SEQ ID NO: 8)
TABLE-US-00006 Product length: 207 bp (SEQ ID NO: 8) US4p21c31-2F gtcc gtcagaaccc atgcggcagc p21/31-3 31S aaggcctgcc gccgcctctt cggcccagtg gacagcgagc agctgagccg cgactgtgat a 31R gcgctaatgg cgggctgcat ccaggaggcc cgtgagcgat ggaacttcga ctttgtcacc gagacaccac tggaggg US5p21-2R Primers (SEQ ID NOs: 9-11) US4p21c31-2F ccc agt cac gac gtt gta aaa cgg tcc gtc aga acc cat gcg g US5p21-2R acc gga taa caa ttt cac aca ggc tcc agt ggt gtc tcg gtg ac P21/31-3 cag cga gca gct gag
Masses
TABLE-US-00007
[0270] Product Termination: SEQ Allele ddC # Length Mass p21/31-3 cagcgagcagctgag 12 15 4627 P21/31-3 Ser cagcgagcagctgagc 13 16 4900.2 (wt) P21/31-3 Arg cagcgagcagctgagac 14 17 5213.4
Biotinylated US5 primer is used in the PCR amplification.
[0271] Each of the Caucasian subject DNA samples was individually analyzed by MALDI-TOF mass spectrometry to determine the identity of the nucleotide at the polymorphic sites. The genotypic results of each assay can be entered into the database. The results were then sorted according to age and/or sex to determine the distribution of allelic frequencies by age and/or sex. As depicted in the Figures showing histograms of the results, in each case, there was a differential distribution of the allelic frequencies of the genetic markers for the p21, p53 and lipoprotein lipase gene polymorphisms.
[0272] FIG. 8 shows the results of the p21 genetic marker assays and reveals a statistically significant decrease (from 13.3% to 9.2%) in the frequency of the heterozygous genotype (S31R) in Caucasians with age (18-49 years of age compared to 50-79 years of age). The frequencies of the homozygous (S31 and R31) genotypes for the two age groups are also shown, as are the overall frequencies of the S31 and R31 alleles in the two age groups (designated as *S31 and *R31, respectively in the Figure).
[0273] FIGS. 7A-C show the results of the p53 genetic marker assays and reveals a statistically significant decrease (from 6.7% to 3.7%) in the frequency of the homozygous polymorphic genotype (P72) in Caucasians with age (18-59 years of age compared to 60-79 years of age). The frequencies of the homozygous "wild-type" genotype (R72) and the heterozygous genotype (R72P) for the two age groups are also shown, as are the overall frequencies of the R72 and P72 alleles in the two age groups (designated as *R72 and
[0274] P72, respectively in the Figure). These results are consistent with the observation that allele is not benign, as p53 regulates expression of a second protein, p21, which inhibits cyclin-dependent kinases (CDKs) needed to drive cells through the cell-cycle (a mutation in either gene can disrupt the cell cycle leading to increased cell division).
[0275] FIG. 2C shows the results of the lipoprotein lipase gene genetic marker assays and reveals a statistically significant decrease (from 1.97% to 0.54%) in the frequency of the polymorphic allele (S291) in Caucasian males with age (see also Reymer et al. (1995) Nature Genetics 10:28-34). The frequencies of this allele in Caucasian females of different age groups are also shown.
Example 2
[0276] This example describes the use of MALDI-TOF mass spectrometry to analyze DNA samples of a number of subjects as individual samples and as pooled samples of multiple subjects to assess the presence or absence of a polymorphic allele (the 353Q allele) of the Factor VII gene and determine the frequency of the allele in the group of subjects. The results of this study show that essentially the same allelic frequency can be obtained by analyzing pooled DNA samples as by analyzing each sample separately and thereby demonstrate the quantitative nature of MALDI-TOF mass spectrometry in the analysis of nucleic acids.
[0277] Factor VII
[0278] Factor VII is a serine protease involved in the extrinsic blood coagulation cascade. This factor is activated by thrombin and works with tissue factor (Factor III) in the processing of Factor X to Factor Xa. There is evidence that supports an association between polymorphisms in the Factor VII gene and increased Factor VII activity which can result in an elevated risk of ischemic cardiovascular disease, including myocardial infarction. The polymorphism investigated in this study is R353Q (i.e., a substitution of a glutamic acid residue for an arginine residue at codon 353 of the Factor VII gene) (see Table 5).
Analysis of DNA Samples for the Presence or Absence of the 353Q Allele of the Factor VII Gene
[0279] Genomic DNA was isolated from separate blood samples obtained from a large number of subjects divided into multiple groups of 92 subjects per group. Each sample of genomic DNA was analyzed using the BiomassPROBEยฎ assay as described in Example 1 to determine the presence or absence of the 353Q polymorphism of the Factor VII gene.
[0280] First, DNA from each sample was amplified in a polymerase chain reaction using primers F7-353FUS4 (SEQ ID NO: 24) and F7-353RUS5 (SEQ ID NO: 26) as shown below and using standard conditions, for example, as described in Example 1. One of the primers was biotinylated to permit immobilization of the amplification product to a solid support. The purified amplification products were immobilized via a biotin-avidin linkage to streptavidin-coated beads and the double-stranded DNA was denatured. A detection primer was then annealed to the immobilized DNA using conditions such as, for example, described in Example 1. The detection primer is shown as F7-353-P (SEQ ID NO: 27) below. The PROBE extension reaction was carried out using conditions, for example, such as those described in Example 1. The reaction was performed using ddG.
[0281] The DNA was denatured to release the extended primers from the immobilized template. Each of the resulting extension products was separately analyzed by MALDI-TOF mass spectrometry. A matrix such as 3-hydroxypicolinic acid (3-HPA) and a UV laser could be used in the MALDI-TOF mass spectrometric analysis. The products resulting from the reaction conducted on a "wild-type" allele template (wherein codon 353 encodes an arginine) and from the reaction conducted on a polymorphic 353Q allele template (wherein codon 353 encodes a glutamic acid) are shown below and designated as 353 CGG and 353 CAG, respectively. The masses for each product as can be measured by MALDI-TOF mass spectrometry are also provided (i.e., 5646.8 Da for the wild-type product and 5960 Da for the polymorphic product).
[0282] The MALDI-TOF mass spectrometric analyses of the PROBE reactions of each DNA sample were first conducted separately on each sample (250 nanograms total concentration of DNA per analysis). The allelic frequency of the 353Q polymorphism in the group of 92 subjects was calculated based on the number of individual subjects in which it was detected.
[0283] Next, the samples from 92 subjects were pooled (250 nanograms total concentration of DNA in which the concentration of any individual DNA is 2.7 nanograms), and the pool of DNA was subjected to MALDI-TOF mass spectrometric analysis. The area under the signal corresponding to the mass of the 353Q polymorphism PROBE extension product in the resulting spectrum was integrated in order to quantitate the amount of DNA present. The ratio of this amount to total DNA was used to determine the allelic frequency of the 353Q polymorphism in the group of subjects. This type of individual sample vs. pooled sample analysis was repeated for numerous different groups of 92 different samples.
[0284] The frequencies calculated based on individual MALDI-TOF mass spectrometric analysis of the 92 separate samples of each group of 92 are compared to those calculated based on MALDI-TOF mass spectrometric analysis of pools of DNA from 92 samples in FIG. 9. These comparisons are shown as "pairs" of bar graphs in the Figure, each pair being labeled as a separate "pool" number, e.g., P1, P16, P2, etc. Thus, for example, for P1, the allelic frequency of the polymorphism calculated by separate analysis of each of the 92 samples was 11.41%, and the frequency calculated by analysis of a pool of all of the 92 DNA samples was 12.09%.
[0285] The similarity in frequencies calculated by analyzing separate DNA samples individually and by pooling the DNA samples demonstrates that it is possible, through the quantitative nature of MALDI-TOF mass spectrometry, to analyze pooled samples and obtain accurate frequency determinations. The ability to analyze pooled DNA samples significantly reduces the time and costs involved in the use of the non-selected, healthy databases as described herein. It has also been shown that it is possible to decrease the DNA concentration of the individual samples in a pooled mixture from 2.7 nanograms to 0.27 nanograms without any change in the quality of the spectrum or the ability to quantitate the amount of sample detected.
Factor VII R353Q PROBE Assay
[0286] PROBE Assay for cod353 CGG>CAG (Arg>Gln), Exon 9 G>A.
[0287] PCR fragment: 134 bp (incl. US tags; SEQ ID Nos. 22 and 23)
Frequency of A allele: Europeans about 0.1, Japanese/Chinese about 0.03-0.05 (Thromb. Haemost. 1995, 73:617-22: Diabetoloaia 1998, 41:760-6):
TABLE-US-00008 F7-353FUS4> 1201 GTGCCGGCTA CTCGGATGGC AGCAAGGACT CCTGCAAGGG GGACAGTGGA GGCCCACATG F7-353-P> A <F7-353RUS5 1261 CCACCCACTA CCGGGGCACG TGGTACCTGA CGGGCATCGT CAGCTGGGGC CAGGGCTGCG Primers (SEQ ID NOs: 24-26) Tmgs F7-353FUS4 CCC AGT CAC GAC GTT GTA AAA CGA TGG CAG CAA GGA CTC CTG 64ยฐ C. F7-353-P CAC ATG CCA CCC ACT ACC F7-353RUS5 AGC GGA TAA CAA TTT CAC ACA GGT GAC GAT GCC CGT CAG GTA C 64ยฐ C.
Masses
TABLE-US-00009
[0288] Product Termination: SEQ Allele ddG # Length Mass F7-353-P atgccacccactacc 27 18 5333.6 353 CGG cacatgccacccactaccg 28 19 5646.8 353 CAG cacatgccacccactaccag 29 20 5960 US5-bio agcggataacaatttcacacagg 30 23 7648.6 bio-
Conclusion
[0289] The above examples demonstrate an effect of altered frequency of disease causing genetic factors within the general population. Interpretation of those results allows prediction of the medical relevance of polymorphic genetic alterations. In addition, conclusions can be drawn with regard to their penetrance, diagnostic specificity, positive predictive value, onset of disease, most appropriate onset of preventive strategies, and the general applicability of genetic alterations identified in isolated populations to panmixed populations. Therefore, an age- and sex-stratified population-based sample bank that is ethnically homogenous is a suitable tool for rapid identification and validation of genetic factors regarding their potential medical utility.
Example 3
Morbidity and Mortality Markers
Sample Band and Initial Screening
[0290] Healthy samples were obtained through the blood bank of San Bernardino, Calif. Donors signed prior to the blood collection a consent form and agreed that their blood will be used in genetic studies with regard to human aging. All samples were anomymized. Tracking back of samples is not possible.
Isolation of DNA from Blood Samples of a Healthy Donor Population
[0291] Blood is obtained from a donor by venous puncture and preserved with 1 mM EDTA pH 8.0. Ten milliliters of whole blood from each donor was centrifuged at 2000รg. One milliliter of the buffy coat was added to 9 milliliters of 155 mM NH4Cl, 10 mM KHCO3, and 0.1 mM Na2EDTA, incubated 10 minutes at room temperature and centrifuged for 10 minutes at 2000รg. The supernatant was removed, and the white cell pellet was washed in 155 mM NH4Cl, 10 mM KHCO3, and 0.1 mM Na2EDTA and resuspended in 4.5 milliliters of 50 mM Tris, 5 mM EDTA, and 1')/0 SDS. Proteins were precipitated from the cell lysate by 6M Ammonium Acetate, pH 7.3, and separated from the nucleic acid by centrifugation 3000รg. The nucleic acid was recovered from the supernatant by the addition of an equal volume of 100% isopropanol and centrifugation at 2000รg. The dried nucleic acid pellet was hydrated in 10 mM Tris pH 7.6 and 1 mM Na2EDTA and stored at 4C.
[0292] In this study, samples were pooled as shown in Table 1. Both parents of the blood donors were of Caucasian origin.
TABLE-US-00010 TABLE 1 Pool ID Sex Age-range # individuals SP1 Female 18-39 years 276 SP2 Males 18-39 years 276 SP3 Females 60-69 years 184 SP4 Males 60-79 years 368
More than 400 SNPs were tested using all four pools. After one test run 34 assays were selected to be re-assayed at least once. Finally, 10 assays showed repeatedly differences in allele frequencies of several percent and, therefore, fulfilled the criteria to be tested using the individual samples. Average allele frequency and standard deviation is tabulated in Table 2.
TABLE-US-00011 TABLE 2 Assay ID SP1 SP1-STD SP2 SP2-STD SP3 SP3-STD SP4 SP4-STD 47861 0.457 0.028 0.433 0.042 0.384 0.034 0.380 0.015 47751 0.276 0.007 0.403 0.006 0.428 0.052 0.400 0.097 48319 0.676 0.013 0.627 0.018 0.755 0.009 0.686 0.034 48070 0.581 0.034 0.617 0.045 0.561 n.a. 0.539 0.032 49807 0.504 0.034 0.422 0.020 0.477 0.030 0.556 0.005 49534 0.537 0.017 0.503 n.a. 0.623 0.023 0.535 0.009 49733 0.560 0.006 0.527 0.059 0.546 0.032 0.436 0.016 49947 0.754 0.008 0.763 0.047 0.736 0.052 0.689 0.025 50128 0.401 0.022 0.363 0.001 0.294 0.059 0.345 0.013 63306 0.697 0.012 0.674 0.013 0.712 0.017 0.719 0.005
[0293] So far, 7 out of the 10 potential morbidity markers were fully analyzed. Additional information about genes in which these SNPs are located was gathered through publicly available databases, including Genbank.
AKAPS
[0294] Candidate morbidity and mortality markers include housekeeping genes, such as genes involved in signal transduction. Among such genes are the A-kinase anchoring proteins (AKAPs) genes, which participate in signal transduction pathways involving protein phosphorylation. Protein phosphorylation is an important mechanism for enzyme regulation and the transduction of extracellular signals across the cell membrane in eukaryotic cells. A wide variety of cellular substrates, including enzymes, membrane receptors, ion channels and transcription factors, can be phosphorylated in response to extracellular signals that interact with cells. A key enzyme in the phosphorylation of cellular proteins in response to hormones and neurotransmitters is cyclic AMP (cAMP)-dependent protein kinase (PKA). Upon activation by cAMP, PKA thus mediates a variety of cellular responses to such extracellular signals. An array of PKA isozymes are expressed in mammalian cells. The PKAs usually exist as inactive tetramers containing a regulatory (R) subunit dimer and two catalytic (C) subunits. Genes encoding three C subunits (Cฮฑ, Cฮฒ and Cฮณ) and four R subunits (RIฮฑ, RIฮฒ, RIIฮฑ and RIIฮฒ) have been identified [see Takio et al. (1982) Proc. Natl. Acad. Sci. U.S.A. 79:2544-2548; Lee et al. (1983) Proc. Natl. Acad. Sci. U.S.A. 80:3608-3612; Jahnsen et al. (1996) J. Biol. Chem. 261:12352-12361; Clegg et al. (1988) Proc. Natl. Acad. Sci. U.S.A. 85:3703-3707; and Scott (1991) Pharmacol. Ther. 50:123-145]. The type I (RI) ฮฑ and type II (RII) ฮฑ subunits are distributed ubiquitously, whereas RIฮฒ and RIIฮฒ are present mainly in brain [see. e.g., Miki and Eddy (1999) J. Biol. Chem. 274:29057-29062]. The type I PKA holoenzyme (RIฮฑ and RIฮฒ) is predominantly cytoplasmic, whereas the majority of type II PKA (RIIฮฑ and RIIฮฒ) associates with cellular structures and organelles [Scott (1991) Pharmacol. Ther. 50:123-145]. Many hormones and other signals act through receptors to generate cAMP which binds to the R subunits of PKA and releases and activates the C subunits to phosphorylate proteins. Because protein kinases and their substrates are widely distributed throughout cells, there are mechanisms in place in cells to localize protein kinase-mediated responses to different signals. One such mechanism involves subcellular targeting of PKAs through association with anchoring proteins, referred to as A-kinase anchoring proteins (AKAPs), that place PKAs in close proximity to specific organelles or cytoskeletal components and particular substrates thereby providing for more specific PKA interactions and localized responses [see, e.g., Scott et al. (1990) J. Biol. Chem. 265:21561-21566; Bregman et al. (1991) J. Biol. Chem. 266:7207-7213; and Miki and Eddy (1999) J. Biol. Chem. 274:29057-29062]. Anchoring not only places the kinase close to the substrates, but also positions the PKA holoenzyme at sites where it can optimally respond to fluctuations in the second messenger cAMP [Mochly-Rosen (1995) Science 268:247-251; Faux and Scott (1996) Trends Biochem. Sci. 21:312-315; Hubbard and Cohen (1993) Trends Biochem. Sci. 18:172-177].
[0295] Up to 75% of type II PKA is localized to various intracellular sites through association of the regulatory subunit (RII) with AKAPs [see, e.g., Hausken et al. (1996) J. Biol. Chem. 271:29016-29022]. RII subunits of PKA bind to AKAPs with nanomolar affinity [Carr et al. (1992) J. Biol. Chem. 267:13376-13382], and many AKAP-RII complexes have been isolated from cell extracts. RI subunits of PKA bind to AKAPs with only micromolar affinity [Burton et al. (1997) Proc. Natl. Acad. Sci. U.S.A. 94:11067-11072]. Evidence of binding of a PKA RI subunit to an AKAP has been reported [Miki and Eddy (1998) J. Biol. Chem 273:34384-34390] in which RIฮฑ-specific and RIฮฑ/RIIฮฑ dual specificity PKA anchoring domains were identified on FSC1/AKAP82. Additional dual specific AKAPs, referred to as D-AKAP1 and D-AKAP2, which interact with the type I and type II regulatory subunits of PKA have also been reported [Huang et al. (1997) J. Biol. Chem. 272:8057-8064; Huang et al. (1997) Proc. Natl. Acad. Sci. U.S.A. 94:11184-11189]
[0296] More than 20 AKAPs have been reported in different tissues and species. Complementary DNAs (cDNAs) encoding AKAPs have been isolated from diverse species, ranging from Caenorhabditis elegans and Drosophila to human [see, e.g., Colledge and Scott (1999) Trends Cell Biol. 9:216-221]. Regions within AKAPs that mediate association with RII subunits of PKA have been identified. These regions of approximately 10-18 amino acid residues vary substantially in primary sequence, but secondary structure predictions indicate that they are likely to form an amphipathic helix with hydrophobic residues aligned along one face of the helix and charged residues along the other [Carr et al. (1991) J. Biol. Chem. 266:14188-14192; Carr et al. (1992) J. Biol. Chem. 267:13376-13382]. Hydrophobic amino acids with a long aliphatic side chain, e.g., valine, leucine or isoleucine, can participate in binding to RII subunits [Glantz et al. (1993) J. Biol. Chem. 268:12796-12804].
[0297] Many AKAPs also have the ability to bind to multiple proteins, including other signaling enzymes. For example, AKAP79 binds to PKA, protein kinase C (PKC) and the protein phosphatase calcineurin (PP2B) [Coghlan et al. (1995) Science 267:108-112 and Klauck et al. (1996) Science 271:1589-1592]. Therefore, the targeting of AKAP79 to neuronal postsynaptic membranes brings together enzymes with opposite catalytic activities in a single complex.
[0298] AKAPs thus serve as potential regulatory mechanisms that increase the selectivity and intensity of a cAMP-mediated response. There is a need, therefore, to identify and elucidate the structural and functional properties of AKAPs in order to gain a complete understanding of the important role these proteins play in the basic functioning of cells.
AKAP10
[0299] The sequence of a human AKAP10 cDNA (also referred to as D-AKAP2) is available in the GenBank database, at accession numbers AF037439 (SEQ ID NO: 31) and NM 007202. The AKAP10 gene is located on chromosome 17.
[0300] The sequence of a mouse D-AKAP2 cDNA is also available in the GenBank database (see accession number AF021833). The mouse D-AKAP2 protein contains an RGS domain near the amino terminus that is characteristic of proteins that interact with Gฮฑ subunits and possess GTPase activating protein-like activity [Huang et al. (1997) Proc. Natl. Acad. Sci. U.S.A. 94:11184-11189]. The human AKAP10 protein also has sequences homologous to RGS domains. The carboxy-terminal 40 residues of the mouse D-AKAP2 protein are responsible for the interaction with the regulatory subunits of PKA. This sequence is fairly well conserved between the mouse D-AKAP2 and human AKAP10 proteins.
Polymorphisms of the Human AKAP10 Gene and Polymorphic AKAP10 Proteins
[0301] Polymorphisms of AKAP genes that alter gene expression, regulation, protein structure and/or protein function are more likely to have a significant effect on the regulation of enzyme (particularly PKA) activity, cellular transduction of signals and responses thereto and on the basic functioning of cells than polymorphisms that do not alter gene and/or protein function. Included in the polymorphic AKAPs provided herein are human AKAP10 proteins containing differing amino acid residues at position number 646.
[0302] Amino acid 646 of the human AKAP10 protein is located in the carboxy-terminal region of the protein within a segment that participates in the binding of R-subunits of PKAs. This segment includes the carboxy-terminal 40 amino acids.
[0303] The amino acid residue reported for position 646 of the human AKAP10 protein is an isoleucine. Polymorphic human AKAP10 proteins provided herein have the amino acid sequence but contain residues other than isoleucine at amino acid position 646 of the protein. In particular embodiments of the polymorphic human AKAP10 proteins provided herein, the amino acid at position 646 is a valine, leucine or phenylalanine residue.
An A to G Transition at Nucleotide 2073 of the Human AKAP10 Coding Sequence
[0304] As described herein, an allele of the human AKAP10 gene that contains a specific polymorphism at position 2073 of the coding sequence and thereby encodes a valine at position 646 has been detected in varying frequencies in DNA samples from younger and older segments of the human population. In this allele, the A at position 2073 of the AKAP10 gene coding sequence is changed from an A to a G, giving rise to an altered sequence in which the codon for amino acid 646 changes from ATT, coding for isoleucine, to GTT, coding for valine.
Morbidity Marker 1: Human Protein Kinase A Anchoring Protein (AKAP10-1)
PCR Amplification and BiomassPROBE Assay Detection of AKAP10-1 in a Healthy Donor Population
PCR Amplification of Donor Population for AKAP 10
[0305] PCR primers were synthesized by OPERON using phosphoramidite chemistry.
[0306] Amplification of the AKAP10 target sequence was carried out in single 50 ฮผl PCR reaction with 100 ng-1 ug of pooled human genomic DNAs in a 50 ฮผl PCR reaction. Individual DNA concentrations within the pooled samples were present in equal concentration with the final concentration ranging from 1-25 ng. Each reaction containing 1รPCR buffer (Qiagen, Valencia, Calif.), 2OO uM dNTPs, 1U Hotstar Taq polymerase (Qiagen, Valencia, Calif.), 4 mM MgCl2, and 25 pmol of the forward primer containing the universal primer sequence and the target specific sequence 5'-TCTCAATCATGTGCATTGAGG-3'(SEQ ID NO: 45), 2 pmol of the reverse primer 5'-AGCGGATAACAATTTCACACAGGGATCACACAGCCATCAGCAG-3' (SEQ ID NO: 46), and I0 pmol of a biotinylated universal primer complementary to the 5' end of the PCR amplicon 5'-AGCGGATAACAATTTCACACAGG-3'(SEQ ID NO: 47). After an initial round of amplification with the target with the specific forward and reverse primer, the 5' biotinylated universal primer then hybridized and acted as a reverse primer thereby introducing a 3' biotin capture moiety into the molecule. The amplification protocol results in a 5'-biotinylated double stranded DNA amplicon and dramatically reduces the cost of high throughput genotyping by eliminating the need to 5' biotin label each forward primer used in a genotyping. Thermal cycling was performed in 0.2 mL tubes or 96 well plate using an MJ Research Thermal Cycler (calculated temperature) with the following cycling parameters: 94ยฐ C. for 5 min; 45 cycles: 94ยฐ C. for 20 sec, 56ยฐ C. for 30 sec, 72ยฐ C. for 60 sec; 72ยฐ C. 3 min.
Immobilization of DNA
[0307] The 50 ฮผl PCR reaction was added to 25 ul of streptavidin coated magnetic bead (Dynal) prewashed three times and resuspended in 1M NH4Cl, 0.06M NH4OH. The PCR amplicons were allowed to bind to the beads for 15 minutes at room temperature. The beads were then collected with a magnet and the supernatant containing unbound DNA was removed. The unbound strand was released from the double stranded amplicons by incubation in 100 mM NaOH and washing of the beads three times with 10 mM Tris pH 8.0.
BiomassPROBE Assay Analysis of Donor Population for AKAP10-1 (Clone 48319)
[0308] Genotyping using the BiomassPROBE assay methods was carried out by resuspending the DNA coated magnetic beads in 26 mM Tris-HCl pH 9.5, 6.5 mM MgCl2 and 50 mM each of dTTP and 50 mM each of ddCTP, ddATP, ddGTP, 2.5U of a thermostable DNA polymerase (Ambersham) and 20 pmol of a template specific oligonucleotide PROBE primer 5'-CTGGCGCCCACGTGGTCAA-3' (SEQ ID NO: 48) (Operon). Primer extension occurs with three cycles of oligonucleotide primer hybridization and extension. The extension products were analyzed after denaturation from the template with 50 mM NH4Cl and transfer of 150 mL each sample to a silicon chip preloaded with 150 mL of H3PA matrix material. The sample material was allowed to crystallize and was analyzed by MALDI-TOF (Bruker, PerSeptive). The SNP that is present in AKAP10-1 is a T to C transversion at nucleotide number 156277 of the sequence of a genomic clone of the AKAP10 gene (GenBank Accession No. AC005730) (SEQ ID NO: 36). SEQ ID NO: 35: represents the nucleotide sequence of human chromosome 17, which contains the genomic nucleotide sequence of the human AKAP10 gene, and SEQ ID NO: 36 represents the nucleotide sequence of human chromosome 17, which contains the genomic nucleotide sequence of the human AKAP10-1 allele. The mass of the primer used in the BioMass probe reaction was 5500.6 daltons. In the presence of the SNP, the primer is extended by the addition of ddC, which has a mass of 5773.8. The wildtype gene results in the addition of dT and ddG to the primer to produce an extension product having a mass of 6101 daltons.
[0309] The frequency of the SNP was measured in a population of age selected healthy individuals. Five hundred fifty-two (552) individuals between the ages of 18-39 years (276 females, 276 males) and 552 individuals between the ages of 60-79 (184 females between the ages of 60-69, 368 males between the age of 60-79) were tested for the presence of the polymorphism localized in the non-translated 3' region of AKAP 10. Differences in the frequency of this polymorphism with increasing age groups were observed among healthy individuals. Statistical analysis showed that the significance level for differences in the allelic frequency for alleles between the "younger" and the "older" populations was p=0.0009 and for genotypes was p=0.003. Differences between age groups are significant. For the total population allele significance is p=0.0009, and genotype significance is p=0.003.
[0310] This marker led to the best significant result with regard to allele and genotype frequencies in the age-stratified population. FIG. 19 shows the allele and genotype frequency in both genders as well as in the entire population. For the latter, the significance for alleles was p=0.0009 and for genotypes was p=0.003. The young and old populations were in Hardy-Weinberg equilibrium. A preferential change of one particular genotype was not observed.
[0311] The polymorphism is localized in the non-translated 3'-region of the gene encoding the human protein kinase A anchoring protein (AKAP10). The gene is located on chromosome 17. Its structure includes 15 exons and 14 intervening sequences (introns). The encoded protein is responsible for the sub-cellular localization of the cAMP-dependent protein kinase and, therefore, plays a key role in the G-protein mediated receptor-signaling pathway (Huang et al. PNAS (1007) 94:11184-11189). Since its localization is outside the coding region, this polymorphism is most likely in linkage disequilibrium (LD) with other non-synonymous polymorphisms that could cause amino acid substitutions and subsequently alter the function of the protein. Sequence comparison of different Genbank database entries concerning this gene revealed further six potential polymorphisms of which two are supposed to change the respective amino acid (see Table 3).
TABLE-US-00012 TABLE 3 Exon Codon Nucleotides Amino acid 3 100 GCT > GCC Ala > Ala 4 177 AGT > GTG Met > Val 8 424 GGG > GGC Gly > Gly 10 524 CCG > CTG Pro > Leu 12 591 GTG > GTC Val > Val 12 599 CGC > CGA Arg > Arg
Morbitity Marker 2: Human Protein Kinase a Anchoring Protein (AKAP10-5) Discovery of AKAP10-5 Allele (SEQ ID NO: 33)
[0312] Genomic DNA was isolated from blood (as described above) of seventeen (17) individuals with a genotype CC at the AKAP10-1 gene locus and a single heterozygous individual (CT) (as described). A target sequence in the AKAP10-1 gene which encodes the C-terminal PKA binding domain was amplified using the polymerase chain reaction. PCR primers were synthesized by OPERON using phosphoramidite chemistry. Amplification of the AKAP10-1 target sequence was carried out in individual 50 ฮผl PCR reaction with 25 ng of human genomic DNA templates. Each reaction containing 1รPCR buffer (Qiagen, Valencia, Calif.), 200 ฮผM dNTPs, IU Hotstar Taq polymerase (Qiagen, Valencia, Calif.), 4 mM MgCl2, 25 pmol of the forward primer (Ex13F) containing the universal primer sequence and the target specific sequence 5'-TCC CAA AGT GCT GGA ATT AC-3' (SEQ ID NO: 53), and 2 pmol of the reverse primer (Ex14R) 5'-GTC CAA TAT ATG CAA ACA GTT G-3' (SEQ ID NO: 54). Thermal cycling was performed in 0.2 mL tubes or 96 well plate using an MJ Research Thermal Cycler (MJ Research, Waltham, Mass.) (calculated temperature) with the following cycling parameters: 94ยฐ C. for 5 min; 45 cycles; 94ยฐ C. for 20 sec, 56ยฐ C. for 30 sec, 72ยฐ C. for 60 sec; 72ยฐ C. 3 min. After amplification the amplicons were purified using a chromatography (Mo Bio Laboratories (Solana Beach, Calif.)).
[0313] The sequence of the 18 amplicons, representing the target region, was determined using a standard Sanger cycle sequencing method with 25 nmol of the PCR amplicon, 3.2 uM DNA sequencing primer 5'-CCC ACA GCA GTT AAT CCT TC-3'(SEQ ID NO: 55), and chain terminating dRhodamine labeled 2', 3' dideoxynucleotides (PE Biosystems, Foster City, Calif.) using the following cycling parameters: 96ยฐ C. for 15 seconds; 25 cycles: 55ยฐ C. for 15 seconds, 60ยฐ C. for 4 minutes. The sequencing products precipitated by 0.3M NaOAc and ethanol. The precipitate was centrifuged and dried. The pellets were resuspended in deionized formamide and separated on a 5% polyacrylimide gel. The sequence was determined using the "Sequencher" software (Gene Codes, Ann Arbor, Mich.).
[0314] The sequence of all 17 of the amplicons, which are homozygous for the AKAP10-1 SNP of the amplicons, revealed a polymorphism at nucleotide position 152171 (numbering for GenBank Accession No. AC005730 for AKAP10 genomic clone (SEQ ID NO: 35)) with A replaced by G. This SNP also can be designated as located at nucleotide 2073 of a cDNA clone of the wildtype AKAP10 (GenBank Accession No. AF037439) (SEQ ID NO: 31). The amino acid sequence of the human AKAP10 protein is provided as SEQ ID NO: 34. This single nucleotide polymorphism was designated as AKAP10-5 (SEQ ID NO: 33) and resulted in a substitution of a valine for an isoleucine residue at amino acid position 646 of the amino acid sequence of human AKAP10 (SEQ ID NO: 32).
PCR Amplification and BiomassPROBE Assay Detection of AKAP10-5 in a Healthy Donor Population
[0315] The healthy population stratified by age is a very efficient and a universal screening tool for morbidity associated genes by allowing for the detection of changes of allelic frequencies in the young compared to the old population. Individual samples of this healthy population base can be pooled to further increase the throughput.
[0316] Healthy samples were obtained through the blood bank of San Bernardino, Calif. Both parents of the blood donors were of Caucasian origin. Practically a healthy subject, when human, is defined as human donor who passes blood bank criteria to donate blood for eventual use in the general population. These criteria are as follows: free of detectable viral, bacterial, mycoplasma, and parasitic infections; not anemic; and then further selected based upon a questionnaire regarding history (see FIG. 3). Thus, a healthy population represents an unbiased population of sufficient health to donate blood according to blood bank criteria, and not further selected for any disease state. Typically such individuals are not taking any medications.
[0317] PCR primers were synthesized by OPERON using phosphoramidite chemistry. Amplification of the AKAP10 target sequence was carried out in a single 50 ฮผl PCR reaction with 100 ng-1 ฮผg of pooled human genomic DNAs in a 50 ฮผl PCR reaction. Individual DNA concentrations within the pooled samples were present in equal concentration with the final concentration ranging from 1-25 ng. Each reaction contained 1รPCR buffer (Qiagen, Valencia, Calif.), 200 ฮผM dNTPs, 1U Hotstar Taq polymerase (Qiagen, Valencia, Calif.), 4 mM MgCl2, and 25 pmol of the forward primer containing the universal primer sequence and the target specific sequence 5'-AGCGGATAACAATTTCACACAGGGAGCTAGCTTGGAAGATTGC-3' (SEQ ID NO: 41), 2 pmol of the reverse primer 5'-GTCCAATATATGCAAACAGTTG-3' (SEQ ID NO: 54), and 10 pmol of a biotinylated universal primer complementary to the 5' end of the PCR amplicon B10:5'-AGCGGATAACAATTTCACACAGG-3' (SEQ ID NO: 43). After an initial round of amplification with the target with the specific forward and reverse primer, the 5' biotinylated universal primer can then be hybridized and acted as a forward primer thereby introducing a 5' biotin capture moiety into the molecule. The amplification protocol resulted in a 5'-biotinylated double stranded DNA amplicon and dramatically reduced the cost of high throughput genotyping by eliminating the need to 5' biotin label every forward primer used in a genotyping.
[0318] Thermal cycling was performed in 0.2 mL tubes or 96 well plate using an MJ Research Thermal Cycler (calculated temperature) with the following cycling parameters: 94ยฐ C. for 5 min; 45 cycles: 94ยฐ C. for 20 sec, 56ยฐ C. for 30 sec; 72ยฐ C. for 60 sec; 72ยฐ C. 3 min.
Immobilization of DNA
[0319] The 50 ฮผl PCR reaction was added to 25 ฮผL of streptavidin coated magnetic beads (Dynal, Oslo, Norway), which were prewashed three times and resuspended in 1M NH4Cl, 0.06M NH4OH. The 5' end of one strand of the double stranded PCR amplicons were allowed to bind to the beads for 15 minutes at room temperature. The beads were then collected with a magnet, and the supernatant containing unbound DNA was removed. The hybridized but unbound strand was released from the double stranded amplicons by incubation in 100 mM NaOH and washing of the beads three times with 10 mM Tris pH 8.0.
Detection of AKAP10-5 Using BiomassPROBEยฎ Assay
[0320] BiomassPROBEยฎ assay of primer extension analysis (see, U.S. Pat. No. 6,043,031) of donor population for AKAP 10-5 (SEQ ID NO: 33) was performed. Genotyping using these methods was carried out by resuspending the DNA coated magnetic beads in 26 mM Tris-HCL pH 9.5, 6.5 mM MgCl2, 50 mM dTTP, 50 mM each of ddCTP, ddATP, ddGTP, 2.5U of a thermostable DNA polymerase (Ambersham), and 20 pmol of a template specific oligonucleotide PROBE primer 5'-ACTGAGCCTGCTGCATAA-3' (SEQ ID NO: 44) (Operon). Primer extension occurs with three cycles of oligonucleotide primer with hybridization and extension. The extension products were analyzed after denaturation from the template with 50 mM NH4Cl and transfer of 150 mL of each sample to a silicon chip preloaded with 150 nl of H3PA matrix material. The sample material was allowed to crystallize and analyzed by MALDI-TOF (Bruker, PerSeptive). The primer has a mass of 5483.6 daltons. The SNP results in the addition of a ddC to the primer, giving a mass of 5756.8 daltons for the extended product. The wild type results in the addition a T and ddG to the primer giving a mass of 6101 daltons.
[0321] The frequency of the SNP was measured in a population of age selected healthy individuals. Seven hundred thirteen (713) individuals under 40 years of age (360 females, 353 males) and 703 individuals over 60 years of age (322 females, 381 males) were tested for the presence of the SNP, AKAP10-5 (SEQ ID NO: 33). Results are presented below in Table 4.
TABLE-US-00013 TABLE 4 AKAP10-5 (2073V) frequency comparison in 2 age groups <40 >60 delta G allele Female Alleles *G 38.6 34.6 4.0 *A 61.4 65.4 Genotypes G 13.9 11.8 2.1 GA 49.4 45.7 A 36.7 42.5 Male Alleles *G 41.4 37.0 4.4 *A 58.6 63.0 Genotypes G 18.4 10.8 7.7 GA 45.9 52.5 A 35.7 36.7 Total Alleles *G 40.0 35.9 4.1 *A 60.0 64.1 Genotypes G 16.1 11.2 4.9 GA 47.7 49.4 A 36.2 39.4
[0322] FIG. 20 graphically shows these results of allele and genotype distribution in the age and sex stratified Caucasian population.
Morbidity Marker 3: Human Methionine Sulfoxide Reductase a (msrA)
[0323] The age-related allele and genotype frequency of this marker in both genders and the entire population is shown in FIG. 21. The decrease of the homozygous CC genotype in the older male population is highly significant.
Methionine Sulfoxide Reductase A (#63306)
[0324] PCR Amplification and BiomassPROBE Assay Detection of the Human Methionine Sulfoxide Reductase a (h-Msr-A) in a Healthy Donor Population PCR Amplification of Donor Population for h-Msr-A
[0325] PCR primers were synthesized by OPERON using phosphoramidite chemistry. Amplification of the AKAP10 target sequence was carried out in single 50 ฮผl PCR reaction with 100 ng-1 ug of pooled human genomic DNA templates in a 50 ฮผl PCR reaction. Individual DNA concentrations within the pooled samples were present in an equal concentration with the final concentration ranging from 1-25 ng. Each reaction containing I X PCR buffer (Qiagen, Valencia, Calif.), 200 ฮผM dNTPs, 1U Hotstar Taq polymerase (Qiagen, Valencia, Calif.), 4 mM MgCl2, 25 pmol of the forward primer containing the universal primer sequence and the target specific sequence 5'-TTTCTCTGCACAGAGAGGC-3' (SEQ ID NO: 49), 2 pmol of the reverse primer 5'-AGCGGATAACAATTTCACACAGGGCTGAAATCCTTCGCTTTACC-3' (SEQ ID NO: 50), and 10 pmol of a biotinylated universal primer complementary to the 5' end of the PCR amplicon 5'-AGCGGATAACAATTTCACACAGG-3' (SEQ ID NO: 51). After an initial round of amplification of the target with the specific forward and reverse primers, the 5' biotinylated universal primer was then hybridized and acted as a reverse primer thereby introducing a 3' biotin capture moiety into the molecule. The amplification protocol results in a 5'-biotinylated double stranded DNA amplicon and dramatically reduces the cost of high throughput genotyping by eliminating the need to 5' biotin label each forward primer used in a genotyping. Thermal cycling was performed in 0.2 mL tubes or 96 well plate using an MJ Research Thermal Cycler (calculated temperature) with the following cycling parameters: 94ยฐ C. for 5 min; 45 cycles: 94ยฐ C. for 20 sec, 56ยฐ C. for 30 sec, 72ยฐ C. for 60 sec; 72ยฐ C. 3 min.
Immobilization of DNA
[0326] The 50 ฮผl PCR reaction was added to 25 ul of streptavidin coated magnetic bead (Dynal) prewashed three times and resuspended in 1M NH4Cl, 0.06M NH4OH. The PCR amplicons were allowed to bind to the beads for 15 minutes at room temperature. The beads were then collected with a magnet and the supernatant containing unbound DNA was removed. The unbound strand was released from the double stranded amplicons by incubation in 100 mM NaOH and washing of the beads three times with 10 mM Tris pH 8.0.
BiomassPROBE Assay Analysis of Donor Population for h-Msr A
[0327] Genotyping using the BiomassPROBE assay methods was carried out by resuspending the DNA coated magnetic beads in 26 mM Tris-HCl pH 9.5, 6.5 mM MgCl2, 50 mM of dTTPs and 50 mM each of ddCTP, ddATP, ddGTP, 2.5U of a thermostable DNA polymerase (Amersham), and 20 pmol of a template specific oligonucleotide PROBE primer 5'-CTGAAAAGGGAGAGAAAG-3' (Operon) (SEQ ID NO: 52). Primer extension occurs with three cycles of oligonucleotide primer with hybridization and extension. The extension products were analyzed after denaturation from the template with 50 mM NH4Cl and transfer of 150 nl each sample to a silicon chip preloaded with 150 nl of H3PA matrix material. The sample material was allowed to crystallize and analyzed by MALDI-TOF (Bruker, PerSeptive). The SNP is represented as a T to C transversion in the sequence of two ESTs. The wild type is represented by having a T at position 128 of GenBank Accession No. AW 195104, which represents the nucleotide sequence of an EST which is a portion of the wild type human msrA gene (SEQ ID NO: 39). The SNP is presented as a C at position 129 of GenBank Accession No. AW 874187, which represents the nucleotide sequence of an EST which is a portion of an allele of the human msrA gene (SEQ ID NO: 40).
[0328] In a genomic sequence the SNP is represented as an A to G transversion. The primer utilized in the BioMass probe reaction had a mass of 5654.8 daltons. In the presence of the SNP the primer is extended by the incorporation of a ddC and has a mass of 5928. In the presence of the wildtype the primer is extended by adding a dT and a DDC to produce a mass of 6232.1 daltons.
[0329] The frequency of the SNP was measured in a population of age selected healthy individuals. Five hundred fifty-two (552) individuals between the ages of 18-39 years (276 females, 276 males and 552 individuals between the age of 60-79 (184 females between the ages of 60-69, 368 males between the age of 60-79) were tested for the presence of the polymorphism localized in the nontranslated 3' region of h-msr-A.
[0330] Genotype difference between male age group among healthy individuals is significant. For the male population allele significance is p=0.0009 and genotype significance is p=0.003. The age-related allele and genotype frequency of this marker in both genders and the entire population is shown in FIG. 21. The decrease of the homozygous CC genotype in the older male population is highly significant.
[0331] The polymorphism is localized in the non-translated 3'-region of the gene encoding the human methionine sulfoxide reductase (h-msrA). The exact localization is 451 base pairs downstream the stop codon (TAA). It is likely that this SNP is in linkage disequilibrium (LD) with another polymorphism more upstream in the coding or promoter region; thus, it does not directly cause morbidity. The enzyme methionine sulfoxide reductase has been proposed to exhibit multiple biological functions. It can serve to repair oxidative protein damage but also play an important role in the regulation of proteins by activation or inactivation of their biological functions (Moskovitz et al. (1990) PNAS 95:14071-14075). It has also been shown that its activity is significantly reduced in brain tissues of Alzheimer patients (Gabbita et al., (1999) J. Neurochem 73:1660-1666). It is scientifically conceivable that proteins involved in the metabolism of reactive oxygen species are associated to disease.
Conclusion
[0332] The use of the healthy population provides for the identification of morbidity markers. The identification of proteins involved in the G-protein coupled signaling transduction pathway or in the detoxification of oxidative stress can be considered as convincing results. Further confirmation and validation of other potential polymorphisms already identified in silico in the gene encoding the human protein kinase A anchoring protein could even provide stronger association to morbidity and demonstrate that this gene product is a suitable pharmaceutical or diagnostic target.
Example 4
MALDI-TOF Mass Spectrometry Analysis
[0333] All of the products of the enzyme assays listed below were analyzed by MALDI-TOF mass spectrometry. A diluted matrix solution (0.15 ฮผL) containing of 10:1 3-hydroxypicolinic acid:ammonium citrate in 1:1 water:acetonitrile diluted 2.5-fold with water was pipetted onto a SpectroChip (Sequenom, Inc.) and was allowed to crystallize. Then, 0.15 ฮผL of sample was added. A linear PerSeptive Voyager DE mass spectrometer or Bruker Biflex MALDI-TOF mass spectrometer, operating in positive ion mode, was used for the measurements. The sample plates were kept at 18.2 kV for 400 nm after each UV laser shot (approximate 250 laser shots total), and then the target voltage was raised to 20 kV. The original spectra were digitized at 500 MHz.
Example 5
Sample Conditioning
[0334] Where indicated in the examples below, the products of the enzymatic digestions were purified with ZipTips (Millipore, Bedford, Mass.). The ZipTips were pre-wetted with 10 ฮผL 50% acetonitrile and equilibrated 4 times with 10 ฮผl 0.1 M TEAAc. The oligonucleotide fragments were bound to the C18 in the ZipTip material by continuous aspiration and dispension of each sample into the ZipTip. Each digested oligonucleotide was conditioned by washing with 10 ฮผL 0.1 M TEAAc, followed by 4 washing steps with 10 ฮผL H2O. DNA fragments were eluted from the Ziptip with 7 ฮผL 50% acetonitrile.
[0335] Any method for condition the samples can be employed. Methods for conditioning, which generally is used to increase peak resolution, are well known (see, e.g., International PCT application No. WO 98/20019).
Example 6
DNA Glycosylase-Mediated Sequence Analysis
[0336] DNA Glycosylases modifies DNA at each position that a specific nucleobase resides in the DNA, thereby producing abasic sites. In a subsequent reaction with another enzyme, a chemical, or heat, the phosphate backbone at each abasic site can be cleaved.
[0337] The glycosylase utilized in the following procedures was uracil-DNA glycosylase (UDG). Uracil bases were incorporated into DNA fragments in each position that a thymine base would normally occupy by amplifying a DNA target sequence in the presence of uracil. Each uracil substituted DNA amplicon was incubated with UDG, which cleaved each uracil base in the amplicon, and was then subjected to conditions that effected backbone cleavage at each abasic site, which produced DNA fragments. DNA fragments were subjected to MALDI-TOF mass spectrometry analysis. Genetic variability in the target DNA was then assessed by analyzing mass spectra.
[0338] Glycosylases specific for nucleotide analogs or modified nucleotides, as described herein, can be substituted for UDG in the following procedures. The glycosylase methods described hereafter, in conjunction with phosphate backbone cleavage and MALDI, can be used to analyze DNA fragments for the purposes of SNP scanning, bacteria typing, methylation analysis, microsatellite analysis, genotyping, and nucleotide sequencing and re-sequencing.
A. Genotyping
[0339] A glycosylase procedure was used to genotype the DNA sequence encoding UCP-2 (Uncoupling Protein 2). The sequence for UCP-2 is deposited in GenBank under accession number AF096289. The sequence variation genotyped in the following procedure was a cytosine (C-allele) to thymine (T-allele) variation at nucleotide position 4790, which results in a alanine to valine mutation at position 55 in the UCP-2 polypeptide.
[0340] DNA was amplified using a PCR procedure with a 50 ฮผL reaction volume containing of 5 pmol biotinylated primer having the sequence 5'-TGCTTATCCCTGTAGCTACCCTGTCTTGGCCTTGCAGATCCAA-3' (SEQ ID NO: 91), 15 pmol non-biotinylated primer having the sequence 5'-AGCGGATAACAATTTCACACAGGCCATCACACCGCGGTACTG-3' (SEQ ID NO: 92), 200 ฮผM dATP, 200 ฮผM dCTP, 200 ฮผM dGTP, 600 ฮผM dUTP (to fully replace dTTP), 1.5 mM to 3 mM MgCl2, 1 U of HotStarTaq polymerase, and 25 ng of CEPH DNA. Amplification was effected with 45 cycles at an annealing temperature of 56ยฐ C.
[0341] The amplification product was then immobilized onto a solid support by incubating 50 ฮผL of the amplification reaction with 5 ฮผL of prewashed Dynabeads for 20 minutes at room temperature. The supernatant was removed, and the beads were incubated with 50 ฮผL of 0.1 M NaOH for 5 minutes at room temperature to denature the double-stranded PCR product in such a fashion that single-stranded DNA was linked to the beads. The beads were then neutralized by three washes with 50 ฮผL 10 mM TrisHCl (pH 8). The beads were resuspended in 10 ฮผL of a 60 mM TrisHCl/1 mM EDTA (pH 7.9) solution, and 1 U uracil DNA glycosylase was added to the solution for 45 minutes at 37ยฐ C. to remove uracil nucleotides present in the single-stranded DNA linked to the beads. The beads were then washed two times with 25 ฮผL of 10 mM TrisHCl (pH 8) and once with 10 ฮผL of water. The biotinylated strands were then eluted from the beads with 12 ฮผL of 2 M NH4OH at 60ยฐ C. for 10 minutes. The backbone of the DNA was cleaved by incubating the samples for 10 min at 95ยฐ C. (with a closed lid), and ammonia was evaporated from the samples by incubating the samples for 11 min at 80ยฐ C.
[0342] The cleavage fragments were then analyzed by MALDI-TOF mass spectrometry as described in Example 4. The T-allele generated a unique fragment of 3254 Daltons. The C-allele generated a unique fragment of 4788 Daltons. These fragments were distinguishable in mass spectra. Thus, the above-identified procedure was successfully utilized to genotype individuals heterozygous for the C-allele and T-allele in UCP-2.
B. Glycosylase Analysis Utilizing Pooled DNA Samples
[0343] The glycosylase assay was conducted using pooled samples to detect genetic variability at the UCP-2 locus. DNA of known genotype was pooled from eleven individuals and was diluted to a fixed concentration of 5 ng/ฮผL. The procedure provided in Example 3A was followed using 2 pmol of forward primer having a sequence of 5'-CCCAGTCACGACGTTGTAAAACGTCTTGGCCTTGCAGATCCAAG-3' (SEQ ID NO: 93) and 15 pmol of reverse primer having the sequence 5'-AGCGGATAACAATTTCACACAGGCCATCACACCGCGGTACTG-3' (SEQ ID NO: 94). In addition, 5 pmol of biotinylated primer having the sequence 5'bioCCCAGTCACGACGTTGTAAAACG 3' (SEQ ID NO: 97) can be introduced to the PCR reaction after about two cycles. The fragments were analyzed via MALDI-TOF mass spectroscopy (Example 4). As determined in Example 3A, the T-allele, which generated a unique fragment of 3254 Daltons, could be distinguished in mass spectra from the C-allele, which generated a unique fragment of 4788 Daltons. Allelic frequency in the pooled samples was quantified by integrating the area under each signal corresponding to an allelic fragment. Integration was accomplished by hand calculations using equations well known to those skilled in the art. In the pool of eleven samples, this procedure suggested that 40.9% of the individuals harbored the T allele and 59.09% of the individuals harbored the C allele.
C. Glycosylase-Mediated Microsatellite Analysis
[0344] A glycosylase procedure was utilized to identify microsatellites of the Bradykinin Receptor 2 (BKR-2) sequence. The sequence for BKR-2 is deposited in GenBank under accession number X86173. BKR-2 includes a SNP in the promoter region, which is a C to T variation, as well as a SNP in a repeated unit, which is a G to T variation. The procedure provided in Example 3A was utilized to identify the SNP in the promotor region, the SNP in the microsattelite repeat region, and the number of repeated units in the microsattelite region of BKR-2. Specifically, a forward PCR primer having the sequence 5'-CTCCAGCTGGGCAGGAGTGC-3' (SEQ ID NO: 95) and a reverse primer having the sequence 5'-CACTTCAGTCGCTCCCT-3' (SEQ ID NO: 96) were utilized to amplify BKR-2 DNA in the presence of uracil. The amplicon was fragmented by UDG followed by backbone cleavage. The cleavage fragments were analyzed by MALDI-TOF mass spectrometry as described in Example 4.
[0345] With regard to the SNP in the BKR-2 promotor region having a C to T variation, the C-allele generated a unique fragment having a mass of 7342.4 Daltons, and the T-allele generated a unique fragment having a mass of 7053.2 Daltons. These fragments were distinguishable in mass spectra. Thus, the above-identified procedure was successfully utilized to genotype individuals heterozygous for the C-allele and T-allele in the promotor region of BKR-2.
[0346] With regard to the SNP in the BKR-2 repeat region having a G to T variation, the T-allele generated a unique fragment having a mass of 1784 Daltons, which was readily detected in a mass spectrum. Hence, the presence of the T-allele was indicative of the G to T sequence variation in the repeat region of BKR-2.
[0347] In addition, the number of repeat regions was distinguished between individuals having two repeat sequences and individuals having three repeat sequences in BKR-2. The DNA of these individuals did not harbor the G to T sequence variation in the repeat sequence as each repeat sequence contained a G at the SNP locus. The number of repeat regions was determined in individual samples by calculating the area under a signal corresponding to a unique DNA fragment having a mass of 2771.6 Daltons. This signal in spectra generated from individuals having two repeat regions had an area that was thirty-three percent less than the area under the same signal in spectra generated from individuals having three repeat regions. Thus, the procedures discussed above can be utilized to genotype individuals for the number of repeat sequences present in BKR-2.
D. Bisulfite Treatment Coupled with Glycosylase Digestion
[0348] Bisulfite treatment of genomic DNA can be utilized to analyze positions of methylated cytosine residues within the DNA. Treating nucleic acids with bisulfite deaminates cytosine residues to uracil residues, while methylated cytosine remains unmodified. Thus, by comparing the sequence of a PCR product generated from genomic DNA that is not treated with bisulfite with the sequence of a PCR product generated from genomic DNA that is treated with bisulfite, the degree of methylation in a nucleic acid as well as the positions where cytosine is methylated can be deduced.
[0349] Genomic DNA (2 ฮผg) was digested by incubation with 1 ฮผL of a restriction enzyme at 37ยฐ C. for 2 hours. An aliquot of 3 M NaOH was added to yield a final concentration of 0.3M NaOH in the digestion solution. The reaction was incubated at 37ยฐ C. for 15 minutes followed by treatment with 5.35M urea, 4.44M bisulfite, and 10 mM hydroquinone, where the final concentration of hydroquinone is 0.5 mM.
[0350] The sample that was treated with bisulfite (sample A) was compared to the same digestion sample that had not undergone bisulfite treatment (sample B). After sample A was treated with bisulfite as described above, sample A and sample B were amplified by a standard PCR procedure. The PCR procedure included the step of overlaying each sample with mineral oil and then subjecting the sample to thermocycling (20 cycles of 15 minutes at 55ยฐ C. followed by 30 seconds at 95ยฐ C.). The PCR reaction contained four nucleotide bases, C, A, G, and U. The mineral oil was removed from each sample, and the PCR products were purified with glassmilk. Sodium iodide (3 volumes) and glassmilk (5 ฮผL) were added to samples A and B. The samples were then placed on ice for 8 minutes, washed with 420 ฮผL cold buffer, centrifuged for 10 seconds, and the supernatant fractions were removed. This process was repeated twice and then 25 ฮผL of water was added. Samples were incubated for 5 minutes at 37ยฐ C., were centrifuged for 20 seconds, and the supernatant fraction was collected, and then this incubation/centrifugation/supernatant fraction collection procedure was repeated. 50 ฮผL 0.1 M NaOH was then added to the samples to denature the DNA. The samples were incubated at room temperature for 5 minutes, washed three times with 50 ฮผL of 10 mM TrisHCl (pH 8), and resuspended in 10 ฮผL 60 mM TrisHCl/1 mM EDTA, pH 7.9.
[0351] The sequence of PCR products from sample A and sample B were then treated with 2U of UDG (MBI Fermentas) and then subjected to backbone cleavage, as described herein. The resulting fragments from each of sample A and sample B were analyzed by MALDI-TOF mass spectroscopy as described in Example 4. Sample A gave rise to a greater number of fragments than the number of fragments arising from sample B, indicative that the nucleic acid harbored at least one methylated cytosine moiety.
Example 7
Fen-Ligase-Mediated Haplotyping
[0352] Haplotyping procedures permit the selection of a fragment from one of an individual's two homologous chromosomes and to genotype linked SNPs on that fragment. The direct resolution of haplotypes can yield increased information content, improving the diagnosis of any linked disease genes or identifying linkages associated with those diseases. In previous studies, haplotypes were typically reconstructed indirectly through pedigree analysis (in cases where pedigrees were available) through laborious and unreliable allele-specific PCR or through single-molecule dilution methods well known in the art.
[0353] A haplotyping procedure was used to determine the presence of two SNPs, referred to as SNP1 and SNP2, located on one strand in a DNA sample. The haplotyping procedure used in this assay utilized Fen-1, a site-specific "flap" endonuclease that cleaves DNA "flaps" created by the overlap of two oligonucleotides hybridized to a target DNA strand. The two overlapping oligonucleotides in this example were short arm and long arm allele-specific adaptors. The target DNA was an amplified nucleic acid that had been denatured and contained SNP1 and SNP2.
[0354] The short arm adaptor included a unique sequence not found in the target DNA. The 3' distal nucleotide of the short arm adaptor was identical to one of the SNP1 alleles. Moreover, the long arm adaptor included two regions: a 3' region complementary to the short arm and a 5'gene-specific region complementary to the fragment of interest adjacent to the SNP. If there was a match between the adaptor and one of the homologues, the Fen enzyme recognized and cleaved the overlapping flap. The short arm of the adaptor was then ligated to the remainder of the target fragment (minus the SNP site). This ligated fragment was used as the forward primer for a second PCR reaction in which only the ligated homologue was amplified. The second PCR product (PCR2) was then analyzed by mass spectrometry. If there was no match between the adaptors and the target DNA, there was no overlap, no cleavage by Fen-1, and thus no PCR2 product of interest.
[0355] If there was more than one SNP in the sequence of interest, the second SNP (SNP2) was found by using an adaptor that was specific for SNP2 and hybridizing the adaptor to the PCR2 product containing the first SNP. The Fen-ligase and amplification procedures were repeated for the PCR2 product containing the first SNP. If the amplified product yielded a second SNP, then SNP1 and SNP2 were on the same fragment.
[0356] If the SNP is unknown, then four allele-specific adaptors (e.g. C, G, A, and T) can be used to hybridize with the target DNA. The substrates are then treated with the Fen-ligase protocol, including amplification. The PCR2 products can be analyzed by PROBE, as described herein, to determine which adaptors were hybridized to the DNA target and thus identify the SNPs in the sequence.
[0357] A Fen-ligase assay was used to detect two SNPs present in Factor VII. These SNPs are located 814 base pairs apart from each other. SNP1 was located at position 8401 (C to T), and SNP2 was located at 9215 (G to A).
A. First Amplification Step
[0358] A PCR product (PCR1) was generated for a known heterozygous individual at SNP1, a short distance from the 5' end of the SNP. Specifically, a 10 ฮผL PCR reaction was performed by mixing 1.5 mM MgCl2, 200 ฮผM of each dNTP, 0.5 U HotStar polymerase, 0.1 ฮผM of a forward primer having the sequence 5'-GCG CTC CTG TCG GTG CCA (SEQ ID NO: 56), 0.1 ฮผM of a reverse primer having the sequence 5'-GCC TGA CTG GTG GGG CCC (SEQ ID NO: 57), and 1 ng of genomic DNA. The annealing temperature was 58ยฐ C., and the amplification process yielded fragments that were 861 bp in length.
[0359] The PCR1 reaction mixture was divided in half and was treated with an exonuclease 1/SAP mixture (0.22 ฮผL mixture/5 ฮผL PCR1 reaction) which contained 1.0 ฮผL SAP and 0.1 ฮผL exon1. The exonuclease treatment was done for 30 minutes at 37ยฐ C. and then 20 minutes at 85ยฐ C. to denature the DNA.
B. Adaptor Oligonucleotides
[0360] A solution of allele-specific adaptors (C and T), containing of one long and one short oligonucleotide per adaptor, was prepared. The long arm and short arm oligonucleotides of each adaptor (10 ฮผM) were mixed in a 1:1 ratio and heated for 30 seconds at 95ยฐ C. The temperature was reduced in 2ยฐ C. increments to 37ยฐ C. for annealing. The C-adaptor had a short arm sequence of 5'-CAT GCA TGC ACG GTC (SEQ ID NO: 58) and a long arm sequence of 5'-CAG AGA GTA CCC CTC GAO CGT GCA TGC ATG (SEQ ID NO: 59). Hence, the long arm of the adaptor was 30 bp (15 bp gene-specific), and the short arm was 15 bp. The T-adaptor had a short arm sequence of 5'-CAT GCA TGC ACG GTT (SEQ ID NO: 60) and a long arm sequence of 5'-GTA CGT ACG TGC CAA CTC CCC ATG AGA GAC (SEQ ID NO: 61). The adaptor could also have a hairpin structure in which the short and long arm are separated by a loop containing of 3 to 10 nucleotides (SEQ ID NO: 118).
C. FEN-Ligase Reaction
[0361] In two tubes (one tube for each allele-specific adaptor per sample) was placed a solution (Solution A) containing of 3.5 ฮผl 10 mM 16% PEG/50 mM MOPS, 1.2 ฮผl 25 mM MgCl2, 1.5 ฮผl 10ร Ampligase Buffer, and 2.5 ฮผl PCR1. Each tube containing Solution A was incubated at 95ยฐ C. for 5 minutes to denature the PCR1 product. A second solution (Solution B) containing of 1.65 ฮผl Ampligase (Thermostable ligase, Epicentre Technologies), 1.65 ฮผl 200 ng/ฮผl MFEN (from Methanocuccus jannaschii), and 3.0 ฮผl of an allele specific adaptor (C or T) was prepared. Thus, different variations of Solution B, each variation containing of different allele-specific adaptors, were made. Solution B was added to Solution A at 95ยฐ C. and incubated at 55ยฐ C. for 3 hours. The total reaction volume was 15.0 ฮผl per adaptor-specific reaction. For a bi-allelic system, 2ร15.0 ฮผl reactions were required.
[0362] The Fen-ligase reaction in each tube was then deactivated by adding 8.0 ฮผl 10 mM EDTA. Then, 1.0 ฮผl exoIII/Buffer (70%130%) solution was added to each sample and incubated 30 minutes at 37ยฐ C., 20 minutes at 70ยฐ C. (to deactivate exoIII), and 5 minutes at 95ยฐ C. (to denature the sample and dissociate unused adaptor from template). The samples were cooled in an ice slurry and purified on UltraClean PCR Clean-up (MoBio) spin columns which removed all fragments less than 100 base pairs in length. The fragments were eluted with 50 ฮผl H2O.
D. Second Amplification Step
[0363] A second amplification reaction (PCR2) was conducted in each sample tube using the short arm adaptor (C or T) sequence as the forward primer (minus the SNP1 site). Only the ligated homologue was amplified. A standard PCR reaction was conducted with a total volume of 10.0 ฮผl containing of 1ร Buffer (final concentration), 1.5 mM final concentration MgCl2, 200 ฮผM final concentration dNTPs, 0.5 U HotStar polymerase, 0.1 ฮผM final concentration forward primer 5'-CAT GCA TGC ACG GT (SEQ ID NO: 62), 0.1 ฮผM final concentration reverse primer 5'-GCC TGA CTG GTG GGG CCC (SEQ ID NO: 63), and 1.0 ฮผl of the purified FEN-ligase reaction solution. The annealing temperature was 58ยฐ C. The PCR2 product was analyzed by MALDI TOF mass spectroscopy as described in Example 4. The mass spectrum of Fen SNP1 showed a mass of 6084.08 Daltons, representing the C allele.
E. Genotyping Additional SNPs
[0364] The second SNP (SNP2) can be found by using an adaptor that is specific for SNP2 and hybridizing that adaptor to the PCR2 product containing the first SNP. The Fen-ligase and amplification procedures are repeated for the PCR2 product containing the first SNP. If the amplified product yields a second SNP, then SN1 and SN2 are on the same fragment. The mass spectrum of SNP2, representing the T allele, showed a mass of 6359.88 Daltons.
[0365] This assay also can be performed upon pooled DNA to yield haplotype frequencies as described herein. The Fen-ligase assay can be used to analyze multiplexes as described herein.
Example 8
Nickase-Mediated Sequence Analysis
[0366] A DNA nickase, or DNase, was used to recognize and cleave one strand of a DNA duplex. NY2A nickase and NYS1 nickase (Megabase), which cleave DNA at the following sites:
TABLE-US-00014 NY2A: 5' . . . R AG . . . 3' 3' . . . YโTC . . . 5' where R = A or G and Y = C or T NYS1: 5' . . . โCC[A/G/T] . . . 3' 3' . . . GG[T/C/A] . . . 5'
were used.
A. Nickase Digestion
[0367] Tris-HCl (10 mM), KCl (10 mM, pH 8.3), magnesium acetate (25 mM), BSA (1 mg/mL), and 6 U of Cvi NY2A or Cvi NYS1 Nickase (Megabase Research) were added to 25 pmol of double-stranded oligonucleotide template having a sequence of 5'-CGC AGG GTT TCC TCG TCG CAC TGG GCA TGT G-3' (SEQ ID NO: 90, Operon, Alameda, Calif.) synthesized using standard phosphoramidite chemistry. With a total volume of 20 ฮผL, the reaction mixture was incubated at 37ยฐ C. for 5 hours, and the digestion products were purified using ZipTips (Millipore, Bedford, Mass.) as described in Example 5. The samples were analyzed by MALDI-TOF mass spectroscopy as described in Example 1. The nickase Cvi NY2A yielded three fragments with masses 4049.76 Daltons, 5473.14 Daltons, and 9540.71 Daltons. The Cvi NYS1 nickase yielded fragments with masses 2063.18 Daltons, 3056.48 Daltons, 6492.81 Daltons, and 7450.14 Daltons.
B. Nickase Digestion of Pooled Samples
[0368] DQA (HLA ClassII-DQ Alpha, expected fragment size=225 bp) was amplified from the genomic DNA of 100 healthy individuals. DQA was amplified using standard PCR chemistry in a reaction having a total volume of 50 ฮผL containing of 10 mM Tris-HCl, 10 mM KCl (pH 8.3), 2.5 mM MgCl2, 200 ฮผM of each dNTP, 10 pmol of a forward primer having the sequence 5'-GTG CTG CAG GTG TAA ACT TGT ACC AG-3'(SEQ ID NO: 64), 10 pmol of a reverse primer having the sequence 5'-CAC GGA TCC GGT AGC AGC GGT AGA GTT G-3'(SEQ ID NO: 65), 1 U DNA polymerase (Stoffel fragment, Perkin Elmer), and 200 ng human genomic DNA (2 ng DNA/individual). The template was denatured at 94ยฐ C. for 5 minutes. Thermal cycling was continued with a touch-down program that included 45 cycles of 20 seconds at 94ยฐ C., 30 seconds at 56ยฐ C., 1 minute at 72ยฐ C., and a final extension of 3 minutes at 72ยฐ C. The crude PCR product was used in the subsequent nickase reaction.
[0369] The unpurified PCR product was subjected to nickase digestion. Tris-HCl (10 mM), KCl (10 mM, pH 8.3), magnesium acetate (25 mM), BSA (1 mg/mL), and 5 U of Cvi NY2A or Cvi NYS1 Nickase (Megabase Research) were added to 25 pmol of the amplified template with a total reaction volume of 20 ฮผL. The mixture was then incubated at 37ยฐ C. for 5 hours. The digestion products were purified with either ZipTips (Millipore, Bedford, Mass.) as described in Example 5. The samples were analyzed by MALDI-TOF mass spectroscopy as described in Example 4. This assay also can be used to do multiplexing and standardless genotyping as described herein.
[0370] To simplify the nickase mass spectrum, the two complementary strands can be separated after digestion by using a single-stranded undigested PCR product as a capture probe. This probe (preparation shown below in Example 8C) can be hybridized to the nickase fragments in hybridization buffer containing 200 mM sodium citrate and 1% blocking reagent (Boehringer Mannheim). The reaction is heated to 95ยฐ C. for 5 minutes and cooled to room temperature over 30 minutes by using a thermal cycler (PTC-200 DNA engine, MJ Research, Waltham, Mass.). The capture probe-nickase fragment is immobilized on 140 ฮผg of streptavidin-coated magnetic beads. The beads are subsequently washed three times with 70 mM ammonium citrate. The captured single-stranded nickase fragments are eluted by heating to 80ยฐ C. for 5 minutes in 5 ฮผL of 50 mM ammonium hydroxide.
C. Preparation of Capture Probe
[0371] The capture probe is prepared by amplifying the human ฮฒ-globin gene (3' end of intron 1 to 5' end of exon 2) via PCR methods in a total volume of 50 ฮผL containing of GeneAmp 1ร PCR Buffer II, 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 2 mM MgCl2, 0.2 mM dNTP mix, 10 ฮผmol of each primer (forward primer 5'-ACTGGGCATGTGGAGACAG-3'(SEQ ID NO: 66) and biotinylated reverse primer bio5'-GCACTTTCTTGCCATGAG-3'(SEQ ID: 67), 2 U of AmpliTaq Gold, and 200 ng of human genomic DNA. The template is denatured at 94ยฐ C. for 8 minutes. Thermal cycling is continued with a touch-down program that included 11 cycles of 20 seconds at 94ยฐ C., 30 seconds at 64ยฐ C., 1 minute at 72ยฐ C.; and a final extension of 5 minutes at 72ยฐ C. The amplicon is purified using UltraCleanยฎ PCR clean-up kit (MO Bio Laboratories, Solano Beach, Calif.).
Example 9
Multiplex Type IIS SNP Assay
[0372] A Type IIS assay was used to identify human gene sequences with known SNPs. The Type IIS enzyme used in this assay was Fok I which effected double-stranded cleavage of the target DNA. The assay involved the steps of amplification and Fok I treatment of the amplicon. In the amplification step, the primers were designed so that each PCR product of a designated gene target was less than 100 bases such that a Fok I recognition sequence was incorporated at the 5' and 3' end of the amplicon. Therefore, the fragments that were cleaved by Fok I included a center fragment containing the SNP of interest.
[0373] Ten human gene targets with known SNPs were analyzed by this assay. Sequences of the ten gene targets, as well as the primers used to amplify the target regions, are found in Table 5. The ten targets were lipoprotein lipase, prothrombin, factor V, cholesterol ester transfer protein (CETP), factor VII, factor XIII, HLA-H exon 2, HLA-H exon 4, methylenetetrahydrofolate reductase (MTHR), and P53 exon 4 codon 72.
[0374] Amplification of the ten human gene sequences were carried out in a single 50 ฮผL volume PCR reaction with 20 ng of human genomic DNA template in 5 PCR reaction tubes. Each reaction vial contained 1รPCR buffer (Qiagen), 200 ฮผM dNTPs, 1U Hotstar Taq polymerase (Qiagen), 4 mM MgCl2, and 10 pmol of each primer. US8, having sequence of 5'TCAGTCACGACGTT3'(SEQ ID NO: 68), and US9, having sequence of 5'CGGATAACAATTTC3'(SEQ ID NO: 69), were used for the forward and reverse primers respectively. Moreover, the primers were designed such that a Fok I recognition site was incorporated at the 5' and 3' ends of the amplicon. Thermal cycling was performed in 0.2 mL tubes or a 96 well plate using a MJ Research Thermal Cycler (calculated temperature) with the following cycling parameters: 94ยฐ C. for 5 minutes; 45 cycles: 94ยฐ C. for 20 seconds, 56ยฐ C. for 20 seconds, 72ยฐ C. for 60 seconds; and 72ยฐ C. for 3 minutes.
[0375] Following PCR, the sample was treated with 0.2 U Exonuclease I (Amersham Pharmacia) and S Alkaline Phosphotase (Amersham Pharmacia) to remove the unincorporated primers and dNTPs. Typically, 0.2 U of exonuclease I and SAP were added to 5 ฮผL of the PCR sample. The sample was then incubated at 37ยฐ C. for 15 minutes. Exonuclease I and SAP were then inactivated by heating the sample up to 85ยฐ C. for 15 minutes. Fok I digestion was performed by adding 2 U of Fok I (New England Biolab) to the 5 uL PCR sample and incubating at 37ยฐ C. for 30 minutes. Since the Fok I restriction sites are located on both sides of the amplicon, the 5' and 3' cutoff fragments have higher masses than the center fragment containing the SNP. The sample was then purified by anion exchange and analyzed by MALDI-TOF mass spectrometry as described in Example 4. The masses of the gene fragments from this multiplexing experiment are listed in Table 6. These gene fragments were resolved in mass spectra thereby allowing multiplex analysis of sequence variability in these genes.
TABLE-US-00015 TABLE 5 Genes for Multiplex Type IIS Assay Seq. ID Seq. ID Gene Sequence No. Primers No. Lipoprotein cctttgagaa agggctctgc ttgagttgta 98-99 5' 70 Lipase gaaagaaccg ctgcaacaat caatttcatcgctggatgcaatctggg (Asn291Ser) ctatgagatc 3' ctgggctatg agatca[a g]taa agtcagagcc 5' 71 aaaagaagca gcaaaatgta caatttcacacagcggatgcttcttttg gctctgact 3' Prothrombin 26731 gaattatttt tgtgtttcta aaactatggt 100-101 5' 72 tcccaataaa agtgactctc tcagtcacgacgttggatgccaataa aagtgactctcagc 3' 26781 agc[g a]agcctc aatgctccca 5' 73 gtgctattca tgggcagctc tctgggctca cggataacaatttcggatgcactggg agcattgaggc 3' Factor V taataggact acttctaatc tgtaagagca 102-103 5' 74 (Arg506Gln) gatccctgga caggc[g a]agga tcagtcacgacgttggatgagcagat ccctggacaggc 3' atacaggtat tttgtccttg aagtaacctt tcag 5' 75 cggataacaatttcggatggacaaa atacctgtattcc 3' Cholesterol 1261 ctcaccatgg gcatttgatt gcagagcage 104-105 5' 76 ester tccgagtcc[g a] tccagagctt tcagtcacgacgttggatgcagagc transfer agctccgagtc 3' protein 1311 cctgcagtca atgatcaccg ctgtgggcat 5' 77 (CETP)(I405V) ccctgaggtc atgtctcgta cagcggtgatcattggatgcagga agctctgg 3' Factor VII 1221 agcaaggact cctgcaaggg ggacagtgga 106-107 5' 78 (R353Q) ggcccacatg ccacccacta tcagtcacgacgttggatgcccacat gccacccactac 3' 1271 cc[a g]gggcacg tggtacctga 5' 79 cgggcatcgt cagctggggc cagggctgcg cggataacaatttcggatgcccgtca ggtaccacg 3' Factor XIII 111 caataactct aatgcagcgg aagatgacct 108-109 5' 80 (V34L) gcccacagtg gagcttcagg tcagtcacgacgttggatgcccaca gtggagcttcag 3' 161 gc[g t]tggtgcc ccggggcgtc 5' gctcataccttgcaggatgacg 81 aacctgcaag gtatgagcat accccccttc 3' HLA-H exon 2 361 ttgaagcttt gggctacgtg gatgaccagc 110-111 5' 82 (His63Asp) tgttcgtgtt ctatgat[c g]at tcagtcacgacgttggatgaccagct gttcgtgttc 3' 411 gagagtcgcc gtgtggagcc ccgaactcca 5' 83 tgggtttcca gtagaatttc tacatggagttcggggatgcacac ggcgactctc 3' HLA-H exon 4 1021 ggataacctt ggctgtaccc cctggggaag 112-113 5' 84 (Cys282Tyr) agcagagata tacgt[g a]ccag tcagtcacgacgttggatggggaag agcagagatatacgt 3' 1071 gtggagcacc caggcctgga tcagcccctc 5' 85 attgtgatct gggagccctc gagggactgatccaggatgggtg ctccac 3' Methylentetra- 761 tgaagcactt gaagga gaag gtatctgcgg 114-115 5' 86 hydro- gag[c t]cgattt catcacg tcagtcacgacgttggatggggaag folateredctase agcagagatatacgt 3' (MTHR) 811 cagcttttct ttgaggctga cacattcttc 5' 87 (Ala222Val) gagggactgatccaggatgggtg ctccac 3' P53 Exon4 12101 tccagatgaa gctcccagaa tgccagaggc 116-117 5' 88 Codon tgctcccc[g c]c gtggcccctg gatgaagctcccaggatgccaga 72 (Arg72Pro) ggc 3' 89 12151 caccagcagc tcctacaccg gcggcccctg 5' gccgccggtgtaggatgctgctgct tgc 3'
TABLE-US-00016 TABLE 6 The mass of Center Fragments for Ten Different SNP Typing by IIS Assay Gene LPL FV CETP FVII FXIII (.sup.Asn291.sup.ser) Prothrombin (.sup.Arg506.sup.Gln) (.sup.I405.sup.V) (.sup.R353.sup.Q) (.sup.V34) Genotype A G G A G A G A G A G T +strand 6213 6229 5845 5829 5677 5661 3388 3372 6128 6112 5058 5033 mass (Da) -strand 6129 6114 5949 5964 5472 5487 3437 3452 6174 6189 4916 4940 mass (Da) Gene Hlah2 Hlah4 MTHR(.sup.Ala222.sup.val) P53exon4(.sup.Arg72.sup.Pro) Genotype C G G A C T G C +strand mass 5889 5929 4392 4376 4400 4415 4586 4546 (Da) -strand mass - 5836 5796 4319 4334 4368 4352 4724 4764 (Da)
Example 10
Exemplary Use of Parental Medical History Parameter for Stratification of Healthy Datebase
[0376] A healthy database can be used to associate a disease state with a specific allele (SNP) that has been found to show a strong association between age and the allele, in particular the homozygous genotype. The method involves using the same healthy database used to identify the age dependent association, however stratification is by information given by the donors about common disorders from which their parents suffered (the donor's familial history of disease). There are three possible answers a donor could give about the health status of their parents: neither were affected, one was affected or both were affected. Only donors above a certain minimum age, depending on the disease, are utilized, as the donors parents must be old enough to have exhibited clinical disease phenotypes. The genotype frequency in each of these groups is determined and compared with each other. If there is an association of the marker in the donor to a disease the frequency of the heterozygous genotype will be increased. The frequency of the homozygous genotype should not increase, as it should be significantly underrepresented in the healthy population.
Example 11
Method and Device for Identifying a Biological Sample
Description
[0377] A method and device for identifying a biological sample is provided. Referring now to FIG. 24, an apparatus 10 for identifying a biological sample is disclosed. The apparatus 10 for identifying a biological sample generally comprises a mass spectrometer 15 communicating with a computing device 20. In an embodiment, the mass spectrometer can be a MALDI-TOF mass spectrometer manufactured by Bruker-Franzen Analytik GmbH; however, it will be appreciated that other mass spectrometers can be substituted. The computing device 20 is typically a general purpose computing device. It will be appreciated that the computing device could be alternatively configured, for example, it can be integrated with the mass spectrometer or could be part of a computer in a larger network system.
[0378] The apparatus 10 for identifying a biological sample can operate as an automated identification system having a robot 25 with a robotic arm 27 configured to deliver a sample plate 29 into a receiving area 31 of the mass spectrometer 15. In such a manner, the sample to be identified can be placed on the plate 29 and automatically received into the mass spectrometer 15. The biological sample is then processed in the mass spectrometer to generate data indicative of the mass of DNA fragments in the biological sample. This data can be sent directly to computing device 20, or can have some preprocessing or filtering performed within the mass spectrometer. In an embodiment, the mass spectrometer 15 transmits unprocessed and unfiltered mass spectrometry data to the computing device 20. It will be appreciated that the analysis in the computing device can be adjusted to accommodate preprocessing or filtering performed within the mass spectrometer.
[0379] Referring now to FIG. 25, a general method 35 for identifying a biological sample is shown. In method 35, data are received into a computing device from a test instrument in block 40. Generally the data are received in a raw, unprocessed and unfiltered form, but alternatively can have some form of filtering or processing applied. The test instrument of an exemplary embodiment is a mass spectrometer as described above. It will be appreciated that other test instruments could be substituted for the mass spectrometer.
[0380] The data generated by the test instrument, and in particular the mass spectrometer, includes information indicative of the identification of the biological sample. More specifically, the data are indicative of the DNA composition of the biological sample. Typically, mass spectrometry data gathered from DNA samples obtained from DNA amplification techniques are noisier than, for example, those from typical protein samples. This is due in part because protein samples are more readily prepared in more abundance, and protein samples are more easily ionizable as compared to DNA samples. Accordingly, conventional mass spectrometer data analysis techniques are generally ineffective for DNA analysis of a biological sample.
[0381] To improve the analysis capability so that DNA composition data can be more readily discerned, an embodiment uses wavelet technology for analyzing the DNA mass spectrometry data. Wavelets are an analytical tool for signal processing, numerical analysis, and mathematical modeling. Wavelet technology provides a basic expansion function which is applied to a data set. Using wavelet decomposition, the data set can be simultaneously analyzed in the time and frequency domains. Wavelet transformation is the technique of choice in the analysis of data that exhibit complicated time (mass) and frequency domain information, such as MALDI-TOF DNA data. Wavelet transforms as described herein have superior denoising properties as compared to conventional Fourier analysis techniques. Wavelet transformation has proven to be particularly effective in interpreting the inherently noisy MALDI-TOF spectra of DNA samples. In using wavelets, a "small wave" or "scaling function" is used to transform a data set into stages, with each stage representing a frequency component in the data set. Using wavelet transformation, mass spectrometry data can be processed, filtered, and analyzed with sufficient discrimination to be useful for identification of the DNA composition for a biological sample.
[0382] Referring again to FIG. 25, the data received in block 40 is denoised in block 45. The denoised data then has a baseline correction applied in block 50. A baseline correction is generally necessary as data coming from the test instrument, in particular a mass spectrometer instrument, has data arranged in a generally exponentially decaying manner. This generally exponential decaying arrangement is not due to the composition of the biological sample, but is a result of the physical properties and characteristics of the test instrument, and other chemicals involved in DNA sample preparation. Accordingly, baseline correction substantially corrects the data to remove a component of the data attributable to the test system, and sample preparation characteristics.
[0383] After denoising in block 45 and the baseline correction in block 50, a signal remains which is generally indicative of the composition of the biological sample. Due to the extraordinary discrimination required for analyzing the DNA composition of the biological sample, the composition is not readily apparent from the denoised and corrected signal. For example, although the signal can include peak areas, it is not yet clear whether these "putative" peaks actually represent a DNA composition, or whether the putative peaks are the result of a systemic or chemical aberration. Further, any call of the composition of the biological sample would have a probability of error which would be unacceptable for clinical or therapeutic purposes. In such critical situations, there needs to be a high degree of certainty that any call or identification of the sample is accurate. Therefore, additional data processing and interpretation is necessary before the sample can be accurately and confidently identified.
[0384] Since the quantity of data resulting from each mass spectrometry test is typically thousands of data points, and an automated system can be set to perform hundreds or even thousands of tests per hour, the quantity of mass spectrometry data generated is enormous. To facilitate efficient transmission and storage of the mass spectrometry data, block 55 shows that the denoised and baseline corrected data are compressed.
[0385] In one embodiment, the biological sample is selected and processed to have only a limited range of possible compositions. Accordingly, it is therefore known where peaks indicating composition should be located, if present. Taking advantage of knowing the location of these expected peaks, in block 60 the method 35 matches putative peaks in the processed signal to the location of the expected peaks. In such a manner, the probability of each putative peak in the data being an actual peak indicative of the composition of the biological sample can be determined. Once the probability of each peak is determined in block 60, then in block 65 the method 35 statistically determines the composition of the biological sample, and determines if confidence is high enough to calling a genotype.
[0386] Referring again to block 40, data are received from the test instrument, which can be a mass spectrometer. In a specific illustration, FIG. 26 shows an example of data from a mass spectrometer. The mass spectrometer data 70 generally comprises data points distributed along an x-axis 71 and a y-axis 72. The x-axis 71 represents the mass of particles detected, while the y-axis 72 represents a numerical concentration of the particles. As can be seen in FIG. 26, the mass spectrometry data 70 is generally exponentially decaying with data at the left end of the x-axis 73 generally decaying in an exponential manner toward data at the heavier end 74 of the x-axis 71. The general exponential presentation of the data is not indicative of the composition of the biological sample, but is more reflective of systematic error and characteristics. Further, as described above and illustrated in FIG. 26, considerable noise exists in the mass spectrometry DNA data 70.
[0387] Referring again to block 45, where the raw data received in block 40 is denoised, the denoising process will be described in more detail. As illustrated in FIG. 25, the denoising process generally entails 1) performing a wavelet transformation on the raw data to decompose the raw data into wavelet stage coefficients; 2) generating a noise profile from the highest stage of wavelet coefficients; and 3) applying a scaled noise profile to other stages in the wavelet transformation. Each step of the denoising process is further described below.
[0388] Referring now to FIG. 27, the wavelet transformation of the raw mass spectrometry data is generally diagramed. Using wavelet transformation techniques, the mass spectrometry data 70 is sequentially transformed into stages. In each stage, the data are represented in a high stage and a low stage, with the low stage acting as the input to the next sequential stage. For example, the mass spectrometry data 70 is transformed into stage 0 high data 82 and stage 0 low data 83. The stage 0 low data 83 is then used as an input to the next level transformation to generate stage 1 high data 84 and stage 1 low data 85. In a similar manner, the stage 1 low data 85 is used as an input to be transformed into stage 2 high data 86 and stage 2 low data 87. The transformation is continued until no more useful information can be derived by further wavelet transformation. For example, in the one embodiment a 24-point wavelet is used. More particularly a wavelet commonly referred to as the Daubechies 24 is used to decompose the raw data. It will be appreciated that other wavelets can be used for the wavelet transformation. Since each stage in a wavelet transformation has one-half the data points of the previous stage, the wavelet transformation can be continued until the stage n low data 89 has around 50 points. Accordingly, the stage n high 88 would contain about 100 data points. Since the exemplary wavelet is 24 points long, little data or information can be derived by continuing the wavelet transformation on a data set of around 50 points.
[0389] FIG. 28 shows an example of stage 0 high data 95. Since stage 0 high data 95 is generally indicative of the highest frequencies in the mass spectrometry data, stage 0 high data 95 will closely relate to the quantity of high frequency noise in the mass spectrometry data. In FIG. 29, an exponential fitting formula has been applied to the stage 0 high data 95 to generate a stage 0 noise profile 97. In particular, the exponential fitting formula is in the format A0+A1 EXP(-A2m). It will be appreciated that other exponential fitting formulae or other types of curve fits can be used.
[0390] Referring now to FIG. 30, noise profiles for the other high stages are determined. Since the later data points in each stage will likely be representative of the level of noise in each stage, only the later data points in each stage are used to generate a standard deviation figure that is representative of the noise content in that particular stage. More particularly, in generating the noise profile for each remaining stage, only the last five percent of the data points in each stage are analyzed to determined a standard deviation number. It will be appreciated that other numbers of points, or alternative methods could be used to generate such a standard deviation figure.
[0391] The standard deviation number for each stage is used with the stage 0 noise profile (the exponential curve) 97 to generate a scaled noise profile for each stage. For example, FIG. 30 shows that stage 1 high data 98 has stage 1 high data 103 with the last five percent of the data points represented by area 99. The points in area 99 are evaluated to determine a standard deviation number indicative of the noise content in stage 1 high data 103. The standard deviation number is then used with the stage 0 noise profile 97 to generate a stage 1 noise profile.
[0392] In a similar manner, stage 2 high 100 has stage 2 high data 104 with the last five percent of points represented by area 101. The data points in area 101 are then used to calculate a standard deviation number which is then used to scale the stage 0 noise profile 97 to generate a noise profile for stage 2 data. This same process is continued for each of the stage high data as shown by the stage n high 105. For stage n high 105, stage n high data 108 has the last five percent of data points indicated in area 106. The data points in area 106 are used to determine a standard deviation number for stage n. The stage n standard deviation number is then used with the stage 0 noise profile 97 to generate a noise profile for stage n. Accordingly, each of the high data stages has a noise profile.
[0393] FIG. 31 shows how the noise profile is applied to the data in each stage. Generally, the noise profile is used to generate a threshold which is applied to the data in each stage. Since the noise profile is already scaled to adjust for the noise content of each stage, calculating a threshold permits further adjustment to tune the quantity of noise removed. Wavelet coefficients below the threshold are ignored while those above the threshold are retained. Accordingly, the remaining data have a substantial portion of the noise content removed.
[0394] Due to the characteristics of wavelet transformation, the lower stages, such as stage 0 and 1, will have more noise content than the later stages such as stage 2 or stage n. Indeed, stage n low data are likely to have little noise at all. Therefore, in an embodiment, the noise profiles are applied more aggressively in the lower stages and less aggressively in the later stages. For example, FIG. 31 shows that stage 0 high threshold is determined by multiplying the stage 0 noise profile by a factor of four. In such a manner, significant numbers of data points in stage 0 high data 95 will be below the threshold and therefore eliminated. Stage 1 high threshold 112 is set at two times the noise profile for the stage 1 high data, and stage 2 high threshold 114 is set equal to the noise profile for stage 2 high. Following this geometric progression, stage n high threshold 116 is therefore determined by scaling the noise profile for each respective stage n high by a factor equal to (1/2n-2). It will be appreciated that other factors can be applied to scale the noise profile for each stage. For example, the noise profile can be scaled more or less aggressively to accommodate specific systemic characteristics or sample compositions. As indicated above, stage n low data does not have a noise profile applied as stage n low data 118 is assumed to have little or no noise content. After the scaled noise profiles have been applied to each high data stage, the mass spectrometry data 70 has been denoised and is ready for further processing. A wavelet transformation of the denoised signal results in the sparse data set 120 as shown in FIG. 31.
[0395] Referring again to FIG. 25, the mass spectrometry data received in block 40 has been denoised in block 45 and is now passed to block 50 for baseline correction. Before performing baseline correction, the artifacts introduced by the wavelet transformation procedure can be removed. Wavelet transformation results vary slightly depending upon which point of the wavelet is used as a starting point. For example, an exemplary embodiment uses the 24-point Daubechies-24 wavelet. By starting the transformation at the 0 point of the wavelet, a slightly different result will be obtained than if starting at points 1 or 2 of the wavelet. Therefore, the denoised data are transformed using every available possible starting point, with the results averaged to determine a final denoised and shifted signal. For example, FIG. 33 shows that the wavelet coefficient is applied 24 different times and then the results averaged to generate the final data set. It will be appreciated that other techniques can be used to accommodate the slight error introduced due to wavelet shifting.
[0396] The formula 125 is generally indicated in FIG. 33. Once the signal has been denoised and shifted, a denoised and shifted signal 130 is generated as shown in FIG. 58. FIG. 34 shows an example of the wavelet coefficient 135 data set from the denoised and shifted signal 130.
[0397] FIG. 36 shows that putative peak areas 145, 147, and 149 are located in the denoised and shifted signal 150. The putative peak areas are systematically identified by taking a moving average along the signal 150 and identifying sections of the signal 150 which exceed a threshold related to the moving average. It will be appreciated that other methods can be used to identify putative peak areas in the signal 150.
[0398] Putative peak areas 145, 147 and 149 are removed from the signal 150 to create a peak-free signal 155 as shown in FIG. 37. The peak-free signal 155 is further analyzed to identify remaining minimum values 157, and the remaining minimum values 157 are connected to generate the peak-free signal 155.
[0399] FIG. 38 shows a process of using the peak-free signal 155 to generate a baseline 170 as shown in FIG. 39. As shown in block 162, a wavelet transformation is performed on the peak-free signal 155. All the stages from the wavelet transformation are eliminated in block 164 except for the n low stage. The n low stage will generally indicate the lowest frequency component of the peak-free signal 155 and therefore will generally indicate the system exponential characteristics. Block 166 shows that a signal is reconstructed from the n low coefficients and the baseline signal 170 is generated in block 168.
[0400] FIG. 39 shows a denoised and shifted data signal 172 positioned adjacent a correction baseline 170. The baseline correction 170 is subtracted from the denoised and shifted signal 172 to generate a signal 175 having a baseline correction applied as shown in FIG. 40. Although such a denoised, shifted, and corrected signal is sufficient for most identification purposes, the putative peaks in signal 175 are not identifiable with sufficient accuracy or confidence to call the DNA composition of a biological sample.
[0401] Referring again to FIG. 25, the data from the baseline correction 50 is now compressed in block 55; the compression technique used in an exemplary embodiment is detailed in FIG. 41. In FIG. 41 the data in the baseline corrected data are presented in an array format 182 with x-axis points 183 having an associated data value 184. The x-axis is indexed by the non-zero wavelet coefficients, and the associated value is the value of the wavelet coefficient. In the illustrated data example in table 182, the maximum value 184 is indicated to be 1000. Although a particularly advantageous compression technique for mass spectrometry data is shown, it will be appreciated that other compression techniques can be used. The data also can be stored without compression.
[0402] In compressing the data according to one embodiment, an intermediate format 186 is generated. The intermediate format 186 generally comprises a real number having a whole number portion 188 and a decimal portion 190. The whole number portion is the x-axis point 183 while the decimal portion is the value data 184 divided by the maximum data value. For example, in the data 182 a data value "25" is indicated at x-axis point "100". The intermediate value for this data point would be "100.025".
[0403] From the intermediate compressed data 186 the final compressed data 195 is generated. The first point of the intermediate data file becomes the starting point for the compressed data. Thereafter each data point in the compressed data 195 is calculated as follows: the whole number portion (left of the decimal) is replaced by the difference between the current and the last whole number. The remainder (right of the decimal) remains intact. For example, the starting point of the compressed data 195 is shown to be the same as the intermediate data point which is "100.025". The comparison between the first intermediate data point "100.025" and the second intermediate data point "150.220" is "50.220". Therefore, "50.220" becomes the second point of the compressed data 195. In a similar manner, the second intermediate point is "150.220" and the third intermediate data point is "500.0001". Therefore, the third compressed data becomes "350.000". The calculation for determining compressed data points is continued until the entire array of data points is converted to a single array of real numbers.
[0404] FIG. 42 generally describes the method of compressing mass spectrometry data, showing that the data file in block 201 is presented as an array of coefficients in block 202. The data starting point and maximum is determined as shown in block 203, and the intermediate real numbers are calculated in block 204 as described above. With the intermediate data points generated, the compressed data are generated in block 205. The described compression method is highly advantageous and efficient for compressing data sets such as a processed data set from a mass spectrometry instrument. The method is particularly useful for data, such as mass spectrometry data, that uses large numbers and has been processed to have occasional lengthy gaps in x-axis data. Accordingly, an x-y data array for processed mass spectrometry data can be stored with an effective compression rate of 10ร or more. Although the compression technique is applied to mass spectrometry data, it will be appreciated that the method can also advantageously be applied to other data sets.
[0405] Referring again to FIG. 25, peak heights are now determined in block 60. The first step in determining peak height is illustrated in FIG. 43 where the signal 210 is shifted left or right to correspond with the position of expected peaks. As the set of possible compositions in the biological sample is known before the mass spectrometry data are generated, the possible positioning of expected peaks is already known. These possible peaks are referred to as expected peaks, such as expected peaks 212, 214, and 216. Due to calibration or other errors in the test instrument data, the entire signal can be shifted left or right from its actual position, therefore, putative peaks located in the signal, such as putative peaks 218, 222, and 224 can be compared to the expected peaks 212, 214, and 216, respectively. The entire signal is then shifted such that the putative peaks align more closely with the expected peaks.
[0406] Once the putative peaks have been shifted to match expected peaks, the strongest putative peak is identified in FIG. 44. In one embodiment, the strongest peak is calculated as a combination of analyzing the overall peak height and area beneath the peak. For example, a moderately high but wide peak would be stronger than a very high peak that is extremely narrow. With the strongest putative peak identified, such as putative peak 225, a Gaussian 228 curve is fit to the peak 225. Once the Gaussian is fit, the width (W) of the Gaussian is determined and will be used as the peak width for future calculations.
[0407] As generally addressed above, the denoised, shifted, and baseline-corrected signal is not sufficiently processed for confidently calling the DNA composition of the biological sample. For example, although the baseline has generally been removed, there are still residual baseline effects present. These residual baseline effects are therefore removed to increase the accuracy and confidence in making identifications.
[0408] To remove the residual baseline effects, FIG. 45 shows that the putative peaks 218, 222, and 224 are removed from the baseline corrected signal. The peaks are removed by identifying a center line 230, 232, and 234 of the putative peaks 218, 222, and 224, respectively and removing an area to the left and to the right of the identified center line. For each putative peak, an area equal to twice the width (W) of the Gaussian is removed from the left of the center line, while an area equivalent to 50 daltons is removed from the right of the center line. It has been found that the area representing 50 daltons is adequate to sufficiently remove the effect of salt adducts which can be associated with an actual peak. Such adducts appear to the right of an actual peak and are a natural effect from the chemistry involved in acquiring a mass spectrum. Although a 50 Dalton buffer has been selected, it will be appreciated that other ranges or methods can be used to reduce or eliminate adduct effects.
[0409] The peaks are removed and remaining minima 247 located as shown in FIG. 46 with the minima 247 connected to create signal 245. A quartic polynomial is applied to signal 245 to generate a residual baseline 250 as shown in FIG. 47. The residual baseline 250 is subtracted from the signal 225 to generate the final signal 255 as indicated in FIG. 48. Although the residual baseline is the result of a quartic fit to signal 245, it will be appreciated that other techniques can be used to smooth or fit the residual baseline.
[0410] To determine peak height, as shown in FIG. 49, a Gaussian such as Gaussian 266, 268, and 270 is fit to each of the peaks, such as peaks 260, 262, and 264, respectively. Accordingly, the height of the Gaussian is determined as height 272, 274, and 276. Once the height of each Gaussian peak is determined, then the method of identifying a biological compound 35 can move into the genotyping phase 65 as shown in FIG. 25.
[0411] An indication of the confidence that each putative peak is an actual peak can be discerned by calculating a signal-to-noise ratio for each putative peak. Accordingly, putative peaks with a strong signal-to-noise ratio are generally more likely to be an actual peak than a putative peak with a lower signal-to-noise ratio. As described above and shown in FIG. 50, the height of each peak, such as height 272, 274, and 276, is determined for each peak, with the height being an indicator of signal strength for each peak. The noise profile, such as noise profile 97, is extrapolated into noise profile 280 across the identified peaks. At the center line of each of the peaks, a noise value is determined, such as noise value 282, 283, and 284. With a signal values and a noise values generated, signal-to-noise ratios can be calculated for each peak. For example, the signal-to-noise ratio for the first peak in FIG. 50 would be calculated as signal value 272 divided by noise value 282, and in a similar manner the signal-to-noise ratio of the middle peak in FIG. 50 would be determined as signal 274 divided by noise value 283.
[0412] Although the signal-to-noise ratio is generally a useful indicator of the presence of an actual peak, further processing has been found to increase the confidence by which a sample can be identified. For example, the signal-to-noise ratio for each peak in the exemplary embodiment can be adjusted by the goodness of fit between a Gaussian and each putative peak. It is a characteristic of a mass spectrometer that sample material is detected in a manner that generally complies with a normal distribution. Accordingly, greater confidence will be associated with a putative signal having a Gaussian shape than a signal that has a less normal distribution. The error resulting from having a non-Gaussian shape can be referred to as a "residual error".
[0413] Referring to FIG. 51, a residual error is calculated by taking a root mean square calculation between the Gaussian 293 and the putative peak 290 in the data signal. The calculation is performed on data within one width on either side of a center line of the Gaussian. The residual error is calculated as:
[(G-R)2/N],
[0414] where G is the Gaussian signal value, R is the putative peak value, and N is the number of points from -W to +W. The calculated residual error is used to generate an adjusted signal-to-noise ratio, as described below.
[0415] An adjusted signal noise ratio is calculated for each putative peak using the formula (S/N)*EXP.sup.(-0.1*R), where S/N is the signal-to-noise ratio, and R is the residual error determined above. Although the exemplary embodiment calculates an adjusted signal-to-noise ratio using a residual error for each peak, it will be appreciated that other techniques can be used to account for the goodness of fit between the Gaussian and the actual signal.
[0416] Referring now to FIG. 52, a probability is determined that a putative peak is an actual peak. In making the determination of peak probability, a probability profile 300 is generated where the adjusted signal-to-noise ratio is the x-axis and the probability is the y-axis. Probability is necessarily in the range between a 0% probability and a 100% probability, which is indicated as 1. Generally, the higher the adjusted signal-to-noise ratio, the greater the confidence that a putative peak is an actual peak.
[0417] At some target value for the adjusted signal-to-noise, it has been found that the probability is 100% that the putative peak is an actual peak and can confidently be used to identify the DNA composition of a biological sample. The target value of adjusted signal-to-noise ratio where the probability is assumed to be 100% is a variable parameter which is to be set according to application specific criteria. For example, the target signal-to-noise ratio will be adjusted depending upon trial experience, sample characteristics, and the acceptable error tolerance in the overall system. More specifically, for situations requiring a conservative approach where error cannot be tolerated, the target adjusted signal-to-noise ratio can be set to, for example, 10 and higher. Accordingly, 100% probability will not be assigned to a peak unless the adjusted signal-to-noise ratio is 10 or over.
[0418] In other situations, a more aggressive approach can be taken as sample data is more pronounced or the risk of error can be reduced. In such a situation, the system can be set to assume a 100% probability with a 5 or greater target signal-to-noise ratio. Of course, an intermediate signal-to-noise ratio target figure can be selected, such as 7, when a moderate risk of error can be assumed. Once the target adjusted signal-to-noise ratio is set for the method, then for any adjusted signal-to-noise ratio ฮฑ probability can be determined that a putative peak is an actual peak.
[0419] Due to the chemistry involved in performing an identification test, especially a mass spectrometry test of a sample prepared by DNA amplifications, the allelic ratio between the signal strength of the highest peak and the signal strength of the second (or third and so on) highest peak should fall within an expected ratio. If the allelic ratio falls outside of normal guidelines, the exemplary embodiment imposes an allelic ratio penalty to the probability. For example, FIG. 53 shows an allelic penalty 315 which has an x-axis 317 that is the ratio between the signal strength of the second highest peak divided by signal strength of the highest peak. The y-axis 319 assigns a penalty between 0 and 1 depending on the determined allelic ratio. In the exemplary embodiment, it is assumed that allelic ratios over 30% are within the expected range and therefore no penalty is applied. Between a ratio of 10% and 30%, the penalty is linearly increased until at allelic ratios below 10% it is assumed the second-highest peak is not real. For allelic ratios between 10% and 30%, the allelic penalty chart 315 is used to determine a penalty 319, which is multiplied by the peak probability determined in FIG. 52 to determine a final peak probability. Although the exemplary embodiment incorporates an allelic ratio penalty to account for a possible chemistry error, it will be appreciated that other techniques can be used. Similar treatment will be applied to the other peaks.
[0420] With the peak probability of each peak determined, the statistical probability for various composition components can be determined, as an example, in order to determine the probability of each of three possible combinations of two peaks, --peak G, peak C and combinations GG, CC and GC. FIG. 54 shows an example where a most probable peak 325 is determined to have a final peak probability of 90%. Peak 325 is positioned such that it represents a G component in the biological sample. Accordingly, it can be maintained that there is a 90% probability that G exists in the biological sample. Also in the example shown in FIG. 54, the second highest probability is peak 330 which has a peak probability of 20%. Peak 330 is at a position associated with a C composition. Accordingly, it can be maintained that there is a 20% probability that C exists in the biological sample.
[0421] With the probability of G existing (90%) and the probability of C existing (20%) as a starting point, the probability of combinations of G and C existing can be calculated. For example, FIG. 54 indicates that the probability of GG existing 329 is calculated as 72%. This is calculated as the probability of GG is equal to the probability of G existing (90%) multiplied by the probability of C not existing (100%-20%). So if the probability of G existing is 90% and the probability of C not existing is 80%, the probability of GG is 72%.
[0422] In a similar manner, the probability of CC existing is equivalent to the probability of C existing (20%) multiplied by the probability of G not existing (100%-90%). As shown in FIG. 54, the probability of C existing is 20% while the probability of G not existing is 10%, so therefore the probability of CC is only 2%. Finally, the probability of GC existing is equal to the probability of G existing (90%) multiplied by the probability of C existing (20%). So if the probability of G existing is 90% and the probability of C existing is 20%, the probability of GC existing is 18%. In summary form, then, the probability of the composition of the biological sample is:
TABLE-US-00017 probability of GG: 72%; probability of GC: 18%; and probability of CC: 2%.
[0423] Once the probabilities of each of the possible combinations has been determined, FIG. 55 is used to decide whether or not sufficient confidence exists to call the genotype. FIG. 55 shows a call chart 335 which has an x-axis 337 which is the ratio of the highest combination probability to the second highest combination probability. The y-axis 339 simply indicates whether the ratio is sufficiently high to justify calling the genotype. The value of the ratio can be indicated by M 340. The value of M is set depending upon trial data, sample composition, and the ability to accept error. For example, the value M can be set relatively high, such as to a value 4 so that the highest probability must be at least four times greater than the second highest probability before confidence is established to call a genotype. If a certain level of error can be acceptable, the value of M can be set to a more aggressive value, such as to 3, so that the ratio between the highest and second highest probabilities needs to be only a ratio of 3 or higher. Of course, moderate value can be selected for M when a moderate risk can be accepted. Using the example of FIG. 54, where the probability of GG was 72% and the probability of GC was 18%, the ratio between 72% and 18% is 4.0, therefore, whether M is set to 3, 3.5, or 4, the system would call the genotype as GG. Although the exemplary embodiment uses a ratio between the two highest peak probabilities to determine if a genotype confidently can be called, it will be appreciated that other methods can be substituted. It will also be appreciated that the above techniques can be used for calculating probabilities and choosing genotypes (or more general DNA patterns) containing of combinations of more than two peaks.
[0424] Referring now to FIG. 56, a flow chart is shown generally defining the process of statistically calling genotype described above. In FIG. 56 block 402 shows that the height of each peak is determined and that in block 404 a noise profile is extrapolated for each peak. The signal is determined from the height of each peak in block 406 and the noise for each peak is determined using the noise profile in block 408. In block 410, the signal-to-noise ratio is calculated for each peak. To account for a non-Gaussian peak shape, a residual error is determined in block 412 and an adjusted signal-to-noise ratio is calculated in block 414. Block 416 shows that a probability profile is developed, with the probability of each peak existing found in block 418. An allelic penalty can be applied in block 420, with the allelic penalty applied to the adjusted peak probability in block 422. The probability of each combination of components is calculated in block 424 with the ratio between the two highest probabilities being determined in block 426. If the ratio of probabilities exceeds a threshold value then the genotype is called in block 428.
[0425] In another embodiment, the computing device 20 (FIG. 24) supports "standardless" genotyping by identifying data peaks that contain putative SNPs. Standardless genotyping is used, for example, where insufficient information is known about the samples to determine a distribution of expected peak locations, against which an allelic penalty as described above can be reliably calculated. This permits the computing device to be used for identification of peaks that contain putative SNPs from data generated by any assay that fragments a targeted DNA molecule. For such standardless genotyping, peaks that are associated with an area under the data curve that deviates significantly from the typical area of other peaks in the data spectrum are identified and their corresponding mass (location along the x-axis) is determined.
[0426] More particularly, peaks that deviate significantly from the average area of other peaks in the data are identified, and the expected allelic ratio between data peaks is defined in terms of the ratio of the area under the data peaks. Theoretically, where each genetic loci has the same molar concentration of analyte, the area under each corresponding peak should be the same, thus producing a 1.0 ratio of the peak area between any two peaks. In accordance with the methods provided herein, peaks having a smaller ratio relative to the other peaks in the data will not be recognized as peaks. More particularly, peaks having an area ratio smaller than 30% relative to a nominal value for peak area will be assigned an allelic penalty. The mass of the remaining peaks (their location along the x-axis of the data) will be determined based on oligonucleotide standards.
[0427] FIG. 57 shows a flow diagram representation of the processing by the computing device 20 (FIG. 24) when performing standardless genotyping. In the first operation, represented by the flow diagram box numbered 502, the computing device receives data from the mass spectrometer. Next, the height of each putative peak in the data sample is determined, as indicated by the block 504. After the height of each peak in the mass spectrometer data is determined, a de-noise process 505 is performed, beginning with an extrapolation of the noise profile (block 506), followed by finding the noise of each peak (block 508) and calculating the signal to noise ratio for each data sample (block 510). Each of these operations can be performed in accordance with the description above for denoise operations 45 of FIG. 25. Other suitable denoise operations will occur to those skilled in the art.
[0428] The next operation is to find the residual error associated with each data point. This is represented by the block 512 in FIG. 57. The next step, block 514, involves calculating an adjusted signal to noise ratio for each identified peak. A probability profile is developed next (block 516), followed by a determination of the peak probabilities at block 518. In an exemplary embodiment, the denoise operations of FIG. 57, comprising block 502 to block 518, comprise the corresponding operations described above in conjunction with FIG. 56 for block 402 through block 418, respectively.
[0429] The next action for the standardless genotype processing is to determine an allelic penalty for each peak, indicated by the block 524. As noted above, the standardless genotype processing of FIG. 57 determines an allelic penalty by comparing area under the peaks. Therefore, rather than compare signal strength ratios to determine an allelic penalty, such as described above for FIG. 53, the standardless processing determines the area under each of the identified peaks and compares the ratio of those areas. Determining the area under each peak can be computed using conventional numerical analysis techniques for calculating the area under a curve for experimental data.
[0430] Thus, the allelic penalty is assigned in accordance with FIG. 58, which shows that no penalty is assigned to peaks having a peak area relative to an expected average area value that is greater than 0.30 (30%). The allelic penalty is applied to the peak probability value, which can be determined according to the process such as described in FIG. 52. It should be apparent from FIG. 58 that the allelic penalty imposed for peaks below a ratio of 30% is that such peaks will be removed from further measurement and processing. Other penalty schemes, however, can be imposed in accordance with knowledge about the data being processed, as determined by those skilled in the art.
[0431] After the allelic penalty has been determined and applied, the standardless genotype processing compares the location of the remaining putative peaks to oligonucleotide standards to determine corresponding masses in the processing for block 524. For standardless genotype data, the processing of the block 524 is performed to determine mass and genotype, rather than performing the operations corresponding to block 424, 426, and 428 of FIG. 33. Techniques for performing such comparisons and determining mass will be known to those skilled in the art.
[0432] In another embodiment, the computing device 20 (FIG. 24) permits the detection and determination of the mass (location along the x-axis of the data) of the sense and antisense strand of fragments generated in the assay. If desired, the computing device can also detect and determine the quantity (area under each peak) of the respective sense and antisense strands, using a similar technique to that described above for standardless genotype processing. The data generated for each type of strand can then be combined to achieve a data redundancy and to thereby increase the confidence level of the determined genotype. This technique obviates primer peaks that are often observed in data from other diagnostic methods, thereby permitting a higher level of multiplexing. In addition, when quantitation is used in pooling experiments, the ratio of the measured peak areas is more reliably calculated than the peak identifying technique, due to data redundancy.
[0433] FIG. 23 is a flow diagram that illustrates the processing implemented by the computing device 20 to perform sense and antisense processing. In the first operation, represented by the flow diagram box numbered 602, the computing device receives data from the mass spectrometer. This data will include data for the sense strand and antisense strand of assay fragments. Next, the height of each putative peak in the data sample is determined, as indicated by the block 604. After the height of each peak in the mass spectrometer data is determined, a de-noise process 605 is performed, beginning with an operation that extrapolates the noise profile (block 606), followed by finding the noise of each peak (block 608) and calculating the signal to noise ratio for each data sample (block 610). Each of these operations can be performed in accordance with the description above for the denoise operations 45 of FIG. 25. Other suitable denoise operations will occur to those skilled in the art. The next operation is to find the residual error associated with each data point. This is represented by the block 612 in FIG. 36.
[0434] After the residual error for the data of the sense strand and antisense strand has been performed, processing to identify the genotypes will be performed for the sense strand and also for the antisense strand. Therefore, FIG. 23 shows that processing includes sense strand processing (block 630) and antisense strand processing (block 640). Each block 630, 640 includes processing that corresponds to adjusting the signal to noise ratio, developing a probability profile, determining an allelic penalty, adjusting the peak probability by the allelic penalty, calculating genotype probabilities, and testing genotype probability ratios, such as described above in conjunction with blocks 414 through 426 of FIG. 56. The processing of each block 630, 640 can, if desired, include standardless processing operations such as described above in conjunction with FIG. 57. The standardless processing can be included in place of or in addition to the processing operations of FIG. 56.
[0435] After the genotype probability processing is completed, the data from the sense strand and antisense strand processing is combined and compared to expected database values to obtain the benefits of data redundancy as between the sense strand and antisense strand. Those skilled in the art will understand techniques to take advantage of known data redundancies between a sense strand and antisense strand of assay fragments. This processing is represented by the block 650. After the data from the two strands is combined for processing, the genotype processing is performed (block 660) and the genotype is identified.
[0436] Since modifications will be apparent to those of skill in this art, it is intended that this invention be limited only by the scope of the appended claims.
Sequence CWU
1
1
1211361DNAArtificial SequenceDescription of Artificial Sequence Synthetic
polynucleotide 1ctgaggacct ggtcctctga ctgctctttt cacccatcta
cagtccccct tgccgtccca 60agcaatggat gatttgatgc tgtccccgga cgatattgaa
caatggttca ctgaagaccc 120aggtccagat gaagctccca gaatgccaga ggctgctccc
cgcgtggccc ctgcaccagc 180agctcctaca ccggcggccc ctgcaccagc cccctcctgg
cccctgtcat cttctgtccc 240ttcccagaaa acctaccagg gcagctacgg tttccgtctg
ggcttcttgc attctgggac 300agccaagtct gtgacttgca cggtcagttg ccctgagggg
ctggcttcca tgagacttca 360a
361244DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 2cccagtcacg acgttgtaaa
acgctgagga cctggtcctc tgac 44342DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
3agcggataac aatttcacac aggttgaagt ctcatggaag cc
42417DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 4gccagaggct gctcccc
17517DNAArtificial SequenceDescription of Artificial Sequence
Synthetic probe 5gccagaggct gctcccc
17619DNAArtificial SequenceDescription of Artificial
Sequence Synthetic probe 6gccagaggct gctccccgc
19718DNAArtificial SequenceDescription of
Artificial Sequence Synthetic probe 7gccagaggct gctccccc
188161DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
8gtccgtcaga acccatgcgg cagcaaggcc tgccgccgcc tcttcggccc agtggacagc
60gagcagctga gccgcgactg tgatgcgcta atggcgggct gcatccagga ggcccgtgag
120cgatggaact tcgactttgt caccgagaca ccactggagg g
161943DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 9cccagtcacg acgttgtaaa acggtccgtc agaacccatg cgg
431044DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 10agcggataac aatttcacac aggctccagt ggtgtctcgg tgac
441115DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 11cagcgagcag ctgag
151215DNAArtificial SequenceDescription of
Artificial Sequence Synthetic probe 12cagcgagcag ctgag
151316DNAArtificial
SequenceDescription of Artificial Sequence Synthetic probe
13cagcgagcag ctgagc
161417DNAArtificial SequenceDescription of Artificial Sequence Synthetic
probe 14cagcgagcag ctgagac
1715205DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 15gcgctccatt catctcttca tcgactctct
gttgaatgaa gaaaatccaa gtaaggccta 60caggtgcagt tccaaggaag cctttgagaa
agggctctgc ttgagttgta gaaagaaccg 120ctgcaacaat ctgggctatg agatcaataa
agtcagagcc aaaagaagca gcaaaatgta 180cctgaagact cgttctcaga tgccc
2051642DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
16cccagtcacg acgttgtaaa acggcgctcc attcatctct tc
421742DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 17agcggataac aatttcacac agggggcatc tgagaacgag tc
421820DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 18caatctgggc tatgagatca
201920DNAArtificial SequenceDescription of Artificial
Sequence Synthetic probe 19caatctgggc tatgagatca
202021DNAArtificial SequenceDescription of
Artificial Sequence Synthetic probe 20caatctgggc tatgagatca a
212122DNAArtificial
SequenceDescription of Artificial Sequence Synthetic probe
21caatctgggc tatgagatca gt
2222120DNAArtificial SequenceDescription of Artificial Sequence Synthetic
polynucleotide 22gtgccggcta ctcggatggc agcaaggact cctgcaaggg
ggacagtgga ggcccacatg 60ccacccacta ccggggcacg tggtacctga cgggcatcgt
cagctggggc cagggctgcg 12023120DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 23gtgccggcta ctcggatggc
agcaaggact cctgcaaggg ggacagtgga ggcccacatg 60ccacccacta ccagggcacg
tggtacctga cgggcatcgt cagctggggc cagggctgcg 1202442DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
24cccagtcacg acgttgtaaa acgatggcag caaggactcc tg
422518DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 25cacatgccac ccactacc
182643DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 26agcggataac aatttcacac aggtgacgat gcccgtcagg tac
432715DNAArtificial SequenceDescription of Artificial
Sequence Synthetic probe 27atgccaccca ctacc
152819DNAArtificial SequenceDescription of
Artificial Sequence Synthetic probe 28cacatgccac ccactaccg
192920DNAArtificial
SequenceDescription of Artificial Sequence Synthetic probe
29cacatgccac ccactaccag
203023DNAArtificial SequenceDescription of Artificial Sequence Synthetic
probe 30agcggataac aatttcacac agg
23312363DNAHomo sapiensCDS(138)..(2123) 31gcggcttgtt gataatatgg
cggctggagc tgcctgggca tcccgaggag gcggtggggc 60ccactcccgg aagaagggtc
ccttttcgcg ctagtgcagc ggcccctctg gacccggaag 120tccgggccgg ttgctga atg
agg gga gcc ggg ccc tcc ccg cgc cag tcc 170 Met
Arg Gly Ala Gly Pro Ser Pro Arg Gln Ser 1
5 10 ccc cgc acc ctc cgt ccc
gac ccg ggc ccc gcc atg tcc ttc ttc cgg 218Pro Arg Thr Leu Arg Pro
Asp Pro Gly Pro Ala Met Ser Phe Phe Arg 15
20 25 cgg aaa gtg aaa ggc aaa gaa
caa gag aag acc tca gat gtg aag tcc 266Arg Lys Val Lys Gly Lys Glu
Gln Glu Lys Thr Ser Asp Val Lys Ser 30
35 40 att aaa gct tca ata tcc gta
cat tcc cca caa aaa agc act aaa aat 314Ile Lys Ala Ser Ile Ser Val
His Ser Pro Gln Lys Ser Thr Lys Asn 45 50
55 cat gcc ttg ctg gag gct gca gga
cca agt cat gtt gca atc aat gcc 362His Ala Leu Leu Glu Ala Ala Gly
Pro Ser His Val Ala Ile Asn Ala 60 65
70 75 att tct gcc aac atg gac tcc ttt tca
agt agc agg aca gcc aca ctt 410Ile Ser Ala Asn Met Asp Ser Phe Ser
Ser Ser Arg Thr Ala Thr Leu 80
85 90 aag aag cag cca agc cac atg gag gct
gct cat ttt ggt gac ctg ggc 458Lys Lys Gln Pro Ser His Met Glu Ala
Ala His Phe Gly Asp Leu Gly 95 100
105 aga tct tgt ctg gac tac cag act caa gag
acc aaa tca agc ctt tct 506Arg Ser Cys Leu Asp Tyr Gln Thr Gln Glu
Thr Lys Ser Ser Leu Ser 110 115
120 aag acc ctt gaa caa gtc ttg cac gac act att
gtc ctc cct tac ttc 554Lys Thr Leu Glu Gln Val Leu His Asp Thr Ile
Val Leu Pro Tyr Phe 125 130
135 att caa ttc atg gaa ctt cgg cga atg gag cat
ttg gtg aaa ttt tgg 602Ile Gln Phe Met Glu Leu Arg Arg Met Glu His
Leu Val Lys Phe Trp 140 145 150
155 tta gag gct gaa agt ttt cat tca aca act tgg tcg
cga ata aga gca 650Leu Glu Ala Glu Ser Phe His Ser Thr Thr Trp Ser
Arg Ile Arg Ala 160 165
170 cac agt cta aac aca atg aag cag agc tca ctg gct gag
cct gtc tct 698His Ser Leu Asn Thr Met Lys Gln Ser Ser Leu Ala Glu
Pro Val Ser 175 180
185 cca tct aaa aag cat gaa act aca gcg tct ttt tta act
gat tct ctt 746Pro Ser Lys Lys His Glu Thr Thr Ala Ser Phe Leu Thr
Asp Ser Leu 190 195 200
gat aag aga ttg gag gat tct ggc tca gca cag ttg ttt atg
act cat 794Asp Lys Arg Leu Glu Asp Ser Gly Ser Ala Gln Leu Phe Met
Thr His 205 210 215
tca gaa gga att gac ctg aat aat aga act aac agc act cag aat
cac 842Ser Glu Gly Ile Asp Leu Asn Asn Arg Thr Asn Ser Thr Gln Asn
His 220 225 230
235 ttg ctg ctt tcc cag gaa tgt gac agt gcc cat tct ctc cgt ctt
gaa 890Leu Leu Leu Ser Gln Glu Cys Asp Ser Ala His Ser Leu Arg Leu
Glu 240 245 250
atg gcc aga gca gga act cac caa gtt tcc atg gaa acc caa gaa tct
938Met Ala Arg Ala Gly Thr His Gln Val Ser Met Glu Thr Gln Glu Ser
255 260 265
tcc tct aca ctt aca gta gcc agt aga aat agt ccc gct tct cca cta
986Ser Ser Thr Leu Thr Val Ala Ser Arg Asn Ser Pro Ala Ser Pro Leu
270 275 280
aaa gaa ttg tca gga aaa cta atg aaa agt ata gaa caa gat gca gtg
1034Lys Glu Leu Ser Gly Lys Leu Met Lys Ser Ile Glu Gln Asp Ala Val
285 290 295
aat act ttt acc aaa tat ata tct cca gat gct gct aaa cca ata cca
1082Asn Thr Phe Thr Lys Tyr Ile Ser Pro Asp Ala Ala Lys Pro Ile Pro
300 305 310 315
att aca gaa gca atg aga aat gac atc ata gca agg att tgt gga gaa
1130Ile Thr Glu Ala Met Arg Asn Asp Ile Ile Ala Arg Ile Cys Gly Glu
320 325 330
gat gga cag gtg gat ccc aac tgt ttc gtt ttg gca cag tcc ata gtc
1178Asp Gly Gln Val Asp Pro Asn Cys Phe Val Leu Ala Gln Ser Ile Val
335 340 345
ttt agt gca atg gag caa gag cac ttt agt gag ttt ctg cga agt cac
1226Phe Ser Ala Met Glu Gln Glu His Phe Ser Glu Phe Leu Arg Ser His
350 355 360
cat ttc tgt aaa tac cag att gaa gtg ctg acc agt gga act gtt tac
1274His Phe Cys Lys Tyr Gln Ile Glu Val Leu Thr Ser Gly Thr Val Tyr
365 370 375
ctg gct gac att ctc ttc tgt gag tca gcc ctc ttt tat ttc tct gag
1322Leu Ala Asp Ile Leu Phe Cys Glu Ser Ala Leu Phe Tyr Phe Ser Glu
380 385 390 395
tac atg gaa aaa gag gat gca gtg aat atc tta caa ttc tgg ttg gca
1370Tyr Met Glu Lys Glu Asp Ala Val Asn Ile Leu Gln Phe Trp Leu Ala
400 405 410
gca gat aac ttc cag tct cag ctt gct gcc aaa aag ggg caa tat gat
1418Ala Asp Asn Phe Gln Ser Gln Leu Ala Ala Lys Lys Gly Gln Tyr Asp
415 420 425
gga cag gag gca cag aat gat gcc atg att tta tat gac aag tac ttc
1466Gly Gln Glu Ala Gln Asn Asp Ala Met Ile Leu Tyr Asp Lys Tyr Phe
430 435 440
tcc ctc caa gcc aca cat cct ctt gga ttt gat gat gtt gta cga tta
1514Ser Leu Gln Ala Thr His Pro Leu Gly Phe Asp Asp Val Val Arg Leu
445 450 455
gaa att gaa tcc aat atc tgc agg gaa ggt ggg cca ctc ccc aac tgt
1562Glu Ile Glu Ser Asn Ile Cys Arg Glu Gly Gly Pro Leu Pro Asn Cys
460 465 470 475
ttc aca act cca tta cgt cag gcc tgg aca acc atg gag aag gtc ttt
1610Phe Thr Thr Pro Leu Arg Gln Ala Trp Thr Thr Met Glu Lys Val Phe
480 485 490
ttg cct ggc ttt ctg tcc agc aat ctt tat tat aaa tat ttg aat gat
1658Leu Pro Gly Phe Leu Ser Ser Asn Leu Tyr Tyr Lys Tyr Leu Asn Asp
495 500 505
ctc atc cat tcg gtt cga gga gat gaa ttt ctg ggc ggg aac gtg tcg
1706Leu Ile His Ser Val Arg Gly Asp Glu Phe Leu Gly Gly Asn Val Ser
510 515 520
ccg act gct cct ggc tct gtt ggc cct cct gat gag tct cac cca ggg
1754Pro Thr Ala Pro Gly Ser Val Gly Pro Pro Asp Glu Ser His Pro Gly
525 530 535
agt tct gac agc tct gcg tct cag tcc agt gtg aaa aaa gcc agt att
1802Ser Ser Asp Ser Ser Ala Ser Gln Ser Ser Val Lys Lys Ala Ser Ile
540 545 550 555
aaa ata ctg aaa aat ttt gat gaa gcg ata att gtg gat gcg gca agt
1850Lys Ile Leu Lys Asn Phe Asp Glu Ala Ile Ile Val Asp Ala Ala Ser
560 565 570
ctg gat cca gaa tct tta tat caa cgg aca tat gcc ggg aag atg aca
1898Leu Asp Pro Glu Ser Leu Tyr Gln Arg Thr Tyr Ala Gly Lys Met Thr
575 580 585
ttt gga aga gtg agt gac ttg ggg caa ttc atc cgg gaa tct gag cct
1946Phe Gly Arg Val Ser Asp Leu Gly Gln Phe Ile Arg Glu Ser Glu Pro
590 595 600
gaa cct gat gta agg aaa tca aaa gga tcc atg ttc tca caa gct atg
1994Glu Pro Asp Val Arg Lys Ser Lys Gly Ser Met Phe Ser Gln Ala Met
605 610 615
aag aaa tgg gtg caa gga aat act gat gag gcc cag gaa gag cta gct
2042Lys Lys Trp Val Gln Gly Asn Thr Asp Glu Ala Gln Glu Glu Leu Ala
620 625 630 635
tgg aag att gct aaa atg ata gtc agt gac att atg cag cag gct cag
2090Trp Lys Ile Ala Lys Met Ile Val Ser Asp Ile Met Gln Gln Ala Gln
640 645 650
tat gat caa ccg tta gag aaa tct aca aag tta tgactcaaaa cttgagataa
2143Tyr Asp Gln Pro Leu Glu Lys Ser Thr Lys Leu
655 660
aggaaatctg cttgtgaaaa ataagagaac ttttttccct tggttggatt cttcaacaca
2203gccaatgaaa acagcactat atttctgatc tgtcactgtt gtttccaggg agagaatggg
2263gagacaatcc taggacttcc accctaatgc agttacctgt agggcataat tggatggcac
2323atgatgtttc acacagtgag gagtctttaa aggttaccaa
236332662PRTHomo sapiens 32Met Arg Gly Ala Gly Pro Ser Pro Arg Gln Ser
Pro Arg Thr Leu Arg 1 5 10
15 Pro Asp Pro Gly Pro Ala Met Ser Phe Phe Arg Arg Lys Val Lys Gly
20 25 30 Lys Glu
Gln Glu Lys Thr Ser Asp Val Lys Ser Ile Lys Ala Ser Ile 35
40 45 Ser Val His Ser Pro Gln Lys
Ser Thr Lys Asn His Ala Leu Leu Glu 50 55
60 Ala Ala Gly Pro Ser His Val Ala Ile Asn Ala Ile
Ser Ala Asn Met 65 70 75
80 Asp Ser Phe Ser Ser Ser Arg Thr Ala Thr Leu Lys Lys Gln Pro Ser
85 90 95 His Met Glu
Ala Ala His Phe Gly Asp Leu Gly Arg Ser Cys Leu Asp 100
105 110 Tyr Gln Thr Gln Glu Thr Lys Ser
Ser Leu Ser Lys Thr Leu Glu Gln 115 120
125 Val Leu His Asp Thr Ile Val Leu Pro Tyr Phe Ile Gln
Phe Met Glu 130 135 140
Leu Arg Arg Met Glu His Leu Val Lys Phe Trp Leu Glu Ala Glu Ser 145
150 155 160 Phe His Ser Thr
Thr Trp Ser Arg Ile Arg Ala His Ser Leu Asn Thr 165
170 175 Met Lys Gln Ser Ser Leu Ala Glu Pro
Val Ser Pro Ser Lys Lys His 180 185
190 Glu Thr Thr Ala Ser Phe Leu Thr Asp Ser Leu Asp Lys Arg
Leu Glu 195 200 205
Asp Ser Gly Ser Ala Gln Leu Phe Met Thr His Ser Glu Gly Ile Asp 210
215 220 Leu Asn Asn Arg Thr
Asn Ser Thr Gln Asn His Leu Leu Leu Ser Gln 225 230
235 240 Glu Cys Asp Ser Ala His Ser Leu Arg Leu
Glu Met Ala Arg Ala Gly 245 250
255 Thr His Gln Val Ser Met Glu Thr Gln Glu Ser Ser Ser Thr Leu
Thr 260 265 270 Val
Ala Ser Arg Asn Ser Pro Ala Ser Pro Leu Lys Glu Leu Ser Gly 275
280 285 Lys Leu Met Lys Ser Ile
Glu Gln Asp Ala Val Asn Thr Phe Thr Lys 290 295
300 Tyr Ile Ser Pro Asp Ala Ala Lys Pro Ile Pro
Ile Thr Glu Ala Met 305 310 315
320 Arg Asn Asp Ile Ile Ala Arg Ile Cys Gly Glu Asp Gly Gln Val Asp
325 330 335 Pro Asn
Cys Phe Val Leu Ala Gln Ser Ile Val Phe Ser Ala Met Glu 340
345 350 Gln Glu His Phe Ser Glu Phe
Leu Arg Ser His His Phe Cys Lys Tyr 355 360
365 Gln Ile Glu Val Leu Thr Ser Gly Thr Val Tyr Leu
Ala Asp Ile Leu 370 375 380
Phe Cys Glu Ser Ala Leu Phe Tyr Phe Ser Glu Tyr Met Glu Lys Glu 385
390 395 400 Asp Ala Val
Asn Ile Leu Gln Phe Trp Leu Ala Ala Asp Asn Phe Gln 405
410 415 Ser Gln Leu Ala Ala Lys Lys Gly
Gln Tyr Asp Gly Gln Glu Ala Gln 420 425
430 Asn Asp Ala Met Ile Leu Tyr Asp Lys Tyr Phe Ser Leu
Gln Ala Thr 435 440 445
His Pro Leu Gly Phe Asp Asp Val Val Arg Leu Glu Ile Glu Ser Asn 450
455 460 Ile Cys Arg Glu
Gly Gly Pro Leu Pro Asn Cys Phe Thr Thr Pro Leu 465 470
475 480 Arg Gln Ala Trp Thr Thr Met Glu Lys
Val Phe Leu Pro Gly Phe Leu 485 490
495 Ser Ser Asn Leu Tyr Tyr Lys Tyr Leu Asn Asp Leu Ile His
Ser Val 500 505 510
Arg Gly Asp Glu Phe Leu Gly Gly Asn Val Ser Pro Thr Ala Pro Gly
515 520 525 Ser Val Gly Pro
Pro Asp Glu Ser His Pro Gly Ser Ser Asp Ser Ser 530
535 540 Ala Ser Gln Ser Ser Val Lys Lys
Ala Ser Ile Lys Ile Leu Lys Asn 545 550
555 560 Phe Asp Glu Ala Ile Ile Val Asp Ala Ala Ser Leu
Asp Pro Glu Ser 565 570
575 Leu Tyr Gln Arg Thr Tyr Ala Gly Lys Met Thr Phe Gly Arg Val Ser
580 585 590 Asp Leu Gly
Gln Phe Ile Arg Glu Ser Glu Pro Glu Pro Asp Val Arg 595
600 605 Lys Ser Lys Gly Ser Met Phe Ser
Gln Ala Met Lys Lys Trp Val Gln 610 615
620 Gly Asn Thr Asp Glu Ala Gln Glu Glu Leu Ala Trp Lys
Ile Ala Lys 625 630 635
640 Met Ile Val Ser Asp Ile Met Gln Gln Ala Gln Tyr Asp Gln Pro Leu
645 650 655 Glu Lys Ser Thr
Lys Leu 660 332363DNAArtificial SequenceDescription
of Artificial Sequence Synthetic polynucleotide 33gcggcttgtt
gataatatgg cggctggagc tgcctgggca tcccgaggag gcggtggggc 60ccactcccgg
aagaagggtc ccttttcgcg ctagtgcagc ggcccctctg gacccggaag 120tccgggccgg
ttgctga atg agg gga gcc ggg ccc tcc ccg cgc cag tcc 170
Met Arg Gly Ala Gly Pro Ser Pro Arg Gln Ser
1 5 10 ccc cgc acc ctc
cgt ccc gac ccg ggc ccc gcc atg tcc ttc ttc cgg 218Pro Arg Thr Leu
Arg Pro Asp Pro Gly Pro Ala Met Ser Phe Phe Arg 15
20 25 cgg aaa gtg aaa ggc
aaa gaa caa gag aag acc tca gat gtg aag tcc 266Arg Lys Val Lys Gly
Lys Glu Gln Glu Lys Thr Ser Asp Val Lys Ser 30
35 40 att aaa gct tca ata tcc
gta cat tcc cca caa aaa agc act aaa aat 314Ile Lys Ala Ser Ile Ser
Val His Ser Pro Gln Lys Ser Thr Lys Asn 45
50 55 cat gcc ttg ctg gag gct
gca gga cca agt cat gtt gca atc aat gcc 362His Ala Leu Leu Glu Ala
Ala Gly Pro Ser His Val Ala Ile Asn Ala 60 65
70 75 att tct gcc aac atg gac tcc
ttt tca agt agc agg aca gcc aca ctt 410Ile Ser Ala Asn Met Asp Ser
Phe Ser Ser Ser Arg Thr Ala Thr Leu 80
85 90 aag aag cag cca agc cac atg gag
gct gct cat ttt ggt gac ctg ggc 458Lys Lys Gln Pro Ser His Met Glu
Ala Ala His Phe Gly Asp Leu Gly 95
100 105 aga tct tgt ctg gac tac cag act
caa gag acc aaa tca agc ctt tct 506Arg Ser Cys Leu Asp Tyr Gln Thr
Gln Glu Thr Lys Ser Ser Leu Ser 110 115
120 aag acc ctt gaa caa gtc ttg cac gac
act att gtc ctc cct tac ttc 554Lys Thr Leu Glu Gln Val Leu His Asp
Thr Ile Val Leu Pro Tyr Phe 125 130
135 att caa ttc atg gaa ctt cgg cga atg gag
cat ttg gtg aaa ttt tgg 602Ile Gln Phe Met Glu Leu Arg Arg Met Glu
His Leu Val Lys Phe Trp 140 145
150 155 tta gag gct gaa agt ttt cat tca aca act
tgg tcg cga ata aga gca 650Leu Glu Ala Glu Ser Phe His Ser Thr Thr
Trp Ser Arg Ile Arg Ala 160 165
170 cac agt cta aac aca atg aag cag agc tca ctg
gct gag cct gtc tct 698His Ser Leu Asn Thr Met Lys Gln Ser Ser Leu
Ala Glu Pro Val Ser 175 180
185 cca tct aaa aag cat gaa act aca gcg tct ttt tta
act gat tct ctt 746Pro Ser Lys Lys His Glu Thr Thr Ala Ser Phe Leu
Thr Asp Ser Leu 190 195
200 gat aag aga ttg gag gat tct ggc tca gca cag ttg
ttt atg act cat 794Asp Lys Arg Leu Glu Asp Ser Gly Ser Ala Gln Leu
Phe Met Thr His 205 210 215
tca gaa gga att gac ctg aat aat aga act aac agc act
cag aat cac 842Ser Glu Gly Ile Asp Leu Asn Asn Arg Thr Asn Ser Thr
Gln Asn His 220 225 230
235 ttg ctg ctt tcc cag gaa tgt gac agt gcc cat tct ctc cgt
ctt gaa 890Leu Leu Leu Ser Gln Glu Cys Asp Ser Ala His Ser Leu Arg
Leu Glu 240 245
250 atg gcc aga gca gga act cac caa gtt tcc atg gaa acc caa
gaa tct 938Met Ala Arg Ala Gly Thr His Gln Val Ser Met Glu Thr Gln
Glu Ser 255 260 265
tcc tct aca ctt aca gta gcc agt aga aat agt ccc gct tct cca
cta 986Ser Ser Thr Leu Thr Val Ala Ser Arg Asn Ser Pro Ala Ser Pro
Leu 270 275 280
aaa gaa ttg tca gga aaa cta atg aaa agt ata gaa caa gat gca gtg
1034Lys Glu Leu Ser Gly Lys Leu Met Lys Ser Ile Glu Gln Asp Ala Val
285 290 295
aat act ttt acc aaa tat ata tct cca gat gct gct aaa cca ata cca
1082Asn Thr Phe Thr Lys Tyr Ile Ser Pro Asp Ala Ala Lys Pro Ile Pro
300 305 310 315
att aca gaa gca atg aga aat gac atc ata gca agg att tgt gga gaa
1130Ile Thr Glu Ala Met Arg Asn Asp Ile Ile Ala Arg Ile Cys Gly Glu
320 325 330
gat gga cag gtg gat ccc aac tgt ttc gtt ttg gca cag tcc ata gtc
1178Asp Gly Gln Val Asp Pro Asn Cys Phe Val Leu Ala Gln Ser Ile Val
335 340 345
ttt agt gca atg gag caa gag cac ttt agt gag ttt ctg cga agt cac
1226Phe Ser Ala Met Glu Gln Glu His Phe Ser Glu Phe Leu Arg Ser His
350 355 360
cat ttc tgt aaa tac cag att gaa gtg ctg acc agt gga act gtt tac
1274His Phe Cys Lys Tyr Gln Ile Glu Val Leu Thr Ser Gly Thr Val Tyr
365 370 375
ctg gct gac att ctc ttc tgt gag tca gcc ctc ttt tat ttc tct gag
1322Leu Ala Asp Ile Leu Phe Cys Glu Ser Ala Leu Phe Tyr Phe Ser Glu
380 385 390 395
tac atg gaa aaa gag gat gca gtg aat atc tta caa ttc tgg ttg gca
1370Tyr Met Glu Lys Glu Asp Ala Val Asn Ile Leu Gln Phe Trp Leu Ala
400 405 410
gca gat aac ttc cag tct cag ctt gct gcc aaa aag ggg caa tat gat
1418Ala Asp Asn Phe Gln Ser Gln Leu Ala Ala Lys Lys Gly Gln Tyr Asp
415 420 425
gga cag gag gca cag aat gat gcc atg att tta tat gac aag tac ttc
1466Gly Gln Glu Ala Gln Asn Asp Ala Met Ile Leu Tyr Asp Lys Tyr Phe
430 435 440
tcc ctc caa gcc aca cat cct ctt gga ttt gat gat gtt gta cga tta
1514Ser Leu Gln Ala Thr His Pro Leu Gly Phe Asp Asp Val Val Arg Leu
445 450 455
gaa att gaa tcc aat atc tgc agg gaa ggt ggg cca ctc ccc aac tgt
1562Glu Ile Glu Ser Asn Ile Cys Arg Glu Gly Gly Pro Leu Pro Asn Cys
460 465 470 475
ttc aca act cca tta cgt cag gcc tgg aca acc atg gag aag gtc ttt
1610Phe Thr Thr Pro Leu Arg Gln Ala Trp Thr Thr Met Glu Lys Val Phe
480 485 490
ttg cct ggc ttt ctg tcc agc aat ctt tat tat aaa tat ttg aat gat
1658Leu Pro Gly Phe Leu Ser Ser Asn Leu Tyr Tyr Lys Tyr Leu Asn Asp
495 500 505
ctc atc cat tcg gtt cga gga gat gaa ttt ctg ggc ggg aac gtg tcg
1706Leu Ile His Ser Val Arg Gly Asp Glu Phe Leu Gly Gly Asn Val Ser
510 515 520
ccg act gct cct ggc tct gtt ggc cct cct gat gag tct cac cca ggg
1754Pro Thr Ala Pro Gly Ser Val Gly Pro Pro Asp Glu Ser His Pro Gly
525 530 535
agt tct gac agc tct gcg tct cag tcc agt gtg aaa aaa gcc agt att
1802Ser Ser Asp Ser Ser Ala Ser Gln Ser Ser Val Lys Lys Ala Ser Ile
540 545 550 555
aaa ata ctg aaa aat ttt gat gaa gcg ata att gtg gat gcg gca agt
1850Lys Ile Leu Lys Asn Phe Asp Glu Ala Ile Ile Val Asp Ala Ala Ser
560 565 570
ctg gat cca gaa tct tta tat caa cgg aca tat gcc ggg aag atg aca
1898Leu Asp Pro Glu Ser Leu Tyr Gln Arg Thr Tyr Ala Gly Lys Met Thr
575 580 585
ttt gga aga gtg agt gac ttg ggg caa ttc atc cgg gaa tct gag cct
1946Phe Gly Arg Val Ser Asp Leu Gly Gln Phe Ile Arg Glu Ser Glu Pro
590 595 600
gaa cct gat gta agg aaa tca aaa gga tcc atg ttc tca caa gct atg
1994Glu Pro Asp Val Arg Lys Ser Lys Gly Ser Met Phe Ser Gln Ala Met
605 610 615
aag aaa tgg gtg caa gga aat act gat gag gcc cag gaa gag cta gct
2042Lys Lys Trp Val Gln Gly Asn Thr Asp Glu Ala Gln Glu Glu Leu Ala
620 625 630 635
tgg aag att gct aaa atg ata gtc agt gac gtt atg cag cag gct cag
2090Trp Lys Ile Ala Lys Met Ile Val Ser Asp Val Met Gln Gln Ala Gln
640 645 650
tat gat caa ccg tta gag aaa tct aca aag tta tgactcaaaa cttgagataa
2143Tyr Asp Gln Pro Leu Glu Lys Ser Thr Lys Leu
655 660
aggaaatctg cttgtgaaaa ataagagaac ttttttccct tggttggatt cttcaacaca
2203gccaatgaaa acagcactat atttctgatc tgtcactgtt gtttccaggg agagaatggg
2263gagacaatcc taggacttcc accctaatgc agttacctgt agggcataat tggatggcac
2323atgatgtttc acacagtgag gagtctttaa aggttaccaa
236334662PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 34Met Arg Gly Ala Gly Pro Ser Pro Arg Gln Ser
Pro Arg Thr Leu Arg 1 5 10
15 Pro Asp Pro Gly Pro Ala Met Ser Phe Phe Arg Arg Lys Val Lys Gly
20 25 30 Lys Glu
Gln Glu Lys Thr Ser Asp Val Lys Ser Ile Lys Ala Ser Ile 35
40 45 Ser Val His Ser Pro Gln Lys
Ser Thr Lys Asn His Ala Leu Leu Glu 50 55
60 Ala Ala Gly Pro Ser His Val Ala Ile Asn Ala Ile
Ser Ala Asn Met 65 70 75
80 Asp Ser Phe Ser Ser Ser Arg Thr Ala Thr Leu Lys Lys Gln Pro Ser
85 90 95 His Met Glu
Ala Ala His Phe Gly Asp Leu Gly Arg Ser Cys Leu Asp 100
105 110 Tyr Gln Thr Gln Glu Thr Lys Ser
Ser Leu Ser Lys Thr Leu Glu Gln 115 120
125 Val Leu His Asp Thr Ile Val Leu Pro Tyr Phe Ile Gln
Phe Met Glu 130 135 140
Leu Arg Arg Met Glu His Leu Val Lys Phe Trp Leu Glu Ala Glu Ser 145
150 155 160 Phe His Ser Thr
Thr Trp Ser Arg Ile Arg Ala His Ser Leu Asn Thr 165
170 175 Met Lys Gln Ser Ser Leu Ala Glu Pro
Val Ser Pro Ser Lys Lys His 180 185
190 Glu Thr Thr Ala Ser Phe Leu Thr Asp Ser Leu Asp Lys Arg
Leu Glu 195 200 205
Asp Ser Gly Ser Ala Gln Leu Phe Met Thr His Ser Glu Gly Ile Asp 210
215 220 Leu Asn Asn Arg Thr
Asn Ser Thr Gln Asn His Leu Leu Leu Ser Gln 225 230
235 240 Glu Cys Asp Ser Ala His Ser Leu Arg Leu
Glu Met Ala Arg Ala Gly 245 250
255 Thr His Gln Val Ser Met Glu Thr Gln Glu Ser Ser Ser Thr Leu
Thr 260 265 270 Val
Ala Ser Arg Asn Ser Pro Ala Ser Pro Leu Lys Glu Leu Ser Gly 275
280 285 Lys Leu Met Lys Ser Ile
Glu Gln Asp Ala Val Asn Thr Phe Thr Lys 290 295
300 Tyr Ile Ser Pro Asp Ala Ala Lys Pro Ile Pro
Ile Thr Glu Ala Met 305 310 315
320 Arg Asn Asp Ile Ile Ala Arg Ile Cys Gly Glu Asp Gly Gln Val Asp
325 330 335 Pro Asn
Cys Phe Val Leu Ala Gln Ser Ile Val Phe Ser Ala Met Glu 340
345 350 Gln Glu His Phe Ser Glu Phe
Leu Arg Ser His His Phe Cys Lys Tyr 355 360
365 Gln Ile Glu Val Leu Thr Ser Gly Thr Val Tyr Leu
Ala Asp Ile Leu 370 375 380
Phe Cys Glu Ser Ala Leu Phe Tyr Phe Ser Glu Tyr Met Glu Lys Glu 385
390 395 400 Asp Ala Val
Asn Ile Leu Gln Phe Trp Leu Ala Ala Asp Asn Phe Gln 405
410 415 Ser Gln Leu Ala Ala Lys Lys Gly
Gln Tyr Asp Gly Gln Glu Ala Gln 420 425
430 Asn Asp Ala Met Ile Leu Tyr Asp Lys Tyr Phe Ser Leu
Gln Ala Thr 435 440 445
His Pro Leu Gly Phe Asp Asp Val Val Arg Leu Glu Ile Glu Ser Asn 450
455 460 Ile Cys Arg Glu
Gly Gly Pro Leu Pro Asn Cys Phe Thr Thr Pro Leu 465 470
475 480 Arg Gln Ala Trp Thr Thr Met Glu Lys
Val Phe Leu Pro Gly Phe Leu 485 490
495 Ser Ser Asn Leu Tyr Tyr Lys Tyr Leu Asn Asp Leu Ile His
Ser Val 500 505 510
Arg Gly Asp Glu Phe Leu Gly Gly Asn Val Ser Pro Thr Ala Pro Gly
515 520 525 Ser Val Gly Pro
Pro Asp Glu Ser His Pro Gly Ser Ser Asp Ser Ser 530
535 540 Ala Ser Gln Ser Ser Val Lys Lys
Ala Ser Ile Lys Ile Leu Lys Asn 545 550
555 560 Phe Asp Glu Ala Ile Ile Val Asp Ala Ala Ser Leu
Asp Pro Glu Ser 565 570
575 Leu Tyr Gln Arg Thr Tyr Ala Gly Lys Met Thr Phe Gly Arg Val Ser
580 585 590 Asp Leu Gly
Gln Phe Ile Arg Glu Ser Glu Pro Glu Pro Asp Val Arg 595
600 605 Lys Ser Lys Gly Ser Met Phe Ser
Gln Ala Met Lys Lys Trp Val Gln 610 615
620 Gly Asn Thr Asp Glu Ala Gln Glu Glu Leu Ala Trp Lys
Ile Ala Lys 625 630 635
640 Met Ile Val Ser Asp Val Met Gln Gln Ala Gln Tyr Asp Gln Pro Leu
645 650 655 Glu Lys Ser Thr
Lys Leu 660 35162025DNAHomo sapiens 35gaattcctat
ttcaaaagaa acaaatgggc caagtatggt ggctcatacc tgtaatccca 60gcactttggg
aggccgaggt gagtgggtca cttgaggtca ggagttccag gccagtctgg 120ccaacatggt
gaaacactgt ctctactaaa aatacaaaaa ttagccgggc gtggtggcgg 180gcacctgtaa
tcccagctac tcaggaggct gaggcaggag aattgcttga acctgggaga 240tggaggttgc
agtgagccga gatcgcgcca ctgctctcca gcctgggtgg cagagtgaga 300ctctgtctca
aaaagaaaca aagaaataaa tgaaacaatt ttgttcacat atatttcaca 360aatttgaaat
gttaaaggta ttatggtcac tgatatcctg tttcattctt tatataatca 420ttaagtttga
aatgtatact tgcactacta acacagtagt taatcttagt cctacaagtt 480actgctttta
cacaatatat tttcgtaata tgtatgcact ggtgtttatg tacgtgttta 540tgtttatatc
tgttaaaatt agcagtttcc atctttttct attttgtacc atcacatcag 600ttcagaagga
ttgacagagc aaaatgattt gatgaagtat aaaagtcaca tggtgagtgg 660cataaataca
actctgaaca attaggaggc tcactattga ctggaactaa actgcaagcc 720agaaagacac
atatcctata tgtcaagaga tgtaccaccc aggcagttaa agaagggaag 780tacacataga
aagcacaatg gtgaataatt aaaaaattgg aatttatcag acactggatt 840catttgctcc
taaagtcaga gtcctctatt gtttttttgt ttttgtgggt ttctttttaa 900atttttttat
tttttgtaga gtcggagtct cactgtgtta cccgggctgg tctagaactc 960ctggcctcaa
acaaacctcc tgcctcagct tcccaaagca ttgggattac agacatgagc 1020cactgagccc
agcccagacg ctttagcatt tatgaagctt ctgaaatagt tgtagaaacc 1080gcataagctt
tccatgtcac tttcaaagtt tgatggtctc tttagtaaac caaccaagtt 1140attcctcaag
ggcaaaataa catttctcag tgcaaaactg atgcacttca ttaccaaaag 1200gaaaagacca
caactataga ggcgtcattg aaagctgcac tcttcagagg ccaaaaaaaa 1260aggtacaaac
acatactaat ggaacattct ttagaagagc cccaaagtta atgataaaca 1320ttttcatcaa
agagaaaaga gaacaaggtg ttagcaaatt cctctatcaa ataacactaa 1380acatcaagga
acatcaatgg catgccatgt ggaagaggaa gtgctagctc atgtacaaac 1440cagtagataa
tttcaacttg ctgccgaatg aaacctcttt gcaaggtatg aatcagcact 1500tctcatgttt
gttttgcttt gttttgtttt gtttttagag acaggccctt gctctgtcac 1560acaggctgga
gtgcagtggc acgatcagag ctcactgcaa cctgaaactc ctgggctcaa 1620gggatcctcc
tgccttagcc tcccaagtag ctgggactac aggcccacca tgcccagcta 1680attttttaaa
ttttctatag agatgggatc tcactagcac ctttcatgtt tgatgttcat 1740atacaacgac
caaggtacaa tgtggaaaag ggtctcaggg atctaaagtg aaggaggacc 1800agaaagaaaa
ggggttgcta catagagtag aagaagttgc acttcatgcc agtctacaac 1860actgctgttt
tcctcagagc agagttgatg atctaaatca ggggtcccca acccccagtt 1920catagcctgt
taggaaccgg gccacacagc aggaggtgag caataggcaa gcgagcatta 1980ccacctgggc
ttcacctccc gtcagatcag tgatgtcatt agattctcat aggaccatga 2040accctattgt
gaactgagca tgcaagggat gtaggttttc cgctctttat gagactctaa 2100tgccggaaga
tctgtcactg tcttccatca ccctgagatg ggaacatcta gttgcaggaa 2160aacaacctca
gggctcccat tgattctata ttacagtgag ttgtatcatt atttcattct 2220atattacaat
gtaataataa tagaaataaa ggcacaatag gccaggcgtg gtggctcaca 2280cctgtaatcc
cagcacttcg ggaggccaag gcaggcggat cacgaggtca ggagatcgag 2340accatcctgg
ctaaaacggt gaaaccccgt ctactaaaaa ttcaaaaaaa aattagccgg 2400gtgtggtggt
gggcacctgt agtcccagct actcgagagg ctgaggcagg agaatggtgt 2460gaacctggga
ggcagagctt gaggtaagcc gagatcacgc cactgcactc cagcctgggc 2520gacagagcga
tactctgtct caaaaaaaaa aaaaaaaaaa aaagaaataa agtgaacaat 2580aaatgtaatg
tggctgaatc attccaaaac aatcccccca ccccagttca cggaaaaatt 2640ctcccacaaa
accagtccct ggtgccaaaa aggttgggga ccgctaatct aaataatcta 2700atcttcattc
aatgctaaaa aatgaataaa ctttttttta aatacacggt ctcactttgt 2760tgcccaggct
ggagtacggt ggcatgatca cagctcactg tagcctcaat cacccaggcc 2820ccagcgatcc
tcccacctaa acttcctgag tagctgggac tacaggcacg caccaccatg 2880cccagctaat
ttttaaattt tttatagaga tgggggtctc accatgttgc ccagactggt 2940ctcaaaccct
gggctcaagt gatcctccct caaactcctg gactcaagtg atcctccttc 3000cttggcctcc
caaagtgctg ggattacaag catgagccac tgtacccagc tggataaaca 3060ttttaagtcg
cactacagtc atggacaatc aggcttttca acatgcagta tggacagtga 3120gtcccagggt
ctgcttttcc atactgaaat acatgtgata ctaaggagaa aggtgctcgc 3180aaggatattt
aaaatgaaga atatttaaaa tgaggaaaaa actgtttctt catgactttg 3240ataaggctga
taaagaccat ttctgtgatc tcaggtgatt cactcaagta gtatatttca 3300gtaatcatta
tctggaacag cctgaatctt aaccaaaata ccatgatttt ttaatgctgt 3360tatgatacct
tgatgatatg accaaactgc aatgtaggca gctaaatctc cacgagtttg 3420acttccccga
gagttgacag ttttcttcac aaattaaaga aatatatttt ttgatacatg 3480attggcatat
ttaaaaacta cactgaaatg ctgcaaaatg atataaagaa acattttcca 3540gaatcaaatg
caatcaaaga gtggattagg aatctactca ccattatcaa ctaaatagaa 3600acacttggac
tgggtgtggt ggctcacatc tgtaatctca gcactttggg aggccaaggc 3660aggtggattg
cttgaggcca ggagctcaag accagcctga gcaacatagc aaaactctgt 3720ctctacaaaa
aaaaaaaaaa attaaccagg catggtggca gatgcttgta atcccagcta 3780ctctggaagc
tgaagtagga ggactgcttg agcccaggag atcaagactg cagtgagccg 3840tggtcatgct
gcgccacagc ctgagtgaca gagagagacc ctgtctcaaa aacaaaaaca 3900aacaaaaaac
acttaacctt cctgtttttt gctgttgttg ttgttgtttg tttgttttga 3960gatggagtct
cactctgttg cccaggctgg agtgcagtgg cgtgatcttg gctcactgca 4020agctctgcct
cccgggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta 4080taggcgcccg
ccaccacgcc cggctacttt tttgcatttt tagtagagat ggggtttcac 4140cgtgttagcc
aggatggtct tgatctcctg acctcgtgat ccacctgcct cggcctccca 4200aagtgctggg
attacaggca tgagccaccg cacccggcca acctttctgt tttttagttt 4260gatatgcttg
ttaactcagc agctgaaaga atgctgaaag tggccttcag taaaaaaatt 4320tcactagaat
ctctacatcc atatttaatc tgaatgcata tccagattga tcagttagag 4380caaaaacact
catcatcatt cctgatgacc tctaattctg gtttcggctt tctatttcaa 4440tggaaacaga
ataaggaaag aaatggaagg gctctggaaa tttgtcctgg gctatagata 4500ctatcaaaga
tcaccaacaa taagatctct cctataaata taaaacaagt ataattaatt 4560ttttaattat
ttttttctct tcagaggatt ttatttcaag ataaaacata acttctaccc 4620atactattga
ttccaaaggt tagaaaaagt gtttttcctc atcttatcct tcaaagaggt 4680cacagcaatg
caaacatcta taaaatgcct ctgcataatt gtcagaagct atagtccaga 4740aatcattgaa
aatgcttttc cattttaagc ttaggtgagg tgtcttagga aacctctatg 4800acaacttact
ctatttattg ggaggtaaac tcccagactc tcccagggtc tcctgtattg 4860atctcatttt
ttaggcttcc taatcccttg aagcacaatc gaaaaagccc tggatctctt 4920ttctgcacat
atcatcgcgg aattcattcg gcttccagca agctgacact ccatgataca 4980agcggcctcg
cccttctccg gacgccagtc cttgctgcgg ttagctagga tgaggggttt 5040gctgggcttc
agtgcaggct tctgcgggtt cccaagccgc accaggtggc ctcacaggct 5100ggatgtcacc
attgcacact gagctcctgg caggctgtac caatttttta attatttaat 5160atttattttt
aaaattatgg tgaatatttt ggtattctgc tctaaaatag gcccataaat 5220gcacagcaga
tatctcttgg aacccacagc tttccactgg aagaactaag tatttttctt 5280ttaaagatgc
tactaagtct ctgaaaagtc cagatcctct acctctttcc atcccaaact 5340aagacttgga
atttatgaga gatctagcta acagaaatcc cagacacatc attggttctt 5400cccagagtgc
agtcctccta aagaggctca gccctaagca ggcccctgca ccaggagggt 5460gggtctgaga
cccacatagc acttcccaag gtgcatgctc cagagaggca ctgaaacagc 5520tgagcacaag
cctgcaagcc tggagaactc tcacagtcag aacggagggg gcccagtggg 5580actaacataa
agagaaaagg gaacacagag aaatggatgg caccaacaac cagcaaagcc 5640ttcatggcca
atgaaagcat cagtgacggg gccagaaccc tcatccccaa agactcttca 5700ctgcctttag
tgaaaaacaa tggctagaga gtgaagttat gatcatgtat agagaggtaa 5760agttacattt
ttatattctg actctgctaa tgtgaaattc cctatctgct agactaaaag 5820tttcagacac
cctgttcaaa tatcccatta gttgctagag acttaaaatg aacagaacgc 5880acattgtcag
gatgactatt accaaaaaat caaaagacag caagtattgg tgaggatgta 5940gagaaactgg
aacttttgtg cactgtttat gagaatgtaa aatggagcag ctgctgtgga 6000aaagagtatg
caggttcctc aaagagtaaa accaagatgt ggaaacaact aaatgcccat 6060cagtggatga
aggggtagac aatatgtggt atatacatac catggagtac tattcagcct 6120ctaaaaaaaa
aaaaggaaat tctataacat gcaacagcat ggatgaatct tgaggacatt 6180ttgctaatga
aataaggcag tcatagaaag acaaatactg cacgactcca cttatatgag 6240ataccaaaaa
tagacaaatt catagaatca aagagtacaa tggaggttac ctggagctgc 6300agggcgggaa
acgaggagtt actaatcaac gaacataacg ttgcagttaa gtaagatgaa 6360taagctctca
agatcagctg tacaacactg tacctagagt caacaataat gtattgtaca 6420cttaaaaatt
tgttaagggt agattaacaa atgtagtaga tccacaaatg tggttaagtg 6480ttcttaccac
agtaaaataa aaaaagaata tcaagcccag gagttcgaga ctagcctggg 6540taacatggtg
aaaccctgtc tctacagaaa atacaaaaat tagccagctg tggaggtgca 6600ctcctaggga
ggctgaggtg ggaggcttgc ttgagcccag gaggtcaagg ctgcagtgag 6660ccatgattgc
accactgtac tccagcccag atgacagagc aagacaccac cccccccaaa 6720aaaagaaaaa
gaatatcaaa cattttaaaa gatcagatac gcaagaacaa caacaaaaaa 6780gagatgaaca
gagcatcgac cctcatctag tgggattctt ggtctaactg aaaaacagac 6840attgagagac
aaacaatgac agtgatgtga tcacagcaat tacacaggta tcccctgggg 6900actgcagaag
aaaggaggaa tgcctaactt tcagaaaata gagaaagcgt caaacagttg 6960gtgaaagcct
tccaaaacta gagagaactg cacacaccaa atcacagaaa gaagaaaagc 7020cgtgggagat
tctgggaccc accggctatt tttgatggct gaacaccctg ctgcaggaga 7080gacaggagct
ggaaagcatg gtgggatgaa acctcaaaca gctttgcctg cattgcttaa 7140gatgactggg
cttgattaac tctagtcaat ggggacaatt caatcaaaga agaaagatgc 7200tcaaattcac
attttagaat gattttttat ggcagtatgg ggaatagatt aaaagagagt 7260gaagctggag
gcaagaaact tgttaagagg caactgaaac agtctagatg ataaataata 7320aactgacaga
gtgactagaa aaatcagaac aggctgaatc aacagatacc tagatgaaaa 7380taacaggact
tgatcaccag ttgtatcttg gagaggaagg agttgtttcc ttgctttccc 7440tacgactggg
aatacggaag gtttgccgtg tgtattggtt atatactggt gtgtagccaa 7500tcactgacaa
ccatttagca gcttaaaaca caaaggctta tctcccagtt tctgtgggcc 7560aggaatctaa
gataggctta gctggctggt tctggctcag agtttctcaa gaggttgcaa 7620tcaagatgtc
agctggggtt gcatcatctg aaggctcaac tggggccgga gggtccactt 7680ccaaggagtt
cactcacctg cctgacaagg cagtgctggt tgttggcagg agatctcaat 7740tcattgccaa
gtgagcctct ctatagcatt gctggaacat cctccccatc tggcagttgg 7800cttctctcag
catgagtgat ctgagagaga gagcaaggag gaagccacag tgttcttcct 7860actcctactc
ctaacactat ggacctactc ctaacactct cacttctgcc ttattccatt 7920agttagaaag
ggaactaagc tccacctctt gaaataagaa gtgtcaaaga atttgtggat 7980atatttaaaa
atcatcacac tgtggaagtg gatagggggt tcaattaatg ctgaacttga 8040aatgcctgag
acattcaaat gtccaacagg caatgaacat acccatagat ggtcatgact 8100ttagcaagaa
tagaggaaga tcacagaatt aaggaggaat tgaaaggtaa aagaagtgga 8160gtcagattcc
ccctgaaaag tgagccatga aaggaacttt aactattgag ttagaggtca 8220gagtaggaaa
tttcggtgga attctttttt aaagaaagga accatataag catgttttga 8280ggtagaggga
gaataaatca gtagacaggg agaggtaaaa aacataaatg ataggggata 8340gttgacaaag
gtcttggcag aatcccttac ccattgactt ggggccaaga gagggacact 8400tctttgtttg
agggataagg aaaataagaa agaatgggtg ctatttagtg tggtcctgtc 8460tctagggcaa
acgcataggt aacaaactgt gtgtgttagg aatatagatg tgacctcaca 8520ttgagattct
cacctcaaat ccattttgtt gttacctgta ccttcctacc ttctcttttt 8580gctacatgca
gactgctgtt ttgtcttcct ggcctgttcc aggtttcagc attctggcat 8640atctgctacc
ctgttcccaa acctctctag agtccatgct ccttccttgg atagtgtttg 8700attgggccac
gtatctaaga agtgatgcct tcagttaggc ctgagaacct cctctatgga 8760aatctccatc
agtgaccctg acagacttgg tatcttggag atgtcactgc tcccagcctg 8820tggtctagga
gaatctcagc ctgggcctct agtagtatgg ataaggcgtt aaggtatctt 8880tgaaccagag
tctgtcatat tcctcaatgt gggacagata aaacagtggt agtgctggtg 8940tttctgagct
agaactctgg tttttggtct agattctttg atgtatgacc tttcagaggt 9000attaaaattt
gttctaatac aatgttcaat acaaatgtag ttccttttct gttaggacct 9060caacaaaaca
tgaccaactg tagatgaaca ttaaactatg acaattcatg gaaatgaata 9120cagtaatacc
tgcggttccc ccattttagc agtcactatg gtgacatttg gcacaaatgg 9180ctatttaagg
gtgcttttgt taaaacctac catcttacta ggcacatgat attgaaacta 9240atgaaataat
ggagaaactt cttaaaaact tttaatgaat aaagtgatga agtgataata 9300ttttagctgc
tatttataaa gtgactatta caggtcaaac attcttctag ggtttttttg 9360ttgaagttgt
cacatttaat ccttaataac ccactatgag tcaggtattc ttctctcccc 9420tttggacagt
tggggaaatg ggggtcagag aggttaggta atttgctcag ggccacacaa 9480cctgcatgta
gaaaatctga gatttgtaca ggaacgtatc aaactctgaa gtccatgctt 9540ctattttccc
atgctgcctt tctaataaaa ggtaactaat gctactggat gctgccccca 9600aagtgagtca
ctttcacccc accctacttg attttctcca taaaactaat cacatcctga 9660caacttattt
attgctgatc tcccccacta gattataaac tcaataaaag caagatcctt 9720gtctgctgaa
tatcagtacc taaaacgctg tctagcacag agcaagtaat taatatttgt 9780tgaatgaaca
aataaaggaa aaaaattcaa aggaagaaaa agccctaaaa cagatgttta 9840cctaaacata
cattttaaaa gaaagcatat aacaaattca ggacagaatt taaatttgat 9900tttttaaaga
aataaccaag tgctagctgg gcacagtggc tcacacctgt aatcctagca 9960ctctgggagg
ccgaggcagg cagatcactt gaggtcaaga gttcaagacc agcctggcca 10020acatggtgaa
acctgtctct actaaaaata cagaaattat ccaggcatgg tggcaggtcc 10080ctgtaacccc
agctactcag gaggctgagt caggagaatt gcttgaaccc aggaggcaga 10140ggttgcagtg
ggccaagatt gcaccactgc actccagcct gagtaacaaa gcaagactct 10200gtctgaagga
gaaggaaaga aagaaggaaa gaaggaaaga aggaaagaag gaaagaagga 10260aagaaagaaa
gaaagaaaga aagaaagaaa gaaagaaaga aagaaagaaa gaaagaaaga 10320aagaaagaaa
aagaaagaaa gaaagaaaga accaagtgct tatttgggac ctactatgct 10380atgtttttcc
atgcacgcta ttttcagtaa agcagttagc aaacttgcaa gatcataaca 10440acaaatatat
gcttctataa ctctaaaatt gtgctttaag aagttcctct ttaccagctc 10500atgtatgcat
tagttttcta agagttacta gtaacttttt ccctggagaa tatccacagc 10560cagtttattt
aaccaaagga ggatgcttac taacatgaag ttatcaaatg tgagcctaag 10620ttgggccagt
tcatgttaat atactccaga acaaaaacca tcctactgtc ctctgacaat 10680tttacctgaa
aattcatttt ccacattacc aaggagccag ggtaggagaa tatagaaaga 10740ccacccaaga
atccttactt ctttcagcaa aatcaattca aagtaggtaa ctaaacacat 10800gccctaacaa
tgaatagcag attgtgctca gaagaatgat ctacaacatc ttactgtgaa 10860ggaactactg
aaatattcca ataagacttc tctccaaaat gattttattg aatttgcatt 10920ttaaaaaata
ttttaagcct aaattttaaa aggtttgata ttggtacatg aatagacaaa 10980cagacatgga
ctagaccaag aattaggttc aaacatatac aggaatttaa tatacgataa 11040atctagtatt
ccaaaggaac caacaaatgg tgttcagaca gcaggatagg catcaggaaa 11100aacacagttg
ggcaccctac cttactccta acaccaggag taactgaagg agcaccaaat 11160atttatttat
tttaattata gttttaagtt ctagggtacg tgtgcacaac atgcaggttt 11220attacatagg
tatacatgtg ccatgttggt gaggagcacc aaatatttaa aagaaaaaaa 11280ttggccaggg
gcggtggctc acacctgtaa tcccagcact ttgggaggcc aaggtgggca 11340gatcacctga
ggtcgggagt tcgagaccag cctgagcaac atggagaaac cccatctcta 11400ctaaaaatac
aaaattagcc aggcatggtg gcacatgcct gtaatcccag ctacttggga 11460ggctgaggca
ggagaatagc tttaatctgg gaggcacagg ttgcggtgag ctgagatatt 11520gcactccagc
ctgggcaaca agagcaaaac ttcaactcaa aaaaattaat aaataaataa 11580aaataaagaa
agaaaagaaa aaaatgaaaa tagtataatt agcagaagaa aacaccgtag 11640aatcctcgga
ctcttaggat ggggaatgcc tataatataa aaaccctgaa gttataaaag 11700agaaaatcac
ctacatacaa accaaatctt tctacatgcc taaaacatag cacaaacaca 11760gctaaataat
catagctgaa tgaactggga aaacaaaact tgactcatat ccagacagag 11820ttaattttcc
tacacataaa gagtacctat ataaacccaa caaaaaaacc accactaacc 11880caaaataaaa
atgtgacagg taatgaacag gtagttcaca gagaatacaa atggctcttc 11940ggcacataag
atgctcagac tgacttttac ttatttattt tttgagagac agggtctcac 12000gatgttgccc
aggttaggct caaactcctg ggctcaaatg atagtaccag gactacaggt 12060gtgccccacc
gcacctggct cctcaaccac ctgtattaac aggaaatgca aaataaaact 12120ttcaaatcta
ttttacctat tagaatggca aaaatttgaa aaacttcaaa catcatcatg 12180ttggtgagaa
tgtgaggaga ctggcactct cattttttgc tgatagcata tatatactga 12240tggcttctat
ggaaagcaat ctggcagcgt ctatcaaatg tacaagtgca tatatccttt 12300gacaaagcaa
ttccactcta ggaatgtgtt ctatatggtt gtgcttcctg gggctgggaa 12360ctgggagcta
agggacaggg gcagaagata atcttctttt ccctccttcc ccgttaaaca 12420tgttgaattt
tatatactgt aatatattat ttttcacaaa agataatttt taagcgatat 12480gtctgggaat
tttttttttt cttttctgag acagggtctc actctgtcat ccaggctgga 12540atgccatggt
atgatctcag ctgactgcag cctcgacctc ctgggttcaa gcaatcctcc 12600cacctcagcc
tcctgagtag ctgggactac aggcacgtgc catcatgcta atttttgtat 12660atacagggtc
tcactatgtt gcccaggcta atgtcaaact cctaggctca agcaatccac 12720ccacctcagg
ctccaaagtg ctgggattac aggcgtgagc caccgcgcct ggccctggga 12780attcttacaa
aagaaaaaat atctactctc cccttctatt aaagtcaaaa cagagaagga 12840aattcaacct
ataatgaaag tagagaaggg cctcaaccct gagcaacaaa cacaaaggct 12900atttctgaga
caggaatttg ctgaacaaaa tcgagggaag atgacaagaa tcaagactca 12960cttctcggct
gggcgcagtg gctcacacct gtaatcccag cactttggga ggccgaggcg 13020gacagatcac
gaggtcagga gattgagacc atactggcta acacagtgaa acccagtctc 13080tactaaaaat
acaaaaaatt agccgggcgt ggtggcaggt gcctgtagtc ccagctactt 13140gggaagctga
ggcaggagaa tggcgtgaac ccaggaagcg gagcttgcag tgagccgaga 13200tcacgccact
gcactccagc ctgggtgaca gagcaagact ctgtctcaaa aaaaaaaaaa 13260aagactcatt
tctctagatc ttgagccgta ttcaaattta tctcagctta gtgagaggtt 13320aaagcaagga
atatccttcc ctgtgggccc tgctccttac tgaaggaagg taacggatga 13380gtcaaggaca
ccaatggaga aaagcactaa caccattatc tgatgaacat tacgtgaaga 13440agggtaagaa
gtgaagtgga attgctgaag aagtcagtga aagcggacat tcatttgggg 13500aaatggaata
taggaaatcc ataaaagtga ttaaaaagat gttagaggct gaggcggggg 13560gaccacaggg
tcaggagatc gagaccatcc tggctaacac ggtgaaaccc catctctact 13620aaaaatacaa
aaaattagcc aggcgtggtg gcaggcacct gtagtcccaa ctactcggga 13680gactgaggca
ggagaatggc atgaacctgg gagacggagc ttgcagtgag ccgagatcac 13740gccactgcac
tccagcctgg gtgacagagt gagactccat ctcaaaaaaa aaagttagat 13800acgagagata
aagatccaac agacacacaa ctgctaattc tgaacagaac aaaacaaatg 13860gcacaggaaa
agaaaattta agatataaca ccggaaaact ttcctgaaat tgagtaactg 13920aatctatagc
ttgaaagggt ttagcatatg ccaagaaaaa tcagtagagt ccaaccagca 13980caagacacat
ctagcaaggc tggtgattct accaacacag agaaagaagt gggtgaccca 14040taatgcggaa
aaaggcagac catctgcagt cttctccaga acactggagt ctgaagacaa 14100aagaatgctg
cctactgagc cagaagggag agaaagtgac ccaacacatc tttaccaagt 14160tagaatgtca
cgcattattt aaaggctgca aaagccatga aagacatgaa agaacacaag 14220catttacaac
atgaaagaac acaagcattc tcatactcaa gaatccttaa gaaaaatgta 14280gtcctaatcc
agcccactga aagttaaatg tacttaatgt gctcattaat gggaacttca 14340tagcttcaaa
tcagtctggt cccatctacc aacatctctc gcccggcttt cctgcaatag 14400tcagcacctt
tccctcctcc cagtcttgtc ccctggagtc tgctctcagc atagcagagt 14460gaccacatca
acacccaagt cagagccctc cagtgcgcac tggtctacaa agcccttccc 14520accccccacc
ccacgtgccc tccggatcct tgtgacgtgt ctcctgcata ccctagcagc 14580cctggcctcc
tcactgcccc tcctgtacat caggaaggcg actccttgag tcttggctct 14640ggccgcctcc
tccacctgca gtgagttaac tcccttacct actctaggtc attgctcaaa 14700tgtcagcatc
tcaatggggc cctccctgac taccctattt aaattctaca tactcccctt 14760gaccccatgg
acctcactca ccctattcca cttttattct tacaatttag cacttgttct 14820cttctaacgt
attctaagac ttactcattt attacattgt ttgccacccc ctctagtaca 14880taaactccag
aggggcaggg atttctgtct atttattcat ttctttatcc ctaggacata 14940gaacagggca
tagttcagag tattcaatgt tatcaatgaa tgaactagca gtagtaccag 15000ttccagttag
gcacagaatt aaatctaaat agaattaaat ctcatggtct gggttaacta 15060tggatagaaa
attagatata attttaagaa gcctagaaag aaaaaattaa taatgtaaaa 15120ataatattaa
tttgataata ataacaaaaa ctctgccagg cactgtggct caaatctgca 15180atcccagcta
ctcaggaggc tgaggtggaa ggatcacttg agaccagagt tcaagactca 15240gcctaggcaa
cacggcaaga aactgtctct aaaaaaatta aaacttaaat ttttaaaaaa 15300gaattctcaa
agcgtcacaa aaactggaga ttaaggtaca ggaagtgtga agtaatatta 15360ctatgctaat
ggtttttttt ttttttagaa aggtataacc aaaagatttc tttctcaagt 15420cgataaactg
agaaagataa gcatatcttc caattaacag agggggagga aaagccagat 15480acaacaaaat
aagatataaa ttagtttcca gttgaaaaca agagtaggag ttattttgca 15540tcacctcacc
tgtgacctcc cccagcccaa aaaacactac tgataaacag ggtagaaaag 15600catcatctca
gataaagcag gaaaaactgc cacagtctca aaccacaaac tataagcaca 15660cacctggcca
accctgccaa gtctgggctc agtaggagga acgtgctgag agctaggatg 15720taccaactta
gacattctgt gggatacaga tgtccctgga agggtcacac catctcaaag 15780gcacctgtaa
tgcccactga ttacagccac catatgtgag agagaaactc agggcactta 15840gagagtataa
caagaacctt atgtcatctg agatgaggaa tcctcagccc tgcaaattaa 15900ccaactcttt
agaacaactg gcaaaacata aatatccaca acttttgttt cagtaattcc 15960actcttagat
atcaatccaa agtacatgag acagcagata cacacacaaa atggtattta 16020ctgcagcatt
gtttataata gcaaaaaaca agaaataatc catatgtctc aataggatac 16080tgggtacatg
agggtatgta cccatcattc aaccatcaaa aagagtgata tggatgtcca 16140cagatggaca
taaaaagctg tgtgttacgt gaaaacaaac tcaagcagca gcaggatggg 16200cttatgatag
tcagtatgag ctaatttctg gaaaaaaaaa tctagtgtgt gcacagaaaa 16260catctgaaag
aacagaaaca aaactatcag cagaatattg agatgtttta ctaagttgta 16320tatctatact
gcttgtaatt tttaccccaa gcaagaatta ctttttggaa aaagaaaatt 16380caggaaataa
agcatttctt taaacttcat gtttaaacaa atggtgatgg aataaaagag 16440ttcttattca
tcataaacac acacagcaca catgcacgca tgtgcgtgag cacacccttt 16500acttgataaa
taccatgttg aatattttag tctttccttt taggttctat cccttcactc 16560aaaatgcggt
tataaataaa tgtacttttc atgtgccttc tgcctaaacc cactttaata 16620taactttaca
gtcccattat cattatagtc tcaaagctag actcagcctg aaactaccct 16680ttcatttgga
acccttatta aaatgccaca tacagctcct tcaaataaaa acaaacccta 16740ggacctgaca
ctaggcttcc tttgttgcta ctcataatgg ccaagttctg tgcttataat 16800acatcttctt
tcattttatt gctacatatc caagggtttt atatgttttt cttattatat 16860cttaattcaa
aacaccatca cgctcttttc cagatgaaaa taaggaaaag aaattgagca 16920actgactgac
ttaaaggtca taaaactata tagtagcaga gtcagcaaaa gaagaaacac 16980acatctccca
agtagaggct gaaaaccagt accattcacc tccagggtga gctatataca 17040gattacaaag
tcaccttctc taaatgttca aactgaatcc catacccata ctttaccact 17100acctcgtaag
aacagcctca gatcttgtta tagccttttt tttagcatgc tgaagccaat 17160aaaatgcttc
ccattcagca agagaaacaa gttctgaaac actgaataat ctgcccaggg 17220cctatgaaca
tttccactgt gagaaatgtt ctccactgtg tggagaagat ccttactctt 17280ctccacacag
gcagaacatt agaaaaattc ttggattcta tgatgcacag cttaggagtc 17340tgtttagcac
aatttaagtc caaatagtta ttaaatcctc ctctgttcca gaaacagtgc 17400taaatactgt
gaatataaaa attgaaaaga tactctcctg gctcccaaga aagtcagcca 17460gatagaggag
acacaggcac acaaatcact gtcacatgaa gctctacctc cctaacttca 17520aacgagggcc
taagtcacca agaatacagt agcagttgtg actacgagta actactataa 17580ttcaatactt
tatcttccct tagaaaactc ttctcccttg gaaatttatt tgcatttcta 17640aataccattc
cttactaaaa ggaagcaggg ctccttgggg aaatagctga ttctaggtgt 17700ggactatgaa
atgaaaatgg tgagtctggg acatcccatg ttgcccagaa atcaaggaac 17760tgcccaaaga
ttaacagagt catgttaaat ggacctaaga gtgaaccaga aggagctcac 17820tttgccccgc
gtggaacaat ttcaagaaaa acatgacagt aatgaattat aaaacatgaa 17880ttaaaataca
tattggtact aaaaagagaa caaaaggatg tggctttgga taaagctctt 17940cttcatggaa
gaataccagc taataaatgt aaaggaaatg agagaattag aaaaattatc 18000attttgtaaa
ccttaatata ttcacctaga catgctaaaa ccactgagta aaaggctgct 18060tgggaagagg
atgctcacat gatctcagag tttcacacca cagataattt attagataca 18120ggaaggaaga
tgtgatcaag cttcctgtga cccccagcca ggccccacaa cactatgtgc 18180ctccttgtga
tgtgggagct acacagcatc gcccacacag cttctcgcca aaactgtttg 18240aagctaatca
caagggaaga actggacagc ttctgaccat gagacgctcc accagacaac 18300ttgcttggcc
tctccaaaga aacttgcttg gcctctccaa agaaaactca gtttcattta 18360aaaacaaaac
taattattta aaaacaaacg aaaagcaagt tgtggacttg agctccaggg 18420acagagcaga
catacttttc cctgttcttc ccagtaagtg gtaataaaaa ccctcaacac 18480tagatataaa
acaaatataa gaaggttctg gaaggggaag aggaggcaga ctatccaggt 18540gccttgaggc
ccacagaaca acccagtgat gggttcactg ggtcttcttt ttgcttcatt 18600atctcagact
tggagctgaa gcagcaggca acttcaaaac accaaggggc acagattgaa 18660aagccccaag
aaaagcctgc cctctctagc caaaggacca ggaaggagac agtctaatga 18720gatggaacac
atttagacag taactgccca tttaccagca ataactgagc agggagccta 18780gacttccagt
cttgtgagga cgtaccaagg tacccaacac ccccaccaag gctgagtaag 18840gactgcgact
tttatccctg catggcagta gtaaggagcc catccctcac ccgccagcag 18900tgtcagggga
acctggactt ccactcccac ccaggagtga tgaggccctc cctgctgggg 18960tcatgtcaga
ggaggcctag tggagattca gtgacttaac cttttcccag agataatgag 19020gccacctttc
ctccctcttc ccccatggtg acagtgaaag cactgtggca agcagtaggc 19080actcctaccc
ctcctagcca gggaggtatc agggaggcca agtagggaac cagaataccc 19140acaaccaccc
agcagcaaca ggggtccccc accccattgg gtgtcaatgg aagcagagcg 19200gaaagcctgg
atatttaccc ccatctagaa gtaacaagct gatgtccccc ttcttctact 19260acaatggtgt
tcaaaacagg tttaaataag gtctagagtc tgataacgta atacccaaat 19320cgttgaagtt
ttcattgagg atcatttata ccaagagtca ggaagatccc aaactgaaag 19380agagaaaaga
caattgacag acactagcac taagagagca cagatattag aactacctga 19440aaggatgtta
aagcacatat cataagcctc aacaggctgg gcgcggtggc tcacgcctgt 19500aaccccagca
ctttgggagg ccgaggcagg tggatcacaa gatcaggaga tcgagaccat 19560cctggctaac
acggtgaaac cccgtctcta ctaaaaatac aaaaaaaaat agcaaggcat 19620ggtggtgggc
acctgtagtc ccagctactc gggagcctga ggcaggagaa tggcatgaac 19680ctgggaagag
gagcagtgag ccgagatcgc accaccgcac tccagcctgg gcaacagagc 19740aagacttcgt
cccaaaaaaa aaaaaaaaaa aaaaaaaagc ctcaacaaac aactacaaac 19800gtgcttgaaa
caaatgaaaa aaaaatcttg gcaaagaaat aaaagatata tattttggcc 19860aggtgcagtg
gctcacagcc tgtaatccct gcactttggg aggctgaggc aggcggatca 19920cctgaggtca
ggagtttgag accagcctga ccaacatgga gaaaccccgt ctctactaaa 19980aatacaaaat
tagccagtca tggtggcaca tgcctgtaat cctagctact caggaggccg 20040aggcaggaga
atcgcttgaa ctcaggaggt ggaggttgcg gtgagccgag atcccgccat 20100tgcacattgc
actccagcct gggcaacaag agcaaaactc catctcaaaa aaatagatac 20160atattttaat
ggaaatttta gaattgaaaa atacagtaac caaattgaat ggaaagacaa 20220catagaatgg
agggggcaga caaaataatc agtgaacttc aacagaaaat aatagaaatt 20280acccaatatg
aagaacagaa agaaaataga ctggccaaaa aataaagaag aaaaaagagg 20340agcagcagga
ggaatgatgg aaaaagagaa aggaaggaag gaagggaagg agggagggaa 20400ggagtgaggg
agaaagtctc aaagacctct gagactaaaa taaaagatct aacacttgtc 20460atcagggtcc
aggaaagaga caaagatggc acagctggaa acgtattcaa aaaataatag 20520ctgaaaactt
cccaaatttg gcaagagaca taaacctata gattcgaaat gctgaacccc 20580aaataaaaag
cccaataaaa tccacaccaa aatacatcat agtcaaactt ctgaaaagac 20640gaaaagagaa
aacgtcttga aagcagtgag tgaaacaaca cttcatgtat aagggaaaaa 20700caattcaagt
aacagatttc ttacagaaat taaggaagcc agaaggaaat gacacaatgg 20760ttttcaagtg
ctgaaagaaa agaagtgtca acacaaaatt ctagattcag taaaaatatc 20820cttcaagaat
caatgggaaa tcaagacagt ctcagataaa gcaaaataag agaatatgtt 20880gccagcagat
ctcccctaaa ggaatggcaa aaggaagatc atgcaacaga ccaaaaaatg 20940atgaaagaag
gaatccagaa acatcaagaa gaaagaaata acatagtaag caaaaataca 21000tgtaattaca
ataaaatttc tatctcctct taagacttct aaattatatt gatggttgaa 21060gcaaaaatta
taaccctgtc tgaagtgctt ctactaaatg tatgcagaga attataaatg 21120gggaaagtat
aggtttctat acctcattga agtggtaaaa tgacaacact gtgaaaagtt 21180acatacacac
acacacgtaa gtatatataa atatatgtgt gtatatgtgt gtgtatatat 21240atatatacat
ataatgtaat acagcaacca ctaacaacac tatacaaaga gataataacc 21300aaaaacaatt
tagataaatt gaaatggaat tctaaaaaat attcaaatac tctacaggaa 21360gacaagacaa
aaagagaaaa aaagaggagg acaaactaaa ttttttaaaa acataaataa 21420aatggtagac
ttaagcccta acttatcaat aattacataa atgtaaatga tctaattata 21480tcaattaaaa
gacagagata gcagagttaa tttaaaaaca tagctataag aaacctgctt 21540tgggctgagt
gcagtgactc acacttgtaa tcccagcact tcgggaggcc aaggcgggtg 21600gatcacctga
ggtcaggagt tccagaccag cctggacaac atggtaatac cccatctcta 21660ctaaaaatac
aaaaaaatta gccaggcatg gtggcacacg cctgtagtcc caactactca 21720ggaggctgcg
acacaagaac tgcttgaacc cgggcagcag aggtagcagt gggccaagat 21780tgcgccactc
cagcctgaac gacagagtga gactccacct cagttgaaaa acaaaaaaga 21840aacctgcttt
aaatatacca acatatgttg gttgaaatta aaagaataaa atatatcatg 21900aaaacattaa
tcaaaagaaa ggagtggcta tattaataac ataaaataga cttcagagaa 21960aagaaaattt
caagagacag gaataaaagg atcaagaaaa gatcctgaaa gaaaagcagg 22020caaatcaatc
attctgcttg gagattcaac accctctctt aacaactgat agaacaacta 22080gacaaaaaaa
tcagcatgga gttgagaaga acttaacacc actgaacaac aggatctaat 22140agacatttac
ggaacactct acccaacaat agcaaaataa acattctttt caagtattca 22200ctgaacatat
ccttagaccc taccctgggc cataaaacaa agctcactag tgattgccga 22260aggcttggat
ggacagtgga agagctgcat ggggagggag aaggtgacag ttaaagagtg 22320taggatttct
ttttgggata atgaaaatgt tccaaaattg attgtggtga tgttggcgca 22380actctacaaa
tataaaaaag gccattgaat tgtacgtttt aagtgggtga aacatatggt 22440atgtggatta
tatctaacgc tttttaaaaa cttaacacat ttcaaagaat agaagtcata 22500cagagtgtgc
tctactggaa tcaaactaga aagaggtaac tggaggataa cgagaaaagc 22560ctccaaatac
ttgaaaactg gacagcacat ttctaaaatc atccgtgggt caaagatatt 22620catttctgat
attcattttt attgtttaat gtatttttaa aaatttctta agggaaataa 22680actgactaaa
aatgaatatg gctgggtgcg gtggctcacg cctgtgatcc cagcactttg 22740ggaggccgag
gctggtggat cacaagatca ggagttcgag accagcctgg ccaagatggt 22800gaaaccccgt
ctcaactaaa aaactacaaa aagtagccaa gcgcagtggc gggagcctgt 22860ggtcccagct
acttgggagg ctgaggtagg agaatcgctt gaacacaggc agcagaggtt 22920gcagtgagcc
aagattgtgc cactgcacgc cagcctgggc gacagagact gcctcaaaaa 22980aaaaaaaaaa
aaaaagaata tcaaaatttg tgggacatag ttaaagcaat gctgagaggg 23040aaatttataa
cactaaatgt ttacattaga aaagagaaaa agtttcaaat caatagtctc 23100cactcccatc
tcaagaacac agaagatgaa gagcaaaata aacccaaagc aagcaaaaga 23160aagaaaatat
aaaaataaat cagtaaaatt gaaaacagaa acacaataaa gaaaatcagt 23220gaaacaaagt
actgattctt cgaaagatta ataaaattga caaacctcta gcaaggctaa 23280caaacaaaaa
agaaagaaga cacggattac cagttattag aatgaaagca taattagaaa 23340caactctaca
cattataaat ttgacaatgt agatgaaatg gactaattac tgaaaaaaca 23400caaattacca
caactcaccc aatatgaaat agataattgg gatagcctga taactactga 23460gaaaattgaa
tttgtaattt taacactctt aaaacagaaa cattaaactt aatattttat 23520aaatattaga
taaggtaatt atacccttcc ttaacaaata aaaacgacaa attattttgc 23580agctaaagag
atgtatgtac tgtgaaaaat atcttcagaa aaatagaact ttgtttgaag 23640aataaggatt
taaaaaatgt ttttaactct caagaagcaa atatctgggc ccagatggtt 23700tcactgaaga
attctaccaa atgtttaatg aagaattacc accaactcta catagcatct 23760ttgagaaaac
tgaagagaag ggaacatctc ccagttcatt ttatgaagtg ggtgttactc 23820tgatactaga
actgtataag gacagctact cttgacacac tgcctatggg tagctctgct 23880ctgcaggaac
agtcagaaaa aaaaaaaaaa gaagcactgg acaagggcag tataaaaaaa 23940gaaaactggg
ccaggtgcag tggctcacac ctgtaatctc agcactttgg gaggctgacg 24000ctggtggatc
acctgaggtc aggagtttga gactagcctg gccaacatgg taaaaccctg 24060tctctactaa
aatacaaaaa ttagccaggc agggtggtgg ggaaaataaa aaggaaaaaa 24120aaacaaaaat
aaactgcaga ccaatatcct tcatgagtat agacacaaaa ctccttaaac 24180tccttaacaa
aatattagca agtagaagca atatataaaa ataattatac accatgatca 24240agtgggactt
attccagaaa cgcaagtctg gttcaacatt tgaaaacaag gtaacccact 24300atatgaacgt
actaaagagg aaaactacat aatcacatca atcaatgcag aaaaaagcat 24360ttgccaaaat
ccaatatcca ttcatgatac tctaataaga aaaataagaa taaaggggaa 24420attccttgac
ttgataaagc ttacaaaaga ctacaaaagc ttacagctaa cctatactta 24480atggtgaaaa
actaaatgct ttcccctacg atcaggaaca aagcaaggat gttcactctc 24540attgctctta
tttaacatag ccctgaagtt ctaacttgtg caaaacgata agaaagggaa 24600atgaaagacc
tgcagattgg caaagaagaa ataaaactgt tcctgtttgc agatgacatg 24660attgtctcat
agaaaatgta aagcaactag gggtaggggg gcagtggaga cacgctggtc 24720aaaggatacc
aaatttcagt taggaggagt aagttcaaga tacctattgc acaacatggt 24780aactatactt
aatatattgt attcttgaaa atactaaaag agtgggtgtt aagcgttctc 24840accacaaaaa
tgataactat gtgaagtaat gcatacgtta attagcacaa cgtatattac 24900tccaaaacat
catgttgtac atgataaata cacacaattt tatctgtcag tttaaaaaca 24960catgattttg
gccaggcaca gtggctcata cctgtaatcc cagcatttta ggaggctgag 25020gcgagcagaa
aacttgaggt cgggagtttg agaccagaat ggtcaacata gtgaaatccc 25080gtctccacta
ataatacaaa aattagcagg atgtggtggc gtgcacctgt agacccagct 25140acttgggagg
ctgaggcacg agaattgctt gaacaaggga ggcagaggtt gcagtgagct 25200gggtgccact
gcattccagc ctggtgacag agtgagactc catctcaaaa aaaataaaat 25260aaagcatgac
ttttcttaaa tgcaaagcag ccaagcgcag tggctcatgc ctgtaatccc 25320accactttgg
gaggccgagg caggcagatc acaaggtcag gagtttgaga ccagcctgac 25380caacatggtg
aaaccccatc tctactaaaa aatatataaa ttagccaggc atgtgtagtc 25440tcagctactc
aggaggctga ggcaggagaa tcacttgaac ccggaggcag aggttgcagt 25500gttgagccac
cgcactccag cctgggtgag agaacgagac tccgtctcaa aaaaaaaaag 25560caaaataacc
taattttaaa aacactaaaa ctactaagtg aattcagtaa gtctttagga 25620ttcaggatat
atgatgaaca tacaaaaatc aattgagctg gacaaaggag gattgtttta 25680ggtcagtagt
ttgaggctgt aatgcacaat gattgtgcct gtgaatagct gctgtgctcc 25740agcctgagca
gcataatgag accacatctc tatttaaaaa aaaaaaaatt gtatctctat 25800gtactagcaa
taagcacatg ggtactaaaa ttaaaaacat aataaatact gtttttaatt 25860gcctgaaaaa
aatgaaatac ttacatataa atctaacaaa atgtgcagga cttgtgtgct 25920gaaaactaca
aaacgctgat aaaagaaatc aaagaagact taaatagcgt gaaatatacc 25980atgcttatag
gttggaaaac ttaatatagt aaagatgcca attttatcca aattattaca 26040caggataaca
ttattactac caaaatccca gaaaaatttt acatagatat agacaagatc 26100atacaaaaat
gtatacggaa atatgcaaag gaactagagt agctaaaaca aatttgaaaa 26160agaaaaataa
agtgggaaga atcagtctat ccagtttcaa gacttacata gctacagtaa 26220tcaagactgt
gatattgaca gagggacagc tatagatcaa tgcaaccaaa tagagaacta 26280agaaagaagc
acacacaaat atgcccaaat gatttctgac aaaggtgtta aaacacttca 26340acgggggaag
atatgtctct cattaaaggg tgtagagtca ttgcacatct ataggcaaaa 26400agatgaacct
gaacctcaca ccctacagaa aaattaactc aaaatgactc aaggactaaa 26460cataagatat
acatctataa aacatttaga aaaaggccac gcacggtggc tcacgctcgt 26520aatcccagca
ctttgggagg ccaaggcagg tggatcacct aaggtcagga gtttgagacc 26580agccggatca
acatggagaa gccccatctc tactaaaaat acaaaattag ctggacgtgg 26640tggcacatgc
ctgtaatccc agctacttgg gaggctgagg catgagaatc gcttgaaccc 26700ggggggcaga
ggttgcggtg agccaagatc acaccattgc actccagcct gggcaacaag 26760agcaaaactc
caactcaaaa aaaaaaaaaa aaaggaaaaa tagaaaatct ttgggatgta 26820aggcgaggta
aagaattctt acacttgatg ccaaactaag atctataagg ccagtcgtgg 26880tggctcatgc
ctgtaattcc agcactttgg tcaactagat gaaaggtata tgggaattca 26940ctgtattatt
ctttcaactt ttctgtaggt ttgacatttt tttagtaaaa aattggggga 27000aagacctgac
gcagtggctc acacctgtaa tcccagcact ttgggaggcc ggggcaggtg 27060gatcacacgg
tcaggagttc gagaccagcc tggccaacat ggtgaaaccc cgtctctacc 27120aaaaatataa
aaaattagcc gggtgtcatg gtgcatgcct gtaatcccag ctactgagga 27180ggctgaggca
ggagaatcac ttgaacctgg gaggtggaag ttgcagtgag ccgagattgt 27240gccactgcac
tccagccttg ggtgacagag cgagactccg tctcaaaaga aaaaaaaaaa 27300aaagaatatc
aaacgcttac tttagaaact atttaaagga gccagaattt aattgtatta 27360gtatttagag
caatttttat gctccatggc attgttaaat agagcaacca gctaacaatt 27420agtggagttc
aacagctgtt aaatttgcta actgtttagg aagagagccc tatcaatatc 27480actgtcattt
gaggctgaca ataagcacac ccaaagctgt acctccttga ggagcaacat 27540aaggggttta
accctgttag ggtgttaatg gtttggatat ggtttgtttg gccccaccga 27600gtctcatgtt
gaaatttgtt ccccagtact ggaggtgggg ccttattgga aggtgtctga 27660gtcatggggg
tggcatatcc ctcctgaatg gtttggtgcc attcttgcag gaatgagtga 27720gttcttactc
ttagttccca caacaactgg ttattaaaaa cagcctggca ctttccccca 27780tctctcgctt
cctctctcac catgtgatct cactggttcc ccttcccttt atgcaatgag 27840tggaagcagc
ctgaagccct cgccagaagc agatagtgat gccatgcttc ttgtacagcc 27900tacaaaacca
tgagcccaat aaaccttttt tctttataaa ttatccagcc tcaggtattc 27960ctttatagca
agacaaatga accaagacag ggggaaatca acttcattaa aataatctat 28020gcagtcacta
aacaaataag aacaagaggc tccagaagtg ggaagccaat acccagagtt 28080cctacaatac
agtatctgaa aagtccagtt tccaaccaaa aaatatatat atacaggccg 28140gacatggtag
cttatgtctg taatcccagc actttgggat gctgaggcgg gcagatcacc 28200ctaggtcagg
agttcgagac cagcctggcc aatatggcaa aaccccgtct ctactaaaaa 28260tacaaaaatt
agccaggcat ggtggtggat gcctgtaatc ccagctactc gggaggctga 28320ggcagggaat
cacttgaacc caggaggcag aggttgcagt gagccgagat cacgccactg 28380aactccagcc
tgggcaacaa agtgagactc cacctcaaaa aaaaaaaaaa tatacatata 28440tatatgtgtg
tgtgtgtgtg tgcgcgcgtg tgtgtatata cacatacaca tatatacata 28500tatacagaca
cacatatata tatgaagcat gaaaagaaac aaggaagtat gaaccatact 28560ttctgtggtt
atgataggat ggggtatcac gggggaagta gacaagggaa actgcaagtg 28620agagcaaaca
gttatcagat ttaacagaaa aagactttgg agtaaccatt ataaatatgt 28680ccacagaatt
aaagaaaagc gtgattaaaa aaggaaagga aagtatcata acaatattac 28740tccaaataga
gaatatcaat aaaggcatag aaattataaa atataataca atggaaattc 28800cggagttgaa
aggtagaata actaaaattt aaaattcact agagaaggtt caacactata 28860tttgaactgg
cagaagaaaa atttagtgag acaaatatac ttcaatagac attattcaaa 28920tgaaaaataa
aaagaaaaaa gaatgaagaa aaataaacag aatctcagca aaatgtggca 28980caccattaat
cacattaaca tatgcatact gagagtaccg gaagcagatg agaaagagga 29040agaaaaaata
ttcaaatgat ggccagtaac ttcctagatt tttgttttaa agcaataacc 29100tatacaatca
agaaactcaa tgaattccaa gtaggataaa tacaaaaaga accacaaaca 29160gatacaccat
ggtaaaaatg ctgtaagtca aaaacagaga aaatattgaa agcagctaga 29220ggaaaactta
taagagaacc tcacttacaa aagaacatca cttataaaag aaccacaata 29280atagaaacag
ttgacctctc atcagaaaca atgaatgata acatatttga agtgctcaaa 29340gaaaaaaaat
aaagattcct atatacgaca aagctgtctt tcaaaaatat acatccaaaa 29400ggattgaaac
cagggtcttg aagagttatt tgtacatcca tgttcatagc agcattattc 29460acaatagcca
aaaggtagaa gcaacccaag ggtccatcga caaataaata aaatgtggta 29520tatgtataca
caatggaatt tattcagtat taaaaaggaa tgaaattctg acacatgcta 29580caacatggct
aaaccttgag aacactatgc taagtgaaat aagccagcca caaaaggaca 29640aataccatat
tacttcactt gtatgaaata cctagggtag tcaaattcag agatagaaag 29700taaaacagtg
gttgccaagg gctgagggag ggagtaacgt ggagttattg ttgaatgggt 29760acagaatttc
agttttgcaa gataaaaaga gttctggaga cagatggtgg tgagggtggt 29820acaacaatac
aaatatactt tatactactg aacagtatac ttaaaaatga ttaacatggt 29880gaaaccccgt
ctctactaaa aatacaaaaa aattagctgg gtgtggtggc gggcacctgt 29940aatcccagct
acttgggagg ctgaggcagc agaattgctt gaaaccagaa ggcggaggtt 30000gcagtgagct
gagattgcgc caccgcactc tagcctgggc aataagagca aaactccgtc 30060tcaaaaaata
aaaaataaaa aaaatttaaa aatgattaag caggaggcca ggcacggtgg 30120ctcacaccta
taatgccagc actttgggag gccgaggcag gcgatcactt gagaccagga 30180gtttgagacc
agcctggcca acatggcaaa accctgtctc tgctaaaaat acaaaaatta 30240gccaggcatg
gtggcatata cttataatcc cagctactgg tgagactgag acacgagaat 30300tgcttgaacc
caggaggcag agattgcagt gagtcgagat cgcgccactg aattccagcc 30360tgggcgacag
agcaagattc tgtctcgaaa aaacaaaaac aaaaacaaaa agcaaaacca 30420aaaaataatt
aagcaggaaa cgagattgct gctgaggagg agaaagatgt gcaggaccaa 30480ggctcatgag
agcacaaaac ttttcaaaaa atgtttaatg attaaaatgg taaattttat 30540atgtatctta
ccacaaaaaa aagggctggg gggcaggaaa tgaaggtgaa ataaagacat 30600cccagagaaa
caaaagtaga gaatttgttg ccttagaaga aacaccacag gaagttcttc 30660aggctgaaaa
caagtgaccc cagagggtaa tctgaattct cacagaaaat tgaagcatag 30720cagtaaaggt
tattctgtaa ctatgacact aacaatgcat attttttcct ttcttctctg 30780aaatgattta
aaaagcaatt gcataaaata ttatatataa agcctattgt tgaacctata 30840acatatatag
aaatatactt gtaatatatt tgcaaataac tgcacaaaag agagttggaa 30900caaagctgtt
actaggctaa agaaattact acagatagta aagtaatata acagggaact 30960taaaaataaa
attttaaaaa atttaaaaat aataattaca acaataatat ggttgggttt 31020gtaatattaa
tagacataat acaaaaatac cacaaaaagg gaagaagaca atagaactac 31080ataggaataa
cattttggta tctaactaga attaaattat aaatatgaag tatattctgg 31140taagttaaga
cacacatgtt aaaccctaga tactaaaaag taactcacat aaatacagta 31200aaaaaataaa
taaaataatt aaaatgtttg tattagtttc ctcagggtac agtaacaaac 31260taccacaaat
tgagtggctt aacacaactt aaatgtattt tctcccagtt ctggaggcta 31320aacacctgca
atcaaggtga gtacagggcc atgctccctg tgaaggctct aggaaagaat 31380cctcccttgt
ctcttccagc ttccagtggt tctcagtaac cctaagtgct ccttggcttg 31440tagctatatc
attcctagca accagaaaga agaaaataat aaagattatg gcaaaaaata 31500atgaaatcaa
aaggagaaaa atggaaaaaa ataaataaaa ccaaaagcta gttctttgaa 31560aagatcaacc
aagttaacaa accttttaac tagactgaca aaaaggaggt aagactcaaa 31620ttactagaat
cagaaataaa agaggggaca ttactaatga gggattagaa aagaatacta 31680cgaacaaatg
tgtgccaaca aattagaaaa cttagatgaa atggacaggt tcctaggaca 31740acatcaacta
ccaaaattta ctcaagaaga aagagacaat ttgaatgagc tataacaagg 31800gaagagactg
aattgacaac caagaaacta tccacaaaga aaatcccagg cccagaagat 31860ttcactgtga
aattctttca aacttataaa tataaattaa catcagttct tcacaaactc 31920ctccaaaaaa
aagaacagat ctctatttac aggcgatacg atctttagaa aatcctaagg 31980gaactactaa
gacactatga taactgataa acaagttcag caaggctgca ggatagaaaa 32040ccaatataca
aaaatctatt atatttctat acacttgcag tgaacaaccc aaaaatgaga 32100ttaagaaaat
aattcaattt acaataacat caaaaagaat aaaaacactc aaaaataaat 32160ttattcaagt
aagtgcaaaa cttatactct agaagctaca aaacactgtt aaaagaaatt 32220aaaggtttac
ataaatgaaa aactatccca tgttcatgga tcaaaagact tattactggc 32280aatgctctcc
aaattgatct ataaattcaa caaaatcctt atcaaaatcc cagatgaggc 32340tgggggtggc
ggttcatgcc tgtaatccca gcactttggg aggctgaggc acgcagatta 32400cctgaggtcg
ggagctcgag atcagcctga ccaacatgga gaaaccctat ctcttctaaa 32460aatacaaaat
tagtcaggcg tggtggcaca tgcctataat cccagctact cgggaagctg 32520aggcaggaga
atcgcttgaa cccaggaggc agaggttgca gtgagccaag atcgtgccat 32580tgcactccag
cctgggcaac aagagcaaaa ttccatctca aaaaaaaaaa aaaaaaaatc 32640ccagatgact
tcactgttga aattgaaaag attattctaa aattcacatg gaattgcaag 32700accttgagaa
tagccaaaac aaacttgaaa aacacgaaca aaatatagga tgactcactt 32760gccaattgca
aatgttacga cacagcaaca gtaatcaaga ctgtgtggta ctggcaaaag 32820acacatacat
acatacatat caatggaata taattgagag tacagaaaca agcctaaaca 32880tctatggtaa
gtgcttttct atttttttct tttttttttt cttttttgta gagatagaat 32940ctcaccatgt
tgcccaggct ggtcttcaac ttctgggctc aagcaatcct cccactgtgg 33000cctcccaaag
tgctgggata actggcatga gccaccacat ccagcccaga tgattttcaa 33060aaaagtcaac
aagaccattc ttttcaacaa ataggtctgg gatgatcaga tagtcacatg 33120aaaaaaaaaa
tgaagttgga ccctccatca cactaaagtg ctgcgattat aggcatcagc 33180caccacatcc
agcccaaatg attttcaaaa aggtcaacaa gaccattctt ttcaacaaat 33240aggtctggga
taatcagata gtcacatgaa aaaaaaaatg aagttggacc ctccatcaca 33300ccatatgcaa
aaattaattc aaaaatgaat tgatgactta aacgtaagag ttacgactgt 33360aaaactctta
gaaggaaaca tacgggtaaa tcttaaagac gttaggtttg acaaagaatt 33420cttagacatg
acaccaaaag catgaccaac taaggtaaaa tagggtaaat tgtacctacc 33480aaaatgaaaa
acctttgtgc tggaaaggac accatcaaga aatggaaagc caaaatagcc 33540aaggcaatat
taagcaaaaa gaacaaagct ggaggcatca tactacctga cttcaaagca 33600acagtaacca
aaacagcatg gtactagtag aaaaacagac acatagacca atggaacaga 33660ataaagaacc
caaaaataaa tccacatatt tatagtcaac tgatttttga caatgacacc 33720ccttcaataa
atgatactag gaaaactgga tatcgatatg cagaagaata aaactagacc 33780cctatctctc
accatataga aaaatcaact cagactgaat taaagacttg aatgtaagac 33840ccaaaactat
aaaactactg gtagaaaaca taaggaaaaa cgcttcagga cattggtcca 33900ggcaaagatc
ttatggctaa aacctcaaaa acacaggcaa caaaaacaaa aatggaaaaa 33960tagcacttta
ttaaactaaa aagctcctgc acagcaaagg aaacaacaga atgaaaagac 34020aacctgtaga
atgggagaaa atatttgcaa actatccatc catcaaggga ctagtatcca 34080gaacacacaa
gtgactaaaa caactcaaca gcaaaaaagc aaataatctg gtttttatat 34140gggcaaaaga
tctgaataaa cattctcaaa ggaagacata caaatgtcac tatcattctg 34200ccagtaccac
actgtcttga ttacttgtta gtgtataaat ttttaaattg ggaagtgtga 34260gtcatcctac
actttgttct tgtttttcaa gtttgttttg gctattctgg gagccttgca 34320agtataaaat
agccaacaag tatgaaaaaa tgctcaccat cactaatcat cagagaaata 34380aaaatcaaga
ccactatgag atatcctctc actccagtta gaatggctac tatcaaaaag 34440acaaaatata
atggatgctg gcaaagattt ggagaaaggg gaactcctat acactgtggg 34500tagggatgca
aattggtaat ggccattatg gaaaataata ctgaggtttt tcaaaaaact 34560gaaaatagaa
ctaccatatg atccagcaac cctactactg ggtatttatc caaaggaaag 34620aagtcagtat
actgaagaaa tatatgcact ctcatgttaa ttgcaacact gttcacaaca 34680gccaagacag
ggaataaatc taaatgtgca tcaacagatg aatggataaa gaaaatgtgg 34740catatacact
caatagaata ctattcagcc attaaagaag aatgaaatcc tgtcatccca 34800gcaacatgga
tgaacctgga ggacattata tttaatgaaa taagtaaagc acaaaaagat 34860aaacagtaca
tgttctcact cagacatggg tgctaaaaag aaaatggggt cacagaatta 34920gaaggggagg
cttgggaaaa gttaatggat aaaaatttac agctatgtaa gaagaataag 34980ttttagtgtt
ctatagaact gtagggcgag tatagttacc aataacttat tgtacatgtt 35040caaaaagcta
gaagagattt tggatgttcc cagcacaaag gaatgataaa tgtttgtgat 35100gatggatatc
ctaattaccc tgattcaatc attacacatt gcatacatgt atcaaattat 35160cactctgtac
ctcataaata tgtataatta ttacgtcaac aaaaaaagga aaaaaaagaa 35220aattaagaca
acccacataa tggaagaaat aaaatatctg caaattatat atatctgata 35280aatatttaat
atttataata tataaagaac tcctacaact caagaacaac aacaaaacaa 35340cccaattcaa
aaatgggtaa aagccttgaa tatacactta tctaaagact atatacaatt 35400ggccaataaa
gacacgaaaa gatgctcaac atcactagtc atcagggaaa tataaatcaa 35460aaccacaatg
tagaatgtag acaccacttc atatgcacta ggatggctag aataaaaagg 35520taataacaaa
tgttggtaag gatgtgaaaa aatcagaaac ctcattcgct gctgttggga 35580atgtaaagtg
atgcagccac tttggaaaac agtctggcag ctcctcaaat tattaaatac 35640agagttaccg
tatgacccag gaatattcct cctgggtcta taaccaaaaa aatgaaaaca 35700tatatccaca
taaaaacttg tacatgggca tttatagcaa cattattcat aacagcaaag 35760gtggtaagaa
cccatatgcc catcatctga tgaacaggta aataacatgc ggtattatcc 35820atacactaga
atattatctg cccatacaag gagtgacatc cagctacatg ctacaaggat 35880gaatctcgga
aaccttatgc taagtgaaag aagccagtca caaatgacca cagattatga 35940ttccatgcat
cggaaatgac cagaataggg aaatctatag agacagaaag tagattagtg 36000gttgggtggg
gctgggagga caggtagtac actactttcc cagaactact ggaacaaagt 36060accacaaact
ggggagctta aacatagaaa ttgatttcct cacagttctg gagactagga 36120ctctgagatc
aaggtgtcag cagagctggt tctttctgag ggccctgagg caaggctctg 36180tcccaggcct
ctctccttgg ctggcaggtg gccatcttct ccctgcgtct tcacatcatc 36240ttttctctgt
gtgtgcccat gtccaaattt tgattggctc attctgggtc atggccaatt 36300gctatgcaca
aagtgaagtc tacttccaaa agaagggaag agggaacact gactaggcta 36360aacttatagt
cattttaatg tccgcttttc ctatgagatt gtgaacacac agaagtaggg 36420tttttatcta
cattgtgcaa agtttaataa gaaaaataga attcaagaga agcagttcaa 36480tagcaggaat
ttaatatggg aactaattac aaggtttagg gcaggactaa aaagccagtt 36540gggatggtga
gccaacccag agattagcaa cagtgggacc ccatctacct accacccatg 36600aagctggaag
gataaaggag gggctattat cagagtccac aagccagtgt cagagtcctt 36660ggctggagct
gggaccaccc tagagacact gtgcaaagca gaaaacaagg gggaaaaacc 36720ctgacttctc
ccttcctccc acctttcaat ctcccactag tgcttcctac tagccatact 36780tggccagaga
cagtgacaag gaacactgca aaatgaagtt tgtaggaatc atctccctct 36840gagacagaga
aatatggaag ggtagaaaat gaatcagagg ataaagagaa aaaaccctga 36900gtactatctt
atttatcttt gtatctccag tgcctaatct gtctctcaaa aaaggaaagc 36960aattgagaga
aactgaaaac tccaattgaa atgaaagaat ggagaattac tggactagaa 37020gagaagagaa
aaatttattc cgcatagagt aaacaagaat ggattcacaa aggacgtgat 37080gaatgaaaag
ctataatcag caaagatttg ccagagaaat taaaaagtgg taaactcagc 37140cacgctgtac
aacctgaagg cacaatgcat gaaaacgttt caagaaatga caagatttga 37200agtcaaattc
taagtgcttt tccagaatct ctcaagacga ttatatagct accccatttt 37260attaaataaa
atggaaactt actaaacttt ccccttgtat taaactaaca tatgtcctaa 37320tagcaaacga
ttctggaatt cctagagtaa aatatatttc gtcaaagtgt attgctcttt 37380taatattctg
ctgacctcct tttgctattt aggatatttg tatacacatc acacgtaaat 37440ttggtctata
gtttacatct acgggcttat actgttcttt ttttcatttt tttaaaattt 37500ccaaccccca
gtatccatat actgctctct atcagggtta ttttaacttt gtaaaatcag 37560ctgagatgct
ttccatgttt ttttttttta ttttctgcca catttgaata gcataggagt 37620taccaccatc
aaccttggat tatttaagca ttcacgattc cacgtgtgga ttttttattc 37680agagtctttc
ttgtcattcc tgctatcagc acagaaccca atctcagctt tccagctata 37740ctctcacccc
atggaatttg cagatgaagt tcaaaaggac ctttgcatta tcctgcctcg 37800ccctcttccc
ccttcattta gacatcacct tcttctagaa cgtcttacct gacatgccct 37860gctcccaacc
cctgctgccc aattgtgtgc tctcccgtgt cctggcctgc catcctcttt 37920agtaattgcc
tgctccctca tctgtctccc cacccagaca ttaagctgaa tagactggat 37980ttgtgtcttg
tccatcacta taatctcagc acctagtacc tagtaggtac ttaccatgta 38040ttcattagca
aaatgttatg tataaccttg caccttaaaa acaagagaag gaagacaaaa 38100ttaagtctta
agactatggt ttagaacatg gatcagaaac tacagtctgc agcccaaatc 38160cagaccaaat
gaagagacca tgttcattta catacaacct atagcagctt tcacactaca 38220ggagcagagc
taagtagttc caagggaaca cacggccctg caaagcctaa aatatttact 38280ctatagctct
tcacagaaaa agttttcaga tccctcgttt agaactcttg ttcatatgca 38340atttcactaa
accatagttt tttgggtttg tttggttttt tttggcaaaa aggaatgagc 38400cgatccagaa
aaggttgaaa agaatgaatc attactgctg aaagaatgtg cacacagtcc 38460gtcagtattc
tgctgccatg ctgacaccca tccaatagtg tcatgagatg cagcagctac 38520tactgtgttc
tcaatgccga gtccacccac tccataacca tgtccaagca atcttgggaa 38580catcatcacc
atgcttgttt atccttaagg tattgcctca catacagcag tggctggtca 38640taaagtcaaa
tgacactagt ggccaggagg tcaagagaat gagtgaggac aggtgggtag 38700gcagcccagg
ccctagcaac agcaggagct cacccctcag tcactctagc caggactgaa 38760atacttttca
ccctttcaag agagactagg aatctggatt tttatgtgaa atatcttgat 38820tactaaatgt
tgtcaacaga catgtcaaaa ggtaaaacta agtaagttca tggggcagat 38880tgactattca
ggttatagaa ttaaggattc ttatccaaca cagataccaa ccaaaaagct 38940gacgtataac
atattaggag aaactatgtg cactgtcgaa acatcaacaa ggggctaatg 39000tctaaaatag
tctatattgg attccagttg aaacatgggg aaaggacatg aacaggcaac 39060ttatgtcaat
ggaaactcaa aaagataaca agcatatata aaagcattct caaattcagt 39120agtaaacaga
cagatgcaaa taaaaagagg gaaactgctg ccgggcacag tggctcacac 39180ctgtaatccc
agcactttgg gaggccgagg cgggcggatc atgaagtcag gagatcgaga 39240ccatcctggc
taacatggtg aaaccccgtc tctactgaaa acacaaaaaa ttagccaggc 39300gtagtggtgg
gcaccagtag tcccagctac tcaggaggtt gaggcaggag aatggcatga 39360acccaggagg
cggagattgc agtgagccga gaccatgcca ctgcactcca gcctgggcga 39420ctgagtgaaa
ctccatctca aaaaatataa taataattat aattataata ataataaata 39480gtaaataaat
aaaaagagag agactgctaa agtctagaaa gttgaatgat gccaagcgca 39540tgcaaagatc
agggccttgg gatggccggg tgcagtggct cacgcctgta atcccaccac 39600tttgggaggc
caaggcgggc ggatcatgag gtcaagagat caagaccatc ctggccgaca 39660cagtgaaacc
cggtctctac taaaagtaca aaaaaatata tatatatata tatattatta 39720tattatatat
atatatatca gagccttggg aatccttgtg tgctgctggg gaaggtagtg 39780gtgcagccac
ccttgacagc aatctggcag tacttggtta tattaagtat aggcacacac 39840cacgaccagg
cagtcctact cctgggtcta aatcccaaag aattctcaca caagtccata 39900aggagacatg
tacgaggctc attcagcatt actgggagtg ggaatcaacc tgggtgtcca 39960tctacaggag
acgagatgga caaaatgtgg tggatattaa gaccagaatc accaagtaac 40020agagatgggt
ggtgagtgac aatcctaaga tacagaataa aggctagaac atgatgccat 40080tcatgtaaat
taaaaataga tgcacacaaa gcagtatacg cgtgaccctt gaatagcaca 40140ggtttgaact
gcctgtgtcc acttacatgt ggattttctt ccacttctgc tacccccaag 40200acagcaagac
caacccctct tcttcctcct ccccctcagc ctactcaaca tgaagatgac 40260aaggatgaag
acttttatga taatccaatt ccaaggaact aatgaaaagt atattttctc 40320ttccttatga
ttttctttat ctctagctta cattattcta agaatatggt acataataca 40380catcacacgc
aaaataaatg ttaattgact gtttatatta tgggtaaggc ttccactcaa 40440cagtaggctg
tcagtagtta agttttggga gtcaaaagtt atacacagat tttcaactgt 40500gcaggcaatc
agttcccctg accccctcat tgttcacggg tcaactgtat atacacaaaa 40560gtattatatg
aacctcatta gaatagctgt ctatagggag aagagaatga gagtgggata 40620aaacggaatg
aacaaataaa ccaacaaatg cattaacaag caaaacaaca gaggggcttg 40680catgggccag
tgatgataaa gggctaagaa tgagaatata attaattcaa ttcctcacac 40740ctgaggtcta
aaaccaagga aagggagggc caggcgtgga ggctcacgcc tgtaatccca 40800gcactttggg
aggctgaggc gggcggatca caagattagg agtttgagat cagcctggcc 40860aacacagtga
aagcccatct ctacaaaaaa tacaagaatt acccaggtgt ggtggcacat 40920gcctgtagtt
agctactctg gaggctgagg caggagaatc acttgaaccc aggaggcgga 40980ggttgcaggg
agccgagatc acaccattgc actccagcct gggtgacaga gtaagactct 41040gtctcaaaaa
aataaaaaaa ataaaaaaac agagaaaggg aggaaactag atccaggctg 41100actagataca
gcctttagag ttagaaaaga tgatttgaca atctaagccc acactcagat 41160tgaatgaaat
tgaaaagcct ttcaaactaa aacatttaat tacaccatct gctgcagaca 41220gaactcagac
aactcaaaca ggtaatgtca gcgtggtgtt ttatatcacc accctcaaca 41280cagaataaaa
atcagctgca tgtgaagcag tgactagaat gaagaaaagg ctgcttctta 41340cttccttcta
gtggttcttt ccgaaaacat taataggcac cagctctatg catgtcaccc 41400tgcagggaga
catggggtat ataactatga cttactgttc attcctcaag gaattcccaa 41460tcttgtggaa
gattatacac aatgaggcaa caaaaactat ccaataaaac cacggaaaag 41520aagccagtga
caaagaagcc agtgatgaaa ggccctgtga gcagagctga tggccatttg 41580gggaagaaag
accaacatgg atgggggtga tcagggtggc tccgtgggaa agctggaaga 41640gaagtggcag
atctctgagc tggatgatgg gccactacca tctgtatatg gctaattaaa 41700gaccatgtgt
ggatttttta ttcagctctt tcgtgtcatt cctgctatca gcacagaacc 41760caatctcaac
tttccagcta tattgagcta aacttctcac ctcatggaat ttgcagataa 41820agttcaaaag
gatccttgcc ttttcaaaat aattttgaat ggttgagtag tccctctgtg 41880ctctctcact
gacaccctct caaggctgct gagcacgtgc catgctatgg ctttctccaa 41940catcaggaaa
tgttctccac tcagtttcac cttaatacaa atgtgttctc tcttcagaga 42000aggcaaaaaa
attcatgacc atctgactgg gagaagtcat ttctaggtaa agtgtccatc 42060tttttctgag
gaacacagga ggaaaatctt acagaaaaga gttaacacag caggcctaag 42120actgcttttt
aaaataaata aataaataaa taaataaata aataaataaa taaataaata 42180aataaatgaa
tgatagggtc ttctgtattg gccaggctag tctcaaattc ctggcttcaa 42240gagatcctcc
caccttggtc tcccacagtg ttgggattat agacatgagc cattgtgctt 42300ggcccaagac
tgttattctt aaaaagtctc ataaaaagca tggttaatcc ttggctggca 42360cctgggaact
tagatttcag aagggttccc accatccaac ctggaaagag ggactcactg 42420tgcctaaatt
attgtgtggt ttatgctgaa ctcctgcttt tcttcaggta gcgtggaatg 42480tggtatgtgc
tgggcaaagg gggcctgcat gaccagcccc caataaaaac cctgggtgtt 42540gggtctctag
tgagtttccc tggtagacag catttcacat gcgttgtcac agctccttcc 42600tcggggagtt
aagcacatac atcctgtgtg actgcactgg gagaggatgc ttggaagctt 42660gtgcctggct
tcctttggac ttggccccat gcacctttcc ctttgctgat tgtgctttgt 42720atcctttcac
tgtaataaat tacagccgtg agtacaccac atgctgagtc ttccaagtga 42780accaccagat
ctgagcatgg tcctgggggc ccccaacaca gaaataaatt ataaaagacc 42840aaggactggg
catggtggcc catgccggta atctcagcgc tttgggaggc cgaggcagga 42900ggaccagtta
agcccaaaag ttcaaagtta cagtgaccta tgactgcgcc aatgcactct 42960aacctgggag
acagagcaag accctgtccc caaaacaata aactaaacac atacttctgc 43020cttccaagtg
tcttaaaatt caatggaatg gtagaaacat ttttaaaaca ctaaatcaaa 43080agaaacctgg
aaaacaagag tgccgatggc caactaaaat gtctaggaaa tttctgaaaa 43140gtaaaaagta
ctcagaacca gattacctga gcaaaccata gcccaataca agcttgggag 43200gaggctgtta
tgcagaagga aatggtaaca ggtttccagg aacagacttg taacagcaga 43260tagaacagca
gaggtagaac ctgacaaggt gattacctgg ggaactgcag tctgaatgac 43320caggactgtt
ggacccttcc cctcacatgg aatacacacg ccactcagca gcacaccaca 43380gctcttcaac
aatcacagga ggcacgctac gcctagtaag acaggaaaaa aggaattctc 43440aaacttcgaa
gatgaacaca taaagaatca ccaagttttt attcagtatg atgaaacagg 43500gacactgaat
caacagaaca caaacccaag caaagataat tactagagca catagaagaa 43560attattagat
attcttggga agacctaagg ggacattata aagagcaagc agttggtatg 43620tgacgatctt
tgtgatatac caagaaataa aaacacagga tgaagaccag atagagaata 43680atgctactat
ttgtgcaaaa aaggagaaat ggagaatctg attcatattt gcttgtattt 43740gcatgaagaa
actttggaag gtacataagt aactaacaac aatggttacc tacttgtaag 43800gcgagagaag
taagaggaca ggaatggtgg gaacaccttt tgtgtccgga attggtgggt 43860tcttggtctg
acttggagaa tgaagccgtg gaccctcgcg gtgagcgtaa cagttcttaa 43920aggcggtgtg
tctggagttt gttccttctg atgtttggat gtgttcggag tttcttcctt 43980ctggtgggtt
cgtagtctcg ctgactcagg agtgaagctg cagaccttcg cggcgagtgt 44040tacagctctt
aagggggcgc atctagagtt gttcgttcct cctggtgagt tcgtggtctc 44100gctagcttca
ggagtgaagc tgcagacctt cgaggtgtgt gttgcagctc atatagacag 44160tgcagaccca
aagagtgagc agtaataaga acgcattcca aacatcaaaa ggacaaacct 44220tcagcagcgc
ggaatgcgac cgcagcacgt taccactctt ggctcgggca gcctgctttt 44280attctcttat
ctggccacac ccatatcctg ctgattggtc cattttacag agagccgact 44340gctccatttt
acagagaacc gattggtcca tttttcagag agctgattgg tccattttga 44400cagagtgctg
attggtgcgt ttacaatccc tgagctagac acagggtgct gactggtgta 44460tttacaatcc
cttagctaga cataaaggtt ctcaagtccc caccagactc aggagcccag 44520ctggcttcac
ccagtggatc cggcatcagt gccacaggtg gagctgcctg ccagtcccgc 44580gccctgcgcc
cgcactcctc agccctctgg tggtcgatgg gactgggcgc cgtggagcag 44640ggggtggtgc
tgtcagggag gctcgggccg cacaggagcc caggaggtgg gggtggctca 44700ggcatggcgg
gccgcaggtc atgagcgctg ccccgcaggg aggcagctaa ggcccagcga 44760gaaatcgggc
acagcagctg ctggcccagg tgctaagccc ctcactgcct ggggccgttg 44820gggccggctg
gccggccgct cccagtgcgg ggcccgccaa gcccacgccc accgggaact 44880cacgctggcc
cgcaagcacc gcgtacagcc ccggttcccg cccgcgcctc tccctccaca 44940cctccctgca
aagctgaggg agctggctcc agccttggcc agcccagaaa ggggctccca 45000cagtgcagcg
gtgggctgaa gggctcctca agcgcggcca gagtgggcac taaggctgag 45060gaggcaccga
gagcgagcga ggactgccag cacgctgtca cctctcactt tcatttatgc 45120ctttttaata
cagtctggtt ttgaacactg attatcttac ctattttttt tttttttttt 45180tgagatggag
tcgctctctg tcgcccagac tggagtgcag tggtgccatc ctggctcact 45240gcaagctccg
cctcccgggt tcacaccatt ctcctgcctc aacctcctga gtagctggga 45300ctacaggcaa
tcgccaccac gcccagctaa ttttttattt tatttttttt ttagtagaag 45360cggagtttca
ccatgttagc cagatggtct caatctcctg acctcgtgat ccatccgcct 45420cggcctccca
aagtgctggg attacagacg tgagccactg cgccctgcct atcttaccta 45480tttcaaaagt
taaactttaa gaagtagaaa cccgtggcca ggcgtggtgg ctcacgcctg 45540taaccccagc
actttgggag gccgaggcgg gcggatcacg aggtcaggag atcgagatca 45600tcctggttaa
cacagtgaaa ccccgtcgct actaaaaata caaaaaatta gccgggcgtg 45660gtggtgggca
ccggcagtcc tcgctactgg ggaggctgag gcaggagaat ggcgtgaacc 45720tgggaggcag
agcttgcagt gagccgagat agtgccattg ccttccagcc tgggcgacag 45780agcgagactc
cacctcaaaa aaaaaaaaaa aaaatagaga cccggaaagt taaaaatatg 45840ataatcaata
tttaaaaaca ctcaagagat gggctaaaga gttgacggaa caaatctaaa 45900tattagattg
gtgacctgca aaaccagccc aaggaacatc ccagaatgca gcccataaag 45960ataaagagag
catttccgct gggcacagtg gtatggcagg ggaattgcct gagtccaaga 46020gttgcaggtc
acattgaacc acaccattgc actccaggcc tgggcaacac agcaatactc 46080tgtctcaaaa
aaaaaaaaaa ttaaattaaa aaagacagaa tatttgagag aaaaaaatgc 46140ttatttcaag
aaacatgaaa gataaatcaa gatattctaa ttcccaagta agaataattc 46200cagaagcaga
aaatagaata gaggcaagga aacactcaaa acttctccag tgccatagaa 46260atgtgtatta
atctttagaa tgaaacggac taccaaatgc tgagcaggaa gaacaaaaga 46320gatccactct
taagccagtg tggtgcccaa gcgcagtggc tcatgcctgt aatcccagca 46380ctttgggagg
ccgaggcagg tggatcacct gaggtcagga gtttgagatc agtcaggcca 46440acatggtgaa
accctgtctg tactaaaaat acaaacatta gctgggtatg gtggtgcaca 46500tctgtaatcc
caactacttg ggaggctaag gcaggagaat cacttgaaac caggaggtgg 46560aggttgtagt
gagccgagat catgccacac tcccagcctg ggtgacagag caagattcca 46620tctcaaaaaa
aaaatccact cctagacaaa taatagttaa attttagaac accaaggaga 46680aagaaaaaaa
attgtaaagc ttcagagaaa ataaacatta actacaaaga aacgagagtc 46740agacgcgtgc
acttcttcct agataccagc agataaagca atatctccaa aattcagaag 46800gttttaacgt
agaatcctat acccagtcaa gaatattcac atggaaaagt gaaataaaaa 46860acattgttta
aacatgcaag ggttcagaaa gtttaccatt cacagaatcc ctgaaaacaa 46920aaccaaataa
tcacttaagg actcattaag aaaacaaatg aaataaaagc accaatgatg 46980agtaaataat
cagaaaaatt tacagtttac ctaaataact gtttatgcat aatgtatgaa 47040aacccaaaaa
tttaatatgg gacagaatta aaatcatgat aagattcttt tttgctttac 47100tcatggagag
ttcacataaa cagattatct tttaatagca agagaaaaaa atgtttagat 47160atgtgtgaaa
aactaagggt accaaaacag tgcaaattca tttatcatca ggaaaatcca 47220aattaaaacc
acagtatcca ccagaataac taaaaggtaa aagacagaaa ttaccaagag 47280ttggcaagaa
tgtggagcaa ccacatatac ttctggggta aataagttgg tgcaaccggt 47340actgaaaact
gtttgctagt atctactaaa accgagcaca tgcacagact acaaccaagc 47400agttccactc
ccagatacac actcaacaga aatgcacaca ctcactcaac aaaagacgtg 47460tactagagtg
ttcatgtact tactattcat aatagtccaa aaatgcaaac aaccaactgc 47520caatcaaagt
caaatgtata tctatattag ggatatatac aatggcatat acacagcaat 47580gagaatgaaa
tgaaccagct cggcacagtg gttcatgcct gtaatctcag cactttgggc 47640gggtaaggca
ggcagatcac ttgaggtcag aaatttgaga ctagcctggc caacacggtt 47700aaaacctgtc
cccactaaaa acacaaaaat tagccgggca tagtggttgc aggcctgtaa 47760ttccagctac
tcgggaggct gggttgggag aatcgtttga acccgaaagc cggaggtcgc 47820agtgagcgga
gatcgtgcca ctgcactcca gcctggacga tagagcaaga ctccgtctca 47880aaaaaggaaa
tcaaaaatat aaaataagat gacaggaata atccgcaaaa gatcagtaat 47940caaaataaat
ataaatgggc taaagctacc tattaaaaga caaagatttc acacccataa 48000ggatagctac
tatcaaaaaa agagagagaa taacagatgt tagcaaggat gtatggaaac 48060tgaaattctc
acgcattgct ggtgagaata taaaatggtt cagcctctgc ggaaaacact 48120atgctgggtc
atcaaaaaat taaaaataga agtactactt gatccaacaa ttctacttct 48180gggtatatac
ccaaataact gaaagcaggg tcttgaagag atatttgtac acccatgatc 48240atggcagcat
tattcataat agctatgatg tggaaccaac ataaatatcc tttgataaat 48300atatggataa
gcaaaatgtg gtgtatacat tcaatggaat attaattagc aataaaaatg 48360aagaaaattc
tgacacatgc tacaacatgg atgaaccttg agggcattac attaaatgaa 48420ataagccagt
tataaaaaga caaatactat atgaggtact atattagata ctcatgcaag 48480gtacctaaaa
taggcaaatt catagagaca aaaagcagaa tggtggttgc caggggctgc 48540ggtaatggat
acagagcttc aattttgtaa gatgaaaaaa ttctggagat tggttgcata 48600acaatgtgca
cacacttaac actggggaac tgtaaactta aaagtagtaa atggtaaaaa 48660taaaaataat
aaataataaa ttttatgtta ttttaccaca atatttatta aaagacaaag 48720attaactaat
taaacaaaat ccagccataa gctaatggta agagtaacaa ttaaagaaga 48780cacagaaaat
tgaaaatcag tgactagaaa aagatattcc atataaatgc taacaaaaag 48840caagtacagc
aatataaaga gaatgaacaa aaaaaaaatt aaataagatg gctcgtttat 48900tcccaaaagg
tacaattcac caagaagata caagaattgt gaacctttaa gcacataaaa 48960cagcttcaaa
aatacaacat ttaaagaaaa atatatatta aacatagaaa tagtacaaaa 49020acccctacaa
gaatcataat gggagtcttc aatacaactc tccatatcaa caggtcaaac 49080agagaaaaaa
aataagttaa ggatgcagaa aacctgaatt accatcaata aacttgagat 49140taatatagaa
ctgtataccc aatatactaa gagttcaggg aacagtcgtg actgacagtg 49200gactgcaaat
taatctgttc ttaatctttg tttttctttc agcactgtgg cagaatagag 49260atcctaaaaa
ccttccagct acaaaacatc tttttaaaaa tataaaaaaa tacaaaaata 49320actctgaaat
caatagaaga cacatggtga aaccaaaatt ctagaataca gggagaataa 49380aggcattttc
agatattaca aaaacagaaa attgatcatt gctgaagtaa tttctaaaga 49440atgtacttga
gggagaagaa aaatgttcca aagaaaagta tctgtgatac aagaaggaat 49500ggaaagtgaa
gaaatggtaa acaggtagat aaagctaata aatgttgacc tagaaaataa 49560caaaaacaat
agcaataatg tctcgttgga agggttgaag taaaaataca attaaggcca 49620aatgtgaggt
aagtggaatg aaagaattag aagtccttgc cttgttcaca ggactgatta 49680aataaatgag
ccaggttttc cattcaaaca gttaaaactt gaacaaaata aactcaaatt 49740aagtagaaag
ataaaaaaca gaaattaatg tcatagaaaa ataaaaaatc aatagaatta 49800atcaataaat
cctggttaat aaaagctggt tctttgaaag gattaataaa ataatcatta 49860agcaagtctg
atcaaaaaaa aagagaaaag gtaccaaaaa aagtactgta tcagaaagag 49920aacatacaga
tacatacaga tatgtaagag tctgttttct tacaccagaa tactatatac 49980aacattatgc
tagcatatat taaatttcaa taatgttaat gattttctag gaaaacagaa 50040aatattaaat
ttactttgaa gaaacagaaa aactgagaaa aataaatgat catgaaaaaa 50100atgaaaaggt
aattaaatac tgatattaac tgcctaaaca acaccagcag cagcccaggc 50160agtctgcagt
caagttctgc caaacttgag ggaacagata attcttctat tccagagcat 50220agaaaatgat
ggaaagtttc ccaatttaat cagagaggac agcctgatcc ttgttatgaa 50280cacagataaa
aatggggtaa actatatgcc aaactcagat accaaaaccc taaataagat 50340gctagcttat
tgatgtgaac aatccaaaag tgcattttaa attagcccag ggttttagag 50400aaagaaaatc
tagcaatgtg accaccactt atgttaacaa ttttaagacg aaaatctaca 50460tgatcatatc
aatgcatgct acacaaaagc atttgggcaa aaaacccaac acccaccctt 50520gactttttaa
actcttagta attaggcata aacagaaatg tacttaatgt gatagaatac 50580actcggtgaa
gatacagagg gaatgctccc taaaaccaag cccaagacaa agattcctat 50640ttaacctcaa
tagtcaacac tgcagcgaga gtaatctatg gaagacaagg aaaaaagtaa 50700aaacatgaga
gacatctgtt gtttaacaga caataagatc acctacttgg aagaggcaaa 50760cgaatcaagc
gaaaaactat taaaactgag acaggcttta gtatggaggc tcagcttcag 50820ctgtagtttg
ggctaccaaa ttcaactcgc ttgcttggag agttaatcct gcaaagctaa 50880tttctgttga
ggtattagga ttgacaagcc tgtgctcctc cctcctcccc catcttcaac 50940actgaaataa
cacggtgttt ggaactggat aacagaatct tccaaaaaca aaaattgtcc 51000tgaagggctg
acttgtgccc ttactcaaaa aacactttat ctgctgcctg cagctcctac 51060agttgctggt
ggataagcct gccaaccagc tcggcgtaat tcttcctgca gagggcaagg 51120aagagcactt
tcacaggaaa atttttttcc gaactgtatg ccgcttatta cataaactta 51180cgtgctggca
aatggagctc cagcaaaata agatattcag agtcaaactt ccttaggaaa 51240aaaaaaaaaa
aaaagcaagc acataacact aatttccttg catgggcact ggggaaggag 51300gtcgttactt
ccgcacgccc gcaggtccgc accaccggga aacccacggg caccgcgcgc 51360tgcccccggg
ccttccaggt gcactgcgcc gcggcgcccc agctgacccg ggatgcgcag 51420ccctagccct
tcccctgtca ccccggccag gaaggggcgg gagcgcggcg gacgccgagg 51480gcgaagggct
tctcggtcct ctgcaccacg cagcaccccc aaggcacaac agggagggtg 51540cgggaggctc
ccgagaccca ggagccgggg ccgggcgtgc ccgcgcacct gtcccactgc 51600ggcgagggct
ggggtcgcct ccagggccgc agctgtcggg agccacctgg ctctcagtcc 51660cgggtccctg
cgacaaccct cgggcccgga ggggaggagg cggccacctg ccgctgccac 51720ctgcggcacc
ggtcccaccg ctccgggccg ggcaggacag gccaggacgt ccctcctggg 51780ctggggacag
gacacgcgac gaggggaccg gggcccccgc ggcgaagacg cagcacgcct 51840tcccagaaag
gcagtcccgt gcccccacga cggactgccg gacccccgcg ctcgcccgcc 51900catcccttca
gaccacgcgg ctgaggcgca aagagccggc cggcgggcgg gctggcggcg 51960cggctagtac
tcaccggccc cgctggctca gcgccgccgc aacccccagc ggccacggct 52020ccgggcgctc
actgatgctc aggagaggga cccgcgctcc gccggcgcct ccagccatcg 52080ccgccagggg
gcgagcgcga gccgcgcggg gctcgctggg agatgtagta cccggaccgc 52140cgcctgcgcc
gtcctccttc agccggcggc cgggggcccc ctctctccca gctctcagtg 52200tctcatctcc
ctatctgctc atcctctggt cgcacataat cgatgtttgg gcgtcccaag 52260ccagatgtgg
accccatttc cgcactctac actggaggtt ttctaagggt ggtgcccgga 52320ccagcagctt
cagcctcatc tgggaacttg agaaaatgca gattctccgt cccacccagc 52380ctattcggtt
tttcctgcac taaaaccatg aaggtggggc ccagcagtcc acattctcgc 52440aagcccgtca
agtgattctg aggcgccctc cagtttgaga gctatgctca cggcctcacc 52500tccgccccgc
aaggagcccg gtcttgcctg tggcgctagc cgcacacgga cacctcatcc 52560tgcggggccc
gcccccccgc tgcaccctca ccgcccaacg cctcctccgg gatgcagcgg 52620aggcgcctgg
aagtcggcaa ggtcaacatc cccctcagca tcttccctac cctcacggct 52680cctcctccag
gggtgcctca tggccagggg ttagaaagag ccactgtgtt tcttgacatg 52740gaagtggcct
aagaccttaa tgaaaactgc aggagtggaa tgacagaacc tttggtcata 52800cttgagggcg
tgaagctcaa atgaggagga aggaaaggat ccagggagaa taaccaaccc 52860tggcaagttg
tggcgcccag gtagaggggc gagcctaggc tagcggttct cgaccagggc 52920cggtgttgcc
cctcctcgcc gccccgcgta catttgggga ggtctggaga catttttggt 52980tgtcatgatg
cgggagttgc tactgttgcc taagtgggta gacacgaggg tgctcctcaa 53040catcctacct
gaaggacagg actgccccac aaggaagaat gatccggccc caaataagaa 53100accctgggct
ggtcagcaac aacccctttg ttctgagaag agaggaggaa agaataaaag 53160aagtggggtg
aagttttggt ttggtagagg aaacttgaag acattttcac tggaaaggaa 53220gagaggaaga
ggagggagat gtctgtaagg acgagcaaac cgggtgacag ctgatttcct 53280catattgaag
taatgagtcc tagttataat aaattcctaa taaaaaccca gtttatccct 53340gcaataaact
tgtctttttt ttttaaatat actgcttgat tctgtttgct aatattttat 53400ttacaggctt
tgcattgata tgcaaaaatg agatgggcaa taattttctt tttgaatgtc 53460taatgttgtt
tggtttcaga atcaatgtta tgctcacatc ataaaaaatt tggaaccgag 53520gcaggaggag
tgcttgaggc cagaagttcg agaccagtct aggaaacaca gtgagacccc 53580cccatctcta
caaaaaaaaa aaaagaaaaa aaaatgggca tgtttgcttt ttccttttac 53640tctgaacaat
ttaaggagca ttaaaattat ctattctttg aggtttgatc atttcccagt 53700taaaaatgtt
cctcccagcc tgatgctttc tttggggagg gtaaatcttt taaggctaga 53760aaagtttctt
ctgtggcaat tttattattt acattttaaa aattattcta gagttaattt 53820tgataaagca
tgtatttctt aaaacaaatt atcctttttt tccagatgtt caagtgtatt 53880tgcataaagt
tgaggaaagt agtcttttgt gaatctttta acttctccca aatatcttat 53940tttgtgtatt
tttgcttctt tattttgtta acttttaaaa gtgtattttt ttttcaaaga 54000atcagctctt
aggtttatgt ttttggttat actggagctt ttttcttctt ctttttaaaa 54060tattttttct
cctttatttt ttagacgtat tttgatctaa cgtaatcgga agaaggtaaa 54120ttagaatctt
ttgttactat tgtgttttta tttctcctta tttctctgaa gtcctgcttt 54180ataaatagta
ccatgttatt tgtgcataaa tattcatttg tcttatattc ttgggaattt 54240tcccacttca
tcataaaatg accttccttg tctcatttaa tgtgttcaaa ctttgccctg 54300aatttaactt
tgtctgatat tttaccatcc tgctgaattt tgtttgttac cccaaacaac 54360ctttgctgtt
ttcgtctttt ctgaaccctt tattttaggt aatcccttga attagagcac 54420taagttttgc
tttgtgatta aatctgaaaa tctttatctt gccatagatg agttgagccc 54480tattcatgtg
acagctatat tatgctgttt catagccctt ttggtccttt tttcactctt 54540gcattgcata
ttttgtgttt attgtgtttt gtgtttcttc tgataatttg gaaggtttgt 54600atttttattc
agggagttgc cttataatca tactccgcaa tacacatcgt cctcagtttc 54660ttcagactgt
ctgttaactc cctattctga ataaaaatga cattgtaatt tccctctttt 54720ttctttaccc
cttttcttct cctcacctaa tgtaaatgat tttatccttc tttagtattt 54780gcttttttaa
ttaactacat ttataaatat ctttatcact tgatttttaa atcagctttg 54840aatgagatat
ttggattcct agatataaaa gatgttaatt ataccatttc cacgttagta 54900ggtttataaa
atcatacatt ctgctgtgta accataatcc cacgtttgtt ttagttccac 54960tcctacagtt
aaaagattca gaagtattat taacagttat tttgccatag ttttttcccc 55020aacccatttt
gtggtaagtt atgatcctgc tttagtttct taagaataat ttatagagca 55080gagtgtggtg
gctcacgttt gtaatcccag cactttggga gacaagaggt agaaggatcg 55140cttgaagcca
gcagttcaag accaccctga gcaacatagt gagaccttgt ctctacaaaa 55200aattttaaaa
tttagccaga cgtagtggcg tgtgcctata gtcccagcta ctcaggaggc 55260tgaggcaaga
ggattgctag agcccagaag tttgaggctg cagtgacctc tgattgtgcc 55320actgcacccc
agtctgggca agaaagtgag aacctatctc tttaaaataa caataataac 55380ttatgaaaat
tatattccct gagtttttca tgtttaaaaa tatttgttgc ctttatcctg 55440taaaagtttg
agtataaatt cttgggttat actttattta ttgaagaatg tataagtatt 55500gtcttctaga
attgagtgtt gctgtaatga aaccagaagt cagcctggtt tatttttcct 55560cagaaatgag
gtaattgccg gccggacacc gtggctcatg cctgtaatcc caacactttg 55620ggaggccgag
acaggtggat cacgaggtca ggagattgag accatcctgg ctaacatggt 55680gaaaccccgg
ctctactaaa agtacaaaaa gttagctggg catggtggtg gacgcctgta 55740atcccagcta
cccgggaggc tgaggcagga gaatggcgtg aacctgggag gaggagcttg 55800cagagagctg
agatcgcgcc actgcactcc agcctgggcg acagagtgag actccgtctc 55860aaaaaaacaa
aaaaaaaaca aagaagtgaa gtaattgcca tgatgctcca agaattatct 55920ctttgtctat
gaaatccaga aatctcactg ttatacattt tggaattatt attctgggcc 55980aatatttcct
gggacacaat agattgactc tatagattta attttttttt tttttttgag 56040acagagtctc
actgcaatct cagcttactg caacctctgc ctcacgggtt caagcaattc 56100tcctgcctca
gcctcccaag tagctgggac tacaggcgcg tggcaccatg cctggctaat 56160ttttgtcttt
ttagtagaga cagggtttca ccatgttggc caggctggtc ttgaacgcct 56220aacctcaagt
gatccacctg cctcagcctc ccaaagtgct gggattacag gcgtgagcca 56280ccatgcccag
cctcaattcc tctttctatc tggtaatttt tctgaagttg aaaacatttg 56340ttctaatacg
ttatttcagt gttcttctaa gatgtgtaaa gcaccctatt cccaggtcag 56400cccccatctt
gctagtgagc tcggctggtt cttcacaaga gctctggttt tctcctgctt 56460aatctcaagt
acctctgtca gcctccacct ggtttatgat ttggagtttt ttggtttttg 56520ttttttgttt
ttgacagagt cttactctgt cacccaggct ggagagcagt ggcataatct 56580cagctcactg
caacctctgt ctcccaggtt tgagcgattc tcctgcctca gcctactgag 56640tagctgggat
tacaggcgcg tgccaccaca cccggctaat ttttgtattt ttagtagaga 56700tggggtttca
ccatgttggc cagggtggtc ttgaactcct gacctcaggt aatccacctg 56760cctcagcctc
ccaaagtgct gagattacag gcgtgagcca ccgcgcctgg catggtttgg 56820agttttaatc
tgtagtttta ataaagatag tgcttatgtt tgtgtttctt atatttcttg 56880gtactcttgg
gtaatttgta agatccccat atctacacaa gaagtccatt ttcaattctt 56940ttcttcagac
tgtttatttt attttatttt attttatttt tatgtttgag atggagtctc 57000gctgtgtcac
ttctggaggc tggagtgcag tggcgcgatc tcaggtcact gcaacctccg 57060tctcccgggt
tcaagcaatt ctcctgcctc agcctcccga gtagctggga ttacaggcac 57120ctgccacttt
ttaatttttt tagagacaga gtctcgcttt gttgaccagg ctggagtgcg 57180gtggtgcaat
catggctgac tataacctcc aaatcctggg ctcaagtgat cctcctgcct 57240cagcctcctg
agtagctggg actacaggca catgccacca tgcccagtta attttaattt 57300ttttgtagag
acagggtctc catatgttgc ccaggctggc ctcctactcc tggcctcaag 57360taatcctcct
acctcagcct cccaaattac taggattata agcatgagcc accatgccca 57420gccttgttct
actactttaa tttcatatgt taggtgacca tgtaattgat catccaaacc 57480aggatactgt
aagaatgaaa gaggctgaca gtagtatgat gctgggacta gcattgtgca 57540ctgagattat
ttctgggaaa gcaggagata cggtcaccct acttatagtg tgcttgtctt 57600tggattgttg
aatttggagt ttctatttgc aggcttattt caactgggca gccttgatcc 57660gccctgccca
gcaatgctac cgttctctcc accgggtctc tgggacccct tcagtcacta 57720tacttagctc
agttccccac cctcccactc cctaaaagcg taaccaggaa tcctgcctca 57780ggtctactgc
cgtcttccgt gggctgtttc agttcctatt acccagagtc aaactcccag 57840cattccctac
ctgattccag acttggagtc cagagcttta acctcttcag gccaactccc 57900cactttgcat
ttctgtccct atatcttagt ccatggagat acatttcatg tctttgagtc 57960tacttacaaa
gtaaattttg ctgtttttta attttttttt tgagatggag tcttgccctg 58020tcacccaggc
tgtggtgcaa tgacgccatc tcggctcact gcaacctccg cctcctgggt 58080tcaagcgatt
catctgcctc agcctcccaa gtagctgtga ttacagacag gcaccaccac 58140gcccagctaa
ttttttttat cttttagtag agacagggtt tcaccatgtt ggccaggctg 58200gtcttgaatt
cctgacctcg tgatctgccc atctcggcct cccaaagtgc tgagattaca 58260ggcgtgagcc
actgtgccca gccaattttg ctttttttat atttcattgc tatatgttta 58320gaggataagt
ttacagtgct atatgcattc ccaaatatta gaccaaaaaa atctccaaaa 58380aattagaaag
aaaatccaaa aaatctcaaa aaataccaaa aagcaacaat ctcacagacc 58440atactcactg
acccccaata aaataaaatt agaaattaac cacaacttaa caaaataaag 58500tactcaagtc
agagaggaaa gaggaaataa acatcaaaat tacaaagtct aggcggtggc 58560tcacgcctgt
aatcccagca ctttgggagg ccaaggcggg cagatcacaa ggtcaggaat 58620tcgagaccag
cctggccaat atggtgaaac cccgtttcca ctaaaaatac aaaaattagc 58680caggcatagt
gatgtgtgcc tgtaatccag ccacttggga ggctgaggca ggagaatcac 58740tgaacccagg
gagacgaaga ttgcagtgag ccaaaatcgt gccactgcac ttcggcctgg 58800gtgacaaagc
gagactccat ctcaaaaaaa aaaaaattac aaactcttta gatagaaatt 58860ttggtgtttt
tttttgagac ggagtctcac tctgtcgcag aggctggagt gcagtgggac 58920tatgtcagct
caccgcaacc tccatctcct ggattcaagc aattctcctg tctcagcctc 58980ccaagtagct
aggattacag gcgcccacca ccagacccag ctagttttta tatttttagt 59040agagatggtg
tttcaccatg ttggccaggc tggtctcaaa ctcctgacct caagtgatcc 59100acctgcttca
gcctcccaaa gtgctcagat tacaggcgtg agccaccgca ccccacctag 59160atagaaattt
caacatgagg ccgggcacaa tggctcacgc ctgtaatctc agcacttcag 59220gaggctgagg
cgtgggagga tcacttgggc ccaggagttc aggaccagca tgggtgacag 59280agacagaccc
tgtctctatt tatttgaaaa aaaaaaaaaa aaagagagag agaaagaaat 59340ttcaacatga
aaagtatctc tcaaaccctt cgagatgttg gcaaaaagcg actcaaagga 59400aaatgtatta
ctgtgtgtga atttgcttga aaataagaaa gaggccgggt gtggtggcta 59460acacctgtaa
tcccaacact ctgggagtcc gaatcaagtg gatcatgagg tcaggagatc 59520gagaccatcc
tggctaacat ggtgaaaccc tgtctctact aaaaatacaa aaaattagct 59580aggcgcggtg
gctcatgcct gtaatcccag cactttggga ggctgaggca ggtggatcac 59640ctgaggtcag
gggtttgaga ccagcctggc ctacatggtg aaacctcgtc tcttctacaa 59700atacaaaaat
tagctgggcg tggtggtggg tgcctgtaat cccagctact cagaggctga 59760ggcaggagaa
tcgcttgaac ccgggaggcg gaggttgcgg tgagccgaga tcgcaccact 59820acactccagc
ctgggcaaca gcctgggtga cacagtgaga ctccatctca aaaaatacaa 59880aaaattagct
gggtgtggtg gcctgcgcct gtagtcccag ctacccggga ggctgaggca 59940ggagaatgga
gtgaacctgg gaggaggagc ttgcagtgag ccgagatccc accactgcac 60000tccagcctgg
gcgacagagc aagactcttg tctcaaaaaa aagaaaaaaa aaggaaaaaa 60060gaaccctgat
aataaagaaa ccaaatgttc aactctcaaa gctcggacac tttaaagaaa 60120taattaataa
aggcagaagt taaagggagg atgataaagc aatttttttt gttggttttt 60180ttgagatgga
gtcttgctct gtcacccagg ctggagtgca gtgatgcgat cttggctcac 60240tgcaacctct
gcctcccggg ttcaagcaat tctcctgcct cagcctcctg agtagctggt 60300actacaggtg
cgcgccacct ggcccagcta atttttgtat ttttattaga gacggggttt 60360caccatattt
gttaggctgg tctcaaactc ctgatctcag gtaatctgcc cacctcggcc 60420tctcaaagtg
ctgggattac aggcaggcgc caccgcgcct ggcctaaagc aaaatattgg 60480ttctgtgcaa
aaggtcaata aaaagagcaa acgtttacaa actggagcca gcacccattc 60540agctcagtgt
gtctggagaa aaaacaatct cgcttcagaa ttcatgatta cgcagccctt 60600tttgcttcct
aaaaatccta ctatgttgct gttgaccatt ctctctcttt ctctctctct 60660tgctttctct
ccagaaaagc tattcagaca ttctcctctt tcctcaaacc tccaacactt 60720cctcctccat
ccttagcctc agctgctgac ctcacttcta atcattgaga aaccaggaga 60780agcatttaag
agtgaacctc cgcctccccg cacgggcaaa accacccacc cacagaattg 60840tgccccaatt
ctgcgtcctc tcctctcacc atggatggac ggtccaggct ccgagccaaa 60900gccaggcctc
ccctggagct ctggatccac cacctgcagc ttctcaggca gggccccagc 60960agctcccctg
ctcccttgta ccatcaatcc ctcccctcac tgggtcactc ccaacaatat 61020atatatttag
tgatgtttct cccatgtggt aaaatcactt agcctctctc ctcccccagc 61080tactatccta
tttgtttctt tccattctct gcaaaacttc tcaaagcatt gtgtctatgt 61140gctgactcca
tttatcttct cccgttctct gctgagtcct tcccacagac tctcacccca 61200gttactccat
gaaatgacct ctgcactgcc acatccaatg gtgaatgttc agttcttaat 61260tttattcagt
ctttcagcag catttgacct ggccgatcac tccctcttct taaaaatact 61320tttctcagcc
aggcgtgatg gctcacacct gtaatcccaa cactttggga ggccaaggcg 61380ggaggatcat
gagagcccag gagttcaaga tcagcctggg caacatggca agaccctatc 61440tctacaaaaa
ctaaaaagta gccagtgtga tggcatgcac ctgtagtccc atctacttag 61500gaggctgagg
cagtaggatg acttgagcct gggaaatcaa ggctgcagtg agccatgatt 61560gcaccactgc
actccagcct gagtgacagc gagaccctgt ctcaaaaaga caaaatagga 61620aacttttctc
agcatattcc tctgattctc ctgctgcttc tgtctgcaca gattcagtct 61680cctttgccgg
ttcttcctca tcctcctgat ctcttgacct tgaagtgccc cagagtacag 61740tctttttttt
tttttttgag acgcagtctc gtctgtcacc caagctggag tgcaatggcg 61800aggtctcagc
tcatgcaacc tctgcctcct gggttcaagc gattctcctg cctcagcctc 61860ccaagtagcc
aggactacag gcacatgcca ccatgcccag caaattgttg tatttttagt 61920agagacaggg
ttttactata ttggccacgc tggtctcaaa ctcctgaact cgtgaaccac 61980ccgcctcggc
ctcccaaagt gctgagatta caggcatgag ccaccacacc cggcccagag 62040tacagtcttt
agacggcctc tctacctata cttgctcccc tcataaactc ctcctgcctc 62100atggctttaa
ataccatcgg tagactgatg actcccatat ttctcttttt tttttggaga 62160cggagtctcg
ctcagtcccc caggctggag tgcagtggcg cgatctcggc tcactgcaag 62220ctccacctgc
caagttcaca ccattctcct acctcagcct ctccagtagc tgggactaca 62280ggcacccgcc
accacgcctg gctaattttt ttgtattttt agtagagatg gggtttcacc 62340atgttagcca
ggatggtctc gatctcctga cctcgtgatc cgcccatctc ggcctcccaa 62400agtgctggga
ttataggtgt gagccaccgt gcccagccga tgactcccat atttctatct 62460cttgctgtgt
gggagttctc ctcagaactc catactcata aatccaactc tcataaatag 62520tatctcaaat
gggcaatatg ctcaaaagtc aattcctact tttctcccta aacttgcttt 62580cctgcagtct
ccaccatctt aatgtccaat ctaacattag gaggcaaaaa ctttgaagtc 62640attcttgact
cttctctatt acacacccta tccaatcttt ctgcagatcc agtcgacccc 62700caaatccagt
tagctctcat catctcccct gttaccccct ggtccaggcc atcttcctct 62760ctcacctgaa
tcactgcagc attctcctca ctggtctctt tggttctgtt ttcactccac 62820cttagcatag
tctccacaga gcagtcagag ggatcctttt aaagtgtaat tcccatcctg 62880tccctgctct
gctcaaaacc ctgtcgtgat tcccgtttta atctgtcaga ttaaaagcca 62940gagtctttcc
agtgacctac atgatctgcc tattatcacc tcccacttct ttccccttgc 63000tcactccact
ccagctctgc agctgtcctt tctgtttcct gaacagccca gattttgctt 63060ctttagaacc
tttgtatttg ctgtcccctc tgtctggaat gtttttccag gaagtcacct 63120ggctctctcc
tgcacttcct tcctgaccac catgtttaaa aatcactcaa acacacttca 63180ggccggacat
ggtggctcac gcctgtaatc ccagcacttt gggaggccaa ggtgggtgga 63240tcacctgagg
tcaggagttc gagaccagcc tggccaacat ggtgaaactt cgtctctact 63300acaaatacaa
atagtagcca ggtgtagtgg cacacacctg taatctcagc tactcaggag 63360gctgaggcag
gagaatcgct tgaacccaga aggcagagga ggtgcagtga gccaagatca 63420cgccacaaca
ccccagcctg ggtgacagag caagacccca tctcaaaaaa aaaaaaagaa 63480aaaaaaatca
cacaaacaca cttctcttca tattcctttt ccaagtttta tttttctcca 63540gaatacttta
cattgtttta atggaagttc tccgtttccc cccaactaga atggatactt 63600cctgcaggta
ggcactctag tcctcccatc caagtactaa ccaggctcaa ccctgcttag 63660cttctgagag
caggggagat caggcctgtt cagggtggta tggcccagga attttgattc 63720tgttttattc
attgctgttc tgttgattct cttttgttcc tcctcctagt gctgagaaca 63780ctacttgtac
ataataagca ttcaataaat atttgttgaa tgaatgactt gttgaatgaa 63840ttaatctcag
aaatgcagga ctggttctac attagaaaat ttttcaaggt cattctctgt 63900tgtcgtaaca
cattaagaga ggaaaatttt gtactctaaa tcatttgata aaatacatac 63960tgatttctgt
tttcaaaaac tcttagtggc tgggcgaggt ggctcacatc tataatccca 64020gcattttggg
aggacgaggt gggcggatca cttgaggtca ggagtttgag accagcctgg 64080ccatcatggt
gaaaccctat ctctactgaa aatagaaaaa ttagccgggt gtggtggcgc 64140atgcctgtag
tcccagctac ctgggaggct gaggcaggag aatggcttga acccgggagg 64200cggaggttgc
agtgagccaa gatcatgcca ttgcactcca gcctgggtaa cagagtgaga 64260ctccatctca
aaagaaaact cttagtgagt ttaggaatcc aaggaagacc ctcaaactaa 64320atagataatc
tagctaccag aagccttcag taaaccttaa cactccatgg tgaaacatta 64380gaaacattcc
tactaaaaga caggctaaga atgcctgcaa tcttcacggc tagtccaaga 64440agtcaaaaag
aagaaatgag cgctgattta aaaaaataaa caaacaaaaa actaccgatg 64500cagaggctgg
cagcaaggac tgaaggactg tacagtactt gcctggagca ggcggatggc 64560cacacccctg
cgaagcctgc tcagctggct gggggacgct ccagtgtgtg agtggcagga 64620tgcagggtac
ttcctctgcc agggagttgc actggggaga tcctccccca ctcacacttt 64680ggcagctggg
gctttggaat gtgacttagc ttctgtcaaa gggtcaatcc accctttgat 64740atatgatgca
aaggcgaaca tatgatgcaa aggtgagaga acagcccaaa ttaggacttt 64800taccacagct
gtggaggtgg acagcgacag tggtgggccc tggccagact tttcatgctc 64860aaaggtggtg
gttgttcttc ctacttcttg tccctccagg gcttcctttg cctgtgtgct 64920gaacctgctt
cttttaattt tttttaactt ttttaaattt ttaattgttt taattaaaac 64980aaattttgaa
aactgtctga acctgctttt gaaccctgct atgatttgaa tgtttgtccc 65040ctgccaaact
gattttgaaa cttaatctcc aaagtggcaa tattgagatg gggctttaag 65100cagtgactgg
atcatgagag ctctgacctc atgagtggat taatggatta atgagttgtc 65160atgggagtgg
catcagtggc tttataagag gaagaattaa gacctgagct agcatggtcg 65220ccccttcacc
atttgatatc ttacactgcc taggggctct gcagagagtc cccaccaaca 65280agaaggctct
caccagatac agctcctcaa ccttgtactt ctcagcctct gtaactgtaa 65340gaaataaatg
ccttttcttt atgaattacc cagtttcaga tattctgtta taaacaatag 65400aaaacgaact
aaggcaaact ctcatgattc tactgccatg ccattccaat aaactccctt 65460tatgcttaag
agagccagag ttggccaggc gtggtgactc acgcctgtaa ttccagcact 65520ttgggaggcc
gaggcaggtg gatcacaagg tcaggagatc gagaccatcc tggctaacac 65580ggtgaaaccc
cgtctctact aaaaatacaa aaaaattagc tgggcgtggt agtgggtgcc 65640tgtagtccca
gctactcggg aggctgaagc aggaggagaa tggcgtggac ccaggaggcg 65700gagcttgcag
tgagtcgaga tcgtgccact gcactccagc ctgggtgaca gaatgagact 65760ccgtctcaaa
aaaaaagaga gccagagttt atttctgttg cttgcaacca agaaatctgg 65820ctggtgcact
gaagtttcca taaataatag caatttaaag actctttcca agccaggcaa 65880tgcctagcct
tgtgtagtcc ttgtggtaat acattcattc attcatttgt tcaaccaact 65940gtgctccaga
gactaagaat acaaaaatgg gggccgggtg tggtggctca cacctataat 66000cctagcactt
tgggaggccg aggcaggtag atcacctgag gtcaggagtt cgagaccaac 66060ctggccaaaa
tggtgaaacc cctactctac taaaaataca aaaaattagc tgggggtggt 66120ggcggacacc
tgtaatccca gctactcgtg agactgaggc aggagaatca cttgaacccg 66180ggaggcagag
gttgcagtga gccgagatcg caccactgca ctccagcctg ggcaacaaga 66240gcgaaactcc
acctcgaaaa aaaaaaaaaa aaaaaaagag ggccggggct gggcgcagtg 66300gctcacgcct
gtaatcccag cactctggga ggccaaggca ggagaattac gaggtcagca 66360gatcgagacc
agcctgacca acatggtgaa accccatctc tactaaaaat acaaaaatta 66420tccgggcgtg
gtggcgcaca cctctagtcc cagctacttg ggaggctgag gcaggagaat 66480cgcttgaacc
cgggaggcag aggttgcagt gagccgaaat catgccactg cactccagcc 66540tgggtgacag
agtgagactc cgtctcaaaa aaaaaataaa aaaaaaaaaa gaattcaaaa 66600attgtagagt
tatagtgtgc ttctagttta gttgagagga catctgtcct tcaaggaagg 66660ctagaatcta
taccctgagt ccttactgaa atcaatccag cagtcaaaac atgggaccaa 66720cgatcacagc
agtaagatag gaagagcacc tttgtacatt tagctcatgt tgagataagc 66780cactgacaga
gctgaaggaa gctcacagtt ctgggttcca tcctttggca tttaaaaaga 66840aaagtgctaa
gaaaattcgg ttggtcacgg tggctcacgc ctgtaatccc aacactttga 66900gaggccaagg
caggcagatc acgaggtcag gagttcgaaa ccagcctggc caacatggtg 66960aaaccccgtc
tctactaaaa acagaaaaat tagccgggca tggtggcgca tgcctataat 67020cccagctact
caggaggctg aggcaggaga attgcttgaa cccgggaggg ggaggttgca 67080gcgagtgaga
gcaggccact gcactccagc ctgggagaca gagcaagact ctgtctcaaa 67140aaaaaaaaag
aaaaaaagaa agaaaggaaa aaaagaaaga aaaaaaaaga aaaaagaaaa 67200ttcaggccag
gccaggcctg gtggctcaca cctgtaatcc caacactttg ggaggctgaa 67260gcgagacggt
gccttagccc aggagtttga gaccagcctg agcaacatag cgagaccctg 67320tctctataaa
aaaaaatttt tttttggcca gacgcagtgg ctcacgcctg taatcccagc 67380actttgggag
gccgaggcag gtggatcacg aggtcaggag atggagacca tcctggctaa 67440cacggtgaaa
ccccatctct actaaaaaat acaaaaaatt aaccgggcgt ggtggcgggc 67500gcctgtagtc
ccagctactc gggaggctga ggcaggagaa tggcgtgaac ccgggaggcg 67560gagcttgcag
tgagccgaga ttgcgccact gcactccaga ctgggagaga gtgagactcc 67620gtctcaaaaa
aaaaaaaaaa aaaaaaaaat taattgtcag gtgtgctggc atgcagctgt 67680agtcctagct
actcgggagg ctgaggtaag aagatcgctt gagcccagga gttcaaggct 67740gcagtaatag
tgcctctcac tctaccctgg gtgacaatga gaccctctct caaaaagaaa 67800gaaaaaaggg
aaagaagaaa agaaagaaag aaagagaaga aaggaaggaa gaaagaaaga 67860aaaagaaaag
gaaggaagga agaagaaaaa aaaagaaaga aagaaaagag agagaagttc 67920aaagaccaaa
gggtcaggat cccaaaatag tttttatgtt ttatttattt atttacttat 67980ttatttttga
gacagtatgg ctctgtcgcc caggctggag tgcagtgatg cgattgcggc 68040tcactgcagc
ctccaaactg ggctcaggtg gccctcccac ctcagcctcc cgagtagctg 68100ggaccacagg
cgcgtgccac catgcccagc taatttttta attctttgta gagatgaggt 68160ctctatatgc
tgcccaggct ggtctcgagc tcctgggctt aagccatcca cccgcctggg 68220cctcccaaag
tgctgggatt acagaagtga gccaccgcgc ctaatcgggt ggtttgtttg 68280tttattgacg
gggtctcgct gctgcccagg ctggagtgcc agtggctgtt cacaggtgca 68340gtcctggagc
attgcatcag ctcttgggct ctagcgatcc tccagagtag ctgcagctgg 68400gattccaggc
gcgccaccgc gcggggctca gaatgggttt ttatattgag ggttatgctg 68460ccacctagag
gatatatgta gtaccgaact gtgtgcgcag ggaggctgag gttgcagtga 68520gccaagatga
tgccagggca ctccagcgtg ggtgacagag caagatttca tctcaaaaaa 68580aaaaaaaaaa
aaaaaaaaaa aagaattgaa agtaaggtct tgaagagata tttgtgcctg 68640tatggtcata
gcagtattaa ctttgaccca ctagctaaaa cacaaaagca acatgtgtct 68700gtcagcaggt
gaacggataa acaaaatgtg gtatatatgt acaattgaat attattcagc 68760ctttaaaaag
gaataaaagg ctggatgcgg gggctcacgc ctgtaatcct aacactttgg 68820gagactgagg
tgggtggatc acccgaggtt aggagtttga gaacagcctg gccaacatgg 68880tgaaacttca
tctctactaa aaatactaaa attagccggg catggtggca cttgtctgta 68940atccaagcta
ctggggaggc taaggcagga gaattgcttg aactcaggag ccggaggttg 69000cagtgagcta
agatggcacc actgcactcc agcctgggca acagagtgag actccatctc 69060aaaacaaaca
aacaaaaaat tattatttcc aaagaaacaa gaccctgggt ccatttccca 69120gcccacacct
gatgttgact cacaacacac agcctggttt gctatgagcc tgcttcattt 69180aattgtcacc
ttaacttcac atcaccctca agtcctggaa taactctttg ctgacctttg 69240tgtgctgagc
catctccatg tcgctcaacg tgcagtccct ctcactgcac tgagtcaata 69300gccagacgtg
gtctgactgc agggtcatcc ttggtggctt aggctgactc gggcatagca 69360gggtgctctg
agacctcacc gcatataggc tttgccccca ataaactcta tataatattc 69420atattatgtg
gtctgggtgt gtgtagcttt gcactgtctt ctcgtgacag tgccctcaac 69480ctctttccca
ggatttcctc ctctacctcc tcaagtccca ctgctctgca aagaccaaaa 69540gctgcagagt
cccagctccc tcctttacac cccacgacgc agcctcctct ctcagaaccc 69600tttaaacaga
gtcttttact gcagatccca agaacagcca cacccctctc tcccacccac 69660tccagacaca
cccaggtaat tatagcaccc agggtaacta tgtagatgga gtccctggaa 69720catgtggata
gtgccccctg ggagtatgca aaagcaacat tgctggcacc tgcagagaac 69780agggtgacat
ccaggaatca gagcatgggc ctctgggagg tagggatgtg gccaggcagg 69840ctgccaaaaa
ttggtagagc aaggccacag gatctttctg accttccttc caaacagagg 69900ctcctgtact
ggtgatccct gtgttgattg accactccct tcctgggggt cgtggtctct 69960gtcccagttg
cccggacttc tgtgagtgtc ctactgaggt ccttttcatg agaagcatgc 70020tgtccttcca
cctgctggga gcaagagtga caacttcaat actataatag cagtggcata 70080cagagaagaa
gaaagatgaa gtggcaagaa aaacaggctt ccaagcagga gtttttctat 70140aaaaacaaaa
acgtttacaa gcaaactttt tataaagggc tagatagtaa atattttagg 70200ctttgagagc
cacatagact tgtttgcagg gactcaatgt cgctattgta gtttgaaagc 70260agccatcagg
gttatgtaaa tgagtgagtc tgattttgtt tcagcaaaat tttatttacc 70320aaaacagaca
atgagtgggc tggatttggc ccatgatcct tagtttgcca actcctgctt 70380tgggctcacc
cagatctgat tttgaattct ggctctgcta ctggttagct gcaggagctt 70440ggaaggctct
ctgagcctgt ttcctcatct gtaaaattaa agcaataatt tctaacactc 70500aagagtgtta
cctcacgcct gtaatcccag cactttggag gctgaggcag gcggatcacc 70560tgaggtcaga
agttcaagac cagcgtggcc aacgtggcaa aaccctgtct ctactaaaaa 70620atacaaaaag
tagccgggca tggtggcgcg catctgtaat cccagctact tgggaggctg 70680aggcagggat
actgctagaa cctgggaggt ggagcgtgca gtgagtggag atcacacctc 70740cacactccag
cctggccgac agagcgagac tccatctcaa aaaaaaaaaa aaaaagagtg 70800ttagaaggtt
ttgagataat gaataaaaga tgccttgtgt atactaagta ttcaacaact 70860gatagctgca
ttggtctaat tataacagtt tagaagcgat tgagtcaaca aatgctggat 70920ttgtcaggga
ggacttccta tcaggaggta gatcttgggc tgagtcctga agcaaagata 70980ggcattggat
agaggagttg agagaacacc ctaggactgt tattattatt attcgacacg 71040gagtctcttg
ctctgtcacc caggctggag tgcagtggcg cgatctcggc tcactgcaac 71100ctctgcctcc
caggttcaag cgattctcct gcctcctaag tagctgagac tacaggtgtg 71160tgccaccaca
cccggctaat ttttatattt ttagtagaga cagagtttca ccatgttggc 71220catgctggtc
tcgaactcct gacttcaggt gatccacccg cctcagcctc ccaaagtgct 71280ggaataacag
atgtgagcca ccgcacccag cccagaacca tttttcaatc cttggctctg 71340ccttttatta
gctgcaagat ctcaggcaat ttatttaacc tctccaaaga ctcattttct 71400cattcacaaa
atgaggcaaa taataatatc tactatccca ggttgtcatg agaattaaat 71460gcaacatgac
atttaatgaa atgagaagtc ccttggacat taactggcta aagtatgtgc 71520tcgacaagga
tatcatttta ggtggatact tagcatctca gaactgatgc tcacaatgga 71580atatcattga
aacgcattaa aattcatttt aaatgattgt aggtagtgag gcaattgaaa 71640gaagaagaca
agaggactga ttataatgct tcaggctcac tagtctcctt ttaggaggga 71700aaaacaattt
caagttaaat tttaggctct agatttttac ccctgctgct cattagaatc 71760acccagattg
atgaaatcag agcccatctg aggctgtgtt tttcatctcc agaatgagag 71820ctgttgtggg
gattaagttt ttgaaaaagt acatctaaca ggtgatcgaa aatgatagtg 71880atattattgc
agtgatggtc attattgttg ttattattat actgaaagag gcttcagttt 71940tctgatccat
aaagtgaggg aattgcatga gaccattgct aagattcctt ctagctctgt 72000ttttttgttt
ttgtttttta gacagagtct ctgtcgccca ggctggagtg caatggcatg 72060atcttggctc
actgcaacct ccgcctcccg ggttcaaatg atcctcctgt ctcagcctcc 72120gaagtagctg
ggactacagg cacacaccac catgcccagc taacttttat atttttaata 72180gaggtggggt
ttcaccatat tggtcaggct ggtctcaaac tcctgacctc aggtgatcca 72240cccgcctcgg
cctcccaaca tgctgggatt acaggcatga gccactgtgc ccaacccctt 72300ctagctttct
tgatcactga ttctagggtt ctctgctgaa atatatttga gacatcctgg 72360ataaaagatc
atgcaagagc tcccaatatg gtattaataa ttgattctgg aggcttagct 72420actcctgatg
gattagacat gactcaactg cctctcttat gtgtacaaca caacaacaca 72480accaagaaag
gttattctgg cattccattt attcagttta tttacagccc ttacttccag 72540cagcacgtta
aagatatggc cagggccggg tgcagtggct caagtctgta atcccaggac 72600tttgggaggc
caaggtgggc ggatcacaag gtcaggagtt tgagaatctg gcaattcttc 72660agacttagaa
gcaaccagct cgataacaca gtcttgtgtg ggctctccct ctgtccctcc 72720ctcgcttccc
tcatttctca tccctgcccc tgagactgtg caccttcaca tagccctgcc 72780atgagacctt
catctcaggc tttgctttct ggggtaactg aggctaaaca ctgagtggcc 72840ctaaaagagg
attgggattt ggaagttaga ttattcacca gagaacagac tttgctgatg 72900atcaggccca
ggttgtaatt gttgaaaaaa agagaggatg catagtctta tctcatctcc 72960tagtcaaagt
caacaccatg ataaataaga gtcaaatcct gagatgtgaa ttggggacat 73020ttgagtggtt
aaccctgaga agcttgcacc ttcagacccc tcaatacccc tgctccccag 73080agaaggctgg
acattgacct cagcacaggc aggagccctg caagatgcca tttgtcctac 73140taaagatgga
cccctccact ctgtttctag gtaaataacc aaagtcaagt ctccacacag 73200cctgagcaag
aaagtcagag cctgctacag gagaaaatac cacactggcc aaaggattca 73260ctagccctgg
ccactgtgtg tgggaggaac cagggaatca tgtgtgggag tcaatgttga 73320agctgttgga
ctgggggtgg ggtggaatat aagcctggcc ctggggagtt tttcccgttt 73380gagggccttt
acccacaact caagatccag tgctatagca ggagatccca gagctagtcc 73440taacagatgg
tcaggattga acttggccta gagtaaaatg aggaggatag tgccagaact 73500ttctcaacat
actattgagg aagaggtcag aaggcttaag gaggtagtgt aactggaaag 73560gggtcctgat
ccagacccca ggagagggtt cttggacctt gcataagaaa gagttcgaga 73620cgagtccacc
cagtaaagtg aaagcaattt tattaaagaa gaaacagaaa aatggctact 73680ccatagagca
gcgacatggg ctgcttaact gagtgttctt atgattattt cttgattcta 73740tgctaaacaa
agggtggatt atttgtgagg tttccaggaa aggggcaggg atttcccaga 73800actgatggat
ccccccactt ttagaccata tagagtaact tcctgacgtt gccatggcgt 73860ttgtaaactg
tcatggccct ggagggaatg tcttttagca tgttaatgta ttataatgtg 73920tataatgagc
agtgaggacg gccagaggtc gctttcatca ccatcttggt tttggtgggt 73980tttggccggc
ttctttatca catcctgttt tatgagcagg gtctttatga cctataactt 74040ctcctgccga
cctcctatct cctcctgtga ctaagaatgc agcctagcag gtctcagcct 74100cattttacca
tggagtcgct ctgattccaa tgcctctgac agcaggaatg ttggaattga 74160attactatgc
aagacctgag aagccattgg aggacacagc cttcattagg acactggcat 74220ctgtgacagg
ctgggtggtg gtaattgtct gttggccagt gtggactgtg ggagatgcta 74280ctactgtaag
atatgacaag gtttctcttc aaacaggctg atccgcttct tattctctaa 74340ttccaagtac
caccccccgc ctttcttctc cttttccttc tttctgattt tactacatgc 74400ccaggcatgc
tacggcccca gctcacattc ctttccttat ttaaaaatgg actggggctg 74460ggcgcggtgg
ctcatgcctg taatcccagc actttgggag gccgaggcgg gcggatcatg 74520aggtcaggag
atcgagacca tcctggctaa cacggtgaaa ccccgtctct actaaaaatg 74580caaaaacatt
agccaggcgt ggttgcaggt gcctgcagtc ccagcggctc aggaggctga 74640ggcaggagaa
tggcgtgaac ctgggaggtg gaggttgcaa tgagccgaga ttgtgccact 74700gcactccagc
ctgggtgaca gagcgagact ccgtctcaaa aaaaaaaaaa aaaaaaaaaa 74760tagctgggca
tggtggcgcg tgcctgtaat accagctact ctggaggctg aggcaagaga 74820atcgcttgaa
cccagtaggc ggaagttgca gtgagccgag atcttgacac tgcactccag 74880cctggtgaca
gagtgagact ctgtctcaaa aaaaaaaaaa agaaaaaaaa agacagaaag 74940aaagagcaca
gacagagtca caggtatttg cagtaggaag ctgtcaggtt agagtgcacg 75000gaaatagaaa
gtatatttta cacttacagc acatcttcgt ttgattagcc acatttaaaa 75060tactgaatag
caacgtgtgg ctatttagta ttcactaaaa tcttggacag tgcaagtcta 75120aagaatcctt
gatccgtccg gcatggtggc tcacgccttt aatcccagca ctttgggagg 75180ccaaggtgga
aggatcactt aaggtcagga gttcgagacc agcctggcca acatggtgaa 75240acctcgtctc
tactaataat acaaaaaaaa ttagccgggc atggtggtgc atgcctgtaa 75300tcccaggtac
ttgggaggct gaggcaggag aatagcttga atccaggagg cgctgcagtg 75360agccgagatc
atgccatgcc actactgcac tccagcctgg gcaacagagt gagactgtct 75420caaaaaaaaa
aaaaaaattg ttgggcgtgg tggctcacgc ctgtaatccc agcactttgg 75480gaggctgagg
ggggtggatc acctgggttc tggagttcga gaccagcctg gccaacatgg 75540tgaaacccca
tctctactaa aaatacaaaa attagctggg cgtggtggtg ggcacctgaa 75600atctcagcta
ctcaggaggc tgaggcagga gaatttcttg aacccaggag gcagaggttg 75660cagtgagcca
agatcgcgcc tctgcactcc atcctgggtg gcagagcaag actatgtctc 75720aaaaaaaaaa
aaaaaaatac ttgattgtct ggacattctg cagaacatca tatggagaca 75780ctatgttgac
gacatcatgc tgattgtaag caagaaatgg caagtgttcc agaaacacag 75840tcaagacaca
tacatgccag aaggtgagat ataaactcta ctaagattca gtggcctgcc 75900acactggtga
catttttaaa cctgctagat gtttgtgtag aaaaggattt aaccttgccc 75960aaagaggggt
ctggcctttg tccccagcta ctggacataa tctctttaaa ctcttgaaat 76020atcattcctg
atagaagtat ttttgttttg actaggggcc ttgggccagc cagatagcaa 76080caatgtgatc
tgggttgggg gctttggatc aggtggcatc agtgtgacct cctgagtggc 76140tagagactag
aatcaaccac atgggcagac aacccagctt acatgatgga attccaataa 76200agactttgga
cacaagggct tgggtaagct ttcctggttg gcaatgctct atactgggaa 76260acccattctg
actccatagg gagaggacaa ctggatattc tcatttggta cctccctggg 76320ctttgcccta
tgcatttttc ccttgtctga ttattattat tattatgaga tggaatctcg 76380ctctgtcacc
caggctggag tgcagtggaa tgatctcaac tcactgcaac ctctgcctcc 76440ccggttcaag
cgattttcct gtctcggcct cccgagtagc tgggactaca gatgcatacc 76500accacacccg
gctaattttt ttgtattttt agtagagacg gggtttcacg ttagccagga 76560tggtctcgat
ctcctgacct catgttccgc ctgcctcggc ctctcaaagt gctaggaata 76620catgtgtgag
ccaccgcgcc cagccccctt ggctgattat taaagtgtat ccttgagctg 76680tagtaaatta
taaccgtgaa tataacagct tttagtgagt tttgtgagca cttctagcaa 76740attatcaaac
ctaaggatag ccttggggac ccctgaactt gcagttggtg tcagaaataa 76800gggtgctcat
gtgtgtacca tgccctctaa ttttgtagtt aattaacttt cacaacttta 76860ttattaccgc
ttacactcaa tgtttattca catttatcca cataccactt attctagtgc 76920cttgcatcaa
agactttcta tctcatgtac tttattctgc ttgaagtaaa tcctttagga 76980tattcttttt
tttttttaaa ctttgcacat acatactttt attttttatt tatttttaat 77040tttgttattt
ttgtgggtac gtagtagata tatgtattta tggagtacat gagatgtttt 77100gatacaggca
tgcaatgtga aataagcaca tcatggagaa tggggtatcc atcctctcaa 77160gcaatttatc
cttcaagtta caaacaatcc aattacactc tttaagttat tttaaaatgt 77220acatttaatt
ttgtattgac tagagtcact ctgttgtgct atcaaatata attttttttt 77280tttttgagac
agagtctcac tcagtggccc agactgaaag tgcagtggca caagctcggc 77340tcacttcaat
ctctgcctcc ctggttcaag cgaatctcct gcctcagcct cccacatagc 77400tgggattaca
ggcacacacc accatgccca gctaattttt atattttttt agtagagacg 77460ggttttcgcc
atgttggcca ggctggtctt gaactcctgg cctcaaatga tctgaccacc 77520tcagcctccc
aaagtgctag gattacaggc atgagccacc acacctggcc aaaatagaat 77580attctttagt
gaggtctgct ggtgacaatt tttttctttt ttttgagact gagtctcgct 77640gttgtcagct
tgggctggag tgcaatagca cgatctcagc tcactgcaac ctccacctcc 77700cggattccag
caattctcct gcctcagcct cccaagtagc tgagagatta caggcaccca 77760ccaccacacg
cggctaattt ttgtattttt agtagaaatg ggggttcacc gtgttggcca 77820ggctggtctc
gaactcctga cctcaggtga tccacccacc ttggcctccc aaagtgctgg 77880gattacaagc
atgagccacc acgcacagcc aattttttcc gtttttgtct gaaatcttat 77940tttgtgtcat
ctttgaaata tatttttgat ggatataaaa ttgttggttg atagttatta 78000tcattattat
tattattttg agacagggtc tcactctgtt gcctatgctg gggtgtagta 78060atgtgatctc
ggttcactgc agacttgacc tcctagggct caggtgatct tcccacctca 78120gcctccctag
tagctgggac tacagatgca tgccaccata cccaactaat ttttctattt 78180tttgtagaga
tgaggctttg ccacatttcc caggctggtc tctaactcct gagctctagc 78240aatccaccca
ccttggcctt acaaagtgct gggccatgac tagccagcag ttacttttta 78300tagcatattg
aatatttaat atgaatcttc tggcatccac tgtaactgtt taaaaaatca 78360gctgtttact
tggcactctt tttttttttt ttttttttga gacagagtct tgccctgtcg 78420cccaggctgg
agtgcagtgg cgtgatcttg gctcactgca agctctgcct cccgggttca 78480cgccattctc
ctgcctcagc ctccggagta gctgggacta aaggcgcccg ccaccacgcc 78540cggctgattt
ttttgtattt ttcgtagagt tggggtttca ccgtgttagc caggatggtc 78600tcgatctcct
gacctcgtga tctgtccgcc tcggcctccc aaagtgctgg gattataggc 78660gtgagccacc
gcgcccagcc tctttttttt ttttttttag acggagtctt actctgtcat 78720ctaggctggt
gtacagtggc gtgatctcag ctcagtgcaa cctccacctc ctgcctcagc 78780ctgccaaata
gctgggatta caggtgcgta ccatcacgcc cggctaattt ttgtattttc 78840agtagagatg
gggtttcacc atgttagaca ggctggtctc gaactcctgg cctcaagtga 78900tctgcctgcc
ccagcctccc aaagattaca ggcatgagcc accgcacccg gccaagtagc 78960actcctttga
aggtaatctg cttcccctac ccctagcaat ttttaacaat ttttcttcat 79020ttttatttcc
tgaagttttg ttattaataa tctgtgtgca gatttctttg tatttctttt 79080gtttgcagtt
catagtgatt cttgaattag tgtgttggtt tctgttatca ccacaggaaa 79140attgtcagcc
gttagctttt caaatatttc cttgctaaat tctctcttct cccctttcgg 79200tacaattgat
ttgattaaaa ctaaaaccag ggccgggtgc agtgactcat gcctgtaatc 79260ccaacacttt
gagaggctga ggcaggtgga tcacctaagc tcaggagttc aagaccagcc 79320tggccaatat
ggtgaaaccc cgtctctact aaaaatacaa aaattaccag gcatggtggc 79380acacatttgt
agtcaggagg ctgaggcagg agaattgctt gaatccagga ggtggaggtt 79440gcagtgagct
gagatcccac cactgcagtc tggcctgggc gacagagtga gatgagaatc 79500tgtctcgaaa
aaaaaagtta tgaatgtttg ataaactata tttgttagaa tgtttgttgt 79560agaatactat
tcattgattt ttaaacaatg ttagattaaa ccattcactg gatttgtgat 79620aattaactta
ctgattttac ctcactgatt tgttgtaatt aatacaactg gtataaaaag 79680actgtgacga
ggccgggcat ggtggctccc gcctataatc ccagcacttt gggaggctga 79740ggcaggcgga
tcacctgagg tcaggagttc aagaccagcc tgaccaacat ggtgaaaccc 79800catctttact
aaaaatacaa aattagccgg tcgtggtggt gcatgcctgt aatcccagct 79860cttcgggagg
ctgtggcagg agaatcactt gaacccggga ggtggaggtt gcagtgagcc 79920gatatcgcgc
cattgcactc cagcctgggc aacaagagcg aaactccgtc taaaaaaaaa 79980aaagaaaaaa
aacacataaa acaaaacaac actgtgacgg ttcccaaaaa ttaggagcat 80040aattaaagga
actcctgata aaaattaatt ttatcttaca tgtaaactaa aatgacttta 80100tgaagttaat
tcagaaatac aatgcagggt attagtttgc cacagctgcg tattcagcct 80160aatgtaatat
tcttgttatt tttaaattct tcttttaact ttactcatat gtggatcatc 80220aaatttcaaa
agattaaatg acaatactct tagcagcaag cttccctaag catataaaca 80280ttttaatggg
tgatgattca gaaggtaccc gaagaatatg tactgccaga tatcattcac 80340ccccatatac
ctgcccgaca gacatcccat tttgggaccc tggataaatg tgtgggtgga 80400gagaaagata
ggagaaagtg gtataagcaa atggctttgg agtctgattg acagcgattg 80460aaatcctgtc
tctacctctt aacagcctca tgatcctaca taagttaccc cgatcctcag 80520ggccacatct
gtaaattggg ggttgcgatg gcagccatct cacagggtct cttttcgggg 80580aagggcagga
attatggatt aagtgagcta gtaattgtaa agcacttaat acaaggaggg 80640cgcataataa
gtacttcata aataatgacg gccattatca tgactgaggt gtatgcagct 80700gtcggggatt
acggcgactt cagaatttct ggtgggcagg gctcaaaggc agcaaatcac 80760actggaagtc
gaggtgaggc actgcttctg cacagactgc ttagctggag agaatgagga 80820aggcttagag
gagatttaga ggaacttaga gtcctccgcc tccaactctg tgggatctgc 80880tcccgtgcca
gagacattca ggggatttct cgcactctcc cctcccctac gtccctcccg 80940ccccatccaa
ctaaccacac aacacataca aaatagcccc tgcgaggttc tgcacgctgg 81000aagggaacag
gagaagggcg ctgcgctttc ttgctgatgc cctgtacttg ggcccctggt 81060agacacagcc
acttgtcccc tcagcctgca gagaaatccc acgtagaccg cgcccgggtc 81120cttggcttca
gccaatctcc ctttggtggg ggtgggatgc acgatccaag gttttattgg 81180ctacagacag
cggggtgtgg tccgccaaga acacagattg gctcccgagg gcatctcgga 81240tccctggtgg
ggcgccgctc agcctcccgg tgcaggcccg gccgaggcca ggaggaagcg 81300gccagaccgc
gtccattcgg cgccagctca ctccggacgt ccggagcctc tgccagcgct 81360gcttccgtcc
agtgcgcctg gacgcgctgt ccttaactgg agaaaggctt caccttgaaa 81420tccaggcttc
atccctagtt agcgtgtgac cttgagcagt tgactttatt tttcagtgcc 81480tagttttcca
gataccagga ctgactccaa ggactattac tcatctggag ggtttagcac 81540agtaccgtcg
catagtaaat ttccatgtca gttttggtta cctttcatgc acttgcaaac 81600atgccatgct
ctgaaacgaa ataggcacat cttttttttt ttttttttta aggagtcttc 81660ctctcgccca
ggctggagtg cagtggcgcg atcttggctc actgcaacct ccacctcccg 81720tgttcgagat
tctcctgcct cagcctcctg attagctggg actacaggca tgccacgacg 81780cccagttaat
ttttgtattt ttagtagaga cggggtttcg ccatcttggc caggctggtc 81840taactcctga
cctcaggtga tctgactgcc tcagcctctc aaagtgttgg gattacaggc 81900ataagccact
gcatctggcc agaaatgaaa taagtaaatc ttttaacctg ctctaacaat 81960atagtgaaaa
gaccatatta ttattagagc aggttaaggg atttgcctat ttcgggttct 82020agttatagtc
ttaaacttgg acattcttgt agaaagtaaa aagtttcctc ttcaaagttc 82080cccttcttgt
taaagaatac atcataagtg ttagaagtaa tagtttattt taaagactaa 82140ctttcttcaa
gcctccttgc tttgtgctaa taactctttg ttaagcccta tcctatgtaa 82200ctgttggaca
tgctcacagg cacgttccag ttcacagcct atgccccttc cttatttgga 82260aatgttattg
cttccttaaa cctttcggta agcaacttcc tctccttctt cgttcttcct 82320tgcacttacc
tatttagaaa gttttaggct attagcaaat cggctatcag tttaagagtg 82380tgaggtcccg
ctccagccaa tggatgcagg acatagcagt gaggacgacc caaatgcgta 82440agggataaat
atgtttgctt ttcctttgtt caggtgtgct ctcgacatcg ttccatctgc 82500gattgagcac
cctttctgca gaaagtaaag attgccttgc tggagatctt ttgtctccgt 82560gctgactttt
cttcgtggca ccgattatct atttctaaca attttggtat ttctaacatt 82620ctgaacaatc
ttgggctagt tgtctcttct gggcctgttt ccccatccgt cacatgataa 82680acttcattgg
tttaaaaacc ccagcgaaca tttattgagt tactattacc ttcctgccct 82740ccccaacccc
aaccccaggg agcagttaca acctcagccg ctgagcgcac tcgccgggtg 82800ttaagaagca
ccaaagacag ggaggcttga ttgattttgc tttgggagta gagggtcaga 82860agattcacag
gaaaatggca tttgagcaag gatgattcac tggagctagc ttttaaatac 82920tggcgaggct
tttatgttgc agtcccttac aaagttgagc attcgcaggg actgcactcc 82980gaaataagcc
cgcttcccct tttcattcgc taatgatcca gggagctgct ggttccgcat 83040gcggcaggtt
gtgccttttc ctaatcaggg ttctgcatcg cctcgaaccc gcaggccgtg 83100gcgggttctc
ctgaggaagc agggactggg gtgcagggtg aagctgctcg tgccggccag 83160cgcctgtgag
caaaactcaa acggaggagc aggaggggtc gagctggagc gtggcagggt 83220tgaccctgcc
ttttagaagg gcacaatttg aagggtaccc aggggccgga agccggggac 83280ctaaggcccg
ccccgttcca gctgctggga gggctcccgc cccagggagt tagttttgca 83340gagactgggt
ctgcagcgct ccaccggggg ccggcgacag acgccacaaa acagctgcag 83400gaacggtggc
tcgctccagg cacccagggc ccgggaaaga ggcgcgggta gcacgcgcgg 83460gtcacgtggg
cgatgcgggc gtgcgcccct gcacccgcgg gagggggatg gggaaaaggg 83520gcggggccgg
cgcttgacct cccgtgaagc ctagcgcggg gaaggaccgg aactccgggc 83580gggcggcttg
ttgataatat ggcggctgga gctgcctggg catcccgagg aggcggtggg 83640gcccactccc
ggaagaaggg tcccttttcg cgctagtgca gcggcccctc tggacccgga 83700agtccgggcc
ggttgctgaa tgaggggagc cgggccctcc ccgcgccagt ccccccgcac 83760cctccgtccc
gacccgggcc ccgccatgtc cttcttccgg cggaaaggta gctgaggggg 83820cgccggcggg
gagtcaggcc gggcctcagg ggcggcggtg gggcaggtgg gcctgcgagg 83880gctttcccca
aggcggcagc aaggccttca gcgagcctcg acctcggcgc agatgccccc 83940tgagtgcctt
gctctgctcc gggactcttc tgggagggag aaggtggcct tcttgcgcga 84000ggtcagagga
gtattgtcgc gctggttcag aagcgattgc taaagcccat agaagttcct 84060gcctgtttgg
ttaagaacag ttcttaggtg ggggttagtt tttttgtgtt tctttgagga 84120ccgtggatca
agatcaagga aatctcttta gaaccttatt atggaagtct gaagtttcca 84180aatgttgagg
gttttatgtc taaaagcaac acgtgaaaaa attgttttct tcacccagtg 84240ctgtcttcca
atttcctctt tggggggagg ggtagttact gctgttacta aaataaaatt 84300acttattgct
aaagttcccc aacaggaaga ccactacttt tgatgacttt ggcaagtttg 84360ctaactactg
gaaccctaac ttacaaacga actacttaca tttttgattt ccagttgtat 84420tacctgccca
atgtttacgt agaaacagct taattttgat tctgggtaac gttgttgcac 84480ttcattaaaa
atacatatcc gaagtgagca agtatgggtc tgtggacagc agtgattttt 84540cctgtcaatt
cctgttgctt cagataaaat gtaccagaca gaggccgggc gcggtggctc 84600acgcctgtaa
tcccagcact ttgggaggct tggcgggtgg atcacctgag atcgggagtt 84660caagaccagc
ctgaccaaca tggagaaacc ccgtgtctac taaaaataca aaattagcca 84720gggtggtggc
gcatgcctgt aatgccagct acttgggagg ctgaagcagg agaatcgctt 84780gaacctggga
ggcggaggtt gcggtgagcc gagatagcac cattgcactc cagcctgggc 84840aaaaagagcg
aaactccgtc tcaaaaaaaa agtaccagac agaaatgggt tttgttttct 84900ttttttgttt
tgagacggag tttcgctctt gttgcccagg ctcgagtgca atggcgcgat 84960ctcagtctcg
gctcactgca acctctgtct cccaggttta atcgattctc ctgcctcagc 85020ctcccaagta
gctgggatta cccatgcccc accatgcccg gctaattttt gtatttttag 85080tagaaacggg
gcttcaccat gttaggctgg tcttgaaccc ctgacctcaa gtgggcctcc 85140cacctcggcc
tcccaaagtg ccaggattac aggcatgagc caccgcggcc agccagaaat 85200gggttttgga
aaaagcacta aacaaaatcg aacttggttt catatgacag ctctgctgct 85260aactgtaaca
ggggcagacc agttaaccta cttttctgtc ttctgtcagc tgagaattag 85320atgattccca
aaggcccatt gaactctgaa tgactttaaa tacttcttct taagtgggta 85380cacggttttg
gtaactgatg ccaggtgatg aatgcatgaa agtgcttaat gaatgaaacc 85440ggtaaaatag
taggaggaag ctttattggt aaggcagggg tatacctaat agctctctaa 85500tttattggta
ttgaagtggt taacttttgt ttttttaagg ggggaaaaca ttctaagaat 85560aatgaggcaa
actgcatatt gcacaagaga ctgttgtctc tattcaacaa ataccttttg 85620agtgtccaga
gtctgccagg tgctgtgcta ggccctcacg attgagtagt gaaccagaga 85680atgtccctgc
acccatggag cttattgtct actggggtag acagataata aataagcaaa 85740caaatcttct
ctcttctccc tttcgctcca tgtaagtgtg tgtgtatagg tgtatactta 85800caagttgagt
aaagtgttat gaaagattaa gaggagaaat gcattttggt tagatgttag 85860aggactcagc
aggtgacctt gaaacttaga gctgaaggat cagtaggagg taactagaga 85920ggccagggaa
tcgcatgttc aaaggccagg aggcaagaaa gagcatggtg cccttcaaga 85980gaggaaagaa
ggctactgtg actggagcat agatgtaggc aagtgttggg tgattgagag 86040ctctacgggc
catggttagg ttttattcct aatgccgaga tgccaaacat ggtggttcat 86100atctgtaatc
ccagtatttt aggaggccga ggcaggaata tagcttgaac ccaggagttc 86160aagaccagcc
tgagcaacat gagacctgta caaaacattt aaaaaattgc tgggtatgat 86220ggtgcacacc
tgtggtccca gctactcagg aggctgaggc agaaggatca cttgagccta 86280ggaggtggag
gctacaatga gccatatttg agtcactaca ctccagcctg gatgacaaag 86340tgagaccatg
tgtcaaacaa aatacagaaa gaatattaat ttaaaatttt gaaagaggag 86400tgatctgaac
ttatatctta aaaagatcat tctagggcat ggtggctcat gcctgtaatc 86460aagggctttg
ggaggctgag acaggaggat cacctgaggc cagttcgaga tcaacctgta 86520cagcatagag
agactccatc tctacaaaaa gaaaaaataa atagctgggt gttgtgagtt 86580attcaggagg
ctgaagcaga aagatcactt gagcccagga gtttgaggct gcagtaagct 86640atgatcccac
cactgcaaca cagtgagatc ttgtctcaaa aaaaaaaaaa aatcattcta 86700ggtgcttttt
ggaggctgga tgtggtaaga gtagaagctg gagatggtcc tgttagggat 86760tcgattcaga
ctttaaatac catcaatgca ttgagtccca aatttacatc actacgttgg 86820atccttgccc
ctgaatccag actggtatat ccaactttag gttcagtttg tatctctacc 86880tgaccaatat
agaggtgtcc agtcttttgg cttccctagg ccacattgga agaagaattg 86940tcttgagcca
cacatagagt acactaacgc taacaatagc agatgagcta aaaaaaaatc 87000gcaaaactta
taatgtttta agaaagttta cgaatttgtg ttgggcacat tcagagccat 87060cctgggccgc
gggatggaca agcttaatcc agtagatacc ttcaacttac aatatctaaa 87120attttatgcc
agatttagtc attttaaacc tgctcatcag tttttctcaa gaagtagtat 87180tttggctttt
tttcttttct tttttttgag atggagtttc gctcttatcg ttcaagctgg 87240agtgcagtgg
cggatcttgg ctcactgcaa cctccgcctc ctgggttcaa gtgattctcc 87300tgcctcagcc
tcgcaagtag ctggaattac aggcatgcgc caccatgacc agctaatttt 87360tggagacagg
gtttcaccat gttggtcagg ctggttttgt actcctgacc tcaggtgatc 87420tgcctgcctc
ggcctcccaa aggctgggat tacaggcatg agccaccgct cccggctgca 87480tttttggatt
tttagttgct cagcccaaaa ctttagtaca tctttgaacc tcttctttcc 87540tcctactcta
tatctgatcc atcagcaaat ctgttaggtc tacctcacac atatcgaaat 87600cctaccacgt
ctcaccatct gtgacaatta acaccctggt ctaggcagtc atctctgtta 87660agattgagtg
gttaaggatg tcctctaagg agatgacatt caaatcttag cttaaatgtc 87720aagagggagc
tggttttata aagattgagg aggcagcatt attttgccat aggcttccat 87780ttggtttcca
ttccattctt gatacttatg gtatatattc aaaacaaatg cacagaaaca 87840gacccaggta
tattgggaat ttcggatata gagttcctag ttgggaaaag atagactgat 87900ctgtaaatga
tgctagttat ccatcatctg gcaaaaaata atttcctgcc tcctctcata 87960tatctcagat
caacagactt tttctgttaa gggccaaatc ataaatattt taggctttcc 88020agaccatatg
gtttctgtca cactctcctt tatccttgaa gccatagaca atatgtaaac 88080aaatgggcat
ggctgtgcta cgataaaact ttacttacaa aaactggtag tgggccagtt 88140taggcatggc
cagcactttg ggaggctaag gcagatggat cacttggggt caggagtttg 88200agaccagcct
ggccaacatg gtgaaaccct gtctctacta aaaatacaaa aaatagctgg 88260gcatggtggt
gggtgtctat aattccagct actctggagg ctaagacaca agaatcactt 88320gaacccagga
ggcagaggtt gcagtgagct gagatagcac cactgcactc cagccagggt 88380gacggagtct
taaagcaaaa caaaacaaaa ggtagtgggt tgtatttggc ccatgggctg 88440tagtttgcca
atccctgatg cagaaacaaa ttccaggtaa ataagagcct ggaatgttaa 88500aaaaacaaaa
cttgaagtca tgtagaagaa caggtagggg gaacaatcct gatctcagga 88560taggaaggga
tattgcttaa aataagacac aggaaaatat aatccatgtt gtgtaaattt 88620gactacgtta
aaacttaaaa ctttcgccaa gcgcggtggc tcacgcctgt aataccagta 88680ctttgggagg
ccgaggtgag cagatcacca ggtcaggaga ttgagaccat cctggctaac 88740acggtgaaac
cccgtctcta ctaaaaatac aaaacattag ccgggcgtgg tggcgggcgc 88800ctgtagtccc
agctacttgg gaggctgagg caggagaatg gcctgaaccc gggaggcgaa 88860gcttgcagtg
agctgagatc gcgccactgc actccagcct gggcgacaga gtgagattcc 88920gtctcaaaaa
aacaaaacaa aacaaagcaa aaaacctaaa actttcatac aataaagtat 88980acctaagata
cttctagaag agaagattta catccaggac gtgtatggaa tttctgcaag 89040taataagtaa
aagacaaggg acatgaagag gcagttcaca aaagaggaag ccaaaatgac 89100caataaacat
gaaaggatgt ttaacctcaa aggaaacaag gaaatgaatt aaaaacatca 89160aatgccattt
caaaactagt aagttggcaa aattaaaaat accaaggatg agaatatgaa 89220gcatggctat
atgagtgcat ggaatggtac agtcactttc attaaaaatg cacataattt 89280gttttttatt
tatttttttg agacagtcta tgtcgcccag gctagaatgc agtggcatga 89340tctcggctca
ccacaatctc tgcctcctgg gttcaagcaa ttctcctgcc tcagcctcct 89400gagtagctgg
gattacaggc acatgccaca acgcccggtt aagttttgta tttttagtag 89460agacagggtt
ttgccatgtt ggccaggctg gtctcgaact cctgacctca ggtgagctgc 89520ttcccaaagt
gctgggatta gaggcgtgag ccaatgctcc tggctgaaaa aaatgcacat 89580aatttgttac
ctagcaattc catgtctaga ggcttatcct agagaaattc ttgcttatat 89640gcataggaag
acgtgtacta gaatgttcac tagttgaatg tttaagtgaa aattaggaaa 89700taaagtaaat
gttcattaac aggaaaatga gtaaaggtat atttataaaa caattaagta 89760gctaaaatga
ataaactaga gctgcgtgaa tgaactagaa ctggttcaat agtcatgtca 89820gattattgaa
tgaatacagg tcagatatgt atagagtgtc atttgtgtaa ttaatttttt 89880tttttttttt
gagatggagt ctcactctgt tgcccaggct ggagtgcagt ggcgtgatct 89940cagctcactg
caacctccac ctcctgggtt aaagtgattc tcctgcctca gcctcccgag 90000tagttgggat
tacaggcatg caccaccatg cccagctcat tttcctattt ttagtggcca 90060cagggtttca
ccatgttggc caggctggtc ttgaactcct gacctcaagt gttccaccca 90120acttggcctc
ccaaagtgct aggattacag gcgtgagcca ccgtgctcag ccatttgcgt 90180gatttttaaa
gatgtgcaga ataatgccat taaaaaaaat acacatacat gtatatatat 90240acacgtttgg
ctgggtgtgg tggctcacac ctgtaatccc agcactttgg gaggctgagg 90300caggaggatc
acttgagccc aggtgtacaa gactagcctg ggcgagatag caagacccca 90360tctcaacaac
agaaaggata attaggtatg gtggcatgag aggatcactt gagcccagga 90420gttcgagtgt
tatcaggcca ctgcactcta gcctggacaa caaagcaaga ccgtgtctca 90480aaaaaataaa
aataaaaagt atttgtatgt ggtcatagtc aaaaaacgta catggaagga 90540aaatgtcttt
atttatttat ttattttttt ttttttaaga cagagtcttg ctctgtcacc 90600caggctgggg
tacagtggtg taatctcagc tcaccgcaat ctcggcctcc cgggttcaag 90660cgattcttct
gcctcagcct tctaagtagc tgggactaca ggtacccgcc accacaccct 90720gctaattctt
gtgttttcag tagagacagg gtttcaccat gttggcaagg ctggtctcga 90780actcctgacc
ttaagtgagc cacccgcctt ggcctcccaa agtcctggga ttacaggtgt 90840gagccactgc
gcttggccag gaaatatcta atttagtaag tatttatatc tgggaaagga 90900agggtcaggt
ggtgattcat aggaactcta aagtctatgt ataatactta gggggacaga 90960aggaaataaa
gcaaaatgct gatatttgat tgttgagttg tgtatatgtt agaagtataa 91020cataggagat
ctgattgata gtaggagaat gtttttaggt ggtaaaagtg gaaccgtggt 91080ggtttgtttt
ggcagtagaa tcagttggtc atagtttgta tgtggaaggt aataaacaga 91140ccatgttaag
gatgacttcc ggaattttgg tctgagtagt gggtggatga cagtgtcatt 91200catgagggaa
gatgaagact gaggtaggaa caggtttggg agaagatgac atgttccctt 91260ttagacaagt
ggaattatgg aagatggcag gtaggtggtt agctatatga atttgagata 91320aaagatttag
gatggagata taaatttagg agtaacagcg tatctatggt attgtaagcc 91380ttaagaatgg
gtaggatcag ccaggaaata cagatgtata tgcagaagag aggagtcaag 91440gaagccaaga
caagttaatg tttaaagtga gtgatgtagt ccatgggcag atgctgctga 91500gagggctgca
aacaccagtg accctacaac atttttaaat gtcgtcttcc tgacagcagt 91560gatcagtacc
tgcaacgatc ttatttattt ttttcatgtt agtctccaca cacttgaatg 91620tagacttttt
gaaggcaaaa tcattgcctt ttctgagctg ggagcatgtc tggcacatac 91680caagcactca
acagttgatg tattgacttc atccagatac tctgagggcg agttatttcc 91740tgctactagc
ctttcacctt tcaatgttta agagcacaaa tacagagatg ggcacgtttt 91800ggcatttctt
attttgataa ccttttcctg gtaagatttt ttaatgttga aaaaaaaaaa 91860caagaaaaga
gggttaaaaa tagtcttatg tcagatcctg tgatagaatt cacacttggc 91920ttaagctgct
gggcaccttc ctatcttgga tgtcatatta gcttatctac agcagaattt 91980ttactgtttt
atgtagtaag gaagcaatta tatgattatt ttacagacaa attattcttt 92040atcttttatt
tttttagacg gagtctctct ttgtctccca ggctggagta cagtgtcgcg 92100atctcggctc
actgcaacct ccgcctcctg ggttcaagca attctctgcc tcagcctccc 92160aagtagctgg
gcttacaggt gtccgccacc acacccagct cattgttttg tatttttagt 92220agagatgggg
tttcaccatg ttggccaggc tggtcttgag ctactgacct caggtgatcc 92280acccgccttg
gcatcccaaa gtgctggaat tacaggcgtg agccaccgtg cctggcccag 92340acaaattatt
atactctgag tgttagaggc ttaggatgtt ttcacttgat gctatgggag 92400gaataagtaa
taagatatga tacacaacca aagacctttc ttcactatgc ttctagtagc 92460tagtactatg
gatgacacat ggtaataata ttggttagca tttgtcctca atttactgtg 92520ctagttactc
ttctaagccc cttacaggta tatatttttt ttcatcaata atcctctaag 92580gtagttttta
ttattgacct aattttataa atcaagaaaa ttaagaccca gagaagtaag 92640taacttgtcc
aagatcacat ggcttataag tggtagagcc agaatttgac cccagatgtt 92700gtgactacat
tgtctctcca taagcaggtt caactctttt gactggatgc tgttccaagg 92760tcacttcctt
agagaagcct ttgctgacaa ctaccctcct gtgccctcct ccaaggctgt 92820ccattgttct
agaactttga atactcatct tagaataaag ctggtctaat ttttacagtg 92880ttatagaatg
gatctctgac tgcaaaagtt ggtcataatt atctttttat gttctagtga 92940aaggcaaaga
acaagagaag acctcagatg tgaagtccat taaaggtaag ttctgccctt 93000ggcagtccac
tgcattaaaa agtgatgtgc tttgcatttg tgagttcttt aatcctgtta 93060tactctctct
tttggcatta atcatttctg ccttatttta taattactta tgattttgat 93120ttatttccct
ctttaacctg tataatgctt taacatctag catataataa gtaggctttt 93180tttttttttt
tttttttgga gacggagtct tgctctgtta cccaggctgg agtgcagtgg 93240cgcgatcttg
gctcactgca agctctgtct cccgggttca caccattctc ctgcctcagc 93300ctccccagca
gctgggacta caggtgcacg gcgccacgcc tggctaattt tttgtatttt 93360ttagtagaga
cagagtttca ccatgttagc cagtatggtc tcgatctcct gaccttgtga 93420tccgcccgcc
tcggcctccc aaagtgctgg gattacaagc gtgagccacc gcacccggcc 93480gtaagtaggc
tttttttacc ttaattttat ttttttgaga tggagtcttg ctcttatccc 93540caggctggag
tgcagtggtg ccatctcggc tcactgcagc atccacctcc cgggttcaag 93600cgattctcct
gcctcagcct cccgagtagc tgggattaca ggtggccgcc accatgccca 93660gctaattttt
gtatttttag tagagacagg gtttcaccgt gttggccagg ccagtctcaa 93720actcctgacc
tcaagtgatc cactcgcctt ggcctcccaa agtcctggga ttacaggcgt 93780gagccaccat
gcctggccat aagtaggctt ttactgagcc ttgtgtgtat tggctatcct 93840agtgattaca
gtgaaccagt gcccttctta ttaatcacac atttaattgt tccctaaaag 93900tgattagttc
actttattta tttagtaaga caaaaaatga agaatactct taactgagca 93960gtctgttaac
tgtaggaaag cactgacact tataaggctt agttttctgt catttatcca 94020gaagtatggt
tgattacagt ttttactttt ttatttgaat gaacaacctt aatttaaaat 94080atattttgtt
tattttttgt tgggatcgat acattgtcct tgtttataga ttagagcatg 94140ctttttaaag
atgctgtatt actcactgat tttatttgtc cagtgtacag agattgaagt 94200gggaaaatta
taatggaaat tgtttccata gtcattacat attaatttca tcaatttatt 94260tccataaaat
ctgtagattg ctacttattt agatttttcc ttcaaatgtt tttatgttgt 94320attgcttgca
ctgagtattt attctatatg ctcaatttgc tggagaagaa gactaattat 94380aacttaggca
agttgtaaaa ttagggaaaa aagtaaggta ccttacagcc tagtttactt 94440atttcttatg
taaagccagt tagattccac attagttcaa actgccttct ttgagcaaaa 94500cttgattggc
agtgataaag gcttaaagcc cttctcaagc agagacctgt aaagactaga 94560tctgactgta
gtagaaggaa ggaacttaga tgtttcaggc agtgagaaca ccagtcttcc 94620actctaaact
ttgccactaa cagtatgacc ttgggaagtt gtaactttct tcagattctt 94680catttgttga
atggggggat tggcctagct aatttctaaa tctctactgg gctaaaaaat 94740tctgtgctta
tactctgatt atgaagtaca taatctgtgc ttaacattca ctgacttatc 94800cttaggataa
tacagaagca gtacaagaaa cagcccctca agatgtttgc agtctggtta 94860gaaagacaaa
cttatacaca gaacagtagc aaatagacca aaataataat agctgccatt 94920tatagaacac
ttcttctgtt ctgggcatta gacaaaaact gactataacg gtgaacaaaa 94980aagacttagg
tcctgccctc attgaactta cagattagta ggggagagga acattaatca 95040agtaattcca
cagatggctt agcctagatt ggtagtgatg gaagtaaaga gatgtgaacg 95100gacttgaaaa
aaaattcgga ggcaaaatgg atagaagttt attattgatt aaatatgagg 95160tgtgagagag
agggatattt aagattgata cctaccttct ggcttgccta acagaaccaa 95220aacaggaaat
tatatgttca gttttgttat gttgggtggg aggtgctttt gagtcattca 95280tttatatatg
ttatatatgt tattttatat gcatagtaat tttaaggtct gagttttaaa 95340ccaaaggtta
gagagtgatt ttttagagtc tagcaaacct aagttgaaat cctgcctgtt 95400gaaatggctg
tttactagct cattaaccta gggcaaagta ttcaacttgt tttcattttt 95460gtcttcatct
ctaaaatgag gaaaatatgg tcttacaaga ttgtcctgag agatagatga 95520aataatatcc
aaaaaaaaaa aaggtacata gagaaactcg tatagtgcct ggtatatagt 95580aggtcctcca
ttggtagcta tcattatcta gttttaacat agccttcagt ttgttgaatt 95640agtcaaactg
agtgaagcac tgcaaggaat tcagaggaat ttgagatcaa caaatgattt 95700ctgaagttta
gggaagactt catggcaatg acacttacct tgtataaaag ttgaagaata 95760agaaagattt
gaatgagaga ttctttctct tctccctacc agcccagctt cttatttgag 95820gatatattgg
gcaaaggggc cttcagacaa gtagagggag atttttacag aaagattgag 95880atgaaggtat
agaaggctgt aaagaccaga aaagagaatt gagacagagg aagcaggaag 95940ccactgtagg
tttttgagca agatattgat gctgtaagta tggtgtttat gaaaggttag 96000tctggaagag
atttgcagga tggagacccc ggaagttttt ttgttataat acagaaagac 96060ttgcactgag
ggtgaggtgt taaaaataaa caggtaagta aatgtttaaa catcttgaag 96120gaaaagtcaa
caaatcttgg caagtaaaca gataacagtg aaaaagaatg ggaccaagat 96180tttgagtttt
ggagactggt ggattgaaca gacagggaaa ttgagaggag aatcagatga 96240tgatgtttta
agttgatatt tagacagatt gtgcttgaga tggtaaagtc aatgtgggtg 96300ggaatgctta
gtagcgagta atcagtgata caagaccaaa gcccaggtca aagacaagtc 96360acagatacag
atcagggctt tttcatctgc tccacagagg tgtaccctag gagctgttgc 96420aaacagtcca
tgtggagggt gtgagtaaga tgtttccctt gaatttgcca gaattacttt 96480tttgttgttg
ttgttgtttt ttctgagaca gattctcgct ctgttgccca ggctggaggg 96540cagtggcgag
atcgcgcagc tcactgcaac ctctgcctct cgggttcgag tgattctcct 96600gcctcagcct
cccaagtagc tgggattaca ggcttgtgcc accaagccca gctaatttct 96660tttgtatttt
tagtagagat ggggtttcac catgttggcc agactggtct cgaactcctg 96720gcctcgtgat
ctgcctgcct cagcctccaa aagttctggg attacaggcg tgaaccactg 96780cacccggtcc
cttgttaagt ttattttggt gggaagcaaa ggaggtttca gcttttaaaa 96840agtttgaaaa
ttattgctct ggtaataatt aaagatttga gagtaaatat gctttctagc 96900agaaagaata
aaagaagaac agatagcctc aagaagggga gccaaagaag caggctatat 96960ctgacacact
gggtgttgat aaatgggtat taaaagaatg agagcaatga gcagatagaa 97020gaggaaatta
ggagagtata ataccatgga gaccaagaaa gatagactat caggaaggag 97080tggtaaaaat
aagttactag ttctaagaga gatgttaaga gggaccgggg aaagccttgt 97140acaaatgagt
tagtagcatt ttacattata tacatctaat taagaaacaa tgcgagagtc 97200tcaccattcc
tatagactct tacttgtact tgtctgaaca cgaaaactgg cttttgttta 97260taaataagct
aaaaattatt ttgctccaat ttctcatgaa aataaaaata aaccttcttt 97320taacattgaa
aaaatagttt gaagacagtc actcttcatt ttgtaattcc cacaactatt 97380attgaatgac
tgaaattatc tttattctga agccaaaggg gtgatactga tatttcttca 97440gactactaaa
aatatatttt atgaattttt agtgtgcttt atcttttttt gttttttttt 97500ttgagatgga
gtttcactcc cgttgctcag gctggagggc agtggtgcaa tctcagctca 97560ctgcaacctt
cgcctcccag attcaagcaa ttctcctgcc tcggtctccc aagtagctgg 97620gattacaggc
acctgccccc acacccagct aattttttgt atttttagta gagacagggt 97680ttcaccatgt
tggtcaggct ggtcttgaac tcctgacctc aggtgatcca cccaccttgg 97740cctcccaaag
tactgcgatt gcaggcatga gccaccatgc ctggcctgag gaatattttt 97800ctaggttccc
cccaccccaa gcatttattc tgcaatttta gttttgttcc taaagcaagc 97860aaggtttaag
gatttaaaaa taatccgtat tttagaatgc tttctggctt tgttactttt 97920tatccacagt
agaagttctc agagaatgat ctccctcttt taatttaact ttttggcaca 97980gtattttgag
aattataaat aatattagaa tgttttctgg ctgggtgtgg tggctcatgc 98040ctgtaatcct
ggctacttgg gaggctgagg caggagaatc acttgaacat gggaggcaga 98100ggttgcagtg
agccgaggtc atgccactgc actccagcct gggtgacaga gcaagactct 98160gtctgggaaa
aaaaaaaaaa aaaaaaagag tgttttcttt cctattttcc accacttgat 98220taagttactt
ttcctcttaa gtattttttg ctgagtatgc tgacttaaga gtaatgttac 98280aaaatttaat
ttttaaagtt ctctgaaagc ccctttatga gagttttagg ctatcaaatt 98340gtgtttaatt
cttaacaatt ttttgaaaaa ttatagcttc aatatccgta cattccccac 98400aaaaaagcac
taaaaatcat gccttgctgg aggctgcagg accaagtcat gttgcaatca 98460atgccatttc
tgccaacatg gactcctttt caagtagcag gacagccaca cttaagaagc 98520agccaagcca
catggaggcc gctcattttg gtgacctggg taagtaacta tcatttttta 98580ttaacttgta
ttagaaggat ttgagtacaa tatgtgaaac ttctgtcata ggatacagaa 98640ctatataatt
ggaaagtgct ttggaaaaaa tgtatttaaa ataacagcta caagtataat 98700gggtagctgt
gttgtgttcc tgtaaatata gaatataaag catgcccagt agaaaaacaa 98760gcatttccag
aagaaatata tctgatcact aaatataaat atatgaaaaa gatgtctcac 98820tttattactg
agggaagtgc aaattaaaat aatcagttaa tgttctccta acacattagc 98880atatttttta
aagtttgaca atttgaatgt cagtgaagat gcagggaaat acccctccta 98940tttagtgata
atataatctg gtgaagactc tttggaaagc aatttggaaa tcagtataaa 99000atatgcatgt
catttaggcc actctttcta agacctagcc ctcagatatg ctcattcata 99060tgtgcaggtg
tgtatgtgtg tgtgtgtgtg tgtgtgtgtg tgtatatgta tgtatgtatg 99120tatgtatgta
tgtatgttga aggctattca ttatagtatt gtttgtgata gcaaaaaatt 99180atggacaaca
tataaatatc tgttataggg aaataaccaa attgtggtat acgcatgctc 99240tggagtataa
tatagccatt tgtttctatt tatttatttt cttgagacag ggttttactc 99300tgttgcccag
gctggagtgc agtggtatga tcatggttca ctgcagcctt cacctcctgg 99360gcacaagcca
ttctctcgcc tcagcctcca gagttactag gactgcaggc atgtgtcacc 99420acacccagat
aattttttaa ttttttgtag agacagggtc tcactatgtt gcctaagctg 99480gtctcaaact
cctggcctca agcaattctc ccacacaggc ctcccaaagt gctgggatta 99540ccaacgtgaa
ccaccacacc tggttcagtg tagccattta gaaatctaaa aaagacgtgg 99600gaaaatgtct
aaggcatgtt taaatgtgag aaaagcaagt cacagtatgc atggtaaaat 99660ccgttatatt
aaaataagtt cttccaaaac aaaaacatat gcaggagacc tttattttgt 99720cagtatttct
tacccaaatt tctgcactta gaaaattgca tgtcatgttg tcataagttg 99780aaaaaaagat
ccatgaacca atggacttct aataaaatca gtcctgcttt tgacatctct 99840ctctactttt
gtgtatattc aaaccagagt gtcaatgtgt ttgtggggca cacttagcaa 99900taatacatag
cagacaaaat gcatatagct cagagagtaa aattgtaagt tttgctagat 99960cactcataaa
ttgctgatga gaatttaaaa tggtgcagat gctctggaaa acaggcagtt 100020tctttctttc
tttttttttt tctttttgag acagggtctc actctgttgc gcaggctgga 100080gtacagtggc
gtgattacaa ctcactgcag cctcaccctc ctcaggttca ggtgatcctc 100140cctcagtctc
ctgagtagct gggactatag gcatgcacca ccacgcctgg ctaatttttg 100200tatttttttt
tttttttttt gtagagacgg ggtttcgcca tgtttcccag gctggtctca 100260aactcctgga
atcaagcgat ccacttgcgt aggcctccca aagtgctggg attacgggcg 100320tgagctactg
tgcctggcct aggcagtttg tttgtttgtt tgtttgtttg tttatttatt 100380tgtagacgga
gtctcacagg ctggagtgca gtggcccaat ttttggctca ctgcaacctc 100440cgcctcccag
gttcaagcta ttctcctgcc tcagcctcct gagtagctgg gatgacaggt 100500gcctgccata
atgcctggct gatttttgta tatttagtag atatggggtt tcaccatgtt 100560ggtcaggctg
gttttgaact cctgacctca ggtgatcagc ccgcctcggc ctcccaaagt 100620gctgggatta
caggcatgag ccgtcatccc tggctggtgg tttcttatga cgtgaaacat 100680gcaattacca
tatgacctag cagttgcact ctgtatttat cccagataaa tgaaaactta 100740ccttccaata
aaaacctgtg cacaaatgtt catagcagct taatattgaa aaactggatg 100800ttcttcagca
ggtgaatgaa ctggttcatt cataccatgg aataccattc agcaataaaa 100860aggaacaaac
tgttgataca tttaaccacc tggatgaata tcaagggaat tatgctgtca 100920gacaaaaacc
agtccctaaa gactacatat agtatgattc cgtttggata atattcttga 100980aatagagaaa
ttaagagaaa tgaaaagatt agtgtttgcc agatgttaga gacagggagg 101040tgagaggggt
aagtgggtgt agttataaaa gtgcaacatg agggatcttt gtgatgttga 101100agttgtatct
tggcagtgga tgcagaaatc tcaatgtgat aaaattacaa agaactaaaa 101160acaagaatga
gtatagataa aactggggaa atctgaacaa gttagagtgt tgtatcactg 101220tcagtatctt
agagtgatat tgtactatag ctttgcaaga tgttaccatg ggagaaacta 101280aagtgtacaa
gggatctcta ggtattatta tttttttaga gatggggttt cactatgttc 101340cccaggccgg
tcttgaactc ctgggctcta gtgatccgcc tgccccagcc tcctaaagta 101400ctggaattac
aggcgtgagc gaccatgcct ggccctttca gtattgtatc ttagaacttc 101460atgtgaatct
agcattatct catagaattt aattaaaaga aattgtaaac ctcacagaag 101520atcagaattt
cctcaagttt gtgatgttga caaagatgaa ctagttgaca ctgacagtaa 101580gactgaggat
gaagacacga cgtgcttcaa aaaaatgatt tgaatatcaa tggattaaga 101640agaactcttt
tgacaaattg atgaaaccct cagtcagttt tataagaatg cccatcttta 101700tgatcatgct
atgaaagcca atttttaaaa aaattttttg tctttcctaa caattagctt 101760gtggttataa
tttaaattta gttaaatata agataaatga ttttttatta agtttagttt 101820catttttcaa
ggtacgatct caaagctact ctttaaccta ctatgaatga ataatgctga 101880gttcataaca
tctttgtaga tatatccaca attttccctc aggataagtg cctacaagtg 101940gaattactgg
actgaaaata atgcagtttg ctaagacttt gctatctgtt cctgaatgct 102000cctccaaaaa
ggttttgcca gtttacatcc tcatgaccag cgaatgagag tgttgcctat 102060tttcctgtgc
ccttgttact gcttaataat ttttgaaaaa aatctaattt gacagacaaa 102120aatgcatttt
atgttaattt gcttttctgg gatttttaat gaggttgagt atagttttta 102180atatttttat
tggccccttt ggaactagta tcataagttt tttttcttaa gaatttatgt 102240agtctgggct
gggcgcagtg gctcacgcct gcaatcccag cactttggga ggccgaggtg 102300ggtggattgc
cgaaggtcag gagtttgaga ccatcctgac caacatggtg aaaccgaatc 102360tctactaaaa
gtacaaaaac tagctcagcg tggtggcggg tgcctgtaat cccagctact 102420taggaggctg
agtcaagaga atcgcttgaa cccgggaggt ggaggttggt tgcattgagc 102480cgagatcgcg
ccattgctct ccagcctagg caacaagagt gaaaagtctc aaaaaaaaaa 102540aaaaaaaaaa
aaaaaagaat ttacatggtc tgaattgcca ttaaaagaga tatgagaatt 102600attgagtaac
aaataacttt ttaataattt aggcaagttt tggacgattg tactttgttt 102660agaaaccaaa
agcatagtat ttgtagtttt tttatttact ttagttgcta ggaagtaaac 102720tttattcaag
gtctctggta ccagttgttg ctaaaagtga ttgactaatc tgtcaatctg 102780aaattatttg
ttgctgaact gctaattctt ttgcttctat cttttaggca gatcttgtct 102840ggactaccag
actcaagaga ccaaatcaag cctttctaag acccttgaac aagtcttgca 102900cgacactatt
gtcctccctt acttcattca attcatggaa cttcggcgaa tggagcattt 102960ggtgaaattt
tggttagagg ctgaaagttt tcattcaaca acttggtcgc gaataagagc 103020acacagtcta
aacacagtga agcagagctc actggctgag cctgtctctc catctaaaaa 103080gcatgaaact
acagcgtctt ttttaactga ttctcttgat aagagattgg aggattctgg 103140ctcagcacag
ttgtttatga ctcattcaga aggaattgac ctgaataata gaactaacag 103200cactcagaat
cacttgctgc tttcccagga atgtgacagt gcccattctc tccgtcttga 103260aatggccaga
gcaggaactc accaagtttc catggaaacc caagaatctt cctctacact 103320tacagtagcc
agtagaaata gtcccgcttc tccactaaaa gaattgtcag gaaaactaat 103380gaaaagtgag
tatgtgattt tcttgtgtgt acatatgtgt ctcactttct ttttttaatt 103440tactaagcag
aacttcagat gaggaataaa atgattggaa tatttttttt ctcctctaac 103500tacttgtaaa
tttgggagaa tttggagagt gtagtagagt cagatcagtg tatggaaaag 103560gagcaggagt
gactggacct tctaagaagt gtgttatcag aattagtaaa tgaagggtca 103620aatgtcctac
ttttcccctc cactgatttt gacatcaaac cattatccac atagccttat 103680ttcctccctc
ggtcttaatt ttattaatat tttactgcac tttgcagata aaatttttaa 103740aaaattttta
aaaattgcca ataagtgaca tttattaagt tcagtgctta gtgtatattt 103800ggattttatt
tattagtcac aagacctttg tgcaggtagt aggcatgatt atcttttttt 103860ttttgagatg
gagtcttgct ctgtcgccca ggctggagtg caatggcgcg gtctcggctc 103920actgcaacct
ccgggttcat gccattctcc tgcctcagcc tcccaaatag ctgggactac 103980aggcgcctgc
caccacaccc ggctaatttt tttgtatttt tagtagagac ggggtttcac 104040catgttcgcc
aggatggtct cgatctcctg actttgtgat ccgcctgcct cggcctccca 104100aagtgctggg
attacaggca tgagccaccg cgcccggact gattatctta tttacacatg 104160agaaaaccag
ggcttagaaa ggttaggtaa cttcctctag gttgtacagt aaatgtggac 104220ctagaagcat
tttgacaaga gcacctgttt ttttttcttc tctattagtt tagaaattat 104280atactcttaa
ttatcacctg ggattttgat tagacagcct tcatgttctt tttcatctta 104340aatgttcttt
gtgtcttaaa gggctaagtg atttcttcag atcttttagt tcactcattc 104400tcagtgaact
aaaatgaggt ctaatctgct actgaatcaa gttttcagca tgttatttcc 104460ttcctccctc
cctccctcct tccttccctc aaccaggctc ccgaggagct gggattacag 104520gcgcccgcca
ccactcctgg ctaattttta tattttagta gagacggggt ttcaccatgt 104580tggtcaggct
gatcttgaac tcctgacctc aagtgaccca cctgcctcgg cctcccaaag 104640tgctgggatt
acaggcatga atcaccacac ctgacggcat gttattttca tcgcaaagtt 104700actgtaagct
gggagaagtg gcacacactt gtactcccag ctactcagga agcttaaggt 104760gagaagattg
cttgagccca ggagttttga gaccaacctg ggcaacacag caagacccca 104820gctcaaacaa
agaaaaaaag ttattgaatt ttttatttct atggatcatt ttttgtagtt 104880tcttattcct
ttcacccttc attcccactt ttgatcccat cttttattta tttagtttta 104940ttaaatgtat
atttgtctga taattctgct atctacagtt ttttgtggac ctgactcagc 105000atttctttgt
ttcttcggat tcagactgtt ggtggcttgt gattttagtg atttttggcc 105060gtgaacatgt
ttcttggact tttgtctgtg ggaattctct gtgtactctg tataaattaa 105120gttacttcag
gtgttttgca ttttcttttg ccatgcacct ggggcctggg tcactaccct 105180tctggtacca
cttaaaactg aatttttgtc ttgggtgctc gtactgatcc tgtatgagta 105240caggtttata
cttactgtag aaatatggtg tttgattatg gggtattgtc ccagatggtg 105300ctggagtatt
aatatgctct ctgttaaact taatgtgttg tccctgtaaa actccaaaat 105360tctgaattcc
agaatactac tggccccaaa tgtttaagat aagggcactg cctgtatttg 105420tttctgcctc
ccactatttt ccttagttta acacaaactc acctttttaa aaaacatttt 105480gagagaattc
agtattggga agagtttcta acctgtttct ggaaatggaa gtccaaagtc 105540tgtttctgta
attgtttttt ttttgagatg gagtctcact ctgtcaccca ggctggagtg 105600caatgacgta
ctctcagctc actgcaacct ccacctcccg ggttcaagcg attctcttgc 105660ctcagccccc
tgagtagctg ggattacagg tgcccaccac catgcctggc tgatttttgt 105720atttttagaa
gagatggggt ttcgccatgt tggccaggct ggtcttgaac tcctgacttt 105780gtgatctgcc
cacctcagcc tcccaaagtg ctaggattat gtttctgtaa ttgtaataca 105840tttattgttt
ttagaaactg tctttgcttt agtggtaatt ttcaataaaa atagaaatag 105900cagtggagtt
attaaaagag cattagttac atttttccct ttttcattat cttcaaatat 105960tatatatagt
aagtttgacc tttttaaaat gtatacttgt atcagtttta acacatacat 106020agattcctgt
aactgtcacc actataaggg taaagaacag ttagttcctt cacctttgaa 106080gtcaagcccc
acctctatcc caacacttgg caaccgctga tctttctccg tctcaatagc 106140tttgcctttt
ctcttttttt ttcttatttt tttttttgag acagcgtctt gctctgtcgc 106200ccgagctgga
gtgcagtgag gcaatctcgg ctcactgcaa cctccgcctc ctgggttcaa 106260gcagttctcc
tgccttagcc tccctagtag ctgggattat aggcacgcac caccacaccc 106320ggctgatttt
tttgtatttt tagtagaaat ggggtttcac catgttggcc aggctggtct 106380caaactcttg
acctcaagtg atccacctgc ctcggcctcc caaagtgctg ggattacagg 106440cgtgagccac
tgtgcccaat caggactttt tttttttaaa tttacattca acttgtcatt 106500tttttcttgt
atggattgtg ccttcagagt cacacctaag agccctttgc ctaagcaaag 106560gtcatgaaga
ttttctcata tgtttccttt taaaagtatt gtggttggcc aggtgccatg 106620gcttatgcct
gtaatctcag cactttgaga agctgaggtg ggcagattac gaggtcagga 106680gatcgagacc
atcctggcta atgcggtgaa accccatctc tactaaaaat acaaaaaaaa 106740aaaaaaatta
gccgggcgtg gtggcgggca cctgtagtcc cagctacttg agaggttgag 106800gcaggagaat
agtgtgaacc cgggaggtgg agcttgcagt gagccgagat cgcgccactg 106860cactccagcc
tgggcaacac agtgagactc catctcaaaa aaaaaaaaaa agtattatgg 106920ttttacactt
tacgtttaga tatatatctt ttttgagtta atgtcgtata agtatgaggg 106980ttacgtcaga
ttttttgttt tttgtttatt tttacatatg gatgtctagt tgttctaata 107040ccatttgttg
aaaagacaac ctttactcca ttgaattgcc tttgtacttt tgccatattt 107100gtctaggcct
gtttttggac tcctttttct gtttcatgat gtgtgtgtct attcctttgt 107160taataccaca
tggtcttaat tactgtatag taagtcttaa aattgggtaa tgctggcctt 107220ataaaacgaa
ttgggaagtt tttattttta ctcttatttc cattttctag aagagattgt 107280gtagaattgg
tgtcatttct tctttagata tttggttgaa ttgggaagtg atgccatctg 107340ggcctagggt
tttgtttttt gtgtgtgaga cagagtctca cttctgtcac ccaggttgga 107400gtgcagtggt
gagatcttgg cttactgcaa cctctgcctc ccaggttcaa gttatcctcc 107460tgcctcagcc
tcccaaatag ctgggattac aagcgtgtgc caccatgccc gactaatttt 107520tgtattttta
atgcagacag ggtttcacca tgttagccaa gctggtctcg aacttgtgac 107580ctcaagtgat
tagcccacct tggcctccca aagtgttagg attatagatg tgagccaccg 107640tgcctggcag
gggcctaggg ttttcttttt cagagtattt taaactatga attcagatta 107700tttaatagat
ataggactat ttaagttatc tgtttcttct tgagtgaatt tttactgtag 107760tttatggcct
ttgagtaatt aattgtattg aattgtcaaa tttatgagcg tgtaattatt 107820tatagcattt
cgggtttgta gtggtatccc tcttttattc ctggtgttgg caattgtgtc 107880ttgtttttct
ttgtcagatt gtatagggat ttattagtct tttcaaagaa ctagcttttg 107940ttttgatttt
tctgttgttt tgttttcaat tttattgatt ttctgctctt tattatttct 108000tttctattat
ttctgcttgc tttgggttta ttttactctt ttttttttct ccaagttgct 108060taaagtagaa
acttagattt ctggtttgag acctttcttt tctaagataa gcatttaata 108120ctgtaaattt
ccttctaacc actgctttag ttacaccccc acaaattctg gtattttgaa 108180ctgagcacaa
atgaaatgtt ctaatttccc ttgaatctta ttcttttacc aatgaattat 108240ttagaaatat
gttatttagt ttgcaagcaa ttggagactt ttttcctgtt atttttctac 108300catttatttc
tcatttcatt atattatggt cagagaatat attttgaatg atttcattta 108360ttaattttta
aaaataacat taaaaaattt tttaaaatgt gaatatacca catacagtat 108420aaagattgta
cattctgttt ttggacagtt ttctataaat gtcaagttga tttagttggt 108480taatgatggt
gttcagtttt tctttattct tgctgatact ttgtatgcag ttatatcact 108540ttattactca
gaagagtgtt gaactttcca actacaattt ttttttccaa ttttactttc 108600agctctatct
ggttttgctt catgtatttt gaggctctgt tgttaggtgt gtacacattc 108660aggatgatat
cttctgggtg aattgcctgt tttatcatta tgtaattccc tctttatggt 108720aattttcctt
gttctaagat cagaaatatc tgttgtccaa tttatataga cactgcagct 108780ttcatttgat
tagtgcttgc atggcatatc tttttccatt tttttacttt tgatctacct 108840ttataattct
atttaaaggg ggcttcttgt aggcagcata tagttgggta gtgttattta 108900tttatttatt
tatttattta tttatttatt tattgagaca gagttttgct cttgttgccc 108960aagctggagt
gcagtggtgc aatcctggct taccacaacc tccacctcct gggttgcagt 109020gattctcctg
cctcagcctc ccaagtagct gggattacag gcacgcgcac catgcctggc 109080tgattttttg
tatttttagt agaaacggat tttcaccatg ttagccaggc tcgtcttgaa 109140ctcctgacct
caggtgatcc acctgctttg gcctcccaaa gtgctgggat tacaggcgtg 109200agccactgca
cccggctgag tcatgttatt tttaatcttt tctcacaata cagggttttt 109260gttggtaaat
ttaattattt taatataaat tttagtataa ttatttacat taaatgtaac 109320tgttgcactg
gggtatttat aatgtgtaaa tataattatt ggtattaata taattatatt 109380actcataata
atattaatat ctttggattt agattaccag tttagtatat gtttttctgt 109440ttctccctct
ttgatttccc cttttttgct tttttttttt ttttaattct tatttttttt 109500tagtatttgt
tgatcattct tgggtgtttc ttggagaggg ggatttggca gggtcatagg 109560acaatagttg
agggaaggtc agcagataaa catgtgaaca aggtctctgg ttttcctaga 109620cagaggaccc
tgcggccttc tgcagtgttt gtgtccctgg gtacttgaga ttagggagtg 109680gtgatgactc
ttaacgagca tgctgccttc aagcatctgt ttaacaaagc acatcttgca 109740ccacccttaa
tccatttaac cctgagtggt aatagcacat gtttcagaga gcagggggtt 109800gggggtaagg
ttatagatta acagcatccc aaggcagaag aatttttctt agtacagaac 109860aaaatggagt
ctcccatgtc tacttctttc tacacagaca cagtaacaat ctgatctctc 109920tttcttttcc
ccacatttcc cccttttcta ttcgacaaaa ctgccatcgt catcatggcc 109980cgttctcaat
gagctgttgg gtacacctcc cagacggggt ggcagctggg cagaggggct 110040cctcacttcc
cagatggggc agccgggcag aggcgccccc cacctcccag acggggcagt 110100ggccgggcgg
aggcgccccc cacctccctc ccggatgggg cggctggccg ggcgggggct 110160gaccccccac
ctccctcccg gacggggcgg ctggccgggc gggggctgac cccccacctc 110220cctcccagat
ggggcggctg gccgggcggg ggctgccccc cacctccctc ccggacgggg 110280cggctgccgg
gctgaggggc tcctcacttc gcagaccggg cggctgccgg gcggaggggc 110340tcctcacttc
tcagacgggg cggccgggca gagacgctcc tcacctccca gatggggtgg 110400cggtcgggca
gagacactcc tcagttccca gacggggtcg cggccgggca gaggcgctcc 110460tcccatccca
gacggggcgg cggggcagag gtggtcccca catctcagac gatgggctgc 110520cgggcagaga
cactcctcac ttcctagacg ggatggcagc cgggaagagg tgctcctcac 110580ttcccagacg
gggcggccgg tcagaggggc tcctcacatc ccagacgatg ggcggctagg 110640cagagacgct
cctcacttcc cggacggggt ggcggccggg cagaggctgc aatctcggca 110700ctttgggagg
ccaaggcagg cggctgggaa gtggaggttg tagggagctg agatcacgcc 110760actgcactcc
agcctgggca acattgagca ttgagtgagc gagactccgt ctgcaatcct 110820ggcacctcgg
gaggccgagg caggcagatc actcgcggtc aggagctgga gaccagcccg 110880gccaacacag
cgaaaccccg tctccaccaa aaaatgcaaa aaccagtcag gtgtggcggc 110940gtgcgcctgc
aatcccaggc actctgcagg ctgaggcagg agaatcaggc agggaggttg 111000cagtgagccg
agatggcggc agtacagtcc agcctcggct ttcacaactt tggtggcatc 111060agagggagac
cggggagagg gagagggaga cgagggagag cccctttttt gctttctttt 111120ggattatttg
aatttttcct taaatttatt tatcttactt atttatttat ttttttgagt 111180gattctcctg
ccacagctcc caagtagctg ggactgcagg catgtgccac tacacccagc 111240taattttttt
gtatttttag tagagacagg gtttcaccat attggccagg ctggtcttga 111300actcttgacc
tcaagtgatc cacctgcctc ggcctcccaa agtgctggga ttacaggcgt 111360gagccaccat
gccctgcctt tttctagaat ttatatattg agttcttgat tgtatctttt 111420tatgtaggct
ttttagtggc ttctctagga attacaatat acatactttt cacagtgtac 111480tcacatttaa
tattttgtaa cttcaagtgg aatgtagaaa acttaaccac cataaaaata 111540gaactaggga
tgaggttaaa aaagagagag aaaagaaatg taataaagat ttaataacac 111600cgtttttttt
tttttttctc tttttttttt gagacagagt ctctctttct gttaccaggc 111660tggagtgcag
tggcgtgatc ttggctcact gcaacctccg cctcctgggt tcaagtgttt 111720ctcctgcctc
agcctactga gtagctggga ttacaggtgc gcgccaccat gcccagctaa 111780tttttgtatt
tttagtagag acggtttcac tgtgttggcc aggatggtct cgatttcttg 111840accttgtgat
tcgctctcct cagcctccca aagtgctggg attacaggcg tgagccaccg 111900cgcccggcta
agtctttaaa tatttttttg acattgcact ttttctcttt tccttctagg 111960attttagtaa
cccaaatgtt agttttgtta ttgtttggca ggttcctgag gctttcctta 112020cttctttaaa
tttttttttc ctgttgttca gcttcgaaaa tttctattca tctgtcttca 112080aattcactgg
ttctttcccg ttatttccat tctgttattg agtctttgta gtgaatttta 112140aattttgttt
attatgtttt ttagttctaa aattttcttt ttttgtgtat gtcttatact 112200ttgctcctga
aactcttatt tgtttcagga gtgatcttat ttcttagagc atggttttag 112260tagctactta
aaatttgttt tatcatccca gcatatgtgt cctcttgatt gtcttttctc 112320ttgtgagata
atgggatttt ctggttcttt atatgacaat taattttgga ttgtatcttg 112380gacagtttga
cttacgttac atgattctga atcttgttta aatcctgtgg aaaatattga 112440agtttttgct
ttaacaagca gttgacctag ttaggttcag tccacaaatt ctaagcagca 112500ttctgtcggc
tctggttcca tcatcagttc agttttgtat cttatctgct tatgtgcctt 112560tctgtgtcca
gtctgggacc tggccaatgg tcaggtccca aagcctttgt acacttttag 112620aagcagggcc
atgcacaccc agctcacgag tggccccggg agtgcacata caactcgacg 112680ttttcatggg
ctccttcttt tctgtgatgt ccctgacacg ttctgccttc taagaacctc 112740cctttatccc
tttcctgttg tctggctaga aagtcagggc tttagattcc ctatacttca 112800gcacacttcc
tgtagctatg tcaacctctg tggccacgac ttcttcttct tgggactgca 112860gtttctcttg
tcagaaagta ggattcttgg agctgctgtc attgctgctg tggctgctct 112920gatgctgcct
gggagtcgaa ggagagaaag gaacaaaaca aaacaaccca ggggatttcc 112980tccactctct
ttgatccgtg agagccccct ttcctgttcc tcagaccaga aatagagggc 113040ctgtcttgga
acttcttctt tgtgcatctg gtgtgcagtt tcagcttttg agtccaggcc 113100aggaggtgct
ggacaaactt gtcaggagta cggaggtact gcaagttctg attacttttc 113160tcagtccacc
tgcttccaag tccttggatg catttgtcca ttgttttgag ttgcattcca 113220tgggagagac
agaagagtgt gcttatttca tcttgacata cttattagga tttcatatca 113280aatcaacgga
tgatattctc tatattaatt tgctgttttc cctttagcaa gcacattagg 113340aaaataacac
tttaacaccc gcctttggtg gtttctgtca taattattaa tacttgactt 113400tttttttttt
tttgagacgg agtctcactc tgtcctttga ggcattgtcc ccataaactt 113460ttggtaaagc
atcaataatt ttatctttca tccacacaag cttcaccata aatttgatgt 113520ttattcttcc
attttagcag aattcatgtt gctccaatag gggctgtctt caaactgatg 113580ttttctcctt
cttagtgcct cagagtagat cctgttcaga tacgttataa caggttaata 113640tgagtttatt
ttggtgtaaa agtactttga aattcatgca tagttttttc atcatatgca 113700ttttccatag
ctttgaacac ccccatgtaa ctctcctctt ccacaaacca aacaatgaaa 113760aagcaccttt
gtgatggaag tttattttgc aataggaact cacagtgatc taagccctgc 113820tattcatgaa
tataattcat tactggagtc caagttgctt tttggttttt gaagttctct 113880tcttcccttg
caggtataga acaagatgca gtgaatactt ttaccaaata tatatctcca 113940gatgctgcta
aaccaatacc aattacagaa gcaatgagaa atgacatcat aggtaagcag 114000tgcttgaaac
tatggcaaaa aaaaaatgac aaaaaatgca cagaactgac aattttcgtt 114060attgactaag
ataatttttt cttaacatgg aatttagcag ttcccttcct aatttgtttt 114120ctgagtattt
tttatatcgg attatagctc actttaaaag tttctcggct gcattcggtg 114180cgagggtctt
tgcctgggcc agatgggctg cagtgtagcg ggtgctcagg cctgcccgct 114240gctgagcagc
cgggccggcg ggcggctacg ctaaccggca cagaccaccg gatggactgg 114300ccggcagccc
cgcaccagtg cacgaagtgg gcgggacaga aacttctggg gttggaagtc 114360cagtgaggct
aaaagccggt accaaagtct ctaggcatca gggctgcagc ccaagagtct 114420cacgaccagt
gggcaactgg atggccagac aggtgtctca gtggtggcct ctccgtctca 114480gggcttcatc
ccacttctca gtgggcctga cgtccctggg caccctggat gtctacctgc 114540attagccaga
gccatcacat ggcctgtgac ttgccttttt ttgccagttg attgtgccac 114600acacagtgtc
atttctgtgt catttggcac agctggaggt gcaaggagga gggcagcctc 114660atgtccagtc
ccagtttcac gtaactttat tcttctgaat aaagacaatt tgctaacctt 114720aaaaaaaaaa
aaaaaaaaaa agtttttctt atatgttgga cccaaattct taggctttaa 114780cctgaataac
aatgacagca agatcaataa atagtacaca tttattaaac actcactgtg 114840tcccagacaa
tattccaagc actttttatg gatagactca ttttaacttc taaagaactt 114900tgtgggataa
atacagttat tttatagatg aagaaactga agcacagaga agttaagtgc 114960tttgtccagg
gtaacagctc agatatggca gagtcaggat ttgaaactag accctcacat 115020accttaactg
ctgtgctgtg gcagtgtttt tcatactgta ggttgggacc agccttctct 115080tatgccctca
ccccctgcca aaaaaaaaaa aaaaaaaaaa aaatatatat atatatatat 115140atatatatat
atatatatat aatatatata tatataaaat atatatatat ataaaatata 115200tgtattagta
tatatgcata tatagtatat attatatatt agtatatata ctaatatata 115260atatacatat
tagtgtgtgt atatatatat atactagaat aaaaaaatca aagtatctca 115320gagtagtaag
gacaaacatt tcagaaaaat gttttcatta tatatacatg tatgtatgtg 115380tatgctgatt
caacaaatat atttcttata ggttatagca aaatagtttg aaagctttta 115440ctgtgtttta
tcaggaagac cttaggtgaa cgtatattca cagataaaag aggttattta 115500ttcattcaat
aaatattaca ttctcataag tcctaatatt atgtattttt attcttcaaa 115560aaagttagta
tttgtgattt atgaaataag acatgttctt gcacttttag cagatctgtc 115620ccgatgttgg
gcttctttaa tccttagtgt gggtgctttg cactcactca ctgctgggga 115680cagcaagacc
cctgttagtc tcagctgtgt ttcttaaatt ggcccactgt accttccagt 115740tagctattct
ggggtccatg tcatgttggc tccattttcc ttttctttct cccacacaga 115800tacctataac
ggctataaca taggcctggt ggctgttggt ggcttatccc tatctgcttg 115860tatttaaggg
gtactgtttc actgagtttt gctgacagat gttgtcatga gatttgaggt 115920tttctgtgtt
gttgctctat ttttatgtgg gaatttgcta ctatcatcat ccctagacca 115980gcttttccta
gtaatacaac agggatgttc tgactgatta gagtttgcct gtttgaagaa 116040ttggttggct
agtgattttt ttttgagggg agtctgtacc agttaatagc ctgactggcg 116100tgtggataaa
aaggaagcag tttcaagtca aataaaacac ttaaaatgaa accacactgc 116160aactctcttt
cttttactta agcttaatca aattaatgat gatgtaatcc catgaaggaa 116220aagtcttctg
aaggatcaag ttgataacat tttgtgatca aagaatttga gaaaacctct 116280atcccagtgt
ctatcattat atattttagg atgttaatta cctgtgtggc tttaggcaag 116340tcatttttcc
tccttgagcc ccattcttaa tcctgtccaa attatttgtc tcctcttgca 116400gttggactat
tttaatatag ctgtccttca agtgagtttt gttcaaagga gccttcactt 116460tagctcttac
tgtgtaccca ctttgcatag tcttgtttta aatgtaatcc ttggattttt 116520ggtgttgcta
actaattact gtttttatgt gaggatttag agtgatccag aatctatact 116580tgcactacct
ccttcatctt ccacaaatgt ttgaagtggt agaattttta aaaactttga 116640aggtacagct
gacagaattt gctgatggtt tggaagtgag tggtatgaga gggaaaaaaa 116700ggaataaagc
atgactgcat tttttgtttg tttgtttgtt tgtttttgag acggagtctc 116760actctcgcca
ggctggagtg cagtggcgtg atcttggctc acggcaacct ccgcctcctg 116820ggttcaagcg
attcccctgc ctcagcctcc caagtagctg ggactacagg cgctcgccac 116880cacgcctggc
taattttttt ttttgtattt tagtagaaac ggggtttcac cgtgttggcc 116940aggatggtct
ccatctcctg acctcatgat ctactcacct tggcctccca aagtgctgag 117000gttacaggca
tatatataag catataaagt gtgttatagc atacaaacag gtatatatat 117060aaacatgcag
tccacacagc tgataggaat gaggcagtag tgaaggagaa gttgatgtag 117120gagaggggac
agttgttaca ggaaagaagt ctggaggcag aagggatgaa ttccagtgct 117180cacatagaag
attgcttaga tgggagcaag gacaatttat ctagagtcac aggaaagaat 117240gcagtacacg
ggtagagatg caggtgagtt gaaagatgtg agagatgatg gaaataattt 117300tctgattgct
tctatattct caaggaagca ggaagcaaag tcctcagcaa agagaataga 117360agaggtgtta
aatatttgag aaaggagatg tactgtagaa aaaaaaaaaa ctcagtttct 117420ccttctgaac
tctcacaaaa cagaaccctt ccatgactct agttgtgtgg ggttttttcc 117480ctgtcagcta
ccaattctgc agatgattgt tcagtgaaca ccaactgggt gtcctctaag 117540tcagttcagt
tctcacactg tttacctgga gatagcatca gatcccacag attgaggact 117600ctgtcccaca
agactgcctc cacttcagat gccagtctca agtacaagtt gtggcctgtg 117660cttctgactg
accttctata aattggagtt cccacagtcc cctccttggg ttcaataaat 117720ttgctagagc
agctctcaga actcagggaa atgctttaca tatatttacc catttattat 117780aaaggatatt
acaaaggata cagattgaac aggcagatgg aagagatgca tgggcaaggt 117840atgggagagg
ggcacagagc ttccatgcac tctccaggtc atgccaccct ccaagaacct 117900ctacagattt
agctattcag aagcccccct ccccattctg tccttttggg ttttttgtgg 117960agacttcatt
atataggcat gattgatcat tggctattgg tgatcagctc aaccttcagc 118020cccctcatcc
cgggaggttg gtgggtaggg ctgaaagtcc caaacgtgta attctgcctt 118080ggtctttctg
gtgattagcc ctcatcctaa agctctttag aggccacagc cacaagtcat 118140ctcattagcc
ttcaaaagaa tccagagatt ccatgaattt taggcgctgt atgctaagaa 118200actggctaaa
ggccagttgc aatgtctcag gcctgtaatc ccagcacttt gggaggctga 118260ggcaggagga
tcgtttcagg ccatgagatc aaaaccagcc tggtcaacat agtgagaccc 118320ccttacaaaa
aatttaaaaa ttggccaggc gtaatagctc ttgtctgtag tctcagctac 118380tcagaaggct
gaggatcact gagccctgga gttgaaggca gcagtgagcc atgatcgtgc 118440cactgactcc
ggcttgggtg acaaagtgag accttgtctc agaagaaaaa ggaaaaaaaa 118500aaaactgggc
aaagactaaa taacatattt cacagtatca cagatttgta ttgtctagga 118560aagtgaatgt
aaacagacca ggacactagt atgatccctt ggtttcatga aggtcccact 118620aaagtcatga
acacaaagtg agactaggca tcatgttata tggtttttcc agccatgttt 118680aacagctagc
taaatagcta attgtttcgc tgcagtttat tttagcagtt ccttatttta 118740gcacatttca
tgttttaaaa tttctaccaa taacatttta ataaactttt ttacagataa 118800cttcacaaat
ccataatttt ttaagttaca atcccagaaa tagaattgct cattgaaagg 118860gtatgttcat
ttttaaagtt atgctagaaa ctgccaaatt gccttcagaa aaaggtgttt 118920gtatccccac
taacactagt gttagttttc ttgtgccctt gctcaagtat acatattatt 118980aaaaacaatg
ttgggccagt ttactagata aaaggtgtag tgcctcctta ttctaatcta 119040tttgattact
agtgagtatg tatgtctttt cacgttggtc attttatgtt tgttcctttg 119100tggattgtca
tgtcctttgc tcatttttct tttggaacat ttcttagtag tttataagag 119160ctcttggtat
tttaatgata gtaacctttt aactgtcatg catgctgcaa atcttttttc 119220tgtttgtttg
cctttgtatt ttgtttttgg agggtttcta tgtataggaa ttaaatttta 119280tgttgttaaa
tcttttgatt tctgcttttg catatgtact tcaaaagact ttctatttta 119340agatcaagtg
ttacctgtat tttcttttag ttctatttaa aacctcttaa tttatatgcc 119400tgtgctgtta
actcccaagt tgattcacaa gtgtgtatac atagtttgaa tttagtggca 119460atttaattat
ttacaacttc ttttgcagca aggatttgtg gagaagatgg acaggtggat 119520cccaactgtt
tcgttttggc acagtccata gtctttagtg caatggagca agagtaagtt 119580agttcatatt
ttcacattgt gcatcctagg gaatttgggt tcattgttag gaatgggctt 119640cactcagcta
aaaacaaagt atttttgaga atttaaatat tttggatatt tacaagatca 119700tataaagcat
actctatctt ggttaacagt ttcttttaaa tataaattat gtgaactctt 119760aaaattttca
ttttcatttt caatgttaat atttcctaag ttaaaataat ttgtttttag 119820ttctgaaata
atttggggag tgattgagtc tgtagtgatt atgactatta gaattggttt 119880atttatttaa
ataatgcatg tcttcagatg gctctcctaa tttgttagtt aggctttaag 119940ctaaatggat
gctatataac taaatccaca tagatttgtt gaaatggctc cagaggtttt 120000ttagatttat
tactgctatg tgcccttaaa aaaaatctat tcattctttc acttaacatt 120060tatcagaaga
gtgctctgtg taagacgtgg ttaggcatag tgccagtctt gaaggaagtt 120120acagcctaat
aaaagacata gggcatgttg tttggttact gtaatatgaa gtggcatgtg 120180ttaaatgtca
ggggagaact acaaagtcat aaaaaggtgg gagagattac atacaggtaa 120240aggaatcagg
aatgacacca tggggagtaa ggtagtgttg acctaggcct ttaagataca 120300atagggacag
tatggaaaga gtatattttt cccacttaaa ctctttcctt ggtcgttccc 120360tcaaattttc
ccttttgtcc atgtgcaggc actttagtga gtttctgcga agtcaccatt 120420tctgtaaata
ccagattgaa gtgctgacca gtggaactgt ttacctggct gacattctct 120480tctgtgagtc
agccctcttt tatttctctg aggtaaagtc tgcatttctt ttcacactct 120540attcgagcat
tccagcctct aactatcaat gctggggccc tgtctatagg aaataacaca 120600gaagagccaa
gtcatttcca aaaagatgta tcattgtttc aagttgtttc tgatggcaag 120660agtaatttaa
taatatatta gagagaacat gaaaattcaa tgtattaaat aactctaatt 120720ttgagaaacc
taattaaact actgcatgta agagagtgca tgtttttaat tatttggagc 120780tattttaaaa
ccacagaatt tgaaacttgc ttccagtgca taaattgcag accagacttc 120840agaagagaaa
aaaagtagta aattttttct tatgctcatc atttttactt tagtcacttg 120900ataggattgc
ccagtgaaga agcatttgca acagacaatg agtatattaa tctttttgag 120960gcatacagtt
tagtataatg ctctttgtta ggcttcaaca agtgaaatta ttttgttgga 121020aagcaaatga
ctattaagta gaaagaggat tcccagtctc acaaagcagt aatttagaca 121080ctcgattctg
cctctttaca agaatacagg tactcagttg atttgttttc tcactccctt 121140tctttgctat
aagtttaaat caacaatttg tttaggttaa tatgtcctca tggaatggtg 121200gaaatgatca
gatataaaat atttggtttg gttagtttac tctttatatg tttgctggca 121260aggaaccaca
aatccagttt agtataattt ttactctagt tcactaaaag tttgcatcca 121320gctgtgtagg
tagtgtttgt ttcttgttaa cttttttttc gtctaaaaga atactttaaa 121380acttttcaat
ctcaaatgac tgtaacttgc tgacaggtgt taacagaaga agtagatctt 121440tttgtttttt
gcttatgacc tgtattttaa tatttgagct tatagattag agattgtgag 121500agaaatctgt
ttatagtctt attttccctt gtgtattttt tcttcctagt acatggaaaa 121560agaggatgca
gtgaatatct tacaattctg gttggcagca gataacttcc agtctcagct 121620tgctgccaaa
aagggccaat atgatggaca ggaggcacag aatgatgcca tgattttata 121680tgacaagtga
gttatattga tagatggatt cagcagatac ttattgaaca tttgatatgt 121740tttgtggaaa
taaagatgaa taaactcagt ctctgttgtc aaggagctca caggaggcag 121800cataaaagct
gcttttatat ggtgtttgta aagctttggg ggttcttaga acaaaagttt 121860ctgctgggaa
aggggaggtg tatgtggggt aaacaggatg gcaatggtgg tgttcaagga 121920gtgtttccca
gaagagagat tttgtttgga tcccaaagaa agaagggaat tttgctaccc 121980agagaaggca
gaaaacaaca ttctaggcaa aggcattggc ccagaagcca tggaaacgta 122040ggggaaagtg
gcactttcaa gaaacttgag tttagataat caaaggagtg gggaataaat 122100atgaggatgc
tggtactaat tggaatagat tgtaagggac cttgaatgcc tatttatggg 122160tatattatac
tttctgtata aatctgctca ggcacgttgt taattagttt tttattagtt 122220ttcactgaaa
atgagaggat ggaaacatca tacagtaaac aaaattgaaa atatctggtc 122280aggcagatga
tgagcttgtg gccagctctg taacgtatgg tattcttttc atttaacttt 122340tcttactctg
taaaaaaagt aattcgtggt cgggcacggt ggctcactcc tgtaatcaca 122400acactttgag
aggcagaggc aggtgaatcg cttgagccca ggaatttgag accagcctgg 122460gcaacatggc
aaaacccgcc tttactaaaa atacaaaaat tagctgagcg tgatggcgtg 122520cgcctgttgt
cctagctact taggggcctg aggcagaagg atcacctgag ccttgggagg 122580tcgaggctgc
agtgagctgt gatccactgt actccaccct gggcagggca gtagagtgag 122640accctgtctc
caaaaaaaaa aaaaacaaca aaggtaattt gttatttgta tccttaagca 122700aatgctaaag
gggtaacttg gggatagaga aaagtccaca gatgttaggg tttgaagaca 122760ctaatagtat
ctaggccagt ggttcctgaa cattagtctg tgggctcttg ctgggctgtc 122820tgcataggaa
tcacctgaga gcttattaaa aataggtttt caggctggtt gcggtggctc 122880acgcctataa
tcccagcact ttgggaggct gaggcaggcg gattacttga ggtcaggcgt 122940tcaagaccag
cctggccaac atggtaaaac cccgtctcta ctaaaaatac aagaattagc 123000caggcatgat
ggcacacacc tgtaatccca gctactcagg aggctgagga aggagaattg 123060ctcgagcccg
ggaggtggag gttgcagtga gcggagatca tgccactgca ctccaggctg 123120gctgacagag
ggagactctg tctcagaaaa aaaaaaaaaa ataggttttc agtctgggta 123180ccggtggctc
acacctgtaa tcccagcact ttgggaggcc aaggcaggca gatcacttga 123240ggtcaggagt
ttgagaactg cctggccaac atagtgaaac cttgtctcta ctagaaacta 123300caaaaaatta
actgggcatt ttgacgggtg cctataatcc cagctactag ggaggctgag 123360gcaggagaat
tgcttgaacc cgggaggcag aggactgcat ctcaaaaaaa aaaaaaaaaa 123420aaaggtttcc
agtccccctg tctcagaaat tctgattctg caggtttgag gtgtgaccag 123480gaatctttat
ttttagaaga cataccagat aattctgata aatagccagt ttagggatgt 123540agtctaattt
tcctattttg caagtaagga aaataaggcc cagagaggta atgattttct 123600caaagtcaca
gaacaagtta gtggcagaat ttggactgga atgcagttct taatgttctg 123660tccagtgttt
attctggtac agtatgtttg tagaaggtat tacgtaagaa acattgttat 123720atagatgttg
agataggaag agtttacatt tagaaatttg gtctaaaatg cctgaacatt 123780caagtcgtgg
aggagtattg accaacttac tcaatacaac ataggagatt cacattttgt 123840tacaaaaatg
ctgatttaaa aggagagttt tctttttttt cttctttttt attttttgag 123900atggagtctt
gctctgtcac ccaggctaga gtgcagtgac acgatctcag ctcactgcaa 123960cctccacctc
ctgggttcaa gcggttctcc tgcctcagcc tcctgagtag ctgggattac 124020aggtgggggc
caccacgccc agctaatttt tgtattttta gtagagacag ggtttcacca 124080tgttggccag
gccggtcttg aactcctgac ctcaagtgat ccacccacca ctgcctccca 124140aagtgctggg
attataggcg tgagccactg tgcccagcct gcttgttttt gtatcatata 124200tatgcatcat
cataatcatg cattatcaac ctttgtattt ctgtcaggac atagaaacca 124260ttagagtgct
tggaagagag cctttttttt tttctcgcat ttaatgcttt ttttggtatt 124320catttcataa
tcagcttacc aaaacattac ctgcattata ccccatcaag gtagaaatct 124380ttgtgttatc
aatattggtt actccctttc cacaccgagt catcagtaag tcctgttcta 124440tccaaatagg
tcatatgcat ctagctcacc cctcagtgct gttttgtttt gaatttgtac 124500atgtttactc
ctgatgcctt gtagttatga tgatgtgttc ttattttatt ctgtgcatac 124560aagttctcag
ctcgcttttt agggaaaatg accatgtctt cctttcctat aaattccttt 124620ctatctatca
agtcctcaac agagaatagg tacccataaa tatgtgattg ttagtttctt 124680tgcctcagtt
gtagtctgat ccttacagct tttaaacaac agtagagttc accgtcaaga 124740actaaggatg
gttggcaggc agatagaaag gtagcaagtt gacccaacta tctctgggga 124800agtgggaaca
aagaaaggtt acatcagcac tgtcatcaca tagctctata gttctaggcc 124860tgcaggctca
atcaagtagc cttgtataag attctctgga ggaggtgctg aaagttgctt 124920atacttgcta
tggaatttga ttttacttcg gatatctttt taccataggt acttctccct 124980ccaagccaca
catcctcttg gatttgatga tgttgtacga ttagaaattg aatccaatat 125040ctgcagggaa
ggtgggccac tccccaactg tttcacaact ccattacgtc aggcctggac 125100aaccatggag
aaggtaaccc agaacttcaa acgtatcaaa ctacaagaag ttttattggt 125160agaactcata
aaatataagg tgggaaaacc aagcagaata gcacagtgga aattgaagca 125220gtccagcaaa
gtgattaaga gcagaggcct tgagtctggc ctggtatgta cagtcacgtg 125280ccacataaca
ttttagtcaa cagtggactg cgtgtacgat ggtcctgtac gattataatg 125340gatcaaagct
ggtagtgcaa taataacaaa agttagaaaa aataaatttt aataagtaaa 125400aaagaaaaaa
gaaaaactaa aaagataaaa gaataaccaa gaacaaaaca aaaaaaatta 125460taatggagct
gaaaaatctc tgttgcctca tatttactgt actatacttt taatcattat 125520tttagagtgc
tccttctact tactaagaaa acagttaact gtaaaacagc ttcagacagg 125580tccttcagga
ggtttccaga aggaggcatt gttatcaaag gagatgacgg ctccatgcgt 125640gttactgccc
ctgaagacct tccagtggga caagatgtgg aggtgaaaga aagtgttatt 125700gatgatcctg
accctgtgta ggcttaggct aatgtgggtg tttgtcttag tttttaacaa 125760acaaatttaa
aaagaaaaaa aaaattaaaa atagaaaaaa gcttataaaa taaggatata 125820atgaaaatat
ttttgtacag ctgtatatgt ttgtgtttta agctgttatg acaacagagt 125880caaaaagcta
aaaaaagtaa aacagttaaa aagttacagt aagctaattt attattaaag 125940aaaaaaattt
taaataaatt tagtgtagcc taagtgtaca gtgtaagtct acagtagtgt 126000acaataatgt
gctaggcctt cacattcact taccactcac tcgctgactc acccagagca 126060acttccagtc
ttgcaagctc cattcatggt aagtgcccta tacagatgta ccatttttta 126120tcttttatac
tgtattttta ctgtgccttt tctgtatttg tgtttaaata cacaaattct 126180taccattgca
atagtggcct acgatattca ttatagtaac atgtgataca ggtttgtagc 126240ccaaaagcaa
taggttgtac catatagcca aggggtgtag taggccatac catctaggtt 126300tgtataagta
cactctgtga tgttagcaca atggcaagca gcctaacgga aattctgttt 126360attgattgat
tgattgattg attgattgag acagagtttc actccattgt ccaggctgga 126420gtgcagttgc
acagtcttgg cacactgcaa cttctgcctc ccaggttcaa ccaattatcc 126480tgcctcatcc
tcccaagtag ctgggattac aggcaggcac caccatacct ggctaatttt 126540tgtattttag
tagagacagg gtttcaccat tttggccagg ctgttctcga actcctgacc 126600ttaagtgatc
tgcctgcttt ggcctccgaa agtgctggga ttacaggcat gagctaccat 126660gcctgggcag
taactgaaat tctctaatgc cattttcctt atctgtaaag tgacgataat 126720atgcacgttt
acctcaaagt tactttgatg attaaagtaa ggtaatgtat ataaaataca 126780tattaacata
gtacctgaca catggtaagc atcaaaaaat gttaactact tttattacta 126840ttattattac
gtatttttaa ataattagag agcagtatca aaaattagct gggcgtagtg 126900gcatgcacct
atagttccag ctactcagga ggctgaagct ggaggattgc atgagcctgg 126960gaattaaagg
ctgcagtgag ccgtgttcat gcccctgcac tccagccttg gtgacagagc 127020aagaccctgt
cttgaacaat taaagaaggc attatgccgc aacgttagct tagaaatgat 127080ccacatatat
caccagtaac tgtcaacagg attggaaccc tagttttggg tattatgatc 127140acaaggtatt
attaatagct tattaataat aaagcgttgg ctaggcacgg cgactcacat 127200ctgtaatccc
agcactttgg gaggccgagg tgggtggatc acctgaggtc aggagtttga 127260gaccagcctg
accaacatgg agaaacccca tctctactaa aaatacaaaa ttagccgggc 127320gtggtggtgc
atgcctgtaa tcccagctac ttaggaggct gaggcaggaa aatctcttga 127380acccgggagg
cagaggttgc agtgagctga gatcgcacca ttgcactcca gcctgggcaa 127440caagagcaaa
actccgtctc aaaaatataa ttataataaa taaataaaag taaagtattg 127500atgtttgtga
atgatttatt cttctaatga actagaggag atttttccag gaatttcaga 127560gccagtgagg
ttatgttgct tgtatgtgtc atgtgtatcc aggtgaaaaa acttaattaa 127620acgctattat
ataataccat acataaaaac tgaattttag gaatactgaa gaatgacata 127680tagaagtcaa
atcattaaat agctagtagt aaacagaata gagtgtcagc tgttacccaa 127740tgatgataat
attttcacga ttaaaattaa accttttctg attttaaagg aaaagttcag 127800atctgtatca
tataaagaat gtaaattttc agggtaataa aattaaaatg cagagagaaa 127860aatgcaaaaa
tagttcttac tagatgtgtg tatgtaagga acttagacta attttaagaa 127920cactgtcaag
accctggtag ttaggtagga aaaaagacat gaatgattca ttcaacaaaa 127980actttgagta
tttctgtgct agatggtagt gttacagtgg taaacaaaat aaatgtgttt 128040ctgctatcct
ggagcttagt ctacaaaaaa ggtacatatt ggccgggcac ggtggctcac 128100gcctgtaatc
ctagcacttt ggaagatcga ggcgggtgga tcacctgagg tcaggagttc 128160aagaccagct
tggccaacat ggcgaaaccc cgtctctact aaaaatacaa aaattaactg 128220ggtgtggtgg
cggacacctg taatcccagc tactcgggag gctgaggcag gagaatcact 128280tgaacctggg
agacagaggt tccagtgagt cgagatcatg ccactgcatt ccagcccggg 128340ggacaaaagc
gaaaatacgt ctcaaaaaaa caaaaacaaa caacaaaggc acgtattaaa 128400tacgaacata
aatatttaca aattatactg aataagttct catgtttatt atttgcttgt 128460ccagttacaa
acttttcctt cgtagaatta gaaatataaa taataaacat gagaactcat 128520tcagtataat
taataattat taaatgtaaa taaaaacatc tatgtacaat taggcattta 128580tttaagaatt
atttgaaaaa aaaacaatgt ggaaacagat attttgatat attgctagtg 128640attgaaattg
ataatgttct tttgaagagt aaagtgacca tatatattaa agttaaaatt 128700taactcagca
atcacacgcc tggtgagtta tcttaaggaa atcagtttga aagtaaaatc 128760aatatatgca
caaagacttt aacatttatc ataaaccaga aaaatcgagt ttcaaattat 128820atcctatgga
ctattttctg ctaaaaagta ttaatatcaa ctttatgtaa tactttcgtg 128880acaaatattt
tgggggagaa aacccaacaa aattacatgc attgtaattt tttttttttt 128940ttttttttta
gacagtcttg ctccagcgtc caggctggag tgcagtggtg caatctcggc 129000tcactgcaac
ctccatctcc caggttcaag caattctcct gcctcaggcc tcccgagtag 129060ctgggattac
aggcgctcac caccatgcct agctaatttt tatagttttt agtagagatg 129120gggtttcatc
atgttggcca ggctggtctt gaactcctgg tctcaagtga tccgtctgcc 129180tcggcctcct
agagtgctga gattacaggt gtaagccact gcacccagcc ttatgcatta 129240taattttaat
ttgtaaactg tacaaaggga taatacttgt agtacaacaa gaagtaaaaa 129300catttgttat
aggtagttaa catttgtaac cagtagaatt ataggtaaaa tttatttatt 129360taaaacagtt
ttagttggat ttgatttcaa ctttaaaata atgcttttca tctctatcag 129420gtctttttgc
ctggcttttt gtccagcaat ctttattata aatatttgaa tgatctcatc 129480cattcggttc
gaggagatga atttctgggc gggaacgtgt cgctgactgc tcctggctct 129540gttggccctc
ctgatgagtc tcacccaggg agttctgaca gctctgcgtc tcaggtattg 129600actgattgcg
tctgccatta gggagaaaag catacacatc ctttccttca catcccagta 129660acagatccta
ttatttgtaa attttaagtt gtggaaaaaa aagataaaag ccaggcacag 129720tggcctgtgc
ctgtaatccc agcactttgg gaggctgcgg tgggcggatc acacgaggtc 129780aggaattcga
gaccagcctg gccgacatgg tgaaacccca tctctactaa aaatacaaaa 129840attagccggg
catggtggca ggcacctgta atcctagcta cttgggaggc tgaggcagga 129900gaatcgcttg
aacccaggag gcagaggttg caatgaacca aaatcacgcc actgcactcc 129960agcctgggtg
acaaagtgag actgtgtctc aaaaaaaaaa aaaaaagaga gaaataaaat 130020tagcctactt
actatcttct aatcaaagca tttgtggtaa cttaaaatat actgtattgt 130080aaagtatcat
gctgtttcat ttaggccatt attctatttg aatctgtggc tgtttctctt 130140aataaatcaa
gtaatatgga atatattcat agcctctgaa gagctcttta tgtaagtatt 130200tatttaggat
actttttgta aaataagtga atgaattctt aggtctcctt tttttttctt 130260ttcttgagac
agggtctcct cgctgcaacc tggaaattct gggctcaaat aatccaccca 130320ccacagcctc
ctgaatagct gggactagag gcatgcacca ccacgcctgg ctaatttgaa 130380attttttttt
ggccaggcat gatggttcac gcctgtaatc ccagcacttt gggagaccga 130440ggcaggcaga
tcacgaggtc gggagatgga gaccagcctg gccaacgtgg tgaaaccccg 130500tctctactaa
aaatacaaaa attagctggt tatggtggct catgcctgta atcccagcta 130560cttgggaggc
tgaggcagga gaatggcttc aaccagggag tcggaggttg cagtgagccg 130620agatcacgcc
actgcactcc tgcatggtga cagagtgaga ctccatctca aaaaaaattt 130680tttttttaaa
tgatggagtc ttgctgtgtt gctcaggctg gtcttgaacc cctgacctca 130740aatgccgcct
gcttcagcct aagtttcttt tttttttgta aagagacagg gtcttgctat 130800gttggccagg
gtagtctcaa actcctggct tcaagcagtc ctcccacctt ggcctctcaa 130860agtgctggga
ttacaggcgt gaaccactac ctataatgtt gtgtttcact caaggccttt 130920tgatttcgtt
ttgcattacc gtgccacatt gtgcatttcc ttgacctttt ttgggttttt 130980tggagtgctt
tcatatgtta aaccatacct gattctcctc aaaatcacac aaagtagaat 131040atcctaagac
aagaaatcta aggaggcata aagaagttaa ctggttttat taaactcaca 131100cagtaaatga
tagagccaga aatattcccc ttctagtgtt cttcaccatc agcttaatgt 131160agcataataa
ttttctaatt actgttgaca aataaataac cctttgaatt ttcaatactg 131220ggccttggat
aaattttcct aatttgtaag agagtattat cgtattgcca tttacaaagc 131280tctcctgagt
atctttttct tctgttaagt ttacctagga gataaactgc tgagtatggt 131340tgccattttg
gttttttgat ataggttaga atgtcttggt tttttttttt tttttttttg 131400gtttttgttg
ttgtcattgt ttgagacagc atcttgctct gtcgcccagg ctggagtgca 131460atggcacgat
cgtggctcac tgcaacctcc acctcccggg ttcaagcaat tctcctgcct 131520cagcttcctg
agtagctggg attacaggca tgtgcaacca cacctggcta atttttgtgt 131580ttttagtaga
gaaggggttt caccatgttg gtcaggctgg tattgaactg ctgacctcat 131640gatccacctg
cctcggcctc ccaaagtgct gggattgcag gcatgagcca ctgcacctgg 131700ctgaatgtct
tgtttttgat taggcactta agaaaggcct aggtactaac cataaaatat 131760atttttatac
cttttgttga tactatatat atagaaaact gcacttatca taaccttaga 131820caccttgaag
aatgttcaca agcagaacta acccatgtga cccagcatcc agatcaaaaa 131880cagcattatc
agcccctcta gaagccctct tgggcccctt ccattcactg tccttcttgt 131940caccagggta
gctactatcc tgacttttga tggcatagat tagcattacc tgttcttgtc 132000attttataaa
taaaaccata ctgtgtattc ttttcttgta cagctttatt gtgctaattc 132060acatttacat
catacaattc agtggttttt atatggtcac agagttaggt aaccattacc 132120acatcgattt
tagaacattt ttttcactcc agatagaaac cccctttact taaactccaa 132180atcccccact
ccaccagccc taggcagcca ctagtctact ttttatctct atagagacaa 132240tagatttgct
tattctggac atttcataaa catggaaccg tatattatgt ggtcttttgt 132300tgccaactgt
ctttcactta gcatcatgtg ttcaaaagag catcatgtta tccatgtttg 132360gcatgtatca
gaattttatt cctcattatg gccaaatatc ccattgcaag gatttatgac 132420attttatttg
aattgtaccc tcctttctgc catttatcaa taatgctact gtgaccattt 132480gtgtacaagt
ttttgtgtgg atacaggttt tctttttgtt tttaaatttg aggtggagtc 132540ttgctctgtc
gcccaggctg gagtgcagtg gcacaatctc ggctcactgc aacctctgtc 132600tcctgggttc
aagcagttct cctgcctcag cctcccgagt atctgggact ataggcacgc 132660accaccacgc
ccagctaatt ttttagtaga gatggggttt caccatgttg gccagtctgg 132720tctcgaactc
ttgacctcaa gtgatccacc catctcggcc tcccaaagtg ctgggattac 132780aggggtgagc
cactatgccc ggctgtggtt ttcatttctt ttgttgtata tacataggag 132840tagaattgct
gagtcaagag gtaactctta aacttattga aaaactgcca gattgttttc 132900cgaaaaggct
gcaccatttt gcaatcccac cagcagtgta tgagttttac agcttctcca 132960catttcattg
gaacttatta tctgtttggc tgtttttaaa aatgatagtc attccaataa 133020gttctacttc
agtgtggttt ttgcacttct ctgatgagta atgatgttga gcatcttttc 133080atttgcttat
tggcctttgt tctagctttg gaaaaatgtt tattcaaatc ctttggccat 133140ttttattttt
atttttattt atttattttt ttttgagacc aagtctcact ctgtcagcca 133200ggctggagta
caatggtgtg gtctcagctc actgcaacct ccgcctcctg tgttcaagtg 133260attctcctgc
ctcagcctcc cgagtagctg ggattacatt tcaggcacct gccagcatgc 133320cgggctgatt
tttgtatttt tactagtgac agggtttcac catgttagcc aggctggtca 133380caaactcctg
acctcaggtg atctgcctgc ctaggcttcc caaagtgctg ggattacagg 133440cgtgagccat
tgggcccagc ctagattttc ttttttcttt ttttttttga gaaggagtct 133500tgctcttgtt
gcccaggctg gagtgcaatg gcacaatctt ggctcactgc aacctctgcc 133560tcctgggttc
aagcgatttt cctgcctcag cctccccagt agctgggatt acaggtgcct 133620accaccacac
ccagctaact tttgtatttt ttttagagac agggtttcac catgttggcc 133680aggctggtct
caactcctga cctcaggtga tccacctgcc ttggcctccc gaagtgctgg 133740gattaccggc
atgagctacc aggcccagcc aattttctca ttatattgcc caggctggtc 133800tcaaactcct
gggttcaagt gatcctcctg ccttggcctc ccaaagtgtg gggagtacag 133860gcgtgagcca
ccttgctcag cccctttgcc catttttaaa ttagattgcc tttttatatt 133920gagtttcagg
agtcctttat atattctaga taaatgtccc ttatcaaatt atattatttc 133980caggtatttt
cttcattctg tgagttgtct ttcctctacc ttttaaaaaa ggtgggtttt 134040tgtttgtttg
tttgtttgtt tttttaagat aaggtctcat tctgctgccc aggctggagt 134100gcagtggcac
aatcacagct cactgccacc tcaacttcct gggccgaagt gatcctctta 134160cttcagcctc
ctgaatagct agggccatag atacacacta tcacacccag cttttttttt 134220ctgtttgtag
agacagatct tactgtgttg cccaagttgg tctcaaactc taggctcaaa 134280gtgattctcc
cacctctgcc tcccagagtg ctgggattac aggtgtgagc cacacgcaac 134340ctgtcttttc
actattaata gtgtcttcct gcttcagcct cccgagtagc tgggattaca 134400ggcacccacc
accatgcctg gctaattttt ttgcattttt agtagagaca gtgtttcacc 134460atgttcaccc
ggctggtctt gaactcctga cctcaggtga ttcacctgcc atggcctccc 134520aaagtgctgg
gattacaggc gtgagccact gcacccggcc aaaatattgc cttcttaaca 134580gtattgtctt
ctaatttgtg aacatggatg tatcttcatg tatttatgtg ttctttcatt 134640tcagcagaat
tttgtagttt tcagagtaga agcctttcac ctccttgggt catttattcc 134700tatgttttaa
gttcttttcg attccattat aaatagaatt gttttcttaa tttcattttc 134760agattgtttg
atgagagagc atagaaatac aagtgatttt tacatgttga tcttgcaact 134820tcaactttga
taaatctgat tgttagctct aatagttttc ttgtggattc tttaggattt 134880tcaatatata
agatcatgtc atttatggat agagatagtt ttttttctgg ctagaactta 134940cagagcaatg
atgagtagaa gtggcagaag caaaaatctt tgtcttgttt cctatctgac 135000agggaaagct
ttcagtttca tcatttaata tgatgttagg tgtgggtttt caataaatgc 135060cttttttcag
attcaggaat ttccctatca ttcctgattt tttaaggctt tttttttttt 135120ttaaatcatg
aaagggtgtt gaatattgtc atgttctttc tgtatcagta taaatgatcc 135180tatggatttt
gggttttatt ctgttgatgt gaaatattaa ttgattttca gatgttaaac 135240caaccttgca
tacctgagat gaatctcact tggtcatggt gtataatctt ttcaatatgc 135300tgctggattc
catttactgg tattttgttg aagattttgt atctgaacgc ttaagataac 135360atttacactc
tatcagaaat gaattgacca taaatgtgag agtgtatttg tgggttcttg 135420attctcttcc
attccaaaga tagacataca tccgtctgta tgtctgtctt tatgccagta 135480ccatactctc
ttgattacta ttgctttgta ataagttttg aaatcagaaa gtataaatga 135540gattttggta
tctgagtaac agtcctcata gaattagttg ggaaatattc cctctttatt 135600ctggtccctc
tttctttttt gtttaactgt gtatcttgga gattgttcct tctcaacaca 135660tgagagccgc
tttccctacc ctcccacccc tgctatagag aggtctataa gtgtctgttc 135720aattatttta
tttacttaac ctattactta gtcggggaca ttaagcttgt ttatgtcttt 135780tattttaaac
aatgctgcag tgaataatct tgtatataag tcattttcca tcaatataag 135840tctctctgta
actgaatttt tagaagtgga atttctaggt caacctatgg ctctgtattt 135900cacaaaaata
ccaattctgg tttttcttgt ggaggtgggg agtaggaggt agaatgctgg 135960aggagaactt
gctgtactca gctggctagt cattttagaa aggtttcctt agcttctttt 136020tgtcatatgg
cctcaccaag aatcaaaaac attcctattt accctgtaaa catggggctt 136080tactacccaa
gatacatatt tctggatgta tgacagcttt tcatattgaa gaaataatgc 136140tgtgagtaca
gcacatttgt tggaacttag gtcgttaaga atgtcttata aattcataca 136200ttatacattt
tattttattt tattttttag tttttgatac agagtcttcc tctgtcgccc 136260aggccagcgt
gcagtggtac aatcttggct cactgcgacc tccatctcct gggctcaagt 136320gattctcatg
tctcagcctc cagagtagct atggttacag gcatgcacca ccatgcccgg 136380ctaatttttt
tatttttagt agaaactggg tttcaccata ttgaccatgc tggcctcgaa 136440ctcttggcct
caagtgatcg gcctgcctca gcctcccaaa gtgctgggat ccttgtattg 136500ggtaaaagat
gaatattgag ggctgcatgg tggctcatac ctgtaatccc agcactttct 136560gagactgagg
tgggaggagt cctggagccc aggagggtga ggctgcagtg agttgtgatc 136620gcgccattgc
acttcaacct aggaattata ggcttcagtc actgtgcccg gcatgtacat 136680tttaatattg
tgctttcctc ttttagctat agtatgaggt tacatttcag agtcattgtt 136740gttaagcatc
ttaatagtga tgaggttgag tgaaagttac ttctatttca aacactgaag 136800aaaattttgt
acaaatctgt cacattccaa gcccaggact gattgtttca tatacttcta 136860attttacaat
ttctattgta gtccagtgtg aaaaaagcca gtattaaaat actgaaaaat 136920tttgatgaag
cgataattgt ggatgcggca agtctggatc cagaatcttt atatcaacgg 136980acatatgccg
ggtaagctta gctcatgcct agaattttta caagtgtaaa taactttgca 137040tcttttaaat
tttttaatta aattttacat ttttttctaa tctattatta tatgcccaga 137100actttcactt
agagtgtgca gtataatgtg gtggttaagt ataaaggctc tggagtgact 137160tcctgggttt
taatcttggc tctgccattt attggcagcc gctaacctct tggtatctca 137220gtttcttcat
ctgtaaaatg agaataataa agtgaaaaga tgccaacatc atttactctg 137280ggctgcataa
ctgatacttg gaaaaagtat tcctttgagt ttaagaatta agttggttat 137340tcattttagc
ttgtaataaa aagatagtga ttcataggat atgccactta ctgaaattta 137400ccacagatcc
aatcataaaa tcactttctc ttccctaaag atagcttgat taacatgtaa 137460aggtgtgtaa
aggcttgatt acactaccct gatccgtacc ccagttccca gcagcaccat 137520gaaaaaggga
tttcaacata tttaattact ttcagtagaa agtaacagtg gtaggccagg 137580cgcagtggct
cacacctgta atcccagcac tttgggaggc cgaggtgggc ggatcacgag 137640gtcaggagat
tgagaccatc ctggctaaca cgatgaaacc ccgtctctac taaaaataca 137700aaaaattagc
cgggcatggt ggcaggcacc tgtagtccca gctacttggg aggctgagac 137760aggagaatgg
cgtgagcccg ggaggcggag cttgcagtga gcttagattg tgccactgca 137820ctccagcctg
cgcagtggag cgagactctt gtctcaaaaa aaaagaaagt aacagtggta 137880ttgggagact
gaggagccta gaaagtactt gaaggaagta aaaggtttgt ttgaccacat 137940tgtatttgga
aagccagctt tttcagctgt gtcagctttg tgtagtgatt tttagttctt 138000cttttagaaa
ataacggaca aggccgggca cggtggctca cgcctgtaat cccaccactt 138060tgggaggccg
agacgggcgg attacctgat ctcaggagtt cgagaccagc ctgggcaaca 138120tggtgaaacc
ccgtctctac taaaatacaa aaagttagcc gggcgtggtg gcgtgtgcct 138180gtagtcccag
ctactccgga ggctgaggca ggagaattgc ttgaacccgg gaggcggagg 138240ttgcagtgag
ccaagatcac accattgcac tgcagcctgc gcgacagagt aagactctgt 138300ctcaaaaaat
aataataaaa taaaaaagaa tggacagtaa acctaaatga gttcattccc 138360aaagatgatg
ttattcttaa gggatggttc atttatttaa gaccttacat aaagtctatc 138420aattgcgtga
tttttcactt ctgtaattgt gtgtatgtat aatgtaaata tatatgtttt 138480tgttttgttt
tggttttttg agacggagtc tcgctctgtt gctcaggctg gaatgcagtg 138540gtgcaatctc
agctctctgc aacctctgtc tcccaggttc aagcgtttct tctgcctcat 138600cctcccaagt
agctgggact acaggcacgt gccaccacgc ccggctaatt ttttgtattt 138660ttagtagaga
tggggtttca ccgtgttagc caggatggtc tcaatctcct gacctcgtga 138720tccacccgcc
ttggcttccc aaagtgttgc tattacaggc atgagccacc acacccagca 138780tgtatttttt
aaatgtataa aatgaagcag aaaagagaaa tgataatttt tcttcatctt 138840gaaagattat
cttcaccagg cgcagtggct cacacttgta atcccagcac tttgggaggc 138900ctcggcaggc
ggctcacttg agttcgaaac cagcctggcc gacatggtga aactccgtct 138960ctactaaaaa
taaataaata aagatggttt taatatatgt tttagtttta tgattttagc 139020atctttctga
aatttttctc aaggcaagta aatttgtatc agttggtata ttggtaccca 139080tctatgaaat
aacttattag gaagatatct ctaaaataag atcactttgc ctaaaataaa 139140ctgatatatt
gatgttcaca gaatttttct tttaaccgac ttgataaatg cattattctt 139200gacgtcaagt
gatccacctt cctcagcctc ccaaagtgct gggattacac acatgagcca 139260ccgcacctgg
cattattctt ataaaaggtt aaatttctag ttaagtttaa tgtcctcttt 139320gttcatgtac
cattgcttat tttcttccct tcctactcac agtaatcatt cttatggtat 139380gcacttttgt
ttgcttattt ttatgtaatt gatattacgc tccattctgt acgttgtact 139440ttcattcaca
gtgagttttg gacattccta tgttcatcta tacagactta cttcatttta 139500actacactgt
agtattccgt atgtaatatt tactataact catcactgta gcagagcatc 139560tcatagtgta
tgtattactg ttttgccatt ttggtatcaa tgagtattta agtcatttgc 139620agtttttccc
tcttataccc agtattacag aggatctctt tttatatgct tctttgtacc 139680aagaggcaga
ttaaaaaatt tttttttgaa aaaatttttg aaaaaaaatg aaatgaagtc 139740tcactatgtt
gcccaggctg gtctcaaact cctaggctca agcaatcctt ccatcttggc 139800ctcccaaagt
gctggggtta caggcatgag ccaccatgcc tggcctacat tttaaatttt 139860gatagctctt
acaatttact ttgtaaagta tctgcatcat tttatgttct caccagtctt 139920taataagaat
acttcatact tttggctgga cacagtggct cacgcctgta atcccagcac 139980tttgggaggc
cgaggcgggc agatcaagag atcgagacca ccctggccaa tatggtgaaa 140040ccctgtctct
actaaaaata caaaaattag ctgggcgtgg tggcgcaccc gtagtcccag 140100ctactcgaga
ggctgagaca ggagaatcac ttgaacccgg gaggtggagg ttgcagtgaa 140160cttagatcac
accactgcac tccagcctag caacagagtg agactctgtc tcaaaaaaaa 140220aaaagaatac
ttcagactta attttttttc cagtcttaag tgtttgctaa tgagattgag 140280tttcttttgg
tatgtctctt gattgttcag gttttttctt ttatgaattg actgttcatc 140340tctttttcac
attatttctg ttgggtgatt ttattagtga cttgttaaaa ttctgtatat 140400tttttcagca
tgacacttca ttattcaaaa aaaaaaaaag attctctatg tttctcgata 140460ctaatcattg
gttggtaata ccttaaaaat aagaccctta ctgtattttt tgcttttttt 140520tttttttttt
tttttttttt tttgagatag agtcttgctc tgttgcccag gctggagtgc 140580aatggtatga
tctcggctct cagctcactg caactgcaac ctctacctcc ctgtttcaag 140640caattctcct
gccttagcct cccaagtagc tgggattaca ggcatccacc accacaccca 140700gctaattttt
gtatttttag tagagacagg gtttcaccat gttggccagg ctggtctcaa 140760actactggcc
tcaagtgatc cgcctgcctc ggcatcccaa agtactggga ttacaggcat 140820gagccacagt
gcctagccac tttttgcttt ttaactttgt tttatagtac tatagtttta 140880gtataaacag
atgtatgtat acacacaact atggctttat aatatgtttc agtcattgtt 140940agagcaaggc
ctaccttttg ggtgcttctt ttacaaaatt gtcttggcta ttcttgtgcc 141000ttttttctta
tttgtgaatt ttagaattgt gaattacctg ttgactcacc atgttttgta 141060aactgaggat
tttgaatgga attgcactca attaaagatt atcttgcttt ctgtgcagca 141120atgttttatt
tcaaataatc cctactttaa attacttagg atagctataa attgtgtttc 141180tggctttcta
gatttagatg aaacgcttta aattgattgt tttctcctaa atttaaaact 141240gattgttaga
agttaaagtc ttctgttcat tcttatttag gaagatgaca tttggaagag 141300tcagtgactt
ggggcaattc atccgagaat ctgagcctga acctgatgta aggaaatcaa 141360aaggtttgtg
gtgtttttat acttcatatt aagcctttac tcacattagt gattgactgt 141420aagtcaaaga
ccacttaagg tttaaactgt ttattttgta aagtaaccac tgtatctttc 141480accttgtgtt
tatagtcaga agtaagtaca agggcttcct gtagtcacat ctttatgcaa 141540tctcctctga
atcaaaagtt agtgaacttg ctttgccact ccagaaggca catgaatatg 141600aaaaagcatt
gtctattttc ttatttaatg gcaaaatacc cgacctaagt tggacttaat 141660gtttgagacc
gtttatttta ttaaattata ttttttctct tttctttttt ttttttgaga 141720cagttcttgc
tctgtcaccc agaccggagt gcagtggtct gaccgcacct cactgcaacc 141780tctgcttcct
aggttcaagc gattttcctg cctcatcctc ctgagtagct gggactacaa 141840gtgcgcacca
ccacacctgg ctaatttttg tatttttagc agagatgagg tttcaccacg 141900ttggctaggc
tggtctcata ctcctgacct caagcaatcc atccgccttg gcttcccaaa 141960gtgctgggat
tacaagtgtg agccaccatg cctggcctta ttaaattatt tttattaaat 142020ttcctcaaga
ttgatgaaag taatgaaata taaaagtaat gaaatatatg tggaaaatag 142080actggattaa
gaaaatgtgg cacatataca ccatggatac tatgcagcca taaaaaagga 142140tgagttcatg
tcctttgtag ggacatggat gaagctggaa accatcattc tgagcaaact 142200gtctcaagga
tagaaaacca aacaccgcat gctctcactc ataggtggga attgaacaat 142260gagaacactt
ggacacaggg tggggaacat cacacgctgg ggcctgtcgt ggggtggggg 142320gctgggggag
gaatagcatt aggagatata cctaatataa atgacgagtt aatgggtgca 142380gcacaccaac
atggtacatg tatacatatg taacaaagct gcacgttgtg cacatgtacc 142440ctagaactta
aagtataata aatttaaaaa aaataaatat atgtggaaaa tattaatagg 142500tcaaaattca
aattgttcat ttaatcagaa gagtagttta gtcaaatcca agggttagac 142560aacagaaatc
ttttttgtca agtgcattct ttgtgactga tttcattttc ttcctggttt 142620acacaggaag
atttcagaaa caaatgtgga tccgtgacag atggtatcta gaagttttta 142680gtttggttga
attgacagta ttttattgag taaaagatac taatttttgt aagaagaaaa 142740attcaatttt
gataagtatg tttaagatta agagctattg gccaggcgct gtggctcatg 142800cctgtaatcc
tagcactttg ggaagctgga gcaggtgggt cacgaggtca agagattgag 142860accatcctgg
ccaacatggt gaaaccctgt ctctactaaa ttagccaggc gtggtggcac 142920atgcctgtgc
acccgcctcc gggtttaagc gatcctactg cctcaggctc ctgagtagct 142980gggattacag
gcgccatggc taatttttgc atttttagta gagacagggt ttcactacat 143040tggccaggct
ggtctggtct caaactcctg acctcaggtg atctgcccgc cttagcctcc 143100caaagtgctg
ggattacagg catgattcac catgtctggc catttatctt attttctttt 143160tttttttttt
ttttgtttga gacggagtct tgctgtgtcg cccagagctg gagtgcaatg 143220gtgcgatctc
agctcactgc aacctctgcc tcctgggttc aagcaattct cctgcctcag 143280tcttccaagt
agctgggatt acaggcgcgt gccaccacat ctagctaatt tttgtatttt 143340tagtagagac
agggtttcac catgttggcc aggctggtct cggaactcct gacctcgtaa 143400tctgcccacc
tcggcctccc aaagtgctga gattacaagt gtgagccact gtgcccagcc 143460atcttatttt
ctttcttttt ttttgtcggg tgggaggggg acagagtcta gctctgtcgc 143520caggcttggc
tcactgcaac ctctgccccc caggttctag caattattct gcctcagcct 143580cccaagtagc
tgggattata ggcacctgcc accacgcctg gctaattttt tgttattttt 143640agtagagatg
gggttttgct atgttgacca tgctggcctc aagtgatccg cccaccttgg 143700cctcccaaag
tactgggctt acaggcgtga gcttgtattg ggtaaaagaa caatattggg 143760ggctgcatgg
tggttcatac ctgtaatctg agcactttgt gagactgaga tggaaggagt 143820gttggagccc
aggagggtga ggctgcggct gcagtgaatt gtgatcacgc cattgcactt 143880ccacctaggt
aatggagcaa gaccatgtct ctaaaaaaca aaacacaatt tttttaagga 143940atactgggaa
gaggtcagtg gtggttttag aacagaggaa gtgccagatg acctttgtga 144000ggcattggcc
aggaagaact ctacagtgtc tttaggtagc ttctgtccat aaggataatg 144060gggtctcctc
cccagtatta atagaaaatc tctgagctgt ttttttttgt ttgtttgttt 144120tgtttttttt
tcctgagatg gagtctctct ctgtcggcca ggctggagtg ctgtggcgcg 144180atcttggctc
actgcaagct ctgcctccca ggttcacacc attctcctgc ctcagcctcc 144240caagtagctg
ggactacagg tgtccaccac cacgcccagc taattttttg ttatttttag 144300tagagatggg
gtttcaccat gtcagccagg atggtctcga tctcctgacc tcgtgatccg 144360ctcgcctctg
ccttgcaaag tgctggagtt acaggcgtga gccaccgtgc ctggcctggt 144420ttttttgttg
ttgttattta tttatttatt tatttatttt ttgagacaga ctctcgctct 144480gtcgcccggg
ctggagtgta gtggcacgat gtcggctcac tgcaagctct gcctgccagg 144540ttcaagccat
tctcctgcct cagcctcctg agtagcaggg accacaggcg ctcgccacca 144600cgcccggcta
attttttgta tttttagaag agacggggtt tcaccgcatt agccaggatg 144660gtctcgatct
cctgatgtcg tgatccgccc acctcggcct cccaaagtgc tgggattaca 144720ggtgtgagcc
accgtgcctg gcctgatttt tttttttttt taatctggtc tcatacctct 144780gacagctcat
gaagaagtgc tcctgcttca tatgtatatg tgttagcata gtgttaacat 144840agcataggtg
ttcggtgttt gcagtttctg tttgttttat atgaattaag gtgtattatg 144900agcagttgaa
gatatatagg aaattttttc ccaaaccact atctctgctc gttctattca 144960ttcagtctgt
ttatgttatt ccttcattca ttcattttat agaacagtgg agtgcctact 145020gtatgcatct
attgttctgg gtcctgggga agaaaacaaa gttcctgctt tcatggaact 145080tacattatat
tggcggagac agtaacagac aaacaaatgt agcctgtgta catgtgttac 145140atgaaaagca
gggtaggggg ctgggagaga gtagtaggga gtgctatttt cgaggtggtt 145200gtcaggaaag
gcctcactga ggaggtggca ttttgagtag acctgagcgc agcgggggcg 145260taagcccagg
cagcatgtgg aggaagagtg ttcttggtga aaggaacaag gatagaggcc 145320cgaagctaga
gagctcagca tgatcaagga acagcaagcc ccgtgtggct ggaatggagt 145380gagcaaagga
atgagcagta gaaggtgagt gagttgggag gtcaccagag accatggcaa 145440ggacttgaaa
gtgtcaggga cacattggaa gttggagcag ggaaatgatg ggatttatgt 145500tttgtttttg
ttttatgttt agtgttttta agggattgct ctatcagcta tttggaaaat 145560ttagtgtagg
gcttcaagaa gagaagcaga gaaacaacat tcttgccata gtcatagtct 145620aagtaaggga
tgatggtggt gtggattagg ctggtagtgg aagaccagtc cagttcgggt 145680tgtatttgaa
ggtagaggca aaaagattat atttctacca gcaagcccat ctatgaagtt 145740acttgtatta
ttaatttaat tgagacatgc ccacataaac taataaatag gaatttctgc 145800agtttggtta
aacacccctg tatatcctgg ttcttctttt agttgtccag atgtctcttt 145860aagtcaagta
ttttttggtg gtgtaggagc ctagagattg aatttattca cccaaaaggc 145920atttgagtga
ttactatgtg ccaggcacta tgctgaatgc caaggatgta aataagaggg 145980cgtagtctca
gtctgtttta ctccagcttg gttccttttt aatgaccctg acttgttaag 146040catatcagtt
atcctacaga atgtttaatc ttctgtactt tcctggttgt gttatttagc 146100ttatttctct
ttccttgaca tttcttgtaa actggaagtt acacctatag tcttgatgat 146160tcgtgttaca
cattttagat tagaacacat catgtgttgt atatggtgtt tttgaaagcc 146220tctctgtata
ttggtctgta cattaaaatg ttgcctgaat ggatacacat aaaatttaac 146280agtgattaca
ttagagatga gaagaaagag gtgcctttta cttttcaata taccttttcc 146340tctgcttttt
gaactttctt gccctatgca tacgttattg cttaatcatc cacctcatct 146400cttcccctgt
ggctttctgt tgcatttgga atgaaatcta gcctctttgc tgttacctgt 146460ggatgtccct
tgctggcctc tatcacctta ctttgaacca ctcctttcat ggactgagct 146520ctcattggac
tatcttttat tcttttgctg aagtttcttc actttgagtg cctctgcagt 146580tgctatttca
tggctgtggc aagccctgcc atggctttca tgcaaggatg gttcctcctt 146640ctcatctcaa
tattatctct tcagagaggg accttcccaa ctccgatgat ctaaaatcct 146700ttgtatatac
cactcactac cacttctttc ttttcttttc cttttatctt tttttttttt 146760tttttttttt
gagatagggt cttgctctgt tgcccaggct ggaatcacga ctcactgcag 146820cctcatcttc
ttgggctcaa atgatcctct cacctcagcc tctcgagtag ctggaactgc 146880aggcacacac
caccatactt ggcttattat tttacttttt gtagagacag ggtttcacca 146940aggctggtct
caagctcctg ccgcaagcaa tccacatctc tcagcctccc aaagtattgg 147000gattatagga
gtgagccact actcctggcc tattttctta ttcactgtct aaaattatct 147060tgttcattta
tttacatact tgtttatagc ttatttctca gctggacatg gtgcctcaca 147120cctgtaatct
caatactttg ggaggctggg ttggagaatt ggttgagccc aggacttcaa 147180gaccagcctg
ggcaacaaag tgagaccctg tctataaaaa attgtttaaa aattagctgg 147240gcatggtggc
acatgcctgt ggtcccagct acttgggagg cagaggtggg agaatcgctt 147300gggcccagga
ggttgaggcg acggtgagcc atgattgtgc cactgcactc tagcctagtg 147360acagagtgag
accatgtgtc taaaaagtaa ataaaaatag tttctctttc atgactagaa 147420tattacctct
atgtgggcag ggagtttgtc tatactattt ggcactatat ttcctgattc 147480tgaaattatg
cctagcacat ggtaagtact ccttaaatat ttattgactg aattatttaa 147540tacttaagaa
tttcatttgg gattatctga gtggtaagat tacggattat atttatgtaa 147600gaaaaaatca
ttttttaaac ttggttgccc tttgccacac tgacatagac actaagtttt 147660cttagccaga
ttacttccga ggatactcac agaggccatt ctcttctcaa tccccaaata 147720attgatattt
cttagcactt tcaagctaat gcaattctta gatgatgtat ctgtgtatat 147780catatcctca
ttctacaaat gtagaaattg aagtctgggc acagtggctc tcacctgtaa 147840tctcagcagt
ttgggaggcc aaggcgagcg gatcactgag gacaagagtt aagaccagcc 147900tggccaacat
ggtaaagcct tgcctctatt aaaaatacaa caattagggc cgggcgtggt 147960ggctcacgcc
tataatccca gcacgttggg aggccaaggc aggcagatca cgaggtcagg 148020agttcgagac
catcctggct aacacagtga aaccccatct ctactaaaaa tacaaaaaat 148080tagccaggca
tggtggcacg cgcttgtagt cccagctatc gggaggctga ggcaggtgaa 148140tcccttgaac
ccgggaggcg gaggttgcaa tgagctgaga ttgcaccgct gaactccagc 148200ctggtcaaca
gagggagact ctgtctcaaa aaaaaaaaaa aaaaacaatt agccaggcgt 148260ggtggcgggt
acgagtacct gtaatcccag ctactaggga ggctgaggga ggagaatcac 148320ttaaacccag
gaggtggagt ttgcagcggg ctgataatgc accactacat tccagcctgg 148380gcaacagagt
gagactctgt cttaaaaaaa aaaaaaagaa agaaagaaat tgaggaatgt 148440ggagattgtg
gtctgtgatt tgttaggaat cacacagcag gttagtagca actacagggc 148500tttggttcag
aataccacct tgacaatggt ttgtttacag ttcggctccc cttcctctgc 148560ctttctctcc
ttccttattg agggcagctg gaaagaattt tcatcattta ctagcctata 148620gctttaattt
gagttttgaa accttgataa tagagcacag aggaaaagac tgagttttct 148680ttttttgaga
cagtcttgct ctatggccca ggctggagtg cagtgacacc atctcagctg 148740gttgcaacct
ctgcctccca ggttcaagca attctgcctc agcctctcga gtagctgaga 148800ttacaggcac
gtgtcaccac gcccagctaa ttttctgttt ttgtttcgtt ttgttttttt 148860ctgagatgga
gtcttgctct gtcacccagg ctggagtgca gtggtgcgat gttggctcac 148920tcaaacctct
gtctcctggg ttcaagcaat tcttctgcct cagcctcccc agtagctggg 148980actacaggta
cgtgccacca tccctagttc atttttgtat gtttagtaga gatggggttt 149040cactatgttg
accaggctgg tctcgaactc ctgatctcag gtgatctact cgtctcagtt 149100tcccaaagtg
ctgggattat tggcacacgc ctatttttgt atttttagta gagacggggt 149160ttcaccatgt
tggttagact ggtctcaaac ttctgacctc aagtgatttg cccgccccag 149220cctcccaaag
tgctgggatt acaggcgtga gccaccgtgc ccagccaaga ttgagttttg 149280aaaagagcct
tctgagatta tgagaagggc aagcaagata acttaagaag ttacattaaa 149340atcatctaag
agacagtgta acaagaagga attgtaaaat gatgttatga gcacgtgccc 149400aatgtagtgg
caatcccttg tgcttcgata cattggtggg agacaaaact gtacttaaat 149460tgataaatcc
cttacatgtc attttaagga gcttagactg actcccatca tgtagacatc 149520agagatttct
tttttttttt tttttttttt tttttttttt tttgtgacag agttttgctc 149580ttgttgccga
ggctggagtg caatggcgtg atctcggctc accacaacct ccacctccca 149640ggttcaagca
attctcctgc ctcagcctcc cgagtagctg ggattacagc catgcaccac 149700cacgcctggc
taattttgta tttttagtag agacggggtt tctccatgtt gtggctggtc 149760tcgaactcct
gacctcaggt gatcctcccg cctcagccac ccaaagttct gaaattacag 149820gcgtgagcca
ccgcgcccag cccagagatt tctaaacaga gttctaacca gatgcttttc 149880cctgtcagta
gaatgagaat gaattggagg tgggagagac tggcatgagg gacaccagtc 149940agccagtgga
attagctggt aatgttgata ggagaagaaa aagattcaaa gttaggtagt 150000ggtagcaaga
attagaggga aggtcggatt tatgatatgt ccaaggttga attctaaggt 150060gaaatttggt
ggcagatttc atgtgtaaat tgggaaggta gattgagttt ttttaacatg 150120ggttttctaa
catgtcaata gagtgactct gcaggggggc ctgacgagag aacagtgcat 150180ggggtgattc
aacagccagt tgagccttca tgcagagcat ttaacactgt gactctgtag 150240actctggttg
gcagtaaaat ttcattaaac caatatttaa acccttaggt aataataaaa 150300attgagggaa
aaggatccag gttttgtatt ttttatgaat tcagttattg aattaaacag 150360gaccttgcct
caagaaataa tctaccaaca attaacttgt tttaaagcaa agttaggaag 150420tgagcatgtt
caaattatta aataaaaaag taagctgtgt atttcattca tagaaataga 150480ggctggccta
cttcggatga ttctcagcat gtgattacag atgtgggctt atacatccta 150540gggagttaag
gcgtactctg gcttggatag agtagagctc tttgaaactc ttctctcacc 150600cagctagttt
atatagacta gagaactaga atgtagcagc atactctgtc ttagaagccc 150660ttttatatag
gagctggtct ggaaggtttg aaaacataac aaatgtgttg gtgtctccca 150720atgtattgct
agattcttac ccaagagcat tatcctggtt agggtttggt ttggttttgt 150780tttgtttttt
aatgtttgcc acaaactaac actagatgtt agttctttca tcaagtgagg 150840agagtagaag
aaaagtccag aactctgaaa caccttttca aaagtttttc aagccatgat 150900gtttgcaagt
taaatgctct gttatgtaag caatataatc agtttttatt aatgtaacat 150960tccttagtgt
tttggggtat cacacaaaaa agaatatcca tatctggaag caacagcttt 151020taaataagag
cattgtggtg gtggtggtga tagtggtttt tttttttttt tttgagttgg 151080agtctcgctc
tgttgcccag gttggagtgc agtggcacga tctcagctcg cttcaacctc 151140tgctcccagg
ttcaagcaat tcttctgcct cagcctcctg agtagctggg attataggca 151200cctgctacca
tgcctggctg atttttatta ttttagtaga gacaggtttc accatgttgg 151260ccaggctggt
cttgaactct taacctcagg tgaatcaccc acctcggcct cccaaagtgc 151320tggaattaca
ggcatgaacc accatggcca gccaaataag agcattttta atgtaaaatt 151380atgcatgaaa
tgtacattca attttgtctt tgtttactag gatccatgtt ctcacaagct 151440atgaagaaat
gggtgcaagg aaatactgat gaggtaaatc ctacctttag gataaaaaga 151500tttctgttta
taagtgccac cctcatgtaa gtgaggttta aaattttcct tttctttagg 151560tcccatgttt
aagcagcatg gcacatttat gttctcttac ccagaatgta ccaagaaagg 151620gtggtccctt
cttaacatct aacaattgcc tggtagtagc agtgaaggta tcttcagtca 151680gaggctagga
ccactgaagg atatacatgc attcaagttt ccatcagcca gcaggcatca 151740gtaatcagtg
tgtagatcaa aagctcaaat gtttccttcc ccactggcag ttttacttca 151800agtagtggag
gcttgctttt ttaatagtta attaagtaca ttgagagatg ggaggtgaaa 151860aaaggaaaat
gttttatttt gaccatctaa tatgaaagta gttcggtgtt aggtatccag 151920tagttgacac
tggaagacag ggaatgacat gttaatattc atagccagag ggtggcccag 151980gttttttcgt
acatgggaat gaaattctta tccaaataag tagaaattat gtgcgtaagc 152040catttgttaa
gagcactgag tatgtgcatc tcgatccatc taatgaataa ccattatcac 152100cagtttaaat
tattttcttt aggcccagga agagctagct tggaagattg ctaaaatgat 152160agtcagtgac
attatgcagc aggctcagta tgatcaaccg ttagagaaat ctacaaaggt 152220aaggatgact
tcgttttgtg taaactaaaa agtattattt tccaggtgta aaaataaaaa 152280agaacataag
gggtttcttt gcctttgaag gattaactgc tgtggggatt accttcttat 152340cataagcaac
tagaaaattg acaaactaaa tgaaacaact gtttgcatat attggacaat 152400gggcaataca
gggaaaccat ggaaaccaaa cagagcccag tagtcttgct gaacgaaaga 152460gttaaatatc
aaagttcagg ccaggtgcag tggctcacgc ctgtaatccc agcactttgg 152520gaggccaagg
cgggtgaatc acttgaggtc aggagttcaa gaccagcctg gccaacatgg 152580tgaaaccctg
tcttagccgg gtgtggtggc aggcacctgt aatcccaact atttgggagg 152640ctgaggcagg
agaatcgctt gaaccaggga ggcggaggtt gcagtgagcc gagatcacac 152700cactgcactc
cagcctgggc gacgagcgaa accccatttc aaaaaaaaaa tcaaagttca 152760gagagctcaa
tttgagtaga agttgtagga taaggtagca gaaaagagga agctgcccag 152820aaagaaagcc
gtagagatat ttagagagat tcccatggat ccttggccta ggagtgatct 152880gtatatgtgt
ggggtgaaaa cgcatgtgtc caggtagaga accccccaga aattagtagg 152940ctgaatgatt
gctggaacat agggctaaga aaagttcatg gccagaagga tctggccaga 153000gtagagagac
ttagtaatac acaaggcatt gggtagtgtc ttcacagagg ttatgcctta 153060ctactgaaga
taaattagtc ctagagtaca agcacctgaa ccaagtttca aagcaaattt 153120ttaaagggtc
aaattaccta acaactgcat gccaaaacaa aggcctaacc ctctttacag 153180taacacaaca
aaattcagca cttcacagtg taaagttaga atgtctgacg tccaggctgg 153240gcgcagtggc
tcatgcctgt aatcccagca ctttgggagg ccgaggcagg tagatgacct 153300gaggtcagga
gttcaagacc agcctggcta acatggtgca accccgtctc tattaaaaat 153360acaaaaactt
agccaggcat ggtggccggc acctgtgatc ccggctactt gggaggctga 153420ggcaggagaa
ttgcctgaac ccaggaggtg aaggttgcag tgagccgaga tcgcaccact 153480gcactctggt
ctgggcaaaa agagcaaaac tcaggctcaa aaaaaaaaaa gaatgtctga 153540cgtcaatcac
aaattaccaa gcatgacatg aagttgacct ataaccagga gaaaactcaa 153600tctatagaaa
cagacccaga tgtgagaaag atgatgaatt tagcagacaa agaccatcaa 153660gtggctattt
taaatattaa aaatatgttc aagtggccag gtgcagtggc tcatgcctgt 153720aatcccagca
ctttgggagg ccaaggtggg taggagttca agaccagctt ggccaatatg 153780gtgaaacccc
ttctctacta aaaatacaaa aaaattagct gggcatggtg gcaggtgcct 153840atagtcccag
ctatatggga ggctgaggca caagaatcac ttgaacccgg gaggtggagg 153900ttgaggttgc
agtaagccga gattgtgcca cttgtactcc agcctggaca acagagtgag 153960actctgtctc
aaaaaaaaaa aaaaaaaagt taaagaaaac aagagtataa tgagaaaaat 154020gcaaaatagt
tttaaaagaa ccaaatggaa tttcttaaaa taaaaaatac cagaaatggg 154080ggccgggcgt
ggtagctcac gtctataatc ccagcacttt gtgggggctg aggcaggcag 154140atcacctgag
atcggtagtt caaggccagc ctgaccaaca tggagaaacc tcatctctac 154200taaaaataca
aaattagctg ggcgtggtgg cgcattgcct gtaatcccag ctacttggga 154260ggctgaggca
ggagaattgc ttgaacccgg gaggcagagg ttgcggtgag ctgagattgc 154320accagtgcac
tccagcttgg gccacaagag tgaaactccg tctcaaaaaa aaaacaaaaa 154380aaaacagtag
actcgaagaa ctagctgagt ttttctttac tttaggcagt aagtgtgacc 154440ttttgcaggt
gactacttta gttcctcatg tcctcattag tagatcagag aaattcgaca 154500ccaaaacccc
aaaagaaaaa ccccttctaa tcctcattcc atgattttat gaatgcatga 154560agtcctaggc
ctgcgaagga atactcattc tctttatcct gtgttgatac ctctctgctt 154620caacctccaa
ctcgacattt gcctatagga tgtacttgga cattcagcat aaactacctc 154680acaccattac
tgaattgctt catgtgcaca tgtcccatgc cacaataccg gggaccttgt 154740cttccgtgat
atttgtccgc agtgctgtga ctacaggagg gagtcagtga atgtctgcat 154800gtgtgtcttt
accatccctc ttgaatatgc tctagggtta attcctagaa gtagaattac 154860tctattgaaa
attggcaata tttttcattc taatatctat tgccaacatg ggaaagcaag 154920tctggatgcc
agtccttgtt atatgcccct tgggtaagtt acgtaacctc tttaagcttc 154980tgttcactca
tattttaaca aggaaaatta caatatttta cctcacaaaa ttgtagtcag 155040cttctggctg
tcttaaactc tggtatatag taaacactaa gtgttggtgt ccatccttaa 155100tttgtaataa
taggtcactt gttagagaaa tgcaccttac cattttcttt tcttttcttt 155160tttcagttat
gactcaaaac ttgagataaa ggaaatctgc ttgtgaaaaa taagagaact 155220tttttccctt
ggttggattc ttcaacacag ccaatgaaaa cagcactata tttctgatct 155280gtcactgttg
tttccaggag agaatgggag acaatcctag acttccacca taatgcagtt 155340acctgtaggc
ataattgatg cacatgatgt tcacacagtg agagtcttaa agatacaaaa 155400tggtattgtt
tacattacta gaaaattatt agttttccaa tggcaataac ccatttatga 155460gagtgtttta
gcctactgga atagacaggg accacatcct ctgggaagca gataagcata 155520gaactgatac
ttgatgcaca ctcgtagtgg taactcatcc ctaatcagca ttgtaaagca 155580ggtgccagag
gtggtttgct ttgtccttcc aaagcaggtg agtcagcccc accgagagcc 155640aggcagcttt
gagtggcagc gtggtgctag cagcttcagc ggaacagggt gagagttaat 155700tatgcagtct
tcttgacagc ggcattaatt tggaaggaaa ctgacaagtc atgggtcaag 155760tttcagtgac
ttcctccttc ctctgatggc agtatatagt tttcacattt taattcctcc 155820tcctgagatg
cactatactt aaaaccattc tctcccctgc taacagaagg gtgtgaatct 155880ggtttacttt
gagcattagg atttgcccct ttggaattct gcactccagt tacttaactt 155940tcccttcaga
atacatgtgg aaagaaagaa agaaatagcg atgactccac ttttgcccct 156000gtggcacctt
gaacaaagca gttcttccca aattatactt tttttttttt taaataaggt 156060gagcaggatg
actggggaga gagaaacatt tgactttgac tgcctccccc attctttgct 156120gtgagctgga
aagtgtgcag ttggtcgtct ttcttctcct ttctttagga tagtaagaga 156180ctcactcact
gcacttctgc tcagttggct tctgcatcgg gatcacacag ccatcagcag 156240gactgcccag
ttggtgagca cactccattg accacgtggc gccagcgctt cctcaatgca 156300catgattgag
aggaaagaaa gttctcttag atgttactgc ttttgctcag actttgcaaa 156360aaaaaaaata
tatatatata tgtataaata tataattatt aatcactttt gtccttgaga 156420aagtcttgaa
tgaacagaga atttattcca ttgcaatatt tgattgtata gaggcacact 156480gtttcatcga
cagaagaagc aaaaaggctt tgtgtaagtt tttggtacta tgtaccacct 156540ctgttattct
tttaaagctg aagtattcat gtacttaaac catattatat ttaattgtgt 156600ttgattttaa
aatatatata tatgaattct atttaaaatt gtgtcaactt tctgctttca 156660gggcatttat
ggctcttctg ttgaaatata ttgatctttc caaatatttt catttgcttt 156720ctaaaaaccc
agaacatgag ccactactgg actttgcctt gtgtttgaag tgtatggcat 156780aaacccaagg
tttttattag tcatctatgc tgtgattaat tcattttgtt cttttaacaa 156840aatatttcca
tccacttcac attgcttcaa tctttaacag aaaagcaata taaaggttat 156900agaataaaat
gtggttttgg gcaactcttg ctgcctctgc atgttttgga ataacaattt 156960ctacaagact
ctaggctgtt taaactagtg ctttcagtta agataaattc taatcatttc 157020tttgtatata
cattttgtgc ttctgagcta gagatgccaa gtagttgtaa actgcttata 157080aagagaatag
cagcaaattt gagactcggc tacttttttc tgccccacct gctttgagac 157140acagaagcgg
agtgtggccc gaaattatta gccagattta atatttgatc taaagtaggt 157200ccttgtactc
attttaaagt tggaatttga ttcctccaac attgagcacc caccatgttc 157260caggctctgt
gcattgtgcc cacaaaataa gattccctgg tggagttttt atgggttcaa 157320ataatcagtt
gaacaccctt catctttatc atgttgttga cattgacaca aattgtttaa 157380aaagaaaaga
tattagagag aaagtggtac ctttgtaact tgatgtgtct tcatcattcg 157440gtaagatttg
atgaaagtaa aaagcaaatg tcagccaaat ccagtgaaca gcaataaaac 157500agggagtaac
tttttataac tttttctact tggatttcaa cattcagtag agcttttcga 157560aatgtaagta
gtttacagta ctggaggttt gactagttca gtaggaattt ggaggggaag 157620gtcattctga
attgtaacaa agtacaaact tctttgctgt tttatttaag tactgagagc 157680taagcacctg
atgaagtgac tgacctctct ccagtgacag tgtttgggta cctgcctgac 157740ttcaggagtg
gggtttatgt ttctacacag tgaccttttc tctcgccctc tcctccctct 157800tgcccacaca
ccagttgatt ggacctgggt tgaactcctg atccagacag gcccaagaca 157860gttcttaatg
ttaagaattt tggggccggg cacggtggct catgcctgta attgcaacac 157920tttgggaggc
cgagacaggc ggatcacttg aggtcagggg ttcgaggcca gcctggccaa 157980catggtgaaa
ccctgtcttt actaaaaata caaaaattag ctgggcatgg tggcgcacgc 158040ctgtaatccc
agctacgtgg gtggctgaga caggggaatc gcttgaacct ggaggcggag 158100gttgtgcaat
gagccgagac cgtgtcactg cattccagcc tgggtgacag agggagactc 158160tgtctccaaa
aataaaaata agaaaaagaa ttttgggcta ggtgcagtgg ctcacgcctg 158220taattacagc
attttggaag gcccaagatg ggcagatcac ttgaggacag gagttcgaga 158280ccagcctgga
caacatggtg aaactccatc tctactaaaa agacaaaagt tagccagatg 158340tggtgatggg
cacctataat cctagctcct cgggaggctg gggcaggaga atcacttgaa 158400cccaggaagc
agagattgca gtgagccaag atcacatctc tgcactccag cctgggcaac 158460agagcaagac
tctgtctcaa aaaaaaaaga atttggccag gcgcagtggt tcacgcctgt 158520aatcccagca
ctttgggagg ccaaggcagg cagatcacga ggtcaggaga tcgagattgt 158580cctggctaac
atggtgaaac cctgtctcta ctaaaaatac aaaacattag ccgggtgtgg 158640tggtgggcac
ctgtagtccc agctactagg gaggctgagg cagaggaagg atgtgaaccc 158700aggaggcgga
gcttgcagta agccaagatc gtgccactgc actacagtct gggcgacaga 158760gtgagactcc
gtctcaaaaa aaaaaagaat tttggccggg tgcggtggca catgcctgta 158820gtcccagcac
tttgggagac caaagtgggc ggattacctg aggtcaggag ttcaagacca 158880gtccggccaa
tatggcgaaa ccctgtctct tactaaaaaa aatacaaaaa ttagccaggt 158940gtggtggcgg
gcacctgggg aggctgaggc agggagaaat gcttgaaccg gggaggcaga 159000ggttgcagta
agccaagatc gtgccactgc actccagagc aagactcttt ctcaaaaaaa 159060aaaaaaaaag
aattttgcat ggggaaggag agatactgtt caccatctgg aatggtgctt 159120ggatgtggca
cttacaaaat caggagccag cactgcatgg acaaacagaa gcatgtgggc 159180ctgagatagc
aggtaccttg ataaccctga agacatcctt ggtttctgca tctattcctg 159240catccttgca
ttggactaca ttaatctgtc agttatcctt ataatgattt ttgatttttt 159300ttttttgaga
tggagtttcg ctcttgttgc ccaggctgga gtgcaatggc acgatctcgg 159360ctcaccacaa
cctccacctc ccaggttcaa gtgattctgc tgcctcagcc tcctgagtaa 159420ctgggattac
aggcatgcgc caccacacct ggctaatttt gtatttttag tagagacggg 159480gtttctccat
gttggtcagg ctggtctcga actcccaacc tcaggtgatc accctgtctc 159540ggcctcccaa
agtgctggga ttacaggcgt aagccatggt acccggtctg ttttttgatt 159600ttttgaaacc
agtctgaagt gagttttttt aattacgtga aaggagtttg gctaaaatac 159660tgccatactg
ccctaatgcc taatgattat gtattctcag catgtctgca aagtactgct 159720gatttctgga
gaataatttt tctttagtaa acttcactta agtcgtcatg tgtattctct 159780caaaatggta
tcctaaccta atggagctaa aagacacccc ttgtttttat aacaagcagt 159840tactgaggcc
caggaagggg agaagtccct ggcttgtgag atgatcacca ttagaactca 159900ggcctgggcc
agtgcctttt catgcttctc agatccttcc aaagaataat gaagattata 159960accgctttta
gcaattgtaa taaacccaga aatagaaagc tttttggtta gagtactggt 160020agaagtttgg
cgggagagat aatttttaca aaatttgtaa atacctgcca attctatata 160080ctaggcaagg
tctctggcct tgtaaaaccc ctcaaggtta caactttggt ggcccacact 160140aatagttacc
cactgaggcc ctctccgggt gaacattgag cactagagga agcccctctg 160200cttgggcagg
actgggcgtg gtgcagagta ggagcggtga tactgtggat tctgggcagg 160260tggagatggc
cagtgatgtc caataaagga cactggaggg agcagtgtga gtaaaggccc 160320tgagggcatt
catgttcagg gagggttgct gcccactggc ttgcttggca cacaggagag 160380tgggtattcc
tgccttagta actttatgta aacaagtatt tcctcagtct gttcctctca 160440aactgcctgc
tctggcacat tcagaatgtc acagaactca cctggatgca ttcagcccct 160500tgcctaaagg
tgacagtgca tctccttccc caccccaccc ctcataccac tgaagcacct 160560gtcagactgg
cccagtctgt gggcaaggag cctagagagg gcttagtttc agcttgaaag 160620gagctgggat
ttaccaagaa gcaaatgaga gacgaggatt gcaacaactg tgccatttcc 160680ccagcttcag
ctgactcctg tatattgact gtgccttcag actcatccgt aagtgacccc 160740aggctggcct
ctcccacatc acagtaagaa ttccacacac catacaactt ggaaagaggc 160800tccagctgaa
ggaagcccca cacttctttc aagtttttct tagtcttctc ttcttggcaa 160860agagtacctt
ttgtttcttc taattatgta actattggtt tagtaaatat tcacccattc 160920agtcaccctg
taagtggcag gcactgttta cagggacaca ggaaggaata aaaacttgca 160980ggcaccttgg
agcttgcatt ctattgaaga ggtaatggaa gttgggatag cagctaaact 161040atgctggtat
tggccaggcg cagtggctca cacctgtaat cccagcactt tggaggccaa 161100ggtgggcaga
tcatgaagtc aggagatcga gaccatcctg gctaacatgg tgaaaccccg 161160tctctactaa
aagtaaaaaa aaaaattagc caggtgtggt ggcgggcgcc tgtagtccca 161220gctacttggg
aggctgaggc aggagaatgg tgtgaaccca ggaggcgaag attgcagtga 161280gccgagatgg
caccactgca ctccagcctg ggtgacagag cgagactctg tctcagaaaa 161340aaaaaatatg
ctggtagttt tgattcaaga tggcctttgg agcccatgat ttaggtctcg 161400tacccaccaa
ggtctactgg aaaacatcag gctctcctgc tatagaccca tagggagagc 161460tgcagccgag
agggggagct gaagagaagt gccccttctg tgtcctgtca gcctcatcct 161520tccgcaagga
ccagttgctg tgccactcca ttcacttgct gcaagactgg aggtttttcc 161580tcaggtgttg
agcacctggt ttacaagatg tcagcatctt gatgcctgag accatcaagg 161640caagtctctg
aacagggctt accttagagt aaggcttaga agaggccgta aagtcagtct 161700cagctccgtg
gctctgcaga gctttgggac atgtgaattc ttaaaaacaa gactattgta 161760cagttactat
atgcatgcag tataaaatta taaccttgga aaatcctagc tagctgttga 161820gctaattcca
taaagtaatc agctcctgag ttctgcagtg gtaataataa tcagcataat 161880gagtaaacac
tgtgtgtgcc aggcagcgtc tcatttgatc cttgtgataa tcttgtaagt 161940actgattttc
tcccttcttt aaacaaagtt tttttttttt ttttagagag ggtctcacta 162000tgttgcccag
gctagtcttg aattc
16202536162025DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 36gaattcctat ttcaaaagaa acaaatgggc
caagtatggt ggctcatacc tgtaatccca 60gcactttggg aggccgaggt gagtgggtca
cttgaggtca ggagttccag gccagtctgg 120ccaacatggt gaaacactgt ctctactaaa
aatacaaaaa ttagccgggc gtggtggcgg 180gcacctgtaa tcccagctac tcaggaggct
gaggcaggag aattgcttga acctgggaga 240tggaggttgc agtgagccga gatcgcgcca
ctgctctcca gcctgggtgg cagagtgaga 300ctctgtctca aaaagaaaca aagaaataaa
tgaaacaatt ttgttcacat atatttcaca 360aatttgaaat gttaaaggta ttatggtcac
tgatatcctg tttcattctt tatataatca 420ttaagtttga aatgtatact tgcactacta
acacagtagt taatcttagt cctacaagtt 480actgctttta cacaatatat tttcgtaata
tgtatgcact ggtgtttatg tacgtgttta 540tgtttatatc tgttaaaatt agcagtttcc
atctttttct attttgtacc atcacatcag 600ttcagaagga ttgacagagc aaaatgattt
gatgaagtat aaaagtcaca tggtgagtgg 660cataaataca actctgaaca attaggaggc
tcactattga ctggaactaa actgcaagcc 720agaaagacac atatcctata tgtcaagaga
tgtaccaccc aggcagttaa agaagggaag 780tacacataga aagcacaatg gtgaataatt
aaaaaattgg aatttatcag acactggatt 840catttgctcc taaagtcaga gtcctctatt
gtttttttgt ttttgtgggt ttctttttaa 900atttttttat tttttgtaga gtcggagtct
cactgtgtta cccgggctgg tctagaactc 960ctggcctcaa acaaacctcc tgcctcagct
tcccaaagca ttgggattac agacatgagc 1020cactgagccc agcccagacg ctttagcatt
tatgaagctt ctgaaatagt tgtagaaacc 1080gcataagctt tccatgtcac tttcaaagtt
tgatggtctc tttagtaaac caaccaagtt 1140attcctcaag ggcaaaataa catttctcag
tgcaaaactg atgcacttca ttaccaaaag 1200gaaaagacca caactataga ggcgtcattg
aaagctgcac tcttcagagg ccaaaaaaaa 1260aggtacaaac acatactaat ggaacattct
ttagaagagc cccaaagtta atgataaaca 1320ttttcatcaa agagaaaaga gaacaaggtg
ttagcaaatt cctctatcaa ataacactaa 1380acatcaagga acatcaatgg catgccatgt
ggaagaggaa gtgctagctc atgtacaaac 1440cagtagataa tttcaacttg ctgccgaatg
aaacctcttt gcaaggtatg aatcagcact 1500tctcatgttt gttttgcttt gttttgtttt
gtttttagag acaggccctt gctctgtcac 1560acaggctgga gtgcagtggc acgatcagag
ctcactgcaa cctgaaactc ctgggctcaa 1620gggatcctcc tgccttagcc tcccaagtag
ctgggactac aggcccacca tgcccagcta 1680attttttaaa ttttctatag agatgggatc
tcactagcac ctttcatgtt tgatgttcat 1740atacaacgac caaggtacaa tgtggaaaag
ggtctcaggg atctaaagtg aaggaggacc 1800agaaagaaaa ggggttgcta catagagtag
aagaagttgc acttcatgcc agtctacaac 1860actgctgttt tcctcagagc agagttgatg
atctaaatca ggggtcccca acccccagtt 1920catagcctgt taggaaccgg gccacacagc
aggaggtgag caataggcaa gcgagcatta 1980ccacctgggc ttcacctccc gtcagatcag
tgatgtcatt agattctcat aggaccatga 2040accctattgt gaactgagca tgcaagggat
gtaggttttc cgctctttat gagactctaa 2100tgccggaaga tctgtcactg tcttccatca
ccctgagatg ggaacatcta gttgcaggaa 2160aacaacctca gggctcccat tgattctata
ttacagtgag ttgtatcatt atttcattct 2220atattacaat gtaataataa tagaaataaa
ggcacaatag gccaggcgtg gtggctcaca 2280cctgtaatcc cagcacttcg ggaggccaag
gcaggcggat cacgaggtca ggagatcgag 2340accatcctgg ctaaaacggt gaaaccccgt
ctactaaaaa ttcaaaaaaa aattagccgg 2400gtgtggtggt gggcacctgt agtcccagct
actcgagagg ctgaggcagg agaatggtgt 2460gaacctggga ggcagagctt gaggtaagcc
gagatcacgc cactgcactc cagcctgggc 2520gacagagcga tactctgtct caaaaaaaaa
aaaaaaaaaa aaagaaataa agtgaacaat 2580aaatgtaatg tggctgaatc attccaaaac
aatcccccca ccccagttca cggaaaaatt 2640ctcccacaaa accagtccct ggtgccaaaa
aggttgggga ccgctaatct aaataatcta 2700atcttcattc aatgctaaaa aatgaataaa
ctttttttta aatacacggt ctcactttgt 2760tgcccaggct ggagtacggt ggcatgatca
cagctcactg tagcctcaat cacccaggcc 2820ccagcgatcc tcccacctaa acttcctgag
tagctgggac tacaggcacg caccaccatg 2880cccagctaat ttttaaattt tttatagaga
tgggggtctc accatgttgc ccagactggt 2940ctcaaaccct gggctcaagt gatcctccct
caaactcctg gactcaagtg atcctccttc 3000cttggcctcc caaagtgctg ggattacaag
catgagccac tgtacccagc tggataaaca 3060ttttaagtcg cactacagtc atggacaatc
aggcttttca acatgcagta tggacagtga 3120gtcccagggt ctgcttttcc atactgaaat
acatgtgata ctaaggagaa aggtgctcgc 3180aaggatattt aaaatgaaga atatttaaaa
tgaggaaaaa actgtttctt catgactttg 3240ataaggctga taaagaccat ttctgtgatc
tcaggtgatt cactcaagta gtatatttca 3300gtaatcatta tctggaacag cctgaatctt
aaccaaaata ccatgatttt ttaatgctgt 3360tatgatacct tgatgatatg accaaactgc
aatgtaggca gctaaatctc cacgagtttg 3420acttccccga gagttgacag ttttcttcac
aaattaaaga aatatatttt ttgatacatg 3480attggcatat ttaaaaacta cactgaaatg
ctgcaaaatg atataaagaa acattttcca 3540gaatcaaatg caatcaaaga gtggattagg
aatctactca ccattatcaa ctaaatagaa 3600acacttggac tgggtgtggt ggctcacatc
tgtaatctca gcactttggg aggccaaggc 3660aggtggattg cttgaggcca ggagctcaag
accagcctga gcaacatagc aaaactctgt 3720ctctacaaaa aaaaaaaaaa attaaccagg
catggtggca gatgcttgta atcccagcta 3780ctctggaagc tgaagtagga ggactgcttg
agcccaggag atcaagactg cagtgagccg 3840tggtcatgct gcgccacagc ctgagtgaca
gagagagacc ctgtctcaaa aacaaaaaca 3900aacaaaaaac acttaacctt cctgtttttt
gctgttgttg ttgttgtttg tttgttttga 3960gatggagtct cactctgttg cccaggctgg
agtgcagtgg cgtgatcttg gctcactgca 4020agctctgcct cccgggttca cgccattctc
ctgcctcagc ctcccgagta gctgggacta 4080taggcgcccg ccaccacgcc cggctacttt
tttgcatttt tagtagagat ggggtttcac 4140cgtgttagcc aggatggtct tgatctcctg
acctcgtgat ccacctgcct cggcctccca 4200aagtgctggg attacaggca tgagccaccg
cacccggcca acctttctgt tttttagttt 4260gatatgcttg ttaactcagc agctgaaaga
atgctgaaag tggccttcag taaaaaaatt 4320tcactagaat ctctacatcc atatttaatc
tgaatgcata tccagattga tcagttagag 4380caaaaacact catcatcatt cctgatgacc
tctaattctg gtttcggctt tctatttcaa 4440tggaaacaga ataaggaaag aaatggaagg
gctctggaaa tttgtcctgg gctatagata 4500ctatcaaaga tcaccaacaa taagatctct
cctataaata taaaacaagt ataattaatt 4560ttttaattat ttttttctct tcagaggatt
ttatttcaag ataaaacata acttctaccc 4620atactattga ttccaaaggt tagaaaaagt
gtttttcctc atcttatcct tcaaagaggt 4680cacagcaatg caaacatcta taaaatgcct
ctgcataatt gtcagaagct atagtccaga 4740aatcattgaa aatgcttttc cattttaagc
ttaggtgagg tgtcttagga aacctctatg 4800acaacttact ctatttattg ggaggtaaac
tcccagactc tcccagggtc tcctgtattg 4860atctcatttt ttaggcttcc taatcccttg
aagcacaatc gaaaaagccc tggatctctt 4920ttctgcacat atcatcgcgg aattcattcg
gcttccagca agctgacact ccatgataca 4980agcggcctcg cccttctccg gacgccagtc
cttgctgcgg ttagctagga tgaggggttt 5040gctgggcttc agtgcaggct tctgcgggtt
cccaagccgc accaggtggc ctcacaggct 5100ggatgtcacc attgcacact gagctcctgg
caggctgtac caatttttta attatttaat 5160atttattttt aaaattatgg tgaatatttt
ggtattctgc tctaaaatag gcccataaat 5220gcacagcaga tatctcttgg aacccacagc
tttccactgg aagaactaag tatttttctt 5280ttaaagatgc tactaagtct ctgaaaagtc
cagatcctct acctctttcc atcccaaact 5340aagacttgga atttatgaga gatctagcta
acagaaatcc cagacacatc attggttctt 5400cccagagtgc agtcctccta aagaggctca
gccctaagca ggcccctgca ccaggagggt 5460gggtctgaga cccacatagc acttcccaag
gtgcatgctc cagagaggca ctgaaacagc 5520tgagcacaag cctgcaagcc tggagaactc
tcacagtcag aacggagggg gcccagtggg 5580actaacataa agagaaaagg gaacacagag
aaatggatgg caccaacaac cagcaaagcc 5640ttcatggcca atgaaagcat cagtgacggg
gccagaaccc tcatccccaa agactcttca 5700ctgcctttag tgaaaaacaa tggctagaga
gtgaagttat gatcatgtat agagaggtaa 5760agttacattt ttatattctg actctgctaa
tgtgaaattc cctatctgct agactaaaag 5820tttcagacac cctgttcaaa tatcccatta
gttgctagag acttaaaatg aacagaacgc 5880acattgtcag gatgactatt accaaaaaat
caaaagacag caagtattgg tgaggatgta 5940gagaaactgg aacttttgtg cactgtttat
gagaatgtaa aatggagcag ctgctgtgga 6000aaagagtatg caggttcctc aaagagtaaa
accaagatgt ggaaacaact aaatgcccat 6060cagtggatga aggggtagac aatatgtggt
atatacatac catggagtac tattcagcct 6120ctaaaaaaaa aaaaggaaat tctataacat
gcaacagcat ggatgaatct tgaggacatt 6180ttgctaatga aataaggcag tcatagaaag
acaaatactg cacgactcca cttatatgag 6240ataccaaaaa tagacaaatt catagaatca
aagagtacaa tggaggttac ctggagctgc 6300agggcgggaa acgaggagtt actaatcaac
gaacataacg ttgcagttaa gtaagatgaa 6360taagctctca agatcagctg tacaacactg
tacctagagt caacaataat gtattgtaca 6420cttaaaaatt tgttaagggt agattaacaa
atgtagtaga tccacaaatg tggttaagtg 6480ttcttaccac agtaaaataa aaaaagaata
tcaagcccag gagttcgaga ctagcctggg 6540taacatggtg aaaccctgtc tctacagaaa
atacaaaaat tagccagctg tggaggtgca 6600ctcctaggga ggctgaggtg ggaggcttgc
ttgagcccag gaggtcaagg ctgcagtgag 6660ccatgattgc accactgtac tccagcccag
atgacagagc aagacaccac cccccccaaa 6720aaaagaaaaa gaatatcaaa cattttaaaa
gatcagatac gcaagaacaa caacaaaaaa 6780gagatgaaca gagcatcgac cctcatctag
tgggattctt ggtctaactg aaaaacagac 6840attgagagac aaacaatgac agtgatgtga
tcacagcaat tacacaggta tcccctgggg 6900actgcagaag aaaggaggaa tgcctaactt
tcagaaaata gagaaagcgt caaacagttg 6960gtgaaagcct tccaaaacta gagagaactg
cacacaccaa atcacagaaa gaagaaaagc 7020cgtgggagat tctgggaccc accggctatt
tttgatggct gaacaccctg ctgcaggaga 7080gacaggagct ggaaagcatg gtgggatgaa
acctcaaaca gctttgcctg cattgcttaa 7140gatgactggg cttgattaac tctagtcaat
ggggacaatt caatcaaaga agaaagatgc 7200tcaaattcac attttagaat gattttttat
ggcagtatgg ggaatagatt aaaagagagt 7260gaagctggag gcaagaaact tgttaagagg
caactgaaac agtctagatg ataaataata 7320aactgacaga gtgactagaa aaatcagaac
aggctgaatc aacagatacc tagatgaaaa 7380taacaggact tgatcaccag ttgtatcttg
gagaggaagg agttgtttcc ttgctttccc 7440tacgactggg aatacggaag gtttgccgtg
tgtattggtt atatactggt gtgtagccaa 7500tcactgacaa ccatttagca gcttaaaaca
caaaggctta tctcccagtt tctgtgggcc 7560aggaatctaa gataggctta gctggctggt
tctggctcag agtttctcaa gaggttgcaa 7620tcaagatgtc agctggggtt gcatcatctg
aaggctcaac tggggccgga gggtccactt 7680ccaaggagtt cactcacctg cctgacaagg
cagtgctggt tgttggcagg agatctcaat 7740tcattgccaa gtgagcctct ctatagcatt
gctggaacat cctccccatc tggcagttgg 7800cttctctcag catgagtgat ctgagagaga
gagcaaggag gaagccacag tgttcttcct 7860actcctactc ctaacactat ggacctactc
ctaacactct cacttctgcc ttattccatt 7920agttagaaag ggaactaagc tccacctctt
gaaataagaa gtgtcaaaga atttgtggat 7980atatttaaaa atcatcacac tgtggaagtg
gatagggggt tcaattaatg ctgaacttga 8040aatgcctgag acattcaaat gtccaacagg
caatgaacat acccatagat ggtcatgact 8100ttagcaagaa tagaggaaga tcacagaatt
aaggaggaat tgaaaggtaa aagaagtgga 8160gtcagattcc ccctgaaaag tgagccatga
aaggaacttt aactattgag ttagaggtca 8220gagtaggaaa tttcggtgga attctttttt
aaagaaagga accatataag catgttttga 8280ggtagaggga gaataaatca gtagacaggg
agaggtaaaa aacataaatg ataggggata 8340gttgacaaag gtcttggcag aatcccttac
ccattgactt ggggccaaga gagggacact 8400tctttgtttg agggataagg aaaataagaa
agaatgggtg ctatttagtg tggtcctgtc 8460tctagggcaa acgcataggt aacaaactgt
gtgtgttagg aatatagatg tgacctcaca 8520ttgagattct cacctcaaat ccattttgtt
gttacctgta ccttcctacc ttctcttttt 8580gctacatgca gactgctgtt ttgtcttcct
ggcctgttcc aggtttcagc attctggcat 8640atctgctacc ctgttcccaa acctctctag
agtccatgct ccttccttgg atagtgtttg 8700attgggccac gtatctaaga agtgatgcct
tcagttaggc ctgagaacct cctctatgga 8760aatctccatc agtgaccctg acagacttgg
tatcttggag atgtcactgc tcccagcctg 8820tggtctagga gaatctcagc ctgggcctct
agtagtatgg ataaggcgtt aaggtatctt 8880tgaaccagag tctgtcatat tcctcaatgt
gggacagata aaacagtggt agtgctggtg 8940tttctgagct agaactctgg tttttggtct
agattctttg atgtatgacc tttcagaggt 9000attaaaattt gttctaatac aatgttcaat
acaaatgtag ttccttttct gttaggacct 9060caacaaaaca tgaccaactg tagatgaaca
ttaaactatg acaattcatg gaaatgaata 9120cagtaatacc tgcggttccc ccattttagc
agtcactatg gtgacatttg gcacaaatgg 9180ctatttaagg gtgcttttgt taaaacctac
catcttacta ggcacatgat attgaaacta 9240atgaaataat ggagaaactt cttaaaaact
tttaatgaat aaagtgatga agtgataata 9300ttttagctgc tatttataaa gtgactatta
caggtcaaac attcttctag ggtttttttg 9360ttgaagttgt cacatttaat ccttaataac
ccactatgag tcaggtattc ttctctcccc 9420tttggacagt tggggaaatg ggggtcagag
aggttaggta atttgctcag ggccacacaa 9480cctgcatgta gaaaatctga gatttgtaca
ggaacgtatc aaactctgaa gtccatgctt 9540ctattttccc atgctgcctt tctaataaaa
ggtaactaat gctactggat gctgccccca 9600aagtgagtca ctttcacccc accctacttg
attttctcca taaaactaat cacatcctga 9660caacttattt attgctgatc tcccccacta
gattataaac tcaataaaag caagatcctt 9720gtctgctgaa tatcagtacc taaaacgctg
tctagcacag agcaagtaat taatatttgt 9780tgaatgaaca aataaaggaa aaaaattcaa
aggaagaaaa agccctaaaa cagatgttta 9840cctaaacata cattttaaaa gaaagcatat
aacaaattca ggacagaatt taaatttgat 9900tttttaaaga aataaccaag tgctagctgg
gcacagtggc tcacacctgt aatcctagca 9960ctctgggagg ccgaggcagg cagatcactt
gaggtcaaga gttcaagacc agcctggcca 10020acatggtgaa acctgtctct actaaaaata
cagaaattat ccaggcatgg tggcaggtcc 10080ctgtaacccc agctactcag gaggctgagt
caggagaatt gcttgaaccc aggaggcaga 10140ggttgcagtg ggccaagatt gcaccactgc
actccagcct gagtaacaaa gcaagactct 10200gtctgaagga gaaggaaaga aagaaggaaa
gaaggaaaga aggaaagaag gaaagaagga 10260aagaaagaaa gaaagaaaga aagaaagaaa
gaaagaaaga aagaaagaaa gaaagaaaga 10320aagaaagaaa aagaaagaaa gaaagaaaga
accaagtgct tatttgggac ctactatgct 10380atgtttttcc atgcacgcta ttttcagtaa
agcagttagc aaacttgcaa gatcataaca 10440acaaatatat gcttctataa ctctaaaatt
gtgctttaag aagttcctct ttaccagctc 10500atgtatgcat tagttttcta agagttacta
gtaacttttt ccctggagaa tatccacagc 10560cagtttattt aaccaaagga ggatgcttac
taacatgaag ttatcaaatg tgagcctaag 10620ttgggccagt tcatgttaat atactccaga
acaaaaacca tcctactgtc ctctgacaat 10680tttacctgaa aattcatttt ccacattacc
aaggagccag ggtaggagaa tatagaaaga 10740ccacccaaga atccttactt ctttcagcaa
aatcaattca aagtaggtaa ctaaacacat 10800gccctaacaa tgaatagcag attgtgctca
gaagaatgat ctacaacatc ttactgtgaa 10860ggaactactg aaatattcca ataagacttc
tctccaaaat gattttattg aatttgcatt 10920ttaaaaaata ttttaagcct aaattttaaa
aggtttgata ttggtacatg aatagacaaa 10980cagacatgga ctagaccaag aattaggttc
aaacatatac aggaatttaa tatacgataa 11040atctagtatt ccaaaggaac caacaaatgg
tgttcagaca gcaggatagg catcaggaaa 11100aacacagttg ggcaccctac cttactccta
acaccaggag taactgaagg agcaccaaat 11160atttatttat tttaattata gttttaagtt
ctagggtacg tgtgcacaac atgcaggttt 11220attacatagg tatacatgtg ccatgttggt
gaggagcacc aaatatttaa aagaaaaaaa 11280ttggccaggg gcggtggctc acacctgtaa
tcccagcact ttgggaggcc aaggtgggca 11340gatcacctga ggtcgggagt tcgagaccag
cctgagcaac atggagaaac cccatctcta 11400ctaaaaatac aaaattagcc aggcatggtg
gcacatgcct gtaatcccag ctacttggga 11460ggctgaggca ggagaatagc tttaatctgg
gaggcacagg ttgcggtgag ctgagatatt 11520gcactccagc ctgggcaaca agagcaaaac
ttcaactcaa aaaaattaat aaataaataa 11580aaataaagaa agaaaagaaa aaaatgaaaa
tagtataatt agcagaagaa aacaccgtag 11640aatcctcgga ctcttaggat ggggaatgcc
tataatataa aaaccctgaa gttataaaag 11700agaaaatcac ctacatacaa accaaatctt
tctacatgcc taaaacatag cacaaacaca 11760gctaaataat catagctgaa tgaactggga
aaacaaaact tgactcatat ccagacagag 11820ttaattttcc tacacataaa gagtacctat
ataaacccaa caaaaaaacc accactaacc 11880caaaataaaa atgtgacagg taatgaacag
gtagttcaca gagaatacaa atggctcttc 11940ggcacataag atgctcagac tgacttttac
ttatttattt tttgagagac agggtctcac 12000gatgttgccc aggttaggct caaactcctg
ggctcaaatg atagtaccag gactacaggt 12060gtgccccacc gcacctggct cctcaaccac
ctgtattaac aggaaatgca aaataaaact 12120ttcaaatcta ttttacctat tagaatggca
aaaatttgaa aaacttcaaa catcatcatg 12180ttggtgagaa tgtgaggaga ctggcactct
cattttttgc tgatagcata tatatactga 12240tggcttctat ggaaagcaat ctggcagcgt
ctatcaaatg tacaagtgca tatatccttt 12300gacaaagcaa ttccactcta ggaatgtgtt
ctatatggtt gtgcttcctg gggctgggaa 12360ctgggagcta agggacaggg gcagaagata
atcttctttt ccctccttcc ccgttaaaca 12420tgttgaattt tatatactgt aatatattat
ttttcacaaa agataatttt taagcgatat 12480gtctgggaat tttttttttt cttttctgag
acagggtctc actctgtcat ccaggctgga 12540atgccatggt atgatctcag ctgactgcag
cctcgacctc ctgggttcaa gcaatcctcc 12600cacctcagcc tcctgagtag ctgggactac
aggcacgtgc catcatgcta atttttgtat 12660atacagggtc tcactatgtt gcccaggcta
atgtcaaact cctaggctca agcaatccac 12720ccacctcagg ctccaaagtg ctgggattac
aggcgtgagc caccgcgcct ggccctggga 12780attcttacaa aagaaaaaat atctactctc
cccttctatt aaagtcaaaa cagagaagga 12840aattcaacct ataatgaaag tagagaaggg
cctcaaccct gagcaacaaa cacaaaggct 12900atttctgaga caggaatttg ctgaacaaaa
tcgagggaag atgacaagaa tcaagactca 12960cttctcggct gggcgcagtg gctcacacct
gtaatcccag cactttggga ggccgaggcg 13020gacagatcac gaggtcagga gattgagacc
atactggcta acacagtgaa acccagtctc 13080tactaaaaat acaaaaaatt agccgggcgt
ggtggcaggt gcctgtagtc ccagctactt 13140gggaagctga ggcaggagaa tggcgtgaac
ccaggaagcg gagcttgcag tgagccgaga 13200tcacgccact gcactccagc ctgggtgaca
gagcaagact ctgtctcaaa aaaaaaaaaa 13260aagactcatt tctctagatc ttgagccgta
ttcaaattta tctcagctta gtgagaggtt 13320aaagcaagga atatccttcc ctgtgggccc
tgctccttac tgaaggaagg taacggatga 13380gtcaaggaca ccaatggaga aaagcactaa
caccattatc tgatgaacat tacgtgaaga 13440agggtaagaa gtgaagtgga attgctgaag
aagtcagtga aagcggacat tcatttgggg 13500aaatggaata taggaaatcc ataaaagtga
ttaaaaagat gttagaggct gaggcggggg 13560gaccacaggg tcaggagatc gagaccatcc
tggctaacac ggtgaaaccc catctctact 13620aaaaatacaa aaaattagcc aggcgtggtg
gcaggcacct gtagtcccaa ctactcggga 13680gactgaggca ggagaatggc atgaacctgg
gagacggagc ttgcagtgag ccgagatcac 13740gccactgcac tccagcctgg gtgacagagt
gagactccat ctcaaaaaaa aaagttagat 13800acgagagata aagatccaac agacacacaa
ctgctaattc tgaacagaac aaaacaaatg 13860gcacaggaaa agaaaattta agatataaca
ccggaaaact ttcctgaaat tgagtaactg 13920aatctatagc ttgaaagggt ttagcatatg
ccaagaaaaa tcagtagagt ccaaccagca 13980caagacacat ctagcaaggc tggtgattct
accaacacag agaaagaagt gggtgaccca 14040taatgcggaa aaaggcagac catctgcagt
cttctccaga acactggagt ctgaagacaa 14100aagaatgctg cctactgagc cagaagggag
agaaagtgac ccaacacatc tttaccaagt 14160tagaatgtca cgcattattt aaaggctgca
aaagccatga aagacatgaa agaacacaag 14220catttacaac atgaaagaac acaagcattc
tcatactcaa gaatccttaa gaaaaatgta 14280gtcctaatcc agcccactga aagttaaatg
tacttaatgt gctcattaat gggaacttca 14340tagcttcaaa tcagtctggt cccatctacc
aacatctctc gcccggcttt cctgcaatag 14400tcagcacctt tccctcctcc cagtcttgtc
ccctggagtc tgctctcagc atagcagagt 14460gaccacatca acacccaagt cagagccctc
cagtgcgcac tggtctacaa agcccttccc 14520accccccacc ccacgtgccc tccggatcct
tgtgacgtgt ctcctgcata ccctagcagc 14580cctggcctcc tcactgcccc tcctgtacat
caggaaggcg actccttgag tcttggctct 14640ggccgcctcc tccacctgca gtgagttaac
tcccttacct actctaggtc attgctcaaa 14700tgtcagcatc tcaatggggc cctccctgac
taccctattt aaattctaca tactcccctt 14760gaccccatgg acctcactca ccctattcca
cttttattct tacaatttag cacttgttct 14820cttctaacgt attctaagac ttactcattt
attacattgt ttgccacccc ctctagtaca 14880taaactccag aggggcaggg atttctgtct
atttattcat ttctttatcc ctaggacata 14940gaacagggca tagttcagag tattcaatgt
tatcaatgaa tgaactagca gtagtaccag 15000ttccagttag gcacagaatt aaatctaaat
agaattaaat ctcatggtct gggttaacta 15060tggatagaaa attagatata attttaagaa
gcctagaaag aaaaaattaa taatgtaaaa 15120ataatattaa tttgataata ataacaaaaa
ctctgccagg cactgtggct caaatctgca 15180atcccagcta ctcaggaggc tgaggtggaa
ggatcacttg agaccagagt tcaagactca 15240gcctaggcaa cacggcaaga aactgtctct
aaaaaaatta aaacttaaat ttttaaaaaa 15300gaattctcaa agcgtcacaa aaactggaga
ttaaggtaca ggaagtgtga agtaatatta 15360ctatgctaat ggtttttttt ttttttagaa
aggtataacc aaaagatttc tttctcaagt 15420cgataaactg agaaagataa gcatatcttc
caattaacag agggggagga aaagccagat 15480acaacaaaat aagatataaa ttagtttcca
gttgaaaaca agagtaggag ttattttgca 15540tcacctcacc tgtgacctcc cccagcccaa
aaaacactac tgataaacag ggtagaaaag 15600catcatctca gataaagcag gaaaaactgc
cacagtctca aaccacaaac tataagcaca 15660cacctggcca accctgccaa gtctgggctc
agtaggagga acgtgctgag agctaggatg 15720taccaactta gacattctgt gggatacaga
tgtccctgga agggtcacac catctcaaag 15780gcacctgtaa tgcccactga ttacagccac
catatgtgag agagaaactc agggcactta 15840gagagtataa caagaacctt atgtcatctg
agatgaggaa tcctcagccc tgcaaattaa 15900ccaactcttt agaacaactg gcaaaacata
aatatccaca acttttgttt cagtaattcc 15960actcttagat atcaatccaa agtacatgag
acagcagata cacacacaaa atggtattta 16020ctgcagcatt gtttataata gcaaaaaaca
agaaataatc catatgtctc aataggatac 16080tgggtacatg agggtatgta cccatcattc
aaccatcaaa aagagtgata tggatgtcca 16140cagatggaca taaaaagctg tgtgttacgt
gaaaacaaac tcaagcagca gcaggatggg 16200cttatgatag tcagtatgag ctaatttctg
gaaaaaaaaa tctagtgtgt gcacagaaaa 16260catctgaaag aacagaaaca aaactatcag
cagaatattg agatgtttta ctaagttgta 16320tatctatact gcttgtaatt tttaccccaa
gcaagaatta ctttttggaa aaagaaaatt 16380caggaaataa agcatttctt taaacttcat
gtttaaacaa atggtgatgg aataaaagag 16440ttcttattca tcataaacac acacagcaca
catgcacgca tgtgcgtgag cacacccttt 16500acttgataaa taccatgttg aatattttag
tctttccttt taggttctat cccttcactc 16560aaaatgcggt tataaataaa tgtacttttc
atgtgccttc tgcctaaacc cactttaata 16620taactttaca gtcccattat cattatagtc
tcaaagctag actcagcctg aaactaccct 16680ttcatttgga acccttatta aaatgccaca
tacagctcct tcaaataaaa acaaacccta 16740ggacctgaca ctaggcttcc tttgttgcta
ctcataatgg ccaagttctg tgcttataat 16800acatcttctt tcattttatt gctacatatc
caagggtttt atatgttttt cttattatat 16860cttaattcaa aacaccatca cgctcttttc
cagatgaaaa taaggaaaag aaattgagca 16920actgactgac ttaaaggtca taaaactata
tagtagcaga gtcagcaaaa gaagaaacac 16980acatctccca agtagaggct gaaaaccagt
accattcacc tccagggtga gctatataca 17040gattacaaag tcaccttctc taaatgttca
aactgaatcc catacccata ctttaccact 17100acctcgtaag aacagcctca gatcttgtta
tagccttttt tttagcatgc tgaagccaat 17160aaaatgcttc ccattcagca agagaaacaa
gttctgaaac actgaataat ctgcccaggg 17220cctatgaaca tttccactgt gagaaatgtt
ctccactgtg tggagaagat ccttactctt 17280ctccacacag gcagaacatt agaaaaattc
ttggattcta tgatgcacag cttaggagtc 17340tgtttagcac aatttaagtc caaatagtta
ttaaatcctc ctctgttcca gaaacagtgc 17400taaatactgt gaatataaaa attgaaaaga
tactctcctg gctcccaaga aagtcagcca 17460gatagaggag acacaggcac acaaatcact
gtcacatgaa gctctacctc cctaacttca 17520aacgagggcc taagtcacca agaatacagt
agcagttgtg actacgagta actactataa 17580ttcaatactt tatcttccct tagaaaactc
ttctcccttg gaaatttatt tgcatttcta 17640aataccattc cttactaaaa ggaagcaggg
ctccttgggg aaatagctga ttctaggtgt 17700ggactatgaa atgaaaatgg tgagtctggg
acatcccatg ttgcccagaa atcaaggaac 17760tgcccaaaga ttaacagagt catgttaaat
ggacctaaga gtgaaccaga aggagctcac 17820tttgccccgc gtggaacaat ttcaagaaaa
acatgacagt aatgaattat aaaacatgaa 17880ttaaaataca tattggtact aaaaagagaa
caaaaggatg tggctttgga taaagctctt 17940cttcatggaa gaataccagc taataaatgt
aaaggaaatg agagaattag aaaaattatc 18000attttgtaaa ccttaatata ttcacctaga
catgctaaaa ccactgagta aaaggctgct 18060tgggaagagg atgctcacat gatctcagag
tttcacacca cagataattt attagataca 18120ggaaggaaga tgtgatcaag cttcctgtga
cccccagcca ggccccacaa cactatgtgc 18180ctccttgtga tgtgggagct acacagcatc
gcccacacag cttctcgcca aaactgtttg 18240aagctaatca caagggaaga actggacagc
ttctgaccat gagacgctcc accagacaac 18300ttgcttggcc tctccaaaga aacttgcttg
gcctctccaa agaaaactca gtttcattta 18360aaaacaaaac taattattta aaaacaaacg
aaaagcaagt tgtggacttg agctccaggg 18420acagagcaga catacttttc cctgttcttc
ccagtaagtg gtaataaaaa ccctcaacac 18480tagatataaa acaaatataa gaaggttctg
gaaggggaag aggaggcaga ctatccaggt 18540gccttgaggc ccacagaaca acccagtgat
gggttcactg ggtcttcttt ttgcttcatt 18600atctcagact tggagctgaa gcagcaggca
acttcaaaac accaaggggc acagattgaa 18660aagccccaag aaaagcctgc cctctctagc
caaaggacca ggaaggagac agtctaatga 18720gatggaacac atttagacag taactgccca
tttaccagca ataactgagc agggagccta 18780gacttccagt cttgtgagga cgtaccaagg
tacccaacac ccccaccaag gctgagtaag 18840gactgcgact tttatccctg catggcagta
gtaaggagcc catccctcac ccgccagcag 18900tgtcagggga acctggactt ccactcccac
ccaggagtga tgaggccctc cctgctgggg 18960tcatgtcaga ggaggcctag tggagattca
gtgacttaac cttttcccag agataatgag 19020gccacctttc ctccctcttc ccccatggtg
acagtgaaag cactgtggca agcagtaggc 19080actcctaccc ctcctagcca gggaggtatc
agggaggcca agtagggaac cagaataccc 19140acaaccaccc agcagcaaca ggggtccccc
accccattgg gtgtcaatgg aagcagagcg 19200gaaagcctgg atatttaccc ccatctagaa
gtaacaagct gatgtccccc ttcttctact 19260acaatggtgt tcaaaacagg tttaaataag
gtctagagtc tgataacgta atacccaaat 19320cgttgaagtt ttcattgagg atcatttata
ccaagagtca ggaagatccc aaactgaaag 19380agagaaaaga caattgacag acactagcac
taagagagca cagatattag aactacctga 19440aaggatgtta aagcacatat cataagcctc
aacaggctgg gcgcggtggc tcacgcctgt 19500aaccccagca ctttgggagg ccgaggcagg
tggatcacaa gatcaggaga tcgagaccat 19560cctggctaac acggtgaaac cccgtctcta
ctaaaaatac aaaaaaaaat agcaaggcat 19620ggtggtgggc acctgtagtc ccagctactc
gggagcctga ggcaggagaa tggcatgaac 19680ctgggaagag gagcagtgag ccgagatcgc
accaccgcac tccagcctgg gcaacagagc 19740aagacttcgt cccaaaaaaa aaaaaaaaaa
aaaaaaaagc ctcaacaaac aactacaaac 19800gtgcttgaaa caaatgaaaa aaaaatcttg
gcaaagaaat aaaagatata tattttggcc 19860aggtgcagtg gctcacagcc tgtaatccct
gcactttggg aggctgaggc aggcggatca 19920cctgaggtca ggagtttgag accagcctga
ccaacatgga gaaaccccgt ctctactaaa 19980aatacaaaat tagccagtca tggtggcaca
tgcctgtaat cctagctact caggaggccg 20040aggcaggaga atcgcttgaa ctcaggaggt
ggaggttgcg gtgagccgag atcccgccat 20100tgcacattgc actccagcct gggcaacaag
agcaaaactc catctcaaaa aaatagatac 20160atattttaat ggaaatttta gaattgaaaa
atacagtaac caaattgaat ggaaagacaa 20220catagaatgg agggggcaga caaaataatc
agtgaacttc aacagaaaat aatagaaatt 20280acccaatatg aagaacagaa agaaaataga
ctggccaaaa aataaagaag aaaaaagagg 20340agcagcagga ggaatgatgg aaaaagagaa
aggaaggaag gaagggaagg agggagggaa 20400ggagtgaggg agaaagtctc aaagacctct
gagactaaaa taaaagatct aacacttgtc 20460atcagggtcc aggaaagaga caaagatggc
acagctggaa acgtattcaa aaaataatag 20520ctgaaaactt cccaaatttg gcaagagaca
taaacctata gattcgaaat gctgaacccc 20580aaataaaaag cccaataaaa tccacaccaa
aatacatcat agtcaaactt ctgaaaagac 20640gaaaagagaa aacgtcttga aagcagtgag
tgaaacaaca cttcatgtat aagggaaaaa 20700caattcaagt aacagatttc ttacagaaat
taaggaagcc agaaggaaat gacacaatgg 20760ttttcaagtg ctgaaagaaa agaagtgtca
acacaaaatt ctagattcag taaaaatatc 20820cttcaagaat caatgggaaa tcaagacagt
ctcagataaa gcaaaataag agaatatgtt 20880gccagcagat ctcccctaaa ggaatggcaa
aaggaagatc atgcaacaga ccaaaaaatg 20940atgaaagaag gaatccagaa acatcaagaa
gaaagaaata acatagtaag caaaaataca 21000tgtaattaca ataaaatttc tatctcctct
taagacttct aaattatatt gatggttgaa 21060gcaaaaatta taaccctgtc tgaagtgctt
ctactaaatg tatgcagaga attataaatg 21120gggaaagtat aggtttctat acctcattga
agtggtaaaa tgacaacact gtgaaaagtt 21180acatacacac acacacgtaa gtatatataa
atatatgtgt gtatatgtgt gtgtatatat 21240atatatacat ataatgtaat acagcaacca
ctaacaacac tatacaaaga gataataacc 21300aaaaacaatt tagataaatt gaaatggaat
tctaaaaaat attcaaatac tctacaggaa 21360gacaagacaa aaagagaaaa aaagaggagg
acaaactaaa ttttttaaaa acataaataa 21420aatggtagac ttaagcccta acttatcaat
aattacataa atgtaaatga tctaattata 21480tcaattaaaa gacagagata gcagagttaa
tttaaaaaca tagctataag aaacctgctt 21540tgggctgagt gcagtgactc acacttgtaa
tcccagcact tcgggaggcc aaggcgggtg 21600gatcacctga ggtcaggagt tccagaccag
cctggacaac atggtaatac cccatctcta 21660ctaaaaatac aaaaaaatta gccaggcatg
gtggcacacg cctgtagtcc caactactca 21720ggaggctgcg acacaagaac tgcttgaacc
cgggcagcag aggtagcagt gggccaagat 21780tgcgccactc cagcctgaac gacagagtga
gactccacct cagttgaaaa acaaaaaaga 21840aacctgcttt aaatatacca acatatgttg
gttgaaatta aaagaataaa atatatcatg 21900aaaacattaa tcaaaagaaa ggagtggcta
tattaataac ataaaataga cttcagagaa 21960aagaaaattt caagagacag gaataaaagg
atcaagaaaa gatcctgaaa gaaaagcagg 22020caaatcaatc attctgcttg gagattcaac
accctctctt aacaactgat agaacaacta 22080gacaaaaaaa tcagcatgga gttgagaaga
acttaacacc actgaacaac aggatctaat 22140agacatttac ggaacactct acccaacaat
agcaaaataa acattctttt caagtattca 22200ctgaacatat ccttagaccc taccctgggc
cataaaacaa agctcactag tgattgccga 22260aggcttggat ggacagtgga agagctgcat
ggggagggag aaggtgacag ttaaagagtg 22320taggatttct ttttgggata atgaaaatgt
tccaaaattg attgtggtga tgttggcgca 22380actctacaaa tataaaaaag gccattgaat
tgtacgtttt aagtgggtga aacatatggt 22440atgtggatta tatctaacgc tttttaaaaa
cttaacacat ttcaaagaat agaagtcata 22500cagagtgtgc tctactggaa tcaaactaga
aagaggtaac tggaggataa cgagaaaagc 22560ctccaaatac ttgaaaactg gacagcacat
ttctaaaatc atccgtgggt caaagatatt 22620catttctgat attcattttt attgtttaat
gtatttttaa aaatttctta agggaaataa 22680actgactaaa aatgaatatg gctgggtgcg
gtggctcacg cctgtgatcc cagcactttg 22740ggaggccgag gctggtggat cacaagatca
ggagttcgag accagcctgg ccaagatggt 22800gaaaccccgt ctcaactaaa aaactacaaa
aagtagccaa gcgcagtggc gggagcctgt 22860ggtcccagct acttgggagg ctgaggtagg
agaatcgctt gaacacaggc agcagaggtt 22920gcagtgagcc aagattgtgc cactgcacgc
cagcctgggc gacagagact gcctcaaaaa 22980aaaaaaaaaa aaaaagaata tcaaaatttg
tgggacatag ttaaagcaat gctgagaggg 23040aaatttataa cactaaatgt ttacattaga
aaagagaaaa agtttcaaat caatagtctc 23100cactcccatc tcaagaacac agaagatgaa
gagcaaaata aacccaaagc aagcaaaaga 23160aagaaaatat aaaaataaat cagtaaaatt
gaaaacagaa acacaataaa gaaaatcagt 23220gaaacaaagt actgattctt cgaaagatta
ataaaattga caaacctcta gcaaggctaa 23280caaacaaaaa agaaagaaga cacggattac
cagttattag aatgaaagca taattagaaa 23340caactctaca cattataaat ttgacaatgt
agatgaaatg gactaattac tgaaaaaaca 23400caaattacca caactcaccc aatatgaaat
agataattgg gatagcctga taactactga 23460gaaaattgaa tttgtaattt taacactctt
aaaacagaaa cattaaactt aatattttat 23520aaatattaga taaggtaatt atacccttcc
ttaacaaata aaaacgacaa attattttgc 23580agctaaagag atgtatgtac tgtgaaaaat
atcttcagaa aaatagaact ttgtttgaag 23640aataaggatt taaaaaatgt ttttaactct
caagaagcaa atatctgggc ccagatggtt 23700tcactgaaga attctaccaa atgtttaatg
aagaattacc accaactcta catagcatct 23760ttgagaaaac tgaagagaag ggaacatctc
ccagttcatt ttatgaagtg ggtgttactc 23820tgatactaga actgtataag gacagctact
cttgacacac tgcctatggg tagctctgct 23880ctgcaggaac agtcagaaaa aaaaaaaaaa
gaagcactgg acaagggcag tataaaaaaa 23940gaaaactggg ccaggtgcag tggctcacac
ctgtaatctc agcactttgg gaggctgacg 24000ctggtggatc acctgaggtc aggagtttga
gactagcctg gccaacatgg taaaaccctg 24060tctctactaa aatacaaaaa ttagccaggc
agggtggtgg ggaaaataaa aaggaaaaaa 24120aaacaaaaat aaactgcaga ccaatatcct
tcatgagtat agacacaaaa ctccttaaac 24180tccttaacaa aatattagca agtagaagca
atatataaaa ataattatac accatgatca 24240agtgggactt attccagaaa cgcaagtctg
gttcaacatt tgaaaacaag gtaacccact 24300atatgaacgt actaaagagg aaaactacat
aatcacatca atcaatgcag aaaaaagcat 24360ttgccaaaat ccaatatcca ttcatgatac
tctaataaga aaaataagaa taaaggggaa 24420attccttgac ttgataaagc ttacaaaaga
ctacaaaagc ttacagctaa cctatactta 24480atggtgaaaa actaaatgct ttcccctacg
atcaggaaca aagcaaggat gttcactctc 24540attgctctta tttaacatag ccctgaagtt
ctaacttgtg caaaacgata agaaagggaa 24600atgaaagacc tgcagattgg caaagaagaa
ataaaactgt tcctgtttgc agatgacatg 24660attgtctcat agaaaatgta aagcaactag
gggtaggggg gcagtggaga cacgctggtc 24720aaaggatacc aaatttcagt taggaggagt
aagttcaaga tacctattgc acaacatggt 24780aactatactt aatatattgt attcttgaaa
atactaaaag agtgggtgtt aagcgttctc 24840accacaaaaa tgataactat gtgaagtaat
gcatacgtta attagcacaa cgtatattac 24900tccaaaacat catgttgtac atgataaata
cacacaattt tatctgtcag tttaaaaaca 24960catgattttg gccaggcaca gtggctcata
cctgtaatcc cagcatttta ggaggctgag 25020gcgagcagaa aacttgaggt cgggagtttg
agaccagaat ggtcaacata gtgaaatccc 25080gtctccacta ataatacaaa aattagcagg
atgtggtggc gtgcacctgt agacccagct 25140acttgggagg ctgaggcacg agaattgctt
gaacaaggga ggcagaggtt gcagtgagct 25200gggtgccact gcattccagc ctggtgacag
agtgagactc catctcaaaa aaaataaaat 25260aaagcatgac ttttcttaaa tgcaaagcag
ccaagcgcag tggctcatgc ctgtaatccc 25320accactttgg gaggccgagg caggcagatc
acaaggtcag gagtttgaga ccagcctgac 25380caacatggtg aaaccccatc tctactaaaa
aatatataaa ttagccaggc atgtgtagtc 25440tcagctactc aggaggctga ggcaggagaa
tcacttgaac ccggaggcag aggttgcagt 25500gttgagccac cgcactccag cctgggtgag
agaacgagac tccgtctcaa aaaaaaaaag 25560caaaataacc taattttaaa aacactaaaa
ctactaagtg aattcagtaa gtctttagga 25620ttcaggatat atgatgaaca tacaaaaatc
aattgagctg gacaaaggag gattgtttta 25680ggtcagtagt ttgaggctgt aatgcacaat
gattgtgcct gtgaatagct gctgtgctcc 25740agcctgagca gcataatgag accacatctc
tatttaaaaa aaaaaaaatt gtatctctat 25800gtactagcaa taagcacatg ggtactaaaa
ttaaaaacat aataaatact gtttttaatt 25860gcctgaaaaa aatgaaatac ttacatataa
atctaacaaa atgtgcagga cttgtgtgct 25920gaaaactaca aaacgctgat aaaagaaatc
aaagaagact taaatagcgt gaaatatacc 25980atgcttatag gttggaaaac ttaatatagt
aaagatgcca attttatcca aattattaca 26040caggataaca ttattactac caaaatccca
gaaaaatttt acatagatat agacaagatc 26100atacaaaaat gtatacggaa atatgcaaag
gaactagagt agctaaaaca aatttgaaaa 26160agaaaaataa agtgggaaga atcagtctat
ccagtttcaa gacttacata gctacagtaa 26220tcaagactgt gatattgaca gagggacagc
tatagatcaa tgcaaccaaa tagagaacta 26280agaaagaagc acacacaaat atgcccaaat
gatttctgac aaaggtgtta aaacacttca 26340acgggggaag atatgtctct cattaaaggg
tgtagagtca ttgcacatct ataggcaaaa 26400agatgaacct gaacctcaca ccctacagaa
aaattaactc aaaatgactc aaggactaaa 26460cataagatat acatctataa aacatttaga
aaaaggccac gcacggtggc tcacgctcgt 26520aatcccagca ctttgggagg ccaaggcagg
tggatcacct aaggtcagga gtttgagacc 26580agccggatca acatggagaa gccccatctc
tactaaaaat acaaaattag ctggacgtgg 26640tggcacatgc ctgtaatccc agctacttgg
gaggctgagg catgagaatc gcttgaaccc 26700ggggggcaga ggttgcggtg agccaagatc
acaccattgc actccagcct gggcaacaag 26760agcaaaactc caactcaaaa aaaaaaaaaa
aaaggaaaaa tagaaaatct ttgggatgta 26820aggcgaggta aagaattctt acacttgatg
ccaaactaag atctataagg ccagtcgtgg 26880tggctcatgc ctgtaattcc agcactttgg
tcaactagat gaaaggtata tgggaattca 26940ctgtattatt ctttcaactt ttctgtaggt
ttgacatttt tttagtaaaa aattggggga 27000aagacctgac gcagtggctc acacctgtaa
tcccagcact ttgggaggcc ggggcaggtg 27060gatcacacgg tcaggagttc gagaccagcc
tggccaacat ggtgaaaccc cgtctctacc 27120aaaaatataa aaaattagcc gggtgtcatg
gtgcatgcct gtaatcccag ctactgagga 27180ggctgaggca ggagaatcac ttgaacctgg
gaggtggaag ttgcagtgag ccgagattgt 27240gccactgcac tccagccttg ggtgacagag
cgagactccg tctcaaaaga aaaaaaaaaa 27300aaagaatatc aaacgcttac tttagaaact
atttaaagga gccagaattt aattgtatta 27360gtatttagag caatttttat gctccatggc
attgttaaat agagcaacca gctaacaatt 27420agtggagttc aacagctgtt aaatttgcta
actgtttagg aagagagccc tatcaatatc 27480actgtcattt gaggctgaca ataagcacac
ccaaagctgt acctccttga ggagcaacat 27540aaggggttta accctgttag ggtgttaatg
gtttggatat ggtttgtttg gccccaccga 27600gtctcatgtt gaaatttgtt ccccagtact
ggaggtgggg ccttattgga aggtgtctga 27660gtcatggggg tggcatatcc ctcctgaatg
gtttggtgcc attcttgcag gaatgagtga 27720gttcttactc ttagttccca caacaactgg
ttattaaaaa cagcctggca ctttccccca 27780tctctcgctt cctctctcac catgtgatct
cactggttcc ccttcccttt atgcaatgag 27840tggaagcagc ctgaagccct cgccagaagc
agatagtgat gccatgcttc ttgtacagcc 27900tacaaaacca tgagcccaat aaaccttttt
tctttataaa ttatccagcc tcaggtattc 27960ctttatagca agacaaatga accaagacag
ggggaaatca acttcattaa aataatctat 28020gcagtcacta aacaaataag aacaagaggc
tccagaagtg ggaagccaat acccagagtt 28080cctacaatac agtatctgaa aagtccagtt
tccaaccaaa aaatatatat atacaggccg 28140gacatggtag cttatgtctg taatcccagc
actttgggat gctgaggcgg gcagatcacc 28200ctaggtcagg agttcgagac cagcctggcc
aatatggcaa aaccccgtct ctactaaaaa 28260tacaaaaatt agccaggcat ggtggtggat
gcctgtaatc ccagctactc gggaggctga 28320ggcagggaat cacttgaacc caggaggcag
aggttgcagt gagccgagat cacgccactg 28380aactccagcc tgggcaacaa agtgagactc
cacctcaaaa aaaaaaaaaa tatacatata 28440tatatgtgtg tgtgtgtgtg tgcgcgcgtg
tgtgtatata cacatacaca tatatacata 28500tatacagaca cacatatata tatgaagcat
gaaaagaaac aaggaagtat gaaccatact 28560ttctgtggtt atgataggat ggggtatcac
gggggaagta gacaagggaa actgcaagtg 28620agagcaaaca gttatcagat ttaacagaaa
aagactttgg agtaaccatt ataaatatgt 28680ccacagaatt aaagaaaagc gtgattaaaa
aaggaaagga aagtatcata acaatattac 28740tccaaataga gaatatcaat aaaggcatag
aaattataaa atataataca atggaaattc 28800cggagttgaa aggtagaata actaaaattt
aaaattcact agagaaggtt caacactata 28860tttgaactgg cagaagaaaa atttagtgag
acaaatatac ttcaatagac attattcaaa 28920tgaaaaataa aaagaaaaaa gaatgaagaa
aaataaacag aatctcagca aaatgtggca 28980caccattaat cacattaaca tatgcatact
gagagtaccg gaagcagatg agaaagagga 29040agaaaaaata ttcaaatgat ggccagtaac
ttcctagatt tttgttttaa agcaataacc 29100tatacaatca agaaactcaa tgaattccaa
gtaggataaa tacaaaaaga accacaaaca 29160gatacaccat ggtaaaaatg ctgtaagtca
aaaacagaga aaatattgaa agcagctaga 29220ggaaaactta taagagaacc tcacttacaa
aagaacatca cttataaaag aaccacaata 29280atagaaacag ttgacctctc atcagaaaca
atgaatgata acatatttga agtgctcaaa 29340gaaaaaaaat aaagattcct atatacgaca
aagctgtctt tcaaaaatat acatccaaaa 29400ggattgaaac cagggtcttg aagagttatt
tgtacatcca tgttcatagc agcattattc 29460acaatagcca aaaggtagaa gcaacccaag
ggtccatcga caaataaata aaatgtggta 29520tatgtataca caatggaatt tattcagtat
taaaaaggaa tgaaattctg acacatgcta 29580caacatggct aaaccttgag aacactatgc
taagtgaaat aagccagcca caaaaggaca 29640aataccatat tacttcactt gtatgaaata
cctagggtag tcaaattcag agatagaaag 29700taaaacagtg gttgccaagg gctgagggag
ggagtaacgt ggagttattg ttgaatgggt 29760acagaatttc agttttgcaa gataaaaaga
gttctggaga cagatggtgg tgagggtggt 29820acaacaatac aaatatactt tatactactg
aacagtatac ttaaaaatga ttaacatggt 29880gaaaccccgt ctctactaaa aatacaaaaa
aattagctgg gtgtggtggc gggcacctgt 29940aatcccagct acttgggagg ctgaggcagc
agaattgctt gaaaccagaa ggcggaggtt 30000gcagtgagct gagattgcgc caccgcactc
tagcctgggc aataagagca aaactccgtc 30060tcaaaaaata aaaaataaaa aaaatttaaa
aatgattaag caggaggcca ggcacggtgg 30120ctcacaccta taatgccagc actttgggag
gccgaggcag gcgatcactt gagaccagga 30180gtttgagacc agcctggcca acatggcaaa
accctgtctc tgctaaaaat acaaaaatta 30240gccaggcatg gtggcatata cttataatcc
cagctactgg tgagactgag acacgagaat 30300tgcttgaacc caggaggcag agattgcagt
gagtcgagat cgcgccactg aattccagcc 30360tgggcgacag agcaagattc tgtctcgaaa
aaacaaaaac aaaaacaaaa agcaaaacca 30420aaaaataatt aagcaggaaa cgagattgct
gctgaggagg agaaagatgt gcaggaccaa 30480ggctcatgag agcacaaaac ttttcaaaaa
atgtttaatg attaaaatgg taaattttat 30540atgtatctta ccacaaaaaa aagggctggg
gggcaggaaa tgaaggtgaa ataaagacat 30600cccagagaaa caaaagtaga gaatttgttg
ccttagaaga aacaccacag gaagttcttc 30660aggctgaaaa caagtgaccc cagagggtaa
tctgaattct cacagaaaat tgaagcatag 30720cagtaaaggt tattctgtaa ctatgacact
aacaatgcat attttttcct ttcttctctg 30780aaatgattta aaaagcaatt gcataaaata
ttatatataa agcctattgt tgaacctata 30840acatatatag aaatatactt gtaatatatt
tgcaaataac tgcacaaaag agagttggaa 30900caaagctgtt actaggctaa agaaattact
acagatagta aagtaatata acagggaact 30960taaaaataaa attttaaaaa atttaaaaat
aataattaca acaataatat ggttgggttt 31020gtaatattaa tagacataat acaaaaatac
cacaaaaagg gaagaagaca atagaactac 31080ataggaataa cattttggta tctaactaga
attaaattat aaatatgaag tatattctgg 31140taagttaaga cacacatgtt aaaccctaga
tactaaaaag taactcacat aaatacagta 31200aaaaaataaa taaaataatt aaaatgtttg
tattagtttc ctcagggtac agtaacaaac 31260taccacaaat tgagtggctt aacacaactt
aaatgtattt tctcccagtt ctggaggcta 31320aacacctgca atcaaggtga gtacagggcc
atgctccctg tgaaggctct aggaaagaat 31380cctcccttgt ctcttccagc ttccagtggt
tctcagtaac cctaagtgct ccttggcttg 31440tagctatatc attcctagca accagaaaga
agaaaataat aaagattatg gcaaaaaata 31500atgaaatcaa aaggagaaaa atggaaaaaa
ataaataaaa ccaaaagcta gttctttgaa 31560aagatcaacc aagttaacaa accttttaac
tagactgaca aaaaggaggt aagactcaaa 31620ttactagaat cagaaataaa agaggggaca
ttactaatga gggattagaa aagaatacta 31680cgaacaaatg tgtgccaaca aattagaaaa
cttagatgaa atggacaggt tcctaggaca 31740acatcaacta ccaaaattta ctcaagaaga
aagagacaat ttgaatgagc tataacaagg 31800gaagagactg aattgacaac caagaaacta
tccacaaaga aaatcccagg cccagaagat 31860ttcactgtga aattctttca aacttataaa
tataaattaa catcagttct tcacaaactc 31920ctccaaaaaa aagaacagat ctctatttac
aggcgatacg atctttagaa aatcctaagg 31980gaactactaa gacactatga taactgataa
acaagttcag caaggctgca ggatagaaaa 32040ccaatataca aaaatctatt atatttctat
acacttgcag tgaacaaccc aaaaatgaga 32100ttaagaaaat aattcaattt acaataacat
caaaaagaat aaaaacactc aaaaataaat 32160ttattcaagt aagtgcaaaa cttatactct
agaagctaca aaacactgtt aaaagaaatt 32220aaaggtttac ataaatgaaa aactatccca
tgttcatgga tcaaaagact tattactggc 32280aatgctctcc aaattgatct ataaattcaa
caaaatcctt atcaaaatcc cagatgaggc 32340tgggggtggc ggttcatgcc tgtaatccca
gcactttggg aggctgaggc acgcagatta 32400cctgaggtcg ggagctcgag atcagcctga
ccaacatgga gaaaccctat ctcttctaaa 32460aatacaaaat tagtcaggcg tggtggcaca
tgcctataat cccagctact cgggaagctg 32520aggcaggaga atcgcttgaa cccaggaggc
agaggttgca gtgagccaag atcgtgccat 32580tgcactccag cctgggcaac aagagcaaaa
ttccatctca aaaaaaaaaa aaaaaaaatc 32640ccagatgact tcactgttga aattgaaaag
attattctaa aattcacatg gaattgcaag 32700accttgagaa tagccaaaac aaacttgaaa
aacacgaaca aaatatagga tgactcactt 32760gccaattgca aatgttacga cacagcaaca
gtaatcaaga ctgtgtggta ctggcaaaag 32820acacatacat acatacatat caatggaata
taattgagag tacagaaaca agcctaaaca 32880tctatggtaa gtgcttttct atttttttct
tttttttttt cttttttgta gagatagaat 32940ctcaccatgt tgcccaggct ggtcttcaac
ttctgggctc aagcaatcct cccactgtgg 33000cctcccaaag tgctgggata actggcatga
gccaccacat ccagcccaga tgattttcaa 33060aaaagtcaac aagaccattc ttttcaacaa
ataggtctgg gatgatcaga tagtcacatg 33120aaaaaaaaaa tgaagttgga ccctccatca
cactaaagtg ctgcgattat aggcatcagc 33180caccacatcc agcccaaatg attttcaaaa
aggtcaacaa gaccattctt ttcaacaaat 33240aggtctggga taatcagata gtcacatgaa
aaaaaaaatg aagttggacc ctccatcaca 33300ccatatgcaa aaattaattc aaaaatgaat
tgatgactta aacgtaagag ttacgactgt 33360aaaactctta gaaggaaaca tacgggtaaa
tcttaaagac gttaggtttg acaaagaatt 33420cttagacatg acaccaaaag catgaccaac
taaggtaaaa tagggtaaat tgtacctacc 33480aaaatgaaaa acctttgtgc tggaaaggac
accatcaaga aatggaaagc caaaatagcc 33540aaggcaatat taagcaaaaa gaacaaagct
ggaggcatca tactacctga cttcaaagca 33600acagtaacca aaacagcatg gtactagtag
aaaaacagac acatagacca atggaacaga 33660ataaagaacc caaaaataaa tccacatatt
tatagtcaac tgatttttga caatgacacc 33720ccttcaataa atgatactag gaaaactgga
tatcgatatg cagaagaata aaactagacc 33780cctatctctc accatataga aaaatcaact
cagactgaat taaagacttg aatgtaagac 33840ccaaaactat aaaactactg gtagaaaaca
taaggaaaaa cgcttcagga cattggtcca 33900ggcaaagatc ttatggctaa aacctcaaaa
acacaggcaa caaaaacaaa aatggaaaaa 33960tagcacttta ttaaactaaa aagctcctgc
acagcaaagg aaacaacaga atgaaaagac 34020aacctgtaga atgggagaaa atatttgcaa
actatccatc catcaaggga ctagtatcca 34080gaacacacaa gtgactaaaa caactcaaca
gcaaaaaagc aaataatctg gtttttatat 34140gggcaaaaga tctgaataaa cattctcaaa
ggaagacata caaatgtcac tatcattctg 34200ccagtaccac actgtcttga ttacttgtta
gtgtataaat ttttaaattg ggaagtgtga 34260gtcatcctac actttgttct tgtttttcaa
gtttgttttg gctattctgg gagccttgca 34320agtataaaat agccaacaag tatgaaaaaa
tgctcaccat cactaatcat cagagaaata 34380aaaatcaaga ccactatgag atatcctctc
actccagtta gaatggctac tatcaaaaag 34440acaaaatata atggatgctg gcaaagattt
ggagaaaggg gaactcctat acactgtggg 34500tagggatgca aattggtaat ggccattatg
gaaaataata ctgaggtttt tcaaaaaact 34560gaaaatagaa ctaccatatg atccagcaac
cctactactg ggtatttatc caaaggaaag 34620aagtcagtat actgaagaaa tatatgcact
ctcatgttaa ttgcaacact gttcacaaca 34680gccaagacag ggaataaatc taaatgtgca
tcaacagatg aatggataaa gaaaatgtgg 34740catatacact caatagaata ctattcagcc
attaaagaag aatgaaatcc tgtcatccca 34800gcaacatgga tgaacctgga ggacattata
tttaatgaaa taagtaaagc acaaaaagat 34860aaacagtaca tgttctcact cagacatggg
tgctaaaaag aaaatggggt cacagaatta 34920gaaggggagg cttgggaaaa gttaatggat
aaaaatttac agctatgtaa gaagaataag 34980ttttagtgtt ctatagaact gtagggcgag
tatagttacc aataacttat tgtacatgtt 35040caaaaagcta gaagagattt tggatgttcc
cagcacaaag gaatgataaa tgtttgtgat 35100gatggatatc ctaattaccc tgattcaatc
attacacatt gcatacatgt atcaaattat 35160cactctgtac ctcataaata tgtataatta
ttacgtcaac aaaaaaagga aaaaaaagaa 35220aattaagaca acccacataa tggaagaaat
aaaatatctg caaattatat atatctgata 35280aatatttaat atttataata tataaagaac
tcctacaact caagaacaac aacaaaacaa 35340cccaattcaa aaatgggtaa aagccttgaa
tatacactta tctaaagact atatacaatt 35400ggccaataaa gacacgaaaa gatgctcaac
atcactagtc atcagggaaa tataaatcaa 35460aaccacaatg tagaatgtag acaccacttc
atatgcacta ggatggctag aataaaaagg 35520taataacaaa tgttggtaag gatgtgaaaa
aatcagaaac ctcattcgct gctgttggga 35580atgtaaagtg atgcagccac tttggaaaac
agtctggcag ctcctcaaat tattaaatac 35640agagttaccg tatgacccag gaatattcct
cctgggtcta taaccaaaaa aatgaaaaca 35700tatatccaca taaaaacttg tacatgggca
tttatagcaa cattattcat aacagcaaag 35760gtggtaagaa cccatatgcc catcatctga
tgaacaggta aataacatgc ggtattatcc 35820atacactaga atattatctg cccatacaag
gagtgacatc cagctacatg ctacaaggat 35880gaatctcgga aaccttatgc taagtgaaag
aagccagtca caaatgacca cagattatga 35940ttccatgcat cggaaatgac cagaataggg
aaatctatag agacagaaag tagattagtg 36000gttgggtggg gctgggagga caggtagtac
actactttcc cagaactact ggaacaaagt 36060accacaaact ggggagctta aacatagaaa
ttgatttcct cacagttctg gagactagga 36120ctctgagatc aaggtgtcag cagagctggt
tctttctgag ggccctgagg caaggctctg 36180tcccaggcct ctctccttgg ctggcaggtg
gccatcttct ccctgcgtct tcacatcatc 36240ttttctctgt gtgtgcccat gtccaaattt
tgattggctc attctgggtc atggccaatt 36300gctatgcaca aagtgaagtc tacttccaaa
agaagggaag agggaacact gactaggcta 36360aacttatagt cattttaatg tccgcttttc
ctatgagatt gtgaacacac agaagtaggg 36420tttttatcta cattgtgcaa agtttaataa
gaaaaataga attcaagaga agcagttcaa 36480tagcaggaat ttaatatggg aactaattac
aaggtttagg gcaggactaa aaagccagtt 36540gggatggtga gccaacccag agattagcaa
cagtgggacc ccatctacct accacccatg 36600aagctggaag gataaaggag gggctattat
cagagtccac aagccagtgt cagagtcctt 36660ggctggagct gggaccaccc tagagacact
gtgcaaagca gaaaacaagg gggaaaaacc 36720ctgacttctc ccttcctccc acctttcaat
ctcccactag tgcttcctac tagccatact 36780tggccagaga cagtgacaag gaacactgca
aaatgaagtt tgtaggaatc atctccctct 36840gagacagaga aatatggaag ggtagaaaat
gaatcagagg ataaagagaa aaaaccctga 36900gtactatctt atttatcttt gtatctccag
tgcctaatct gtctctcaaa aaaggaaagc 36960aattgagaga aactgaaaac tccaattgaa
atgaaagaat ggagaattac tggactagaa 37020gagaagagaa aaatttattc cgcatagagt
aaacaagaat ggattcacaa aggacgtgat 37080gaatgaaaag ctataatcag caaagatttg
ccagagaaat taaaaagtgg taaactcagc 37140cacgctgtac aacctgaagg cacaatgcat
gaaaacgttt caagaaatga caagatttga 37200agtcaaattc taagtgcttt tccagaatct
ctcaagacga ttatatagct accccatttt 37260attaaataaa atggaaactt actaaacttt
ccccttgtat taaactaaca tatgtcctaa 37320tagcaaacga ttctggaatt cctagagtaa
aatatatttc gtcaaagtgt attgctcttt 37380taatattctg ctgacctcct tttgctattt
aggatatttg tatacacatc acacgtaaat 37440ttggtctata gtttacatct acgggcttat
actgttcttt ttttcatttt tttaaaattt 37500ccaaccccca gtatccatat actgctctct
atcagggtta ttttaacttt gtaaaatcag 37560ctgagatgct ttccatgttt ttttttttta
ttttctgcca catttgaata gcataggagt 37620taccaccatc aaccttggat tatttaagca
ttcacgattc cacgtgtgga ttttttattc 37680agagtctttc ttgtcattcc tgctatcagc
acagaaccca atctcagctt tccagctata 37740ctctcacccc atggaatttg cagatgaagt
tcaaaaggac ctttgcatta tcctgcctcg 37800ccctcttccc ccttcattta gacatcacct
tcttctagaa cgtcttacct gacatgccct 37860gctcccaacc cctgctgccc aattgtgtgc
tctcccgtgt cctggcctgc catcctcttt 37920agtaattgcc tgctccctca tctgtctccc
cacccagaca ttaagctgaa tagactggat 37980ttgtgtcttg tccatcacta taatctcagc
acctagtacc tagtaggtac ttaccatgta 38040ttcattagca aaatgttatg tataaccttg
caccttaaaa acaagagaag gaagacaaaa 38100ttaagtctta agactatggt ttagaacatg
gatcagaaac tacagtctgc agcccaaatc 38160cagaccaaat gaagagacca tgttcattta
catacaacct atagcagctt tcacactaca 38220ggagcagagc taagtagttc caagggaaca
cacggccctg caaagcctaa aatatttact 38280ctatagctct tcacagaaaa agttttcaga
tccctcgttt agaactcttg ttcatatgca 38340atttcactaa accatagttt tttgggtttg
tttggttttt tttggcaaaa aggaatgagc 38400cgatccagaa aaggttgaaa agaatgaatc
attactgctg aaagaatgtg cacacagtcc 38460gtcagtattc tgctgccatg ctgacaccca
tccaatagtg tcatgagatg cagcagctac 38520tactgtgttc tcaatgccga gtccacccac
tccataacca tgtccaagca atcttgggaa 38580catcatcacc atgcttgttt atccttaagg
tattgcctca catacagcag tggctggtca 38640taaagtcaaa tgacactagt ggccaggagg
tcaagagaat gagtgaggac aggtgggtag 38700gcagcccagg ccctagcaac agcaggagct
cacccctcag tcactctagc caggactgaa 38760atacttttca ccctttcaag agagactagg
aatctggatt tttatgtgaa atatcttgat 38820tactaaatgt tgtcaacaga catgtcaaaa
ggtaaaacta agtaagttca tggggcagat 38880tgactattca ggttatagaa ttaaggattc
ttatccaaca cagataccaa ccaaaaagct 38940gacgtataac atattaggag aaactatgtg
cactgtcgaa acatcaacaa ggggctaatg 39000tctaaaatag tctatattgg attccagttg
aaacatgggg aaaggacatg aacaggcaac 39060ttatgtcaat ggaaactcaa aaagataaca
agcatatata aaagcattct caaattcagt 39120agtaaacaga cagatgcaaa taaaaagagg
gaaactgctg ccgggcacag tggctcacac 39180ctgtaatccc agcactttgg gaggccgagg
cgggcggatc atgaagtcag gagatcgaga 39240ccatcctggc taacatggtg aaaccccgtc
tctactgaaa acacaaaaaa ttagccaggc 39300gtagtggtgg gcaccagtag tcccagctac
tcaggaggtt gaggcaggag aatggcatga 39360acccaggagg cggagattgc agtgagccga
gaccatgcca ctgcactcca gcctgggcga 39420ctgagtgaaa ctccatctca aaaaatataa
taataattat aattataata ataataaata 39480gtaaataaat aaaaagagag agactgctaa
agtctagaaa gttgaatgat gccaagcgca 39540tgcaaagatc agggccttgg gatggccggg
tgcagtggct cacgcctgta atcccaccac 39600tttgggaggc caaggcgggc ggatcatgag
gtcaagagat caagaccatc ctggccgaca 39660cagtgaaacc cggtctctac taaaagtaca
aaaaaatata tatatatata tatattatta 39720tattatatat atatatatca gagccttggg
aatccttgtg tgctgctggg gaaggtagtg 39780gtgcagccac ccttgacagc aatctggcag
tacttggtta tattaagtat aggcacacac 39840cacgaccagg cagtcctact cctgggtcta
aatcccaaag aattctcaca caagtccata 39900aggagacatg tacgaggctc attcagcatt
actgggagtg ggaatcaacc tgggtgtcca 39960tctacaggag acgagatgga caaaatgtgg
tggatattaa gaccagaatc accaagtaac 40020agagatgggt ggtgagtgac aatcctaaga
tacagaataa aggctagaac atgatgccat 40080tcatgtaaat taaaaataga tgcacacaaa
gcagtatacg cgtgaccctt gaatagcaca 40140ggtttgaact gcctgtgtcc acttacatgt
ggattttctt ccacttctgc tacccccaag 40200acagcaagac caacccctct tcttcctcct
ccccctcagc ctactcaaca tgaagatgac 40260aaggatgaag acttttatga taatccaatt
ccaaggaact aatgaaaagt atattttctc 40320ttccttatga ttttctttat ctctagctta
cattattcta agaatatggt acataataca 40380catcacacgc aaaataaatg ttaattgact
gtttatatta tgggtaaggc ttccactcaa 40440cagtaggctg tcagtagtta agttttggga
gtcaaaagtt atacacagat tttcaactgt 40500gcaggcaatc agttcccctg accccctcat
tgttcacggg tcaactgtat atacacaaaa 40560gtattatatg aacctcatta gaatagctgt
ctatagggag aagagaatga gagtgggata 40620aaacggaatg aacaaataaa ccaacaaatg
cattaacaag caaaacaaca gaggggcttg 40680catgggccag tgatgataaa gggctaagaa
tgagaatata attaattcaa ttcctcacac 40740ctgaggtcta aaaccaagga aagggagggc
caggcgtgga ggctcacgcc tgtaatccca 40800gcactttggg aggctgaggc gggcggatca
caagattagg agtttgagat cagcctggcc 40860aacacagtga aagcccatct ctacaaaaaa
tacaagaatt acccaggtgt ggtggcacat 40920gcctgtagtt agctactctg gaggctgagg
caggagaatc acttgaaccc aggaggcgga 40980ggttgcaggg agccgagatc acaccattgc
actccagcct gggtgacaga gtaagactct 41040gtctcaaaaa aataaaaaaa ataaaaaaac
agagaaaggg aggaaactag atccaggctg 41100actagataca gcctttagag ttagaaaaga
tgatttgaca atctaagccc acactcagat 41160tgaatgaaat tgaaaagcct ttcaaactaa
aacatttaat tacaccatct gctgcagaca 41220gaactcagac aactcaaaca ggtaatgtca
gcgtggtgtt ttatatcacc accctcaaca 41280cagaataaaa atcagctgca tgtgaagcag
tgactagaat gaagaaaagg ctgcttctta 41340cttccttcta gtggttcttt ccgaaaacat
taataggcac cagctctatg catgtcaccc 41400tgcagggaga catggggtat ataactatga
cttactgttc attcctcaag gaattcccaa 41460tcttgtggaa gattatacac aatgaggcaa
caaaaactat ccaataaaac cacggaaaag 41520aagccagtga caaagaagcc agtgatgaaa
ggccctgtga gcagagctga tggccatttg 41580gggaagaaag accaacatgg atgggggtga
tcagggtggc tccgtgggaa agctggaaga 41640gaagtggcag atctctgagc tggatgatgg
gccactacca tctgtatatg gctaattaaa 41700gaccatgtgt ggatttttta ttcagctctt
tcgtgtcatt cctgctatca gcacagaacc 41760caatctcaac tttccagcta tattgagcta
aacttctcac ctcatggaat ttgcagataa 41820agttcaaaag gatccttgcc ttttcaaaat
aattttgaat ggttgagtag tccctctgtg 41880ctctctcact gacaccctct caaggctgct
gagcacgtgc catgctatgg ctttctccaa 41940catcaggaaa tgttctccac tcagtttcac
cttaatacaa atgtgttctc tcttcagaga 42000aggcaaaaaa attcatgacc atctgactgg
gagaagtcat ttctaggtaa agtgtccatc 42060tttttctgag gaacacagga ggaaaatctt
acagaaaaga gttaacacag caggcctaag 42120actgcttttt aaaataaata aataaataaa
taaataaata aataaataaa taaataaata 42180aataaatgaa tgatagggtc ttctgtattg
gccaggctag tctcaaattc ctggcttcaa 42240gagatcctcc caccttggtc tcccacagtg
ttgggattat agacatgagc cattgtgctt 42300ggcccaagac tgttattctt aaaaagtctc
ataaaaagca tggttaatcc ttggctggca 42360cctgggaact tagatttcag aagggttccc
accatccaac ctggaaagag ggactcactg 42420tgcctaaatt attgtgtggt ttatgctgaa
ctcctgcttt tcttcaggta gcgtggaatg 42480tggtatgtgc tgggcaaagg gggcctgcat
gaccagcccc caataaaaac cctgggtgtt 42540gggtctctag tgagtttccc tggtagacag
catttcacat gcgttgtcac agctccttcc 42600tcggggagtt aagcacatac atcctgtgtg
actgcactgg gagaggatgc ttggaagctt 42660gtgcctggct tcctttggac ttggccccat
gcacctttcc ctttgctgat tgtgctttgt 42720atcctttcac tgtaataaat tacagccgtg
agtacaccac atgctgagtc ttccaagtga 42780accaccagat ctgagcatgg tcctgggggc
ccccaacaca gaaataaatt ataaaagacc 42840aaggactggg catggtggcc catgccggta
atctcagcgc tttgggaggc cgaggcagga 42900ggaccagtta agcccaaaag ttcaaagtta
cagtgaccta tgactgcgcc aatgcactct 42960aacctgggag acagagcaag accctgtccc
caaaacaata aactaaacac atacttctgc 43020cttccaagtg tcttaaaatt caatggaatg
gtagaaacat ttttaaaaca ctaaatcaaa 43080agaaacctgg aaaacaagag tgccgatggc
caactaaaat gtctaggaaa tttctgaaaa 43140gtaaaaagta ctcagaacca gattacctga
gcaaaccata gcccaataca agcttgggag 43200gaggctgtta tgcagaagga aatggtaaca
ggtttccagg aacagacttg taacagcaga 43260tagaacagca gaggtagaac ctgacaaggt
gattacctgg ggaactgcag tctgaatgac 43320caggactgtt ggacccttcc cctcacatgg
aatacacacg ccactcagca gcacaccaca 43380gctcttcaac aatcacagga ggcacgctac
gcctagtaag acaggaaaaa aggaattctc 43440aaacttcgaa gatgaacaca taaagaatca
ccaagttttt attcagtatg atgaaacagg 43500gacactgaat caacagaaca caaacccaag
caaagataat tactagagca catagaagaa 43560attattagat attcttggga agacctaagg
ggacattata aagagcaagc agttggtatg 43620tgacgatctt tgtgatatac caagaaataa
aaacacagga tgaagaccag atagagaata 43680atgctactat ttgtgcaaaa aaggagaaat
ggagaatctg attcatattt gcttgtattt 43740gcatgaagaa actttggaag gtacataagt
aactaacaac aatggttacc tacttgtaag 43800gcgagagaag taagaggaca ggaatggtgg
gaacaccttt tgtgtccgga attggtgggt 43860tcttggtctg acttggagaa tgaagccgtg
gaccctcgcg gtgagcgtaa cagttcttaa 43920aggcggtgtg tctggagttt gttccttctg
atgtttggat gtgttcggag tttcttcctt 43980ctggtgggtt cgtagtctcg ctgactcagg
agtgaagctg cagaccttcg cggcgagtgt 44040tacagctctt aagggggcgc atctagagtt
gttcgttcct cctggtgagt tcgtggtctc 44100gctagcttca ggagtgaagc tgcagacctt
cgaggtgtgt gttgcagctc atatagacag 44160tgcagaccca aagagtgagc agtaataaga
acgcattcca aacatcaaaa ggacaaacct 44220tcagcagcgc ggaatgcgac cgcagcacgt
taccactctt ggctcgggca gcctgctttt 44280attctcttat ctggccacac ccatatcctg
ctgattggtc cattttacag agagccgact 44340gctccatttt acagagaacc gattggtcca
tttttcagag agctgattgg tccattttga 44400cagagtgctg attggtgcgt ttacaatccc
tgagctagac acagggtgct gactggtgta 44460tttacaatcc cttagctaga cataaaggtt
ctcaagtccc caccagactc aggagcccag 44520ctggcttcac ccagtggatc cggcatcagt
gccacaggtg gagctgcctg ccagtcccgc 44580gccctgcgcc cgcactcctc agccctctgg
tggtcgatgg gactgggcgc cgtggagcag 44640ggggtggtgc tgtcagggag gctcgggccg
cacaggagcc caggaggtgg gggtggctca 44700ggcatggcgg gccgcaggtc atgagcgctg
ccccgcaggg aggcagctaa ggcccagcga 44760gaaatcgggc acagcagctg ctggcccagg
tgctaagccc ctcactgcct ggggccgttg 44820gggccggctg gccggccgct cccagtgcgg
ggcccgccaa gcccacgccc accgggaact 44880cacgctggcc cgcaagcacc gcgtacagcc
ccggttcccg cccgcgcctc tccctccaca 44940cctccctgca aagctgaggg agctggctcc
agccttggcc agcccagaaa ggggctccca 45000cagtgcagcg gtgggctgaa gggctcctca
agcgcggcca gagtgggcac taaggctgag 45060gaggcaccga gagcgagcga ggactgccag
cacgctgtca cctctcactt tcatttatgc 45120ctttttaata cagtctggtt ttgaacactg
attatcttac ctattttttt tttttttttt 45180tgagatggag tcgctctctg tcgcccagac
tggagtgcag tggtgccatc ctggctcact 45240gcaagctccg cctcccgggt tcacaccatt
ctcctgcctc aacctcctga gtagctggga 45300ctacaggcaa tcgccaccac gcccagctaa
ttttttattt tatttttttt ttagtagaag 45360cggagtttca ccatgttagc cagatggtct
caatctcctg acctcgtgat ccatccgcct 45420cggcctccca aagtgctggg attacagacg
tgagccactg cgccctgcct atcttaccta 45480tttcaaaagt taaactttaa gaagtagaaa
cccgtggcca ggcgtggtgg ctcacgcctg 45540taaccccagc actttgggag gccgaggcgg
gcggatcacg aggtcaggag atcgagatca 45600tcctggttaa cacagtgaaa ccccgtcgct
actaaaaata caaaaaatta gccgggcgtg 45660gtggtgggca ccggcagtcc tcgctactgg
ggaggctgag gcaggagaat ggcgtgaacc 45720tgggaggcag agcttgcagt gagccgagat
agtgccattg ccttccagcc tgggcgacag 45780agcgagactc cacctcaaaa aaaaaaaaaa
aaaatagaga cccggaaagt taaaaatatg 45840ataatcaata tttaaaaaca ctcaagagat
gggctaaaga gttgacggaa caaatctaaa 45900tattagattg gtgacctgca aaaccagccc
aaggaacatc ccagaatgca gcccataaag 45960ataaagagag catttccgct gggcacagtg
gtatggcagg ggaattgcct gagtccaaga 46020gttgcaggtc acattgaacc acaccattgc
actccaggcc tgggcaacac agcaatactc 46080tgtctcaaaa aaaaaaaaaa ttaaattaaa
aaagacagaa tatttgagag aaaaaaatgc 46140ttatttcaag aaacatgaaa gataaatcaa
gatattctaa ttcccaagta agaataattc 46200cagaagcaga aaatagaata gaggcaagga
aacactcaaa acttctccag tgccatagaa 46260atgtgtatta atctttagaa tgaaacggac
taccaaatgc tgagcaggaa gaacaaaaga 46320gatccactct taagccagtg tggtgcccaa
gcgcagtggc tcatgcctgt aatcccagca 46380ctttgggagg ccgaggcagg tggatcacct
gaggtcagga gtttgagatc agtcaggcca 46440acatggtgaa accctgtctg tactaaaaat
acaaacatta gctgggtatg gtggtgcaca 46500tctgtaatcc caactacttg ggaggctaag
gcaggagaat cacttgaaac caggaggtgg 46560aggttgtagt gagccgagat catgccacac
tcccagcctg ggtgacagag caagattcca 46620tctcaaaaaa aaaatccact cctagacaaa
taatagttaa attttagaac accaaggaga 46680aagaaaaaaa attgtaaagc ttcagagaaa
ataaacatta actacaaaga aacgagagtc 46740agacgcgtgc acttcttcct agataccagc
agataaagca atatctccaa aattcagaag 46800gttttaacgt agaatcctat acccagtcaa
gaatattcac atggaaaagt gaaataaaaa 46860acattgttta aacatgcaag ggttcagaaa
gtttaccatt cacagaatcc ctgaaaacaa 46920aaccaaataa tcacttaagg actcattaag
aaaacaaatg aaataaaagc accaatgatg 46980agtaaataat cagaaaaatt tacagtttac
ctaaataact gtttatgcat aatgtatgaa 47040aacccaaaaa tttaatatgg gacagaatta
aaatcatgat aagattcttt tttgctttac 47100tcatggagag ttcacataaa cagattatct
tttaatagca agagaaaaaa atgtttagat 47160atgtgtgaaa aactaagggt accaaaacag
tgcaaattca tttatcatca ggaaaatcca 47220aattaaaacc acagtatcca ccagaataac
taaaaggtaa aagacagaaa ttaccaagag 47280ttggcaagaa tgtggagcaa ccacatatac
ttctggggta aataagttgg tgcaaccggt 47340actgaaaact gtttgctagt atctactaaa
accgagcaca tgcacagact acaaccaagc 47400agttccactc ccagatacac actcaacaga
aatgcacaca ctcactcaac aaaagacgtg 47460tactagagtg ttcatgtact tactattcat
aatagtccaa aaatgcaaac aaccaactgc 47520caatcaaagt caaatgtata tctatattag
ggatatatac aatggcatat acacagcaat 47580gagaatgaaa tgaaccagct cggcacagtg
gttcatgcct gtaatctcag cactttgggc 47640gggtaaggca ggcagatcac ttgaggtcag
aaatttgaga ctagcctggc caacacggtt 47700aaaacctgtc cccactaaaa acacaaaaat
tagccgggca tagtggttgc aggcctgtaa 47760ttccagctac tcgggaggct gggttgggag
aatcgtttga acccgaaagc cggaggtcgc 47820agtgagcgga gatcgtgcca ctgcactcca
gcctggacga tagagcaaga ctccgtctca 47880aaaaaggaaa tcaaaaatat aaaataagat
gacaggaata atccgcaaaa gatcagtaat 47940caaaataaat ataaatgggc taaagctacc
tattaaaaga caaagatttc acacccataa 48000ggatagctac tatcaaaaaa agagagagaa
taacagatgt tagcaaggat gtatggaaac 48060tgaaattctc acgcattgct ggtgagaata
taaaatggtt cagcctctgc ggaaaacact 48120atgctgggtc atcaaaaaat taaaaataga
agtactactt gatccaacaa ttctacttct 48180gggtatatac ccaaataact gaaagcaggg
tcttgaagag atatttgtac acccatgatc 48240atggcagcat tattcataat agctatgatg
tggaaccaac ataaatatcc tttgataaat 48300atatggataa gcaaaatgtg gtgtatacat
tcaatggaat attaattagc aataaaaatg 48360aagaaaattc tgacacatgc tacaacatgg
atgaaccttg agggcattac attaaatgaa 48420ataagccagt tataaaaaga caaatactat
atgaggtact atattagata ctcatgcaag 48480gtacctaaaa taggcaaatt catagagaca
aaaagcagaa tggtggttgc caggggctgc 48540ggtaatggat acagagcttc aattttgtaa
gatgaaaaaa ttctggagat tggttgcata 48600acaatgtgca cacacttaac actggggaac
tgtaaactta aaagtagtaa atggtaaaaa 48660taaaaataat aaataataaa ttttatgtta
ttttaccaca atatttatta aaagacaaag 48720attaactaat taaacaaaat ccagccataa
gctaatggta agagtaacaa ttaaagaaga 48780cacagaaaat tgaaaatcag tgactagaaa
aagatattcc atataaatgc taacaaaaag 48840caagtacagc aatataaaga gaatgaacaa
aaaaaaaatt aaataagatg gctcgtttat 48900tcccaaaagg tacaattcac caagaagata
caagaattgt gaacctttaa gcacataaaa 48960cagcttcaaa aatacaacat ttaaagaaaa
atatatatta aacatagaaa tagtacaaaa 49020acccctacaa gaatcataat gggagtcttc
aatacaactc tccatatcaa caggtcaaac 49080agagaaaaaa aataagttaa ggatgcagaa
aacctgaatt accatcaata aacttgagat 49140taatatagaa ctgtataccc aatatactaa
gagttcaggg aacagtcgtg actgacagtg 49200gactgcaaat taatctgttc ttaatctttg
tttttctttc agcactgtgg cagaatagag 49260atcctaaaaa ccttccagct acaaaacatc
tttttaaaaa tataaaaaaa tacaaaaata 49320actctgaaat caatagaaga cacatggtga
aaccaaaatt ctagaataca gggagaataa 49380aggcattttc agatattaca aaaacagaaa
attgatcatt gctgaagtaa tttctaaaga 49440atgtacttga gggagaagaa aaatgttcca
aagaaaagta tctgtgatac aagaaggaat 49500ggaaagtgaa gaaatggtaa acaggtagat
aaagctaata aatgttgacc tagaaaataa 49560caaaaacaat agcaataatg tctcgttgga
agggttgaag taaaaataca attaaggcca 49620aatgtgaggt aagtggaatg aaagaattag
aagtccttgc cttgttcaca ggactgatta 49680aataaatgag ccaggttttc cattcaaaca
gttaaaactt gaacaaaata aactcaaatt 49740aagtagaaag ataaaaaaca gaaattaatg
tcatagaaaa ataaaaaatc aatagaatta 49800atcaataaat cctggttaat aaaagctggt
tctttgaaag gattaataaa ataatcatta 49860agcaagtctg atcaaaaaaa aagagaaaag
gtaccaaaaa aagtactgta tcagaaagag 49920aacatacaga tacatacaga tatgtaagag
tctgttttct tacaccagaa tactatatac 49980aacattatgc tagcatatat taaatttcaa
taatgttaat gattttctag gaaaacagaa 50040aatattaaat ttactttgaa gaaacagaaa
aactgagaaa aataaatgat catgaaaaaa 50100atgaaaaggt aattaaatac tgatattaac
tgcctaaaca acaccagcag cagcccaggc 50160agtctgcagt caagttctgc caaacttgag
ggaacagata attcttctat tccagagcat 50220agaaaatgat ggaaagtttc ccaatttaat
cagagaggac agcctgatcc ttgttatgaa 50280cacagataaa aatggggtaa actatatgcc
aaactcagat accaaaaccc taaataagat 50340gctagcttat tgatgtgaac aatccaaaag
tgcattttaa attagcccag ggttttagag 50400aaagaaaatc tagcaatgtg accaccactt
atgttaacaa ttttaagacg aaaatctaca 50460tgatcatatc aatgcatgct acacaaaagc
atttgggcaa aaaacccaac acccaccctt 50520gactttttaa actcttagta attaggcata
aacagaaatg tacttaatgt gatagaatac 50580actcggtgaa gatacagagg gaatgctccc
taaaaccaag cccaagacaa agattcctat 50640ttaacctcaa tagtcaacac tgcagcgaga
gtaatctatg gaagacaagg aaaaaagtaa 50700aaacatgaga gacatctgtt gtttaacaga
caataagatc acctacttgg aagaggcaaa 50760cgaatcaagc gaaaaactat taaaactgag
acaggcttta gtatggaggc tcagcttcag 50820ctgtagtttg ggctaccaaa ttcaactcgc
ttgcttggag agttaatcct gcaaagctaa 50880tttctgttga ggtattagga ttgacaagcc
tgtgctcctc cctcctcccc catcttcaac 50940actgaaataa cacggtgttt ggaactggat
aacagaatct tccaaaaaca aaaattgtcc 51000tgaagggctg acttgtgccc ttactcaaaa
aacactttat ctgctgcctg cagctcctac 51060agttgctggt ggataagcct gccaaccagc
tcggcgtaat tcttcctgca gagggcaagg 51120aagagcactt tcacaggaaa atttttttcc
gaactgtatg ccgcttatta cataaactta 51180cgtgctggca aatggagctc cagcaaaata
agatattcag agtcaaactt ccttaggaaa 51240aaaaaaaaaa aaaagcaagc acataacact
aatttccttg catgggcact ggggaaggag 51300gtcgttactt ccgcacgccc gcaggtccgc
accaccggga aacccacggg caccgcgcgc 51360tgcccccggg ccttccaggt gcactgcgcc
gcggcgcccc agctgacccg ggatgcgcag 51420ccctagccct tcccctgtca ccccggccag
gaaggggcgg gagcgcggcg gacgccgagg 51480gcgaagggct tctcggtcct ctgcaccacg
cagcaccccc aaggcacaac agggagggtg 51540cgggaggctc ccgagaccca ggagccgggg
ccgggcgtgc ccgcgcacct gtcccactgc 51600ggcgagggct ggggtcgcct ccagggccgc
agctgtcggg agccacctgg ctctcagtcc 51660cgggtccctg cgacaaccct cgggcccgga
ggggaggagg cggccacctg ccgctgccac 51720ctgcggcacc ggtcccaccg ctccgggccg
ggcaggacag gccaggacgt ccctcctggg 51780ctggggacag gacacgcgac gaggggaccg
gggcccccgc ggcgaagacg cagcacgcct 51840tcccagaaag gcagtcccgt gcccccacga
cggactgccg gacccccgcg ctcgcccgcc 51900catcccttca gaccacgcgg ctgaggcgca
aagagccggc cggcgggcgg gctggcggcg 51960cggctagtac tcaccggccc cgctggctca
gcgccgccgc aacccccagc ggccacggct 52020ccgggcgctc actgatgctc aggagaggga
cccgcgctcc gccggcgcct ccagccatcg 52080ccgccagggg gcgagcgcga gccgcgcggg
gctcgctggg agatgtagta cccggaccgc 52140cgcctgcgcc gtcctccttc agccggcggc
cgggggcccc ctctctccca gctctcagtg 52200tctcatctcc ctatctgctc atcctctggt
cgcacataat cgatgtttgg gcgtcccaag 52260ccagatgtgg accccatttc cgcactctac
actggaggtt ttctaagggt ggtgcccgga 52320ccagcagctt cagcctcatc tgggaacttg
agaaaatgca gattctccgt cccacccagc 52380ctattcggtt tttcctgcac taaaaccatg
aaggtggggc ccagcagtcc acattctcgc 52440aagcccgtca agtgattctg aggcgccctc
cagtttgaga gctatgctca cggcctcacc 52500tccgccccgc aaggagcccg gtcttgcctg
tggcgctagc cgcacacgga cacctcatcc 52560tgcggggccc gcccccccgc tgcaccctca
ccgcccaacg cctcctccgg gatgcagcgg 52620aggcgcctgg aagtcggcaa ggtcaacatc
cccctcagca tcttccctac cctcacggct 52680cctcctccag gggtgcctca tggccagggg
ttagaaagag ccactgtgtt tcttgacatg 52740gaagtggcct aagaccttaa tgaaaactgc
aggagtggaa tgacagaacc tttggtcata 52800cttgagggcg tgaagctcaa atgaggagga
aggaaaggat ccagggagaa taaccaaccc 52860tggcaagttg tggcgcccag gtagaggggc
gagcctaggc tagcggttct cgaccagggc 52920cggtgttgcc cctcctcgcc gccccgcgta
catttgggga ggtctggaga catttttggt 52980tgtcatgatg cgggagttgc tactgttgcc
taagtgggta gacacgaggg tgctcctcaa 53040catcctacct gaaggacagg actgccccac
aaggaagaat gatccggccc caaataagaa 53100accctgggct ggtcagcaac aacccctttg
ttctgagaag agaggaggaa agaataaaag 53160aagtggggtg aagttttggt ttggtagagg
aaacttgaag acattttcac tggaaaggaa 53220gagaggaaga ggagggagat gtctgtaagg
acgagcaaac cgggtgacag ctgatttcct 53280catattgaag taatgagtcc tagttataat
aaattcctaa taaaaaccca gtttatccct 53340gcaataaact tgtctttttt ttttaaatat
actgcttgat tctgtttgct aatattttat 53400ttacaggctt tgcattgata tgcaaaaatg
agatgggcaa taattttctt tttgaatgtc 53460taatgttgtt tggtttcaga atcaatgtta
tgctcacatc ataaaaaatt tggaaccgag 53520gcaggaggag tgcttgaggc cagaagttcg
agaccagtct aggaaacaca gtgagacccc 53580cccatctcta caaaaaaaaa aaaagaaaaa
aaaatgggca tgtttgcttt ttccttttac 53640tctgaacaat ttaaggagca ttaaaattat
ctattctttg aggtttgatc atttcccagt 53700taaaaatgtt cctcccagcc tgatgctttc
tttggggagg gtaaatcttt taaggctaga 53760aaagtttctt ctgtggcaat tttattattt
acattttaaa aattattcta gagttaattt 53820tgataaagca tgtatttctt aaaacaaatt
atcctttttt tccagatgtt caagtgtatt 53880tgcataaagt tgaggaaagt agtcttttgt
gaatctttta acttctccca aatatcttat 53940tttgtgtatt tttgcttctt tattttgtta
acttttaaaa gtgtattttt ttttcaaaga 54000atcagctctt aggtttatgt ttttggttat
actggagctt ttttcttctt ctttttaaaa 54060tattttttct cctttatttt ttagacgtat
tttgatctaa cgtaatcgga agaaggtaaa 54120ttagaatctt ttgttactat tgtgttttta
tttctcctta tttctctgaa gtcctgcttt 54180ataaatagta ccatgttatt tgtgcataaa
tattcatttg tcttatattc ttgggaattt 54240tcccacttca tcataaaatg accttccttg
tctcatttaa tgtgttcaaa ctttgccctg 54300aatttaactt tgtctgatat tttaccatcc
tgctgaattt tgtttgttac cccaaacaac 54360ctttgctgtt ttcgtctttt ctgaaccctt
tattttaggt aatcccttga attagagcac 54420taagttttgc tttgtgatta aatctgaaaa
tctttatctt gccatagatg agttgagccc 54480tattcatgtg acagctatat tatgctgttt
catagccctt ttggtccttt tttcactctt 54540gcattgcata ttttgtgttt attgtgtttt
gtgtttcttc tgataatttg gaaggtttgt 54600atttttattc agggagttgc cttataatca
tactccgcaa tacacatcgt cctcagtttc 54660ttcagactgt ctgttaactc cctattctga
ataaaaatga cattgtaatt tccctctttt 54720ttctttaccc cttttcttct cctcacctaa
tgtaaatgat tttatccttc tttagtattt 54780gcttttttaa ttaactacat ttataaatat
ctttatcact tgatttttaa atcagctttg 54840aatgagatat ttggattcct agatataaaa
gatgttaatt ataccatttc cacgttagta 54900ggtttataaa atcatacatt ctgctgtgta
accataatcc cacgtttgtt ttagttccac 54960tcctacagtt aaaagattca gaagtattat
taacagttat tttgccatag ttttttcccc 55020aacccatttt gtggtaagtt atgatcctgc
tttagtttct taagaataat ttatagagca 55080gagtgtggtg gctcacgttt gtaatcccag
cactttggga gacaagaggt agaaggatcg 55140cttgaagcca gcagttcaag accaccctga
gcaacatagt gagaccttgt ctctacaaaa 55200aattttaaaa tttagccaga cgtagtggcg
tgtgcctata gtcccagcta ctcaggaggc 55260tgaggcaaga ggattgctag agcccagaag
tttgaggctg cagtgacctc tgattgtgcc 55320actgcacccc agtctgggca agaaagtgag
aacctatctc tttaaaataa caataataac 55380ttatgaaaat tatattccct gagtttttca
tgtttaaaaa tatttgttgc ctttatcctg 55440taaaagtttg agtataaatt cttgggttat
actttattta ttgaagaatg tataagtatt 55500gtcttctaga attgagtgtt gctgtaatga
aaccagaagt cagcctggtt tatttttcct 55560cagaaatgag gtaattgccg gccggacacc
gtggctcatg cctgtaatcc caacactttg 55620ggaggccgag acaggtggat cacgaggtca
ggagattgag accatcctgg ctaacatggt 55680gaaaccccgg ctctactaaa agtacaaaaa
gttagctggg catggtggtg gacgcctgta 55740atcccagcta cccgggaggc tgaggcagga
gaatggcgtg aacctgggag gaggagcttg 55800cagagagctg agatcgcgcc actgcactcc
agcctgggcg acagagtgag actccgtctc 55860aaaaaaacaa aaaaaaaaca aagaagtgaa
gtaattgcca tgatgctcca agaattatct 55920ctttgtctat gaaatccaga aatctcactg
ttatacattt tggaattatt attctgggcc 55980aatatttcct gggacacaat agattgactc
tatagattta attttttttt tttttttgag 56040acagagtctc actgcaatct cagcttactg
caacctctgc ctcacgggtt caagcaattc 56100tcctgcctca gcctcccaag tagctgggac
tacaggcgcg tggcaccatg cctggctaat 56160ttttgtcttt ttagtagaga cagggtttca
ccatgttggc caggctggtc ttgaacgcct 56220aacctcaagt gatccacctg cctcagcctc
ccaaagtgct gggattacag gcgtgagcca 56280ccatgcccag cctcaattcc tctttctatc
tggtaatttt tctgaagttg aaaacatttg 56340ttctaatacg ttatttcagt gttcttctaa
gatgtgtaaa gcaccctatt cccaggtcag 56400cccccatctt gctagtgagc tcggctggtt
cttcacaaga gctctggttt tctcctgctt 56460aatctcaagt acctctgtca gcctccacct
ggtttatgat ttggagtttt ttggtttttg 56520ttttttgttt ttgacagagt cttactctgt
cacccaggct ggagagcagt ggcataatct 56580cagctcactg caacctctgt ctcccaggtt
tgagcgattc tcctgcctca gcctactgag 56640tagctgggat tacaggcgcg tgccaccaca
cccggctaat ttttgtattt ttagtagaga 56700tggggtttca ccatgttggc cagggtggtc
ttgaactcct gacctcaggt aatccacctg 56760cctcagcctc ccaaagtgct gagattacag
gcgtgagcca ccgcgcctgg catggtttgg 56820agttttaatc tgtagtttta ataaagatag
tgcttatgtt tgtgtttctt atatttcttg 56880gtactcttgg gtaatttgta agatccccat
atctacacaa gaagtccatt ttcaattctt 56940ttcttcagac tgtttatttt attttatttt
attttatttt tatgtttgag atggagtctc 57000gctgtgtcac ttctggaggc tggagtgcag
tggcgcgatc tcaggtcact gcaacctccg 57060tctcccgggt tcaagcaatt ctcctgcctc
agcctcccga gtagctggga ttacaggcac 57120ctgccacttt ttaatttttt tagagacaga
gtctcgcttt gttgaccagg ctggagtgcg 57180gtggtgcaat catggctgac tataacctcc
aaatcctggg ctcaagtgat cctcctgcct 57240cagcctcctg agtagctggg actacaggca
catgccacca tgcccagtta attttaattt 57300ttttgtagag acagggtctc catatgttgc
ccaggctggc ctcctactcc tggcctcaag 57360taatcctcct acctcagcct cccaaattac
taggattata agcatgagcc accatgccca 57420gccttgttct actactttaa tttcatatgt
taggtgacca tgtaattgat catccaaacc 57480aggatactgt aagaatgaaa gaggctgaca
gtagtatgat gctgggacta gcattgtgca 57540ctgagattat ttctgggaaa gcaggagata
cggtcaccct acttatagtg tgcttgtctt 57600tggattgttg aatttggagt ttctatttgc
aggcttattt caactgggca gccttgatcc 57660gccctgccca gcaatgctac cgttctctcc
accgggtctc tgggacccct tcagtcacta 57720tacttagctc agttccccac cctcccactc
cctaaaagcg taaccaggaa tcctgcctca 57780ggtctactgc cgtcttccgt gggctgtttc
agttcctatt acccagagtc aaactcccag 57840cattccctac ctgattccag acttggagtc
cagagcttta acctcttcag gccaactccc 57900cactttgcat ttctgtccct atatcttagt
ccatggagat acatttcatg tctttgagtc 57960tacttacaaa gtaaattttg ctgtttttta
attttttttt tgagatggag tcttgccctg 58020tcacccaggc tgtggtgcaa tgacgccatc
tcggctcact gcaacctccg cctcctgggt 58080tcaagcgatt catctgcctc agcctcccaa
gtagctgtga ttacagacag gcaccaccac 58140gcccagctaa ttttttttat cttttagtag
agacagggtt tcaccatgtt ggccaggctg 58200gtcttgaatt cctgacctcg tgatctgccc
atctcggcct cccaaagtgc tgagattaca 58260ggcgtgagcc actgtgccca gccaattttg
ctttttttat atttcattgc tatatgttta 58320gaggataagt ttacagtgct atatgcattc
ccaaatatta gaccaaaaaa atctccaaaa 58380aattagaaag aaaatccaaa aaatctcaaa
aaataccaaa aagcaacaat ctcacagacc 58440atactcactg acccccaata aaataaaatt
agaaattaac cacaacttaa caaaataaag 58500tactcaagtc agagaggaaa gaggaaataa
acatcaaaat tacaaagtct aggcggtggc 58560tcacgcctgt aatcccagca ctttgggagg
ccaaggcggg cagatcacaa ggtcaggaat 58620tcgagaccag cctggccaat atggtgaaac
cccgtttcca ctaaaaatac aaaaattagc 58680caggcatagt gatgtgtgcc tgtaatccag
ccacttggga ggctgaggca ggagaatcac 58740tgaacccagg gagacgaaga ttgcagtgag
ccaaaatcgt gccactgcac ttcggcctgg 58800gtgacaaagc gagactccat ctcaaaaaaa
aaaaaattac aaactcttta gatagaaatt 58860ttggtgtttt tttttgagac ggagtctcac
tctgtcgcag aggctggagt gcagtgggac 58920tatgtcagct caccgcaacc tccatctcct
ggattcaagc aattctcctg tctcagcctc 58980ccaagtagct aggattacag gcgcccacca
ccagacccag ctagttttta tatttttagt 59040agagatggtg tttcaccatg ttggccaggc
tggtctcaaa ctcctgacct caagtgatcc 59100acctgcttca gcctcccaaa gtgctcagat
tacaggcgtg agccaccgca ccccacctag 59160atagaaattt caacatgagg ccgggcacaa
tggctcacgc ctgtaatctc agcacttcag 59220gaggctgagg cgtgggagga tcacttgggc
ccaggagttc aggaccagca tgggtgacag 59280agacagaccc tgtctctatt tatttgaaaa
aaaaaaaaaa aaagagagag agaaagaaat 59340ttcaacatga aaagtatctc tcaaaccctt
cgagatgttg gcaaaaagcg actcaaagga 59400aaatgtatta ctgtgtgtga atttgcttga
aaataagaaa gaggccgggt gtggtggcta 59460acacctgtaa tcccaacact ctgggagtcc
gaatcaagtg gatcatgagg tcaggagatc 59520gagaccatcc tggctaacat ggtgaaaccc
tgtctctact aaaaatacaa aaaattagct 59580aggcgcggtg gctcatgcct gtaatcccag
cactttggga ggctgaggca ggtggatcac 59640ctgaggtcag gggtttgaga ccagcctggc
ctacatggtg aaacctcgtc tcttctacaa 59700atacaaaaat tagctgggcg tggtggtggg
tgcctgtaat cccagctact cagaggctga 59760ggcaggagaa tcgcttgaac ccgggaggcg
gaggttgcgg tgagccgaga tcgcaccact 59820acactccagc ctgggcaaca gcctgggtga
cacagtgaga ctccatctca aaaaatacaa 59880aaaattagct gggtgtggtg gcctgcgcct
gtagtcccag ctacccggga ggctgaggca 59940ggagaatgga gtgaacctgg gaggaggagc
ttgcagtgag ccgagatccc accactgcac 60000tccagcctgg gcgacagagc aagactcttg
tctcaaaaaa aagaaaaaaa aaggaaaaaa 60060gaaccctgat aataaagaaa ccaaatgttc
aactctcaaa gctcggacac tttaaagaaa 60120taattaataa aggcagaagt taaagggagg
atgataaagc aatttttttt gttggttttt 60180ttgagatgga gtcttgctct gtcacccagg
ctggagtgca gtgatgcgat cttggctcac 60240tgcaacctct gcctcccggg ttcaagcaat
tctcctgcct cagcctcctg agtagctggt 60300actacaggtg cgcgccacct ggcccagcta
atttttgtat ttttattaga gacggggttt 60360caccatattt gttaggctgg tctcaaactc
ctgatctcag gtaatctgcc cacctcggcc 60420tctcaaagtg ctgggattac aggcaggcgc
caccgcgcct ggcctaaagc aaaatattgg 60480ttctgtgcaa aaggtcaata aaaagagcaa
acgtttacaa actggagcca gcacccattc 60540agctcagtgt gtctggagaa aaaacaatct
cgcttcagaa ttcatgatta cgcagccctt 60600tttgcttcct aaaaatccta ctatgttgct
gttgaccatt ctctctcttt ctctctctct 60660tgctttctct ccagaaaagc tattcagaca
ttctcctctt tcctcaaacc tccaacactt 60720cctcctccat ccttagcctc agctgctgac
ctcacttcta atcattgaga aaccaggaga 60780agcatttaag agtgaacctc cgcctccccg
cacgggcaaa accacccacc cacagaattg 60840tgccccaatt ctgcgtcctc tcctctcacc
atggatggac ggtccaggct ccgagccaaa 60900gccaggcctc ccctggagct ctggatccac
cacctgcagc ttctcaggca gggccccagc 60960agctcccctg ctcccttgta ccatcaatcc
ctcccctcac tgggtcactc ccaacaatat 61020atatatttag tgatgtttct cccatgtggt
aaaatcactt agcctctctc ctcccccagc 61080tactatccta tttgtttctt tccattctct
gcaaaacttc tcaaagcatt gtgtctatgt 61140gctgactcca tttatcttct cccgttctct
gctgagtcct tcccacagac tctcacccca 61200gttactccat gaaatgacct ctgcactgcc
acatccaatg gtgaatgttc agttcttaat 61260tttattcagt ctttcagcag catttgacct
ggccgatcac tccctcttct taaaaatact 61320tttctcagcc aggcgtgatg gctcacacct
gtaatcccaa cactttggga ggccaaggcg 61380ggaggatcat gagagcccag gagttcaaga
tcagcctggg caacatggca agaccctatc 61440tctacaaaaa ctaaaaagta gccagtgtga
tggcatgcac ctgtagtccc atctacttag 61500gaggctgagg cagtaggatg acttgagcct
gggaaatcaa ggctgcagtg agccatgatt 61560gcaccactgc actccagcct gagtgacagc
gagaccctgt ctcaaaaaga caaaatagga 61620aacttttctc agcatattcc tctgattctc
ctgctgcttc tgtctgcaca gattcagtct 61680cctttgccgg ttcttcctca tcctcctgat
ctcttgacct tgaagtgccc cagagtacag 61740tctttttttt tttttttgag acgcagtctc
gtctgtcacc caagctggag tgcaatggcg 61800aggtctcagc tcatgcaacc tctgcctcct
gggttcaagc gattctcctg cctcagcctc 61860ccaagtagcc aggactacag gcacatgcca
ccatgcccag caaattgttg tatttttagt 61920agagacaggg ttttactata ttggccacgc
tggtctcaaa ctcctgaact cgtgaaccac 61980ccgcctcggc ctcccaaagt gctgagatta
caggcatgag ccaccacacc cggcccagag 62040tacagtcttt agacggcctc tctacctata
cttgctcccc tcataaactc ctcctgcctc 62100atggctttaa ataccatcgg tagactgatg
actcccatat ttctcttttt tttttggaga 62160cggagtctcg ctcagtcccc caggctggag
tgcagtggcg cgatctcggc tcactgcaag 62220ctccacctgc caagttcaca ccattctcct
acctcagcct ctccagtagc tgggactaca 62280ggcacccgcc accacgcctg gctaattttt
ttgtattttt agtagagatg gggtttcacc 62340atgttagcca ggatggtctc gatctcctga
cctcgtgatc cgcccatctc ggcctcccaa 62400agtgctggga ttataggtgt gagccaccgt
gcccagccga tgactcccat atttctatct 62460cttgctgtgt gggagttctc ctcagaactc
catactcata aatccaactc tcataaatag 62520tatctcaaat gggcaatatg ctcaaaagtc
aattcctact tttctcccta aacttgcttt 62580cctgcagtct ccaccatctt aatgtccaat
ctaacattag gaggcaaaaa ctttgaagtc 62640attcttgact cttctctatt acacacccta
tccaatcttt ctgcagatcc agtcgacccc 62700caaatccagt tagctctcat catctcccct
gttaccccct ggtccaggcc atcttcctct 62760ctcacctgaa tcactgcagc attctcctca
ctggtctctt tggttctgtt ttcactccac 62820cttagcatag tctccacaga gcagtcagag
ggatcctttt aaagtgtaat tcccatcctg 62880tccctgctct gctcaaaacc ctgtcgtgat
tcccgtttta atctgtcaga ttaaaagcca 62940gagtctttcc agtgacctac atgatctgcc
tattatcacc tcccacttct ttccccttgc 63000tcactccact ccagctctgc agctgtcctt
tctgtttcct gaacagccca gattttgctt 63060ctttagaacc tttgtatttg ctgtcccctc
tgtctggaat gtttttccag gaagtcacct 63120ggctctctcc tgcacttcct tcctgaccac
catgtttaaa aatcactcaa acacacttca 63180ggccggacat ggtggctcac gcctgtaatc
ccagcacttt gggaggccaa ggtgggtgga 63240tcacctgagg tcaggagttc gagaccagcc
tggccaacat ggtgaaactt cgtctctact 63300acaaatacaa atagtagcca ggtgtagtgg
cacacacctg taatctcagc tactcaggag 63360gctgaggcag gagaatcgct tgaacccaga
aggcagagga ggtgcagtga gccaagatca 63420cgccacaaca ccccagcctg ggtgacagag
caagacccca tctcaaaaaa aaaaaaagaa 63480aaaaaaatca cacaaacaca cttctcttca
tattcctttt ccaagtttta tttttctcca 63540gaatacttta cattgtttta atggaagttc
tccgtttccc cccaactaga atggatactt 63600cctgcaggta ggcactctag tcctcccatc
caagtactaa ccaggctcaa ccctgcttag 63660cttctgagag caggggagat caggcctgtt
cagggtggta tggcccagga attttgattc 63720tgttttattc attgctgttc tgttgattct
cttttgttcc tcctcctagt gctgagaaca 63780ctacttgtac ataataagca ttcaataaat
atttgttgaa tgaatgactt gttgaatgaa 63840ttaatctcag aaatgcagga ctggttctac
attagaaaat ttttcaaggt cattctctgt 63900tgtcgtaaca cattaagaga ggaaaatttt
gtactctaaa tcatttgata aaatacatac 63960tgatttctgt tttcaaaaac tcttagtggc
tgggcgaggt ggctcacatc tataatccca 64020gcattttggg aggacgaggt gggcggatca
cttgaggtca ggagtttgag accagcctgg 64080ccatcatggt gaaaccctat ctctactgaa
aatagaaaaa ttagccgggt gtggtggcgc 64140atgcctgtag tcccagctac ctgggaggct
gaggcaggag aatggcttga acccgggagg 64200cggaggttgc agtgagccaa gatcatgcca
ttgcactcca gcctgggtaa cagagtgaga 64260ctccatctca aaagaaaact cttagtgagt
ttaggaatcc aaggaagacc ctcaaactaa 64320atagataatc tagctaccag aagccttcag
taaaccttaa cactccatgg tgaaacatta 64380gaaacattcc tactaaaaga caggctaaga
atgcctgcaa tcttcacggc tagtccaaga 64440agtcaaaaag aagaaatgag cgctgattta
aaaaaataaa caaacaaaaa actaccgatg 64500cagaggctgg cagcaaggac tgaaggactg
tacagtactt gcctggagca ggcggatggc 64560cacacccctg cgaagcctgc tcagctggct
gggggacgct ccagtgtgtg agtggcagga 64620tgcagggtac ttcctctgcc agggagttgc
actggggaga tcctccccca ctcacacttt 64680ggcagctggg gctttggaat gtgacttagc
ttctgtcaaa gggtcaatcc accctttgat 64740atatgatgca aaggcgaaca tatgatgcaa
aggtgagaga acagcccaaa ttaggacttt 64800taccacagct gtggaggtgg acagcgacag
tggtgggccc tggccagact tttcatgctc 64860aaaggtggtg gttgttcttc ctacttcttg
tccctccagg gcttcctttg cctgtgtgct 64920gaacctgctt cttttaattt tttttaactt
ttttaaattt ttaattgttt taattaaaac 64980aaattttgaa aactgtctga acctgctttt
gaaccctgct atgatttgaa tgtttgtccc 65040ctgccaaact gattttgaaa cttaatctcc
aaagtggcaa tattgagatg gggctttaag 65100cagtgactgg atcatgagag ctctgacctc
atgagtggat taatggatta atgagttgtc 65160atgggagtgg catcagtggc tttataagag
gaagaattaa gacctgagct agcatggtcg 65220ccccttcacc atttgatatc ttacactgcc
taggggctct gcagagagtc cccaccaaca 65280agaaggctct caccagatac agctcctcaa
ccttgtactt ctcagcctct gtaactgtaa 65340gaaataaatg ccttttcttt atgaattacc
cagtttcaga tattctgtta taaacaatag 65400aaaacgaact aaggcaaact ctcatgattc
tactgccatg ccattccaat aaactccctt 65460tatgcttaag agagccagag ttggccaggc
gtggtgactc acgcctgtaa ttccagcact 65520ttgggaggcc gaggcaggtg gatcacaagg
tcaggagatc gagaccatcc tggctaacac 65580ggtgaaaccc cgtctctact aaaaatacaa
aaaaattagc tgggcgtggt agtgggtgcc 65640tgtagtccca gctactcggg aggctgaagc
aggaggagaa tggcgtggac ccaggaggcg 65700gagcttgcag tgagtcgaga tcgtgccact
gcactccagc ctgggtgaca gaatgagact 65760ccgtctcaaa aaaaaagaga gccagagttt
atttctgttg cttgcaacca agaaatctgg 65820ctggtgcact gaagtttcca taaataatag
caatttaaag actctttcca agccaggcaa 65880tgcctagcct tgtgtagtcc ttgtggtaat
acattcattc attcatttgt tcaaccaact 65940gtgctccaga gactaagaat acaaaaatgg
gggccgggtg tggtggctca cacctataat 66000cctagcactt tgggaggccg aggcaggtag
atcacctgag gtcaggagtt cgagaccaac 66060ctggccaaaa tggtgaaacc cctactctac
taaaaataca aaaaattagc tgggggtggt 66120ggcggacacc tgtaatccca gctactcgtg
agactgaggc aggagaatca cttgaacccg 66180ggaggcagag gttgcagtga gccgagatcg
caccactgca ctccagcctg ggcaacaaga 66240gcgaaactcc acctcgaaaa aaaaaaaaaa
aaaaaaagag ggccggggct gggcgcagtg 66300gctcacgcct gtaatcccag cactctggga
ggccaaggca ggagaattac gaggtcagca 66360gatcgagacc agcctgacca acatggtgaa
accccatctc tactaaaaat acaaaaatta 66420tccgggcgtg gtggcgcaca cctctagtcc
cagctacttg ggaggctgag gcaggagaat 66480cgcttgaacc cgggaggcag aggttgcagt
gagccgaaat catgccactg cactccagcc 66540tgggtgacag agtgagactc cgtctcaaaa
aaaaaataaa aaaaaaaaaa gaattcaaaa 66600attgtagagt tatagtgtgc ttctagttta
gttgagagga catctgtcct tcaaggaagg 66660ctagaatcta taccctgagt ccttactgaa
atcaatccag cagtcaaaac atgggaccaa 66720cgatcacagc agtaagatag gaagagcacc
tttgtacatt tagctcatgt tgagataagc 66780cactgacaga gctgaaggaa gctcacagtt
ctgggttcca tcctttggca tttaaaaaga 66840aaagtgctaa gaaaattcgg ttggtcacgg
tggctcacgc ctgtaatccc aacactttga 66900gaggccaagg caggcagatc acgaggtcag
gagttcgaaa ccagcctggc caacatggtg 66960aaaccccgtc tctactaaaa acagaaaaat
tagccgggca tggtggcgca tgcctataat 67020cccagctact caggaggctg aggcaggaga
attgcttgaa cccgggaggg ggaggttgca 67080gcgagtgaga gcaggccact gcactccagc
ctgggagaca gagcaagact ctgtctcaaa 67140aaaaaaaaag aaaaaaagaa agaaaggaaa
aaaagaaaga aaaaaaaaga aaaaagaaaa 67200ttcaggccag gccaggcctg gtggctcaca
cctgtaatcc caacactttg ggaggctgaa 67260gcgagacggt gccttagccc aggagtttga
gaccagcctg agcaacatag cgagaccctg 67320tctctataaa aaaaaatttt tttttggcca
gacgcagtgg ctcacgcctg taatcccagc 67380actttgggag gccgaggcag gtggatcacg
aggtcaggag atggagacca tcctggctaa 67440cacggtgaaa ccccatctct actaaaaaat
acaaaaaatt aaccgggcgt ggtggcgggc 67500gcctgtagtc ccagctactc gggaggctga
ggcaggagaa tggcgtgaac ccgggaggcg 67560gagcttgcag tgagccgaga ttgcgccact
gcactccaga ctgggagaga gtgagactcc 67620gtctcaaaaa aaaaaaaaaa aaaaaaaaat
taattgtcag gtgtgctggc atgcagctgt 67680agtcctagct actcgggagg ctgaggtaag
aagatcgctt gagcccagga gttcaaggct 67740gcagtaatag tgcctctcac tctaccctgg
gtgacaatga gaccctctct caaaaagaaa 67800gaaaaaaggg aaagaagaaa agaaagaaag
aaagagaaga aaggaaggaa gaaagaaaga 67860aaaagaaaag gaaggaagga agaagaaaaa
aaaagaaaga aagaaaagag agagaagttc 67920aaagaccaaa gggtcaggat cccaaaatag
tttttatgtt ttatttattt atttacttat 67980ttatttttga gacagtatgg ctctgtcgcc
caggctggag tgcagtgatg cgattgcggc 68040tcactgcagc ctccaaactg ggctcaggtg
gccctcccac ctcagcctcc cgagtagctg 68100ggaccacagg cgcgtgccac catgcccagc
taatttttta attctttgta gagatgaggt 68160ctctatatgc tgcccaggct ggtctcgagc
tcctgggctt aagccatcca cccgcctggg 68220cctcccaaag tgctgggatt acagaagtga
gccaccgcgc ctaatcgggt ggtttgtttg 68280tttattgacg gggtctcgct gctgcccagg
ctggagtgcc agtggctgtt cacaggtgca 68340gtcctggagc attgcatcag ctcttgggct
ctagcgatcc tccagagtag ctgcagctgg 68400gattccaggc gcgccaccgc gcggggctca
gaatgggttt ttatattgag ggttatgctg 68460ccacctagag gatatatgta gtaccgaact
gtgtgcgcag ggaggctgag gttgcagtga 68520gccaagatga tgccagggca ctccagcgtg
ggtgacagag caagatttca tctcaaaaaa 68580aaaaaaaaaa aaaaaaaaaa aagaattgaa
agtaaggtct tgaagagata tttgtgcctg 68640tatggtcata gcagtattaa ctttgaccca
ctagctaaaa cacaaaagca acatgtgtct 68700gtcagcaggt gaacggataa acaaaatgtg
gtatatatgt acaattgaat attattcagc 68760ctttaaaaag gaataaaagg ctggatgcgg
gggctcacgc ctgtaatcct aacactttgg 68820gagactgagg tgggtggatc acccgaggtt
aggagtttga gaacagcctg gccaacatgg 68880tgaaacttca tctctactaa aaatactaaa
attagccggg catggtggca cttgtctgta 68940atccaagcta ctggggaggc taaggcagga
gaattgcttg aactcaggag ccggaggttg 69000cagtgagcta agatggcacc actgcactcc
agcctgggca acagagtgag actccatctc 69060aaaacaaaca aacaaaaaat tattatttcc
aaagaaacaa gaccctgggt ccatttccca 69120gcccacacct gatgttgact cacaacacac
agcctggttt gctatgagcc tgcttcattt 69180aattgtcacc ttaacttcac atcaccctca
agtcctggaa taactctttg ctgacctttg 69240tgtgctgagc catctccatg tcgctcaacg
tgcagtccct ctcactgcac tgagtcaata 69300gccagacgtg gtctgactgc agggtcatcc
ttggtggctt aggctgactc gggcatagca 69360gggtgctctg agacctcacc gcatataggc
tttgccccca ataaactcta tataatattc 69420atattatgtg gtctgggtgt gtgtagcttt
gcactgtctt ctcgtgacag tgccctcaac 69480ctctttccca ggatttcctc ctctacctcc
tcaagtccca ctgctctgca aagaccaaaa 69540gctgcagagt cccagctccc tcctttacac
cccacgacgc agcctcctct ctcagaaccc 69600tttaaacaga gtcttttact gcagatccca
agaacagcca cacccctctc tcccacccac 69660tccagacaca cccaggtaat tatagcaccc
agggtaacta tgtagatgga gtccctggaa 69720catgtggata gtgccccctg ggagtatgca
aaagcaacat tgctggcacc tgcagagaac 69780agggtgacat ccaggaatca gagcatgggc
ctctgggagg tagggatgtg gccaggcagg 69840ctgccaaaaa ttggtagagc aaggccacag
gatctttctg accttccttc caaacagagg 69900ctcctgtact ggtgatccct gtgttgattg
accactccct tcctgggggt cgtggtctct 69960gtcccagttg cccggacttc tgtgagtgtc
ctactgaggt ccttttcatg agaagcatgc 70020tgtccttcca cctgctggga gcaagagtga
caacttcaat actataatag cagtggcata 70080cagagaagaa gaaagatgaa gtggcaagaa
aaacaggctt ccaagcagga gtttttctat 70140aaaaacaaaa acgtttacaa gcaaactttt
tataaagggc tagatagtaa atattttagg 70200ctttgagagc cacatagact tgtttgcagg
gactcaatgt cgctattgta gtttgaaagc 70260agccatcagg gttatgtaaa tgagtgagtc
tgattttgtt tcagcaaaat tttatttacc 70320aaaacagaca atgagtgggc tggatttggc
ccatgatcct tagtttgcca actcctgctt 70380tgggctcacc cagatctgat tttgaattct
ggctctgcta ctggttagct gcaggagctt 70440ggaaggctct ctgagcctgt ttcctcatct
gtaaaattaa agcaataatt tctaacactc 70500aagagtgtta cctcacgcct gtaatcccag
cactttggag gctgaggcag gcggatcacc 70560tgaggtcaga agttcaagac cagcgtggcc
aacgtggcaa aaccctgtct ctactaaaaa 70620atacaaaaag tagccgggca tggtggcgcg
catctgtaat cccagctact tgggaggctg 70680aggcagggat actgctagaa cctgggaggt
ggagcgtgca gtgagtggag atcacacctc 70740cacactccag cctggccgac agagcgagac
tccatctcaa aaaaaaaaaa aaaaagagtg 70800ttagaaggtt ttgagataat gaataaaaga
tgccttgtgt atactaagta ttcaacaact 70860gatagctgca ttggtctaat tataacagtt
tagaagcgat tgagtcaaca aatgctggat 70920ttgtcaggga ggacttccta tcaggaggta
gatcttgggc tgagtcctga agcaaagata 70980ggcattggat agaggagttg agagaacacc
ctaggactgt tattattatt attcgacacg 71040gagtctcttg ctctgtcacc caggctggag
tgcagtggcg cgatctcggc tcactgcaac 71100ctctgcctcc caggttcaag cgattctcct
gcctcctaag tagctgagac tacaggtgtg 71160tgccaccaca cccggctaat ttttatattt
ttagtagaga cagagtttca ccatgttggc 71220catgctggtc tcgaactcct gacttcaggt
gatccacccg cctcagcctc ccaaagtgct 71280ggaataacag atgtgagcca ccgcacccag
cccagaacca tttttcaatc cttggctctg 71340ccttttatta gctgcaagat ctcaggcaat
ttatttaacc tctccaaaga ctcattttct 71400cattcacaaa atgaggcaaa taataatatc
tactatccca ggttgtcatg agaattaaat 71460gcaacatgac atttaatgaa atgagaagtc
ccttggacat taactggcta aagtatgtgc 71520tcgacaagga tatcatttta ggtggatact
tagcatctca gaactgatgc tcacaatgga 71580atatcattga aacgcattaa aattcatttt
aaatgattgt aggtagtgag gcaattgaaa 71640gaagaagaca agaggactga ttataatgct
tcaggctcac tagtctcctt ttaggaggga 71700aaaacaattt caagttaaat tttaggctct
agatttttac ccctgctgct cattagaatc 71760acccagattg atgaaatcag agcccatctg
aggctgtgtt tttcatctcc agaatgagag 71820ctgttgtggg gattaagttt ttgaaaaagt
acatctaaca ggtgatcgaa aatgatagtg 71880atattattgc agtgatggtc attattgttg
ttattattat actgaaagag gcttcagttt 71940tctgatccat aaagtgaggg aattgcatga
gaccattgct aagattcctt ctagctctgt 72000ttttttgttt ttgtttttta gacagagtct
ctgtcgccca ggctggagtg caatggcatg 72060atcttggctc actgcaacct ccgcctcccg
ggttcaaatg atcctcctgt ctcagcctcc 72120gaagtagctg ggactacagg cacacaccac
catgcccagc taacttttat atttttaata 72180gaggtggggt ttcaccatat tggtcaggct
ggtctcaaac tcctgacctc aggtgatcca 72240cccgcctcgg cctcccaaca tgctgggatt
acaggcatga gccactgtgc ccaacccctt 72300ctagctttct tgatcactga ttctagggtt
ctctgctgaa atatatttga gacatcctgg 72360ataaaagatc atgcaagagc tcccaatatg
gtattaataa ttgattctgg aggcttagct 72420actcctgatg gattagacat gactcaactg
cctctcttat gtgtacaaca caacaacaca 72480accaagaaag gttattctgg cattccattt
attcagttta tttacagccc ttacttccag 72540cagcacgtta aagatatggc cagggccggg
tgcagtggct caagtctgta atcccaggac 72600tttgggaggc caaggtgggc ggatcacaag
gtcaggagtt tgagaatctg gcaattcttc 72660agacttagaa gcaaccagct cgataacaca
gtcttgtgtg ggctctccct ctgtccctcc 72720ctcgcttccc tcatttctca tccctgcccc
tgagactgtg caccttcaca tagccctgcc 72780atgagacctt catctcaggc tttgctttct
ggggtaactg aggctaaaca ctgagtggcc 72840ctaaaagagg attgggattt ggaagttaga
ttattcacca gagaacagac tttgctgatg 72900atcaggccca ggttgtaatt gttgaaaaaa
agagaggatg catagtctta tctcatctcc 72960tagtcaaagt caacaccatg ataaataaga
gtcaaatcct gagatgtgaa ttggggacat 73020ttgagtggtt aaccctgaga agcttgcacc
ttcagacccc tcaatacccc tgctccccag 73080agaaggctgg acattgacct cagcacaggc
aggagccctg caagatgcca tttgtcctac 73140taaagatgga cccctccact ctgtttctag
gtaaataacc aaagtcaagt ctccacacag 73200cctgagcaag aaagtcagag cctgctacag
gagaaaatac cacactggcc aaaggattca 73260ctagccctgg ccactgtgtg tgggaggaac
cagggaatca tgtgtgggag tcaatgttga 73320agctgttgga ctgggggtgg ggtggaatat
aagcctggcc ctggggagtt tttcccgttt 73380gagggccttt acccacaact caagatccag
tgctatagca ggagatccca gagctagtcc 73440taacagatgg tcaggattga acttggccta
gagtaaaatg aggaggatag tgccagaact 73500ttctcaacat actattgagg aagaggtcag
aaggcttaag gaggtagtgt aactggaaag 73560gggtcctgat ccagacccca ggagagggtt
cttggacctt gcataagaaa gagttcgaga 73620cgagtccacc cagtaaagtg aaagcaattt
tattaaagaa gaaacagaaa aatggctact 73680ccatagagca gcgacatggg ctgcttaact
gagtgttctt atgattattt cttgattcta 73740tgctaaacaa agggtggatt atttgtgagg
tttccaggaa aggggcaggg atttcccaga 73800actgatggat ccccccactt ttagaccata
tagagtaact tcctgacgtt gccatggcgt 73860ttgtaaactg tcatggccct ggagggaatg
tcttttagca tgttaatgta ttataatgtg 73920tataatgagc agtgaggacg gccagaggtc
gctttcatca ccatcttggt tttggtgggt 73980tttggccggc ttctttatca catcctgttt
tatgagcagg gtctttatga cctataactt 74040ctcctgccga cctcctatct cctcctgtga
ctaagaatgc agcctagcag gtctcagcct 74100cattttacca tggagtcgct ctgattccaa
tgcctctgac agcaggaatg ttggaattga 74160attactatgc aagacctgag aagccattgg
aggacacagc cttcattagg acactggcat 74220ctgtgacagg ctgggtggtg gtaattgtct
gttggccagt gtggactgtg ggagatgcta 74280ctactgtaag atatgacaag gtttctcttc
aaacaggctg atccgcttct tattctctaa 74340ttccaagtac caccccccgc ctttcttctc
cttttccttc tttctgattt tactacatgc 74400ccaggcatgc tacggcccca gctcacattc
ctttccttat ttaaaaatgg actggggctg 74460ggcgcggtgg ctcatgcctg taatcccagc
actttgggag gccgaggcgg gcggatcatg 74520aggtcaggag atcgagacca tcctggctaa
cacggtgaaa ccccgtctct actaaaaatg 74580caaaaacatt agccaggcgt ggttgcaggt
gcctgcagtc ccagcggctc aggaggctga 74640ggcaggagaa tggcgtgaac ctgggaggtg
gaggttgcaa tgagccgaga ttgtgccact 74700gcactccagc ctgggtgaca gagcgagact
ccgtctcaaa aaaaaaaaaa aaaaaaaaaa 74760tagctgggca tggtggcgcg tgcctgtaat
accagctact ctggaggctg aggcaagaga 74820atcgcttgaa cccagtaggc ggaagttgca
gtgagccgag atcttgacac tgcactccag 74880cctggtgaca gagtgagact ctgtctcaaa
aaaaaaaaaa agaaaaaaaa agacagaaag 74940aaagagcaca gacagagtca caggtatttg
cagtaggaag ctgtcaggtt agagtgcacg 75000gaaatagaaa gtatatttta cacttacagc
acatcttcgt ttgattagcc acatttaaaa 75060tactgaatag caacgtgtgg ctatttagta
ttcactaaaa tcttggacag tgcaagtcta 75120aagaatcctt gatccgtccg gcatggtggc
tcacgccttt aatcccagca ctttgggagg 75180ccaaggtgga aggatcactt aaggtcagga
gttcgagacc agcctggcca acatggtgaa 75240acctcgtctc tactaataat acaaaaaaaa
ttagccgggc atggtggtgc atgcctgtaa 75300tcccaggtac ttgggaggct gaggcaggag
aatagcttga atccaggagg cgctgcagtg 75360agccgagatc atgccatgcc actactgcac
tccagcctgg gcaacagagt gagactgtct 75420caaaaaaaaa aaaaaaattg ttgggcgtgg
tggctcacgc ctgtaatccc agcactttgg 75480gaggctgagg ggggtggatc acctgggttc
tggagttcga gaccagcctg gccaacatgg 75540tgaaacccca tctctactaa aaatacaaaa
attagctggg cgtggtggtg ggcacctgaa 75600atctcagcta ctcaggaggc tgaggcagga
gaatttcttg aacccaggag gcagaggttg 75660cagtgagcca agatcgcgcc tctgcactcc
atcctgggtg gcagagcaag actatgtctc 75720aaaaaaaaaa aaaaaaatac ttgattgtct
ggacattctg cagaacatca tatggagaca 75780ctatgttgac gacatcatgc tgattgtaag
caagaaatgg caagtgttcc agaaacacag 75840tcaagacaca tacatgccag aaggtgagat
ataaactcta ctaagattca gtggcctgcc 75900acactggtga catttttaaa cctgctagat
gtttgtgtag aaaaggattt aaccttgccc 75960aaagaggggt ctggcctttg tccccagcta
ctggacataa tctctttaaa ctcttgaaat 76020atcattcctg atagaagtat ttttgttttg
actaggggcc ttgggccagc cagatagcaa 76080caatgtgatc tgggttgggg gctttggatc
aggtggcatc agtgtgacct cctgagtggc 76140tagagactag aatcaaccac atgggcagac
aacccagctt acatgatgga attccaataa 76200agactttgga cacaagggct tgggtaagct
ttcctggttg gcaatgctct atactgggaa 76260acccattctg actccatagg gagaggacaa
ctggatattc tcatttggta cctccctggg 76320ctttgcccta tgcatttttc ccttgtctga
ttattattat tattatgaga tggaatctcg 76380ctctgtcacc caggctggag tgcagtggaa
tgatctcaac tcactgcaac ctctgcctcc 76440ccggttcaag cgattttcct gtctcggcct
cccgagtagc tgggactaca gatgcatacc 76500accacacccg gctaattttt ttgtattttt
agtagagacg gggtttcacg ttagccagga 76560tggtctcgat ctcctgacct catgttccgc
ctgcctcggc ctctcaaagt gctaggaata 76620catgtgtgag ccaccgcgcc cagccccctt
ggctgattat taaagtgtat ccttgagctg 76680tagtaaatta taaccgtgaa tataacagct
tttagtgagt tttgtgagca cttctagcaa 76740attatcaaac ctaaggatag ccttggggac
ccctgaactt gcagttggtg tcagaaataa 76800gggtgctcat gtgtgtacca tgccctctaa
ttttgtagtt aattaacttt cacaacttta 76860ttattaccgc ttacactcaa tgtttattca
catttatcca cataccactt attctagtgc 76920cttgcatcaa agactttcta tctcatgtac
tttattctgc ttgaagtaaa tcctttagga 76980tattcttttt tttttttaaa ctttgcacat
acatactttt attttttatt tatttttaat 77040tttgttattt ttgtgggtac gtagtagata
tatgtattta tggagtacat gagatgtttt 77100gatacaggca tgcaatgtga aataagcaca
tcatggagaa tggggtatcc atcctctcaa 77160gcaatttatc cttcaagtta caaacaatcc
aattacactc tttaagttat tttaaaatgt 77220acatttaatt ttgtattgac tagagtcact
ctgttgtgct atcaaatata attttttttt 77280tttttgagac agagtctcac tcagtggccc
agactgaaag tgcagtggca caagctcggc 77340tcacttcaat ctctgcctcc ctggttcaag
cgaatctcct gcctcagcct cccacatagc 77400tgggattaca ggcacacacc accatgccca
gctaattttt atattttttt agtagagacg 77460ggttttcgcc atgttggcca ggctggtctt
gaactcctgg cctcaaatga tctgaccacc 77520tcagcctccc aaagtgctag gattacaggc
atgagccacc acacctggcc aaaatagaat 77580attctttagt gaggtctgct ggtgacaatt
tttttctttt ttttgagact gagtctcgct 77640gttgtcagct tgggctggag tgcaatagca
cgatctcagc tcactgcaac ctccacctcc 77700cggattccag caattctcct gcctcagcct
cccaagtagc tgagagatta caggcaccca 77760ccaccacacg cggctaattt ttgtattttt
agtagaaatg ggggttcacc gtgttggcca 77820ggctggtctc gaactcctga cctcaggtga
tccacccacc ttggcctccc aaagtgctgg 77880gattacaagc atgagccacc acgcacagcc
aattttttcc gtttttgtct gaaatcttat 77940tttgtgtcat ctttgaaata tatttttgat
ggatataaaa ttgttggttg atagttatta 78000tcattattat tattattttg agacagggtc
tcactctgtt gcctatgctg gggtgtagta 78060atgtgatctc ggttcactgc agacttgacc
tcctagggct caggtgatct tcccacctca 78120gcctccctag tagctgggac tacagatgca
tgccaccata cccaactaat ttttctattt 78180tttgtagaga tgaggctttg ccacatttcc
caggctggtc tctaactcct gagctctagc 78240aatccaccca ccttggcctt acaaagtgct
gggccatgac tagccagcag ttacttttta 78300tagcatattg aatatttaat atgaatcttc
tggcatccac tgtaactgtt taaaaaatca 78360gctgtttact tggcactctt tttttttttt
ttttttttga gacagagtct tgccctgtcg 78420cccaggctgg agtgcagtgg cgtgatcttg
gctcactgca agctctgcct cccgggttca 78480cgccattctc ctgcctcagc ctccggagta
gctgggacta aaggcgcccg ccaccacgcc 78540cggctgattt ttttgtattt ttcgtagagt
tggggtttca ccgtgttagc caggatggtc 78600tcgatctcct gacctcgtga tctgtccgcc
tcggcctccc aaagtgctgg gattataggc 78660gtgagccacc gcgcccagcc tctttttttt
ttttttttag acggagtctt actctgtcat 78720ctaggctggt gtacagtggc gtgatctcag
ctcagtgcaa cctccacctc ctgcctcagc 78780ctgccaaata gctgggatta caggtgcgta
ccatcacgcc cggctaattt ttgtattttc 78840agtagagatg gggtttcacc atgttagaca
ggctggtctc gaactcctgg cctcaagtga 78900tctgcctgcc ccagcctccc aaagattaca
ggcatgagcc accgcacccg gccaagtagc 78960actcctttga aggtaatctg cttcccctac
ccctagcaat ttttaacaat ttttcttcat 79020ttttatttcc tgaagttttg ttattaataa
tctgtgtgca gatttctttg tatttctttt 79080gtttgcagtt catagtgatt cttgaattag
tgtgttggtt tctgttatca ccacaggaaa 79140attgtcagcc gttagctttt caaatatttc
cttgctaaat tctctcttct cccctttcgg 79200tacaattgat ttgattaaaa ctaaaaccag
ggccgggtgc agtgactcat gcctgtaatc 79260ccaacacttt gagaggctga ggcaggtgga
tcacctaagc tcaggagttc aagaccagcc 79320tggccaatat ggtgaaaccc cgtctctact
aaaaatacaa aaattaccag gcatggtggc 79380acacatttgt agtcaggagg ctgaggcagg
agaattgctt gaatccagga ggtggaggtt 79440gcagtgagct gagatcccac cactgcagtc
tggcctgggc gacagagtga gatgagaatc 79500tgtctcgaaa aaaaaagtta tgaatgtttg
ataaactata tttgttagaa tgtttgttgt 79560agaatactat tcattgattt ttaaacaatg
ttagattaaa ccattcactg gatttgtgat 79620aattaactta ctgattttac ctcactgatt
tgttgtaatt aatacaactg gtataaaaag 79680actgtgacga ggccgggcat ggtggctccc
gcctataatc ccagcacttt gggaggctga 79740ggcaggcgga tcacctgagg tcaggagttc
aagaccagcc tgaccaacat ggtgaaaccc 79800catctttact aaaaatacaa aattagccgg
tcgtggtggt gcatgcctgt aatcccagct 79860cttcgggagg ctgtggcagg agaatcactt
gaacccggga ggtggaggtt gcagtgagcc 79920gatatcgcgc cattgcactc cagcctgggc
aacaagagcg aaactccgtc taaaaaaaaa 79980aaagaaaaaa aacacataaa acaaaacaac
actgtgacgg ttcccaaaaa ttaggagcat 80040aattaaagga actcctgata aaaattaatt
ttatcttaca tgtaaactaa aatgacttta 80100tgaagttaat tcagaaatac aatgcagggt
attagtttgc cacagctgcg tattcagcct 80160aatgtaatat tcttgttatt tttaaattct
tcttttaact ttactcatat gtggatcatc 80220aaatttcaaa agattaaatg acaatactct
tagcagcaag cttccctaag catataaaca 80280ttttaatggg tgatgattca gaaggtaccc
gaagaatatg tactgccaga tatcattcac 80340ccccatatac ctgcccgaca gacatcccat
tttgggaccc tggataaatg tgtgggtgga 80400gagaaagata ggagaaagtg gtataagcaa
atggctttgg agtctgattg acagcgattg 80460aaatcctgtc tctacctctt aacagcctca
tgatcctaca taagttaccc cgatcctcag 80520ggccacatct gtaaattggg ggttgcgatg
gcagccatct cacagggtct cttttcgggg 80580aagggcagga attatggatt aagtgagcta
gtaattgtaa agcacttaat acaaggaggg 80640cgcataataa gtacttcata aataatgacg
gccattatca tgactgaggt gtatgcagct 80700gtcggggatt acggcgactt cagaatttct
ggtgggcagg gctcaaaggc agcaaatcac 80760actggaagtc gaggtgaggc actgcttctg
cacagactgc ttagctggag agaatgagga 80820aggcttagag gagatttaga ggaacttaga
gtcctccgcc tccaactctg tgggatctgc 80880tcccgtgcca gagacattca ggggatttct
cgcactctcc cctcccctac gtccctcccg 80940ccccatccaa ctaaccacac aacacataca
aaatagcccc tgcgaggttc tgcacgctgg 81000aagggaacag gagaagggcg ctgcgctttc
ttgctgatgc cctgtacttg ggcccctggt 81060agacacagcc acttgtcccc tcagcctgca
gagaaatccc acgtagaccg cgcccgggtc 81120cttggcttca gccaatctcc ctttggtggg
ggtgggatgc acgatccaag gttttattgg 81180ctacagacag cggggtgtgg tccgccaaga
acacagattg gctcccgagg gcatctcgga 81240tccctggtgg ggcgccgctc agcctcccgg
tgcaggcccg gccgaggcca ggaggaagcg 81300gccagaccgc gtccattcgg cgccagctca
ctccggacgt ccggagcctc tgccagcgct 81360gcttccgtcc agtgcgcctg gacgcgctgt
ccttaactgg agaaaggctt caccttgaaa 81420tccaggcttc atccctagtt agcgtgtgac
cttgagcagt tgactttatt tttcagtgcc 81480tagttttcca gataccagga ctgactccaa
ggactattac tcatctggag ggtttagcac 81540agtaccgtcg catagtaaat ttccatgtca
gttttggtta cctttcatgc acttgcaaac 81600atgccatgct ctgaaacgaa ataggcacat
cttttttttt ttttttttta aggagtcttc 81660ctctcgccca ggctggagtg cagtggcgcg
atcttggctc actgcaacct ccacctcccg 81720tgttcgagat tctcctgcct cagcctcctg
attagctggg actacaggca tgccacgacg 81780cccagttaat ttttgtattt ttagtagaga
cggggtttcg ccatcttggc caggctggtc 81840taactcctga cctcaggtga tctgactgcc
tcagcctctc aaagtgttgg gattacaggc 81900ataagccact gcatctggcc agaaatgaaa
taagtaaatc ttttaacctg ctctaacaat 81960atagtgaaaa gaccatatta ttattagagc
aggttaaggg atttgcctat ttcgggttct 82020agttatagtc ttaaacttgg acattcttgt
agaaagtaaa aagtttcctc ttcaaagttc 82080cccttcttgt taaagaatac atcataagtg
ttagaagtaa tagtttattt taaagactaa 82140ctttcttcaa gcctccttgc tttgtgctaa
taactctttg ttaagcccta tcctatgtaa 82200ctgttggaca tgctcacagg cacgttccag
ttcacagcct atgccccttc cttatttgga 82260aatgttattg cttccttaaa cctttcggta
agcaacttcc tctccttctt cgttcttcct 82320tgcacttacc tatttagaaa gttttaggct
attagcaaat cggctatcag tttaagagtg 82380tgaggtcccg ctccagccaa tggatgcagg
acatagcagt gaggacgacc caaatgcgta 82440agggataaat atgtttgctt ttcctttgtt
caggtgtgct ctcgacatcg ttccatctgc 82500gattgagcac cctttctgca gaaagtaaag
attgccttgc tggagatctt ttgtctccgt 82560gctgactttt cttcgtggca ccgattatct
atttctaaca attttggtat ttctaacatt 82620ctgaacaatc ttgggctagt tgtctcttct
gggcctgttt ccccatccgt cacatgataa 82680acttcattgg tttaaaaacc ccagcgaaca
tttattgagt tactattacc ttcctgccct 82740ccccaacccc aaccccaggg agcagttaca
acctcagccg ctgagcgcac tcgccgggtg 82800ttaagaagca ccaaagacag ggaggcttga
ttgattttgc tttgggagta gagggtcaga 82860agattcacag gaaaatggca tttgagcaag
gatgattcac tggagctagc ttttaaatac 82920tggcgaggct tttatgttgc agtcccttac
aaagttgagc attcgcaggg actgcactcc 82980gaaataagcc cgcttcccct tttcattcgc
taatgatcca gggagctgct ggttccgcat 83040gcggcaggtt gtgccttttc ctaatcaggg
ttctgcatcg cctcgaaccc gcaggccgtg 83100gcgggttctc ctgaggaagc agggactggg
gtgcagggtg aagctgctcg tgccggccag 83160cgcctgtgag caaaactcaa acggaggagc
aggaggggtc gagctggagc gtggcagggt 83220tgaccctgcc ttttagaagg gcacaatttg
aagggtaccc aggggccgga agccggggac 83280ctaaggcccg ccccgttcca gctgctggga
gggctcccgc cccagggagt tagttttgca 83340gagactgggt ctgcagcgct ccaccggggg
ccggcgacag acgccacaaa acagctgcag 83400gaacggtggc tcgctccagg cacccagggc
ccgggaaaga ggcgcgggta gcacgcgcgg 83460gtcacgtggg cgatgcgggc gtgcgcccct
gcacccgcgg gagggggatg gggaaaaggg 83520gcggggccgg cgcttgacct cccgtgaagc
ctagcgcggg gaaggaccgg aactccgggc 83580gggcggcttg ttgataatat ggcggctgga
gctgcctggg catcccgagg aggcggtggg 83640gcccactccc ggaagaaggg tcccttttcg
cgctagtgca gcggcccctc tggacccgga 83700agtccgggcc ggttgctgaa tgaggggagc
cgggccctcc ccgcgccagt ccccccgcac 83760cctccgtccc gacccgggcc ccgccatgtc
cttcttccgg cggaaaggta gctgaggggg 83820cgccggcggg gagtcaggcc gggcctcagg
ggcggcggtg gggcaggtgg gcctgcgagg 83880gctttcccca aggcggcagc aaggccttca
gcgagcctcg acctcggcgc agatgccccc 83940tgagtgcctt gctctgctcc gggactcttc
tgggagggag aaggtggcct tcttgcgcga 84000ggtcagagga gtattgtcgc gctggttcag
aagcgattgc taaagcccat agaagttcct 84060gcctgtttgg ttaagaacag ttcttaggtg
ggggttagtt tttttgtgtt tctttgagga 84120ccgtggatca agatcaagga aatctcttta
gaaccttatt atggaagtct gaagtttcca 84180aatgttgagg gttttatgtc taaaagcaac
acgtgaaaaa attgttttct tcacccagtg 84240ctgtcttcca atttcctctt tggggggagg
ggtagttact gctgttacta aaataaaatt 84300acttattgct aaagttcccc aacaggaaga
ccactacttt tgatgacttt ggcaagtttg 84360ctaactactg gaaccctaac ttacaaacga
actacttaca tttttgattt ccagttgtat 84420tacctgccca atgtttacgt agaaacagct
taattttgat tctgggtaac gttgttgcac 84480ttcattaaaa atacatatcc gaagtgagca
agtatgggtc tgtggacagc agtgattttt 84540cctgtcaatt cctgttgctt cagataaaat
gtaccagaca gaggccgggc gcggtggctc 84600acgcctgtaa tcccagcact ttgggaggct
tggcgggtgg atcacctgag atcgggagtt 84660caagaccagc ctgaccaaca tggagaaacc
ccgtgtctac taaaaataca aaattagcca 84720gggtggtggc gcatgcctgt aatgccagct
acttgggagg ctgaagcagg agaatcgctt 84780gaacctggga ggcggaggtt gcggtgagcc
gagatagcac cattgcactc cagcctgggc 84840aaaaagagcg aaactccgtc tcaaaaaaaa
agtaccagac agaaatgggt tttgttttct 84900ttttttgttt tgagacggag tttcgctctt
gttgcccagg ctcgagtgca atggcgcgat 84960ctcagtctcg gctcactgca acctctgtct
cccaggttta atcgattctc ctgcctcagc 85020ctcccaagta gctgggatta cccatgcccc
accatgcccg gctaattttt gtatttttag 85080tagaaacggg gcttcaccat gttaggctgg
tcttgaaccc ctgacctcaa gtgggcctcc 85140cacctcggcc tcccaaagtg ccaggattac
aggcatgagc caccgcggcc agccagaaat 85200gggttttgga aaaagcacta aacaaaatcg
aacttggttt catatgacag ctctgctgct 85260aactgtaaca ggggcagacc agttaaccta
cttttctgtc ttctgtcagc tgagaattag 85320atgattccca aaggcccatt gaactctgaa
tgactttaaa tacttcttct taagtgggta 85380cacggttttg gtaactgatg ccaggtgatg
aatgcatgaa agtgcttaat gaatgaaacc 85440ggtaaaatag taggaggaag ctttattggt
aaggcagggg tatacctaat agctctctaa 85500tttattggta ttgaagtggt taacttttgt
ttttttaagg ggggaaaaca ttctaagaat 85560aatgaggcaa actgcatatt gcacaagaga
ctgttgtctc tattcaacaa ataccttttg 85620agtgtccaga gtctgccagg tgctgtgcta
ggccctcacg attgagtagt gaaccagaga 85680atgtccctgc acccatggag cttattgtct
actggggtag acagataata aataagcaaa 85740caaatcttct ctcttctccc tttcgctcca
tgtaagtgtg tgtgtatagg tgtatactta 85800caagttgagt aaagtgttat gaaagattaa
gaggagaaat gcattttggt tagatgttag 85860aggactcagc aggtgacctt gaaacttaga
gctgaaggat cagtaggagg taactagaga 85920ggccagggaa tcgcatgttc aaaggccagg
aggcaagaaa gagcatggtg cccttcaaga 85980gaggaaagaa ggctactgtg actggagcat
agatgtaggc aagtgttggg tgattgagag 86040ctctacgggc catggttagg ttttattcct
aatgccgaga tgccaaacat ggtggttcat 86100atctgtaatc ccagtatttt aggaggccga
ggcaggaata tagcttgaac ccaggagttc 86160aagaccagcc tgagcaacat gagacctgta
caaaacattt aaaaaattgc tgggtatgat 86220ggtgcacacc tgtggtccca gctactcagg
aggctgaggc agaaggatca cttgagccta 86280ggaggtggag gctacaatga gccatatttg
agtcactaca ctccagcctg gatgacaaag 86340tgagaccatg tgtcaaacaa aatacagaaa
gaatattaat ttaaaatttt gaaagaggag 86400tgatctgaac ttatatctta aaaagatcat
tctagggcat ggtggctcat gcctgtaatc 86460aagggctttg ggaggctgag acaggaggat
cacctgaggc cagttcgaga tcaacctgta 86520cagcatagag agactccatc tctacaaaaa
gaaaaaataa atagctgggt gttgtgagtt 86580attcaggagg ctgaagcaga aagatcactt
gagcccagga gtttgaggct gcagtaagct 86640atgatcccac cactgcaaca cagtgagatc
ttgtctcaaa aaaaaaaaaa aatcattcta 86700ggtgcttttt ggaggctgga tgtggtaaga
gtagaagctg gagatggtcc tgttagggat 86760tcgattcaga ctttaaatac catcaatgca
ttgagtccca aatttacatc actacgttgg 86820atccttgccc ctgaatccag actggtatat
ccaactttag gttcagtttg tatctctacc 86880tgaccaatat agaggtgtcc agtcttttgg
cttccctagg ccacattgga agaagaattg 86940tcttgagcca cacatagagt acactaacgc
taacaatagc agatgagcta aaaaaaaatc 87000gcaaaactta taatgtttta agaaagttta
cgaatttgtg ttgggcacat tcagagccat 87060cctgggccgc gggatggaca agcttaatcc
agtagatacc ttcaacttac aatatctaaa 87120attttatgcc agatttagtc attttaaacc
tgctcatcag tttttctcaa gaagtagtat 87180tttggctttt tttcttttct tttttttgag
atggagtttc gctcttatcg ttcaagctgg 87240agtgcagtgg cggatcttgg ctcactgcaa
cctccgcctc ctgggttcaa gtgattctcc 87300tgcctcagcc tcgcaagtag ctggaattac
aggcatgcgc caccatgacc agctaatttt 87360tggagacagg gtttcaccat gttggtcagg
ctggttttgt actcctgacc tcaggtgatc 87420tgcctgcctc ggcctcccaa aggctgggat
tacaggcatg agccaccgct cccggctgca 87480tttttggatt tttagttgct cagcccaaaa
ctttagtaca tctttgaacc tcttctttcc 87540tcctactcta tatctgatcc atcagcaaat
ctgttaggtc tacctcacac atatcgaaat 87600cctaccacgt ctcaccatct gtgacaatta
acaccctggt ctaggcagtc atctctgtta 87660agattgagtg gttaaggatg tcctctaagg
agatgacatt caaatcttag cttaaatgtc 87720aagagggagc tggttttata aagattgagg
aggcagcatt attttgccat aggcttccat 87780ttggtttcca ttccattctt gatacttatg
gtatatattc aaaacaaatg cacagaaaca 87840gacccaggta tattgggaat ttcggatata
gagttcctag ttgggaaaag atagactgat 87900ctgtaaatga tgctagttat ccatcatctg
gcaaaaaata atttcctgcc tcctctcata 87960tatctcagat caacagactt tttctgttaa
gggccaaatc ataaatattt taggctttcc 88020agaccatatg gtttctgtca cactctcctt
tatccttgaa gccatagaca atatgtaaac 88080aaatgggcat ggctgtgcta cgataaaact
ttacttacaa aaactggtag tgggccagtt 88140taggcatggc cagcactttg ggaggctaag
gcagatggat cacttggggt caggagtttg 88200agaccagcct ggccaacatg gtgaaaccct
gtctctacta aaaatacaaa aaatagctgg 88260gcatggtggt gggtgtctat aattccagct
actctggagg ctaagacaca agaatcactt 88320gaacccagga ggcagaggtt gcagtgagct
gagatagcac cactgcactc cagccagggt 88380gacggagtct taaagcaaaa caaaacaaaa
ggtagtgggt tgtatttggc ccatgggctg 88440tagtttgcca atccctgatg cagaaacaaa
ttccaggtaa ataagagcct ggaatgttaa 88500aaaaacaaaa cttgaagtca tgtagaagaa
caggtagggg gaacaatcct gatctcagga 88560taggaaggga tattgcttaa aataagacac
aggaaaatat aatccatgtt gtgtaaattt 88620gactacgtta aaacttaaaa ctttcgccaa
gcgcggtggc tcacgcctgt aataccagta 88680ctttgggagg ccgaggtgag cagatcacca
ggtcaggaga ttgagaccat cctggctaac 88740acggtgaaac cccgtctcta ctaaaaatac
aaaacattag ccgggcgtgg tggcgggcgc 88800ctgtagtccc agctacttgg gaggctgagg
caggagaatg gcctgaaccc gggaggcgaa 88860gcttgcagtg agctgagatc gcgccactgc
actccagcct gggcgacaga gtgagattcc 88920gtctcaaaaa aacaaaacaa aacaaagcaa
aaaacctaaa actttcatac aataaagtat 88980acctaagata cttctagaag agaagattta
catccaggac gtgtatggaa tttctgcaag 89040taataagtaa aagacaaggg acatgaagag
gcagttcaca aaagaggaag ccaaaatgac 89100caataaacat gaaaggatgt ttaacctcaa
aggaaacaag gaaatgaatt aaaaacatca 89160aatgccattt caaaactagt aagttggcaa
aattaaaaat accaaggatg agaatatgaa 89220gcatggctat atgagtgcat ggaatggtac
agtcactttc attaaaaatg cacataattt 89280gttttttatt tatttttttg agacagtcta
tgtcgcccag gctagaatgc agtggcatga 89340tctcggctca ccacaatctc tgcctcctgg
gttcaagcaa ttctcctgcc tcagcctcct 89400gagtagctgg gattacaggc acatgccaca
acgcccggtt aagttttgta tttttagtag 89460agacagggtt ttgccatgtt ggccaggctg
gtctcgaact cctgacctca ggtgagctgc 89520ttcccaaagt gctgggatta gaggcgtgag
ccaatgctcc tggctgaaaa aaatgcacat 89580aatttgttac ctagcaattc catgtctaga
ggcttatcct agagaaattc ttgcttatat 89640gcataggaag acgtgtacta gaatgttcac
tagttgaatg tttaagtgaa aattaggaaa 89700taaagtaaat gttcattaac aggaaaatga
gtaaaggtat atttataaaa caattaagta 89760gctaaaatga ataaactaga gctgcgtgaa
tgaactagaa ctggttcaat agtcatgtca 89820gattattgaa tgaatacagg tcagatatgt
atagagtgtc atttgtgtaa ttaatttttt 89880tttttttttt gagatggagt ctcactctgt
tgcccaggct ggagtgcagt ggcgtgatct 89940cagctcactg caacctccac ctcctgggtt
aaagtgattc tcctgcctca gcctcccgag 90000tagttgggat tacaggcatg caccaccatg
cccagctcat tttcctattt ttagtggcca 90060cagggtttca ccatgttggc caggctggtc
ttgaactcct gacctcaagt gttccaccca 90120acttggcctc ccaaagtgct aggattacag
gcgtgagcca ccgtgctcag ccatttgcgt 90180gatttttaaa gatgtgcaga ataatgccat
taaaaaaaat acacatacat gtatatatat 90240acacgtttgg ctgggtgtgg tggctcacac
ctgtaatccc agcactttgg gaggctgagg 90300caggaggatc acttgagccc aggtgtacaa
gactagcctg ggcgagatag caagacccca 90360tctcaacaac agaaaggata attaggtatg
gtggcatgag aggatcactt gagcccagga 90420gttcgagtgt tatcaggcca ctgcactcta
gcctggacaa caaagcaaga ccgtgtctca 90480aaaaaataaa aataaaaagt atttgtatgt
ggtcatagtc aaaaaacgta catggaagga 90540aaatgtcttt atttatttat ttattttttt
ttttttaaga cagagtcttg ctctgtcacc 90600caggctgggg tacagtggtg taatctcagc
tcaccgcaat ctcggcctcc cgggttcaag 90660cgattcttct gcctcagcct tctaagtagc
tgggactaca ggtacccgcc accacaccct 90720gctaattctt gtgttttcag tagagacagg
gtttcaccat gttggcaagg ctggtctcga 90780actcctgacc ttaagtgagc cacccgcctt
ggcctcccaa agtcctggga ttacaggtgt 90840gagccactgc gcttggccag gaaatatcta
atttagtaag tatttatatc tgggaaagga 90900agggtcaggt ggtgattcat aggaactcta
aagtctatgt ataatactta gggggacaga 90960aggaaataaa gcaaaatgct gatatttgat
tgttgagttg tgtatatgtt agaagtataa 91020cataggagat ctgattgata gtaggagaat
gtttttaggt ggtaaaagtg gaaccgtggt 91080ggtttgtttt ggcagtagaa tcagttggtc
atagtttgta tgtggaaggt aataaacaga 91140ccatgttaag gatgacttcc ggaattttgg
tctgagtagt gggtggatga cagtgtcatt 91200catgagggaa gatgaagact gaggtaggaa
caggtttggg agaagatgac atgttccctt 91260ttagacaagt ggaattatgg aagatggcag
gtaggtggtt agctatatga atttgagata 91320aaagatttag gatggagata taaatttagg
agtaacagcg tatctatggt attgtaagcc 91380ttaagaatgg gtaggatcag ccaggaaata
cagatgtata tgcagaagag aggagtcaag 91440gaagccaaga caagttaatg tttaaagtga
gtgatgtagt ccatgggcag atgctgctga 91500gagggctgca aacaccagtg accctacaac
atttttaaat gtcgtcttcc tgacagcagt 91560gatcagtacc tgcaacgatc ttatttattt
ttttcatgtt agtctccaca cacttgaatg 91620tagacttttt gaaggcaaaa tcattgcctt
ttctgagctg ggagcatgtc tggcacatac 91680caagcactca acagttgatg tattgacttc
atccagatac tctgagggcg agttatttcc 91740tgctactagc ctttcacctt tcaatgttta
agagcacaaa tacagagatg ggcacgtttt 91800ggcatttctt attttgataa ccttttcctg
gtaagatttt ttaatgttga aaaaaaaaaa 91860caagaaaaga gggttaaaaa tagtcttatg
tcagatcctg tgatagaatt cacacttggc 91920ttaagctgct gggcaccttc ctatcttgga
tgtcatatta gcttatctac agcagaattt 91980ttactgtttt atgtagtaag gaagcaatta
tatgattatt ttacagacaa attattcttt 92040atcttttatt tttttagacg gagtctctct
ttgtctccca ggctggagta cagtgtcgcg 92100atctcggctc actgcaacct ccgcctcctg
ggttcaagca attctctgcc tcagcctccc 92160aagtagctgg gcttacaggt gtccgccacc
acacccagct cattgttttg tatttttagt 92220agagatgggg tttcaccatg ttggccaggc
tggtcttgag ctactgacct caggtgatcc 92280acccgccttg gcatcccaaa gtgctggaat
tacaggcgtg agccaccgtg cctggcccag 92340acaaattatt atactctgag tgttagaggc
ttaggatgtt ttcacttgat gctatgggag 92400gaataagtaa taagatatga tacacaacca
aagacctttc ttcactatgc ttctagtagc 92460tagtactatg gatgacacat ggtaataata
ttggttagca tttgtcctca atttactgtg 92520ctagttactc ttctaagccc cttacaggta
tatatttttt ttcatcaata atcctctaag 92580gtagttttta ttattgacct aattttataa
atcaagaaaa ttaagaccca gagaagtaag 92640taacttgtcc aagatcacat ggcttataag
tggtagagcc agaatttgac cccagatgtt 92700gtgactacat tgtctctcca taagcaggtt
caactctttt gactggatgc tgttccaagg 92760tcacttcctt agagaagcct ttgctgacaa
ctaccctcct gtgccctcct ccaaggctgt 92820ccattgttct agaactttga atactcatct
tagaataaag ctggtctaat ttttacagtg 92880ttatagaatg gatctctgac tgcaaaagtt
ggtcataatt atctttttat gttctagtga 92940aaggcaaaga acaagagaag acctcagatg
tgaagtccat taaaggtaag ttctgccctt 93000ggcagtccac tgcattaaaa agtgatgtgc
tttgcatttg tgagttcttt aatcctgtta 93060tactctctct tttggcatta atcatttctg
ccttatttta taattactta tgattttgat 93120ttatttccct ctttaacctg tataatgctt
taacatctag catataataa gtaggctttt 93180tttttttttt tttttttgga gacggagtct
tgctctgtta cccaggctgg agtgcagtgg 93240cgcgatcttg gctcactgca agctctgtct
cccgggttca caccattctc ctgcctcagc 93300ctccccagca gctgggacta caggtgcacg
gcgccacgcc tggctaattt tttgtatttt 93360ttagtagaga cagagtttca ccatgttagc
cagtatggtc tcgatctcct gaccttgtga 93420tccgcccgcc tcggcctccc aaagtgctgg
gattacaagc gtgagccacc gcacccggcc 93480gtaagtaggc tttttttacc ttaattttat
ttttttgaga tggagtcttg ctcttatccc 93540caggctggag tgcagtggtg ccatctcggc
tcactgcagc atccacctcc cgggttcaag 93600cgattctcct gcctcagcct cccgagtagc
tgggattaca ggtggccgcc accatgccca 93660gctaattttt gtatttttag tagagacagg
gtttcaccgt gttggccagg ccagtctcaa 93720actcctgacc tcaagtgatc cactcgcctt
ggcctcccaa agtcctggga ttacaggcgt 93780gagccaccat gcctggccat aagtaggctt
ttactgagcc ttgtgtgtat tggctatcct 93840agtgattaca gtgaaccagt gcccttctta
ttaatcacac atttaattgt tccctaaaag 93900tgattagttc actttattta tttagtaaga
caaaaaatga agaatactct taactgagca 93960gtctgttaac tgtaggaaag cactgacact
tataaggctt agttttctgt catttatcca 94020gaagtatggt tgattacagt ttttactttt
ttatttgaat gaacaacctt aatttaaaat 94080atattttgtt tattttttgt tgggatcgat
acattgtcct tgtttataga ttagagcatg 94140ctttttaaag atgctgtatt actcactgat
tttatttgtc cagtgtacag agattgaagt 94200gggaaaatta taatggaaat tgtttccata
gtcattacat attaatttca tcaatttatt 94260tccataaaat ctgtagattg ctacttattt
agatttttcc ttcaaatgtt tttatgttgt 94320attgcttgca ctgagtattt attctatatg
ctcaatttgc tggagaagaa gactaattat 94380aacttaggca agttgtaaaa ttagggaaaa
aagtaaggta ccttacagcc tagtttactt 94440atttcttatg taaagccagt tagattccac
attagttcaa actgccttct ttgagcaaaa 94500cttgattggc agtgataaag gcttaaagcc
cttctcaagc agagacctgt aaagactaga 94560tctgactgta gtagaaggaa ggaacttaga
tgtttcaggc agtgagaaca ccagtcttcc 94620actctaaact ttgccactaa cagtatgacc
ttgggaagtt gtaactttct tcagattctt 94680catttgttga atggggggat tggcctagct
aatttctaaa tctctactgg gctaaaaaat 94740tctgtgctta tactctgatt atgaagtaca
taatctgtgc ttaacattca ctgacttatc 94800cttaggataa tacagaagca gtacaagaaa
cagcccctca agatgtttgc agtctggtta 94860gaaagacaaa cttatacaca gaacagtagc
aaatagacca aaataataat agctgccatt 94920tatagaacac ttcttctgtt ctgggcatta
gacaaaaact gactataacg gtgaacaaaa 94980aagacttagg tcctgccctc attgaactta
cagattagta ggggagagga acattaatca 95040agtaattcca cagatggctt agcctagatt
ggtagtgatg gaagtaaaga gatgtgaacg 95100gacttgaaaa aaaattcgga ggcaaaatgg
atagaagttt attattgatt aaatatgagg 95160tgtgagagag agggatattt aagattgata
cctaccttct ggcttgccta acagaaccaa 95220aacaggaaat tatatgttca gttttgttat
gttgggtggg aggtgctttt gagtcattca 95280tttatatatg ttatatatgt tattttatat
gcatagtaat tttaaggtct gagttttaaa 95340ccaaaggtta gagagtgatt ttttagagtc
tagcaaacct aagttgaaat cctgcctgtt 95400gaaatggctg tttactagct cattaaccta
gggcaaagta ttcaacttgt tttcattttt 95460gtcttcatct ctaaaatgag gaaaatatgg
tcttacaaga ttgtcctgag agatagatga 95520aataatatcc aaaaaaaaaa aaggtacata
gagaaactcg tatagtgcct ggtatatagt 95580aggtcctcca ttggtagcta tcattatcta
gttttaacat agccttcagt ttgttgaatt 95640agtcaaactg agtgaagcac tgcaaggaat
tcagaggaat ttgagatcaa caaatgattt 95700ctgaagttta gggaagactt catggcaatg
acacttacct tgtataaaag ttgaagaata 95760agaaagattt gaatgagaga ttctttctct
tctccctacc agcccagctt cttatttgag 95820gatatattgg gcaaaggggc cttcagacaa
gtagagggag atttttacag aaagattgag 95880atgaaggtat agaaggctgt aaagaccaga
aaagagaatt gagacagagg aagcaggaag 95940ccactgtagg tttttgagca agatattgat
gctgtaagta tggtgtttat gaaaggttag 96000tctggaagag atttgcagga tggagacccc
ggaagttttt ttgttataat acagaaagac 96060ttgcactgag ggtgaggtgt taaaaataaa
caggtaagta aatgtttaaa catcttgaag 96120gaaaagtcaa caaatcttgg caagtaaaca
gataacagtg aaaaagaatg ggaccaagat 96180tttgagtttt ggagactggt ggattgaaca
gacagggaaa ttgagaggag aatcagatga 96240tgatgtttta agttgatatt tagacagatt
gtgcttgaga tggtaaagtc aatgtgggtg 96300ggaatgctta gtagcgagta atcagtgata
caagaccaaa gcccaggtca aagacaagtc 96360acagatacag atcagggctt tttcatctgc
tccacagagg tgtaccctag gagctgttgc 96420aaacagtcca tgtggagggt gtgagtaaga
tgtttccctt gaatttgcca gaattacttt 96480tttgttgttg ttgttgtttt ttctgagaca
gattctcgct ctgttgccca ggctggaggg 96540cagtggcgag atcgcgcagc tcactgcaac
ctctgcctct cgggttcgag tgattctcct 96600gcctcagcct cccaagtagc tgggattaca
ggcttgtgcc accaagccca gctaatttct 96660tttgtatttt tagtagagat ggggtttcac
catgttggcc agactggtct cgaactcctg 96720gcctcgtgat ctgcctgcct cagcctccaa
aagttctggg attacaggcg tgaaccactg 96780cacccggtcc cttgttaagt ttattttggt
gggaagcaaa ggaggtttca gcttttaaaa 96840agtttgaaaa ttattgctct ggtaataatt
aaagatttga gagtaaatat gctttctagc 96900agaaagaata aaagaagaac agatagcctc
aagaagggga gccaaagaag caggctatat 96960ctgacacact gggtgttgat aaatgggtat
taaaagaatg agagcaatga gcagatagaa 97020gaggaaatta ggagagtata ataccatgga
gaccaagaaa gatagactat caggaaggag 97080tggtaaaaat aagttactag ttctaagaga
gatgttaaga gggaccgggg aaagccttgt 97140acaaatgagt tagtagcatt ttacattata
tacatctaat taagaaacaa tgcgagagtc 97200tcaccattcc tatagactct tacttgtact
tgtctgaaca cgaaaactgg cttttgttta 97260taaataagct aaaaattatt ttgctccaat
ttctcatgaa aataaaaata aaccttcttt 97320taacattgaa aaaatagttt gaagacagtc
actcttcatt ttgtaattcc cacaactatt 97380attgaatgac tgaaattatc tttattctga
agccaaaggg gtgatactga tatttcttca 97440gactactaaa aatatatttt atgaattttt
agtgtgcttt atcttttttt gttttttttt 97500ttgagatgga gtttcactcc cgttgctcag
gctggagggc agtggtgcaa tctcagctca 97560ctgcaacctt cgcctcccag attcaagcaa
ttctcctgcc tcggtctccc aagtagctgg 97620gattacaggc acctgccccc acacccagct
aattttttgt atttttagta gagacagggt 97680ttcaccatgt tggtcaggct ggtcttgaac
tcctgacctc aggtgatcca cccaccttgg 97740cctcccaaag tactgcgatt gcaggcatga
gccaccatgc ctggcctgag gaatattttt 97800ctaggttccc cccaccccaa gcatttattc
tgcaatttta gttttgttcc taaagcaagc 97860aaggtttaag gatttaaaaa taatccgtat
tttagaatgc tttctggctt tgttactttt 97920tatccacagt agaagttctc agagaatgat
ctccctcttt taatttaact ttttggcaca 97980gtattttgag aattataaat aatattagaa
tgttttctgg ctgggtgtgg tggctcatgc 98040ctgtaatcct ggctacttgg gaggctgagg
caggagaatc acttgaacat gggaggcaga 98100ggttgcagtg agccgaggtc atgccactgc
actccagcct gggtgacaga gcaagactct 98160gtctgggaaa aaaaaaaaaa aaaaaaagag
tgttttcttt cctattttcc accacttgat 98220taagttactt ttcctcttaa gtattttttg
ctgagtatgc tgacttaaga gtaatgttac 98280aaaatttaat ttttaaagtt ctctgaaagc
ccctttatga gagttttagg ctatcaaatt 98340gtgtttaatt cttaacaatt ttttgaaaaa
ttatagcttc aatatccgta cattccccac 98400aaaaaagcac taaaaatcat gccttgctgg
aggctgcagg accaagtcat gttgcaatca 98460atgccatttc tgccaacatg gactcctttt
caagtagcag gacagccaca cttaagaagc 98520agccaagcca catggaggcc gctcattttg
gtgacctggg taagtaacta tcatttttta 98580ttaacttgta ttagaaggat ttgagtacaa
tatgtgaaac ttctgtcata ggatacagaa 98640ctatataatt ggaaagtgct ttggaaaaaa
tgtatttaaa ataacagcta caagtataat 98700gggtagctgt gttgtgttcc tgtaaatata
gaatataaag catgcccagt agaaaaacaa 98760gcatttccag aagaaatata tctgatcact
aaatataaat atatgaaaaa gatgtctcac 98820tttattactg agggaagtgc aaattaaaat
aatcagttaa tgttctccta acacattagc 98880atatttttta aagtttgaca atttgaatgt
cagtgaagat gcagggaaat acccctccta 98940tttagtgata atataatctg gtgaagactc
tttggaaagc aatttggaaa tcagtataaa 99000atatgcatgt catttaggcc actctttcta
agacctagcc ctcagatatg ctcattcata 99060tgtgcaggtg tgtatgtgtg tgtgtgtgtg
tgtgtgtgtg tgtatatgta tgtatgtatg 99120tatgtatgta tgtatgttga aggctattca
ttatagtatt gtttgtgata gcaaaaaatt 99180atggacaaca tataaatatc tgttataggg
aaataaccaa attgtggtat acgcatgctc 99240tggagtataa tatagccatt tgtttctatt
tatttatttt cttgagacag ggttttactc 99300tgttgcccag gctggagtgc agtggtatga
tcatggttca ctgcagcctt cacctcctgg 99360gcacaagcca ttctctcgcc tcagcctcca
gagttactag gactgcaggc atgtgtcacc 99420acacccagat aattttttaa ttttttgtag
agacagggtc tcactatgtt gcctaagctg 99480gtctcaaact cctggcctca agcaattctc
ccacacaggc ctcccaaagt gctgggatta 99540ccaacgtgaa ccaccacacc tggttcagtg
tagccattta gaaatctaaa aaagacgtgg 99600gaaaatgtct aaggcatgtt taaatgtgag
aaaagcaagt cacagtatgc atggtaaaat 99660ccgttatatt aaaataagtt cttccaaaac
aaaaacatat gcaggagacc tttattttgt 99720cagtatttct tacccaaatt tctgcactta
gaaaattgca tgtcatgttg tcataagttg 99780aaaaaaagat ccatgaacca atggacttct
aataaaatca gtcctgcttt tgacatctct 99840ctctactttt gtgtatattc aaaccagagt
gtcaatgtgt ttgtggggca cacttagcaa 99900taatacatag cagacaaaat gcatatagct
cagagagtaa aattgtaagt tttgctagat 99960cactcataaa ttgctgatga gaatttaaaa
tggtgcagat gctctggaaa acaggcagtt 100020tctttctttc tttttttttt tctttttgag
acagggtctc actctgttgc gcaggctgga 100080gtacagtggc gtgattacaa ctcactgcag
cctcaccctc ctcaggttca ggtgatcctc 100140cctcagtctc ctgagtagct gggactatag
gcatgcacca ccacgcctgg ctaatttttg 100200tatttttttt tttttttttt gtagagacgg
ggtttcgcca tgtttcccag gctggtctca 100260aactcctgga atcaagcgat ccacttgcgt
aggcctccca aagtgctggg attacgggcg 100320tgagctactg tgcctggcct aggcagtttg
tttgtttgtt tgtttgtttg tttatttatt 100380tgtagacgga gtctcacagg ctggagtgca
gtggcccaat ttttggctca ctgcaacctc 100440cgcctcccag gttcaagcta ttctcctgcc
tcagcctcct gagtagctgg gatgacaggt 100500gcctgccata atgcctggct gatttttgta
tatttagtag atatggggtt tcaccatgtt 100560ggtcaggctg gttttgaact cctgacctca
ggtgatcagc ccgcctcggc ctcccaaagt 100620gctgggatta caggcatgag ccgtcatccc
tggctggtgg tttcttatga cgtgaaacat 100680gcaattacca tatgacctag cagttgcact
ctgtatttat cccagataaa tgaaaactta 100740ccttccaata aaaacctgtg cacaaatgtt
catagcagct taatattgaa aaactggatg 100800ttcttcagca ggtgaatgaa ctggttcatt
cataccatgg aataccattc agcaataaaa 100860aggaacaaac tgttgataca tttaaccacc
tggatgaata tcaagggaat tatgctgtca 100920gacaaaaacc agtccctaaa gactacatat
agtatgattc cgtttggata atattcttga 100980aatagagaaa ttaagagaaa tgaaaagatt
agtgtttgcc agatgttaga gacagggagg 101040tgagaggggt aagtgggtgt agttataaaa
gtgcaacatg agggatcttt gtgatgttga 101100agttgtatct tggcagtgga tgcagaaatc
tcaatgtgat aaaattacaa agaactaaaa 101160acaagaatga gtatagataa aactggggaa
atctgaacaa gttagagtgt tgtatcactg 101220tcagtatctt agagtgatat tgtactatag
ctttgcaaga tgttaccatg ggagaaacta 101280aagtgtacaa gggatctcta ggtattatta
tttttttaga gatggggttt cactatgttc 101340cccaggccgg tcttgaactc ctgggctcta
gtgatccgcc tgccccagcc tcctaaagta 101400ctggaattac aggcgtgagc gaccatgcct
ggccctttca gtattgtatc ttagaacttc 101460atgtgaatct agcattatct catagaattt
aattaaaaga aattgtaaac ctcacagaag 101520atcagaattt cctcaagttt gtgatgttga
caaagatgaa ctagttgaca ctgacagtaa 101580gactgaggat gaagacacga cgtgcttcaa
aaaaatgatt tgaatatcaa tggattaaga 101640agaactcttt tgacaaattg atgaaaccct
cagtcagttt tataagaatg cccatcttta 101700tgatcatgct atgaaagcca atttttaaaa
aaattttttg tctttcctaa caattagctt 101760gtggttataa tttaaattta gttaaatata
agataaatga ttttttatta agtttagttt 101820catttttcaa ggtacgatct caaagctact
ctttaaccta ctatgaatga ataatgctga 101880gttcataaca tctttgtaga tatatccaca
attttccctc aggataagtg cctacaagtg 101940gaattactgg actgaaaata atgcagtttg
ctaagacttt gctatctgtt cctgaatgct 102000cctccaaaaa ggttttgcca gtttacatcc
tcatgaccag cgaatgagag tgttgcctat 102060tttcctgtgc ccttgttact gcttaataat
ttttgaaaaa aatctaattt gacagacaaa 102120aatgcatttt atgttaattt gcttttctgg
gatttttaat gaggttgagt atagttttta 102180atatttttat tggccccttt ggaactagta
tcataagttt tttttcttaa gaatttatgt 102240agtctgggct gggcgcagtg gctcacgcct
gcaatcccag cactttggga ggccgaggtg 102300ggtggattgc cgaaggtcag gagtttgaga
ccatcctgac caacatggtg aaaccgaatc 102360tctactaaaa gtacaaaaac tagctcagcg
tggtggcggg tgcctgtaat cccagctact 102420taggaggctg agtcaagaga atcgcttgaa
cccgggaggt ggaggttggt tgcattgagc 102480cgagatcgcg ccattgctct ccagcctagg
caacaagagt gaaaagtctc aaaaaaaaaa 102540aaaaaaaaaa aaaaaagaat ttacatggtc
tgaattgcca ttaaaagaga tatgagaatt 102600attgagtaac aaataacttt ttaataattt
aggcaagttt tggacgattg tactttgttt 102660agaaaccaaa agcatagtat ttgtagtttt
tttatttact ttagttgcta ggaagtaaac 102720tttattcaag gtctctggta ccagttgttg
ctaaaagtga ttgactaatc tgtcaatctg 102780aaattatttg ttgctgaact gctaattctt
ttgcttctat cttttaggca gatcttgtct 102840ggactaccag actcaagaga ccaaatcaag
cctttctaag acccttgaac aagtcttgca 102900cgacactatt gtcctccctt acttcattca
attcatggaa cttcggcgaa tggagcattt 102960ggtgaaattt tggttagagg ctgaaagttt
tcattcaaca acttggtcgc gaataagagc 103020acacagtcta aacacagtga agcagagctc
actggctgag cctgtctctc catctaaaaa 103080gcatgaaact acagcgtctt ttttaactga
ttctcttgat aagagattgg aggattctgg 103140ctcagcacag ttgtttatga ctcattcaga
aggaattgac ctgaataata gaactaacag 103200cactcagaat cacttgctgc tttcccagga
atgtgacagt gcccattctc tccgtcttga 103260aatggccaga gcaggaactc accaagtttc
catggaaacc caagaatctt cctctacact 103320tacagtagcc agtagaaata gtcccgcttc
tccactaaaa gaattgtcag gaaaactaat 103380gaaaagtgag tatgtgattt tcttgtgtgt
acatatgtgt ctcactttct ttttttaatt 103440tactaagcag aacttcagat gaggaataaa
atgattggaa tatttttttt ctcctctaac 103500tacttgtaaa tttgggagaa tttggagagt
gtagtagagt cagatcagtg tatggaaaag 103560gagcaggagt gactggacct tctaagaagt
gtgttatcag aattagtaaa tgaagggtca 103620aatgtcctac ttttcccctc cactgatttt
gacatcaaac cattatccac atagccttat 103680ttcctccctc ggtcttaatt ttattaatat
tttactgcac tttgcagata aaatttttaa 103740aaaattttta aaaattgcca ataagtgaca
tttattaagt tcagtgctta gtgtatattt 103800ggattttatt tattagtcac aagacctttg
tgcaggtagt aggcatgatt atcttttttt 103860ttttgagatg gagtcttgct ctgtcgccca
ggctggagtg caatggcgcg gtctcggctc 103920actgcaacct ccgggttcat gccattctcc
tgcctcagcc tcccaaatag ctgggactac 103980aggcgcctgc caccacaccc ggctaatttt
tttgtatttt tagtagagac ggggtttcac 104040catgttcgcc aggatggtct cgatctcctg
actttgtgat ccgcctgcct cggcctccca 104100aagtgctggg attacaggca tgagccaccg
cgcccggact gattatctta tttacacatg 104160agaaaaccag ggcttagaaa ggttaggtaa
cttcctctag gttgtacagt aaatgtggac 104220ctagaagcat tttgacaaga gcacctgttt
ttttttcttc tctattagtt tagaaattat 104280atactcttaa ttatcacctg ggattttgat
tagacagcct tcatgttctt tttcatctta 104340aatgttcttt gtgtcttaaa gggctaagtg
atttcttcag atcttttagt tcactcattc 104400tcagtgaact aaaatgaggt ctaatctgct
actgaatcaa gttttcagca tgttatttcc 104460ttcctccctc cctccctcct tccttccctc
aaccaggctc ccgaggagct gggattacag 104520gcgcccgcca ccactcctgg ctaattttta
tattttagta gagacggggt ttcaccatgt 104580tggtcaggct gatcttgaac tcctgacctc
aagtgaccca cctgcctcgg cctcccaaag 104640tgctgggatt acaggcatga atcaccacac
ctgacggcat gttattttca tcgcaaagtt 104700actgtaagct gggagaagtg gcacacactt
gtactcccag ctactcagga agcttaaggt 104760gagaagattg cttgagccca ggagttttga
gaccaacctg ggcaacacag caagacccca 104820gctcaaacaa agaaaaaaag ttattgaatt
ttttatttct atggatcatt ttttgtagtt 104880tcttattcct ttcacccttc attcccactt
ttgatcccat cttttattta tttagtttta 104940ttaaatgtat atttgtctga taattctgct
atctacagtt ttttgtggac ctgactcagc 105000atttctttgt ttcttcggat tcagactgtt
ggtggcttgt gattttagtg atttttggcc 105060gtgaacatgt ttcttggact tttgtctgtg
ggaattctct gtgtactctg tataaattaa 105120gttacttcag gtgttttgca ttttcttttg
ccatgcacct ggggcctggg tcactaccct 105180tctggtacca cttaaaactg aatttttgtc
ttgggtgctc gtactgatcc tgtatgagta 105240caggtttata cttactgtag aaatatggtg
tttgattatg gggtattgtc ccagatggtg 105300ctggagtatt aatatgctct ctgttaaact
taatgtgttg tccctgtaaa actccaaaat 105360tctgaattcc agaatactac tggccccaaa
tgtttaagat aagggcactg cctgtatttg 105420tttctgcctc ccactatttt ccttagttta
acacaaactc acctttttaa aaaacatttt 105480gagagaattc agtattggga agagtttcta
acctgtttct ggaaatggaa gtccaaagtc 105540tgtttctgta attgtttttt ttttgagatg
gagtctcact ctgtcaccca ggctggagtg 105600caatgacgta ctctcagctc actgcaacct
ccacctcccg ggttcaagcg attctcttgc 105660ctcagccccc tgagtagctg ggattacagg
tgcccaccac catgcctggc tgatttttgt 105720atttttagaa gagatggggt ttcgccatgt
tggccaggct ggtcttgaac tcctgacttt 105780gtgatctgcc cacctcagcc tcccaaagtg
ctaggattat gtttctgtaa ttgtaataca 105840tttattgttt ttagaaactg tctttgcttt
agtggtaatt ttcaataaaa atagaaatag 105900cagtggagtt attaaaagag cattagttac
atttttccct ttttcattat cttcaaatat 105960tatatatagt aagtttgacc tttttaaaat
gtatacttgt atcagtttta acacatacat 106020agattcctgt aactgtcacc actataaggg
taaagaacag ttagttcctt cacctttgaa 106080gtcaagcccc acctctatcc caacacttgg
caaccgctga tctttctccg tctcaatagc 106140tttgcctttt ctcttttttt ttcttatttt
tttttttgag acagcgtctt gctctgtcgc 106200ccgagctgga gtgcagtgag gcaatctcgg
ctcactgcaa cctccgcctc ctgggttcaa 106260gcagttctcc tgccttagcc tccctagtag
ctgggattat aggcacgcac caccacaccc 106320ggctgatttt tttgtatttt tagtagaaat
ggggtttcac catgttggcc aggctggtct 106380caaactcttg acctcaagtg atccacctgc
ctcggcctcc caaagtgctg ggattacagg 106440cgtgagccac tgtgcccaat caggactttt
tttttttaaa tttacattca acttgtcatt 106500tttttcttgt atggattgtg ccttcagagt
cacacctaag agccctttgc ctaagcaaag 106560gtcatgaaga ttttctcata tgtttccttt
taaaagtatt gtggttggcc aggtgccatg 106620gcttatgcct gtaatctcag cactttgaga
agctgaggtg ggcagattac gaggtcagga 106680gatcgagacc atcctggcta atgcggtgaa
accccatctc tactaaaaat acaaaaaaaa 106740aaaaaaatta gccgggcgtg gtggcgggca
cctgtagtcc cagctacttg agaggttgag 106800gcaggagaat agtgtgaacc cgggaggtgg
agcttgcagt gagccgagat cgcgccactg 106860cactccagcc tgggcaacac agtgagactc
catctcaaaa aaaaaaaaaa agtattatgg 106920ttttacactt tacgtttaga tatatatctt
ttttgagtta atgtcgtata agtatgaggg 106980ttacgtcaga ttttttgttt tttgtttatt
tttacatatg gatgtctagt tgttctaata 107040ccatttgttg aaaagacaac ctttactcca
ttgaattgcc tttgtacttt tgccatattt 107100gtctaggcct gtttttggac tcctttttct
gtttcatgat gtgtgtgtct attcctttgt 107160taataccaca tggtcttaat tactgtatag
taagtcttaa aattgggtaa tgctggcctt 107220ataaaacgaa ttgggaagtt tttattttta
ctcttatttc cattttctag aagagattgt 107280gtagaattgg tgtcatttct tctttagata
tttggttgaa ttgggaagtg atgccatctg 107340ggcctagggt tttgtttttt gtgtgtgaga
cagagtctca cttctgtcac ccaggttgga 107400gtgcagtggt gagatcttgg cttactgcaa
cctctgcctc ccaggttcaa gttatcctcc 107460tgcctcagcc tcccaaatag ctgggattac
aagcgtgtgc caccatgccc gactaatttt 107520tgtattttta atgcagacag ggtttcacca
tgttagccaa gctggtctcg aacttgtgac 107580ctcaagtgat tagcccacct tggcctccca
aagtgttagg attatagatg tgagccaccg 107640tgcctggcag gggcctaggg ttttcttttt
cagagtattt taaactatga attcagatta 107700tttaatagat ataggactat ttaagttatc
tgtttcttct tgagtgaatt tttactgtag 107760tttatggcct ttgagtaatt aattgtattg
aattgtcaaa tttatgagcg tgtaattatt 107820tatagcattt cgggtttgta gtggtatccc
tcttttattc ctggtgttgg caattgtgtc 107880ttgtttttct ttgtcagatt gtatagggat
ttattagtct tttcaaagaa ctagcttttg 107940ttttgatttt tctgttgttt tgttttcaat
tttattgatt ttctgctctt tattatttct 108000tttctattat ttctgcttgc tttgggttta
ttttactctt ttttttttct ccaagttgct 108060taaagtagaa acttagattt ctggtttgag
acctttcttt tctaagataa gcatttaata 108120ctgtaaattt ccttctaacc actgctttag
ttacaccccc acaaattctg gtattttgaa 108180ctgagcacaa atgaaatgtt ctaatttccc
ttgaatctta ttcttttacc aatgaattat 108240ttagaaatat gttatttagt ttgcaagcaa
ttggagactt ttttcctgtt atttttctac 108300catttatttc tcatttcatt atattatggt
cagagaatat attttgaatg atttcattta 108360ttaattttta aaaataacat taaaaaattt
tttaaaatgt gaatatacca catacagtat 108420aaagattgta cattctgttt ttggacagtt
ttctataaat gtcaagttga tttagttggt 108480taatgatggt gttcagtttt tctttattct
tgctgatact ttgtatgcag ttatatcact 108540ttattactca gaagagtgtt gaactttcca
actacaattt ttttttccaa ttttactttc 108600agctctatct ggttttgctt catgtatttt
gaggctctgt tgttaggtgt gtacacattc 108660aggatgatat cttctgggtg aattgcctgt
tttatcatta tgtaattccc tctttatggt 108720aattttcctt gttctaagat cagaaatatc
tgttgtccaa tttatataga cactgcagct 108780ttcatttgat tagtgcttgc atggcatatc
tttttccatt tttttacttt tgatctacct 108840ttataattct atttaaaggg ggcttcttgt
aggcagcata tagttgggta gtgttattta 108900tttatttatt tatttattta tttatttatt
tattgagaca gagttttgct cttgttgccc 108960aagctggagt gcagtggtgc aatcctggct
taccacaacc tccacctcct gggttgcagt 109020gattctcctg cctcagcctc ccaagtagct
gggattacag gcacgcgcac catgcctggc 109080tgattttttg tatttttagt agaaacggat
tttcaccatg ttagccaggc tcgtcttgaa 109140ctcctgacct caggtgatcc acctgctttg
gcctcccaaa gtgctgggat tacaggcgtg 109200agccactgca cccggctgag tcatgttatt
tttaatcttt tctcacaata cagggttttt 109260gttggtaaat ttaattattt taatataaat
tttagtataa ttatttacat taaatgtaac 109320tgttgcactg gggtatttat aatgtgtaaa
tataattatt ggtattaata taattatatt 109380actcataata atattaatat ctttggattt
agattaccag tttagtatat gtttttctgt 109440ttctccctct ttgatttccc cttttttgct
tttttttttt ttttaattct tatttttttt 109500tagtatttgt tgatcattct tgggtgtttc
ttggagaggg ggatttggca gggtcatagg 109560acaatagttg agggaaggtc agcagataaa
catgtgaaca aggtctctgg ttttcctaga 109620cagaggaccc tgcggccttc tgcagtgttt
gtgtccctgg gtacttgaga ttagggagtg 109680gtgatgactc ttaacgagca tgctgccttc
aagcatctgt ttaacaaagc acatcttgca 109740ccacccttaa tccatttaac cctgagtggt
aatagcacat gtttcagaga gcagggggtt 109800gggggtaagg ttatagatta acagcatccc
aaggcagaag aatttttctt agtacagaac 109860aaaatggagt ctcccatgtc tacttctttc
tacacagaca cagtaacaat ctgatctctc 109920tttcttttcc ccacatttcc cccttttcta
ttcgacaaaa ctgccatcgt catcatggcc 109980cgttctcaat gagctgttgg gtacacctcc
cagacggggt ggcagctggg cagaggggct 110040cctcacttcc cagatggggc agccgggcag
aggcgccccc cacctcccag acggggcagt 110100ggccgggcgg aggcgccccc cacctccctc
ccggatgggg cggctggccg ggcgggggct 110160gaccccccac ctccctcccg gacggggcgg
ctggccgggc gggggctgac cccccacctc 110220cctcccagat ggggcggctg gccgggcggg
ggctgccccc cacctccctc ccggacgggg 110280cggctgccgg gctgaggggc tcctcacttc
gcagaccggg cggctgccgg gcggaggggc 110340tcctcacttc tcagacgggg cggccgggca
gagacgctcc tcacctccca gatggggtgg 110400cggtcgggca gagacactcc tcagttccca
gacggggtcg cggccgggca gaggcgctcc 110460tcccatccca gacggggcgg cggggcagag
gtggtcccca catctcagac gatgggctgc 110520cgggcagaga cactcctcac ttcctagacg
ggatggcagc cgggaagagg tgctcctcac 110580ttcccagacg gggcggccgg tcagaggggc
tcctcacatc ccagacgatg ggcggctagg 110640cagagacgct cctcacttcc cggacggggt
ggcggccggg cagaggctgc aatctcggca 110700ctttgggagg ccaaggcagg cggctgggaa
gtggaggttg tagggagctg agatcacgcc 110760actgcactcc agcctgggca acattgagca
ttgagtgagc gagactccgt ctgcaatcct 110820ggcacctcgg gaggccgagg caggcagatc
actcgcggtc aggagctgga gaccagcccg 110880gccaacacag cgaaaccccg tctccaccaa
aaaatgcaaa aaccagtcag gtgtggcggc 110940gtgcgcctgc aatcccaggc actctgcagg
ctgaggcagg agaatcaggc agggaggttg 111000cagtgagccg agatggcggc agtacagtcc
agcctcggct ttcacaactt tggtggcatc 111060agagggagac cggggagagg gagagggaga
cgagggagag cccctttttt gctttctttt 111120ggattatttg aatttttcct taaatttatt
tatcttactt atttatttat ttttttgagt 111180gattctcctg ccacagctcc caagtagctg
ggactgcagg catgtgccac tacacccagc 111240taattttttt gtatttttag tagagacagg
gtttcaccat attggccagg ctggtcttga 111300actcttgacc tcaagtgatc cacctgcctc
ggcctcccaa agtgctggga ttacaggcgt 111360gagccaccat gccctgcctt tttctagaat
ttatatattg agttcttgat tgtatctttt 111420tatgtaggct ttttagtggc ttctctagga
attacaatat acatactttt cacagtgtac 111480tcacatttaa tattttgtaa cttcaagtgg
aatgtagaaa acttaaccac cataaaaata 111540gaactaggga tgaggttaaa aaagagagag
aaaagaaatg taataaagat ttaataacac 111600cgtttttttt tttttttctc tttttttttt
gagacagagt ctctctttct gttaccaggc 111660tggagtgcag tggcgtgatc ttggctcact
gcaacctccg cctcctgggt tcaagtgttt 111720ctcctgcctc agcctactga gtagctggga
ttacaggtgc gcgccaccat gcccagctaa 111780tttttgtatt tttagtagag acggtttcac
tgtgttggcc aggatggtct cgatttcttg 111840accttgtgat tcgctctcct cagcctccca
aagtgctggg attacaggcg tgagccaccg 111900cgcccggcta agtctttaaa tatttttttg
acattgcact ttttctcttt tccttctagg 111960attttagtaa cccaaatgtt agttttgtta
ttgtttggca ggttcctgag gctttcctta 112020cttctttaaa tttttttttc ctgttgttca
gcttcgaaaa tttctattca tctgtcttca 112080aattcactgg ttctttcccg ttatttccat
tctgttattg agtctttgta gtgaatttta 112140aattttgttt attatgtttt ttagttctaa
aattttcttt ttttgtgtat gtcttatact 112200ttgctcctga aactcttatt tgtttcagga
gtgatcttat ttcttagagc atggttttag 112260tagctactta aaatttgttt tatcatccca
gcatatgtgt cctcttgatt gtcttttctc 112320ttgtgagata atgggatttt ctggttcttt
atatgacaat taattttgga ttgtatcttg 112380gacagtttga cttacgttac atgattctga
atcttgttta aatcctgtgg aaaatattga 112440agtttttgct ttaacaagca gttgacctag
ttaggttcag tccacaaatt ctaagcagca 112500ttctgtcggc tctggttcca tcatcagttc
agttttgtat cttatctgct tatgtgcctt 112560tctgtgtcca gtctgggacc tggccaatgg
tcaggtccca aagcctttgt acacttttag 112620aagcagggcc atgcacaccc agctcacgag
tggccccggg agtgcacata caactcgacg 112680ttttcatggg ctccttcttt tctgtgatgt
ccctgacacg ttctgccttc taagaacctc 112740cctttatccc tttcctgttg tctggctaga
aagtcagggc tttagattcc ctatacttca 112800gcacacttcc tgtagctatg tcaacctctg
tggccacgac ttcttcttct tgggactgca 112860gtttctcttg tcagaaagta ggattcttgg
agctgctgtc attgctgctg tggctgctct 112920gatgctgcct gggagtcgaa ggagagaaag
gaacaaaaca aaacaaccca ggggatttcc 112980tccactctct ttgatccgtg agagccccct
ttcctgttcc tcagaccaga aatagagggc 113040ctgtcttgga acttcttctt tgtgcatctg
gtgtgcagtt tcagcttttg agtccaggcc 113100aggaggtgct ggacaaactt gtcaggagta
cggaggtact gcaagttctg attacttttc 113160tcagtccacc tgcttccaag tccttggatg
catttgtcca ttgttttgag ttgcattcca 113220tgggagagac agaagagtgt gcttatttca
tcttgacata cttattagga tttcatatca 113280aatcaacgga tgatattctc tatattaatt
tgctgttttc cctttagcaa gcacattagg 113340aaaataacac tttaacaccc gcctttggtg
gtttctgtca taattattaa tacttgactt 113400tttttttttt tttgagacgg agtctcactc
tgtcctttga ggcattgtcc ccataaactt 113460ttggtaaagc atcaataatt ttatctttca
tccacacaag cttcaccata aatttgatgt 113520ttattcttcc attttagcag aattcatgtt
gctccaatag gggctgtctt caaactgatg 113580ttttctcctt cttagtgcct cagagtagat
cctgttcaga tacgttataa caggttaata 113640tgagtttatt ttggtgtaaa agtactttga
aattcatgca tagttttttc atcatatgca 113700ttttccatag ctttgaacac ccccatgtaa
ctctcctctt ccacaaacca aacaatgaaa 113760aagcaccttt gtgatggaag tttattttgc
aataggaact cacagtgatc taagccctgc 113820tattcatgaa tataattcat tactggagtc
caagttgctt tttggttttt gaagttctct 113880tcttcccttg caggtataga acaagatgca
gtgaatactt ttaccaaata tatatctcca 113940gatgctgcta aaccaatacc aattacagaa
gcaatgagaa atgacatcat aggtaagcag 114000tgcttgaaac tatggcaaaa aaaaaatgac
aaaaaatgca cagaactgac aattttcgtt 114060attgactaag ataatttttt cttaacatgg
aatttagcag ttcccttcct aatttgtttt 114120ctgagtattt tttatatcgg attatagctc
actttaaaag tttctcggct gcattcggtg 114180cgagggtctt tgcctgggcc agatgggctg
cagtgtagcg ggtgctcagg cctgcccgct 114240gctgagcagc cgggccggcg ggcggctacg
ctaaccggca cagaccaccg gatggactgg 114300ccggcagccc cgcaccagtg cacgaagtgg
gcgggacaga aacttctggg gttggaagtc 114360cagtgaggct aaaagccggt accaaagtct
ctaggcatca gggctgcagc ccaagagtct 114420cacgaccagt gggcaactgg atggccagac
aggtgtctca gtggtggcct ctccgtctca 114480gggcttcatc ccacttctca gtgggcctga
cgtccctggg caccctggat gtctacctgc 114540attagccaga gccatcacat ggcctgtgac
ttgccttttt ttgccagttg attgtgccac 114600acacagtgtc atttctgtgt catttggcac
agctggaggt gcaaggagga gggcagcctc 114660atgtccagtc ccagtttcac gtaactttat
tcttctgaat aaagacaatt tgctaacctt 114720aaaaaaaaaa aaaaaaaaaa agtttttctt
atatgttgga cccaaattct taggctttaa 114780cctgaataac aatgacagca agatcaataa
atagtacaca tttattaaac actcactgtg 114840tcccagacaa tattccaagc actttttatg
gatagactca ttttaacttc taaagaactt 114900tgtgggataa atacagttat tttatagatg
aagaaactga agcacagaga agttaagtgc 114960tttgtccagg gtaacagctc agatatggca
gagtcaggat ttgaaactag accctcacat 115020accttaactg ctgtgctgtg gcagtgtttt
tcatactgta ggttgggacc agccttctct 115080tatgccctca ccccctgcca aaaaaaaaaa
aaaaaaaaaa aaatatatat atatatatat 115140atatatatat atatatatat aatatatata
tatataaaat atatatatat ataaaatata 115200tgtattagta tatatgcata tatagtatat
attatatatt agtatatata ctaatatata 115260atatacatat tagtgtgtgt atatatatat
atactagaat aaaaaaatca aagtatctca 115320gagtagtaag gacaaacatt tcagaaaaat
gttttcatta tatatacatg tatgtatgtg 115380tatgctgatt caacaaatat atttcttata
ggttatagca aaatagtttg aaagctttta 115440ctgtgtttta tcaggaagac cttaggtgaa
cgtatattca cagataaaag aggttattta 115500ttcattcaat aaatattaca ttctcataag
tcctaatatt atgtattttt attcttcaaa 115560aaagttagta tttgtgattt atgaaataag
acatgttctt gcacttttag cagatctgtc 115620ccgatgttgg gcttctttaa tccttagtgt
gggtgctttg cactcactca ctgctgggga 115680cagcaagacc cctgttagtc tcagctgtgt
ttcttaaatt ggcccactgt accttccagt 115740tagctattct ggggtccatg tcatgttggc
tccattttcc ttttctttct cccacacaga 115800tacctataac ggctataaca taggcctggt
ggctgttggt ggcttatccc tatctgcttg 115860tatttaaggg gtactgtttc actgagtttt
gctgacagat gttgtcatga gatttgaggt 115920tttctgtgtt gttgctctat ttttatgtgg
gaatttgcta ctatcatcat ccctagacca 115980gcttttccta gtaatacaac agggatgttc
tgactgatta gagtttgcct gtttgaagaa 116040ttggttggct agtgattttt ttttgagggg
agtctgtacc agttaatagc ctgactggcg 116100tgtggataaa aaggaagcag tttcaagtca
aataaaacac ttaaaatgaa accacactgc 116160aactctcttt cttttactta agcttaatca
aattaatgat gatgtaatcc catgaaggaa 116220aagtcttctg aaggatcaag ttgataacat
tttgtgatca aagaatttga gaaaacctct 116280atcccagtgt ctatcattat atattttagg
atgttaatta cctgtgtggc tttaggcaag 116340tcatttttcc tccttgagcc ccattcttaa
tcctgtccaa attatttgtc tcctcttgca 116400gttggactat tttaatatag ctgtccttca
agtgagtttt gttcaaagga gccttcactt 116460tagctcttac tgtgtaccca ctttgcatag
tcttgtttta aatgtaatcc ttggattttt 116520ggtgttgcta actaattact gtttttatgt
gaggatttag agtgatccag aatctatact 116580tgcactacct ccttcatctt ccacaaatgt
ttgaagtggt agaattttta aaaactttga 116640aggtacagct gacagaattt gctgatggtt
tggaagtgag tggtatgaga gggaaaaaaa 116700ggaataaagc atgactgcat tttttgtttg
tttgtttgtt tgtttttgag acggagtctc 116760actctcgcca ggctggagtg cagtggcgtg
atcttggctc acggcaacct ccgcctcctg 116820ggttcaagcg attcccctgc ctcagcctcc
caagtagctg ggactacagg cgctcgccac 116880cacgcctggc taattttttt ttttgtattt
tagtagaaac ggggtttcac cgtgttggcc 116940aggatggtct ccatctcctg acctcatgat
ctactcacct tggcctccca aagtgctgag 117000gttacaggca tatatataag catataaagt
gtgttatagc atacaaacag gtatatatat 117060aaacatgcag tccacacagc tgataggaat
gaggcagtag tgaaggagaa gttgatgtag 117120gagaggggac agttgttaca ggaaagaagt
ctggaggcag aagggatgaa ttccagtgct 117180cacatagaag attgcttaga tgggagcaag
gacaatttat ctagagtcac aggaaagaat 117240gcagtacacg ggtagagatg caggtgagtt
gaaagatgtg agagatgatg gaaataattt 117300tctgattgct tctatattct caaggaagca
ggaagcaaag tcctcagcaa agagaataga 117360agaggtgtta aatatttgag aaaggagatg
tactgtagaa aaaaaaaaaa ctcagtttct 117420ccttctgaac tctcacaaaa cagaaccctt
ccatgactct agttgtgtgg ggttttttcc 117480ctgtcagcta ccaattctgc agatgattgt
tcagtgaaca ccaactgggt gtcctctaag 117540tcagttcagt tctcacactg tttacctgga
gatagcatca gatcccacag attgaggact 117600ctgtcccaca agactgcctc cacttcagat
gccagtctca agtacaagtt gtggcctgtg 117660cttctgactg accttctata aattggagtt
cccacagtcc cctccttggg ttcaataaat 117720ttgctagagc agctctcaga actcagggaa
atgctttaca tatatttacc catttattat 117780aaaggatatt acaaaggata cagattgaac
aggcagatgg aagagatgca tgggcaaggt 117840atgggagagg ggcacagagc ttccatgcac
tctccaggtc atgccaccct ccaagaacct 117900ctacagattt agctattcag aagcccccct
ccccattctg tccttttggg ttttttgtgg 117960agacttcatt atataggcat gattgatcat
tggctattgg tgatcagctc aaccttcagc 118020cccctcatcc cgggaggttg gtgggtaggg
ctgaaagtcc caaacgtgta attctgcctt 118080ggtctttctg gtgattagcc ctcatcctaa
agctctttag aggccacagc cacaagtcat 118140ctcattagcc ttcaaaagaa tccagagatt
ccatgaattt taggcgctgt atgctaagaa 118200actggctaaa ggccagttgc aatgtctcag
gcctgtaatc ccagcacttt gggaggctga 118260ggcaggagga tcgtttcagg ccatgagatc
aaaaccagcc tggtcaacat agtgagaccc 118320ccttacaaaa aatttaaaaa ttggccaggc
gtaatagctc ttgtctgtag tctcagctac 118380tcagaaggct gaggatcact gagccctgga
gttgaaggca gcagtgagcc atgatcgtgc 118440cactgactcc ggcttgggtg acaaagtgag
accttgtctc agaagaaaaa ggaaaaaaaa 118500aaaactgggc aaagactaaa taacatattt
cacagtatca cagatttgta ttgtctagga 118560aagtgaatgt aaacagacca ggacactagt
atgatccctt ggtttcatga aggtcccact 118620aaagtcatga acacaaagtg agactaggca
tcatgttata tggtttttcc agccatgttt 118680aacagctagc taaatagcta attgtttcgc
tgcagtttat tttagcagtt ccttatttta 118740gcacatttca tgttttaaaa tttctaccaa
taacatttta ataaactttt ttacagataa 118800cttcacaaat ccataatttt ttaagttaca
atcccagaaa tagaattgct cattgaaagg 118860gtatgttcat ttttaaagtt atgctagaaa
ctgccaaatt gccttcagaa aaaggtgttt 118920gtatccccac taacactagt gttagttttc
ttgtgccctt gctcaagtat acatattatt 118980aaaaacaatg ttgggccagt ttactagata
aaaggtgtag tgcctcctta ttctaatcta 119040tttgattact agtgagtatg tatgtctttt
cacgttggtc attttatgtt tgttcctttg 119100tggattgtca tgtcctttgc tcatttttct
tttggaacat ttcttagtag tttataagag 119160ctcttggtat tttaatgata gtaacctttt
aactgtcatg catgctgcaa atcttttttc 119220tgtttgtttg cctttgtatt ttgtttttgg
agggtttcta tgtataggaa ttaaatttta 119280tgttgttaaa tcttttgatt tctgcttttg
catatgtact tcaaaagact ttctatttta 119340agatcaagtg ttacctgtat tttcttttag
ttctatttaa aacctcttaa tttatatgcc 119400tgtgctgtta actcccaagt tgattcacaa
gtgtgtatac atagtttgaa tttagtggca 119460atttaattat ttacaacttc ttttgcagca
aggatttgtg gagaagatgg acaggtggat 119520cccaactgtt tcgttttggc acagtccata
gtctttagtg caatggagca agagtaagtt 119580agttcatatt ttcacattgt gcatcctagg
gaatttgggt tcattgttag gaatgggctt 119640cactcagcta aaaacaaagt atttttgaga
atttaaatat tttggatatt tacaagatca 119700tataaagcat actctatctt ggttaacagt
ttcttttaaa tataaattat gtgaactctt 119760aaaattttca ttttcatttt caatgttaat
atttcctaag ttaaaataat ttgtttttag 119820ttctgaaata atttggggag tgattgagtc
tgtagtgatt atgactatta gaattggttt 119880atttatttaa ataatgcatg tcttcagatg
gctctcctaa tttgttagtt aggctttaag 119940ctaaatggat gctatataac taaatccaca
tagatttgtt gaaatggctc cagaggtttt 120000ttagatttat tactgctatg tgcccttaaa
aaaaatctat tcattctttc acttaacatt 120060tatcagaaga gtgctctgtg taagacgtgg
ttaggcatag tgccagtctt gaaggaagtt 120120acagcctaat aaaagacata gggcatgttg
tttggttact gtaatatgaa gtggcatgtg 120180ttaaatgtca ggggagaact acaaagtcat
aaaaaggtgg gagagattac atacaggtaa 120240aggaatcagg aatgacacca tggggagtaa
ggtagtgttg acctaggcct ttaagataca 120300atagggacag tatggaaaga gtatattttt
cccacttaaa ctctttcctt ggtcgttccc 120360tcaaattttc ccttttgtcc atgtgcaggc
actttagtga gtttctgcga agtcaccatt 120420tctgtaaata ccagattgaa gtgctgacca
gtggaactgt ttacctggct gacattctct 120480tctgtgagtc agccctcttt tatttctctg
aggtaaagtc tgcatttctt ttcacactct 120540attcgagcat tccagcctct aactatcaat
gctggggccc tgtctatagg aaataacaca 120600gaagagccaa gtcatttcca aaaagatgta
tcattgtttc aagttgtttc tgatggcaag 120660agtaatttaa taatatatta gagagaacat
gaaaattcaa tgtattaaat aactctaatt 120720ttgagaaacc taattaaact actgcatgta
agagagtgca tgtttttaat tatttggagc 120780tattttaaaa ccacagaatt tgaaacttgc
ttccagtgca taaattgcag accagacttc 120840agaagagaaa aaaagtagta aattttttct
tatgctcatc atttttactt tagtcacttg 120900ataggattgc ccagtgaaga agcatttgca
acagacaatg agtatattaa tctttttgag 120960gcatacagtt tagtataatg ctctttgtta
ggcttcaaca agtgaaatta ttttgttgga 121020aagcaaatga ctattaagta gaaagaggat
tcccagtctc acaaagcagt aatttagaca 121080ctcgattctg cctctttaca agaatacagg
tactcagttg atttgttttc tcactccctt 121140tctttgctat aagtttaaat caacaatttg
tttaggttaa tatgtcctca tggaatggtg 121200gaaatgatca gatataaaat atttggtttg
gttagtttac tctttatatg tttgctggca 121260aggaaccaca aatccagttt agtataattt
ttactctagt tcactaaaag tttgcatcca 121320gctgtgtagg tagtgtttgt ttcttgttaa
cttttttttc gtctaaaaga atactttaaa 121380acttttcaat ctcaaatgac tgtaacttgc
tgacaggtgt taacagaaga agtagatctt 121440tttgtttttt gcttatgacc tgtattttaa
tatttgagct tatagattag agattgtgag 121500agaaatctgt ttatagtctt attttccctt
gtgtattttt tcttcctagt acatggaaaa 121560agaggatgca gtgaatatct tacaattctg
gttggcagca gataacttcc agtctcagct 121620tgctgccaaa aagggccaat atgatggaca
ggaggcacag aatgatgcca tgattttata 121680tgacaagtga gttatattga tagatggatt
cagcagatac ttattgaaca tttgatatgt 121740tttgtggaaa taaagatgaa taaactcagt
ctctgttgtc aaggagctca caggaggcag 121800cataaaagct gcttttatat ggtgtttgta
aagctttggg ggttcttaga acaaaagttt 121860ctgctgggaa aggggaggtg tatgtggggt
aaacaggatg gcaatggtgg tgttcaagga 121920gtgtttccca gaagagagat tttgtttgga
tcccaaagaa agaagggaat tttgctaccc 121980agagaaggca gaaaacaaca ttctaggcaa
aggcattggc ccagaagcca tggaaacgta 122040ggggaaagtg gcactttcaa gaaacttgag
tttagataat caaaggagtg gggaataaat 122100atgaggatgc tggtactaat tggaatagat
tgtaagggac cttgaatgcc tatttatggg 122160tatattatac tttctgtata aatctgctca
ggcacgttgt taattagttt tttattagtt 122220ttcactgaaa atgagaggat ggaaacatca
tacagtaaac aaaattgaaa atatctggtc 122280aggcagatga tgagcttgtg gccagctctg
taacgtatgg tattcttttc atttaacttt 122340tcttactctg taaaaaaagt aattcgtggt
cgggcacggt ggctcactcc tgtaatcaca 122400acactttgag aggcagaggc aggtgaatcg
cttgagccca ggaatttgag accagcctgg 122460gcaacatggc aaaacccgcc tttactaaaa
atacaaaaat tagctgagcg tgatggcgtg 122520cgcctgttgt cctagctact taggggcctg
aggcagaagg atcacctgag ccttgggagg 122580tcgaggctgc agtgagctgt gatccactgt
actccaccct gggcagggca gtagagtgag 122640accctgtctc caaaaaaaaa aaaaacaaca
aaggtaattt gttatttgta tccttaagca 122700aatgctaaag gggtaacttg gggatagaga
aaagtccaca gatgttaggg tttgaagaca 122760ctaatagtat ctaggccagt ggttcctgaa
cattagtctg tgggctcttg ctgggctgtc 122820tgcataggaa tcacctgaga gcttattaaa
aataggtttt caggctggtt gcggtggctc 122880acgcctataa tcccagcact ttgggaggct
gaggcaggcg gattacttga ggtcaggcgt 122940tcaagaccag cctggccaac atggtaaaac
cccgtctcta ctaaaaatac aagaattagc 123000caggcatgat ggcacacacc tgtaatccca
gctactcagg aggctgagga aggagaattg 123060ctcgagcccg ggaggtggag gttgcagtga
gcggagatca tgccactgca ctccaggctg 123120gctgacagag ggagactctg tctcagaaaa
aaaaaaaaaa ataggttttc agtctgggta 123180ccggtggctc acacctgtaa tcccagcact
ttgggaggcc aaggcaggca gatcacttga 123240ggtcaggagt ttgagaactg cctggccaac
atagtgaaac cttgtctcta ctagaaacta 123300caaaaaatta actgggcatt ttgacgggtg
cctataatcc cagctactag ggaggctgag 123360gcaggagaat tgcttgaacc cgggaggcag
aggactgcat ctcaaaaaaa aaaaaaaaaa 123420aaaggtttcc agtccccctg tctcagaaat
tctgattctg caggtttgag gtgtgaccag 123480gaatctttat ttttagaaga cataccagat
aattctgata aatagccagt ttagggatgt 123540agtctaattt tcctattttg caagtaagga
aaataaggcc cagagaggta atgattttct 123600caaagtcaca gaacaagtta gtggcagaat
ttggactgga atgcagttct taatgttctg 123660tccagtgttt attctggtac agtatgtttg
tagaaggtat tacgtaagaa acattgttat 123720atagatgttg agataggaag agtttacatt
tagaaatttg gtctaaaatg cctgaacatt 123780caagtcgtgg aggagtattg accaacttac
tcaatacaac ataggagatt cacattttgt 123840tacaaaaatg ctgatttaaa aggagagttt
tctttttttt cttctttttt attttttgag 123900atggagtctt gctctgtcac ccaggctaga
gtgcagtgac acgatctcag ctcactgcaa 123960cctccacctc ctgggttcaa gcggttctcc
tgcctcagcc tcctgagtag ctgggattac 124020aggtgggggc caccacgccc agctaatttt
tgtattttta gtagagacag ggtttcacca 124080tgttggccag gccggtcttg aactcctgac
ctcaagtgat ccacccacca ctgcctccca 124140aagtgctggg attataggcg tgagccactg
tgcccagcct gcttgttttt gtatcatata 124200tatgcatcat cataatcatg cattatcaac
ctttgtattt ctgtcaggac atagaaacca 124260ttagagtgct tggaagagag cctttttttt
tttctcgcat ttaatgcttt ttttggtatt 124320catttcataa tcagcttacc aaaacattac
ctgcattata ccccatcaag gtagaaatct 124380ttgtgttatc aatattggtt actccctttc
cacaccgagt catcagtaag tcctgttcta 124440tccaaatagg tcatatgcat ctagctcacc
cctcagtgct gttttgtttt gaatttgtac 124500atgtttactc ctgatgcctt gtagttatga
tgatgtgttc ttattttatt ctgtgcatac 124560aagttctcag ctcgcttttt agggaaaatg
accatgtctt cctttcctat aaattccttt 124620ctatctatca agtcctcaac agagaatagg
tacccataaa tatgtgattg ttagtttctt 124680tgcctcagtt gtagtctgat ccttacagct
tttaaacaac agtagagttc accgtcaaga 124740actaaggatg gttggcaggc agatagaaag
gtagcaagtt gacccaacta tctctgggga 124800agtgggaaca aagaaaggtt acatcagcac
tgtcatcaca tagctctata gttctaggcc 124860tgcaggctca atcaagtagc cttgtataag
attctctgga ggaggtgctg aaagttgctt 124920atacttgcta tggaatttga ttttacttcg
gatatctttt taccataggt acttctccct 124980ccaagccaca catcctcttg gatttgatga
tgttgtacga ttagaaattg aatccaatat 125040ctgcagggaa ggtgggccac tccccaactg
tttcacaact ccattacgtc aggcctggac 125100aaccatggag aaggtaaccc agaacttcaa
acgtatcaaa ctacaagaag ttttattggt 125160agaactcata aaatataagg tgggaaaacc
aagcagaata gcacagtgga aattgaagca 125220gtccagcaaa gtgattaaga gcagaggcct
tgagtctggc ctggtatgta cagtcacgtg 125280ccacataaca ttttagtcaa cagtggactg
cgtgtacgat ggtcctgtac gattataatg 125340gatcaaagct ggtagtgcaa taataacaaa
agttagaaaa aataaatttt aataagtaaa 125400aaagaaaaaa gaaaaactaa aaagataaaa
gaataaccaa gaacaaaaca aaaaaaatta 125460taatggagct gaaaaatctc tgttgcctca
tatttactgt actatacttt taatcattat 125520tttagagtgc tccttctact tactaagaaa
acagttaact gtaaaacagc ttcagacagg 125580tccttcagga ggtttccaga aggaggcatt
gttatcaaag gagatgacgg ctccatgcgt 125640gttactgccc ctgaagacct tccagtggga
caagatgtgg aggtgaaaga aagtgttatt 125700gatgatcctg accctgtgta ggcttaggct
aatgtgggtg tttgtcttag tttttaacaa 125760acaaatttaa aaagaaaaaa aaaattaaaa
atagaaaaaa gcttataaaa taaggatata 125820atgaaaatat ttttgtacag ctgtatatgt
ttgtgtttta agctgttatg acaacagagt 125880caaaaagcta aaaaaagtaa aacagttaaa
aagttacagt aagctaattt attattaaag 125940aaaaaaattt taaataaatt tagtgtagcc
taagtgtaca gtgtaagtct acagtagtgt 126000acaataatgt gctaggcctt cacattcact
taccactcac tcgctgactc acccagagca 126060acttccagtc ttgcaagctc cattcatggt
aagtgcccta tacagatgta ccatttttta 126120tcttttatac tgtattttta ctgtgccttt
tctgtatttg tgtttaaata cacaaattct 126180taccattgca atagtggcct acgatattca
ttatagtaac atgtgataca ggtttgtagc 126240ccaaaagcaa taggttgtac catatagcca
aggggtgtag taggccatac catctaggtt 126300tgtataagta cactctgtga tgttagcaca
atggcaagca gcctaacgga aattctgttt 126360attgattgat tgattgattg attgattgag
acagagtttc actccattgt ccaggctgga 126420gtgcagttgc acagtcttgg cacactgcaa
cttctgcctc ccaggttcaa ccaattatcc 126480tgcctcatcc tcccaagtag ctgggattac
aggcaggcac caccatacct ggctaatttt 126540tgtattttag tagagacagg gtttcaccat
tttggccagg ctgttctcga actcctgacc 126600ttaagtgatc tgcctgcttt ggcctccgaa
agtgctggga ttacaggcat gagctaccat 126660gcctgggcag taactgaaat tctctaatgc
cattttcctt atctgtaaag tgacgataat 126720atgcacgttt acctcaaagt tactttgatg
attaaagtaa ggtaatgtat ataaaataca 126780tattaacata gtacctgaca catggtaagc
atcaaaaaat gttaactact tttattacta 126840ttattattac gtatttttaa ataattagag
agcagtatca aaaattagct gggcgtagtg 126900gcatgcacct atagttccag ctactcagga
ggctgaagct ggaggattgc atgagcctgg 126960gaattaaagg ctgcagtgag ccgtgttcat
gcccctgcac tccagccttg gtgacagagc 127020aagaccctgt cttgaacaat taaagaaggc
attatgccgc aacgttagct tagaaatgat 127080ccacatatat caccagtaac tgtcaacagg
attggaaccc tagttttggg tattatgatc 127140acaaggtatt attaatagct tattaataat
aaagcgttgg ctaggcacgg cgactcacat 127200ctgtaatccc agcactttgg gaggccgagg
tgggtggatc acctgaggtc aggagtttga 127260gaccagcctg accaacatgg agaaacccca
tctctactaa aaatacaaaa ttagccgggc 127320gtggtggtgc atgcctgtaa tcccagctac
ttaggaggct gaggcaggaa aatctcttga 127380acccgggagg cagaggttgc agtgagctga
gatcgcacca ttgcactcca gcctgggcaa 127440caagagcaaa actccgtctc aaaaatataa
ttataataaa taaataaaag taaagtattg 127500atgtttgtga atgatttatt cttctaatga
actagaggag atttttccag gaatttcaga 127560gccagtgagg ttatgttgct tgtatgtgtc
atgtgtatcc aggtgaaaaa acttaattaa 127620acgctattat ataataccat acataaaaac
tgaattttag gaatactgaa gaatgacata 127680tagaagtcaa atcattaaat agctagtagt
aaacagaata gagtgtcagc tgttacccaa 127740tgatgataat attttcacga ttaaaattaa
accttttctg attttaaagg aaaagttcag 127800atctgtatca tataaagaat gtaaattttc
agggtaataa aattaaaatg cagagagaaa 127860aatgcaaaaa tagttcttac tagatgtgtg
tatgtaagga acttagacta attttaagaa 127920cactgtcaag accctggtag ttaggtagga
aaaaagacat gaatgattca ttcaacaaaa 127980actttgagta tttctgtgct agatggtagt
gttacagtgg taaacaaaat aaatgtgttt 128040ctgctatcct ggagcttagt ctacaaaaaa
ggtacatatt ggccgggcac ggtggctcac 128100gcctgtaatc ctagcacttt ggaagatcga
ggcgggtgga tcacctgagg tcaggagttc 128160aagaccagct tggccaacat ggcgaaaccc
cgtctctact aaaaatacaa aaattaactg 128220ggtgtggtgg cggacacctg taatcccagc
tactcgggag gctgaggcag gagaatcact 128280tgaacctggg agacagaggt tccagtgagt
cgagatcatg ccactgcatt ccagcccggg 128340ggacaaaagc gaaaatacgt ctcaaaaaaa
caaaaacaaa caacaaaggc acgtattaaa 128400tacgaacata aatatttaca aattatactg
aataagttct catgtttatt atttgcttgt 128460ccagttacaa acttttcctt cgtagaatta
gaaatataaa taataaacat gagaactcat 128520tcagtataat taataattat taaatgtaaa
taaaaacatc tatgtacaat taggcattta 128580tttaagaatt atttgaaaaa aaaacaatgt
ggaaacagat attttgatat attgctagtg 128640attgaaattg ataatgttct tttgaagagt
aaagtgacca tatatattaa agttaaaatt 128700taactcagca atcacacgcc tggtgagtta
tcttaaggaa atcagtttga aagtaaaatc 128760aatatatgca caaagacttt aacatttatc
ataaaccaga aaaatcgagt ttcaaattat 128820atcctatgga ctattttctg ctaaaaagta
ttaatatcaa ctttatgtaa tactttcgtg 128880acaaatattt tgggggagaa aacccaacaa
aattacatgc attgtaattt tttttttttt 128940ttttttttta gacagtcttg ctccagcgtc
caggctggag tgcagtggtg caatctcggc 129000tcactgcaac ctccatctcc caggttcaag
caattctcct gcctcaggcc tcccgagtag 129060ctgggattac aggcgctcac caccatgcct
agctaatttt tatagttttt agtagagatg 129120gggtttcatc atgttggcca ggctggtctt
gaactcctgg tctcaagtga tccgtctgcc 129180tcggcctcct agagtgctga gattacaggt
gtaagccact gcacccagcc ttatgcatta 129240taattttaat ttgtaaactg tacaaaggga
taatacttgt agtacaacaa gaagtaaaaa 129300catttgttat aggtagttaa catttgtaac
cagtagaatt ataggtaaaa tttatttatt 129360taaaacagtt ttagttggat ttgatttcaa
ctttaaaata atgcttttca tctctatcag 129420gtctttttgc ctggcttttt gtccagcaat
ctttattata aatatttgaa tgatctcatc 129480cattcggttc gaggagatga atttctgggc
gggaacgtgt cgctgactgc tcctggctct 129540gttggccctc ctgatgagtc tcacccaggg
agttctgaca gctctgcgtc tcaggtattg 129600actgattgcg tctgccatta gggagaaaag
catacacatc ctttccttca catcccagta 129660acagatccta ttatttgtaa attttaagtt
gtggaaaaaa aagataaaag ccaggcacag 129720tggcctgtgc ctgtaatccc agcactttgg
gaggctgcgg tgggcggatc acacgaggtc 129780aggaattcga gaccagcctg gccgacatgg
tgaaacccca tctctactaa aaatacaaaa 129840attagccggg catggtggca ggcacctgta
atcctagcta cttgggaggc tgaggcagga 129900gaatcgcttg aacccaggag gcagaggttg
caatgaacca aaatcacgcc actgcactcc 129960agcctgggtg acaaagtgag actgtgtctc
aaaaaaaaaa aaaaaagaga gaaataaaat 130020tagcctactt actatcttct aatcaaagca
tttgtggtaa cttaaaatat actgtattgt 130080aaagtatcat gctgtttcat ttaggccatt
attctatttg aatctgtggc tgtttctctt 130140aataaatcaa gtaatatgga atatattcat
agcctctgaa gagctcttta tgtaagtatt 130200tatttaggat actttttgta aaataagtga
atgaattctt aggtctcctt tttttttctt 130260ttcttgagac agggtctcct cgctgcaacc
tggaaattct gggctcaaat aatccaccca 130320ccacagcctc ctgaatagct gggactagag
gcatgcacca ccacgcctgg ctaatttgaa 130380attttttttt ggccaggcat gatggttcac
gcctgtaatc ccagcacttt gggagaccga 130440ggcaggcaga tcacgaggtc gggagatgga
gaccagcctg gccaacgtgg tgaaaccccg 130500tctctactaa aaatacaaaa attagctggt
tatggtggct catgcctgta atcccagcta 130560cttgggaggc tgaggcagga gaatggcttc
aaccagggag tcggaggttg cagtgagccg 130620agatcacgcc actgcactcc tgcatggtga
cagagtgaga ctccatctca aaaaaaattt 130680tttttttaaa tgatggagtc ttgctgtgtt
gctcaggctg gtcttgaacc cctgacctca 130740aatgccgcct gcttcagcct aagtttcttt
tttttttgta aagagacagg gtcttgctat 130800gttggccagg gtagtctcaa actcctggct
tcaagcagtc ctcccacctt ggcctctcaa 130860agtgctggga ttacaggcgt gaaccactac
ctataatgtt gtgtttcact caaggccttt 130920tgatttcgtt ttgcattacc gtgccacatt
gtgcatttcc ttgacctttt ttgggttttt 130980tggagtgctt tcatatgtta aaccatacct
gattctcctc aaaatcacac aaagtagaat 131040atcctaagac aagaaatcta aggaggcata
aagaagttaa ctggttttat taaactcaca 131100cagtaaatga tagagccaga aatattcccc
ttctagtgtt cttcaccatc agcttaatgt 131160agcataataa ttttctaatt actgttgaca
aataaataac cctttgaatt ttcaatactg 131220ggccttggat aaattttcct aatttgtaag
agagtattat cgtattgcca tttacaaagc 131280tctcctgagt atctttttct tctgttaagt
ttacctagga gataaactgc tgagtatggt 131340tgccattttg gttttttgat ataggttaga
atgtcttggt tttttttttt tttttttttg 131400gtttttgttg ttgtcattgt ttgagacagc
atcttgctct gtcgcccagg ctggagtgca 131460atggcacgat cgtggctcac tgcaacctcc
acctcccggg ttcaagcaat tctcctgcct 131520cagcttcctg agtagctggg attacaggca
tgtgcaacca cacctggcta atttttgtgt 131580ttttagtaga gaaggggttt caccatgttg
gtcaggctgg tattgaactg ctgacctcat 131640gatccacctg cctcggcctc ccaaagtgct
gggattgcag gcatgagcca ctgcacctgg 131700ctgaatgtct tgtttttgat taggcactta
agaaaggcct aggtactaac cataaaatat 131760atttttatac cttttgttga tactatatat
atagaaaact gcacttatca taaccttaga 131820caccttgaag aatgttcaca agcagaacta
acccatgtga cccagcatcc agatcaaaaa 131880cagcattatc agcccctcta gaagccctct
tgggcccctt ccattcactg tccttcttgt 131940caccagggta gctactatcc tgacttttga
tggcatagat tagcattacc tgttcttgtc 132000attttataaa taaaaccata ctgtgtattc
ttttcttgta cagctttatt gtgctaattc 132060acatttacat catacaattc agtggttttt
atatggtcac agagttaggt aaccattacc 132120acatcgattt tagaacattt ttttcactcc
agatagaaac cccctttact taaactccaa 132180atcccccact ccaccagccc taggcagcca
ctagtctact ttttatctct atagagacaa 132240tagatttgct tattctggac atttcataaa
catggaaccg tatattatgt ggtcttttgt 132300tgccaactgt ctttcactta gcatcatgtg
ttcaaaagag catcatgtta tccatgtttg 132360gcatgtatca gaattttatt cctcattatg
gccaaatatc ccattgcaag gatttatgac 132420attttatttg aattgtaccc tcctttctgc
catttatcaa taatgctact gtgaccattt 132480gtgtacaagt ttttgtgtgg atacaggttt
tctttttgtt tttaaatttg aggtggagtc 132540ttgctctgtc gcccaggctg gagtgcagtg
gcacaatctc ggctcactgc aacctctgtc 132600tcctgggttc aagcagttct cctgcctcag
cctcccgagt atctgggact ataggcacgc 132660accaccacgc ccagctaatt ttttagtaga
gatggggttt caccatgttg gccagtctgg 132720tctcgaactc ttgacctcaa gtgatccacc
catctcggcc tcccaaagtg ctgggattac 132780aggggtgagc cactatgccc ggctgtggtt
ttcatttctt ttgttgtata tacataggag 132840tagaattgct gagtcaagag gtaactctta
aacttattga aaaactgcca gattgttttc 132900cgaaaaggct gcaccatttt gcaatcccac
cagcagtgta tgagttttac agcttctcca 132960catttcattg gaacttatta tctgtttggc
tgtttttaaa aatgatagtc attccaataa 133020gttctacttc agtgtggttt ttgcacttct
ctgatgagta atgatgttga gcatcttttc 133080atttgcttat tggcctttgt tctagctttg
gaaaaatgtt tattcaaatc ctttggccat 133140ttttattttt atttttattt atttattttt
ttttgagacc aagtctcact ctgtcagcca 133200ggctggagta caatggtgtg gtctcagctc
actgcaacct ccgcctcctg tgttcaagtg 133260attctcctgc ctcagcctcc cgagtagctg
ggattacatt tcaggcacct gccagcatgc 133320cgggctgatt tttgtatttt tactagtgac
agggtttcac catgttagcc aggctggtca 133380caaactcctg acctcaggtg atctgcctgc
ctaggcttcc caaagtgctg ggattacagg 133440cgtgagccat tgggcccagc ctagattttc
ttttttcttt ttttttttga gaaggagtct 133500tgctcttgtt gcccaggctg gagtgcaatg
gcacaatctt ggctcactgc aacctctgcc 133560tcctgggttc aagcgatttt cctgcctcag
cctccccagt agctgggatt acaggtgcct 133620accaccacac ccagctaact tttgtatttt
ttttagagac agggtttcac catgttggcc 133680aggctggtct caactcctga cctcaggtga
tccacctgcc ttggcctccc gaagtgctgg 133740gattaccggc atgagctacc aggcccagcc
aattttctca ttatattgcc caggctggtc 133800tcaaactcct gggttcaagt gatcctcctg
ccttggcctc ccaaagtgtg gggagtacag 133860gcgtgagcca ccttgctcag cccctttgcc
catttttaaa ttagattgcc tttttatatt 133920gagtttcagg agtcctttat atattctaga
taaatgtccc ttatcaaatt atattatttc 133980caggtatttt cttcattctg tgagttgtct
ttcctctacc ttttaaaaaa ggtgggtttt 134040tgtttgtttg tttgtttgtt tttttaagat
aaggtctcat tctgctgccc aggctggagt 134100gcagtggcac aatcacagct cactgccacc
tcaacttcct gggccgaagt gatcctctta 134160cttcagcctc ctgaatagct agggccatag
atacacacta tcacacccag cttttttttt 134220ctgtttgtag agacagatct tactgtgttg
cccaagttgg tctcaaactc taggctcaaa 134280gtgattctcc cacctctgcc tcccagagtg
ctgggattac aggtgtgagc cacacgcaac 134340ctgtcttttc actattaata gtgtcttcct
gcttcagcct cccgagtagc tgggattaca 134400ggcacccacc accatgcctg gctaattttt
ttgcattttt agtagagaca gtgtttcacc 134460atgttcaccc ggctggtctt gaactcctga
cctcaggtga ttcacctgcc atggcctccc 134520aaagtgctgg gattacaggc gtgagccact
gcacccggcc aaaatattgc cttcttaaca 134580gtattgtctt ctaatttgtg aacatggatg
tatcttcatg tatttatgtg ttctttcatt 134640tcagcagaat tttgtagttt tcagagtaga
agcctttcac ctccttgggt catttattcc 134700tatgttttaa gttcttttcg attccattat
aaatagaatt gttttcttaa tttcattttc 134760agattgtttg atgagagagc atagaaatac
aagtgatttt tacatgttga tcttgcaact 134820tcaactttga taaatctgat tgttagctct
aatagttttc ttgtggattc tttaggattt 134880tcaatatata agatcatgtc atttatggat
agagatagtt ttttttctgg ctagaactta 134940cagagcaatg atgagtagaa gtggcagaag
caaaaatctt tgtcttgttt cctatctgac 135000agggaaagct ttcagtttca tcatttaata
tgatgttagg tgtgggtttt caataaatgc 135060cttttttcag attcaggaat ttccctatca
ttcctgattt tttaaggctt tttttttttt 135120ttaaatcatg aaagggtgtt gaatattgtc
atgttctttc tgtatcagta taaatgatcc 135180tatggatttt gggttttatt ctgttgatgt
gaaatattaa ttgattttca gatgttaaac 135240caaccttgca tacctgagat gaatctcact
tggtcatggt gtataatctt ttcaatatgc 135300tgctggattc catttactgg tattttgttg
aagattttgt atctgaacgc ttaagataac 135360atttacactc tatcagaaat gaattgacca
taaatgtgag agtgtatttg tgggttcttg 135420attctcttcc attccaaaga tagacataca
tccgtctgta tgtctgtctt tatgccagta 135480ccatactctc ttgattacta ttgctttgta
ataagttttg aaatcagaaa gtataaatga 135540gattttggta tctgagtaac agtcctcata
gaattagttg ggaaatattc cctctttatt 135600ctggtccctc tttctttttt gtttaactgt
gtatcttgga gattgttcct tctcaacaca 135660tgagagccgc tttccctacc ctcccacccc
tgctatagag aggtctataa gtgtctgttc 135720aattatttta tttacttaac ctattactta
gtcggggaca ttaagcttgt ttatgtcttt 135780tattttaaac aatgctgcag tgaataatct
tgtatataag tcattttcca tcaatataag 135840tctctctgta actgaatttt tagaagtgga
atttctaggt caacctatgg ctctgtattt 135900cacaaaaata ccaattctgg tttttcttgt
ggaggtgggg agtaggaggt agaatgctgg 135960aggagaactt gctgtactca gctggctagt
cattttagaa aggtttcctt agcttctttt 136020tgtcatatgg cctcaccaag aatcaaaaac
attcctattt accctgtaaa catggggctt 136080tactacccaa gatacatatt tctggatgta
tgacagcttt tcatattgaa gaaataatgc 136140tgtgagtaca gcacatttgt tggaacttag
gtcgttaaga atgtcttata aattcataca 136200ttatacattt tattttattt tattttttag
tttttgatac agagtcttcc tctgtcgccc 136260aggccagcgt gcagtggtac aatcttggct
cactgcgacc tccatctcct gggctcaagt 136320gattctcatg tctcagcctc cagagtagct
atggttacag gcatgcacca ccatgcccgg 136380ctaatttttt tatttttagt agaaactggg
tttcaccata ttgaccatgc tggcctcgaa 136440ctcttggcct caagtgatcg gcctgcctca
gcctcccaaa gtgctgggat ccttgtattg 136500ggtaaaagat gaatattgag ggctgcatgg
tggctcatac ctgtaatccc agcactttct 136560gagactgagg tgggaggagt cctggagccc
aggagggtga ggctgcagtg agttgtgatc 136620gcgccattgc acttcaacct aggaattata
ggcttcagtc actgtgcccg gcatgtacat 136680tttaatattg tgctttcctc ttttagctat
agtatgaggt tacatttcag agtcattgtt 136740gttaagcatc ttaatagtga tgaggttgag
tgaaagttac ttctatttca aacactgaag 136800aaaattttgt acaaatctgt cacattccaa
gcccaggact gattgtttca tatacttcta 136860attttacaat ttctattgta gtccagtgtg
aaaaaagcca gtattaaaat actgaaaaat 136920tttgatgaag cgataattgt ggatgcggca
agtctggatc cagaatcttt atatcaacgg 136980acatatgccg ggtaagctta gctcatgcct
agaattttta caagtgtaaa taactttgca 137040tcttttaaat tttttaatta aattttacat
ttttttctaa tctattatta tatgcccaga 137100actttcactt agagtgtgca gtataatgtg
gtggttaagt ataaaggctc tggagtgact 137160tcctgggttt taatcttggc tctgccattt
attggcagcc gctaacctct tggtatctca 137220gtttcttcat ctgtaaaatg agaataataa
agtgaaaaga tgccaacatc atttactctg 137280ggctgcataa ctgatacttg gaaaaagtat
tcctttgagt ttaagaatta agttggttat 137340tcattttagc ttgtaataaa aagatagtga
ttcataggat atgccactta ctgaaattta 137400ccacagatcc aatcataaaa tcactttctc
ttccctaaag atagcttgat taacatgtaa 137460aggtgtgtaa aggcttgatt acactaccct
gatccgtacc ccagttccca gcagcaccat 137520gaaaaaggga tttcaacata tttaattact
ttcagtagaa agtaacagtg gtaggccagg 137580cgcagtggct cacacctgta atcccagcac
tttgggaggc cgaggtgggc ggatcacgag 137640gtcaggagat tgagaccatc ctggctaaca
cgatgaaacc ccgtctctac taaaaataca 137700aaaaattagc cgggcatggt ggcaggcacc
tgtagtccca gctacttggg aggctgagac 137760aggagaatgg cgtgagcccg ggaggcggag
cttgcagtga gcttagattg tgccactgca 137820ctccagcctg cgcagtggag cgagactctt
gtctcaaaaa aaaagaaagt aacagtggta 137880ttgggagact gaggagccta gaaagtactt
gaaggaagta aaaggtttgt ttgaccacat 137940tgtatttgga aagccagctt tttcagctgt
gtcagctttg tgtagtgatt tttagttctt 138000cttttagaaa ataacggaca aggccgggca
cggtggctca cgcctgtaat cccaccactt 138060tgggaggccg agacgggcgg attacctgat
ctcaggagtt cgagaccagc ctgggcaaca 138120tggtgaaacc ccgtctctac taaaatacaa
aaagttagcc gggcgtggtg gcgtgtgcct 138180gtagtcccag ctactccgga ggctgaggca
ggagaattgc ttgaacccgg gaggcggagg 138240ttgcagtgag ccaagatcac accattgcac
tgcagcctgc gcgacagagt aagactctgt 138300ctcaaaaaat aataataaaa taaaaaagaa
tggacagtaa acctaaatga gttcattccc 138360aaagatgatg ttattcttaa gggatggttc
atttatttaa gaccttacat aaagtctatc 138420aattgcgtga tttttcactt ctgtaattgt
gtgtatgtat aatgtaaata tatatgtttt 138480tgttttgttt tggttttttg agacggagtc
tcgctctgtt gctcaggctg gaatgcagtg 138540gtgcaatctc agctctctgc aacctctgtc
tcccaggttc aagcgtttct tctgcctcat 138600cctcccaagt agctgggact acaggcacgt
gccaccacgc ccggctaatt ttttgtattt 138660ttagtagaga tggggtttca ccgtgttagc
caggatggtc tcaatctcct gacctcgtga 138720tccacccgcc ttggcttccc aaagtgttgc
tattacaggc atgagccacc acacccagca 138780tgtatttttt aaatgtataa aatgaagcag
aaaagagaaa tgataatttt tcttcatctt 138840gaaagattat cttcaccagg cgcagtggct
cacacttgta atcccagcac tttgggaggc 138900ctcggcaggc ggctcacttg agttcgaaac
cagcctggcc gacatggtga aactccgtct 138960ctactaaaaa taaataaata aagatggttt
taatatatgt tttagtttta tgattttagc 139020atctttctga aatttttctc aaggcaagta
aatttgtatc agttggtata ttggtaccca 139080tctatgaaat aacttattag gaagatatct
ctaaaataag atcactttgc ctaaaataaa 139140ctgatatatt gatgttcaca gaatttttct
tttaaccgac ttgataaatg cattattctt 139200gacgtcaagt gatccacctt cctcagcctc
ccaaagtgct gggattacac acatgagcca 139260ccgcacctgg cattattctt ataaaaggtt
aaatttctag ttaagtttaa tgtcctcttt 139320gttcatgtac cattgcttat tttcttccct
tcctactcac agtaatcatt cttatggtat 139380gcacttttgt ttgcttattt ttatgtaatt
gatattacgc tccattctgt acgttgtact 139440ttcattcaca gtgagttttg gacattccta
tgttcatcta tacagactta cttcatttta 139500actacactgt agtattccgt atgtaatatt
tactataact catcactgta gcagagcatc 139560tcatagtgta tgtattactg ttttgccatt
ttggtatcaa tgagtattta agtcatttgc 139620agtttttccc tcttataccc agtattacag
aggatctctt tttatatgct tctttgtacc 139680aagaggcaga ttaaaaaatt tttttttgaa
aaaatttttg aaaaaaaatg aaatgaagtc 139740tcactatgtt gcccaggctg gtctcaaact
cctaggctca agcaatcctt ccatcttggc 139800ctcccaaagt gctggggtta caggcatgag
ccaccatgcc tggcctacat tttaaatttt 139860gatagctctt acaatttact ttgtaaagta
tctgcatcat tttatgttct caccagtctt 139920taataagaat acttcatact tttggctgga
cacagtggct cacgcctgta atcccagcac 139980tttgggaggc cgaggcgggc agatcaagag
atcgagacca ccctggccaa tatggtgaaa 140040ccctgtctct actaaaaata caaaaattag
ctgggcgtgg tggcgcaccc gtagtcccag 140100ctactcgaga ggctgagaca ggagaatcac
ttgaacccgg gaggtggagg ttgcagtgaa 140160cttagatcac accactgcac tccagcctag
caacagagtg agactctgtc tcaaaaaaaa 140220aaaagaatac ttcagactta attttttttc
cagtcttaag tgtttgctaa tgagattgag 140280tttcttttgg tatgtctctt gattgttcag
gttttttctt ttatgaattg actgttcatc 140340tctttttcac attatttctg ttgggtgatt
ttattagtga cttgttaaaa ttctgtatat 140400tttttcagca tgacacttca ttattcaaaa
aaaaaaaaag attctctatg tttctcgata 140460ctaatcattg gttggtaata ccttaaaaat
aagaccctta ctgtattttt tgcttttttt 140520tttttttttt tttttttttt tttgagatag
agtcttgctc tgttgcccag gctggagtgc 140580aatggtatga tctcggctct cagctcactg
caactgcaac ctctacctcc ctgtttcaag 140640caattctcct gccttagcct cccaagtagc
tgggattaca ggcatccacc accacaccca 140700gctaattttt gtatttttag tagagacagg
gtttcaccat gttggccagg ctggtctcaa 140760actactggcc tcaagtgatc cgcctgcctc
ggcatcccaa agtactggga ttacaggcat 140820gagccacagt gcctagccac tttttgcttt
ttaactttgt tttatagtac tatagtttta 140880gtataaacag atgtatgtat acacacaact
atggctttat aatatgtttc agtcattgtt 140940agagcaaggc ctaccttttg ggtgcttctt
ttacaaaatt gtcttggcta ttcttgtgcc 141000ttttttctta tttgtgaatt ttagaattgt
gaattacctg ttgactcacc atgttttgta 141060aactgaggat tttgaatgga attgcactca
attaaagatt atcttgcttt ctgtgcagca 141120atgttttatt tcaaataatc cctactttaa
attacttagg atagctataa attgtgtttc 141180tggctttcta gatttagatg aaacgcttta
aattgattgt tttctcctaa atttaaaact 141240gattgttaga agttaaagtc ttctgttcat
tcttatttag gaagatgaca tttggaagag 141300tcagtgactt ggggcaattc atccgagaat
ctgagcctga acctgatgta aggaaatcaa 141360aaggtttgtg gtgtttttat acttcatatt
aagcctttac tcacattagt gattgactgt 141420aagtcaaaga ccacttaagg tttaaactgt
ttattttgta aagtaaccac tgtatctttc 141480accttgtgtt tatagtcaga agtaagtaca
agggcttcct gtagtcacat ctttatgcaa 141540tctcctctga atcaaaagtt agtgaacttg
ctttgccact ccagaaggca catgaatatg 141600aaaaagcatt gtctattttc ttatttaatg
gcaaaatacc cgacctaagt tggacttaat 141660gtttgagacc gtttatttta ttaaattata
ttttttctct tttctttttt ttttttgaga 141720cagttcttgc tctgtcaccc agaccggagt
gcagtggtct gaccgcacct cactgcaacc 141780tctgcttcct aggttcaagc gattttcctg
cctcatcctc ctgagtagct gggactacaa 141840gtgcgcacca ccacacctgg ctaatttttg
tatttttagc agagatgagg tttcaccacg 141900ttggctaggc tggtctcata ctcctgacct
caagcaatcc atccgccttg gcttcccaaa 141960gtgctgggat tacaagtgtg agccaccatg
cctggcctta ttaaattatt tttattaaat 142020ttcctcaaga ttgatgaaag taatgaaata
taaaagtaat gaaatatatg tggaaaatag 142080actggattaa gaaaatgtgg cacatataca
ccatggatac tatgcagcca taaaaaagga 142140tgagttcatg tcctttgtag ggacatggat
gaagctggaa accatcattc tgagcaaact 142200gtctcaagga tagaaaacca aacaccgcat
gctctcactc ataggtggga attgaacaat 142260gagaacactt ggacacaggg tggggaacat
cacacgctgg ggcctgtcgt ggggtggggg 142320gctgggggag gaatagcatt aggagatata
cctaatataa atgacgagtt aatgggtgca 142380gcacaccaac atggtacatg tatacatatg
taacaaagct gcacgttgtg cacatgtacc 142440ctagaactta aagtataata aatttaaaaa
aaataaatat atgtggaaaa tattaatagg 142500tcaaaattca aattgttcat ttaatcagaa
gagtagttta gtcaaatcca agggttagac 142560aacagaaatc ttttttgtca agtgcattct
ttgtgactga tttcattttc ttcctggttt 142620acacaggaag atttcagaaa caaatgtgga
tccgtgacag atggtatcta gaagttttta 142680gtttggttga attgacagta ttttattgag
taaaagatac taatttttgt aagaagaaaa 142740attcaatttt gataagtatg tttaagatta
agagctattg gccaggcgct gtggctcatg 142800cctgtaatcc tagcactttg ggaagctgga
gcaggtgggt cacgaggtca agagattgag 142860accatcctgg ccaacatggt gaaaccctgt
ctctactaaa ttagccaggc gtggtggcac 142920atgcctgtgc acccgcctcc gggtttaagc
gatcctactg cctcaggctc ctgagtagct 142980gggattacag gcgccatggc taatttttgc
atttttagta gagacagggt ttcactacat 143040tggccaggct ggtctggtct caaactcctg
acctcaggtg atctgcccgc cttagcctcc 143100caaagtgctg ggattacagg catgattcac
catgtctggc catttatctt attttctttt 143160tttttttttt ttttgtttga gacggagtct
tgctgtgtcg cccagagctg gagtgcaatg 143220gtgcgatctc agctcactgc aacctctgcc
tcctgggttc aagcaattct cctgcctcag 143280tcttccaagt agctgggatt acaggcgcgt
gccaccacat ctagctaatt tttgtatttt 143340tagtagagac agggtttcac catgttggcc
aggctggtct cggaactcct gacctcgtaa 143400tctgcccacc tcggcctccc aaagtgctga
gattacaagt gtgagccact gtgcccagcc 143460atcttatttt ctttcttttt ttttgtcggg
tgggaggggg acagagtcta gctctgtcgc 143520caggcttggc tcactgcaac ctctgccccc
caggttctag caattattct gcctcagcct 143580cccaagtagc tgggattata ggcacctgcc
accacgcctg gctaattttt tgttattttt 143640agtagagatg gggttttgct atgttgacca
tgctggcctc aagtgatccg cccaccttgg 143700cctcccaaag tactgggctt acaggcgtga
gcttgtattg ggtaaaagaa caatattggg 143760ggctgcatgg tggttcatac ctgtaatctg
agcactttgt gagactgaga tggaaggagt 143820gttggagccc aggagggtga ggctgcggct
gcagtgaatt gtgatcacgc cattgcactt 143880ccacctaggt aatggagcaa gaccatgtct
ctaaaaaaca aaacacaatt tttttaagga 143940atactgggaa gaggtcagtg gtggttttag
aacagaggaa gtgccagatg acctttgtga 144000ggcattggcc aggaagaact ctacagtgtc
tttaggtagc ttctgtccat aaggataatg 144060gggtctcctc cccagtatta atagaaaatc
tctgagctgt ttttttttgt ttgtttgttt 144120tgtttttttt tcctgagatg gagtctctct
ctgtcggcca ggctggagtg ctgtggcgcg 144180atcttggctc actgcaagct ctgcctccca
ggttcacacc attctcctgc ctcagcctcc 144240caagtagctg ggactacagg tgtccaccac
cacgcccagc taattttttg ttatttttag 144300tagagatggg gtttcaccat gtcagccagg
atggtctcga tctcctgacc tcgtgatccg 144360ctcgcctctg ccttgcaaag tgctggagtt
acaggcgtga gccaccgtgc ctggcctggt 144420ttttttgttg ttgttattta tttatttatt
tatttatttt ttgagacaga ctctcgctct 144480gtcgcccggg ctggagtgta gtggcacgat
gtcggctcac tgcaagctct gcctgccagg 144540ttcaagccat tctcctgcct cagcctcctg
agtagcaggg accacaggcg ctcgccacca 144600cgcccggcta attttttgta tttttagaag
agacggggtt tcaccgcatt agccaggatg 144660gtctcgatct cctgatgtcg tgatccgccc
acctcggcct cccaaagtgc tgggattaca 144720ggtgtgagcc accgtgcctg gcctgatttt
tttttttttt taatctggtc tcatacctct 144780gacagctcat gaagaagtgc tcctgcttca
tatgtatatg tgttagcata gtgttaacat 144840agcataggtg ttcggtgttt gcagtttctg
tttgttttat atgaattaag gtgtattatg 144900agcagttgaa gatatatagg aaattttttc
ccaaaccact atctctgctc gttctattca 144960ttcagtctgt ttatgttatt ccttcattca
ttcattttat agaacagtgg agtgcctact 145020gtatgcatct attgttctgg gtcctgggga
agaaaacaaa gttcctgctt tcatggaact 145080tacattatat tggcggagac agtaacagac
aaacaaatgt agcctgtgta catgtgttac 145140atgaaaagca gggtaggggg ctgggagaga
gtagtaggga gtgctatttt cgaggtggtt 145200gtcaggaaag gcctcactga ggaggtggca
ttttgagtag acctgagcgc agcgggggcg 145260taagcccagg cagcatgtgg aggaagagtg
ttcttggtga aaggaacaag gatagaggcc 145320cgaagctaga gagctcagca tgatcaagga
acagcaagcc ccgtgtggct ggaatggagt 145380gagcaaagga atgagcagta gaaggtgagt
gagttgggag gtcaccagag accatggcaa 145440ggacttgaaa gtgtcaggga cacattggaa
gttggagcag ggaaatgatg ggatttatgt 145500tttgtttttg ttttatgttt agtgttttta
agggattgct ctatcagcta tttggaaaat 145560ttagtgtagg gcttcaagaa gagaagcaga
gaaacaacat tcttgccata gtcatagtct 145620aagtaaggga tgatggtggt gtggattagg
ctggtagtgg aagaccagtc cagttcgggt 145680tgtatttgaa ggtagaggca aaaagattat
atttctacca gcaagcccat ctatgaagtt 145740acttgtatta ttaatttaat tgagacatgc
ccacataaac taataaatag gaatttctgc 145800agtttggtta aacacccctg tatatcctgg
ttcttctttt agttgtccag atgtctcttt 145860aagtcaagta ttttttggtg gtgtaggagc
ctagagattg aatttattca cccaaaaggc 145920atttgagtga ttactatgtg ccaggcacta
tgctgaatgc caaggatgta aataagaggg 145980cgtagtctca gtctgtttta ctccagcttg
gttccttttt aatgaccctg acttgttaag 146040catatcagtt atcctacaga atgtttaatc
ttctgtactt tcctggttgt gttatttagc 146100ttatttctct ttccttgaca tttcttgtaa
actggaagtt acacctatag tcttgatgat 146160tcgtgttaca cattttagat tagaacacat
catgtgttgt atatggtgtt tttgaaagcc 146220tctctgtata ttggtctgta cattaaaatg
ttgcctgaat ggatacacat aaaatttaac 146280agtgattaca ttagagatga gaagaaagag
gtgcctttta cttttcaata taccttttcc 146340tctgcttttt gaactttctt gccctatgca
tacgttattg cttaatcatc cacctcatct 146400cttcccctgt ggctttctgt tgcatttgga
atgaaatcta gcctctttgc tgttacctgt 146460ggatgtccct tgctggcctc tatcacctta
ctttgaacca ctcctttcat ggactgagct 146520ctcattggac tatcttttat tcttttgctg
aagtttcttc actttgagtg cctctgcagt 146580tgctatttca tggctgtggc aagccctgcc
atggctttca tgcaaggatg gttcctcctt 146640ctcatctcaa tattatctct tcagagaggg
accttcccaa ctccgatgat ctaaaatcct 146700ttgtatatac cactcactac cacttctttc
ttttcttttc cttttatctt tttttttttt 146760tttttttttt gagatagggt cttgctctgt
tgcccaggct ggaatcacga ctcactgcag 146820cctcatcttc ttgggctcaa atgatcctct
cacctcagcc tctcgagtag ctggaactgc 146880aggcacacac caccatactt ggcttattat
tttacttttt gtagagacag ggtttcacca 146940aggctggtct caagctcctg ccgcaagcaa
tccacatctc tcagcctccc aaagtattgg 147000gattatagga gtgagccact actcctggcc
tattttctta ttcactgtct aaaattatct 147060tgttcattta tttacatact tgtttatagc
ttatttctca gctggacatg gtgcctcaca 147120cctgtaatct caatactttg ggaggctggg
ttggagaatt ggttgagccc aggacttcaa 147180gaccagcctg ggcaacaaag tgagaccctg
tctataaaaa attgtttaaa aattagctgg 147240gcatggtggc acatgcctgt ggtcccagct
acttgggagg cagaggtggg agaatcgctt 147300gggcccagga ggttgaggcg acggtgagcc
atgattgtgc cactgcactc tagcctagtg 147360acagagtgag accatgtgtc taaaaagtaa
ataaaaatag tttctctttc atgactagaa 147420tattacctct atgtgggcag ggagtttgtc
tatactattt ggcactatat ttcctgattc 147480tgaaattatg cctagcacat ggtaagtact
ccttaaatat ttattgactg aattatttaa 147540tacttaagaa tttcatttgg gattatctga
gtggtaagat tacggattat atttatgtaa 147600gaaaaaatca ttttttaaac ttggttgccc
tttgccacac tgacatagac actaagtttt 147660cttagccaga ttacttccga ggatactcac
agaggccatt ctcttctcaa tccccaaata 147720attgatattt cttagcactt tcaagctaat
gcaattctta gatgatgtat ctgtgtatat 147780catatcctca ttctacaaat gtagaaattg
aagtctgggc acagtggctc tcacctgtaa 147840tctcagcagt ttgggaggcc aaggcgagcg
gatcactgag gacaagagtt aagaccagcc 147900tggccaacat ggtaaagcct tgcctctatt
aaaaatacaa caattagggc cgggcgtggt 147960ggctcacgcc tataatccca gcacgttggg
aggccaaggc aggcagatca cgaggtcagg 148020agttcgagac catcctggct aacacagtga
aaccccatct ctactaaaaa tacaaaaaat 148080tagccaggca tggtggcacg cgcttgtagt
cccagctatc gggaggctga ggcaggtgaa 148140tcccttgaac ccgggaggcg gaggttgcaa
tgagctgaga ttgcaccgct gaactccagc 148200ctggtcaaca gagggagact ctgtctcaaa
aaaaaaaaaa aaaaacaatt agccaggcgt 148260ggtggcgggt acgagtacct gtaatcccag
ctactaggga ggctgaggga ggagaatcac 148320ttaaacccag gaggtggagt ttgcagcggg
ctgataatgc accactacat tccagcctgg 148380gcaacagagt gagactctgt cttaaaaaaa
aaaaaaagaa agaaagaaat tgaggaatgt 148440ggagattgtg gtctgtgatt tgttaggaat
cacacagcag gttagtagca actacagggc 148500tttggttcag aataccacct tgacaatggt
ttgtttacag ttcggctccc cttcctctgc 148560ctttctctcc ttccttattg agggcagctg
gaaagaattt tcatcattta ctagcctata 148620gctttaattt gagttttgaa accttgataa
tagagcacag aggaaaagac tgagttttct 148680ttttttgaga cagtcttgct ctatggccca
ggctggagtg cagtgacacc atctcagctg 148740gttgcaacct ctgcctccca ggttcaagca
attctgcctc agcctctcga gtagctgaga 148800ttacaggcac gtgtcaccac gcccagctaa
ttttctgttt ttgtttcgtt ttgttttttt 148860ctgagatgga gtcttgctct gtcacccagg
ctggagtgca gtggtgcgat gttggctcac 148920tcaaacctct gtctcctggg ttcaagcaat
tcttctgcct cagcctcccc agtagctggg 148980actacaggta cgtgccacca tccctagttc
atttttgtat gtttagtaga gatggggttt 149040cactatgttg accaggctgg tctcgaactc
ctgatctcag gtgatctact cgtctcagtt 149100tcccaaagtg ctgggattat tggcacacgc
ctatttttgt atttttagta gagacggggt 149160ttcaccatgt tggttagact ggtctcaaac
ttctgacctc aagtgatttg cccgccccag 149220cctcccaaag tgctgggatt acaggcgtga
gccaccgtgc ccagccaaga ttgagttttg 149280aaaagagcct tctgagatta tgagaagggc
aagcaagata acttaagaag ttacattaaa 149340atcatctaag agacagtgta acaagaagga
attgtaaaat gatgttatga gcacgtgccc 149400aatgtagtgg caatcccttg tgcttcgata
cattggtggg agacaaaact gtacttaaat 149460tgataaatcc cttacatgtc attttaagga
gcttagactg actcccatca tgtagacatc 149520agagatttct tttttttttt tttttttttt
tttttttttt tttgtgacag agttttgctc 149580ttgttgccga ggctggagtg caatggcgtg
atctcggctc accacaacct ccacctccca 149640ggttcaagca attctcctgc ctcagcctcc
cgagtagctg ggattacagc catgcaccac 149700cacgcctggc taattttgta tttttagtag
agacggggtt tctccatgtt gtggctggtc 149760tcgaactcct gacctcaggt gatcctcccg
cctcagccac ccaaagttct gaaattacag 149820gcgtgagcca ccgcgcccag cccagagatt
tctaaacaga gttctaacca gatgcttttc 149880cctgtcagta gaatgagaat gaattggagg
tgggagagac tggcatgagg gacaccagtc 149940agccagtgga attagctggt aatgttgata
ggagaagaaa aagattcaaa gttaggtagt 150000ggtagcaaga attagaggga aggtcggatt
tatgatatgt ccaaggttga attctaaggt 150060gaaatttggt ggcagatttc atgtgtaaat
tgggaaggta gattgagttt ttttaacatg 150120ggttttctaa catgtcaata gagtgactct
gcaggggggc ctgacgagag aacagtgcat 150180ggggtgattc aacagccagt tgagccttca
tgcagagcat ttaacactgt gactctgtag 150240actctggttg gcagtaaaat ttcattaaac
caatatttaa acccttaggt aataataaaa 150300attgagggaa aaggatccag gttttgtatt
ttttatgaat tcagttattg aattaaacag 150360gaccttgcct caagaaataa tctaccaaca
attaacttgt tttaaagcaa agttaggaag 150420tgagcatgtt caaattatta aataaaaaag
taagctgtgt atttcattca tagaaataga 150480ggctggccta cttcggatga ttctcagcat
gtgattacag atgtgggctt atacatccta 150540gggagttaag gcgtactctg gcttggatag
agtagagctc tttgaaactc ttctctcacc 150600cagctagttt atatagacta gagaactaga
atgtagcagc atactctgtc ttagaagccc 150660ttttatatag gagctggtct ggaaggtttg
aaaacataac aaatgtgttg gtgtctccca 150720atgtattgct agattcttac ccaagagcat
tatcctggtt agggtttggt ttggttttgt 150780tttgtttttt aatgtttgcc acaaactaac
actagatgtt agttctttca tcaagtgagg 150840agagtagaag aaaagtccag aactctgaaa
caccttttca aaagtttttc aagccatgat 150900gtttgcaagt taaatgctct gttatgtaag
caatataatc agtttttatt aatgtaacat 150960tccttagtgt tttggggtat cacacaaaaa
agaatatcca tatctggaag caacagcttt 151020taaataagag cattgtggtg gtggtggtga
tagtggtttt tttttttttt tttgagttgg 151080agtctcgctc tgttgcccag gttggagtgc
agtggcacga tctcagctcg cttcaacctc 151140tgctcccagg ttcaagcaat tcttctgcct
cagcctcctg agtagctggg attataggca 151200cctgctacca tgcctggctg atttttatta
ttttagtaga gacaggtttc accatgttgg 151260ccaggctggt cttgaactct taacctcagg
tgaatcaccc acctcggcct cccaaagtgc 151320tggaattaca ggcatgaacc accatggcca
gccaaataag agcattttta atgtaaaatt 151380atgcatgaaa tgtacattca attttgtctt
tgtttactag gatccatgtt ctcacaagct 151440atgaagaaat gggtgcaagg aaatactgat
gaggtaaatc ctacctttag gataaaaaga 151500tttctgttta taagtgccac cctcatgtaa
gtgaggttta aaattttcct tttctttagg 151560tcccatgttt aagcagcatg gcacatttat
gttctcttac ccagaatgta ccaagaaagg 151620gtggtccctt cttaacatct aacaattgcc
tggtagtagc agtgaaggta tcttcagtca 151680gaggctagga ccactgaagg atatacatgc
attcaagttt ccatcagcca gcaggcatca 151740gtaatcagtg tgtagatcaa aagctcaaat
gtttccttcc ccactggcag ttttacttca 151800agtagtggag gcttgctttt ttaatagtta
attaagtaca ttgagagatg ggaggtgaaa 151860aaaggaaaat gttttatttt gaccatctaa
tatgaaagta gttcggtgtt aggtatccag 151920tagttgacac tggaagacag ggaatgacat
gttaatattc atagccagag ggtggcccag 151980gttttttcgt acatgggaat gaaattctta
tccaaataag tagaaattat gtgcgtaagc 152040catttgttaa gagcactgag tatgtgcatc
tcgatccatc taatgaataa ccattatcac 152100cagtttaaat tattttcttt aggcccagga
agagctagct tggaagattg ctaaaatgat 152160agtcagtgac attatgcagc aggctcagta
tgatcaaccg ttagagaaat ctacaaaggt 152220aaggatgact tcgttttgtg taaactaaaa
agtattattt tccaggtgta aaaataaaaa 152280agaacataag gggtttcttt gcctttgaag
gattaactgc tgtggggatt accttcttat 152340cataagcaac tagaaaattg acaaactaaa
tgaaacaact gtttgcatat attggacaat 152400gggcaataca gggaaaccat ggaaaccaaa
cagagcccag tagtcttgct gaacgaaaga 152460gttaaatatc aaagttcagg ccaggtgcag
tggctcacgc ctgtaatccc agcactttgg 152520gaggccaagg cgggtgaatc acttgaggtc
aggagttcaa gaccagcctg gccaacatgg 152580tgaaaccctg tcttagccgg gtgtggtggc
aggcacctgt aatcccaact atttgggagg 152640ctgaggcagg agaatcgctt gaaccaggga
ggcggaggtt gcagtgagcc gagatcacac 152700cactgcactc cagcctgggc gacgagcgaa
accccatttc aaaaaaaaaa tcaaagttca 152760gagagctcaa tttgagtaga agttgtagga
taaggtagca gaaaagagga agctgcccag 152820aaagaaagcc gtagagatat ttagagagat
tcccatggat ccttggccta ggagtgatct 152880gtatatgtgt ggggtgaaaa cgcatgtgtc
caggtagaga accccccaga aattagtagg 152940ctgaatgatt gctggaacat agggctaaga
aaagttcatg gccagaagga tctggccaga 153000gtagagagac ttagtaatac acaaggcatt
gggtagtgtc ttcacagagg ttatgcctta 153060ctactgaaga taaattagtc ctagagtaca
agcacctgaa ccaagtttca aagcaaattt 153120ttaaagggtc aaattaccta acaactgcat
gccaaaacaa aggcctaacc ctctttacag 153180taacacaaca aaattcagca cttcacagtg
taaagttaga atgtctgacg tccaggctgg 153240gcgcagtggc tcatgcctgt aatcccagca
ctttgggagg ccgaggcagg tagatgacct 153300gaggtcagga gttcaagacc agcctggcta
acatggtgca accccgtctc tattaaaaat 153360acaaaaactt agccaggcat ggtggccggc
acctgtgatc ccggctactt gggaggctga 153420ggcaggagaa ttgcctgaac ccaggaggtg
aaggttgcag tgagccgaga tcgcaccact 153480gcactctggt ctgggcaaaa agagcaaaac
tcaggctcaa aaaaaaaaaa gaatgtctga 153540cgtcaatcac aaattaccaa gcatgacatg
aagttgacct ataaccagga gaaaactcaa 153600tctatagaaa cagacccaga tgtgagaaag
atgatgaatt tagcagacaa agaccatcaa 153660gtggctattt taaatattaa aaatatgttc
aagtggccag gtgcagtggc tcatgcctgt 153720aatcccagca ctttgggagg ccaaggtggg
taggagttca agaccagctt ggccaatatg 153780gtgaaacccc ttctctacta aaaatacaaa
aaaattagct gggcatggtg gcaggtgcct 153840atagtcccag ctatatggga ggctgaggca
caagaatcac ttgaacccgg gaggtggagg 153900ttgaggttgc agtaagccga gattgtgcca
cttgtactcc agcctggaca acagagtgag 153960actctgtctc aaaaaaaaaa aaaaaaaagt
taaagaaaac aagagtataa tgagaaaaat 154020gcaaaatagt tttaaaagaa ccaaatggaa
tttcttaaaa taaaaaatac cagaaatggg 154080ggccgggcgt ggtagctcac gtctataatc
ccagcacttt gtgggggctg aggcaggcag 154140atcacctgag atcggtagtt caaggccagc
ctgaccaaca tggagaaacc tcatctctac 154200taaaaataca aaattagctg ggcgtggtgg
cgcattgcct gtaatcccag ctacttggga 154260ggctgaggca ggagaattgc ttgaacccgg
gaggcagagg ttgcggtgag ctgagattgc 154320accagtgcac tccagcttgg gccacaagag
tgaaactccg tctcaaaaaa aaaacaaaaa 154380aaaacagtag actcgaagaa ctagctgagt
ttttctttac tttaggcagt aagtgtgacc 154440ttttgcaggt gactacttta gttcctcatg
tcctcattag tagatcagag aaattcgaca 154500ccaaaacccc aaaagaaaaa ccccttctaa
tcctcattcc atgattttat gaatgcatga 154560agtcctaggc ctgcgaagga atactcattc
tctttatcct gtgttgatac ctctctgctt 154620caacctccaa ctcgacattt gcctatagga
tgtacttgga cattcagcat aaactacctc 154680acaccattac tgaattgctt catgtgcaca
tgtcccatgc cacaataccg gggaccttgt 154740cttccgtgat atttgtccgc agtgctgtga
ctacaggagg gagtcagtga atgtctgcat 154800gtgtgtcttt accatccctc ttgaatatgc
tctagggtta attcctagaa gtagaattac 154860tctattgaaa attggcaata tttttcattc
taatatctat tgccaacatg ggaaagcaag 154920tctggatgcc agtccttgtt atatgcccct
tgggtaagtt acgtaacctc tttaagcttc 154980tgttcactca tattttaaca aggaaaatta
caatatttta cctcacaaaa ttgtagtcag 155040cttctggctg tcttaaactc tggtatatag
taaacactaa gtgttggtgt ccatccttaa 155100tttgtaataa taggtcactt gttagagaaa
tgcaccttac cattttcttt tcttttcttt 155160tttcagttat gactcaaaac ttgagataaa
ggaaatctgc ttgtgaaaaa taagagaact 155220tttttccctt ggttggattc ttcaacacag
ccaatgaaaa cagcactata tttctgatct 155280gtcactgttg tttccaggag agaatgggag
acaatcctag acttccacca taatgcagtt 155340acctgtaggc ataattgatg cacatgatgt
tcacacagtg agagtcttaa agatacaaaa 155400tggtattgtt tacattacta gaaaattatt
agttttccaa tggcaataac ccatttatga 155460gagtgtttta gcctactgga atagacaggg
accacatcct ctgggaagca gataagcata 155520gaactgatac ttgatgcaca ctcgtagtgg
taactcatcc ctaatcagca ttgtaaagca 155580ggtgccagag gtggtttgct ttgtccttcc
aaagcaggtg agtcagcccc accgagagcc 155640aggcagcttt gagtggcagc gtggtgctag
cagcttcagc ggaacagggt gagagttaat 155700tatgcagtct tcttgacagc ggcattaatt
tggaaggaaa ctgacaagtc atgggtcaag 155760tttcagtgac ttcctccttc ctctgatggc
agtatatagt tttcacattt taattcctcc 155820tcctgagatg cactatactt aaaaccattc
tctcccctgc taacagaagg gtgtgaatct 155880ggtttacttt gagcattagg atttgcccct
ttggaattct gcactccagt tacttaactt 155940tcccttcaga atacatgtgg aaagaaagaa
agaaatagcg atgactccac ttttgcccct 156000gtggcacctt gaacaaagca gttcttccca
aattatactt tttttttttt taaataaggt 156060gagcaggatg actggggaga gagaaacatt
tgactttgac tgcctccccc attctttgct 156120gtgagctgga aagtgtgcag ttggtcgtct
ttcttctcct ttctttagga tagtaagaga 156180ctcactcact gcacttctgc tcagttggct
tctgcatcgg gatcacacag ccatcagcag 156240gactgcccag ttggtgagca cactccattg
accacgcggc gccagcgctt cctcaatgca 156300catgattgag aggaaagaaa gttctcttag
atgttactgc ttttgctcag actttgcaaa 156360aaaaaaaata tatatatata tgtataaata
tataattatt aatcactttt gtccttgaga 156420aagtcttgaa tgaacagaga atttattcca
ttgcaatatt tgattgtata gaggcacact 156480gtttcatcga cagaagaagc aaaaaggctt
tgtgtaagtt tttggtacta tgtaccacct 156540ctgttattct tttaaagctg aagtattcat
gtacttaaac catattatat ttaattgtgt 156600ttgattttaa aatatatata tatgaattct
atttaaaatt gtgtcaactt tctgctttca 156660gggcatttat ggctcttctg ttgaaatata
ttgatctttc caaatatttt catttgcttt 156720ctaaaaaccc agaacatgag ccactactgg
actttgcctt gtgtttgaag tgtatggcat 156780aaacccaagg tttttattag tcatctatgc
tgtgattaat tcattttgtt cttttaacaa 156840aatatttcca tccacttcac attgcttcaa
tctttaacag aaaagcaata taaaggttat 156900agaataaaat gtggttttgg gcaactcttg
ctgcctctgc atgttttgga ataacaattt 156960ctacaagact ctaggctgtt taaactagtg
ctttcagtta agataaattc taatcatttc 157020tttgtatata cattttgtgc ttctgagcta
gagatgccaa gtagttgtaa actgcttata 157080aagagaatag cagcaaattt gagactcggc
tacttttttc tgccccacct gctttgagac 157140acagaagcgg agtgtggccc gaaattatta
gccagattta atatttgatc taaagtaggt 157200ccttgtactc attttaaagt tggaatttga
ttcctccaac attgagcacc caccatgttc 157260caggctctgt gcattgtgcc cacaaaataa
gattccctgg tggagttttt atgggttcaa 157320ataatcagtt gaacaccctt catctttatc
atgttgttga cattgacaca aattgtttaa 157380aaagaaaaga tattagagag aaagtggtac
ctttgtaact tgatgtgtct tcatcattcg 157440gtaagatttg atgaaagtaa aaagcaaatg
tcagccaaat ccagtgaaca gcaataaaac 157500agggagtaac tttttataac tttttctact
tggatttcaa cattcagtag agcttttcga 157560aatgtaagta gtttacagta ctggaggttt
gactagttca gtaggaattt ggaggggaag 157620gtcattctga attgtaacaa agtacaaact
tctttgctgt tttatttaag tactgagagc 157680taagcacctg atgaagtgac tgacctctct
ccagtgacag tgtttgggta cctgcctgac 157740ttcaggagtg gggtttatgt ttctacacag
tgaccttttc tctcgccctc tcctccctct 157800tgcccacaca ccagttgatt ggacctgggt
tgaactcctg atccagacag gcccaagaca 157860gttcttaatg ttaagaattt tggggccggg
cacggtggct catgcctgta attgcaacac 157920tttgggaggc cgagacaggc ggatcacttg
aggtcagggg ttcgaggcca gcctggccaa 157980catggtgaaa ccctgtcttt actaaaaata
caaaaattag ctgggcatgg tggcgcacgc 158040ctgtaatccc agctacgtgg gtggctgaga
caggggaatc gcttgaacct ggaggcggag 158100gttgtgcaat gagccgagac cgtgtcactg
cattccagcc tgggtgacag agggagactc 158160tgtctccaaa aataaaaata agaaaaagaa
ttttgggcta ggtgcagtgg ctcacgcctg 158220taattacagc attttggaag gcccaagatg
ggcagatcac ttgaggacag gagttcgaga 158280ccagcctgga caacatggtg aaactccatc
tctactaaaa agacaaaagt tagccagatg 158340tggtgatggg cacctataat cctagctcct
cgggaggctg gggcaggaga atcacttgaa 158400cccaggaagc agagattgca gtgagccaag
atcacatctc tgcactccag cctgggcaac 158460agagcaagac tctgtctcaa aaaaaaaaga
atttggccag gcgcagtggt tcacgcctgt 158520aatcccagca ctttgggagg ccaaggcagg
cagatcacga ggtcaggaga tcgagattgt 158580cctggctaac atggtgaaac cctgtctcta
ctaaaaatac aaaacattag ccgggtgtgg 158640tggtgggcac ctgtagtccc agctactagg
gaggctgagg cagaggaagg atgtgaaccc 158700aggaggcgga gcttgcagta agccaagatc
gtgccactgc actacagtct gggcgacaga 158760gtgagactcc gtctcaaaaa aaaaaagaat
tttggccggg tgcggtggca catgcctgta 158820gtcccagcac tttgggagac caaagtgggc
ggattacctg aggtcaggag ttcaagacca 158880gtccggccaa tatggcgaaa ccctgtctct
tactaaaaaa aatacaaaaa ttagccaggt 158940gtggtggcgg gcacctgggg aggctgaggc
agggagaaat gcttgaaccg gggaggcaga 159000ggttgcagta agccaagatc gtgccactgc
actccagagc aagactcttt ctcaaaaaaa 159060aaaaaaaaag aattttgcat ggggaaggag
agatactgtt caccatctgg aatggtgctt 159120ggatgtggca cttacaaaat caggagccag
cactgcatgg acaaacagaa gcatgtgggc 159180ctgagatagc aggtaccttg ataaccctga
agacatcctt ggtttctgca tctattcctg 159240catccttgca ttggactaca ttaatctgtc
agttatcctt ataatgattt ttgatttttt 159300ttttttgaga tggagtttcg ctcttgttgc
ccaggctgga gtgcaatggc acgatctcgg 159360ctcaccacaa cctccacctc ccaggttcaa
gtgattctgc tgcctcagcc tcctgagtaa 159420ctgggattac aggcatgcgc caccacacct
ggctaatttt gtatttttag tagagacggg 159480gtttctccat gttggtcagg ctggtctcga
actcccaacc tcaggtgatc accctgtctc 159540ggcctcccaa agtgctggga ttacaggcgt
aagccatggt acccggtctg ttttttgatt 159600ttttgaaacc agtctgaagt gagttttttt
aattacgtga aaggagtttg gctaaaatac 159660tgccatactg ccctaatgcc taatgattat
gtattctcag catgtctgca aagtactgct 159720gatttctgga gaataatttt tctttagtaa
acttcactta agtcgtcatg tgtattctct 159780caaaatggta tcctaaccta atggagctaa
aagacacccc ttgtttttat aacaagcagt 159840tactgaggcc caggaagggg agaagtccct
ggcttgtgag atgatcacca ttagaactca 159900ggcctgggcc agtgcctttt catgcttctc
agatccttcc aaagaataat gaagattata 159960accgctttta gcaattgtaa taaacccaga
aatagaaagc tttttggtta gagtactggt 160020agaagtttgg cgggagagat aatttttaca
aaatttgtaa atacctgcca attctatata 160080ctaggcaagg tctctggcct tgtaaaaccc
ctcaaggtta caactttggt ggcccacact 160140aatagttacc cactgaggcc ctctccgggt
gaacattgag cactagagga agcccctctg 160200cttgggcagg actgggcgtg gtgcagagta
ggagcggtga tactgtggat tctgggcagg 160260tggagatggc cagtgatgtc caataaagga
cactggaggg agcagtgtga gtaaaggccc 160320tgagggcatt catgttcagg gagggttgct
gcccactggc ttgcttggca cacaggagag 160380tgggtattcc tgccttagta actttatgta
aacaagtatt tcctcagtct gttcctctca 160440aactgcctgc tctggcacat tcagaatgtc
acagaactca cctggatgca ttcagcccct 160500tgcctaaagg tgacagtgca tctccttccc
caccccaccc ctcataccac tgaagcacct 160560gtcagactgg cccagtctgt gggcaaggag
cctagagagg gcttagtttc agcttgaaag 160620gagctgggat ttaccaagaa gcaaatgaga
gacgaggatt gcaacaactg tgccatttcc 160680ccagcttcag ctgactcctg tatattgact
gtgccttcag actcatccgt aagtgacccc 160740aggctggcct ctcccacatc acagtaagaa
ttccacacac catacaactt ggaaagaggc 160800tccagctgaa ggaagcccca cacttctttc
aagtttttct tagtcttctc ttcttggcaa 160860agagtacctt ttgtttcttc taattatgta
actattggtt tagtaaatat tcacccattc 160920agtcaccctg taagtggcag gcactgttta
cagggacaca ggaaggaata aaaacttgca 160980ggcaccttgg agcttgcatt ctattgaaga
ggtaatggaa gttgggatag cagctaaact 161040atgctggtat tggccaggcg cagtggctca
cacctgtaat cccagcactt tggaggccaa 161100ggtgggcaga tcatgaagtc aggagatcga
gaccatcctg gctaacatgg tgaaaccccg 161160tctctactaa aagtaaaaaa aaaaattagc
caggtgtggt ggcgggcgcc tgtagtccca 161220gctacttggg aggctgaggc aggagaatgg
tgtgaaccca ggaggcgaag attgcagtga 161280gccgagatgg caccactgca ctccagcctg
ggtgacagag cgagactctg tctcagaaaa 161340aaaaaatatg ctggtagttt tgattcaaga
tggcctttgg agcccatgat ttaggtctcg 161400tacccaccaa ggtctactgg aaaacatcag
gctctcctgc tatagaccca tagggagagc 161460tgcagccgag agggggagct gaagagaagt
gccccttctg tgtcctgtca gcctcatcct 161520tccgcaagga ccagttgctg tgccactcca
ttcacttgct gcaagactgg aggtttttcc 161580tcaggtgttg agcacctggt ttacaagatg
tcagcatctt gatgcctgag accatcaagg 161640caagtctctg aacagggctt accttagagt
aaggcttaga agaggccgta aagtcagtct 161700cagctccgtg gctctgcaga gctttgggac
atgtgaattc ttaaaaacaa gactattgta 161760cagttactat atgcatgcag tataaaatta
taaccttgga aaatcctagc tagctgttga 161820gctaattcca taaagtaatc agctcctgag
ttctgcagtg gtaataataa tcagcataat 161880gagtaaacac tgtgtgtgcc aggcagcgtc
tcatttgatc cttgtgataa tcttgtaagt 161940actgattttc tcccttcttt aaacaaagtt
tttttttttt ttttagagag ggtctcacta 162000tgttgcccag gctagtcttg aattc
162025371350DNAHomo sapiensCDS(213)..(917)
37gcggccgcgt cgacgtgaca gccggtacgc ccgggtttgg gcaacctcga ttacgggcgg
60cctccaggcc cgccagcagc gccccgcgcc gcccgcccgc gcccctgccg ccccccggtt
120ccggccgcgg accccactct ctgccgttcc ggctgcggct ccgctgccgg tagcgccgtc
180ccccgggacc acccttcggc tggcgccctc cc atg ctc tcg gcc acc cgg agg
233 Met Leu Ser Ala Thr Arg Arg
1 5
gct tgc cag ctc ctc ctc ctc cac agc ctc ttt ccc gtc ccg agg atg
281Ala Cys Gln Leu Leu Leu Leu His Ser Leu Phe Pro Val Pro Arg Met
10 15 20
ggc aac tcg gcc tcg aac atc gtc agc ccc cag gag gcc ttg ccg ggc
329Gly Asn Ser Ala Ser Asn Ile Val Ser Pro Gln Glu Ala Leu Pro Gly
25 30 35
cgg aag gaa cag acc cct gta gcg gcc aaa cat cat gtc aat ggc aac
377Arg Lys Glu Gln Thr Pro Val Ala Ala Lys His His Val Asn Gly Asn
40 45 50 55
aga aca gtc gaa cct ttc cca gag gga aca cag atg gct gta ttt gga
425Arg Thr Val Glu Pro Phe Pro Glu Gly Thr Gln Met Ala Val Phe Gly
60 65 70
atg gga tgt ttc tgg gga gct gaa agg aaa ttc tgg gtc ttg aaa gga
473Met Gly Cys Phe Trp Gly Ala Glu Arg Lys Phe Trp Val Leu Lys Gly
75 80 85
gtg tat tca act caa gtt ggt ttt gca gga ggc tat act tca aat cct
521Val Tyr Ser Thr Gln Val Gly Phe Ala Gly Gly Tyr Thr Ser Asn Pro
90 95 100
act tat aaa gaa gtc tgc tca gaa aaa act ggc cat gca gaa gtc gtc
569Thr Tyr Lys Glu Val Cys Ser Glu Lys Thr Gly His Ala Glu Val Val
105 110 115
cga gtg gtg tac cag cca gaa cac atg agt ttt gag gaa ctg ctc aag
617Arg Val Val Tyr Gln Pro Glu His Met Ser Phe Glu Glu Leu Leu Lys
120 125 130 135
gtc ttc tgg gag aat cac gac ccg acc caa ggt atg cgc cag ggg aac
665Val Phe Trp Glu Asn His Asp Pro Thr Gln Gly Met Arg Gln Gly Asn
140 145 150
gac cat ggc act cag tac cgc tcg gcc atc tac ccg acc tct gcc aag
713Asp His Gly Thr Gln Tyr Arg Ser Ala Ile Tyr Pro Thr Ser Ala Lys
155 160 165
caa atg gag gca gcc ctg agc tcc aaa gag aac tac caa aag gtt ctt
761Gln Met Glu Ala Ala Leu Ser Ser Lys Glu Asn Tyr Gln Lys Val Leu
170 175 180
tca gag cac ggc ttc ggc ccc atc act acc gac atc cgg gag gga cag
809Ser Glu His Gly Phe Gly Pro Ile Thr Thr Asp Ile Arg Glu Gly Gln
185 190 195
act ttc tac tat gcg gaa gac tac cac cag cag tac ctg agc aag aac
857Thr Phe Tyr Tyr Ala Glu Asp Tyr His Gln Gln Tyr Leu Ser Lys Asn
200 205 210 215
ccc aat ggc tac tgc ggc ctt ggg ggc acc ggc gtg tcc tgc cca gtg
905Pro Asn Gly Tyr Cys Gly Leu Gly Gly Thr Gly Val Ser Cys Pro Val
220 225 230
ggt att aaa aaa taattgctcc ccacatggtg ggcctttgag gttccagtaa
957Gly Ile Lys Lys
235
aaatgctttc aacaaattgg gcaatgcttg tgtgattcac aatcgtggca tttaaagtgc
1017acaaagtaca aaggaattta tacagattgg gtttaccgaa gtataatcta taggaggcgc
1077gatggcaagt tgataaaatg tgacttatct cctaataagt tatggtggga gtggagctgt
1137gcggtttcct gtgtcttctg gggtctgagt gaagatagca gggatgctgt gttcaccctt
1197cttggtagaa gctaaggtgt gagctgggag gttgctggac aggatggggg accccagaag
1257tcctttatct gtgctctctg cccgccagtg ccttacaatt tgcaaacgtg tatagcctca
1317gtgactcatt cgctgaaatc cttcgcttta cca
135038235PRTHomo sapiens 38Met Leu Ser Ala Thr Arg Arg Ala Cys Gln Leu
Leu Leu Leu His Ser 1 5 10
15 Leu Phe Pro Val Pro Arg Met Gly Asn Ser Ala Ser Asn Ile Val Ser
20 25 30 Pro Gln
Glu Ala Leu Pro Gly Arg Lys Glu Gln Thr Pro Val Ala Ala 35
40 45 Lys His His Val Asn Gly Asn
Arg Thr Val Glu Pro Phe Pro Glu Gly 50 55
60 Thr Gln Met Ala Val Phe Gly Met Gly Cys Phe Trp
Gly Ala Glu Arg 65 70 75
80 Lys Phe Trp Val Leu Lys Gly Val Tyr Ser Thr Gln Val Gly Phe Ala
85 90 95 Gly Gly Tyr
Thr Ser Asn Pro Thr Tyr Lys Glu Val Cys Ser Glu Lys 100
105 110 Thr Gly His Ala Glu Val Val Arg
Val Val Tyr Gln Pro Glu His Met 115 120
125 Ser Phe Glu Glu Leu Leu Lys Val Phe Trp Glu Asn His
Asp Pro Thr 130 135 140
Gln Gly Met Arg Gln Gly Asn Asp His Gly Thr Gln Tyr Arg Ser Ala 145
150 155 160 Ile Tyr Pro Thr
Ser Ala Lys Gln Met Glu Ala Ala Leu Ser Ser Lys 165
170 175 Glu Asn Tyr Gln Lys Val Leu Ser Glu
His Gly Phe Gly Pro Ile Thr 180 185
190 Thr Asp Ile Arg Glu Gly Gln Thr Phe Tyr Tyr Ala Glu Asp
Tyr His 195 200 205
Gln Gln Tyr Leu Ser Lys Asn Pro Asn Gly Tyr Cys Gly Leu Gly Gly 210
215 220 Thr Gly Val Ser Cys
Pro Val Gly Ile Lys Lys 225 230 235
39481DNAHomo sapiens 39ggcattattg gactgtaggt ttttattaaa acaaacattt
ctcatagctc taagcaaagc 60attagaattc atcaagcgga ctcacatctt ttctctgcac
agagaggggc tgaaaaggga 120gagaaagtcc cttatgtatg tctagatttg gtaaagcgaa
ggatttcagc gaatgagtca 180ctgaggctat acacgtttgc aaattgtaag gcactggcgg
gcagagagca cagataaagg 240acttctgggg tcccccatcc tgtccagcaa cctcccagct
cacaccttag cttctaccaa 300gaagggtgaa cacagcatcc ctgctatctt cactcagacc
ccagaaaacc cagggaaacc 360cgacagctcc actcccacca taacttatta ggagataagt
cacattttat caacttgcca 420tcgcgcctcc tatagattat acttcggtaa acccaatctg
tataaattcc tttgtacttt 480g
48140390DNAHomo sapiens 40ttttttttat tggactgtag
gtttttatta aaacaaacat ttctcatagc tctaagcaaa 60gcattagaat tcatcaagcg
gactcacatc ttttctctgc acagagaggg ctgaaaaggg 120agagaaagcc ccttatgtat
gtctagattt ggtaaagcga aggatttcag cgaatgagtc 180actgaggcta tacacgtttg
caaattgtaa ggcactggcg ggcagagagc acagataaag 240gacttttggg ggtcccccat
tcctgtccag caacctccca gctcacacct tagcttctac 300caagaagggg tgaacacagc
atccctgcta tcttcactca gacccccaga agacacagga 360aaccgcacag ctccactccc
accataactt 3904143DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
41agcggataac aatttcacac agggagctag cttggaagat tgc
434222DNAArtificial SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 42gtccaatata tgcaaacagt tg
224323DNAArtificial SequenceDescription of Artificial
Sequence Synthetic oligonucleotide 43agcggataac aatttcacac agg
234418DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
44actgagcctg ctgcataa
184521DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 45tctcaatcat gtgcattgag g
214643DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 46agcggataac aatttcacac agggatcaca cagccatcag cag
434723DNAArtificial SequenceDescription of Artificial
Sequence Synthetic oligonucleotide 47agcggataac aatttcacac agg
234819DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
48ctggcgccca cgtggtcaa
194919DNAArtificial SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 49tttctctgca cagagaggc
195044DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 50agcggataac aatttcacac agggctgaaa
tccttcgctt tacc 445123DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
51agcggataac aatttcacac agg
235218DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 52ctgaaaaggg agagaaag
185320DNAArtificial SequenceDescription of Artificial Sequence
Synthetic oligonucleotide 53tcccaaagtg ctggaattac
205422DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 54gtccaatata tgcaaacagt tg
225520DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
55cccacagcag ttaatccttc
205618DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 56gcgctcctgt cggtgcca
185718DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 57gcctgactgg tggggccc
185815DNAArtificial SequenceDescription of Artificial
Sequence Synthetic oligonucleotide 58catgcatgca cggtc
155930DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
59cagagagtac ccctcgaccg tgcatgcatg
306015DNAArtificial SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 60catgcatgca cggtt
156130DNAArtificial SequenceDescription of Artificial
Sequence Synthetic oligonucleotide 61gtacgtacgt gccaactccc
catgagagac 306214DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
62catgcatgca cggt
146318DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 63gcctgactgg tggggccc
186426DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 64gtgctgcagg tgtaaacttg taccag
266528DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 65cacggatccg gtagcagcgg tagagttg
286619DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 66actgggcatg tggagacag
196718DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
67gcactttctt gccatgag
186814DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 68tcagtcacga cgtt
146914DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 69cggataacaa tttc
147037DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 70caatttcatc gctggatgca atctgggcta tgagatc
377137DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 71caatttcaca cagcggatgc
ttcttttggc tctgact 377240DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
72tcagtcacga cgttggatgc caataaaagt gactctcagc
407337DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 73cggataacaa tttcggatgc actgggagca ttgaggc
377438DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 74tcagtcacga cgttggatga gcagatccct ggacaggc
387538DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 75cggataacaa tttcggatgg acaaaatacc
tgtattcc 387636DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
76tcagtcacga cgttggatgc agagcagctc cgagtc
367732DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 77cagcggtgat cattggatgc aggaagctct gg
327838DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 78tcagtcacga cgttggatgc ccacatgcca cccactac
387935DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 79cggataacaa tttcggatgc ccgtcaggta ccacg
358037DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 80tcagtcacga cgttggatgc
ccacagtgga gcttcag 378122DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
81gctcatacct tgcaggatga cg
228236DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 82tcagtcacga cgttggatga ccagctgttc gtgttc
368334DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 83tacatggagt tcggggatgc acacggcgac tctc
348440DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 84tcagtcacga cgttggatgg ggaagagcag
agatatacgt 408529DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
85gaggggctga tccaggatgg gtgctccac
298630DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 86tgaagcactt gaaggatgag ggtgtctgcg
308738DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 87cggataacaa tttcggatgc tgcgtgatga tgaaatcg
388826DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 88gatgaagctc ccaggatgcc agaggc
268927DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 89gccgccggtg taggatgctg ctggtgc
279031DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
90cgcagggttt cctcgtcgca ctgggcatgt g
319143DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 91tgcttatccc tgtagctacc ctgtcttggc cttgcagatc caa
439242DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 92agcggataac aatttcacac aggccatcac accgcggtac tg
429344DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 93cccagtcacg acgttgtaaa acgtcttggc
cttgcagatc caag 449442DNAArtificial
SequenceDescription of Artificial Sequence Synthetic primer
94agcggataac aatttcacac aggccatcac accgcggtac tg
429520DNAArtificial SequenceDescription of Artificial Sequence Synthetic
primer 95ctccagctgg gcaggagtgc
209617DNAArtificial SequenceDescription of Artificial Sequence
Synthetic primer 96cacttcagtc gctccct
179723DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 97cccagtcacg acgttgtaaa acg
2398100DNAHomo sapiens 98cctttgagaa
agggctctgc ttgagttgta gaaagaaccg ctgcaacaat ctgggctatg 60agatcaataa
agtcagagcc aaaagaagca gcaaaatgta 10099100DNAHomo
sapiens 99cctttgagaa agggctctgc ttgagttgta gaaagaaccg ctgcaacaat
ctgggctatg 60agatcagtaa agtcagagcc aaaagaagca gcaaaatgta
100100100DNAHomo sapiens 100gaattatttt tgtgtttcta aaactatggt
tcccaataaa agtgactctc agcgagcctc 60aatgctccca gtgctattca tgggcagctc
tctgggctca 100101100DNAHomo sapiens
101gaattatttt tgtgtttcta aaactatggt tcccaataaa agtgactctc agcaagcctc
60aatgctccca gtgctattca tgggcagctc tctgggctca
10010284DNAHomo sapiens 102taataggact acttctaatc tgtaagagca gatccctgga
caggcgagga atacaggtat 60tttgtccttg aagtaacctt tcag
8410384DNAHomo sapiens 103taataggact acttctaatc
tgtaagagca gatccctgga caggcaagga atacaggtat 60tttgtccttg aagtaacctt
tcag 84104100DNAHomo sapiens
104ctcaccatgg gcatttgatt gcagagcagc tccgagtccg tccagagctt cctgcagtca
60atgatcaccg ctgtgggcat ccctgaggtc atgtctcgta
100105100DNAHomo sapiens 105ctcaccatgg gcatttgatt gcagagcagc tccgagtcca
tccagagctt cctgcagtca 60atgatcaccg ctgtgggcat ccctgaggtc atgtctcgta
100106100DNAHomo sapiens 106agcaaggact cctgcaaggg
ggacagtgga ggcccacatg ccacccacta ccagggcacg 60tggtacctga cgggcatcgt
cagctggggc cagggctgcg 100107100DNAHomo sapiens
107agcaaggact cctgcaaggg ggacagtgga ggcccacatg ccacccacta ccggggcacg
60tggtacctga cgggcatcgt cagctggggc cagggctgcg
100108100DNAHomo sapiens 108caataactct aatgcagcgg aagatgacct gcccacagtg
gagcttcagg gcgtggtgcc 60ccggggcgtc aacctgcaag gtatgagcat accccccttc
100109100DNAHomo sapiens 109caataactct aatgcagcgg
aagatgacct gcccacagtg gagcttcagg gcttggtgcc 60ccggggcgtc aacctgcaag
gtatgagcat accccccttc 100110100DNAHomo sapiens
110ttgaagcttt gggctacgtg gatgaccagc tgttcgtgtt ctatgatcat gagagtcgcc
60gtgtggagcc ccgaactcca tgggtttcca gtagaatttc
100111100DNAHomo sapiens 111ttgaagcttt gggctacgtg gatgaccagc tgttcgtgtt
ctatgatgat gagagtcgcc 60gtgtggagcc ccgaactcca tgggtttcca gtagaatttc
100112100DNAHomo sapiens 112ggataacctt ggctgtaccc
cctggggaag agcagagata tacgtgccag gtggagcacc 60caggcctgga tcagcccctc
attgtgatct gggagccctc 100113100DNAHomo sapiens
113ggataacctt ggctgtaccc cctggggaag agcagagata tacgtaccag gtggagcacc
60caggcctgga tcagcccctc attgtgatct gggagccctc
10011480DNAHomo sapiens 114tgaagcactt gaaggagaag gtgtctgcgg gagccgattt
catcatcacg cagcttttct 60ttgaggctga cacattcttc
8011580DNAHomo sapiens 115tgaagcactt gaaggagaag
gtgtctgcgg gagtcgattt catcatcacg cagcttttct 60ttgaggctga cacattcttc
8011680DNAHomo sapiens
116tccagatgaa gctcccagaa tgccagaggc tgctccccgc gtggcccctg caccagcagc
60tcctacaccg gcggcccctg
8011780DNAHomo sapiens 117tccagatgaa gctcccagaa tgccagaggc tgctcccccc
gtggcccctg caccagcagc 60tcctacaccg gcggcccctg
8011848DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 118cagagagtac
ccctcaaccg tgcatgcatg aaacatgcat gcacggtt
48119361DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 119ctgaggacct ggtcctctga ctgctctttt
cacccatcta cagtccccct tgccgtccca 60agcaatggat gatttgatgc tgtccccgga
cgatattgaa caatggttca ctgaagaccc 120aggtccagat gaagctccca gaatgccaga
ggctgctccc cccgtggccc ctgcaccagc 180agctcctaca ccggcggccc ctgcaccagc
cccctcctgg cccctgtcat cttctgtccc 240ttcccagaaa acctaccagg gcagctacgg
tttccgtctg ggcttcttgc attctgggac 300agccaagtct gtgacttgca cggtcagttg
ccctgagggg ctggcttcca tgagacttca 360a
361120205DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
120gcgctccatt catctcttca tcgactctct gttgaatgaa gaaaatccaa gtaaggccta
60caggtgcagt tccaaggaag cctttgagaa agggctctgc ttgagttgta gaaagaaccg
120ctgcaacaat ctgggctatg agatcagtaa agtcagagcc aaaagaagca gcaaaatgta
180cctgaagact cgttctcaga tgccc
205121161DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 121gtccgtcaga acccatgcgg cagcaaggcc
tgccgccgcc tcttcggccc agtggacagc 60gagcagctga gacgcgactg tgatgcgcta
atggcgggct gcatccagga ggcccgtgag 120cgatggaact tcgactttgt caccgagaca
ccactggagg g 161
User Contributions:
Comment about this patent or add new information about this topic: