Patent application title: TERMINATOR SEQUENCE FOR GENE EXPRESSION IN PLANTS
Inventors:
Shane E. Abbitt (Ankeny, IA, US)
Jung Rudolf (Rupprechtstegen, DE)
Assignees:
PIONEER HI-BRED INTERNATIONAL, INC.
IPC8 Class: AC12N1582FI
USPC Class:
1 1
Class name:
Publication date: 2021-11-11
Patent application number: 20210348180
Abstract:
The present invention discloses polynucleotide sequences that can be used
to regulate gene expression in plants. Terminator sequences from Sorghum
bicolor that are functional in plants are disclosed.Claims:
1. A recombinant construct comprising a polynucleotide sequence operably
linked to a heterologous polynucleotide sequence, wherein the
polynucleotide sequence comprises: (a) a nucleotide sequence comprising
the sequence set forth in SEQ ID NO:1 or SEQ ID NO:18; (b) a nucleotide
sequence comprising a sequence with at least 95% identity to the sequence
set forth in SEQ ID NO:1 or SEQ ID NO:18; or (c) a nucleotide sequence
comprising a functional fragment of either (a) or (b); wherein the
polynucleotide sequence functions as a transcriptional terminator in a
plant cell.
2. The recombinant construct of claim 1 wherein the polynucleotide is operably linked to a promoter.
3. A plant comprising the recombinant construct of claim 1.
4. The plant of claim 3 wherein the plant is a monocot.
5. The plant of claim 4 wherein the plant is a maize plant.
6. A seed comprising the recombinant construct of claim 1.
7. The seed of claim 6 wherein the seed is from a monocot plant.
8. The seed of claim 7 wherein the seed is from a maize plant.
9. A method of expressing a heterologous polynucleotide in a plant, comprising the steps of: (a) introducing into a regenerable plant cell the recombinant construct of claim 2; (b) regenerating a transgenic plant from the regenerable plant cell of step (a), wherein the transgenic plant comprises the recombinant construct of claim 2; and (c) obtaining a progeny plant from the transgenic plant of step (b), wherein the progeny plant comprises the recombinant construct of claim 2 and exhibits expression of the heterologous polynucleotide.
10. The method of claim 9, wherein the plant is a monocot plant.
11. The method of claim 10, wherein the plant is a maize plant.
12. A plant comprising the recombinant construct of claim 2.
13. The plant of claim 12 wherein the plant is a monocot.
14. The plant of claim 13 wherein the plant is a maize plant.
15. A seed comprising the recombinant construct of claim 2.
16. The seed of claim 15 wherein the seed is from a monocot plant.
17. The seed of claim 16 wherein the seed is from a maize plant.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional Application No. 61/514,055, filed Aug. 2, 2011, the entire content of which is herein incorporated by reference.
FIELD OF INVENTION
[0002] The present invention relates to the field of plant molecular biology and plant genetic engineering. More specifically, it relates to novel plant terminator sequences and their use to regulate gene expression in plants.
BACKGROUND
[0003] Recent advances in plant genetic engineering have opened new doors to engineer plants to have improved characteristics or traits. These transgenic plants characteristically have recombinant DNA constructs in their genome that have protein coding region operably linked to multiple regulatory regions that allow accurate expression of the transgene. A few examples of regulatory elements that help regulate gene expression in transgenic plants are promoters, introns, terminators, enhancers and silencers.
[0004] Plant genetic engineering has advanced to introducing multiple traits into commercially important plants, also known as gene stacking. This is accomplished by multigene transformation, where multiple genes are transferred to create a transgenic plant that might express a complex phenotype, or multiple phenotypes. But it is important to modulate or control the expression of each transgene optimally. The regulatory elements need to be diverse, to avoid introducing into the same transgenic plant repetitive sequences, which has been correlated with undesirable negative effects on transgene expression and stability (Peremarti et al (2010) Plant Mol Biol 73:363-378; Mette et al (1999) EMBO J 18:241-248; Mette et al (2000) EMBO J 19:5194-5201; Mourrain et al (2007) Planta 225:365-379, U.S. Pat. Nos. 7,632,982, 7,491,813, 7,674,950, PCT Application No. PCT/US2009/046968). Therefore it is important to discover and characterize novel regulatory elements that can be used to express heterologous nucleic acids in important crop species. Diverse regulatory regions can be used to control the expression of each transgene optimally.
[0005] Regulatory sequences located downstream of coding regions contain signals required for transcription termination and 3' mRNA processing, and are called terminator sequences. The terminator sequences play a key role in mRNA processing, localization, stability and translation (Proudfoot, N. (2004) Curr. Op. Cell Biol 16:272-278.; Gilmartin, 2005). The 3' regulatory sequences contained in terminator sequences can affect the level of expression of a gene. Optimal expression of a chimeric gene in plant cells has been found to be dependent on the presence of appropriate 3' sequences (Ingelbrecht, I. L. W. et al (1989) Plant Cell 1:671-680). Read through transcription through leaky terminator of a gene can cause unwanted transcription of one transgene from promoter of another one. Also, bidirectional, convergent transcription of transgenes in transgenic plants can occur due to leaky transcription termination of separate convergent genes or from genomic promoters. Convergent, overlapping transcription can decrease transgene expression, or generate antisense RNA (Bieri, S. et al (2002) Molecular Breeding 10:107-117). This underlines the importance of discovering novel and efficient transcriptional terminators.
SUMMARY
[0006] The present invention relates to regulatory sequences for modulating gene expression in plants. Specifically, the present invention relates to terminator sequences. Recombinant DNA constructs comprising terminator sequences are provided.
[0007] An embodiment of this invention is an isolated polynucleotide sequence comprising: (a) the sequence set forth in SEQ ID NO:1 or SEQ ID NO:18; (b) a sequence with at least 95% sequence identity to SEQ ID NO:1 or SEQ ID NO:18; or (c) a sequence comprising a functional fragment of (a) or (b), wherein the isolated polynucleotide sequence functions as a terminator in a plant cell. Another embodiment of this invention is a recombinant construct comprising an isolated polynucleotide sequence comprising: (a) the sequence set forth in SEQ ID NO:1 or SEQ ID NO:18; (b) a sequence with at least 95% sequence identity to SEQ ID NO:1 or SEQ ID NO:18; or (c) a sequence comprising a functional fragment of (a) or (b), wherein the isolated polynucleotide sequence functions as a terminator in a plant cell. This recombinant construct may further comprise a promoter and a heterologous polynucleotide, wherein the promoter and the heterologous polynucleotide are operably linked to the isolated polynucleotide sequence.
[0008] Another embodiment of this invention is a method of expressing a heterologous polynucleotide in a plant, comprising the steps of (a) introducing into a regenerable plant cell the recombinant DNA construct described above; (b) regenerating a transgenic plant from the regenerable plant cell of (a); and (c) obtaining a progeny plant from the transgenic plant of step (b), wherein the transgenic plant and the progeny plant comprises the recombinant DNA construct and exhibits expression of the heterologous polynucleotide.
[0009] In another embodiment, this invention concerns a vector, virus, cell, microorganism, plant, or seed comprising a recombinant DNA construct comprising the terminator sequences described in the present invention.
[0010] The invention encompasses regenerated, mature and fertile transgenic plants comprising the recombinant DNA constructs described above, transgenic seeds produced therefrom, T1 and subsequent generations. The transgenic plant cells, tissues, plants, and seeds may comprise at least one recombinant DNA construct of interest.
[0011] In another embodiment, the plant or seed comprising the terminator sequences described in the present invention is a monocotyledenous plant or seed. In another embodiment, the plant or seed comprising the terminator sequences described in the present invention is a maize plant or seed.
[0012] In another embodiment, any of the methods of expressing a heterologous polynucleotide, wherein the plant cell is a monocotyledonous plant cell, e.g., a maize plant cell.
BRIEF DESCRIPTION OF DRAWINGS AND SEQUENCE LISTING
[0013] The invention can be more fully understood from the following detailed description and the accompanying drawings and Sequence Listing which form a part of this application. The Sequence Listing contains the one letter code for nucleotide sequence characters and the three letter codes for amino acids as defined in conformity with the IUPAC-IUBMB standards described in Nucleic Acids Research 13:3021-3030 (1985) and in the Biochemical Journal 219 (No. 2): 345-373 (1984), which are herein incorporated by reference in their entirety. The symbols and format used for nucleotide and amino acid sequence data comply with the rules set forth in 37 C.F.R. .sctn. 1.822.
[0014] FIG. 1 shows the map of PHP31801, the vector used for cloning SB-GKAF terminator after amplification.
[0015] FIG. 2 shows the map of PHP34074, the vector used for testing the SB-GKAF terminator.
[0016] FIG. 3 shows the results of testing SB-GKAF terminator compared to PINII terminator in transient assays. It shows quantitative analysis of GUS reporter gene expression in BMS cells transformed with PHP34074 (SB-GKAF terminator) and PHP34005 (PINII terminator).
[0017] FIG. 4A and FIG. 4B show quantitative analysis of GUS reporter gene expression in Gaspe Flint derived maize lines stably transformed with SB-GKAF (PHP34074) and PINII (PHP34005) terminator constructs. FIG. 4A shows GUS reporter gene expression assayed at protein level, and FIG. 4B shows GUS reporter gene expression assayed with qRT-PCR.
[0018] FIG. 5 shows the results of qRT-PCR assays with stably transformed Gaspe Flint derived maize lines, using two sets of primers downstream of the SB-GKAF terminator and the PINII terminator.
[0019] FIG. 6A-6C show the alignment between the cloned SB-GKAF terminator (SEQ ID NO:1) and the nucleotides 1863 to 2322 of NCBI GI NO: 671655 (SEQ ID NO:18). The consensus sequence is show at the top, and the residues that match the consensus exactly are boxed.
[0020] SEQ ID NO:1 is the sequence of the 459 bp SB-GKAF terminator.
[0021] SEQ ID NO:2 and 3 are the sequences of the forward and reverse primers used to amplify SB-GKAF terminator.
[0022] SEQ ID NO:4 is the nucleotide sequence of PHP31801, the vector used for cloning SB-GKAF terminator after PCR amplification.
[0023] SEQ ID NO:5 is the nucleotide sequence of PHP34074, the vector used for testing SB-GKAF terminator.
[0024] SEQ ID NO:6 is the nucleotide sequence of PHP34005, the test vector used as a control with PINII terminator.
[0025] SEQ ID NOS:7-9 are the sequences of the forward primer, reverse primer and probe used for assessing GUS expression by qRT-PCR in transgenic maize plants, as described in Table 2.
[0026] SEQ ID NOS:10-17 are the sequences of the primers used for quantitating read through transcription through SB-GKAF and PINII terminators, by qRT-PCR in transgenic maize plants, as described in Table 3.
[0027] SEQ ID NO:18 corresponds to nucleotides 1863 to 2322 of NCBI GI NO: 671655.
DETAILED DESCRIPTION
[0028] The disclosure of each reference set forth herein is hereby incorporated by reference in its entirety.
[0029] As used herein and in the appended claims, the singular forms "a", "an", and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a plant" includes a plurality of such plants, reference to "a cell" includes one or more cells and equivalents thereof known to those skilled in the art, and so forth.
[0030] As used herein:
[0031] The terms "monocot" and "monocotyledonous plant" are used interchangeably herein. A monocot of the current invention includes the Gramineae.
[0032] The terms "dicot" and "dicotyledonous plant" are used interchangeably herein. A dicot of the current invention includes the following families: Brassicaceae, Leguminosae, and Solanaceae.
[0033] The terms "full complement" and "full-length complement" are used interchangeably herein, and refer to a complement of a given nucleotide sequence, wherein the complement and the nucleotide sequence consist of the same number of nucleotides and are 100% complementary.
[0034] "Transgenic" refers to any cell, cell line, callus, tissue, plant part or plant, the genome of which has been altered by the presence of a heterologous nucleic acid, such as a recombinant DNA construct, including those initial transgenic events as well as those created by sexual crosses or asexual propagation from the initial transgenic event. The term "transgenic" as used herein does not encompass the alteration of the genome (chromosomal or extra-chromosomal) by conventional plant breeding methods or by naturally occurring events such as random cross-fertilization, non-recombinant viral infection, non-recombinant bacterial transformation, non-recombinant transposition, or spontaneous mutation.
[0035] "Genome" as it applies to plant cells encompasses not only chromosomal DNA found within the nucleus, but organelle DNA found within subcellular components (e.g., mitochondrial, plastid) of the cell.
[0036] "Plant" includes reference to whole plants, plant organs, plant tissues, plant propagules, seeds and plant cells and progeny of same. Plant cells include, without limitation, cells from seeds, suspension cultures, embryos, meristematic regions, callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and microspores.
[0037] "Propagule" includes all products of meiosis and mitosis able to propagate a new plant, including but not limited to, seeds, spores and parts of a plant that serve as a means of vegetative reproduction, such as corms, tubers, offsets, or runners. Propagule also includes grafts where one portion of a plant is grafted to another portion of a different plant (even one of a different species) to create a living organism. Propagule also includes all plants and seeds produced by cloning or by bringing together meiotic products, or allowing meiotic products to come together to form an embryo or fertilized egg (naturally or with human intervention).
[0038] "Progeny" comprises any subsequent generation of a plant.
[0039] "Transgenic plant" includes reference to a plant which comprises within its genome a heterologous polynucleotide. For example, the heterologous polynucleotide is stably integrated within the genome such that the polynucleotide is passed on to successive generations. The heterologous polynucleotide may be integrated into the genome alone or as part of a recombinant DNA construct.
[0040] The commercial development of genetically improved germplasm has also advanced to the stage of introducing multiple traits into crop plants, often referred to as a gene stacking approach. In this approach, multiple genes conferring different characteristics of interest can be introduced into a plant. Gene stacking can be accomplished by many means including but not limited to co-transformation, retransformation, and crossing lines with different transgenes.
[0041] "Transgenic plant" also includes reference to plants which comprise more than one heterologous polynucleotide within their genome. Each heterologous polynucleotide may confer a different trait to the transgenic plant.
[0042] "Heterologous" with respect to sequence means a sequence that originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition and/or genomic locus by deliberate human intervention. "Polynucleotide", "nucleic acid sequence", "nucleotide sequence", or "nucleic acid fragment" are used interchangeably to refer to a polymer of RNA or DNA that is single- or double-stranded, optionally containing synthetic, non-natural or altered nucleotide bases. Nucleotides (usually found in their 5'-monophosphate form) are referred to by their single letter designation as follows: "A" for adenylate or deoxyadenylate (for RNA or DNA, respectively), "C" for cytidylate or deoxycytidylate, "G" for guanylate or deoxyguanylate, "U" for uridylate, "T" for deoxythymidylate, "R" for purines (A or G), "Y" for pyrimidines (C or T), "K" for G or T, "H" for A or C or T, "I" for inosine, and "N" for any nucleotide.
[0043] "Polypeptide", "peptide", "amino acid sequence" and "protein" are used interchangeably herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers. The terms "polypeptide", "peptide", "amino acid sequence", and "protein" are also inclusive of modifications including, but not limited to, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation.
[0044] "Messenger RNA (mRNA)" refers to the RNA that is without introns and that can be translated into protein by the cell.
[0045] "cDNA" refers to a DNA that is complementary to and synthesized from an mRNA template using the enzyme reverse transcriptase. The cDNA can be single-stranded or converted into the double-stranded form using the Klenow fragment of DNA polymerase I.
[0046] "Coding region" refers to the portion of a messenger RNA (or the corresponding portion of another nucleic acid molecule such as a DNA molecule) which encodes a protein or polypeptide. "Non-coding region" refers to all portions of a messenger RNA or other nucleic acid molecule that are not a coding region, including but not limited to, for example, the promoter region, 5' untranslated region ("UTR"), 3' UTR, intron and terminator. The terms "coding region" and "coding sequence" are used interchangeably herein. The terms "non-coding region" and "non-coding sequence" are used interchangeably herein.
[0047] An "Expressed Sequence Tag" ("EST") is a DNA sequence derived from a cDNA library and therefore is a sequence which has been transcribed. An EST is typically obtained by a single sequencing pass of a cDNA insert. The sequence of an entire cDNA insert is termed the "Full-Insert Sequence" ("FIS"). A "Contig" sequence is a sequence assembled from two or more sequences that can be selected from, but not limited to, the group consisting of an EST, FIS and PCR sequence. A sequence encoding an entire or functional protein is termed a "Complete Gene Sequence" ("CGS") and can be derived from an FIS or a contig.
[0048] "Mature" protein refers to a post-translationally processed polypeptide; i.e., one from which any pre- or pro-peptides present in the primary translation product have been removed.
[0049] "Precursor" protein refers to the primary product of translation of mRNA; i.e., with pre- and pro-peptides still present. Pre- and pro-peptides may be and are not limited to intracellular localization signals.
[0050] "Isolated" refers to materials, such as nucleic acid molecules and/or proteins, which are substantially free or otherwise removed from components that normally accompany or interact with the materials in a naturally occurring environment. Isolated polynucleotides may be purified from a host cell in which they naturally occur. Conventional nucleic acid purification methods known to skilled artisans may be used to obtain isolated polynucleotides. The term also embraces recombinant polynucleotides and chemically synthesized polynucleotides.
[0051] "Recombinant" refers to an artificial combination of two otherwise separated segments of sequence, e.g., by chemical synthesis or by the manipulation of isolated segments of nucleic acids by genetic engineering techniques.
[0052] "Recombinant" also includes reference to a cell or vector, that has been modified by the introduction of a heterologous nucleic acid or a cell derived from a cell so modified, but does not encompass the alteration of the cell or vector by naturally occurring events (e.g., spontaneous mutation, natural transformation/transduction/transposition) such as those occurring without deliberate human intervention.
[0053] "Recombinant DNA construct" refers to a combination of nucleic acid fragments that are not normally found together in nature. Accordingly, a recombinant DNA construct may comprise regulatory sequences and coding sequences that are derived from different sources, or regulatory sequences and coding sequences derived from the same source, but arranged in a manner different than that normally found in nature. The terms "recombinant DNA construct" and "recombinant construct" are used interchangeably herein.
[0054] The terms "entry clone" and "entry vector" are used interchangeably herein.
[0055] "Regulatory sequences" or "regulatory elements" are used interchangeably and refer to nucleotide sequences located upstream (5' non-coding sequences), within, or downstream (3' non-coding sequences) of a coding sequence, and which influence the transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences. The terms "regulatory sequence" and "regulatory element" are used interchangeably herein.
[0056] "Promoter" refers to a nucleic acid fragment capable of controlling transcription of another nucleic acid fragment.
[0057] "Promoter functional in a plant" is a promoter capable of controlling transcription in plant cells whether or not its origin is from a plant cell.
[0058] "Tissue-specific promoter" and "tissue-preferred promoter" are used interchangeably to refer to a promoter that is expressed predominantly but not necessarily exclusively in one tissue or organ, but that may also be expressed in one specific cell.
[0059] "Developmentally regulated promoter" refers to a promoter whose activity is determined by developmental events.
[0060] Promoters that cause a gene to be expressed in most cell types at most times are commonly referred to as "constitutive promoters".
[0061] Inducible promoters selectively express an operably linked DNA sequence in response to the presence of an endogenous or exogenous stimulus, for example by chemical compounds (chemical inducers) or in response to environmental, hormonal, chemical, and/or developmental signals. Examples of inducible or regulated promoters include, but are not limited to, promoters regulated by light, heat, stress, flooding or drought, pathogens, phytohormones, wounding, or chemicals such as ethanol, jasmonate, salicylic acid, or safeners.
[0062] "Enhancer sequences" refer to the sequences that can increase gene expression. These sequences can be located upstream, within introns or downstream of the transcribed region. The transcribed region is comprised of the exons and the intervening introns, from the promoter to the transcription termination region. The enhancement of gene expression can be through various mechanisms which include, but are not limited to, increasing transcriptional efficiency, stabilization of mature mRNA and translational enhancement.
[0063] An "intron" is an intervening sequence in a gene that is transcribed into RNA and then excised in the process of generating the mature mRNA. The term is also used for the excised RNA sequences. An "exon" is a portion of the sequence of a gene that is transcribed and is found in the mature messenger RNA derived from the gene, and is not necessarily a part of the sequence that encodes the final gene product.
[0064] "Operably linked" refers to the association of nucleic acid fragments in a single fragment so that the function of one is regulated by the other. For example, a promoter is operably linked with a nucleic acid fragment when it is capable of regulating the transcription of that nucleic acid fragment.
[0065] "Expression" refers to the production of a functional product. For example, expression of a nucleic acid fragment may refer to transcription of the nucleic acid fragment (e.g., transcription resulting in mRNA or functional RNA) and/or translation of mRNA into a precursor or mature protein.
[0066] "Overexpression" refers to the production of a gene product in transgenic organisms that exceeds levels of production in a null segregating (or non-transgenic) organism from the same experiment.
[0067] "Phenotype" means the detectable characteristics of a cell or organism.
[0068] The term "crossed" or "cross" means the fusion of gametes via pollination to produce progeny (e.g., cells, seeds or plants). The term encompasses both sexual crosses (the pollination of one plant by another) and selfing (self-pollination, e.g., when the pollen and ovule are from the same plant). The term "crossing" refers to the act of fusing gametes via pollination to produce progeny.
[0069] A "favorable allele" is the allele at a particular locus that confers, or contributes to, a desirable phenotype, e.g., increased cell wall digestibility, or alternatively, is an allele that allows the identification of plants with decreased cell wall digestibility that can be removed from a breeding program or planting ("counterselection"). A favorable allele of a marker is a marker allele that segregates with the favorable phenotype, or alternatively, segregates with the unfavorable plant phenotype, therefore providing the benefit of identifying plants.
[0070] The term "introduced" means providing a nucleic acid (e.g., expression construct) or protein into a cell. Introduced includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell, and includes reference to the transient provision of a nucleic acid or protein to the cell. Introduced includes reference to stable or transient transformation methods, as well as sexually crossing. Thus, "introduced" in the context of inserting a nucleic acid fragment (e.g., a recombinant DNA construct/expression construct) into a cell, means "transfection" or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid fragment into a eukaryotic or prokaryotic cell where the nucleic acid fragment may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
[0071] "Suppression DNA construct" is a recombinant DNA construct which when transformed or stably integrated into the genome of the plant, results in "silencing" of a target gene in the plant. The target gene may be endogenous or transgenic to the plant. "Silencing," as used herein with respect to the target gene, refers generally to the suppression of levels of mRNA or protein/enzyme expressed by the target gene, and/or the level of the enzyme activity or protein functionality. The terms "suppression", "suppressing" and "silencing", used interchangeably herein, include lowering, reducing, declining, decreasing, inhibiting, eliminating or preventing. "Silencing" or "gene silencing" does not specify mechanism and is inclusive, and not limited to, anti-sense, cosuppression, viral-suppression, hairpin suppression, stem-loop suppression, RNAi-based approaches, and small RNA-based approaches.
[0072] Standard recombinant DNA and molecular cloning techniques used herein are well known in the art and are described more fully in Sambrook, J., Fritsch, E. F. and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor Laboratory Press: Cold Spring Harbor, 1989 (hereinafter "Sambrook").
[0073] "Transcription terminator", "termination sequences", or "terminator" refer to DNA sequences located downstream of a coding sequence, including polyadenylation recognition sequences and other sequences encoding regulatory signals capable of affecting mRNA processing or gene expression. The polyadenylation signal is usually characterized by affecting the addition of polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 3' non-coding sequences is exemplified by Ingelbrecht, I. L., et al., Plant Cell 1:671-680 (1989). A polynucleotide sequence with "terminator activity" refers to a polynucleotide sequence that, when operably linked to the 3' end of a second polynucleotide sequence that is to be expressed, is capable of terminating transcription from the second polynucleotide sequence. Transcription termination is the process by which RNA synthesis by RNA polymerase is stopped and both the RNA and the enzyme are released from the DNA template.
[0074] Improper termination of an RNA transcript can affect the stability of the RNA, and hence can affect protein expression. Variability of transgene expression is sometimes attributed to variability of termination efficiency (Bieri et al (2002) Molecular Breeding 10: 107-117).
[0075] The terms "SB-GKAF terminator", "GKAF terminator" and "gamma-kafirin terminator" are used interchangeably herein, and each refers to the sequence encoding the 3' untranslated region (3' UTR) of the Sorghum Bicolor gamma-kafirin gene and about 300 bp of sequence downstream from the 3' UTR. The sequence of the SB-GKAF terminator is given in SEQ ID NO:1. The Sorghum bicolor gamma-kafirin gene encodes a gamma-prolamin protein, and the sequence for this gene is given in NCBI GI NO: 671655. Prolam ins are the major storage proteins of many cereals. The Sorghum gamma-Kafirin, which is the .gamma.-prolamin of Sorghum, constitutes about 2-5% of total prolamin in sorghum endosperm, and is composed of a single polypeptide of 27 kDa (de Freitas F A et al (1994) Mol Gen Genetics 245(2):177-86).
[0076] The present invention encompasses functional fragments and variants of the terminator sequences disclosed herein.
[0077] A "functional fragment" of the terminator is defined as any subset of contiguous nucleotides of the terminator sequence disclosed herein, that can perform the same, or substantially similar function as the full length terminator sequence disclosed herein. A "functional fragment" with substantially similar function to the full length terminator disclosed herein refers to a functional fragment that retains the ability to terminate transcription largely at the same level as the full-length terminator sequence. A recombinant construct comprising a heterologous polynucleotide operably linked to a "functional fragment" of the terminator sequence disclosed herein exhibits levels of heterologous polynucleotide expression substantially similar to a corresponding recombinant construct comprising a heterologous polynucleotide operably linked to the full length terminator sequence. A "variant", as used herein, is the sequence of the terminator or the sequence of a functional fragment of a terminator containing changes in which one or more nucleotides of the original sequence is deleted, added, and/or substituted, while substantially maintaining terminator function. One or more base pairs can be inserted, deleted, or substituted internally to a terminator, without affecting its activity. Fragments and variants can be obtained via methods such as site-directed mutagenesis and synthetic construction.
[0078] These terminator functional fragments may comprise at least 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425 or 450 contiguous nucleotides of the particular terminator nucleotide sequence disclosed herein. Such fragments may be obtained by use of restriction enzymes to cleave the naturally occurring terminator nucleotide sequences disclosed herein; by synthesizing a nucleotide sequence from the naturally occurring terminator DNA sequence; or may be obtained through the use of PCR technology. See particularly, Mullis et al., Methods Enzymol. 155:335-350 (1987), and Higuchi, R. In PCR Technology: Principles and Applications for DNA Amplifications; Erlich, H. A., Ed.; Stockton Press Inc.: New York, 1989. Again, variants of these terminator fragments, such as those resulting from site-directed mutagenesis, are encompassed by the compositions of the present invention.
[0079] The terms "substantially similar" and "corresponding substantially" as used herein refer to nucleic acid fragments, particularly terminator sequences, wherein changes in one or more nucleotide bases do not substantially alter the ability of the terminator to terminate transcription. These terms also refer to modifications, including deletions and variants, of the nucleic acid sequences of the instant invention by way of deletion or insertion of one or more nucleotides that do not substantially alter the functional properties of the resulting terminator relative to the initial, unmodified terminator. It is therefore understood, as those skilled in the art will appreciate, that the invention encompasses more than the specific exemplary sequences.
[0080] As will be evident to one of skill in the art, any heterologous polynucleotide of interest can be operably linked to the terminator sequences described in the current invention. Examples of polynucleotides of interest that can be operably linked to the terminator sequences described in this invention include, but are not limited to, polynucleotides comprising regulatory elements such as introns, enhancers, promoters, translation leader sequences, protein coding regions such as disease and insect resistance genes, genes conferring nutritional value, genes conferring yield and heterosis increase, genes that confer male and/or female sterility, antifungal, antibacterial or antiviral genes, and the like. Likewise, the terminator sequences described in the current invention can be used to terminate transcription of any nucleic acid that controls gene expression. Examples of nucleic acids that could be used to control gene expression include, but are not limited to, antisense oligonucleotides, suppression DNA constructs, or nucleic acids encoding transcription factors.
[0081] A recombinant DNA construct (including a suppression DNA construct) of the present invention may comprise at least one regulatory sequence. In an embodiment of the present invention, the regulatory sequences disclosed herein can be operably linked to any other regulatory sequence.
[0082] A number of promoters can be used in recombinant DNA constructs of the present invention. The promoters can be selected based on the desired outcome, and may include constitutive, tissue-specific, inducible, or other promoters for expression in the host organism.
[0083] The terms "real-time PCR", "quantitative PCR", "quantitative real-time PCR", and "QPCR" are used interchangeably herein, and represent a variation of the standard polymerase chain reaction (PCR) technique used to quantify DNA or RNA in a sample. Using sequence-specific primers and a probe, the relative number or copies of a particular DNA or RNA sequence are determined. The term relative is used since this technique compares relative copy numbers between different genes with respect to a specific reference gene. The quantification arises by measuring the amount of amplified product at each cycle during the PCR process. Quantification of amplified product is obtained using fluorescent hydrolysis probes that measure increasing fluorescence for each subsequent PCR cycle. The Ct (cycle threshold) is defined as the number of cycles required for the fluorescent signal to cross the threshold (i.e., exceeds background level). DNA/RNA from genes with higher copy numbers will appear after fewer PCR cycles; so the lower a Ct value, the more copies are present in the specific sample. To quantify RNA, QPCR or real-time PCR is preceded by the step of reverse transcribing mRNA into cDNA. This is referred to herein as "real-time RT-PCR" or "quantitative RT-PCR" or "qRT-PCR".
[0084] The Taqman method of PCR product quantification uses a fluorescent reporter probe. This is more accurate since the probe is designed to be sequence-specific and will only bind to the specific PCR product. The probe specificity allows for quantification even in the presence of non-specific DNA amplification. This allows for multiplexing, which quantitates several genes in the same tube, by using probes with different emission spectra. Breakdown of the probe by the 5' to 3' exonuclease activity of Taq polymerase removes the quencher and allows the PCR product to be detected.
[0085] When plotted on a linear scale, the fluorescent emission increase with PCR cycle number has a sigmoidal shape with an exponential phase and a plateau phase. The plateau phase is determined by the amount of primer in the master mix rather than the nucleotide template. Usually the vertical scale is plotted in a logarithmic fashion, allowing the intersection of the plot with the threshold to be linear and more easily visualized. Theoretically, the amount of DNA doubles every cycle during the exponential phase, but this is affected by the efficiency of the primers used. A positive control using a reference gene, e.g., a "housekeeping" gene that is relatively abundant in all cell types, is also performed to allow for comparisons between samples. The amount of DNA/RNA is determined by comparing the results to a standard curve produced by serial dilutions of a known concentration of DNA/RNA.
[0086] The present invention includes a polynucleotide comprising: (i) a nucleic acid sequence of at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% sequence identity, based on the Clustal V (or Clustal W) method of alignment, when compared to SEQ ID NO:1 or SEQ ID NO:18; or (ii) a nucleic acid sequence of at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity, based on the Clustal V (or Clustal W) method of alignment, when compared to a functional fragment of SEQ ID NO:1 or SEQ ID NO:18; or (iii) a full complement of the nucleic acid sequence of (i) or (ii), wherein the polynucleotide acts as a terminator in a plant cell.
[0087] Sequence alignments and percent identity calculations may be determined using a variety of comparison methods designed to detect homologous sequences including, but not limited to, the Megalign.RTM. program of the LASERGENE.RTM. bioinformatics computing suite (DNASTAR.RTM. Inc., Madison, Wis.). Unless stated otherwise, multiple alignment of the sequences provided herein were performed using the Clustal V method of alignment (Higgins and Sharp (1989) CABIOS. 5:151-153) with the default parameters (GAP PENALTY=10, GAP LENGTH PENALTY=10). Default parameters for pairwise alignments and calculation of percent identity of protein sequences using the Clustal V method are KTUPLE=1, GAP PENALTY=3, WINDOW=5 and DIAGONALS SAVED=5. For nucleic acids these parameters are KTUPLE=2, GAP PENALTY=5, WINDOW=4 and DIAGONALS SAVED=4. After alignment of the sequences, using the Clustal V program, it is possible to obtain "percent identity" and "divergence" values by viewing the "sequence distances" table on the same program; unless stated otherwise, percent identities and divergences provided and claimed herein were calculated in this manner.
[0088] Alternatively, the Clustal W method of alignment may be used. The Clustal W method of alignment (described by Higgins and Sharp, CABIOS. 5:151-153 (1989); Higgins, D. G. et al., Comput. Appl. Biosci. 8:189-191 (1992)) can be found in the MegAlign.TM. v6.1 program of the LASERGENE.RTM. bioinformatics computing suite (DNASTAR.RTM. Inc., Madison, Wis.). Default parameters for multiple alignment correspond to GAP PENALTY=10, GAP LENGTH PENALTY=0.2, Delay Divergent Sequences=30%, DNA Transition Weight=0.5, Protein Weight Matrix=Gonnet Series, DNA Weight Matrix=IUB. For pairwise alignments the default parameters are Alignment=Slow-Accurate, Gap Penalty=10.0, Gap Length=0.10, Protein Weight Matrix=Gonnet 250 and DNA Weight Matrix=IUB. After alignment of the sequences using the Clustal W program, it is possible to obtain "percent identity" and "divergence" values by viewing the "sequence distances" table in the same program.
[0089] Embodiments of the invention include:
[0090] The present invention relates to terminator sequences. Recombinant DNA constructs comprising terminator sequences are provided.
[0091] An embodiment of this invention is an isolated polynucleotide sequence comprising (a) the sequence set forth in SEQ ID NO:1 or SEQ ID NO:18; (b) a sequence with at least 95% sequence identity to SEQ ID NO:1 or SEQ ID NO:18; or (c) a sequence comprising a functional fragment of (a) or (b), wherein the isolated polynucleotide sequence functions as a terminator in a plant cell. In another aspect, this invention concerns a recombinant DNA construct comprising a promoter, at least one heterologous nucleic acid fragment, and any terminator, or combination of terminator elements, of the present invention, wherein the promoter, at least one heterologous nucleic acid fragment, and terminator(s) are operably linked.
[0092] In another embodiment, a functional fragment may comprise at least 450, 425, 400, 375, 350, 325, 300, 275, 250, 225, 200, 175 or 150 contiguous nucleotides of SEQ ID NO:1 or SEQ ID NO:18.
[0093] Recombinant DNA constructs can be constructed by operably linking the nucleic acid fragment of the invention, the terminator sequences set forth in SEQ ID NO:1, or 18 or a functional fragment of the nucleotide sequence set forth in SEQ ID NO:1, or 18, to a heterologous nucleic acid fragment.
[0094] Another embodiment is a method for transforming a cell (or microorganism) comprising transforming a cell (or microorganism) with any of the isolated polynucleotides or recombinant DNA constructs of the present invention. The cell (or microorganism) transformed by this method is also included. In particular embodiments, the cell is eukaryotic cell, e.g., a yeast, insect or plant cell, or prokaryotic, e.g., a bacterial cell. The microorganism may be Agrobacterium, e.g. Agrobacterium tumefaciens or Agrobacterium rhizogenes.
[0095] Another embodiment of this invention is a method of expressing a heterologous polynucleotide in a plant, comprising the steps of introducing into a regenerable plant cell the recombinant DNA construct described above and regenerating a transgenic plant from the transformed regenerable plant cell, wherein the transgenic plant comprises the recombinant DNA construct and exhibits expression of the heterologous polynucleotide.
[0096] Another embodiment of this invention is a method of expressing a heterologous polynucleotide in a plant, comprising the steps of introducing into a regenerable plant cell the recombinant DNA construct described above; regenerating a transgenic plant from the regenerable plant cell described above; and obtaining a progeny plant from the transgenic plant, wherein the transgenic plant and the progeny plant comprises the recombinant DNA construct and exhibits expression of the heterologous polynucleotide.
[0097] In another embodiment, any of the methods of expressing a heterologous polynucleotide, wherein the plant cell is a monocotyledonous or dicotyledonous plant cell, for example, a maize or soybean plant cell. The plant cell may also be from sunflower, sorghum, canola, wheat, alfalfa, cotton, rice, barley, millet, sugar cane or switchgrass.
[0098] In another embodiment, this invention concerns a vector, virus, cell, microorganism, plant, or seed comprising a recombinant DNA construct comprising the terminator sequences described in the present invention.
[0099] The invention encompasses regenerated, mature and fertile transgenic plants comprising the recombinant DNA constructs described above, transgenic seeds produced therefrom, T1 and subsequent generations. The transgenic plant cells, tissues, plants, and seeds may comprise at least one recombinant DNA construct of interest.
[0100] In one embodiment, the plant (or seed derived from the plant) comprising the terminator sequences described in the present invention is a monocotyledonous or dicotyledonous plant, for example, a maize or soybean plant. The plant may also be sunflower, sorghum, canola, wheat, alfalfa, cotton, rice, barley, millet, sugar cane or switchgrass. The plant may be an inbred plant or a hybrid plant.
EXAMPLES
[0101] The present invention is further illustrated in the following Examples, in which parts and percentages are by weight and degrees are Celsius, unless otherwise stated. It should be understood that these examples, while indicating embodiments of the invention, are given by way of illustration only. From the above discussion and these Examples, one skilled in the art can ascertain the essential characteristics of this invention, and without departing from the spirit and scope thereof, can make various changes and modifications of the invention to adapt it to various usages and conditions. Furthermore, various modifications of the invention in addition to those shown and described herein will be apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims.
Example 1
Amplification and Cloning of a Sorghum bicolor Gamma-Kafirin Terminator Sequence
[0102] Primers (SEQ ID NOS:2 and 3) were designed for amplifying the terminator of gamma-Kafirin gene from Sorghum bicolor (SB-GKAF) based on the Sorghum bicolor genomic sequence database. The primer sequences are given below, the underlined region is not homologous with genomic template:
TABLE-US-00001 TMS2039 (forward primer; SEQ ID NO: 2): CAGATCTGATATCGATGGGCCCACTAACTATCTATACTGTAATAATGTT GTATAG TMS2040 (reverse primer; SEQ ID NO: 3): CGGACCGGGTGACCAAGCTTAAGCGAACATATGTCCCTC
[0103] A 504 bp product comprising the 465 bp SB-GKAF terminator sequence (SEQ ID NO:1) was amplified by PCR using these primers. The product was cloned into pGEMTeasy (Promega) (PHP31801; FIG. 1; SEQ ID NO:4) and the sequence was confirmed. The cloned SB-GKAF terminator included 165 bp of the predicted 3' UTR of SB-GKAF along with about 300 bp of downstream sequence. The amplified sequence of SB-GKAF terminator (SEQ ID NO:1) was then cloned into an Agrobacterium transformation vector (PHP34074; FIG.2; SEQ ID NO:5), which had the following expression cassettes in divergent orientation:
[0104] SB-GKAF TERMINATOR: GUSINT: BSV PRO and
[0105] UBI-PRO:UBI INTRON:MOPAT:PINII TERM.
[0106] BSV PRO is Banana Streak Virus promoter, which is a strong constitutive promoter. A construct with a potato PINII terminator (Keil et al. (1986) Nucleic Acids Res. 14:5641-5650) in place of the SB-GKAF terminator was used as a control (PHP34005; SEQ ID NO:6).
Example 2
Transient Transformation to Test Efficacy of a SB-GKAF Terminator
[0107] The isolated SB-GKAF terminator sequence (SEQ ID NO:1) was tested for its ability to act efficiently as a terminator in a recombinant construct. Its efficacy as a terminator was tested by its ability to stop transcription and by its ability to increase expression of a protein. Since improper termination can lead to improper processing of the 3' end of mRNA, and hence affect RNA stability, terminators have been found to affect protein expression levels. It has been shown that different terminators can cause up to 100-fold variation in the efficiency of transgene expression (Bieri et al, (2002) Molecular Breeding 10: 107-117; An et al (1989) Plant Cell 1: 115-122; Ingelbrecht et al (1989), Plant Cell, 1:671-680; Ali and Taylor (2001) Plant Mol. Bio., 46:251-261). Hence we tested the SB-GKAF sequence (SEQ ID NO:1) for its ability to increase expression of a protein compared to the well-known PINII terminator. The Agrobacterium transformation vectors PHP34074 (SEQ ID NO:5) and PHP34005 (SEQ ID NO:6) described in Example 1 were used for transient transformation of BMS (Black Mexican Sweet) cells. The cells were harvested 5 days after transformation and sent for a quantification of the GUS activity (MUG assay). The SB-GKAF construct (PHP34074; SEQ ID NO:5) had .about.35% more expression than that of the PINII construct (PHP34005, SEQ ID NO:6) when the GUS expression was normalized to the MOPAT expression (FIG. 3; Table 1). This information was indicative of the ability of the isolated SB-GKAF sequence (SEQ ID NO:1) to act efficiently as a terminator, by allowing protein expression equal to or above that of the PINII terminator.
TABLE-US-00002 TABLE 1 Sequence Average Standard Construct Tested MUG/PAT* Deviation BSV PRO: GUSINT: PIN II 1.57 0.17 PIN II TERM TERM BSV PRO: GUSINT: SB-GKAF 2.13 0.41 SB-GKAF TERM TERM *Measured as: nmoles MU/mg total protein/hour/ppm PAT
Example 3
Stable Transformation Assays to Test SB-GKAF Terminator Activity
[0108] The Agrobacterium transformation vectors PHP34074 (SEQ ID NO:5) and PHP34005 (SEQ ID NO:6) described in Example 1, that were used for transient transformation assays as described in Example 2, were also used in Gaspe-Flint derived maize lines for stable transformation to generate transgenic maize plants.
[0109] Quantitative Reverse Transcriptase-PCR (qRT-PCR) and GUS assays were done from stably transformed plant tissues to test the ability of isolated SB-GKAF terminator sequence (SEQ ID NO:1) to stop transcription (that is prevent transcription read-through transcription) and to compare GUS expression as compared to that with PINII terminator.
GUS Expression Analysis:
[0110] The expression of the GUS gene in the transgenic plants was assessed at the protein as well as transcript levels. To assess the expression at the protein level, MUG assay was performed on seedling leaf material. To assess the expression at the transcript level, qRT-PCR was done using primers shown in Table 2.
TABLE-US-00003 TABLE 2 Primer/ qPCR Probe Type Sequence Fluor Assay GUS-1482F Forward SEQ ID NO: 7 -- Taqman GUS-1553R Reverse SEQ ID NO: 8 -- Taqman GUS-1509P Probe SEQ ID NO: 9 FAM Taqman
[0111] Plants were grown in the greenhouse and leaves were sampled at the R1 stage of development for expression analysis. Multiple plants were tested for each construct. Each plant was analyzed for expression of the GUS gene. GUS gene with the SB-GKAF terminator had GUS expression in the same range as that of PINII terminator at both the protein (FIG. 4A) and transcript (FIG. 4B) level.
Quantitative Reverse Transcriptase PCR (qRT-PCR) to Determine Read-Through Transcription Through the SB-GKAF Terminator:
[0112] The qRT-PCR assays were performed with leaf tissue from the stable transformants generated using PHP34074 and PHP34005. Each plant was tested for the presence of read-through transcript that had passed through the PIN II terminator and the SB-GKAF terminator (SEQ ID NO:1). To assess presence of products that would indicate that transcription was continuing past the terminator, amplification was targeted downstream of the terminator being tested. Two primer sets were designed downstream of the tested terminators.
[0113] Primer set Term1 .about.100 nt from the terminator
[0114] Primer set Term2.1 .about.500 nt from the terminator
[0115] Multiple plants were tested for each construct. The primers are shown in Table 3.
TABLE-US-00004 TABLE 3 Primer/ qPCR Probe Name Type Sequence Fluor Assay Term2.1.sup.1 Term2.1F fwd SEQ ID NO: 10 -- SYBR Term2.1.sup.1 Term2.1R rev SEQ ID NO: 11 -- SYBR Term1.sup.1 Term_1F fwd SEQ ID NO: 12 -- Taqman Term1.sup.1 Term_1R rev SEQ ID NO: 13 -- Taqman Term1.sup.1 Term_1P probe SEQ ID NO: 14 FAM Taqman Actin.sup.2 Actin_ fwd SEQ ID NO: 15 -- Taqman MGB_F Actin.sup.2 Actin_ rev SEQ ID NO: 16 -- Taqman MGB_R Actin.sup.2 Actin_ probe SEQ ID NO: 17 VIC Taqman VIC_P .sup.1Post-Terminator Primer Set .sup.2Reference Gene
[0116] The test plants were classified into 3 categories depending on the qRT-PCR results:
[0117] 1. Plants showing complete termination: where all GUS transcripts are completely terminated before they reached the specific primer set location;
[0118] 2. Plants showing a high degree of termination: where a large portion of the GUS transcripts are terminated before they reached the specific primer set location, also defined as:
[0119] Primer set Term1--.DELTA.CT>13
[0120] Primer set Term2.1--.DELTA.CT>9; and
[0121] 3. Plants showing poor termination.
[0122] As can be see from FIG. 5, the SB-GKAF terminator proved to have fewer "poorly terminating" plants than the PINII terminator (FIG. 5). Thus the qRT-PCR score for presence of transcripts that had proceeded through the terminator was lower for the SB-GKAF terminator than that for the PINII terminator.
Sequence CWU
1
1
191459DNASorghum bicolor 1aactatctat actgtaataa tgttgtatag ccgccggata
gctagctagt ttagtcattc 60agcggcgatg ggtaataata aagtgtcatc catccatcac
catgggtggc aacgtgagca 120atgacctgat tgaacaaatt gaaatgaaaa gaagaaatat
gttatatgtc aacgagattt 180cctcataatg ccactgacaa cgtgtgtcca agaaatgtat
cagtgatacg tatattcaca 240atttttttat gacttatact cacaatttgt ttttttacta
cttatactca caatttgttg 300tgggtaccat aacaatttcg atcgaatata tatcagaaag
ttgacgaaag taagctcact 360caaaaagtta aatgggctgc ggaagctgcg tcaggcccaa
gttttggcta ttctatccgg 420tatccacgat tttgatggct gagggacata tgttcgctt
459255DNAArtificial Sequenceforward primer
2cagatctgat atcgatgggc ccactaacta tctatactgt aataatgttg tatag
55339DNAArtificial SequenceReverse primer 3cggaccgggt gaccaagctt
aagcgaacat atgtccctc 3943521DNAArtificial
SequenceVector 4gggcgaattg ggcccgacgt cgcatgctcc cggccgccat ggcggccgcg
ggaattcgat 60tcggaccggg tgaccaagct taagcgaaca tatgtccctc agccatcaaa
atcgtggata 120ccggatagaa tagccaaaac ttgggcctga cgcagcttcc gcagcccatt
taactttttg 180agtgagctta ctttcgtcaa ctttctgata tatattcgat cgaaattgtt
atggtaccca 240caacaaattg tgagtataag tagtaaaaaa acaaattgtg agtataagtc
ataaaaaaat 300tgtgaatata cgtatcactg atacatttct tggacacacg tcgtcagtgg
cattatgagg 360aaatctcgtt gacatataac atatttcttc ttttcatttc aatttgttca
atcaggtcat 420tgctcacgtt gccacccatg gtgatggatg gatgacactt tattattacc
catcgccgct 480gaatgactaa actagctagc tatccggcgg ctatacaaca ttattacagt
atagatagtt 540agtgggccca tcgatatcag atctgaatca ctagtgaatt cgcggccgcc
tgcaggtcga 600ccatatggga gagctcccaa cgcgttggat gcatagcttg agtattctat
agtgtcacct 660aaatagcttg gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt
atccgctcac 720aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg
cctaatgagt 780gagctaactc acattaattg cgttgcgctc actgcccgct ttccagtcgg
gaaacctgtc 840gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc
gtattgggcg 900ctcttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc
ggcgagcggt 960atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata
acgcaggaaa 1020gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg
cgttgctggc 1080gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct
caagtcagag 1140gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa
gctccctcgt 1200gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc
tcccttcggg 1260aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt
aggtcgttcg 1320ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg
ccttatccgg 1380taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg
cagcagccac 1440tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct
tgaagtggtg 1500gcctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc
tgaagccagt 1560taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg
ctggtagcgg 1620tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc
aagaagatcc 1680tttgatcttt tctacggggt ctgacgctca gtggaacgaa aactcacgtt
aagggatttt 1740ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa
aatgaagttt 1800taaatcaatc taaagtatat atgagtaaac ttggtctgac agttaccaat
gcttaatcag 1860tgaggcacct atctcagcga tctgtctatt tcgttcatcc atagttgcct
gactccccgt 1920cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg
caatgatacc 1980gcgagaccca cgctcaccgg ctccagattt atcagcaata aaccagccag
ccggaagggc 2040cgagcgcaga agtggtcctg caactttatc cgcctccatc cagtctatta
attgttgccg 2100ggaagctaga gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg
ccattgctac 2160aggcatcgtg gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg
gttcccaacg 2220atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct
ccttcggtcc 2280tccgatcgtt gtcagaagta agttggccgc agtgttatca ctcatggtta
tggcagcact 2340gcataattct cttactgtca tgccatccgt aagatgcttt tctgtgactg
gtgagtactc 2400aaccaagtca ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc
cggcgtcaat 2460acgggataat accgcgccac atagcagaac tttaaaagtg ctcatcattg
gaaaacgttc 2520ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga
tgtaacccac 2580tcgtgcaccc aactgatctt cagcatcttt tactttcacc agcgtttctg
ggtgagcaaa 2640aacaggaagg caaaatgccg caaaaaaggg aataagggcg acacggaaat
gttgaatact 2700catactcttc ctttttcaat attattgaag catttatcag ggttattgtc
tcatgagcgg 2760atacatattt gaatgtattt agaaaaataa acaaataggg gttccgcgca
catttccccg 2820aaaagtgcca cctgatgcgg tgtgaaatac cgcacagatg cgtaaggaga
aaataccgca 2880tcaggaaatt gtaagcgtta atattttgtt aaaattcgcg ttaaattttt
gttaaatcag 2940ctcatttttt aaccaatagg ccgaaatcgg caaaatccct tataaatcaa
aagaatagac 3000cgagataggg ttgagtgttg ttccagtttg gaacaagagt ccactattaa
agaacgtgga 3060ctccaacgtc aaagggcgaa aaaccgtcta tcagggcgat ggcccactac
gtgaaccatc 3120accctaatca agttttttgg ggtcgaggtg ccgtaaagca ctaaatcgga
accctaaagg 3180gagcccccga tttagagctt gacggggaaa gccggcgaac gtggcgagaa
aggaagggaa 3240gaaagcgaaa ggagcgggcg ctagggcgct ggcaagtgta gcggtcacgc
tgcgcgtaac 3300caccacaccc gccgcgctta atgcgccgct acagggcgcg tccattcgcc
attcaggctg 3360cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc tattacgcca
gctggcgaaa 3420gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttttccca
gtcacgacgt 3480tgtaaaacga cggccagtga attgtaatac gactcactat a
3521550910DNAArtificial SequenceSB-GKAF terminator construct
5acgtgaccct agtcacttag gttaccagag ctggtcacct ttgtccacca agatggaact
60gcggccgctc attaattaag tcaggcgcgc ctctagttga agacacgttc atgtcttcat
120cgtaagaaga cactcagtag tcttcggcca gaatggccat ctggattcag caggcctaga
180aggccattta aatcctgagg atctggtctt cctaaggacc cgggatatcg ctatcaactt
240tgtatagaaa agttgggccg aattcgagct cggtacggcc agaatggccc ggaccgggtt
300accgaattcg agctcggtac cactagtaag cttaagcgaa catatgtccc tcagccatca
360aaatcgtgga taccggatag aatagccaaa acttgggcct gacgcagctt ccgcagccca
420tttaactttt tgagtgagct tactttcgtc aactttctga tatatattcg atcgaaattg
480ttatggtacc cacaacaaat tgtgagtata agtagtaaaa aaacaaattg tgagtataag
540tcataaaaaa attgtgaata tacgtatcac tgatacattt cttggacaca cgtcgtcagt
600ggcattatga ggaaatctcg ttgacatata acatatttct tcttttcatt tcaatttgtt
660caatcaggtc attgctcacg ttgccaccca tggtgatgga tggatgacac tttattatta
720cccatcgccg ctgaatgact aaactagcta gctatccggc ggctatacaa cattattaca
780gtatagatag ttagtgggcc catcgatatc agatcttcat tgtttgcctc cctgctgcgg
840tttttcaccg aagttcatgc cagtccagcg tttttgcagc agaaaagccg ccgacttcgg
900tttgcggtcg cgagtgaaga tccctttctt gttaccgcca acgcgcaata tgccttgcga
960ggtcgcaaaa tcggcgaaat tccatacctg ttcaccgacg acggcgctga cgcgatcaaa
1020gacgcggtga tacatatcca gccatgcaca ctgatactct tcactccaca tgtcggtgta
1080cattgagtgc agcccggcta acgtatccac gccgtattcg gtgatgataa tcggctgatg
1140cagtttctcc tgccaggcca gaagttcttt ttccagtacc ttctctgccg tttccaaatc
1200gccgctttgg acataccatc cgtaataacg gttcaggcac agcacatcaa agagatcgct
1260aatggtatcg gtgtgagcgt cgcagaacat tacattgacg caggtgatcg gacgcgtcgg
1320gtcgagttta cgcgttgctt ccgccagtgg cgcgaaatat tcccgtgcac cttgcggacg
1380ggtatccggt tcgttggcaa tactccacat caccacgctt gggtggtttt tgtcacgcgc
1440tatcagctct ttaatcgcct gtaagtgcgc ttgctgagtt tccccgttga ctgcctcttc
1500gctgtacagt tctttcggct tgttgcccgc ttcgaaacca atccctaaag agaggttaaa
1560gccgacagca gcagtttcat caatcaccac gatgccatgt tcatctgccc agtcgagcat
1620ctcttcagcg taagggtaat gcgaggtacg gtaggagttg gccccaatcc agtccattaa
1680tgcgtggtcg tgcaccatca gcacgttatc gaatcctttg ccacgcaagt ccgcatcttc
1740atgacgacca aagccagtaa agtagaacgg tttgtggtta atcaggaact gttggccctt
1800cactgccact gaccggatgc cgacgcgaag cgggtagata tcacactctg tctggctttt
1860ggctgtgacg cacagttcat agagataacc ttcacccggt tgccagaggt gcggattcac
1920cacttgcaaa gtcccgctag tgccttgtcc agttgcaacc acctgttgat ccgcatcacg
1980cagttcaacg ctgacatcac cattggccac cacctgccag tcaacagacg cgtggttaca
2040gtcttgcgcg acatgcgtca ccacggtgat atcgtccacc caggtgttcg gcgtggtgta
2100gagcattacg ctgcgatgga ttccggcata gttaaagaaa tcatggaagt aagactgctt
2160tttcttgccg ttttcgtcgg taatcaccat tcccggcggg atagtctgcc agttcagttc
2220gttgttcaca caaacggtga tacctgcaca tcaacaaatt ttggtcatat attagaaaag
2280ttataaatta aaatatacac acttataaac tacagaaaag caattgctat atactacatt
2340cttttatttt gaaaaaaata tttgaaatat tatattacta ctaattaatg ataattatta
2400tatatatatc aaaggtagaa gcagaaactt acgtacactt ttcccggcaa taacatacgg
2460cgtgacatcg gcttcaaatg gcgtatagcc gccctgatgc tccatcactt cctgattatt
2520gacccacact ttgccgtaat gagtgaccgc atcgaaacgc agcacgatac gctggcctgc
2580ccaacctttc ggtataaaga cttcgcgctg ataccagacg ttgcccgcat aattacgaat
2640atctgcatcg gcgaactgat cgttaaaact gcctggcaca gcaattgccc ggctttcttg
2700taacgcgctt tcccaccaac gctgatcaat tccacagttt tcgcgatcca gactgaatgc
2760ccacaggccg tcgagttttt tgatttcacg ggttggggtt tctacaggac ggaccatggt
2820gtcgtgtgga tccaaattgt atgcaaggtg aatgactttc ttttcgtaaa ctagatagga
2880gtactcctcc aggatgctta acccgtattg acgtacagag gtctatgatc cttttgttta
2940taaaggagct tgtagttcag tcagtcttat acttcacgat gcccatgttt ctatatagga
3000tattatcttg gctttgtaag tacttcacgc aggttatgtt ctgtttctag gatattatcc
3060tcatacatgc gaagaaccaa tttttccccc attctcttcg ggtacttttt cttgggtagg
3120catgctctct tggaccaact agcataaaac ataatcattt ttccctacag ccttgaccag
3180ctataatcga aatcatgctc atttttctaa gaaagactga atacagctcc aatttaaaca
3240atttaaatca taaacttgta actcaattag agaaaagcag agcccttcgg ctcctatcta
3300aaggaattac cccatgaaag ccataaaaac gaaccttgct ctgataccag acgggtctac
3360gctcgcggaa ctaggatctt gcgctctact cgcacaaagt gaactcgcac aaagtgtgtt
3420tcaagcacag aagtttttat ttctcaaatc aggagtaaac tcgcgttgtg gtgcgtgttt
3480gcaacctgaa tacaaggctc cttatataga gagttgtgga gctttctggc atcgttaggt
3540ggcatccacc aataatgcag ataagcatca tcacatgtct ctggcctaac aactttgcgt
3600aagaatcctg caaagttact aaaggtcatc gtgcgtgact agacaacgca caccgacaaa
3660cttaaaataa agagacatta tactttgtct cctctttaca taaagtgagt ggtatccagc
3720tcactccgca tcttatcagt cttcacaccg gttggtatca acacgtggta ggggtccgcc
3780acttccgctt cagtcatcat tactgatatc cagcagatct agagcatctt caataagata
3840ttcttgttct gcacgcagat tttcttgctc cctcagtaat tcctcccaca gtgagtcttc
3900tgatatttct tcaagtttct tctcccatct gatcttttcc tgcacaaacg agtcaatttg
3960gtctttccag acccaagtaa aacaagtgtt agtttcacag gagtaaaact ccctgtcagg
4020atttctggat gttctggaga tcttcagttt tgctggttta ttgcatccac atttgaaaac
4080cggctcttca cttagtgtta gcacattgat ttgatgcaac ctgtagcctt tgctcaacca
4140gtcttcatat ctttttacaa catcattaac tctctgtttt gcatcggtgt ttcccttgtg
4200aaatacctcc tccactgcat tgatcaacac accttcagat tgatgctttt ccggatggag
4260aataatcttt accagtcttg acagagtgtc tgctaaaacg ttgtcctttc cgtcaatgtg
4320ttcaaactta atctcaagac ctgtcccggt aatgtaatct gtgaaggcaa gccatctgac
4380tcttgatggt ttatgatcac tgcttttctt gtaaaagctc actattgctt gactgtcagt
4440tctgattatg agctctttgt aagcttggtc acccggtccg ggcctagaag gccagcttcg
4500gccgccccgg gcaactttat tatacaaagt tgatagatat cggaccgatt aaactttaat
4560tcggtccgaa gcttgcatgc ctgcagtgca gcgtgacccg gtcgtgcccc tctctagaga
4620taatgagcat tgcatgtcta agttataaaa aattaccaca tatttttttt gtcacacttg
4680tttgaagtgc agtttatcta tctttataca tatatttaaa ctttactcta cgaataatat
4740aatctatagt actacaataa tatcagtgtt ttagagaatc atataaatga acagttagac
4800atggtctaaa ggacaattga gtattttgac aacaggactc tacagtttta tctttttagt
4860gtgcatgtgt tctccttttt ttttgcaaat agcttcacct atataatact tcatccattt
4920tattagtaca tccatttagg gtttagggtt aatggttttt atagactaat ttttttagta
4980catctatttt attctatttt agcctctaaa ttaagaaaac taaaactcta ttttagtttt
5040tttatttaat aatttagata taaaatagaa taaaataaag tgactaaaaa ttaaacaaat
5100accctttaag aaattaaaaa aactaaggaa acatttttct tgtttcgagt agataatgcc
5160agcctgttaa acgccgtcga cgagtctaac ggacaccaac cagcgaacca gcagcgtcgc
5220gtcgggccaa gcgaagcaga cggcacggca tctctgtcgc tgcctctgga cccctctcga
5280gagttccgct ccaccgttgg acttgctccg ctgtcggcat ccagaaattg cgtggcggag
5340cggcagacgt gagccggcac ggcaggcggc ctcctcctcc tctcacggca ccggcagcta
5400cgggggattc ctttcccacc gctccttcgc tttcccttcc tcgcccgccg taataaatag
5460acaccccctc cacaccctct ttccccaacc tcgtgttgtt cggagcgcac acacacacaa
5520ccagatctcc cccaaatcca cccgtcggca cctccgcttc aaggtacgcc gctcgtcctc
5580cccccccccc ctctctacct tctctagatc ggcgttccgg tccatgcatg gttagggccc
5640ggtagttcta cttctgttca tgtttgtgtt agatccgtgt ttgtgttaga tccgtgctgc
5700tagcgttcgt acacggatgc gacctgtacg tcagacacgt tctgattgct aacttgccag
5760tgtttctctt tggggaatcc tgggatggct ctagccgttc cgcagacggg atcgatttca
5820tgattttttt tgtttcgttg catagggttt ggtttgccct tttcctttat ttcaatatat
5880gccgtgcact tgtttgtcgg gtcatctttt catgcttttt tttgtcttgg ttgtgatgat
5940gtggtctggt tgggcggtcg ttctagatcg gagtagaatt ctgtttcaaa ctacctggtg
6000gatttattaa ttttggatct gtatgtgtgt gccatacata ttcatagtta cgaattgaag
6060atgatggatg gaaatatcga tctaggatag gtatacatgt tgatgcgggt tttactgatg
6120catatacaga gatgcttttt gttcgcttgg ttgtgatgat gtggtgtggt tgggcggtcg
6180ttcattcgtt ctagatcgga gtagaatact gtttcaaact acctggtgta tttattaatt
6240ttggaactgt atgtgtgtgt catacatctt catagttacg agtttaagat ggatggaaat
6300atcgatctag gataggtata catgttgatg tgggttttac tgatgcatat acatgatggc
6360atatgcagca tctattcata tgctctaacc ttgagtacct atctattata ataaacaagt
6420atgttttata attattttga tcttgatata cttggatgat ggcatatgca gcagctatat
6480gtggattttt ttagccctgc cttcatacgc tatttatttg cttggtactg tttcttttgt
6540cgatgctcac cctgttgttt ggtgttactt ctgcaggtcg actttaactt agcctaggat
6600ccacacgaca ccatgtcccc cgagcgccgc cccgtcgaga tccgcccggc caccgccgcc
6660gacatggccg ccgtgtgcga catcgtgaac cactacatcg agacctccac cgtgaacttc
6720cgcaccgagc cgcagacccc gcaggagtgg atcgacgacc tggagcgcct ccaggaccgc
6780tacccgtggc tcgtggccga ggtggagggc gtggtggccg gcatcgccta cgccggcccg
6840tggaaggccc gcaacgccta cgactggacc gtggagtcca ccgtgtacgt gtcccaccgc
6900caccagcgcc tcggcctcgg ctccaccctc tacacccacc tcctcaagag catggaggcc
6960cagggcttca agtccgtggt ggccgtgatc ggcctcccga acgacccgtc cgtgcgcctc
7020cacgaggccc tcggctacac cgcccgcggc accctccgcg ccgccggcta caagcacggc
7080ggctggcacg acgtcggctt ctggcagcgc gacttcgagc tgccggcccc gccgcgcccg
7140gtgcgcccgg tgacgcagat ctgagtcgaa acctagactt gtccatcttc tggattggcc
7200aacttaatta atgtatgaaa taaaaggatg cacacatagt gacatgctaa tcactataat
7260gtgggcatca aagttgtgtg ttatgtgtaa ttactagtta tctgaataaa agagaaagag
7320atcatccata tttcttatcc taaatgaatg tcacgtgtct ttataattct ttgatgaacc
7380agatgcattt cattaaccaa atccatatac atataaatat taatcatata taattaatat
7440caattgggtt agcaaaacaa atctagtcta ggtgtgtttt gcgaatgcgg ccgataagtg
7500actagggtca cgtgacccta gtcacttagg taccgagctc gaattcattc cgattaatcg
7560tggcctcttg ctcttcagga tgaagagcta tgtttaaacg tgcaagcgct actagacaat
7620tcagtacatt aaaaacgtcc gcaatgtgtt attaagttgt ctaagcgtca atttgtttac
7680accacaatat atcctgccac cagccagcca acagctcccc gaccggcagc tcggcacaaa
7740atcaccactc gatacaggca gcccatcagt ccgggacggc gtcagcggga gagccgttgt
7800aaggcggcag actttgctca tgttaccgat gctattcgga agaacggcaa ctaagctgcc
7860gggtttgaaa cacggatgat ctcgcggagg gtagcatgtt gattgtaacg atgacagagc
7920gttgctgcct gtgatcaaat atcatctccc tcgcagagat ccgaattatc agccttctta
7980ttcatttctc gcttaaccgt gacaggctgt cgatcttgag aactatgccg acataatagg
8040aaatcgctgg ataaagccgc tgaggaagct gagtggcgct atttctttag aagtgaacgt
8100tgacgatcgt cgaccgtacc ccgatgaatt aattcggacg tacgttctga acacagctgg
8160atacttactt gggcgattgt catacatgac atcaacaatg tacccgtttg tgtaaccgtc
8220tcttggaggt tcgtatgaca ctagtggttc ccctcagctt gcgactagat gttgaggcct
8280aacattttat tagagagcag gctagttgct tagatacatg atcttcaggc cgttatctgt
8340cagggcaagc gaaaattggc catttatgac gaccaatgcc ccgcagaagc tcccatcttt
8400gccgccatag acgccgcgcc ccccttttgg ggtgtagaac atccttttgc cagatgtgga
8460aaagaagttc gttgtcccat tgttggcaat gacgtagtag ccggcgaaag tgcgagaccc
8520atttgcgcta tatataagcc tacgatttcc gttgcgacta ttgtcgtaat tggatgaact
8580attatcgtag ttgctctcag agttgtcgta atttgatgga ctattgtcgt aattgcttat
8640ggagttgtcg tagttgcttg gagaaatgtc gtagttggat ggggagtagt catagggaag
8700acgagcttca tccactaaaa caattggcag gtcagcaagt gcctgccccg atgccatcgc
8760aagtacgagg cttagaacca ccttcaacag atcgcgcata gtcttcccca gctctctaac
8820gcttgagtta agccgcgccg cgaagcggcg tcggcttgaa cgaattgtta gacattattt
8880gccgactacc ttggtgatct cgcctttcac gtagtgaaca aattcttcca actgatctgc
8940gcgcgaggcc aagcgatctt cttgtccaag ataagcctgc ctagcttcaa gtatgacggg
9000ctgatactgg gccggcaggc gctccattgc ccagtcggca gcgacatcct tcggcgcgat
9060tttgccggtt actgcgctgt accaaatgcg ggacaacgta agcactacat ttcgctcatc
9120gccagcccag tcgggcggcg agttccatag cgttaaggtt tcatttagcg cctcaaatag
9180atcctgttca ggaaccggat caaagagttc ctccgccgct ggacctacca aggcaacgct
9240atgttctctt gcttttgtca gcaagatagc cagatcaatg tcgatcgtgg ctggctcgaa
9300gatacctgca agaatgtcat tgcgctgcca ttctccaaat tgcagttcgc gcttagctgg
9360ataacgccac ggaatgatgt cgtcgtgcac aacaatggtg acttctacag cgcggagaat
9420ctcgctctct ccaggggaag ccgaagtttc caaaaggtcg ttgatcaaag ctcgccgcgt
9480tgtttcatca agccttacag tcaccgtaac cagcaaatca atatcactgt gtggcttcag
9540gccgccatcc actgcggagc cgtacaaatg tacggccagc aacgtcggtt cgagatggcg
9600ctcgatgacg ccaactacct ctgatagttg agtcgatact tcggcgatca ccgcttccct
9660catgatgttt aactcctgaa ttaagccgcg ccgcgaagcg gtgtcggctt gaatgaattg
9720ttaggcgtca tcctgtgctc ccgagaacca gtaccagtac atcgctgttt cgttcgagac
9780ttgaggtcta gttttatacg tgaacaggtc aatgccgccg agagtaaagc cacattttgc
9840gtacaaattg caggcaggta cattgttcgt ttgtgtctct aatcgtatgc caaggagctg
9900tctgcttagt gcccactttt tcgcaaattc gatgagactg tgcgcgactc ctttgcctcg
9960gtgcgtgtgc gacacaacaa tgtgttcgat agaggctaga tcgttccatg ttgagttgag
10020ttcaatcttc ccgacaagct cttggtcgat gaatgcgcca tagcaagcag agtcttcatc
10080agagtcatca tccgagatgt aatccttccg gtaggggctc acacttctgg tagatagttc
10140aaagccttgg tcggataggt gcacatcgaa cacttcacga acaatgaaat ggttctcagc
10200atccaatgtt tccgccacct gctcagggat caccgaaatc ttcatatgac gcctaacgcc
10260tggcacagcg gatcgcaaac ctggcgcggc ttttggcaca aaaggcgtga caggtttgcg
10320aatccgttgc tgccacttgt taaccctttt gccagatttg gtaactataa tttatgttag
10380aggcgaagtc ttgggtaaaa actggcctaa aattgctggg gatttcagga aagtaaacat
10440caccttccgg ctcgatgtct attgtagata tatgtagtgt atctacttga tcgggggatc
10500tgctgcctcg cgcgtttcgg tgatgacggt gaaaacctct gacacatgca gctcccggag
10560acggtcacag cttgtctgta agcggatgcc gggagcagac aagcccgtca gggcgcgtca
10620gcgggtgttg gcgggtgtcg gggcgcagcc atgacccagt cacgtagcga tagcggagtg
10680tatactggct taactatgcg gcatcagagc agattgtact gagagtgcac catatgcggt
10740gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct
10800cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa
10860aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa
10920aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc
10980tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga
11040caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc
11100cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt
11160ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct
11220gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg
11280agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta
11340gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct
11400acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa
11460gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt
11520gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta
11580cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat
11640caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa
11700gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct
11760cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta
11820cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct
11880caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg
11940gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa
12000gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctgcaggg gggggggggg
12060gggggttcca ttgttcattc cacggacaaa aacagagaaa ggaaacgaca gaggccaaaa
12120agctcgcttt cagcacctgt cgtttccttt cttttcagag ggtattttaa ataaaaacat
12180taagttatga cgaagaagaa cggaaacgcc ttaaaccgga aaattttcat aaatagcgaa
12240aacccgcgag gtcgccgccc cgtaacctgt cggatcaccg gaaaggaccc gtaaagtgat
12300aatgattatc atctacatat cacaacgtgc gtggaggcca tcaaaccacg tcaaataatc
12360aattatgacg caggtatcgt attaattgat ctgcatcaac ttaacgtaaa aacaacttca
12420gacaatacaa atcagcgaca ctgaatacgg ggcaacctca tgtccccccc cccccccccc
12480ctgcaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc
12540aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg
12600gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag
12660cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt
12720actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt
12780caacacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac
12840gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac
12900ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag
12960caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa
13020tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga
13080gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc
13140cccgaaaagt gccacctgac gtctaagaaa ccattattat catgacatta acctataaaa
13200ataggcgtat cacgaggccc tttcgtcttc aagaattcgg agcttttgcc attctcaccg
13260gattcagtcg tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa
13320ttaataggtt gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc
13380atcctatgga actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa
13440tatggtattg ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt
13500ttctaatcag aattggttaa ttggttgtaa cactggcaga gcattacgct gacttgacgg
13560gacggcggct ttgttgaata aatcgaactt ttgctgagtt gaaggatcag atcacgcatc
13620ttcccgacaa cgcagaccgt tccgtggcaa agcaaaagtt caaaatcacc aactggtcca
13680cctacaacaa agctctcatc aaccgtggct ccctcacttt ctggctggat gatggggcga
13740ttcaggcctg gtatgagtca gcaacacctt cttcacgagg cagacctcag cgccagaagg
13800ccgccagaga ggccgagcgc ggccgtgagg cttggacgct agggcagggc atgaaaaagc
13860ccgtagcggg ctgctacggg cgtctgacgc ggtggaaagg gggaggggat gttgtctaca
13920tggctctgct gtagtgagtg ggttgcgctc cggcagcggt cctgatcaat cgtcaccctt
13980tctcggtcct tcaacgttcc tgacaacgag cctccttttc gccaatccat cgacaatcac
14040cgcgagtccc tgctcgaacg ctgcgtccgg accggcttcg tcgaaggcgt ctatcgcggc
14100ccgcaacagc ggcgagagcg gagcctgttc aacggtgccg ccgcgctcgc cggcatcgct
14160gtcgccggcc tgctcctcaa gcacggcccc aacagtgaag tagctgattg tcatcagcgc
14220attgacggcg tccccggccg aaaaacccgc ctcgcagagg aagcgaagct gcgcgtcggc
14280cgtttccatc tgcggtgcgc ccggtcgcgt gccggcatgg atgcgcgcgc catcgcggta
14340ggcgagcagc gcctgcctga agctgcgggc attcccgatc agaaatgagc gccagtcgtc
14400gtcggctctc ggcaccgaat gcgtatgatt ctccgccagc atggcttcgg ccagtgcgtc
14460gagcagcgcc cgcttgttcc tgaagtgcca gtaaagcgcc ggctgctgaa cccccaaccg
14520ttccgccagt ttgcgtgtcg tcagaccgtc tacgccgacc tcgttcaaca ggtccagggc
14580ggcacggatc actgtattcg gctgcaactt tgtcatgctt gacactttat cactgataaa
14640cataatatgt ccaccaactt atcagtgata aagaatccgc gcgttcaatc ggaccagcgg
14700aggctggtcc ggaggccaga cgtgaaaccc aacatacccc tgatcgtaat tctgagcact
14760gtcgcgctcg acgctgtcgg catcggcctg attatgccgg tgctgccggg cctcctgcgc
14820gatctggttc actcgaacga cgtcaccgcc cactatggca ttctgctggc gctgtatgcg
14880ttggtgcaat ttgcctgcgc acctgtgctg ggcgcgctgt cggatcgttt cgggcggcgg
14940ccaatcttgc tcgtctcgct ggccggcgcc actgtcgact acgccatcat ggcgacagcg
15000cctttccttt gggttctcta tatcgggcgg atcgtggccg gcatcaccgg ggcgactggg
15060gcggtagccg gcgcttatat tgccgatatc actgatggcg atgagcgcgc gcggcacttc
15120ggcttcatga gcgcctgttt cgggttcggg atggtcgcgg gacctgtgct cggtgggctg
15180atgggcggtt tctcccccca cgctccgttc ttcgccgcgg cagccttgaa cggcctcaat
15240ttcctgacgg gctgtttcct tttgccggag tcgcacaaag gcgaacgccg gccgttacgc
15300cgggaggctc tcaacccgct cgcttcgttc cggtgggccc ggggcatgac cgtcgtcgcc
15360gccctgatgg cggtcttctt catcatgcaa cttgtcggac aggtgccggc cgcgctttgg
15420gtcattttcg gcgaggatcg ctttcactgg gacgcgacca cgatcggcat ttcgcttgcc
15480gcatttggca ttctgcattc actcgcccag gcaatgatca ccggccctgt agccgcccgg
15540ctcggcgaaa ggcgggcact catgctcgga atgattgccg acggcacagg ctacatcctg
15600cttgccttcg cgacacgggg atggatggcg ttcccgatca tggtcctgct tgcttcgggt
15660ggcatcggaa tgccggcgct gcaagcaatg ttgtccaggc aggtggatga ggaacgtcag
15720gggcagctgc aaggctcact ggcggcgctc accagcctga cctcgatcgt cggacccctc
15780ctcttcacgg cgatctatgc ggcttctata acaacgtgga acgggtgggc atggattgca
15840ggcgctgccc tctacttgct ctgcctgccg gcgctgcgtc gcgggctttg gagcggcgca
15900gggcaacgag ccgatcgctg atcgtggaaa cgataggcct atgccatgcg ggtcaaggcg
15960acttccggca agctatacgc gccctaggag tgcggttgga acgttggccc agccagatac
16020tcccgatcac gagcaggacg ccgatgattt gaagcgcact cagcgtctga tccaagaaca
16080accatcctag caacacggcg gtccccgggc tgagaaagcc cagtaaggaa acaactgtag
16140gttcgagtcg cgagatcccc cggaaccaaa ggaagtaggt taaacccgct ccgatcaggc
16200cgagccacgc caggccgaga acattggttc ctgtaggcat cgggattggc ggatcaaaca
16260ctaaagctac tggaacgagc agaagtcctc cggccgccag ttgccaggcg gtaaaggtga
16320gcagaggcac gggaggttgc cacttgcggg tcagcacggt tccgaacgcc atggaaaccg
16380cccccgccag gcccgctgcg acgccgacag gatctagcgc tgcgtttggt gtcaacacca
16440acagcgccac gcccgcagtt ccgcaaatag cccccaggac cgccatcaat cgtatcgggc
16500tacctagcag agcggcagag atgaacacga ccatcagcgg ctgcacagcg cctaccgtcg
16560ccgcgacccc gcccggcagg cggtagaccg aaataaacaa caagctccag aatagcgaaa
16620tattaagtgc gccgaggatg aagatgcgca tccaccagat tcccgttgga atctgtcgga
16680cgatcatcac gagcaataaa cccgccggca acgcccgcag cagcataccg gcgacccctc
16740ggcctcgctg ttcgggctcc acgaaaacgc cggacagatg cgccttgtga gcgtccttgg
16800ggccgtcctc ctgtttgaag accgacagcc caatgatctc gccgtcgatg taggcgccga
16860atgccacggc atctcgcaac cgttcagcga acgcctccat gggctttttc tcctcgtgct
16920cgtaaacgga cccgaacatc tctggagctt tcttcagggc cgacaatcgg atctcgcgga
16980aatcctgcac gtcggccgct ccaagccgtc gaatctgagc cttaatcaca attgtcaatt
17040ttaatcctct gtttatcggc agttcgtaga gcgcgccgtg cgtcccgagc gatactgagc
17100gaagcaagtg cgtcgagcag tgcccgcttg ttcctgaaat gccagtaaag cgctggctgc
17160tgaaccccca gccggaactg accccacaag gccctagcgt ttgcaatgca ccaggtcatc
17220attgacccag gcgtgttcca ccaggccgct gcctcgcaac tcttcgcagg cttcgccgac
17280ctgctcgcgc cacttcttca cgcgggtgga atccgatccg cacatgaggc ggaaggtttc
17340cagcttgagc gggtacggct cccggtgcga gctgaaatag tcgaacatcc gtcgggccgt
17400cggcgacagc ttgcggtact tctcccatat gaatttcgtg tagtggtcgc cagcaaacag
17460cacgacgatt tcctcgtcga tcaggacctg gcaacgggac gttttcttgc cacggtccag
17520gacgcggaag cggtgcagca gcgacaccga ttccaggtgc ccaacgcggt cggacgtgaa
17580gcccatcgcc gtcgcctgta ggcgcgacag gcattcctcg gccttcgtgt aataccggcc
17640attgatcgac cagcccaggt cctggcaaag ctcgtagaac gtgaaggtga tcggctcgcc
17700gataggggtg cgcttcgcgt actccaacac ctgctgccac accagttcgt catcgtcggc
17760ccgcagctcg acgccggtgt aggtgatctt cacgtccttg ttgacgtgga aaatgacctt
17820gttttgcagc gcctcgcgcg ggattttctt gttgcgcgtg gtgaacaggg cagagcgggc
17880cgtgtcgttt ggcatcgctc gcatcgtgtc cggccacggc gcaatatcga acaaggaaag
17940ctgcatttcc ttgatctgct gcttcgtgtg tttcagcaac gcggcctgct tggcctcgct
18000gacctgtttt gccaggtcct cgccggcggt ttttcgcttc ttggtcgtca tagttcctcg
18060cgtgtcgatg gtcatcgact tcgccaaacc tgccgcctcc tgttcgagac gacgcgaacg
18120ctccacggcg gccgatggcg cgggcagggc agggggagcc agttgcacgc tgtcgcgctc
18180gatcttggcc gtagcttgct ggaccatcga gccgacggac tggaaggttt cgcggggcgc
18240acgcatgacg gtgcggcttg cgatggtttc ggcatcctcg gcggaaaacc ccgcgtcgat
18300cagttcttgc ctgtatgcct tccggtcaaa cgtccgattc attcaccctc cttgcgggat
18360tgccccgact cacgccgggg caatgtgccc ttattcctga tttgacccgc ctggtgcctt
18420ggtgtccaga taatccacct tatcggcaat gaagtcggtc ccgtagaccg tctggccgtc
18480cttctcgtac ttggtattcc gaatcttgcc ctgcacgaat accagcgacc ccttgcccaa
18540atacttgccg tgggcctcgg cctgagagcc aaaacacttg atgcggaaga agtcggtgcg
18600ctcctgcttg tcgccggcat cgttgcgcca ctcttcatta accgctatat cgaaaattgc
18660ttgcggcttg ttagaattgc catgacgtac ctcggtgtca cgggtaagat taccgataaa
18720ctggaactga ttatggctca tatcgaaagt ctccttgaga aaggagactc tagtttagct
18780aaacattggt tccgctgtca agaactttag cggctaaaat tttgcgggcc gcgaccaaag
18840gtgcgagggg cggcttccgc tgtgtacaac cagatatttt tcaccaacat ccttcgtctg
18900ctcgatgagc ggggcatgac gaaacatgag ctgtcggaga gggcaggggt ttcaatttcg
18960tttttatcag acttaaccaa cggtaaggcc aacccctcgt tgaaggtgat ggaggccatt
19020gccgacgccc tggaaactcc cctacctctt ctcctggagt ccaccgacct tgaccgcgag
19080gcactcgcgg agattgcggg tcatcctttc aagagcagcg tgccgcccgg atacgaacgc
19140atcagtgtgg ttttgccgtc acataaggcg tttatcgtaa agaaatgggg cgacgacacc
19200cgaaaaaagc tgcgtggaag gctctgacgc caagggttag ggcttgcact tccttcttta
19260gccgctaaaa cggccccttc tctgcgggcc gtcggctcgc gcatcatatc gacatcctca
19320acggaagccg tgccgcgaat ggcatcgggc gggtgcgctt tgacagttgt tttctatcag
19380aacccctacg tcgtgcggtt cgattagctg tttgtcttgc aggctaaaca ctttcggtat
19440atcgtttgcc tgtgcgataa tgttgctaat gatttgttgc gtaggggtta ctgaaaagtg
19500agcgggaaag aagagtttca gaccatcaag gagcgggcca agcgcaagct ggaacgcgac
19560atgggtgcgg acctgttggc cgcgctcaac gacccgaaaa ccgttgaagt catgctcaac
19620gcggacggca aggtgtggca cgaacgcctt ggcgagccga tgcggtacat ctgcgacatg
19680cggcccagcc agtcgcaggc gattatagaa acggtggccg gattccacgg caaagaggtc
19740acgcggcatt cgcccatcct ggaaggcgag ttccccttgg atggcagccg ctttgccggc
19800caattgccgc cggtcgtggc cgcgccaacc tttgcgatcc gcaagcgcgc ggtcgccatc
19860ttcacgctgg aacagtacgt cgaggcgggc atcatgaccc gcgagcaata cgaggtcatt
19920aaaagcgccg tcgcggcgca tcgaaacatc ctcgtcattg gcggtactgg ctcgggcaag
19980accacgctcg tcaacgcgat catcaatgaa atggtcgcct tcaacccgtc tgagcgcgtc
20040gtcatcatcg aggacaccgg cgaaatccag tgcgccgcag agaacgccgt ccaataccac
20100accagcatcg acgtctcgat gacgctgctg ctcaagacaa cgctgcgtat gcgccccgac
20160cgcatcctgg tcggtgaggt acgtggcccc gaagcccttg atctgttgat ggcctggaac
20220accgggcatg aaggaggtgc cgccaccctg cacgcaaaca accccaaagc gggcctgagc
20280cggctcgcca tgcttatcag catgcacccg gattcaccga aacccattga gccgctgatt
20340ggcgaggcgg ttcatgtggt cgtccatatc gccaggaccc ctagcggccg tcgagtgcaa
20400gaaattctcg aagttcttgg ttacgagaac ggccagtaca tcaccaaaac cctgtaagga
20460gtatttccaa tgacaacggc tgttccgttc cgtctgacca tgaatcgcgg cattttgttc
20520taccttgccg tgttcttcgt tctcgctctc gcgttatccg cgcatccggc gatggcctcg
20580gaaggcaccg gcggcagctt gccatatgag agctggctga cgaacctgcg caactccgta
20640accggcccgg tggccttcgc gctgtccatc atcggcatcg tcgtcgccgg cggcgtgctg
20700atcttcggcg gcgaactcaa cgccttcttc cgaaccctga tcttcctggt tctggtgatg
20760gcgctgctgg tcggcgcgca gaacgtgatg agcaccttct tcggtcgtgg tgccgaaatc
20820gcggccctcg gcaacggggc gctgcaccag gtgcaagtcg cggcggcgga tgccgtgcgt
20880gcggtagcgg ctggacggct cgcctaatca tggctctgcg cacgatcccc atccgtcgcg
20940caggcaaccg agaaaacctg ttcatgggtg gtgatcgtga actggtgatg ttctcgggcc
21000tgatggcgtt tgcgctgatt ttcagcgccc aagagctgcg ggccaccgtg gtcggtctga
21060tcctgtggtt cggggcgctc tatgcgttcc gaatcatggc gaaggccgat ccgaagatgc
21120ggttcgtgta cctgcgtcac cgccggtaca agccgtatta cccggcccgc tcgaccccgt
21180tccgcgagaa caccaatagc caagggaagc aataccgatg atccaagcaa ttgcgattgc
21240aatcgcgggc ctcggcgcgc ttctgttgtt catcctcttt gcccgcatcc gcgcggtcga
21300tgccgaactg aaactgaaaa agcatcgttc caaggacgcc ggcctggccg atctgctcaa
21360ctacgccgct gtcgtcgatg acggcgtaat cgtgggcaag aacggcagct ttatggctgc
21420ctggctgtac aagggcgatg acaacgcaag cagcaccgac cagcagcgcg aagtagtgtc
21480cgcccgcatc aaccaggccc tcgcgggcct gggaagtggg tggatgatcc atgtggacgc
21540cgtgcggcgt cctgctccga actacgcgga gcggggcctg tcggcgttcc ctgaccgtct
21600gacggcagcg attgaagaag agcgctcggt cttgccttgc tcgtcggtga tgtacttcac
21660cagctccgcg aagtcgctct tcttgatgga gcgcatgggg acgtgcttgg caatcacgcg
21720caccccccgg ccgttttagc ggctaaaaaa gtcatggctc tgccctcggg cggaccacgc
21780ccatcatgac cttgccaagc tcgtcctgct tctcttcgat cttcgccagc agggcgagga
21840tcgtggcatc accgaaccgc gccgtgcgcg ggtcgtcggt gagccagagt ttcagcaggc
21900cgcccaggcg gcccaggtcg ccattgatgc gggccagctc gcggacgtgc tcatagtcca
21960cgacgcccgt gattttgtag ccctggccga cggccagcag gtaggccgac aggctcatgc
22020cggccgccgc cgccttttcc tcaatcgctc ttcgttcgtc tggaaggcag tacaccttga
22080taggtgggct gcccttcctg gttggcttgg tttcatcagc catccgcttg ccctcatctg
22140ttacgccggc ggtagccggc cagcctcgca gagcaggatt cccgttgagc accgccaggt
22200gcgaataagg gacagtgaag aaggaacacc cgctcgcggg tgggcctact tcacctatcc
22260tgcccggctg acgccgttgg atacaccaag gaaagtctac acgaaccctt tggcaaaatc
22320ctgtatatcg tgcgaaaaag gatggatata ccgaaaaaat cgctataatg accccgaagc
22380agggttatgc agcggaaaag cgctgcttcc ctgctgtttt gtggaatatc taccgactgg
22440aaacaggcaa atgcaggaaa ttactgaact gaggggacag gcgagagacg atgccaaaga
22500gctacaccga cgagctggcc gagtgggttg aatcccgcgc ggccaagaag cgccggcgtg
22560atgaggctgc ggttgcgttc ctggcggtga gggcggatgt cgaggcggcg ttagcgtccg
22620gctatgcgct cgtcaccatt tgggagcaca tgcgggaaac ggggaaggtc aagttctcct
22680acgagacgtt ccgctcgcac gccaggcggc acatcaaggc caagcccgcc gatgtgcccg
22740caccgcaggc caaggctgcg gaacccgcgc cggcacccaa gacgccggag ccacggcggc
22800cgaagcaggg gggcaaggct gaaaagccgg cccccgctgc ggccccgacc ggcttcacct
22860tcaacccaac accggacaaa aaggatctac tgtaatggcg aaaattcaca tggttttgca
22920gggcaagggc ggggtcggca agtcggccat cgccgcgatc attgcgcagt acaagatgga
22980caaggggcag acacccttgt gcatcgacac cgacccggtg aacgcgacgt tcgagggcta
23040caaggccctg aacgtccgcc ggctgaacat catggccggc gacgaaatta actcgcgcaa
23100cttcgacacc ctggtcgagc tgattgcgcc gaccaaggat gacgtggtga tcgacaacgg
23160tgccagctcg ttcgtgcctc tgtcgcatta cctcatcagc aaccaggtgc cggctctgct
23220gcaagaaatg gggcatgagc tggtcatcca taccgtcgtc accggcggcc aggctctcct
23280ggacacggtg agcggcttcg cccagctcgc cagccagttc ccggccgaag cgcttttcgt
23340ggtctggctg aacccgtatt gggggcctat cgagcatgag ggcaagagct ttgagcagat
23400gaaggcgtac acggccaaca aggcccgcgt gtcgtccatc atccagattc cggccctcaa
23460ggaagaaacc tacggccgcg atttcagcga catgctgcaa gagcggctga cgttcgacca
23520ggcgctggcc gatgaatcgc tcacgatcat gacgcggcaa cgcctcaaga tcgtgcggcg
23580cggcctgttt gaacagctcg acgcggcggc cgtgctatga gcgaccagat tgaagagctg
23640atccgggaga ttgcggccaa gcacggcatc gccgtcggcc gcgacgaccc ggtgctgatc
23700ctgcatacca tcaacgcccg gctcatggcc gacagtgcgg ccaagcaaga ggaaatcctt
23760gccgcgttca aggaagagct ggaagggatc gcccatcgtt ggggcgagga cgccaaggcc
23820aaagcggagc ggatgctgaa cgcggccctg gcggccagca aggacgcaat ggcgaaggta
23880atgaaggaca gcgccgcgca ggcggccgaa gcgatccgca gggaaatcga cgacggcctt
23940ggccgccagc tcgcggccaa ggtcgcggac gcgcggcgcg tggcgatgat gaacatgatc
24000gccggcggca tggtgttgtt cgcggccgcc ctggtggtgt gggcctcgtt atgaatcgca
24060gaggcgcaga tgaaaaagcc cggcgttgcc gggctttgtt tttgcgttag ctgggcttgt
24120ttgacaggcc caagctctga ctgcgcccgc gctcgcgctc ctgggcctgt ttcttctcct
24180gctcctgctt gcgcatcagg gcctggtgcc gtcgggctgc ttcacgcatc gaatcccagt
24240cgccggccag ctcgggatgc tccgcgcgca tcttgcgcgt cgccagttcc tcgatcttgg
24300gcgcgtgaat gcccatgcct tccttgattt cgcgcaccat gtccagccgc gtgtgcaggg
24360tctgcaagcg ggcttgctgt tgggcctgct gctgctgcca ggcggccttt gtacgcggca
24420gggacagcaa gccgggggca ttggactgta gctgctgcaa acgcgcctgc tgacggtcta
24480cgagctgttc taggcggtcc tcgatgcgct ccacctggtc atgctttgcc tgcacgtaga
24540gcgcaagggt ctgctggtag gtctgctcga tgggcgcgga ttctaagagg gcctgctgtt
24600ccgtctcggc ctcctgggcc gcctgtagca aatcctcgcc gctgttgccg ctggactgct
24660ttactgccgg ggactgctgt tgccctgctc gcgccgtcgt cgcagttcgg cttgccccca
24720ctcgattgac tgcttcattt cgagccgcag cgatgcgatc tcggattgcg tcaacggacg
24780gggcagcgcg gaggtgtccg gcttctcctt gggtgagtcg gtcgatgcca tagccaaagg
24840tttccttcca aaatgcgtcc attgctggac cgtgtttctc attgatgccc gcaagcatct
24900tcggcttgac cgccaggtca agcgcgcctt catgggcggt catgacggac gccgccatga
24960ccttgccgcc gttgttctcg atgtagccgc gtaatgaggc aatggtgccg cccatcgtca
25020gcgtgtcatc gacaacgatg tacttctggc cggggatcac ctccccctcg aaagtcgggt
25080tgaacgccag gcgatgatct gaaccggctc cggttcgggc gaccttctcc cgctgcacaa
25140tgtccgtttc gacctcaagg ccaaggcggt cggccagaac gaccgccatc atggccggaa
25200tcttgttgtt ccccgccgcc tcgacggcga ggactggaac gatgcggggc ttgtcgtcgc
25260cgatcagcgt cttgagctgg gcaacagtgt cgtccgaaat caggcgctcg accaaattaa
25320gcgccgcttc cgcgtcgccc tgcttcgcag cctggtattc aggctcgttg gtcaaagaac
25380caaggtcgcc gttgcgaacc accttcggga agtctcccca cggtgcgcgc tcggctctgc
25440tgtagctgct caagacgcct ccctttttag ccgctaaaac tctaacgagt gcgcccgcga
25500ctcaacttga cgctttcggc acttacctgt gccttgccac ttgcgtcata ggtgatgctt
25560ttcgcactcc cgatttcagg tactttatcg aaatctgacc gggcgtgcat tacaaagttc
25620ttccccacct gttggtaaat gctgccgcta tctgcgtgga cgatgctgcc gtcgtggcgc
25680tgcgacttat cggccttttg ggccatatag atgttgtaaa tgccaggttt cagggccccg
25740gctttatcta ccttctggtt cgtccatgcg ccttggttct cggtctggac aattctttgc
25800ccattcatga ccaggaggcg gtgtttcatt gggtgactcc tgacggttgc ctctggtgtt
25860aaacgtgtcc tggtcgcttg ccggctaaaa aaaagccgac ctcggcagtt cgaggccggc
25920tttccctaga gccgggcgcg tcaaggttgt tccatctatt ttagtgaact gcgttcgatt
25980tatcagttac tttcctcccg ctttgtgttt cctcccactc gtttccgcgt ctagccgacc
26040cctcaacata gcggcctctt cttgggctgc ctttgcctct tgccgcgctt cgtcacgctc
26100ggcttgcacc gtcgtaaagc gctcggcctg cctggccgcc tcttgcgccg ccaacttcct
26160ttgctcctgg tgggcctcgg cgtcggcctg cgccttcgct ttcaccgctg ccaactccgt
26220gcgcaaactc tccgcttcgc gcctggtggc gtcgcgctcg ccgcgaagcg cctgcatttc
26280ctggttggcc gcgtccaggg tcttgcggct ctcttctttg aatgcgcggg cgtcctggtg
26340agcgtagtcc agctcggcgc gcagctcctg cgctcgacgc tccacctcgt cggcccgctg
26400cgtcgccagc gcggcccgct gctcggctcc tgccagggcg gtgcgtgctt cggccagggc
26460ttgccgctgg cgtgcggcca gctcggccgc ctcggcggcc tgctgctcta gcaatgtaac
26520gcgcgcctgg gcttcttcca gctcgcgggc ctgcgcctcg aaggcgtcgg ccagctcccc
26580gcgcacggct tccaactcgt tgcgctcacg atcccagccg gcttgcgctg cctgcaacga
26640ttcattggca agggcctggg cggcttgcca gagggcggcc acggcctggt tgccggcctg
26700ctgcaccgcg tccggcacct ggactgccag cggggcggcc tgcgccgtgc gctggcgtcg
26760ccattcgcgc atgccggcgc tggcgtcgtt catgttgacg cgggcggcct tacgcactgc
26820atccacggtc gggaagttct cccggtcgcc ttgctcgaac agctcgtccg cagccgcaaa
26880aatgcggtcg cgcgtctctt tgttcagttc catgttggct ccggtaattg gtaagaataa
26940taatactctt acctacctta tcagcgcaag agtttagctg aacagttctc gacttaacgg
27000caggtttttt agcggctgaa gggcaggcaa aaaaagcccc gcacggtcgg cgggggcaaa
27060gggtcagcgg gaaggggatt agcgggcgtc gggcttcttc atgcgtcggg gccgcgcttc
27120ttgggatgga gcacgacgaa gcgcgcacgc gcatcgtcct cggccctatc ggcccgcgtc
27180gcggtcagga acttgtcgcg cgctaggtcc tccctggtgg gcaccagggg catgaactcg
27240gcctgctcga tgtaggtcca ctccatgacc gcatcgcagt cgaggccgcg ttccttcacc
27300gtctcttgca ggtcgcggta cgcccgctcg ttgagcggct ggtaacgggc caattggtcg
27360taaatggctg tcggccatga gcggcctttc ctgttgagcc agcagccgac gacgaagccg
27420gcaatgcagg cccctggcac aaccaggccg acgccggggg caggggatgg cagcagctcg
27480ccaaccagga accccgccgc gatgatgccg atgccggtca accagccctt gaaactatcc
27540ggccccgaaa cacccctgcg cattgcctgg atgctgcgcc ggatagcttg caacatcagg
27600agccgtttct tttgttcgtc agtcatggtc cgccctcacc agttgttcgt atcggtgtcg
27660gacgaactga aatcgcaaga gctgccggta tcggtccagc cgctgtccgt gtcgctgctg
27720ccgaagcacg gcgaggggtc cgcgaacgcc gcagacggcg tatccggccg cagcgcatcg
27780cccagcatgg ccccggtcag cgagccgccg gccaggtagc ccagcatggt gctgttggtc
27840gccccggcca ccagggccga cgtgacgaaa tcgccgtcat tccctctgga ttgttcgctg
27900ctcggcgggg cagtgcgccg cgccggcggc gtcgtggatg gctcgggttg gctggcctgc
27960gacggccggc gaaaggtgcg cagcagctcg ttatcgaccg gctgcggcgt cggggccgcc
28020gccttgcgct gcggtcggtg ttccttcttc ggctcgcgca gcttgaacag catgatcgcg
28080gaaaccagca gcaacgccgc gcctacgcct cccgcgatgt agaacagcat cggattcatt
28140cttcggtcct ccttgtagcg gaaccgttgt ctgtgcggcg cgggtggccc gcgccgctgt
28200ctttggggat cagccctcga tgagcgcgac cagtttcacg tcggcaaggt tcgcctcgaa
28260ctcctggccg tcgtcctcgt acttcaacca ggcatagcct tccgccggcg gccgacggtt
28320gaggataagg cgggcagggc gctcgtcgtg ctcgacctgg acgatggcct ttttcagctt
28380gtccgggtcc ggctccttcg cgcccttttc cttggcgtcc ttaccgtcct ggtcgccgtc
28440ctcgccgtcc tggccgtcgc cggcctccgc gtcacgctcg gcatcagtct ggccgttgaa
28500ggcatcgacg gtgttgggat cgcggccctt ctcgtccagg aactcgcgca gcagcttgac
28560cgtgccgcgc gtgatttcct gggtgtcgtc gtcaagccac gcctcgactt cctccgggcg
28620cttcttgaag gccgtcacca gctcgttcac cacggtcacg tcgcgcacgc ggccggtgtt
28680gaacgcatcg gcgatcttct ccggcaggtc cagcagcgtg acgtgctggg tgatgaacgc
28740cggcgacttg ccgatttcct tggcgatatc gcctttcttc ttgcccttcg ccagctcgcg
28800gccaatgaag tcggcaattt cgcgcggggt cagctcgttg cgttgcaggt tctcgataac
28860ctggtcggct tcgttgtagt cgttgtcgat gaacgccggg atggacttct tgccggccca
28920cttcgagcca cggtagcggc gggcgccgtg attgatgata tagcggcccg gctgctcctg
28980gttctcgcgc accgaaatgg gtgacttcac cccgcgctct ttgatcgtgg caccgatttc
29040cgcgatgctc tccggggaaa agccggggtt gtcggccgtc cgcggctgat gcggatcttc
29100gtcgatcagg tccaggtcca gctcgatagg gccggaaccg ccctgagacg ccgcaggagc
29160gtccaggagg ctcgacaggt cgccgatgct atccaacccc aggccggacg gctgcgccgc
29220gcctgcggct tcctgagcgg ccgcagcggt gtttttcttg gtggtcttgg cttgagccgc
29280agtcattggg aaatctccat cttcgtgaac acgtaatcag ccagggcgcg aacctctttc
29340gatgccttgc gcgcggccgt tttcttgatc ttccagaccg gcacaccgga tgcgagggca
29400tcggcgatgc tgctgcgcag gccaacggtg gccggaatca tcatcttggg gtacgcggcc
29460agcagctcgg cttggtggcg cgcgtggcgc ggattccgcg catcgacctt gctgggcacc
29520atgccaagga attgcagctt ggcgttcttc tggcgcacgt tcgcaatggt cgtgaccatc
29580ttcttgatgc cctggatgct gtacgcctca agctcgatgg gggacagcac atagtcggcc
29640gcgaagaggg cggccgccag gccgacgcca agggtcgggg ccgtgtcgat caggcacacg
29700tcgaagcctt ggttcgccag ggccttgatg ttcgccccga acagctcgcg ggcgtcgtcc
29760agcgacagcc gttcggcgtt cgccagtacc gggttggact cgatgagggc gaggcgcgcg
29820gcctggccgt cgccggctgc gggtgcggtt tcggtccagc cgccggcagg gacagcgccg
29880aacagcttgc ttgcatgcag gccggtagca aagtccttga gcgtgtagga cgcattgccc
29940tgggggtcca ggtcgatcac ggcaacccgc aagccgcgct cgaaaaagtc gaaggcaaga
30000tgcacaaggg tcgaagtctt gccgacgccg cctttctggt tggccgtgac caaagttttc
30060atcgtttggt ttcctgtttt ttcttggcgt ccgcttccca cttccggacg atgtacgcct
30120gatgttccgg cagaaccgcc gttacccgcg cgtacccctc gggcaagttc ttgtcctcga
30180acgcggccca cacgcgatgc accgcttgcg acactgcgcc cctggtcagt cccagcgacg
30240ttgcgaacgt cgcctgtggc ttcccatcga ctaagacgcc ccgcgctatc tcgatggtct
30300gctgccccac ttccagcccc tggatcgcct cctggaactg gctttcggta agccgtttct
30360tcatggataa cacccataat ttgctccgcg ccttggttga acatagcggt gacagccgcc
30420agcacatgag agaagtttag ctaaacattt ctcgcacgtc aacaccttta gccgctaaaa
30480ctcgtccttg gcgtaacaaa acaaaagccc ggaaaccggg ctttcgtctc ttgccgctta
30540tggctctgca cccggctcca tcaccaacag gtcgcgcacg cgcttcactc ggttgcggat
30600cgacactgcc agcccaacaa agccggttgc cgccgccgcc aggatcgcgc cgatgatgcc
30660ggccacaccg gccatcgccc accaggtcgc cgccttccgg ttccattcct gctggtactg
30720cttcgcaatg ctggacctcg gctcaccata ggctgaccgc tcgatggcgt atgccgcttc
30780tccccttggc gtaaaaccca gcgccgcagg cggcattgcc atgctgcccg ccgctttccc
30840gaccacgacg cgcgcaccag gcttgcggtc cagaccttcg gccacggcga gctgcgcaag
30900gacataatca gccgccgact tggctccacg cgcctcgatc agctcttgca ctcgcgcgaa
30960atccttggcc tccacggccg ccatgaatcg cgcacgcggc gaaggctccg cagggccggc
31020gtcgtgatcg ccgccgagaa tgcccttcac caagttcgac gacacgaaaa tcatgctgac
31080ggctatcacc atcatgcaga cggatcgcac gaacccgctg aattgaacac gagcacggca
31140cccgcgacca ctatgccaag aatgcccaag gtaaaaattg ccggccccgc catgaagtcc
31200gtgaatgccc cgacggccga agtgaagggc aggccgccac ccaggccgcc gccctcactg
31260cccggcacct ggtcgctgaa tgtcgatgcc agcacctgcg gcacgtcaat gcttccgggc
31320gtcgcgctcg ggctgatcgc ccatcccgtt actgccccga tcccggcaat ggcaaggact
31380gccagcgctg ccatttttgg ggtgaggccg ttcgcggccg aggggcgcag cccctggggg
31440gatgggaggc ccgcgttagc gggccgggag ggttcgagaa gggggggcac cccccttcgg
31500cgtgcgcggt cacgcgcaca gggcgcagcc ctggttaaaa acaaggttta taaatattgg
31560tttaaaagca ggttaaaaga caggttagcg gtggccgaaa aacgggcgga aacccttgca
31620aatgctggat tttctgcctg tggacagccc ctcaaatgtc aataggtgcg cccctcatct
31680gtcagcactc tgcccctcaa gtgtcaagga tcgcgcccct catctgtcag tagtcgcgcc
31740cctcaagtgt caataccgca gggcacttat ccccaggctt gtccacatca tctgtgggaa
31800actcgcgtaa aatcaggcgt tttcgccgat ttgcgaggct ggccagctcc acgtcgccgg
31860ccgaaatcga gcctgcccct catctgtcaa cgccgcgccg ggtgagtcgg cccctcaagt
31920gtcaacgtcc gcccctcatc tgtcagtgag ggccaagttt tccgcgaggt atccacaacg
31980ccggcggccg cggtgtctcg cacacggctt cgacggcgtt tctggcgcgt ttgcagggcc
32040atagacggcc gccagcccag cggcgagggc aaccagcccg gtgagcgtcg gaaaggcgct
32100ggaagccccg tagcgacgcg gagaggggcg agacaagcca agggcgcagg ctcgatgcgc
32160agcacgacat agccggttct cgcaaggacg agaatttccc tgcggtgccc ctcaagtgtc
32220aatgaaagtt tccaacgcga gccattcgcg agagccttga gtccacgcta gatgagagct
32280ttgttgtagg tggaccagtt ggtgattttg aacttttgct ttgccacgga acggtctgcg
32340ttgtcgggaa gatgcgtgat ctgatccttc aactcagcaa aagttcgatt tattcaacaa
32400agccacgttg tgtctcaaaa tctctgatgt tacattgcac aagataaaaa tatatcatca
32460tgaacaataa aactgtctgc ttacataaac agtaatacaa ggggtgttat gagccatatt
32520caacgggaaa cgtcttgctc gactctagag ctcgttcctc gaggcctcga ggcctcgagg
32580aacggtacct gcggggaagc ttacaataat gtgtgttgtt aagtcttgtt gcctgtcatc
32640gtctgactga ctttcgtcat aaatcccggc ctccgtaacc cagctttggg caagctcacg
32700gatttgatcc ggcggaacgg gaatatcgag atgccgggct gaacgctgca gttccagctt
32760tccctttcgg gacaggtact ccagctgatt gattatctgc tgaagggtct tggttccacc
32820tcctggcaca atgcgaatga ttacttgagc gcgatcgggc atccaatttt ctcccgtcag
32880gtgcgtggtc aagtgctaca aggcaccttt cagtaacgag cgaccgtcga tccgtcgccg
32940ggatacggac aaaatggagc gcagtagtcc atcgagggcg gcgaaagcct cgccaaaagc
33000aatacgttca tctcgcacag cctccagatc cgatcgaggg tcttcggcgt aggcagatag
33060aagcatggat acattgcttg agagtattcc gatggactga agtatggctt ccatcttttc
33120tcgtgtgtct gcatctattt cgagaaagcc cccgatgcgg cgcaccgcaa cgcgaattgc
33180catactatcc gaaagtccca gcaggcgcgc ttgataggaa aaggtttcat actcggccga
33240tcgcagacgg gcactcacga ccttgaaccc ttcaactttc agggatcgat gctggttgat
33300ggtagtctca ctcgacgtgg ctctggtgtg ttttgacata gcttcctcca aagaaagcgg
33360aaggtctgga tactccagca cgaaatgtgc ccgggtagac ggatggaagt ctagccctgc
33420tcaatatgaa atcaacagta catttacagt caatactgaa tatacttgct acatttgcaa
33480ttgtcttata acgaatgtga aataaaaata gtgtaacaac gcttttactc atcgataatc
33540acaaaaacat ttatacgaac aaaaatacaa atgcactccg gtttcacagg ataggcggga
33600tcagaatatg caacttttga cgttttgttc tttcaaaggg ggtgctggca aaaccaccgc
33660actcatgggc ctttgcgctg ctttggcaaa tgacggtaaa cgagtggccc tctttgatgc
33720cgacgaaaac cggcctctga cgcgatggag agaaaacgcc ttacaaagca gtactgggat
33780cctcgctgtg aagtctattc cgccgacgaa atgccccttc ttgaagcagc ctatgaaaat
33840gccgagctcg aaggatttga ttatgcgttg gccgatacgc gtggcggctc gagcgagctc
33900aacaacacaa tcatcgctag ctcaaacctg cttctgatcc ccaccatgct aacgccgctc
33960gacatcgatg aggcactatc tacctaccgc tacgtcatcg agctgctgtt gagtgaaaat
34020ttggcaattc ctacagctgt tttgcgccaa cgcgtcccgg tcggccgatt gacaacatcg
34080caacgcagga tgtcagagac gctagagagc cttccagttg taccgtctcc catgcatgaa
34140agagatgcat ttgccgcgat gaaagaacgc ggcatgttgc atcttacatt actaaacacg
34200ggaactgatc cgacgatgcg cctcatagag aggaatcttc ggattgcgat ggaggaagtc
34260gtggtcattt cgaaactgat cagcaaaatc ttggaggctt gaagatggca attcgcaagc
34320ccgcattgtc ggtcggcgaa gcacggcggc ttgctggtgc tcgacccgag atccaccatc
34380ccaacccgac acttgttccc cagaagctgg acctccagca cttgcctgaa aaagccgacg
34440agaaagacca gcaacgtgag cctctcgtcg ccgatcacat ttacagtccc gatcgacaac
34500ttaagctaac tgtggatgcc cttagtccac ctccgtcccc gaaaaagctc caggtttttc
34560tttcagcgcg accgcccgcg cctcaagtgt cgaaaacata tgacaacctc gttcggcaat
34620acagtccctc gaagtcgcta caaatgattt taaggcgcgc gttggacgat ttcgaaagca
34680tgctggcaga tggatcattt cgcgtggccc cgaaaagtta tccgatccct tcaactacag
34740aaaaatccgt tctcgttcag acctcacgca tgttcccggt tgcgttgctc gaggtcgctc
34800gaagtcattt tgatccgttg gggttggaga ccgctcgagc tttcggccac aagctggcta
34860ccgccgcgct cgcgtcattc tttgctggag agaagccatc gagcaattgg tgaagaggga
34920cctatcggaa cccctcacca aatattgagt gtaggtttga ggccgctggc cgcgtcctca
34980gtcacctttt gagccagata attaagagcc aaatgcaatt ggctcaggct gccatcgtcc
35040ccccgtgcga aacctgcacg tccgcgtcaa agaaataacc ggcacctctt gctgttttta
35100tcagttgagg gcttgacgga tccgcctcaa gtttgcggcg cagccgcaaa atgagaacat
35160ctatactcct gtcgtaaacc tcctcgtcgc gtactcgact ggcaatgaga agttgctcgc
35220gcgatagaac gtcgcggggt ttctctaaaa acgcgaggag aagattgaac tcacctgccg
35280taagtttcac ctcaccgcca gcttcggaca tcaagcgacg ttgcctgaga ttaagtgtcc
35340agtcagtaaa acaaaaagac cgtcggtctt tggagcggac aacgttgggg cgcacgcgca
35400aggcaacccg aatgcgtgca agaaactctc tcgtactaaa cggcttagcg ataaaatcac
35460ttgctcctag ctcgagtgca acaactttat ccgtctcctc aaggcggtcg ccactgataa
35520ttatgattgg aatatcagac tttgccgcca gatttcgaac gatctcaagc ccatcttcac
35580gacctaaatt tagatcaaca accacgacat cgaccgtcgc ggaagagagt actctagtga
35640actgggtgct gtcggctacc gcggtcactt tgaaggcgtg gatcgtaagg tattcgataa
35700taagatgccg catagcgaca tcgtcatcga taagaagaac gtgtttcaac ggctcacctt
35760tcaatctaaa atctgaaccc ttgttcacag cgcttgagaa attttcacgt gaaggatgta
35820caatcatctc cagctaaatg ggcagttcgt cagaattgcg gctgaccgcg gatgacgaaa
35880atgcgaacca agtatttcaa ttttatgaca aaagttctca atcgttgtta caagtgaaac
35940gcttcgaggt tacagctact attgattaag gagatcgcct atggtctcgc cccggcgtcg
36000tgcgtccgcc gcgagccaga tctcgcctac ttcataaacg tcctcatagg cacggaatgg
36060aatgatgaca tcgatcgccg tagagagcat gtcaatcagt gtgcgatctt ccaagctagc
36120accttgggcg ctacttttga caagggaaaa cagtttcttg aatccttgga ttggattcgc
36180gccgtgtatt gttgaaatcg atcccggatg tcccgagacg acttcactca gataagccca
36240tgctgcatcg tcgcgcatct cgccaagcaa tatccggtcc ggccgcatac gcagacttgc
36300ttggagcaag tgctcggcgc tcacagcacc cagcccagca ccgttcttgg agtagagtag
36360tctaacatga ttatcgtgtg gaatgacgag ttcgagcgta tcttctatgg tgattagcct
36420ttcctggggg gggatggcgc tgatcaaggt cttgctcatt gttgtcttgc cgcttccggt
36480agggccacat agcaacatcg tcagtcggct gacgacgcat gcgtgcagaa acgcttccaa
36540atccccgttg tcaaaatgct gaaggatagc ttcatcatcc tgattttggc gtttccttcg
36600tgtctgccac tggttccacc tcgaagcatc ataacgggag gagacttctt taagaccaga
36660aacacgcgag cttggccgtc gaatggtcaa gctgacggtg cccgagggaa cggtcggcgg
36720cagacagatt tgtagtcgtt caccaccagg aagttcagtg gcgcagaggg ggttacgtgg
36780tccgacatcc tgctttctca gcgcgcccgc taaaatagcg atatcttcaa gatcatcata
36840agagacgggc aaaggcatct tggtaaaaat gccggcttgg cgcacaaatg cctctccagg
36900tcgattgatc gcaatttctt cagtcttcgg gtcatcgagc cattccaaaa tcggcttcag
36960aagaaagcgt agttgcggat ccacttccat ttacaatgta tcctatctct aagcggaaat
37020ttgaattcat taagagcggc ggttcctccc ccgcgtggcg ccgccagtca ggcggagctg
37080gtaaacacca aagaaatcga ggtcccgtgc tacgaaaatg gaaacggtgt caccctgatt
37140cttcttcagg gttggcggta tgttgatggt tgccttaagg gctgtctcag ttgtctgctc
37200accgttattt tgaaagctgt tgaagctcat cccgccaccc gagctgccgg cgtaggtgct
37260agctgcctgg aaggcgcctt gaacaacact caagagcata gctccgctaa aacgctgcca
37320gaagtggctg tcgaccgagc ccggcaatcc tgagcgaccg agttcgtccg cgcttggcga
37380tgttaacgag atcatcgcat ggtcaggtgt ctcggcgcga tcccacaaca caaaaacgcg
37440cccatctccc tgttgcaagc cacgctgtat ttcgccaaca acggtggtgc cacgatcaag
37500aagcacgata ttgttcgttg ttccacgaat atcctgaggc aagacacact ttacatagcc
37560tgccaaattt gtgtcgattg cggtttgcaa gatgcacgga attattgtcc cttgcgttac
37620cataaaatcg gggtgcggca agagcgtggc gctgctgggc tgcagctcgg tgggtttcat
37680acgtatcgac aaatcgttct cgccggacac ttcgccattc ggcaaggagt tgtcgtcacg
37740cttgccttct tgtcttcggc ccgtgtcgcc ctgaatggcg cgtttgctga ccccttgatc
37800gccgctgcta tatgcaaaaa tcggtgtttc ttccggccgt ggctcatgcc gctccggttc
37860gcccctcggc ggtagaggag cagcaggctg aacagcctct tgaaccgctg gaggatccgg
37920cggcacctca atcggagctg gatgaaatgg cttggtgttt gttgcgatca aagttgacgg
37980cgatgcgttc tcattcacct tcttttggcg cccacctagc caaatgaggc ttaatgataa
38040cgcgagaacg acacctccga cgatcaattt ctgagacccc gaaagacgcc ggcgatgttt
38100gtcggagacc agggatccag atgcatcaac ctcatgtgcc gcttgctgac tatcgttatt
38160catcccttcg cccccttcag gacgcgtttc acatcgggcc tcaccgtgcc cgtttgcggc
38220ctttggccaa cgggatcgta agcggtgttc cagatacata gtactgtgtg gccatccctc
38280agacgccaac ctcgggaaac cgaagaaatc tcgacatcgc tccctttaac tgaatagttg
38340gcaacagctt ccttgccatc aggattgatg gtgtagatgg agggtatgcg tacattgccc
38400ggaaagtgga ataccgtcgt aaatccattg tcgaagactt cgagtggcaa cagcgaacga
38460tcgccttggg cgacgtagtg ccaattactg tccgccgcac caagggctgt gacaggctga
38520tccaataaat tctcagcttt ccgttgatat tgtgcttccg cgtgtagtct gtccacaaca
38580gccttctgtt gtgcctccct tcgccgagcc gccgcatcgt cggcggggta ggcgaattgg
38640acgctgtaat agagatcggg ctgctcttta tcgaggtggg acagagtctt ggaacttata
38700ctgaaaacat aacggcgcat cccggagtcg cttgcggtta gcacgattac tggctgaggc
38760gtgaggacct ggcttgcctt gaaaaataga taatttcccc gcggtagggc tgctagatct
38820ttgctatttg aaacggcaac cgctgtcacc gtttcgttcg tggcgaatgt tacgaccaaa
38880gtagctccaa ccgccgtcga gaggcgcacc acttgatcgg gattgtaagc caaataacgc
38940atgcgcggat ctagcttgcc cgccattgga gtgtcttcag cctccgcacc agtcgcagcg
39000gcaaataaac atgctaaaat gaaaagtgct tttctgatca tggttcgctg tggcctacgt
39060ttgaaacggt atcttccgat gtctgatagg aggtgacaac cagacctgcc gggttggtta
39120gtctcaatct gccgggcaag ctggtcacct tttcgtagcg aactgtcgcg gtccacgtac
39180tcaccacagg cattttgccg tcaacgacga gggtcctttt atagcgaatt tgctgcgtgc
39240ttggagttac atcatttgaa gcgatgtgct cgacctccac cctgccgcgt ttgccaagaa
39300tgacttgagg cgaactggga ttgggatagt tgaagaattg ctggtaatcc tggcgcactg
39360ttggggcact gaagttcgat accaggtcgt aggcgtactg agcggtgtcg gcatcataac
39420tctcgcgcag gcgaacgtac tcccacaatg aggcgttaac gacggcctcc tcttgagttg
39480caggcaatcg cgagacagac acctcgctgt caacggtgcc gtccggccgt atccatagat
39540atacgggcac aagcctgctc aacggcacca ttgtggctat agcgaacgct tgagcaacat
39600ttcccaaaat cgcgatagct gcgacagctg caatgagttt ggagagacgt cgcgccgatt
39660tcgctcgcgc ggtttgaaag gcttctactt ccttatagtg ctcggcaagg ctttcgcgcg
39720ccactagcat ggcatattca ggccccgtca tagcgtccac ccgaattgcc gagctgaaga
39780tctgacggag taggctgcca tcgccccaca ttcagcggga agatcgggcc tttgcagctc
39840gctaatgtgt cgtttgtctg gcagccgctc aaagcgacaa ctaggcacag caggcaatac
39900ttcatagaat tctccattga ggcgaatttt tgcgcgacct agcctcgctc aacctgagcg
39960aagcgacggt acaagctgct ggcagattgg gttgcgccgc tccagtaact gcctccaatg
40020ttgccggcga tcgccggcaa agcgacaatg agcgcatccc ctgtcagaaa aaacatatcg
40080agttcgtaaa gaccaatgat cttggccgcg gtcgtaccgg cgaaggtgat tacaccaagc
40140ataagggtga gcgcagtcgc ttcggttagg atgacgatcg ttgccacgag gtttaagagg
40200agaagcaaga gaccgtaggt gataagttgc ccgatccact tagctgcgat gtcccgcgtg
40260cgatcaaaaa tatatccgac gaggatcaga ggcccgatcg cgagaagcac tttcgtgaga
40320attccaacgg cgtcgtaaac tccgaaggca gaccagagcg tgccgtaaag gacccactgt
40380gccccttgga aagcaaggat gtcctggtcg ttcatcggac cgatttcgga tgcgattttc
40440tgaaaaacgg cctgggtcac ggcgaacatt gtatccaact gtgccggaac agtctgcaga
40500ggcaagccgg ttacactaaa ctgctgaaca aagtttggga ccgtcttttc gaagatggaa
40560accacatagt cttggtagtt agcctgccca acaattagag caacaacgat ggtgaccgtg
40620atcacccgag tgataccgct acgggtatcg acttcgccgc gtatgactaa aataccctga
40680acaataatcc aaagagtgac acaggcgatc aatggcgcac tcaccgcctc ctggatagtc
40740tcaagcatcg agtccaagcc tgtcgtgaag gctacatcga agatcgtatg aatggccgta
40800aacggcgccg gaatcgtgaa attcatcgat tggacctgaa cttgactggt ttgtcgcata
40860atgttggata aaatgagctc gcattcggcg aggatgcggg cggatgaaca aatcgcccag
40920ccttagggga gggcaccaaa gatgacagcg gtcttttgat gctccttgcg ttgagcggcc
40980gcctcttccg cctcgtgaag gccggcctgc gcggtagtca tcgttaatag gcttgtcgcc
41040tgtacatttt gaatcattgc gtcatggatc tgcttgagaa gcaaaccatt ggtcacggtt
41100gcctgcatga tattgcgaga tcgggaaagc tgagcagacg tatcagcatt cgccgtcaag
41160cgtttgtcca tcgtttccag attgtcagcc gcaatgccag cgctgtttgc ggaaccggtg
41220atctgcgatc gcaacaggtc cgcttcagca tcactaccca cgactgcacg atctgtatcg
41280ctggtgatcg cacgtgccgt ggtcgacatt ggcattcgcg gcgaaaacat ttcattgtct
41340aggtccttcg tcgaaggata ctgatttttc tggttgagcg aagtcagtag tccagtaacg
41400ccgtaggccg acgtcaacat cgtaaccatc gctatagtct gagtgagatt ctccgcagtc
41460gcgagcgcag tcgcgagcgt ctcagcctcc gttgccgggt cgctaacaac aaactgcgcc
41520cgcgcgggct gaatatatag aaagctgcag gtcaaaactg ttgcaataag ttgcgtcgtc
41580ttcatcgttt cctaccttat caatcttctg cctcgtggtg acgggccatg aattcgctga
41640gccagccaga tgagttgcct tcttgtgcct cgcgtagtcg agttgcaaag cgcaccgtgt
41700tggcacgccc cgaaagcacg gcgacatatt cacgcatatc ccgcagatca aattcgcaga
41760tgacgcttcc actttctcgt ttaagaagaa acttacggct gccgaccgtc atgtcttcac
41820ggatcgcctg aaattccttt tcggtacatt tcagtccatc gacataagcc gatcgatctg
41880cggttggtga tggatagaaa atcttcgtca tacattgcgc aaccaagctg gctcctagcg
41940gcgattccag aacatgctct ggttgctgcg ttgccagtat tagcatcccg ttgttttttc
42000gaacggtcag gaggaatttg tcgacgacag tcgaaaattt agggtttaac aaataggcgc
42060gaaactcatc gcagctcatc acaaaacggc ggccgtcgat catggctcca atccgatgca
42120ggagatatgc tgcagcggga gcgcatactt cctcgtattc gagaagatgc gtcatgtcga
42180agccggtaat cgacggatct aactttactt cgtcaacttc gccgtcaaat gcccagccaa
42240gcgcatggcc ccggcaccag cgttggagcc gcgctcctgc gccttcggcg ggcccatgca
42300acaaaaattc acgtaacccc gcgattgaac gcatttgtgg atcaaacgag agctgacgat
42360ggataccacg gaccagacgg cggttctctt ccggagaaat cccaccccga ccatcactct
42420cgatgagagc cacgatccat tcgcgcagaa aatcgtgtga ggctgctgtg ttttctaggc
42480cacgcaacgg cgccaacccg ctgggtgtgc ctctgtgaag tgccaaatat gttcctcctg
42540tggcgcgaac cagcaattcg ccaccccggt ccttgtcaaa gaacacgacc gtacctgcac
42600ggtcgaccat gctctgttcg agcatggcta gaacaaacat catgagcgtc gtcttacccc
42660tcccgatagg cccgaatatt gccgtcatgc caacatcgtg ctcatgcggg atatagtcga
42720aaggcgttcc gccattggta cgaaatcggg caatcgcgtt gccccagtgg cctgagctgg
42780cgccctctgg aaagttttcg aaagagacaa accctgcgaa attgcgtgaa gtgattgcgc
42840cagggcgtgt gcgccactta aaattccccg gcaattggga ccaataggcc gcttccatac
42900caataccttc ttggacaacc acggcacctg catccgccat tcgtgtccga gcccgcgcgc
42960ccctgtcccc aagactattg agatcgtctg catagacgca aaggctcaaa tgatgtgagc
43020ccataacgaa ttcgttgctc gcaagtgcgt cctcagcctc ggataatttg ccgatttgag
43080tcacggcttt atcgccggaa ctcagcatct ggctcgattt gaggctaagt ttcgcgtgcg
43140cttgcgggcg agtcaggaac gaaaaactct gcgtgagaac aagtggaaaa tcgagggata
43200gcagcgcgtt gagcatgccc ggccgtgttt ttgcagggta ttcgcgaaac gaatagatgg
43260atccaacgta actgtctttt ggcgttctga tctcgagtcc tcgcttgccg caaatgactc
43320tgtcggtata aatcgaagcg ccgagtgagc cgctgacgac cggaaccggt gtgaaccgac
43380cagtcatgat caaccgtagc gcttcgccaa tttcggtgaa gagcacaccc tgcttctcgc
43440ggatgccaag acgatgcagg ccatacgctt taagagagcc agcgacaaca tgccaaagat
43500cttccatgtt cctgatctgg cccgtgagat cgttttccct ttttccgctt agcttggtga
43560acctcctctt taccttccct aaagccgcct gtgggtagac aatcaacgta aggaagtgtt
43620cattgcggag gagttggccg gagagcacgc gctgttcaaa agcttcgttc aggctagcgg
43680cgaaaacact acggaagtgt cgcggcgccg atgatggcac gtcggcatga cgtacgaggt
43740gagcatatat tgacacatga tcatcagcga tattgcgcaa cagcgtgttg aacgcacgac
43800aacgcgcatt gcgcatttca gtttcctcaa gctcgaatgc aacgccatca attctcgcaa
43860tggtcatgat cgatccgtct tcaagaagga cgatatggtc gctgaggtgg ccaatataag
43920ggagatagat ctcaccggat ctttcggtcg ttccactcgc gccgagcatc acaccattcc
43980tctccctcgt gggggaaccc taattggatt tgggctaaca gtagcgcccc cccaaactgc
44040actatcaatg cttcttcccg cggtccgcaa aaatagcagg acgacgctcg ccgcattgta
44100gtctcgctcc acgatgagcc gggctgcaaa ccataacggc acgagaacga cttcgtagag
44160cgggttctga acgataacga tgacaaagcc ggcgaacatc atgaataacc ctgccaatgt
44220cagtggcacc ccaagaaaca atgcgggccg tgtggctgcg aggtaaaggg tcgattcttc
44280caaacgatca gccatcaact accgccagtg agcgtttggc cgaggaagct cgccccaaac
44340atgataacaa tgccgccgac gacgccggca accagcccaa gcgaagcccg cccgaacatc
44400caggagatcc cgatagcgac aatgccgaga acagcgagtg actggccgaa cggaccaagg
44460ataaacgtgc atatattgtt aaccattgtg gcggggtcag tgccgccacc cgcagattgc
44520gctgcggcgg gtccggatga ggaaatgctc catgcaattg caccgcacaa gcttggggcg
44580cagctcgata tcacgcgcat catcgcattc gagagcgaga ggcgatttag atgtaaacgg
44640tatctctcaa agcatcgcat caatgcgcac ctccttagta taagtcgaat aagacttgat
44700tgtcgtctgc ggatttgccg ttgtcctggt gtggcggtgg cggagcgatt aaaccgccag
44760cgccatcctc ctgcgagcgg cgctgatatg acccccaaac atcccacgtc tcttcggatt
44820ttagcgcctc gtgatcgtct tttggaggct cgattaacgc gggcaccagc gattgagcag
44880ctgtttcaac ttttcgcacg tagccgtttg caaaaccgcc gatgaaatta ccggtgttgt
44940aagcggagat cgcccgacga agcgcaaatt gcttctcgtc aatcgtttcg ccgcctgcat
45000aacgactttt cagcatgttt gcagcggcag ataatgatgt gcacgcctgg agcgcaccgt
45060caggtgtcag accgagcata gaaaaatttc gagagtttat ttgcatgagg ccaacatcca
45120gcgaatgccg tgcatcgaga cggtgcctga cgacttgggt tgcttggctg tgatcttgcc
45180agtgaagcgt ttcgccggtc gtgttgtcat gaatcgctaa aggatcaaag cgactctcca
45240ccttagctat cgccgcaagc gtagatgtcg caactgatgg ggcacacttg cgagcaacat
45300ggtcaaactc agcagatgag agtggcgtgg caaggctcga cgaacagaag gagaccatca
45360aggcaagaga aagcgacccc gatctcttaa gcatacctta tctccttagc tcgcaactaa
45420caccgcctct cccgttggaa gaagtgcgtt gttttatgtt gaagattatc gggagggtcg
45480gttactcgaa aattttcaat tgcttcttta tgatttcaat tgaagcgaga aacctcgccc
45540ggcgtcttgg aacgcaacat ggaccgagaa ccgcgcatcc atgactaagc aaccggatcg
45600acctattcag gccgcagttg gtcaggtcag gctcagaacg aaaatgctcg gcgaggttac
45660gctgtctgta aacccattcg atgaacggga agcttccttc cgattgctct tggcaggaat
45720attggcccat gcctgcttgc gctttgcaaa tgctcttatc gcgttggtat catatgcctt
45780gtccgccagc agaaacgcac tctaagcgat tatttgtaaa aatgtttcgg tcatgcggcg
45840gtcatgggct tgacccgctg tcagcgcaag acggatcggt caaccgtcgg catcgacaac
45900agcgtgaatc ttggtggtca aaccgccacg ggaacgtccc atacagccat cgtcttgatc
45960ccgctgtttc ccgtcgccgc atgttggtgg acgcggacac aggaactgtc aatcatgacg
46020acattctatc gaaagccttg gaaatcacac tcagaatatg atcccagacg tctgcctcac
46080gccatcgtac aaagcgattg tagcaggttg tacaggaacc gtatcgatca ggaacgtctg
46140cccagggcgg gcccgtccgg aagcgccaca agatgacatt gatcacccgc gtcaacgcgc
46200ggcacgcgac gcggcttatt tgggaacaaa ggactgaaca acagtccatt cgaaatcggt
46260gacatcaaag cggggacggg ttatcagtgg cctccaagtc aagcctcaat gaatcaaaat
46320cagaccgatt tgcaaacctg atttatgagt gtgcggccta aatgatgaaa tcgtccttct
46380agatcgcctc cgtggtgtag caacacctcg cagtatcgcc gtgctgacct tggccaggga
46440attgactggc aagggtgctt tcacatgacc gctcttttgg ccgcgataga tgatttcgtt
46500gctgctttgg gcacgtagaa ggagagaagt catatcggag aaattcctcc tggcgcgaga
46560gcctgctcta tcgcgacggc atcccactgt cgggaacaga ccggatcatt cacgaggcga
46620aagtcgtcaa cacatgcgtt ataggcatct tcccttgaag gatgatcttg ttgctgccaa
46680tctggaggtg cggcagccgc aggcagatgc gatctcagcg caacttgcgg caaaacatct
46740cactcacctg aaaaccacta gcgagtctcg cgatcagacg aaggcctttt acttaacgac
46800acaatatccg atgtctgcat cacaggcgtc gctatcccag tcaatactaa agcggtgcag
46860gaactaaaga ttactgatga cttaggcgtg ccacgaggcc tgagacgacg cgcgtagaca
46920gttttttgaa atcattatca aagtgatggc ctccgctgaa gcctatcacc tctgcgccgg
46980tctgtcggag agatgggcaa gcattattac ggtcttcgcg cccgtacatg cattggacga
47040ttgcagggtc aatggatctg agatcatcca gaggattgcc gcccttacct tccgtttcga
47100gttggagcca gcccctaaat gagacgacat agtcgacttg atgtgacaat gccaagagag
47160agatttgctt aacccgattt ttttgctcaa gcgtaagcct attgaagctt gccggcatga
47220cgtccgcgcc gaaagaatat cctacaagta aaacattctg cacaccgaaa tgcttggtgt
47280agacatcgat tatgtgacca agatccttag cagtttcgct tggggaccgc tccgaccaga
47340aataccgaag tgaactgacg ccaatgacag gaatcccttc cgtctgcaga taggtaccat
47400cgatagatct gctgcctcgc gcgtttcggt gatgacggtg aaaacctctg acacatgcag
47460ctcccggaga cggtcacagc ttgtctgtaa gcggatgccg ggagcagaca agcccgtcag
47520ggcgcgtcag cgggtgttgg cgggtgtcgg ggcgcagcca tgacccagtc acgtagcgat
47580agcggagtgt atactggctt aactatgcgg catcagagca gattgtactg agagtgcacc
47640atatgcggtg tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcgctctt
47700ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag
47760ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca
47820tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt
47880tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc
47940gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct
48000ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg
48060tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca
48120agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta tccggtaact
48180atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca gccactggta
48240acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag tggtggccta
48300actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag ccagttacct
48360tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt agcggtggtt
48420tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa gatcctttga
48480tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg attttggtca
48540tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga agttttaaat
48600caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta atcagtgagg
48660cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc cccgtcgtgt
48720agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg ataccgcgag
48780acccacgctc accggctcca gatttatcag caataaacca gccagccgga agggccgagc
48840gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt tgccgggaag
48900ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttgccatt gctgcagggg
48960gggggggggg gggggacttc cattgttcat tccacggaca aaaacagaga aaggaaacga
49020cagaggccaa aaagcctcgc tttcagcacc tgtcgtttcc tttcttttca gagggtattt
49080taaataaaaa cattaagtta tgacgaagaa gaacggaaac gccttaaacc ggaaaatttt
49140cataaatagc gaaaacccgc gaggtcgccg ccccgtaacc tgtcggatca ccggaaagga
49200cccgtaaagt gataatgatt atcatctaca tatcacaacg tgcgtggagg ccatcaaacc
49260acgtcaaata atcaattatg acgcaggtat cgtattaatt gatctgcatc aacttaacgt
49320aaaaacaact tcagacaata caaatcagcg acactgaata cggggcaacc tcatgtcccc
49380cccccccccc cccctgcagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc
49440agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg
49500gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc
49560atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct
49620gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc
49680tcttgcccgg cgtcaacacg ggataatacc gcgccacata gcagaacttt aaaagtgctc
49740atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc
49800agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc
49860gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca
49920cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt
49980tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt
50040ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca
50100ttaacctata aaaataggcg tatcacgagg ccctttcgtc ttcaagaatt ggtcgacgat
50160cttgctgcgt tcggatattt tcgtggagtt cccgccacag acccggattg aaggcgagat
50220ccagcaactc gcgccagatc atcctgtgac ggaactttgg cgcgtgatga ctggccagga
50280cgtcggccga aagagcgaca agcagatcac gcttttcgac agcgtcggat ttgcgatcga
50340ggatttttcg gcgctgcgct acgtccgcga ccgcgttgag ggatcaagcc acagcagccc
50400actcgacctt ctagccgacc cagacgagcc aagggatctt tttggaatgc tgctccgtcg
50460tcaggctttc cgacgtttgg gtggttgaac agaagtcatt atcgtacgga atgccaagca
50520ctcccgaggg gaaccctgtg gttggcatgc acatacaaat ggacgaacgg ataaaccttt
50580tcacgccctt ttaaatatcc gttattctaa taaacgctct tttctcttag gtttacccgc
50640caatatatcc tgtcaaacac tgatagttta aactgaaggc gggaaacgac aatctgatca
50700tgagcggaga attaagggag tcacgttatg acccccgccg atgacgcggg acaagccgtt
50760ttacgtttgg aactgacaga accgcaacgt tgaaggagcc actcagcaag ctggtacgat
50820tgtaatacga ctcactatag ggcgaattga gcgctgttta aacgctcttc aactggaaga
50880gcggttacta ccggttaagt gactagggtc
50910650751DNAArtificial SequencePINII terminator control vector
6acgtgaccct agtcacttag gttaccagag ctggtcacct ttgtccacca agatggaact
60gcggccgctc attaattaag tcaggcgcgc ctctagttga agacacgttc atgtcttcat
120cgtaagaaga cactcagtag tcttcggcca gaatggccat ctggattcag caggcctaga
180aggccattta aatcctgagg atctggtctt cctaaggacc cgggatatcg ctatcaactt
240tgtatagaaa agttgggccg aattcgagct cggtacggcc agaatggccc ggaccgggtt
300accgaattcg agctcggtac cactagtaag cttgccgcaa ttcgcaaaac acacctagac
360tagatttgtt ttgctaaccc aattgatatt aattatatat gattaatatt tatatgtata
420tggatttggt taatgaaatg catctggttc atcaaagaat tataaagaca cgtgacattc
480atttaggata agaaatatgg atgatctctt tctcttttat tcagataact agtaattaca
540cataacacac aactttgatg cccacattat agtgattagc atgtcactat gtgtgcatcc
600ttttatttca tacattaatt aagttggcca atccagaaga tggacaagtc tggatcttca
660ttgtttgcct ccctgctgcg gtttttcacc gaagttcatg ccagtccagc gtttttgcag
720cagaaaagcc gccgacttcg gtttgcggtc gcgagtgaag atccctttct tgttaccgcc
780aacgcgcaat atgccttgcg aggtcgcaaa atcggcgaaa ttccatacct gttcaccgac
840gacggcgctg acgcgatcaa agacgcggtg atacatatcc agccatgcac actgatactc
900ttcactccac atgtcggtgt acattgagtg cagcccggct aacgtatcca cgccgtattc
960ggtgatgata atcggctgat gcagtttctc ctgccaggcc agaagttctt tttccagtac
1020cttctctgcc gtttccaaat cgccgctttg gacataccat ccgtaataac ggttcaggca
1080cagcacatca aagagatcgc taatggtatc ggtgtgagcg tcgcagaaca ttacattgac
1140gcaggtgatc ggacgcgtcg ggtcgagttt acgcgttgct tccgccagtg gcgcgaaata
1200ttcccgtgca ccttgcggac gggtatccgg ttcgttggca atactccaca tcaccacgct
1260tgggtggttt ttgtcacgcg ctatcagctc tttaatcgcc tgtaagtgcg cttgctgagt
1320ttccccgttg actgcctctt cgctgtacag ttctttcggc ttgttgcccg cttcgaaacc
1380aatccctaaa gagaggttaa agccgacagc agcagtttca tcaatcacca cgatgccatg
1440ttcatctgcc cagtcgagca tctcttcagc gtaagggtaa tgcgaggtac ggtaggagtt
1500ggccccaatc cagtccatta atgcgtggtc gtgcaccatc agcacgttat cgaatccttt
1560gccacgcaag tccgcatctt catgacgacc aaagccagta aagtagaacg gtttgtggtt
1620aatcaggaac tgttggccct tcactgccac tgaccggatg ccgacgcgaa gcgggtagat
1680atcacactct gtctggcttt tggctgtgac gcacagttca tagagataac cttcacccgg
1740ttgccagagg tgcggattca ccacttgcaa agtcccgcta gtgccttgtc cagttgcaac
1800cacctgttga tccgcatcac gcagttcaac gctgacatca ccattggcca ccacctgcca
1860gtcaacagac gcgtggttac agtcttgcgc gacatgcgtc accacggtga tatcgtccac
1920ccaggtgttc ggcgtggtgt agagcattac gctgcgatgg attccggcat agttaaagaa
1980atcatggaag taagactgct ttttcttgcc gttttcgtcg gtaatcacca ttcccggcgg
2040gatagtctgc cagttcagtt cgttgttcac acaaacggtg atacctgcac atcaacaaat
2100tttggtcata tattagaaaa gttataaatt aaaatataca cacttataaa ctacagaaaa
2160gcaattgcta tatactacat tcttttattt tgaaaaaaat atttgaaata ttatattact
2220actaattaat gataattatt atatatatat caaaggtaga agcagaaact tacgtacact
2280tttcccggca ataacatacg gcgtgacatc ggcttcaaat ggcgtatagc cgccctgatg
2340ctccatcact tcctgattat tgacccacac tttgccgtaa tgagtgaccg catcgaaacg
2400cagcacgata cgctggcctg cccaaccttt cggtataaag acttcgcgct gataccagac
2460gttgcccgca taattacgaa tatctgcatc ggcgaactga tcgttaaaac tgcctggcac
2520agcaattgcc cggctttctt gtaacgcgct ttcccaccaa cgctgatcaa ttccacagtt
2580ttcgcgatcc agactgaatg cccacaggcc gtcgagtttt ttgatttcac gggttggggt
2640ttctacagga cggaccatgg tgtcgtgtgg atccaaattg tatgcaaggt gaatgacttt
2700cttttcgtaa actagatagg agtactcctc caggatgctt aacccgtatt gacgtacaga
2760ggtctatgat ccttttgttt ataaaggagc ttgtagttca gtcagtctta tacttcacga
2820tgcccatgtt tctatatagg atattatctt ggctttgtaa gtacttcacg caggttatgt
2880tctgtttcta ggatattatc ctcatacatg cgaagaacca atttttcccc cattctcttc
2940gggtactttt tcttgggtag gcatgctctc ttggaccaac tagcataaaa cataatcatt
3000tttccctaca gccttgacca gctataatcg aaatcatgct catttttcta agaaagactg
3060aatacagctc caatttaaac aatttaaatc ataaacttgt aactcaatta gagaaaagca
3120gagcccttcg gctcctatct aaaggaatta ccccatgaaa gccataaaaa cgaaccttgc
3180tctgatacca gacgggtcta cgctcgcgga actaggatct tgcgctctac tcgcacaaag
3240tgaactcgca caaagtgtgt ttcaagcaca gaagttttta tttctcaaat caggagtaaa
3300ctcgcgttgt ggtgcgtgtt tgcaacctga atacaaggct ccttatatag agagttgtgg
3360agctttctgg catcgttagg tggcatccac caataatgca gataagcatc atcacatgtc
3420tctggcctaa caactttgcg taagaatcct gcaaagttac taaaggtcat cgtgcgtgac
3480tagacaacgc acaccgacaa acttaaaata aagagacatt atactttgtc tcctctttac
3540ataaagtgag tggtatccag ctcactccgc atcttatcag tcttcacacc ggttggtatc
3600aacacgtggt aggggtccgc cacttccgct tcagtcatca ttactgatat ccagcagatc
3660tagagcatct tcaataagat attcttgttc tgcacgcaga ttttcttgct ccctcagtaa
3720ttcctcccac agtgagtctt ctgatatttc ttcaagtttc ttctcccatc tgatcttttc
3780ctgcacaaac gagtcaattt ggtctttcca gacccaagta aaacaagtgt tagtttcaca
3840ggagtaaaac tccctgtcag gatttctgga tgttctggag atcttcagtt ttgctggttt
3900attgcatcca catttgaaaa ccggctcttc acttagtgtt agcacattga tttgatgcaa
3960cctgtagcct ttgctcaacc agtcttcata tctttttaca acatcattaa ctctctgttt
4020tgcatcggtg tttcccttgt gaaatacctc ctccactgca ttgatcaaca caccttcaga
4080ttgatgcttt tccggatgga gaataatctt taccagtctt gacagagtgt ctgctaaaac
4140gttgtccttt ccgtcaatgt gttcaaactt aatctcaaga cctgtcccgg taatgtaatc
4200tgtgaaggca agccatctga ctcttgatgg tttatgatca ctgcttttct tgtaaaagct
4260cactattgct tgactgtcag ttctgattat gagctctttg taagcttggt cacccggtcc
4320gggcctagaa ggccagcttc ggccgccccg ggcaacttta ttatacaaag ttgatagata
4380tcggaccgat taaactttaa ttcggtccga agcttgcatg cctgcagtgc agcgtgaccc
4440ggtcgtgccc ctctctagag ataatgagca ttgcatgtct aagttataaa aaattaccac
4500atattttttt tgtcacactt gtttgaagtg cagtttatct atctttatac atatatttaa
4560actttactct acgaataata taatctatag tactacaata atatcagtgt tttagagaat
4620catataaatg aacagttaga catggtctaa aggacaattg agtattttga caacaggact
4680ctacagtttt atctttttag tgtgcatgtg ttctcctttt tttttgcaaa tagcttcacc
4740tatataatac ttcatccatt ttattagtac atccatttag ggtttagggt taatggtttt
4800tatagactaa tttttttagt acatctattt tattctattt tagcctctaa attaagaaaa
4860ctaaaactct attttagttt ttttatttaa taatttagat ataaaataga ataaaataaa
4920gtgactaaaa attaaacaaa taccctttaa gaaattaaaa aaactaagga aacatttttc
4980ttgtttcgag tagataatgc cagcctgtta aacgccgtcg acgagtctaa cggacaccaa
5040ccagcgaacc agcagcgtcg cgtcgggcca agcgaagcag acggcacggc atctctgtcg
5100ctgcctctgg acccctctcg agagttccgc tccaccgttg gacttgctcc gctgtcggca
5160tccagaaatt gcgtggcgga gcggcagacg tgagccggca cggcaggcgg cctcctcctc
5220ctctcacggc accggcagct acgggggatt cctttcccac cgctccttcg ctttcccttc
5280ctcgcccgcc gtaataaata gacaccccct ccacaccctc tttccccaac ctcgtgttgt
5340tcggagcgca cacacacaca accagatctc ccccaaatcc acccgtcggc acctccgctt
5400caaggtacgc cgctcgtcct cccccccccc cctctctacc ttctctagat cggcgttccg
5460gtccatgcat ggttagggcc cggtagttct acttctgttc atgtttgtgt tagatccgtg
5520tttgtgttag atccgtgctg ctagcgttcg tacacggatg cgacctgtac gtcagacacg
5580ttctgattgc taacttgcca gtgtttctct ttggggaatc ctgggatggc tctagccgtt
5640ccgcagacgg gatcgatttc atgatttttt ttgtttcgtt gcatagggtt tggtttgccc
5700ttttccttta tttcaatata tgccgtgcac ttgtttgtcg ggtcatcttt tcatgctttt
5760ttttgtcttg gttgtgatga tgtggtctgg ttgggcggtc gttctagatc ggagtagaat
5820tctgtttcaa actacctggt ggatttatta attttggatc tgtatgtgtg tgccatacat
5880attcatagtt acgaattgaa gatgatggat ggaaatatcg atctaggata ggtatacatg
5940ttgatgcggg ttttactgat gcatatacag agatgctttt tgttcgcttg gttgtgatga
6000tgtggtgtgg ttgggcggtc gttcattcgt tctagatcgg agtagaatac tgtttcaaac
6060tacctggtgt atttattaat tttggaactg tatgtgtgtg tcatacatct tcatagttac
6120gagtttaaga tggatggaaa tatcgatcta ggataggtat acatgttgat gtgggtttta
6180ctgatgcata tacatgatgg catatgcagc atctattcat atgctctaac cttgagtacc
6240tatctattat aataaacaag tatgttttat aattattttg atcttgatat acttggatga
6300tggcatatgc agcagctata tgtggatttt tttagccctg ccttcatacg ctatttattt
6360gcttggtact gtttcttttg tcgatgctca ccctgttgtt tggtgttact tctgcaggtc
6420gactttaact tagcctagga tccacacgac accatgtccc ccgagcgccg ccccgtcgag
6480atccgcccgg ccaccgccgc cgacatggcc gccgtgtgcg acatcgtgaa ccactacatc
6540gagacctcca ccgtgaactt ccgcaccgag ccgcagaccc cgcaggagtg gatcgacgac
6600ctggagcgcc tccaggaccg ctacccgtgg ctcgtggccg aggtggaggg cgtggtggcc
6660ggcatcgcct acgccggccc gtggaaggcc cgcaacgcct acgactggac cgtggagtcc
6720accgtgtacg tgtcccaccg ccaccagcgc ctcggcctcg gctccaccct ctacacccac
6780ctcctcaaga gcatggaggc ccagggcttc aagtccgtgg tggccgtgat cggcctcccg
6840aacgacccgt ccgtgcgcct ccacgaggcc ctcggctaca ccgcccgcgg caccctccgc
6900gccgccggct acaagcacgg cggctggcac gacgtcggct tctggcagcg cgacttcgag
6960ctgccggccc cgccgcgccc ggtgcgcccg gtgacgcaga tctgagtcga aacctagact
7020tgtccatctt ctggattggc caacttaatt aatgtatgaa ataaaaggat gcacacatag
7080tgacatgcta atcactataa tgtgggcatc aaagttgtgt gttatgtgta attactagtt
7140atctgaataa aagagaaaga gatcatccat atttcttatc ctaaatgaat gtcacgtgtc
7200tttataattc tttgatgaac cagatgcatt tcattaacca aatccatata catataaata
7260ttaatcatat ataattaata tcaattgggt tagcaaaaca aatctagtct aggtgtgttt
7320tgcgaatgcg gccgataagt gactagggtc acgtgaccct agtcacttag gtaccgagct
7380cgaattcatt ccgattaatc gtggcctctt gctcttcagg atgaagagct atgtttaaac
7440gtgcaagcgc tactagacaa ttcagtacat taaaaacgtc cgcaatgtgt tattaagttg
7500tctaagcgtc aatttgttta caccacaata tatcctgcca ccagccagcc aacagctccc
7560cgaccggcag ctcggcacaa aatcaccact cgatacaggc agcccatcag tccgggacgg
7620cgtcagcggg agagccgttg taaggcggca gactttgctc atgttaccga tgctattcgg
7680aagaacggca actaagctgc cgggtttgaa acacggatga tctcgcggag ggtagcatgt
7740tgattgtaac gatgacagag cgttgctgcc tgtgatcaaa tatcatctcc ctcgcagaga
7800tccgaattat cagccttctt attcatttct cgcttaaccg tgacaggctg tcgatcttga
7860gaactatgcc gacataatag gaaatcgctg gataaagccg ctgaggaagc tgagtggcgc
7920tatttcttta gaagtgaacg ttgacgatcg tcgaccgtac cccgatgaat taattcggac
7980gtacgttctg aacacagctg gatacttact tgggcgattg tcatacatga catcaacaat
8040gtacccgttt gtgtaaccgt ctcttggagg ttcgtatgac actagtggtt cccctcagct
8100tgcgactaga tgttgaggcc taacatttta ttagagagca ggctagttgc ttagatacat
8160gatcttcagg ccgttatctg tcagggcaag cgaaaattgg ccatttatga cgaccaatgc
8220cccgcagaag ctcccatctt tgccgccata gacgccgcgc cccccttttg gggtgtagaa
8280catccttttg ccagatgtgg aaaagaagtt cgttgtccca ttgttggcaa tgacgtagta
8340gccggcgaaa gtgcgagacc catttgcgct atatataagc ctacgatttc cgttgcgact
8400attgtcgtaa ttggatgaac tattatcgta gttgctctca gagttgtcgt aatttgatgg
8460actattgtcg taattgctta tggagttgtc gtagttgctt ggagaaatgt cgtagttgga
8520tggggagtag tcatagggaa gacgagcttc atccactaaa acaattggca ggtcagcaag
8580tgcctgcccc gatgccatcg caagtacgag gcttagaacc accttcaaca gatcgcgcat
8640agtcttcccc agctctctaa cgcttgagtt aagccgcgcc gcgaagcggc gtcggcttga
8700acgaattgtt agacattatt tgccgactac cttggtgatc tcgcctttca cgtagtgaac
8760aaattcttcc aactgatctg cgcgcgaggc caagcgatct tcttgtccaa gataagcctg
8820cctagcttca agtatgacgg gctgatactg ggccggcagg cgctccattg cccagtcggc
8880agcgacatcc ttcggcgcga ttttgccggt tactgcgctg taccaaatgc gggacaacgt
8940aagcactaca tttcgctcat cgccagccca gtcgggcggc gagttccata gcgttaaggt
9000ttcatttagc gcctcaaata gatcctgttc aggaaccgga tcaaagagtt cctccgccgc
9060tggacctacc aaggcaacgc tatgttctct tgcttttgtc agcaagatag ccagatcaat
9120gtcgatcgtg gctggctcga agatacctgc aagaatgtca ttgcgctgcc attctccaaa
9180ttgcagttcg cgcttagctg gataacgcca cggaatgatg tcgtcgtgca caacaatggt
9240gacttctaca gcgcggagaa tctcgctctc tccaggggaa gccgaagttt ccaaaaggtc
9300gttgatcaaa gctcgccgcg ttgtttcatc aagccttaca gtcaccgtaa ccagcaaatc
9360aatatcactg tgtggcttca ggccgccatc cactgcggag ccgtacaaat gtacggccag
9420caacgtcggt tcgagatggc gctcgatgac gccaactacc tctgatagtt gagtcgatac
9480ttcggcgatc accgcttccc tcatgatgtt taactcctga attaagccgc gccgcgaagc
9540ggtgtcggct tgaatgaatt gttaggcgtc atcctgtgct cccgagaacc agtaccagta
9600catcgctgtt tcgttcgaga cttgaggtct agttttatac gtgaacaggt caatgccgcc
9660gagagtaaag ccacattttg cgtacaaatt gcaggcaggt acattgttcg tttgtgtctc
9720taatcgtatg ccaaggagct gtctgcttag tgcccacttt ttcgcaaatt cgatgagact
9780gtgcgcgact cctttgcctc ggtgcgtgtg cgacacaaca atgtgttcga tagaggctag
9840atcgttccat gttgagttga gttcaatctt cccgacaagc tcttggtcga tgaatgcgcc
9900atagcaagca gagtcttcat cagagtcatc atccgagatg taatccttcc ggtaggggct
9960cacacttctg gtagatagtt caaagccttg gtcggatagg tgcacatcga acacttcacg
10020aacaatgaaa tggttctcag catccaatgt ttccgccacc tgctcaggga tcaccgaaat
10080cttcatatga cgcctaacgc ctggcacagc ggatcgcaaa cctggcgcgg cttttggcac
10140aaaaggcgtg acaggtttgc gaatccgttg ctgccacttg ttaacccttt tgccagattt
10200ggtaactata atttatgtta gaggcgaagt cttgggtaaa aactggccta aaattgctgg
10260ggatttcagg aaagtaaaca tcaccttccg gctcgatgtc tattgtagat atatgtagtg
10320tatctacttg atcgggggat ctgctgcctc gcgcgtttcg gtgatgacgg tgaaaacctc
10380tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga
10440caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggcgcagc catgacccag
10500tcacgtagcg atagcggagt gtatactggc ttaactatgc ggcatcagag cagattgtac
10560tgagagtgca ccatatgcgg tgtgaaatac cgcacagatg cgtaaggaga aaataccgca
10620tcaggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc
10680gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg
10740caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt
10800tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa
10860gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct
10920ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc
10980cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg
11040tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct
11100tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag
11160cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga
11220agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga
11280agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg
11340gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag
11400aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag
11460ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat
11520gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct
11580taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac
11640tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa
11700tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg
11760gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt
11820gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca
11880ttgctgcagg gggggggggg ggggggttcc attgttcatt ccacggacaa aaacagagaa
11940aggaaacgac agaggccaaa aagctcgctt tcagcacctg tcgtttcctt tcttttcaga
12000gggtatttta aataaaaaca ttaagttatg acgaagaaga acggaaacgc cttaaaccgg
12060aaaattttca taaatagcga aaacccgcga ggtcgccgcc ccgtaacctg tcggatcacc
12120ggaaaggacc cgtaaagtga taatgattat catctacata tcacaacgtg cgtggaggcc
12180atcaaaccac gtcaaataat caattatgac gcaggtatcg tattaattga tctgcatcaa
12240cttaacgtaa aaacaacttc agacaataca aatcagcgac actgaatacg gggcaacctc
12300atgtcccccc cccccccccc cctgcaggca tcgtggtgtc acgctcgtcg tttggtatgg
12360cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca
12420aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt
12480tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat
12540gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac
12600cgagttgctc ttgcccggcg tcaacacggg ataataccgc gccacatagc agaactttaa
12660aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt
12720tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt
12780tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa
12840gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt
12900atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa
12960taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa accattatta
13020tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtctt caagaattcg
13080gagcttttgc cattctcacc ggattcagtc gtcactcatg gtgatttctc acttgataac
13140cttatttttg acgaggggaa attaataggt tgtattgatg ttggacgagt cggaatcgca
13200gaccgatacc aggatcttgc catcctatgg aactgcctcg gtgagttttc tccttcatta
13260cagaaacggc tttttcaaaa atatggtatt gataatcctg atatgaataa attgcagttt
13320catttgatgc tcgatgagtt tttctaatca gaattggtta attggttgta acactggcag
13380agcattacgc tgacttgacg ggacggcggc tttgttgaat aaatcgaact tttgctgagt
13440tgaaggatca gatcacgcat cttcccgaca acgcagaccg ttccgtggca aagcaaaagt
13500tcaaaatcac caactggtcc acctacaaca aagctctcat caaccgtggc tccctcactt
13560tctggctgga tgatggggcg attcaggcct ggtatgagtc agcaacacct tcttcacgag
13620gcagacctca gcgccagaag gccgccagag aggccgagcg cggccgtgag gcttggacgc
13680tagggcaggg catgaaaaag cccgtagcgg gctgctacgg gcgtctgacg cggtggaaag
13740ggggagggga tgttgtctac atggctctgc tgtagtgagt gggttgcgct ccggcagcgg
13800tcctgatcaa tcgtcaccct ttctcggtcc ttcaacgttc ctgacaacga gcctcctttt
13860cgccaatcca tcgacaatca ccgcgagtcc ctgctcgaac gctgcgtccg gaccggcttc
13920gtcgaaggcg tctatcgcgg cccgcaacag cggcgagagc ggagcctgtt caacggtgcc
13980gccgcgctcg ccggcatcgc tgtcgccggc ctgctcctca agcacggccc caacagtgaa
14040gtagctgatt gtcatcagcg cattgacggc gtccccggcc gaaaaacccg cctcgcagag
14100gaagcgaagc tgcgcgtcgg ccgtttccat ctgcggtgcg cccggtcgcg tgccggcatg
14160gatgcgcgcg ccatcgcggt aggcgagcag cgcctgcctg aagctgcggg cattcccgat
14220cagaaatgag cgccagtcgt cgtcggctct cggcaccgaa tgcgtatgat tctccgccag
14280catggcttcg gccagtgcgt cgagcagcgc ccgcttgttc ctgaagtgcc agtaaagcgc
14340cggctgctga acccccaacc gttccgccag tttgcgtgtc gtcagaccgt ctacgccgac
14400ctcgttcaac aggtccaggg cggcacggat cactgtattc ggctgcaact ttgtcatgct
14460tgacacttta tcactgataa acataatatg tccaccaact tatcagtgat aaagaatccg
14520cgcgttcaat cggaccagcg gaggctggtc cggaggccag acgtgaaacc caacataccc
14580ctgatcgtaa ttctgagcac tgtcgcgctc gacgctgtcg gcatcggcct gattatgccg
14640gtgctgccgg gcctcctgcg cgatctggtt cactcgaacg acgtcaccgc ccactatggc
14700attctgctgg cgctgtatgc gttggtgcaa tttgcctgcg cacctgtgct gggcgcgctg
14760tcggatcgtt tcgggcggcg gccaatcttg ctcgtctcgc tggccggcgc cactgtcgac
14820tacgccatca tggcgacagc gcctttcctt tgggttctct atatcgggcg gatcgtggcc
14880ggcatcaccg gggcgactgg ggcggtagcc ggcgcttata ttgccgatat cactgatggc
14940gatgagcgcg cgcggcactt cggcttcatg agcgcctgtt tcgggttcgg gatggtcgcg
15000ggacctgtgc tcggtgggct gatgggcggt ttctcccccc acgctccgtt cttcgccgcg
15060gcagccttga acggcctcaa tttcctgacg ggctgtttcc ttttgccgga gtcgcacaaa
15120ggcgaacgcc ggccgttacg ccgggaggct ctcaacccgc tcgcttcgtt ccggtgggcc
15180cggggcatga ccgtcgtcgc cgccctgatg gcggtcttct tcatcatgca acttgtcgga
15240caggtgccgg ccgcgctttg ggtcattttc ggcgaggatc gctttcactg ggacgcgacc
15300acgatcggca tttcgcttgc cgcatttggc attctgcatt cactcgccca ggcaatgatc
15360accggccctg tagccgcccg gctcggcgaa aggcgggcac tcatgctcgg aatgattgcc
15420gacggcacag gctacatcct gcttgccttc gcgacacggg gatggatggc gttcccgatc
15480atggtcctgc ttgcttcggg tggcatcgga atgccggcgc tgcaagcaat gttgtccagg
15540caggtggatg aggaacgtca ggggcagctg caaggctcac tggcggcgct caccagcctg
15600acctcgatcg tcggacccct cctcttcacg gcgatctatg cggcttctat aacaacgtgg
15660aacgggtggg catggattgc aggcgctgcc ctctacttgc tctgcctgcc ggcgctgcgt
15720cgcgggcttt ggagcggcgc agggcaacga gccgatcgct gatcgtggaa acgataggcc
15780tatgccatgc gggtcaaggc gacttccggc aagctatacg cgccctagga gtgcggttgg
15840aacgttggcc cagccagata ctcccgatca cgagcaggac gccgatgatt tgaagcgcac
15900tcagcgtctg atccaagaac aaccatccta gcaacacggc ggtccccggg ctgagaaagc
15960ccagtaagga aacaactgta ggttcgagtc gcgagatccc ccggaaccaa aggaagtagg
16020ttaaacccgc tccgatcagg ccgagccacg ccaggccgag aacattggtt cctgtaggca
16080tcgggattgg cggatcaaac actaaagcta ctggaacgag cagaagtcct ccggccgcca
16140gttgccaggc ggtaaaggtg agcagaggca cgggaggttg ccacttgcgg gtcagcacgg
16200ttccgaacgc catggaaacc gcccccgcca ggcccgctgc gacgccgaca ggatctagcg
16260ctgcgtttgg tgtcaacacc aacagcgcca cgcccgcagt tccgcaaata gcccccagga
16320ccgccatcaa tcgtatcggg ctacctagca gagcggcaga gatgaacacg accatcagcg
16380gctgcacagc gcctaccgtc gccgcgaccc cgcccggcag gcggtagacc gaaataaaca
16440acaagctcca gaatagcgaa atattaagtg cgccgaggat gaagatgcgc atccaccaga
16500ttcccgttgg aatctgtcgg acgatcatca cgagcaataa acccgccggc aacgcccgca
16560gcagcatacc ggcgacccct cggcctcgct gttcgggctc cacgaaaacg ccggacagat
16620gcgccttgtg agcgtccttg gggccgtcct cctgtttgaa gaccgacagc ccaatgatct
16680cgccgtcgat gtaggcgccg aatgccacgg catctcgcaa ccgttcagcg aacgcctcca
16740tgggcttttt ctcctcgtgc tcgtaaacgg acccgaacat ctctggagct ttcttcaggg
16800ccgacaatcg gatctcgcgg aaatcctgca cgtcggccgc tccaagccgt cgaatctgag
16860ccttaatcac aattgtcaat tttaatcctc tgtttatcgg cagttcgtag agcgcgccgt
16920gcgtcccgag cgatactgag cgaagcaagt gcgtcgagca gtgcccgctt gttcctgaaa
16980tgccagtaaa gcgctggctg ctgaaccccc agccggaact gaccccacaa ggccctagcg
17040tttgcaatgc accaggtcat cattgaccca ggcgtgttcc accaggccgc tgcctcgcaa
17100ctcttcgcag gcttcgccga cctgctcgcg ccacttcttc acgcgggtgg aatccgatcc
17160gcacatgagg cggaaggttt ccagcttgag cgggtacggc tcccggtgcg agctgaaata
17220gtcgaacatc cgtcgggccg tcggcgacag cttgcggtac ttctcccata tgaatttcgt
17280gtagtggtcg ccagcaaaca gcacgacgat ttcctcgtcg atcaggacct ggcaacggga
17340cgttttcttg ccacggtcca ggacgcggaa gcggtgcagc agcgacaccg attccaggtg
17400cccaacgcgg tcggacgtga agcccatcgc cgtcgcctgt aggcgcgaca ggcattcctc
17460ggccttcgtg taataccggc cattgatcga ccagcccagg tcctggcaaa gctcgtagaa
17520cgtgaaggtg atcggctcgc cgataggggt gcgcttcgcg tactccaaca cctgctgcca
17580caccagttcg tcatcgtcgg cccgcagctc gacgccggtg taggtgatct tcacgtcctt
17640gttgacgtgg aaaatgacct tgttttgcag cgcctcgcgc gggattttct tgttgcgcgt
17700ggtgaacagg gcagagcggg ccgtgtcgtt tggcatcgct cgcatcgtgt ccggccacgg
17760cgcaatatcg aacaaggaaa gctgcatttc cttgatctgc tgcttcgtgt gtttcagcaa
17820cgcggcctgc ttggcctcgc tgacctgttt tgccaggtcc tcgccggcgg tttttcgctt
17880cttggtcgtc atagttcctc gcgtgtcgat ggtcatcgac ttcgccaaac ctgccgcctc
17940ctgttcgaga cgacgcgaac gctccacggc ggccgatggc gcgggcaggg cagggggagc
18000cagttgcacg ctgtcgcgct cgatcttggc cgtagcttgc tggaccatcg agccgacgga
18060ctggaaggtt tcgcggggcg cacgcatgac ggtgcggctt gcgatggttt cggcatcctc
18120ggcggaaaac cccgcgtcga tcagttcttg cctgtatgcc ttccggtcaa acgtccgatt
18180cattcaccct ccttgcggga ttgccccgac tcacgccggg gcaatgtgcc cttattcctg
18240atttgacccg cctggtgcct tggtgtccag ataatccacc ttatcggcaa tgaagtcggt
18300cccgtagacc gtctggccgt ccttctcgta cttggtattc cgaatcttgc cctgcacgaa
18360taccagcgac cccttgccca aatacttgcc gtgggcctcg gcctgagagc caaaacactt
18420gatgcggaag aagtcggtgc gctcctgctt gtcgccggca tcgttgcgcc actcttcatt
18480aaccgctata tcgaaaattg cttgcggctt gttagaattg ccatgacgta cctcggtgtc
18540acgggtaaga ttaccgataa actggaactg attatggctc atatcgaaag tctccttgag
18600aaaggagact ctagtttagc taaacattgg ttccgctgtc aagaacttta gcggctaaaa
18660ttttgcgggc cgcgaccaaa ggtgcgaggg gcggcttccg ctgtgtacaa ccagatattt
18720ttcaccaaca tccttcgtct gctcgatgag cggggcatga cgaaacatga gctgtcggag
18780agggcagggg tttcaatttc gtttttatca gacttaacca acggtaaggc caacccctcg
18840ttgaaggtga tggaggccat tgccgacgcc ctggaaactc ccctacctct tctcctggag
18900tccaccgacc ttgaccgcga ggcactcgcg gagattgcgg gtcatccttt caagagcagc
18960gtgccgcccg gatacgaacg catcagtgtg gttttgccgt cacataaggc gtttatcgta
19020aagaaatggg gcgacgacac ccgaaaaaag ctgcgtggaa ggctctgacg ccaagggtta
19080gggcttgcac ttccttcttt agccgctaaa acggcccctt ctctgcgggc cgtcggctcg
19140cgcatcatat cgacatcctc aacggaagcc gtgccgcgaa tggcatcggg cgggtgcgct
19200ttgacagttg ttttctatca gaacccctac gtcgtgcggt tcgattagct gtttgtcttg
19260caggctaaac actttcggta tatcgtttgc ctgtgcgata atgttgctaa tgatttgttg
19320cgtaggggtt actgaaaagt gagcgggaaa gaagagtttc agaccatcaa ggagcgggcc
19380aagcgcaagc tggaacgcga catgggtgcg gacctgttgg ccgcgctcaa cgacccgaaa
19440accgttgaag tcatgctcaa cgcggacggc aaggtgtggc acgaacgcct tggcgagccg
19500atgcggtaca tctgcgacat gcggcccagc cagtcgcagg cgattataga aacggtggcc
19560ggattccacg gcaaagaggt cacgcggcat tcgcccatcc tggaaggcga gttccccttg
19620gatggcagcc gctttgccgg ccaattgccg ccggtcgtgg ccgcgccaac ctttgcgatc
19680cgcaagcgcg cggtcgccat cttcacgctg gaacagtacg tcgaggcggg catcatgacc
19740cgcgagcaat acgaggtcat taaaagcgcc gtcgcggcgc atcgaaacat cctcgtcatt
19800ggcggtactg gctcgggcaa gaccacgctc gtcaacgcga tcatcaatga aatggtcgcc
19860ttcaacccgt ctgagcgcgt cgtcatcatc gaggacaccg gcgaaatcca gtgcgccgca
19920gagaacgccg tccaatacca caccagcatc gacgtctcga tgacgctgct gctcaagaca
19980acgctgcgta tgcgccccga ccgcatcctg gtcggtgagg tacgtggccc cgaagccctt
20040gatctgttga tggcctggaa caccgggcat gaaggaggtg ccgccaccct gcacgcaaac
20100aaccccaaag cgggcctgag ccggctcgcc atgcttatca gcatgcaccc ggattcaccg
20160aaacccattg agccgctgat tggcgaggcg gttcatgtgg tcgtccatat cgccaggacc
20220cctagcggcc gtcgagtgca agaaattctc gaagttcttg gttacgagaa cggccagtac
20280atcaccaaaa ccctgtaagg agtatttcca atgacaacgg ctgttccgtt ccgtctgacc
20340atgaatcgcg gcattttgtt ctaccttgcc gtgttcttcg ttctcgctct cgcgttatcc
20400gcgcatccgg cgatggcctc ggaaggcacc ggcggcagct tgccatatga gagctggctg
20460acgaacctgc gcaactccgt aaccggcccg gtggccttcg cgctgtccat catcggcatc
20520gtcgtcgccg gcggcgtgct gatcttcggc ggcgaactca acgccttctt ccgaaccctg
20580atcttcctgg ttctggtgat ggcgctgctg gtcggcgcgc agaacgtgat gagcaccttc
20640ttcggtcgtg gtgccgaaat cgcggccctc ggcaacgggg cgctgcacca ggtgcaagtc
20700gcggcggcgg atgccgtgcg tgcggtagcg gctggacggc tcgcctaatc atggctctgc
20760gcacgatccc catccgtcgc gcaggcaacc gagaaaacct gttcatgggt ggtgatcgtg
20820aactggtgat gttctcgggc ctgatggcgt ttgcgctgat tttcagcgcc caagagctgc
20880gggccaccgt ggtcggtctg atcctgtggt tcggggcgct ctatgcgttc cgaatcatgg
20940cgaaggccga tccgaagatg cggttcgtgt acctgcgtca ccgccggtac aagccgtatt
21000acccggcccg ctcgaccccg ttccgcgaga acaccaatag ccaagggaag caataccgat
21060gatccaagca attgcgattg caatcgcggg cctcggcgcg cttctgttgt tcatcctctt
21120tgcccgcatc cgcgcggtcg atgccgaact gaaactgaaa aagcatcgtt ccaaggacgc
21180cggcctggcc gatctgctca actacgccgc tgtcgtcgat gacggcgtaa tcgtgggcaa
21240gaacggcagc tttatggctg cctggctgta caagggcgat gacaacgcaa gcagcaccga
21300ccagcagcgc gaagtagtgt ccgcccgcat caaccaggcc ctcgcgggcc tgggaagtgg
21360gtggatgatc catgtggacg ccgtgcggcg tcctgctccg aactacgcgg agcggggcct
21420gtcggcgttc cctgaccgtc tgacggcagc gattgaagaa gagcgctcgg tcttgccttg
21480ctcgtcggtg atgtacttca ccagctccgc gaagtcgctc ttcttgatgg agcgcatggg
21540gacgtgcttg gcaatcacgc gcaccccccg gccgttttag cggctaaaaa agtcatggct
21600ctgccctcgg gcggaccacg cccatcatga ccttgccaag ctcgtcctgc ttctcttcga
21660tcttcgccag cagggcgagg atcgtggcat caccgaaccg cgccgtgcgc gggtcgtcgg
21720tgagccagag tttcagcagg ccgcccaggc ggcccaggtc gccattgatg cgggccagct
21780cgcggacgtg ctcatagtcc acgacgcccg tgattttgta gccctggccg acggccagca
21840ggtaggccga caggctcatg ccggccgccg ccgccttttc ctcaatcgct cttcgttcgt
21900ctggaaggca gtacaccttg ataggtgggc tgcccttcct ggttggcttg gtttcatcag
21960ccatccgctt gccctcatct gttacgccgg cggtagccgg ccagcctcgc agagcaggat
22020tcccgttgag caccgccagg tgcgaataag ggacagtgaa gaaggaacac ccgctcgcgg
22080gtgggcctac ttcacctatc ctgcccggct gacgccgttg gatacaccaa ggaaagtcta
22140cacgaaccct ttggcaaaat cctgtatatc gtgcgaaaaa ggatggatat accgaaaaaa
22200tcgctataat gaccccgaag cagggttatg cagcggaaaa gcgctgcttc cctgctgttt
22260tgtggaatat ctaccgactg gaaacaggca aatgcaggaa attactgaac tgaggggaca
22320ggcgagagac gatgccaaag agctacaccg acgagctggc cgagtgggtt gaatcccgcg
22380cggccaagaa gcgccggcgt gatgaggctg cggttgcgtt cctggcggtg agggcggatg
22440tcgaggcggc gttagcgtcc ggctatgcgc tcgtcaccat ttgggagcac atgcgggaaa
22500cggggaaggt caagttctcc tacgagacgt tccgctcgca cgccaggcgg cacatcaagg
22560ccaagcccgc cgatgtgccc gcaccgcagg ccaaggctgc ggaacccgcg ccggcaccca
22620agacgccgga gccacggcgg ccgaagcagg ggggcaaggc tgaaaagccg gcccccgctg
22680cggccccgac cggcttcacc ttcaacccaa caccggacaa aaaggatcta ctgtaatggc
22740gaaaattcac atggttttgc agggcaaggg cggggtcggc aagtcggcca tcgccgcgat
22800cattgcgcag tacaagatgg acaaggggca gacacccttg tgcatcgaca ccgacccggt
22860gaacgcgacg ttcgagggct acaaggccct gaacgtccgc cggctgaaca tcatggccgg
22920cgacgaaatt aactcgcgca acttcgacac cctggtcgag ctgattgcgc cgaccaagga
22980tgacgtggtg atcgacaacg gtgccagctc gttcgtgcct ctgtcgcatt acctcatcag
23040caaccaggtg ccggctctgc tgcaagaaat ggggcatgag ctggtcatcc ataccgtcgt
23100caccggcggc caggctctcc tggacacggt gagcggcttc gcccagctcg ccagccagtt
23160cccggccgaa gcgcttttcg tggtctggct gaacccgtat tgggggccta tcgagcatga
23220gggcaagagc tttgagcaga tgaaggcgta cacggccaac aaggcccgcg tgtcgtccat
23280catccagatt ccggccctca aggaagaaac ctacggccgc gatttcagcg acatgctgca
23340agagcggctg acgttcgacc aggcgctggc cgatgaatcg ctcacgatca tgacgcggca
23400acgcctcaag atcgtgcggc gcggcctgtt tgaacagctc gacgcggcgg ccgtgctatg
23460agcgaccaga ttgaagagct gatccgggag attgcggcca agcacggcat cgccgtcggc
23520cgcgacgacc cggtgctgat cctgcatacc atcaacgccc ggctcatggc cgacagtgcg
23580gccaagcaag aggaaatcct tgccgcgttc aaggaagagc tggaagggat cgcccatcgt
23640tggggcgagg acgccaaggc caaagcggag cggatgctga acgcggccct ggcggccagc
23700aaggacgcaa tggcgaaggt aatgaaggac agcgccgcgc aggcggccga agcgatccgc
23760agggaaatcg acgacggcct tggccgccag ctcgcggcca aggtcgcgga cgcgcggcgc
23820gtggcgatga tgaacatgat cgccggcggc atggtgttgt tcgcggccgc cctggtggtg
23880tgggcctcgt tatgaatcgc agaggcgcag atgaaaaagc ccggcgttgc cgggctttgt
23940ttttgcgtta gctgggcttg tttgacaggc ccaagctctg actgcgcccg cgctcgcgct
24000cctgggcctg tttcttctcc tgctcctgct tgcgcatcag ggcctggtgc cgtcgggctg
24060cttcacgcat cgaatcccag tcgccggcca gctcgggatg ctccgcgcgc atcttgcgcg
24120tcgccagttc ctcgatcttg ggcgcgtgaa tgcccatgcc ttccttgatt tcgcgcacca
24180tgtccagccg cgtgtgcagg gtctgcaagc gggcttgctg ttgggcctgc tgctgctgcc
24240aggcggcctt tgtacgcggc agggacagca agccgggggc attggactgt agctgctgca
24300aacgcgcctg ctgacggtct acgagctgtt ctaggcggtc ctcgatgcgc tccacctggt
24360catgctttgc ctgcacgtag agcgcaaggg tctgctggta ggtctgctcg atgggcgcgg
24420attctaagag ggcctgctgt tccgtctcgg cctcctgggc cgcctgtagc aaatcctcgc
24480cgctgttgcc gctggactgc tttactgccg gggactgctg ttgccctgct cgcgccgtcg
24540tcgcagttcg gcttgccccc actcgattga ctgcttcatt tcgagccgca gcgatgcgat
24600ctcggattgc gtcaacggac ggggcagcgc ggaggtgtcc ggcttctcct tgggtgagtc
24660ggtcgatgcc atagccaaag gtttccttcc aaaatgcgtc cattgctgga ccgtgtttct
24720cattgatgcc cgcaagcatc ttcggcttga ccgccaggtc aagcgcgcct tcatgggcgg
24780tcatgacgga cgccgccatg accttgccgc cgttgttctc gatgtagccg cgtaatgagg
24840caatggtgcc gcccatcgtc agcgtgtcat cgacaacgat gtacttctgg ccggggatca
24900cctccccctc gaaagtcggg ttgaacgcca ggcgatgatc tgaaccggct ccggttcggg
24960cgaccttctc ccgctgcaca atgtccgttt cgacctcaag gccaaggcgg tcggccagaa
25020cgaccgccat catggccgga atcttgttgt tccccgccgc ctcgacggcg aggactggaa
25080cgatgcgggg cttgtcgtcg ccgatcagcg tcttgagctg ggcaacagtg tcgtccgaaa
25140tcaggcgctc gaccaaatta agcgccgctt ccgcgtcgcc ctgcttcgca gcctggtatt
25200caggctcgtt ggtcaaagaa ccaaggtcgc cgttgcgaac caccttcggg aagtctcccc
25260acggtgcgcg ctcggctctg ctgtagctgc tcaagacgcc tcccttttta gccgctaaaa
25320ctctaacgag tgcgcccgcg actcaacttg acgctttcgg cacttacctg tgccttgcca
25380cttgcgtcat aggtgatgct tttcgcactc ccgatttcag gtactttatc gaaatctgac
25440cgggcgtgca ttacaaagtt cttccccacc tgttggtaaa tgctgccgct atctgcgtgg
25500acgatgctgc cgtcgtggcg ctgcgactta tcggcctttt gggccatata gatgttgtaa
25560atgccaggtt tcagggcccc ggctttatct accttctggt tcgtccatgc gccttggttc
25620tcggtctgga caattctttg cccattcatg accaggaggc ggtgtttcat tgggtgactc
25680ctgacggttg cctctggtgt taaacgtgtc ctggtcgctt gccggctaaa aaaaagccga
25740cctcggcagt tcgaggccgg ctttccctag agccgggcgc gtcaaggttg ttccatctat
25800tttagtgaac tgcgttcgat ttatcagtta ctttcctccc gctttgtgtt tcctcccact
25860cgtttccgcg tctagccgac ccctcaacat agcggcctct tcttgggctg cctttgcctc
25920ttgccgcgct tcgtcacgct cggcttgcac cgtcgtaaag cgctcggcct gcctggccgc
25980ctcttgcgcc gccaacttcc tttgctcctg gtgggcctcg gcgtcggcct gcgccttcgc
26040tttcaccgct gccaactccg tgcgcaaact ctccgcttcg cgcctggtgg cgtcgcgctc
26100gccgcgaagc gcctgcattt cctggttggc cgcgtccagg gtcttgcggc tctcttcttt
26160gaatgcgcgg gcgtcctggt gagcgtagtc cagctcggcg cgcagctcct gcgctcgacg
26220ctccacctcg tcggcccgct gcgtcgccag cgcggcccgc tgctcggctc ctgccagggc
26280ggtgcgtgct tcggccaggg cttgccgctg gcgtgcggcc agctcggccg cctcggcggc
26340ctgctgctct agcaatgtaa cgcgcgcctg ggcttcttcc agctcgcggg cctgcgcctc
26400gaaggcgtcg gccagctccc cgcgcacggc ttccaactcg ttgcgctcac gatcccagcc
26460ggcttgcgct gcctgcaacg attcattggc aagggcctgg gcggcttgcc agagggcggc
26520cacggcctgg ttgccggcct gctgcaccgc gtccggcacc tggactgcca gcggggcggc
26580ctgcgccgtg cgctggcgtc gccattcgcg catgccggcg ctggcgtcgt tcatgttgac
26640gcgggcggcc ttacgcactg catccacggt cgggaagttc tcccggtcgc cttgctcgaa
26700cagctcgtcc gcagccgcaa aaatgcggtc gcgcgtctct ttgttcagtt ccatgttggc
26760tccggtaatt ggtaagaata ataatactct tacctacctt atcagcgcaa gagtttagct
26820gaacagttct cgacttaacg gcaggttttt tagcggctga agggcaggca aaaaaagccc
26880cgcacggtcg gcgggggcaa agggtcagcg ggaaggggat tagcgggcgt cgggcttctt
26940catgcgtcgg ggccgcgctt cttgggatgg agcacgacga agcgcgcacg cgcatcgtcc
27000tcggccctat cggcccgcgt cgcggtcagg aacttgtcgc gcgctaggtc ctccctggtg
27060ggcaccaggg gcatgaactc ggcctgctcg atgtaggtcc actccatgac cgcatcgcag
27120tcgaggccgc gttccttcac cgtctcttgc aggtcgcggt acgcccgctc gttgagcggc
27180tggtaacggg ccaattggtc gtaaatggct gtcggccatg agcggccttt cctgttgagc
27240cagcagccga cgacgaagcc ggcaatgcag gcccctggca caaccaggcc gacgccgggg
27300gcaggggatg gcagcagctc gccaaccagg aaccccgccg cgatgatgcc gatgccggtc
27360aaccagccct tgaaactatc cggccccgaa acacccctgc gcattgcctg gatgctgcgc
27420cggatagctt gcaacatcag gagccgtttc ttttgttcgt cagtcatggt ccgccctcac
27480cagttgttcg tatcggtgtc ggacgaactg aaatcgcaag agctgccggt atcggtccag
27540ccgctgtccg tgtcgctgct gccgaagcac ggcgaggggt ccgcgaacgc cgcagacggc
27600gtatccggcc gcagcgcatc gcccagcatg gccccggtca gcgagccgcc ggccaggtag
27660cccagcatgg tgctgttggt cgccccggcc accagggccg acgtgacgaa atcgccgtca
27720ttccctctgg attgttcgct gctcggcggg gcagtgcgcc gcgccggcgg cgtcgtggat
27780ggctcgggtt ggctggcctg cgacggccgg cgaaaggtgc gcagcagctc gttatcgacc
27840ggctgcggcg tcggggccgc cgccttgcgc tgcggtcggt gttccttctt cggctcgcgc
27900agcttgaaca gcatgatcgc ggaaaccagc agcaacgccg cgcctacgcc tcccgcgatg
27960tagaacagca tcggattcat tcttcggtcc tccttgtagc ggaaccgttg tctgtgcggc
28020gcgggtggcc cgcgccgctg tctttgggga tcagccctcg atgagcgcga ccagtttcac
28080gtcggcaagg ttcgcctcga actcctggcc gtcgtcctcg tacttcaacc aggcatagcc
28140ttccgccggc ggccgacggt tgaggataag gcgggcaggg cgctcgtcgt gctcgacctg
28200gacgatggcc tttttcagct tgtccgggtc cggctccttc gcgccctttt ccttggcgtc
28260cttaccgtcc tggtcgccgt cctcgccgtc ctggccgtcg ccggcctccg cgtcacgctc
28320ggcatcagtc tggccgttga aggcatcgac ggtgttggga tcgcggccct tctcgtccag
28380gaactcgcgc agcagcttga ccgtgccgcg cgtgatttcc tgggtgtcgt cgtcaagcca
28440cgcctcgact tcctccgggc gcttcttgaa ggccgtcacc agctcgttca ccacggtcac
28500gtcgcgcacg cggccggtgt tgaacgcatc ggcgatcttc tccggcaggt ccagcagcgt
28560gacgtgctgg gtgatgaacg ccggcgactt gccgatttcc ttggcgatat cgcctttctt
28620cttgcccttc gccagctcgc ggccaatgaa gtcggcaatt tcgcgcgggg tcagctcgtt
28680gcgttgcagg ttctcgataa cctggtcggc ttcgttgtag tcgttgtcga tgaacgccgg
28740gatggacttc ttgccggccc acttcgagcc acggtagcgg cgggcgccgt gattgatgat
28800atagcggccc ggctgctcct ggttctcgcg caccgaaatg ggtgacttca ccccgcgctc
28860tttgatcgtg gcaccgattt ccgcgatgct ctccggggaa aagccggggt tgtcggccgt
28920ccgcggctga tgcggatctt cgtcgatcag gtccaggtcc agctcgatag ggccggaacc
28980gccctgagac gccgcaggag cgtccaggag gctcgacagg tcgccgatgc tatccaaccc
29040caggccggac ggctgcgccg cgcctgcggc ttcctgagcg gccgcagcgg tgtttttctt
29100ggtggtcttg gcttgagccg cagtcattgg gaaatctcca tcttcgtgaa cacgtaatca
29160gccagggcgc gaacctcttt cgatgccttg cgcgcggccg ttttcttgat cttccagacc
29220ggcacaccgg atgcgagggc atcggcgatg ctgctgcgca ggccaacggt ggccggaatc
29280atcatcttgg ggtacgcggc cagcagctcg gcttggtggc gcgcgtggcg cggattccgc
29340gcatcgacct tgctgggcac catgccaagg aattgcagct tggcgttctt ctggcgcacg
29400ttcgcaatgg tcgtgaccat cttcttgatg ccctggatgc tgtacgcctc aagctcgatg
29460ggggacagca catagtcggc cgcgaagagg gcggccgcca ggccgacgcc aagggtcggg
29520gccgtgtcga tcaggcacac gtcgaagcct tggttcgcca gggccttgat gttcgccccg
29580aacagctcgc gggcgtcgtc cagcgacagc cgttcggcgt tcgccagtac cgggttggac
29640tcgatgaggg cgaggcgcgc ggcctggccg tcgccggctg cgggtgcggt ttcggtccag
29700ccgccggcag ggacagcgcc gaacagcttg cttgcatgca ggccggtagc aaagtccttg
29760agcgtgtagg acgcattgcc ctgggggtcc aggtcgatca cggcaacccg caagccgcgc
29820tcgaaaaagt cgaaggcaag atgcacaagg gtcgaagtct tgccgacgcc gcctttctgg
29880ttggccgtga ccaaagtttt catcgtttgg tttcctgttt tttcttggcg tccgcttccc
29940acttccggac gatgtacgcc tgatgttccg gcagaaccgc cgttacccgc gcgtacccct
30000cgggcaagtt cttgtcctcg aacgcggccc acacgcgatg caccgcttgc gacactgcgc
30060ccctggtcag tcccagcgac gttgcgaacg tcgcctgtgg cttcccatcg actaagacgc
30120cccgcgctat ctcgatggtc tgctgcccca cttccagccc ctggatcgcc tcctggaact
30180ggctttcggt aagccgtttc ttcatggata acacccataa tttgctccgc gccttggttg
30240aacatagcgg tgacagccgc cagcacatga gagaagttta gctaaacatt tctcgcacgt
30300caacaccttt agccgctaaa actcgtcctt ggcgtaacaa aacaaaagcc cggaaaccgg
30360gctttcgtct cttgccgctt atggctctgc acccggctcc atcaccaaca ggtcgcgcac
30420gcgcttcact cggttgcgga tcgacactgc cagcccaaca aagccggttg ccgccgccgc
30480caggatcgcg ccgatgatgc cggccacacc ggccatcgcc caccaggtcg ccgccttccg
30540gttccattcc tgctggtact gcttcgcaat gctggacctc ggctcaccat aggctgaccg
30600ctcgatggcg tatgccgctt ctccccttgg cgtaaaaccc agcgccgcag gcggcattgc
30660catgctgccc gccgctttcc cgaccacgac gcgcgcacca ggcttgcggt ccagaccttc
30720ggccacggcg agctgcgcaa ggacataatc agccgccgac ttggctccac gcgcctcgat
30780cagctcttgc actcgcgcga aatccttggc ctccacggcc gccatgaatc gcgcacgcgg
30840cgaaggctcc gcagggccgg cgtcgtgatc gccgccgaga atgcccttca ccaagttcga
30900cgacacgaaa atcatgctga cggctatcac catcatgcag acggatcgca cgaacccgct
30960gaattgaaca cgagcacggc acccgcgacc actatgccaa gaatgcccaa ggtaaaaatt
31020gccggccccg ccatgaagtc cgtgaatgcc ccgacggccg aagtgaaggg caggccgcca
31080cccaggccgc cgccctcact gcccggcacc tggtcgctga atgtcgatgc cagcacctgc
31140ggcacgtcaa tgcttccggg cgtcgcgctc gggctgatcg cccatcccgt tactgccccg
31200atcccggcaa tggcaaggac tgccagcgct gccatttttg gggtgaggcc gttcgcggcc
31260gaggggcgca gcccctgggg ggatgggagg cccgcgttag cgggccggga gggttcgaga
31320agggggggca ccccccttcg gcgtgcgcgg tcacgcgcac agggcgcagc cctggttaaa
31380aacaaggttt ataaatattg gtttaaaagc aggttaaaag acaggttagc ggtggccgaa
31440aaacgggcgg aaacccttgc aaatgctgga ttttctgcct gtggacagcc cctcaaatgt
31500caataggtgc gcccctcatc tgtcagcact ctgcccctca agtgtcaagg atcgcgcccc
31560tcatctgtca gtagtcgcgc ccctcaagtg tcaataccgc agggcactta tccccaggct
31620tgtccacatc atctgtggga aactcgcgta aaatcaggcg ttttcgccga tttgcgaggc
31680tggccagctc cacgtcgccg gccgaaatcg agcctgcccc tcatctgtca acgccgcgcc
31740gggtgagtcg gcccctcaag tgtcaacgtc cgcccctcat ctgtcagtga gggccaagtt
31800ttccgcgagg tatccacaac gccggcggcc gcggtgtctc gcacacggct tcgacggcgt
31860ttctggcgcg tttgcagggc catagacggc cgccagccca gcggcgaggg caaccagccc
31920ggtgagcgtc ggaaaggcgc tggaagcccc gtagcgacgc ggagaggggc gagacaagcc
31980aagggcgcag gctcgatgcg cagcacgaca tagccggttc tcgcaaggac gagaatttcc
32040ctgcggtgcc cctcaagtgt caatgaaagt ttccaacgcg agccattcgc gagagccttg
32100agtccacgct agatgagagc tttgttgtag gtggaccagt tggtgatttt gaacttttgc
32160tttgccacgg aacggtctgc gttgtcggga agatgcgtga tctgatcctt caactcagca
32220aaagttcgat ttattcaaca aagccacgtt gtgtctcaaa atctctgatg ttacattgca
32280caagataaaa atatatcatc atgaacaata aaactgtctg cttacataaa cagtaataca
32340aggggtgtta tgagccatat tcaacgggaa acgtcttgct cgactctaga gctcgttcct
32400cgaggcctcg aggcctcgag gaacggtacc tgcggggaag cttacaataa tgtgtgttgt
32460taagtcttgt tgcctgtcat cgtctgactg actttcgtca taaatcccgg cctccgtaac
32520ccagctttgg gcaagctcac ggatttgatc cggcggaacg ggaatatcga gatgccgggc
32580tgaacgctgc agttccagct ttccctttcg ggacaggtac tccagctgat tgattatctg
32640ctgaagggtc ttggttccac ctcctggcac aatgcgaatg attacttgag cgcgatcggg
32700catccaattt tctcccgtca ggtgcgtggt caagtgctac aaggcacctt tcagtaacga
32760gcgaccgtcg atccgtcgcc gggatacgga caaaatggag cgcagtagtc catcgagggc
32820ggcgaaagcc tcgccaaaag caatacgttc atctcgcaca gcctccagat ccgatcgagg
32880gtcttcggcg taggcagata gaagcatgga tacattgctt gagagtattc cgatggactg
32940aagtatggct tccatctttt ctcgtgtgtc tgcatctatt tcgagaaagc ccccgatgcg
33000gcgcaccgca acgcgaattg ccatactatc cgaaagtccc agcaggcgcg cttgatagga
33060aaaggtttca tactcggccg atcgcagacg ggcactcacg accttgaacc cttcaacttt
33120cagggatcga tgctggttga tggtagtctc actcgacgtg gctctggtgt gttttgacat
33180agcttcctcc aaagaaagcg gaaggtctgg atactccagc acgaaatgtg cccgggtaga
33240cggatggaag tctagccctg ctcaatatga aatcaacagt acatttacag tcaatactga
33300atatacttgc tacatttgca attgtcttat aacgaatgtg aaataaaaat agtgtaacaa
33360cgcttttact catcgataat cacaaaaaca tttatacgaa caaaaataca aatgcactcc
33420ggtttcacag gataggcggg atcagaatat gcaacttttg acgttttgtt ctttcaaagg
33480gggtgctggc aaaaccaccg cactcatggg cctttgcgct gctttggcaa atgacggtaa
33540acgagtggcc ctctttgatg ccgacgaaaa ccggcctctg acgcgatgga gagaaaacgc
33600cttacaaagc agtactggga tcctcgctgt gaagtctatt ccgccgacga aatgcccctt
33660cttgaagcag cctatgaaaa tgccgagctc gaaggatttg attatgcgtt ggccgatacg
33720cgtggcggct cgagcgagct caacaacaca atcatcgcta gctcaaacct gcttctgatc
33780cccaccatgc taacgccgct cgacatcgat gaggcactat ctacctaccg ctacgtcatc
33840gagctgctgt tgagtgaaaa tttggcaatt cctacagctg ttttgcgcca acgcgtcccg
33900gtcggccgat tgacaacatc gcaacgcagg atgtcagaga cgctagagag ccttccagtt
33960gtaccgtctc ccatgcatga aagagatgca tttgccgcga tgaaagaacg cggcatgttg
34020catcttacat tactaaacac gggaactgat ccgacgatgc gcctcataga gaggaatctt
34080cggattgcga tggaggaagt cgtggtcatt tcgaaactga tcagcaaaat cttggaggct
34140tgaagatggc aattcgcaag cccgcattgt cggtcggcga agcacggcgg cttgctggtg
34200ctcgacccga gatccaccat cccaacccga cacttgttcc ccagaagctg gacctccagc
34260acttgcctga aaaagccgac gagaaagacc agcaacgtga gcctctcgtc gccgatcaca
34320tttacagtcc cgatcgacaa cttaagctaa ctgtggatgc ccttagtcca cctccgtccc
34380cgaaaaagct ccaggttttt ctttcagcgc gaccgcccgc gcctcaagtg tcgaaaacat
34440atgacaacct cgttcggcaa tacagtccct cgaagtcgct acaaatgatt ttaaggcgcg
34500cgttggacga tttcgaaagc atgctggcag atggatcatt tcgcgtggcc ccgaaaagtt
34560atccgatccc ttcaactaca gaaaaatccg ttctcgttca gacctcacgc atgttcccgg
34620ttgcgttgct cgaggtcgct cgaagtcatt ttgatccgtt ggggttggag accgctcgag
34680ctttcggcca caagctggct accgccgcgc tcgcgtcatt ctttgctgga gagaagccat
34740cgagcaattg gtgaagaggg acctatcgga acccctcacc aaatattgag tgtaggtttg
34800aggccgctgg ccgcgtcctc agtcaccttt tgagccagat aattaagagc caaatgcaat
34860tggctcaggc tgccatcgtc cccccgtgcg aaacctgcac gtccgcgtca aagaaataac
34920cggcacctct tgctgttttt atcagttgag ggcttgacgg atccgcctca agtttgcggc
34980gcagccgcaa aatgagaaca tctatactcc tgtcgtaaac ctcctcgtcg cgtactcgac
35040tggcaatgag aagttgctcg cgcgatagaa cgtcgcgggg tttctctaaa aacgcgagga
35100gaagattgaa ctcacctgcc gtaagtttca cctcaccgcc agcttcggac atcaagcgac
35160gttgcctgag attaagtgtc cagtcagtaa aacaaaaaga ccgtcggtct ttggagcgga
35220caacgttggg gcgcacgcgc aaggcaaccc gaatgcgtgc aagaaactct ctcgtactaa
35280acggcttagc gataaaatca cttgctccta gctcgagtgc aacaacttta tccgtctcct
35340caaggcggtc gccactgata attatgattg gaatatcaga ctttgccgcc agatttcgaa
35400cgatctcaag cccatcttca cgacctaaat ttagatcaac aaccacgaca tcgaccgtcg
35460cggaagagag tactctagtg aactgggtgc tgtcggctac cgcggtcact ttgaaggcgt
35520ggatcgtaag gtattcgata ataagatgcc gcatagcgac atcgtcatcg ataagaagaa
35580cgtgtttcaa cggctcacct ttcaatctaa aatctgaacc cttgttcaca gcgcttgaga
35640aattttcacg tgaaggatgt acaatcatct ccagctaaat gggcagttcg tcagaattgc
35700ggctgaccgc ggatgacgaa aatgcgaacc aagtatttca attttatgac aaaagttctc
35760aatcgttgtt acaagtgaaa cgcttcgagg ttacagctac tattgattaa ggagatcgcc
35820tatggtctcg ccccggcgtc gtgcgtccgc cgcgagccag atctcgccta cttcataaac
35880gtcctcatag gcacggaatg gaatgatgac atcgatcgcc gtagagagca tgtcaatcag
35940tgtgcgatct tccaagctag caccttgggc gctacttttg acaagggaaa acagtttctt
36000gaatccttgg attggattcg cgccgtgtat tgttgaaatc gatcccggat gtcccgagac
36060gacttcactc agataagccc atgctgcatc gtcgcgcatc tcgccaagca atatccggtc
36120cggccgcata cgcagacttg cttggagcaa gtgctcggcg ctcacagcac ccagcccagc
36180accgttcttg gagtagagta gtctaacatg attatcgtgt ggaatgacga gttcgagcgt
36240atcttctatg gtgattagcc tttcctgggg ggggatggcg ctgatcaagg tcttgctcat
36300tgttgtcttg ccgcttccgg tagggccaca tagcaacatc gtcagtcggc tgacgacgca
36360tgcgtgcaga aacgcttcca aatccccgtt gtcaaaatgc tgaaggatag cttcatcatc
36420ctgattttgg cgtttccttc gtgtctgcca ctggttccac ctcgaagcat cataacggga
36480ggagacttct ttaagaccag aaacacgcga gcttggccgt cgaatggtca agctgacggt
36540gcccgaggga acggtcggcg gcagacagat ttgtagtcgt tcaccaccag gaagttcagt
36600ggcgcagagg gggttacgtg gtccgacatc ctgctttctc agcgcgcccg ctaaaatagc
36660gatatcttca agatcatcat aagagacggg caaaggcatc ttggtaaaaa tgccggcttg
36720gcgcacaaat gcctctccag gtcgattgat cgcaatttct tcagtcttcg ggtcatcgag
36780ccattccaaa atcggcttca gaagaaagcg tagttgcgga tccacttcca tttacaatgt
36840atcctatctc taagcggaaa tttgaattca ttaagagcgg cggttcctcc cccgcgtggc
36900gccgccagtc aggcggagct ggtaaacacc aaagaaatcg aggtcccgtg ctacgaaaat
36960ggaaacggtg tcaccctgat tcttcttcag ggttggcggt atgttgatgg ttgccttaag
37020ggctgtctca gttgtctgct caccgttatt ttgaaagctg ttgaagctca tcccgccacc
37080cgagctgccg gcgtaggtgc tagctgcctg gaaggcgcct tgaacaacac tcaagagcat
37140agctccgcta aaacgctgcc agaagtggct gtcgaccgag cccggcaatc ctgagcgacc
37200gagttcgtcc gcgcttggcg atgttaacga gatcatcgca tggtcaggtg tctcggcgcg
37260atcccacaac acaaaaacgc gcccatctcc ctgttgcaag ccacgctgta tttcgccaac
37320aacggtggtg ccacgatcaa gaagcacgat attgttcgtt gttccacgaa tatcctgagg
37380caagacacac tttacatagc ctgccaaatt tgtgtcgatt gcggtttgca agatgcacgg
37440aattattgtc ccttgcgtta ccataaaatc ggggtgcggc aagagcgtgg cgctgctggg
37500ctgcagctcg gtgggtttca tacgtatcga caaatcgttc tcgccggaca cttcgccatt
37560cggcaaggag ttgtcgtcac gcttgccttc ttgtcttcgg cccgtgtcgc cctgaatggc
37620gcgtttgctg accccttgat cgccgctgct atatgcaaaa atcggtgttt cttccggccg
37680tggctcatgc cgctccggtt cgcccctcgg cggtagagga gcagcaggct gaacagcctc
37740ttgaaccgct ggaggatccg gcggcacctc aatcggagct ggatgaaatg gcttggtgtt
37800tgttgcgatc aaagttgacg gcgatgcgtt ctcattcacc ttcttttggc gcccacctag
37860ccaaatgagg cttaatgata acgcgagaac gacacctccg acgatcaatt tctgagaccc
37920cgaaagacgc cggcgatgtt tgtcggagac cagggatcca gatgcatcaa cctcatgtgc
37980cgcttgctga ctatcgttat tcatcccttc gcccccttca ggacgcgttt cacatcgggc
38040ctcaccgtgc ccgtttgcgg cctttggcca acgggatcgt aagcggtgtt ccagatacat
38100agtactgtgt ggccatccct cagacgccaa cctcgggaaa ccgaagaaat ctcgacatcg
38160ctccctttaa ctgaatagtt ggcaacagct tccttgccat caggattgat ggtgtagatg
38220gagggtatgc gtacattgcc cggaaagtgg aataccgtcg taaatccatt gtcgaagact
38280tcgagtggca acagcgaacg atcgccttgg gcgacgtagt gccaattact gtccgccgca
38340ccaagggctg tgacaggctg atccaataaa ttctcagctt tccgttgata ttgtgcttcc
38400gcgtgtagtc tgtccacaac agccttctgt tgtgcctccc ttcgccgagc cgccgcatcg
38460tcggcggggt aggcgaattg gacgctgtaa tagagatcgg gctgctcttt atcgaggtgg
38520gacagagtct tggaacttat actgaaaaca taacggcgca tcccggagtc gcttgcggtt
38580agcacgatta ctggctgagg cgtgaggacc tggcttgcct tgaaaaatag ataatttccc
38640cgcggtaggg ctgctagatc tttgctattt gaaacggcaa ccgctgtcac cgtttcgttc
38700gtggcgaatg ttacgaccaa agtagctcca accgccgtcg agaggcgcac cacttgatcg
38760ggattgtaag ccaaataacg catgcgcgga tctagcttgc ccgccattgg agtgtcttca
38820gcctccgcac cagtcgcagc ggcaaataaa catgctaaaa tgaaaagtgc ttttctgatc
38880atggttcgct gtggcctacg tttgaaacgg tatcttccga tgtctgatag gaggtgacaa
38940ccagacctgc cgggttggtt agtctcaatc tgccgggcaa gctggtcacc ttttcgtagc
39000gaactgtcgc ggtccacgta ctcaccacag gcattttgcc gtcaacgacg agggtccttt
39060tatagcgaat ttgctgcgtg cttggagtta catcatttga agcgatgtgc tcgacctcca
39120ccctgccgcg tttgccaaga atgacttgag gcgaactggg attgggatag ttgaagaatt
39180gctggtaatc ctggcgcact gttggggcac tgaagttcga taccaggtcg taggcgtact
39240gagcggtgtc ggcatcataa ctctcgcgca ggcgaacgta ctcccacaat gaggcgttaa
39300cgacggcctc ctcttgagtt gcaggcaatc gcgagacaga cacctcgctg tcaacggtgc
39360cgtccggccg tatccataga tatacgggca caagcctgct caacggcacc attgtggcta
39420tagcgaacgc ttgagcaaca tttcccaaaa tcgcgatagc tgcgacagct gcaatgagtt
39480tggagagacg tcgcgccgat ttcgctcgcg cggtttgaaa ggcttctact tccttatagt
39540gctcggcaag gctttcgcgc gccactagca tggcatattc aggccccgtc atagcgtcca
39600cccgaattgc cgagctgaag atctgacgga gtaggctgcc atcgccccac attcagcggg
39660aagatcgggc ctttgcagct cgctaatgtg tcgtttgtct ggcagccgct caaagcgaca
39720actaggcaca gcaggcaata cttcatagaa ttctccattg aggcgaattt ttgcgcgacc
39780tagcctcgct caacctgagc gaagcgacgg tacaagctgc tggcagattg ggttgcgccg
39840ctccagtaac tgcctccaat gttgccggcg atcgccggca aagcgacaat gagcgcatcc
39900cctgtcagaa aaaacatatc gagttcgtaa agaccaatga tcttggccgc ggtcgtaccg
39960gcgaaggtga ttacaccaag cataagggtg agcgcagtcg cttcggttag gatgacgatc
40020gttgccacga ggtttaagag gagaagcaag agaccgtagg tgataagttg cccgatccac
40080ttagctgcga tgtcccgcgt gcgatcaaaa atatatccga cgaggatcag aggcccgatc
40140gcgagaagca ctttcgtgag aattccaacg gcgtcgtaaa ctccgaaggc agaccagagc
40200gtgccgtaaa ggacccactg tgccccttgg aaagcaagga tgtcctggtc gttcatcgga
40260ccgatttcgg atgcgatttt ctgaaaaacg gcctgggtca cggcgaacat tgtatccaac
40320tgtgccggaa cagtctgcag aggcaagccg gttacactaa actgctgaac aaagtttggg
40380accgtctttt cgaagatgga aaccacatag tcttggtagt tagcctgccc aacaattaga
40440gcaacaacga tggtgaccgt gatcacccga gtgataccgc tacgggtatc gacttcgccg
40500cgtatgacta aaataccctg aacaataatc caaagagtga cacaggcgat caatggcgca
40560ctcaccgcct cctggatagt ctcaagcatc gagtccaagc ctgtcgtgaa ggctacatcg
40620aagatcgtat gaatggccgt aaacggcgcc ggaatcgtga aattcatcga ttggacctga
40680acttgactgg tttgtcgcat aatgttggat aaaatgagct cgcattcggc gaggatgcgg
40740gcggatgaac aaatcgccca gccttagggg agggcaccaa agatgacagc ggtcttttga
40800tgctccttgc gttgagcggc cgcctcttcc gcctcgtgaa ggccggcctg cgcggtagtc
40860atcgttaata ggcttgtcgc ctgtacattt tgaatcattg cgtcatggat ctgcttgaga
40920agcaaaccat tggtcacggt tgcctgcatg atattgcgag atcgggaaag ctgagcagac
40980gtatcagcat tcgccgtcaa gcgtttgtcc atcgtttcca gattgtcagc cgcaatgcca
41040gcgctgtttg cggaaccggt gatctgcgat cgcaacaggt ccgcttcagc atcactaccc
41100acgactgcac gatctgtatc gctggtgatc gcacgtgccg tggtcgacat tggcattcgc
41160ggcgaaaaca tttcattgtc taggtccttc gtcgaaggat actgattttt ctggttgagc
41220gaagtcagta gtccagtaac gccgtaggcc gacgtcaaca tcgtaaccat cgctatagtc
41280tgagtgagat tctccgcagt cgcgagcgca gtcgcgagcg tctcagcctc cgttgccggg
41340tcgctaacaa caaactgcgc ccgcgcgggc tgaatatata gaaagctgca ggtcaaaact
41400gttgcaataa gttgcgtcgt cttcatcgtt tcctacctta tcaatcttct gcctcgtggt
41460gacgggccat gaattcgctg agccagccag atgagttgcc ttcttgtgcc tcgcgtagtc
41520gagttgcaaa gcgcaccgtg ttggcacgcc ccgaaagcac ggcgacatat tcacgcatat
41580cccgcagatc aaattcgcag atgacgcttc cactttctcg tttaagaaga aacttacggc
41640tgccgaccgt catgtcttca cggatcgcct gaaattcctt ttcggtacat ttcagtccat
41700cgacataagc cgatcgatct gcggttggtg atggatagaa aatcttcgtc atacattgcg
41760caaccaagct ggctcctagc ggcgattcca gaacatgctc tggttgctgc gttgccagta
41820ttagcatccc gttgtttttt cgaacggtca ggaggaattt gtcgacgaca gtcgaaaatt
41880tagggtttaa caaataggcg cgaaactcat cgcagctcat cacaaaacgg cggccgtcga
41940tcatggctcc aatccgatgc aggagatatg ctgcagcggg agcgcatact tcctcgtatt
42000cgagaagatg cgtcatgtcg aagccggtaa tcgacggatc taactttact tcgtcaactt
42060cgccgtcaaa tgcccagcca agcgcatggc cccggcacca gcgttggagc cgcgctcctg
42120cgccttcggc gggcccatgc aacaaaaatt cacgtaaccc cgcgattgaa cgcatttgtg
42180gatcaaacga gagctgacga tggataccac ggaccagacg gcggttctct tccggagaaa
42240tcccaccccg accatcactc tcgatgagag ccacgatcca ttcgcgcaga aaatcgtgtg
42300aggctgctgt gttttctagg ccacgcaacg gcgccaaccc gctgggtgtg cctctgtgaa
42360gtgccaaata tgttcctcct gtggcgcgaa ccagcaattc gccaccccgg tccttgtcaa
42420agaacacgac cgtacctgca cggtcgacca tgctctgttc gagcatggct agaacaaaca
42480tcatgagcgt cgtcttaccc ctcccgatag gcccgaatat tgccgtcatg ccaacatcgt
42540gctcatgcgg gatatagtcg aaaggcgttc cgccattggt acgaaatcgg gcaatcgcgt
42600tgccccagtg gcctgagctg gcgccctctg gaaagttttc gaaagagaca aaccctgcga
42660aattgcgtga agtgattgcg ccagggcgtg tgcgccactt aaaattcccc ggcaattggg
42720accaataggc cgcttccata ccaatacctt cttggacaac cacggcacct gcatccgcca
42780ttcgtgtccg agcccgcgcg cccctgtccc caagactatt gagatcgtct gcatagacgc
42840aaaggctcaa atgatgtgag cccataacga attcgttgct cgcaagtgcg tcctcagcct
42900cggataattt gccgatttga gtcacggctt tatcgccgga actcagcatc tggctcgatt
42960tgaggctaag tttcgcgtgc gcttgcgggc gagtcaggaa cgaaaaactc tgcgtgagaa
43020caagtggaaa atcgagggat agcagcgcgt tgagcatgcc cggccgtgtt tttgcagggt
43080attcgcgaaa cgaatagatg gatccaacgt aactgtcttt tggcgttctg atctcgagtc
43140ctcgcttgcc gcaaatgact ctgtcggtat aaatcgaagc gccgagtgag ccgctgacga
43200ccggaaccgg tgtgaaccga ccagtcatga tcaaccgtag cgcttcgcca atttcggtga
43260agagcacacc ctgcttctcg cggatgccaa gacgatgcag gccatacgct ttaagagagc
43320cagcgacaac atgccaaaga tcttccatgt tcctgatctg gcccgtgaga tcgttttccc
43380tttttccgct tagcttggtg aacctcctct ttaccttccc taaagccgcc tgtgggtaga
43440caatcaacgt aaggaagtgt tcattgcgga ggagttggcc ggagagcacg cgctgttcaa
43500aagcttcgtt caggctagcg gcgaaaacac tacggaagtg tcgcggcgcc gatgatggca
43560cgtcggcatg acgtacgagg tgagcatata ttgacacatg atcatcagcg atattgcgca
43620acagcgtgtt gaacgcacga caacgcgcat tgcgcatttc agtttcctca agctcgaatg
43680caacgccatc aattctcgca atggtcatga tcgatccgtc ttcaagaagg acgatatggt
43740cgctgaggtg gccaatataa gggagataga tctcaccgga tctttcggtc gttccactcg
43800cgccgagcat cacaccattc ctctccctcg tgggggaacc ctaattggat ttgggctaac
43860agtagcgccc ccccaaactg cactatcaat gcttcttccc gcggtccgca aaaatagcag
43920gacgacgctc gccgcattgt agtctcgctc cacgatgagc cgggctgcaa accataacgg
43980cacgagaacg acttcgtaga gcgggttctg aacgataacg atgacaaagc cggcgaacat
44040catgaataac cctgccaatg tcagtggcac cccaagaaac aatgcgggcc gtgtggctgc
44100gaggtaaagg gtcgattctt ccaaacgatc agccatcaac taccgccagt gagcgtttgg
44160ccgaggaagc tcgccccaaa catgataaca atgccgccga cgacgccggc aaccagccca
44220agcgaagccc gcccgaacat ccaggagatc ccgatagcga caatgccgag aacagcgagt
44280gactggccga acggaccaag gataaacgtg catatattgt taaccattgt ggcggggtca
44340gtgccgccac ccgcagattg cgctgcggcg ggtccggatg aggaaatgct ccatgcaatt
44400gcaccgcaca agcttggggc gcagctcgat atcacgcgca tcatcgcatt cgagagcgag
44460aggcgattta gatgtaaacg gtatctctca aagcatcgca tcaatgcgca cctccttagt
44520ataagtcgaa taagacttga ttgtcgtctg cggatttgcc gttgtcctgg tgtggcggtg
44580gcggagcgat taaaccgcca gcgccatcct cctgcgagcg gcgctgatat gacccccaaa
44640catcccacgt ctcttcggat tttagcgcct cgtgatcgtc ttttggaggc tcgattaacg
44700cgggcaccag cgattgagca gctgtttcaa cttttcgcac gtagccgttt gcaaaaccgc
44760cgatgaaatt accggtgttg taagcggaga tcgcccgacg aagcgcaaat tgcttctcgt
44820caatcgtttc gccgcctgca taacgacttt tcagcatgtt tgcagcggca gataatgatg
44880tgcacgcctg gagcgcaccg tcaggtgtca gaccgagcat agaaaaattt cgagagttta
44940tttgcatgag gccaacatcc agcgaatgcc gtgcatcgag acggtgcctg acgacttggg
45000ttgcttggct gtgatcttgc cagtgaagcg tttcgccggt cgtgttgtca tgaatcgcta
45060aaggatcaaa gcgactctcc accttagcta tcgccgcaag cgtagatgtc gcaactgatg
45120gggcacactt gcgagcaaca tggtcaaact cagcagatga gagtggcgtg gcaaggctcg
45180acgaacagaa ggagaccatc aaggcaagag aaagcgaccc cgatctctta agcatacctt
45240atctccttag ctcgcaacta acaccgcctc tcccgttgga agaagtgcgt tgttttatgt
45300tgaagattat cgggagggtc ggttactcga aaattttcaa ttgcttcttt atgatttcaa
45360ttgaagcgag aaacctcgcc cggcgtcttg gaacgcaaca tggaccgaga accgcgcatc
45420catgactaag caaccggatc gacctattca ggccgcagtt ggtcaggtca ggctcagaac
45480gaaaatgctc ggcgaggtta cgctgtctgt aaacccattc gatgaacggg aagcttcctt
45540ccgattgctc ttggcaggaa tattggccca tgcctgcttg cgctttgcaa atgctcttat
45600cgcgttggta tcatatgcct tgtccgccag cagaaacgca ctctaagcga ttatttgtaa
45660aaatgtttcg gtcatgcggc ggtcatgggc ttgacccgct gtcagcgcaa gacggatcgg
45720tcaaccgtcg gcatcgacaa cagcgtgaat cttggtggtc aaaccgccac gggaacgtcc
45780catacagcca tcgtcttgat cccgctgttt cccgtcgccg catgttggtg gacgcggaca
45840caggaactgt caatcatgac gacattctat cgaaagcctt ggaaatcaca ctcagaatat
45900gatcccagac gtctgcctca cgccatcgta caaagcgatt gtagcaggtt gtacaggaac
45960cgtatcgatc aggaacgtct gcccagggcg ggcccgtccg gaagcgccac aagatgacat
46020tgatcacccg cgtcaacgcg cggcacgcga cgcggcttat ttgggaacaa aggactgaac
46080aacagtccat tcgaaatcgg tgacatcaaa gcggggacgg gttatcagtg gcctccaagt
46140caagcctcaa tgaatcaaaa tcagaccgat ttgcaaacct gatttatgag tgtgcggcct
46200aaatgatgaa atcgtccttc tagatcgcct ccgtggtgta gcaacacctc gcagtatcgc
46260cgtgctgacc ttggccaggg aattgactgg caagggtgct ttcacatgac cgctcttttg
46320gccgcgatag atgatttcgt tgctgctttg ggcacgtaga aggagagaag tcatatcgga
46380gaaattcctc ctggcgcgag agcctgctct atcgcgacgg catcccactg tcgggaacag
46440accggatcat tcacgaggcg aaagtcgtca acacatgcgt tataggcatc ttcccttgaa
46500ggatgatctt gttgctgcca atctggaggt gcggcagccg caggcagatg cgatctcagc
46560gcaacttgcg gcaaaacatc tcactcacct gaaaaccact agcgagtctc gcgatcagac
46620gaaggccttt tacttaacga cacaatatcc gatgtctgca tcacaggcgt cgctatccca
46680gtcaatacta aagcggtgca ggaactaaag attactgatg acttaggcgt gccacgaggc
46740ctgagacgac gcgcgtagac agttttttga aatcattatc aaagtgatgg cctccgctga
46800agcctatcac ctctgcgccg gtctgtcgga gagatgggca agcattatta cggtcttcgc
46860gcccgtacat gcattggacg attgcagggt caatggatct gagatcatcc agaggattgc
46920cgcccttacc ttccgtttcg agttggagcc agcccctaaa tgagacgaca tagtcgactt
46980gatgtgacaa tgccaagaga gagatttgct taacccgatt tttttgctca agcgtaagcc
47040tattgaagct tgccggcatg acgtccgcgc cgaaagaata tcctacaagt aaaacattct
47100gcacaccgaa atgcttggtg tagacatcga ttatgtgacc aagatcctta gcagtttcgc
47160ttggggaccg ctccgaccag aaataccgaa gtgaactgac gccaatgaca ggaatccctt
47220ccgtctgcag ataggtacca tcgatagatc tgctgcctcg cgcgtttcgg tgatgacggt
47280gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta agcggatgcc
47340gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg gggcgcagcc
47400atgacccagt cacgtagcga tagcggagtg tatactggct taactatgcg gcatcagagc
47460agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc gtaaggagaa
47520aataccgcat caggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc
47580ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag
47640gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa
47700aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc
47760gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc
47820ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg
47880cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg tatctcagtt
47940cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc
48000gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc
48060cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag
48120agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg
48180ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa
48240ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag
48300gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgaaaact
48360cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag atccttttaa
48420attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg tctgacagtt
48480accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt tcatccatag
48540ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca tctggcccca
48600gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc
48660agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc tccatccagt
48720ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt ttgcgcaacg
48780ttgttgccat tgctgcaggg gggggggggg ggggggactt ccattgttca ttccacggac
48840aaaaacagag aaaggaaacg acagaggcca aaaagcctcg ctttcagcac ctgtcgtttc
48900ctttcttttc agagggtatt ttaaataaaa acattaagtt atgacgaaga agaacggaaa
48960cgccttaaac cggaaaattt tcataaatag cgaaaacccg cgaggtcgcc gccccgtaac
49020ctgtcggatc accggaaagg acccgtaaag tgataatgat tatcatctac atatcacaac
49080gtgcgtggag gccatcaaac cacgtcaaat aatcaattat gacgcaggta tcgtattaat
49140tgatctgcat caacttaacg taaaaacaac ttcagacaat acaaatcagc gacactgaat
49200acggggcaac ctcatgtccc cccccccccc ccccctgcag gcatcgtggt gtcacgctcg
49260tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc
49320cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag
49380ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg
49440ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag
49500tgtatgcggc gaccgagttg ctcttgcccg gcgtcaacac gggataatac cgcgccacat
49560agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg
49620atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca
49680gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca
49740aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat
49800tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag
49860aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtctaa
49920gaaaccatta ttatcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt
49980cttcaagaat tggtcgacga tcttgctgcg ttcggatatt ttcgtggagt tcccgccaca
50040gacccggatt gaaggcgaga tccagcaact cgcgccagat catcctgtga cggaactttg
50100gcgcgtgatg actggccagg acgtcggccg aaagagcgac aagcagatca cgcttttcga
50160cagcgtcgga tttgcgatcg aggatttttc ggcgctgcgc tacgtccgcg accgcgttga
50220gggatcaagc cacagcagcc cactcgacct tctagccgac ccagacgagc caagggatct
50280ttttggaatg ctgctccgtc gtcaggcttt ccgacgtttg ggtggttgaa cagaagtcat
50340tatcgtacgg aatgccaagc actcccgagg ggaaccctgt ggttggcatg cacatacaaa
50400tggacgaacg gataaacctt ttcacgccct tttaaatatc cgttattcta ataaacgctc
50460ttttctctta ggtttacccg ccaatatatc ctgtcaaaca ctgatagttt aaactgaagg
50520cgggaaacga caatctgatc atgagcggag aattaaggga gtcacgttat gacccccgcc
50580gatgacgcgg gacaagccgt tttacgtttg gaactgacag aaccgcaacg ttgaaggagc
50640cactcagcaa gctggtacga ttgtaatacg actcactata gggcgaattg agcgctgttt
50700aaacgctctt caactggaag agcggttact accggttaag tgactagggt c
50751720DNAArtificial SequenceForward primer for GUS expression
7cggaagcaac gcgtaaactc
20821DNAArtificial Sequencereverse primer for GUS expression 8tgtgagcgtc
gcagaacatt a
21920DNAArtificial SequenceProbe sequence 9cgcgtccgat cacctgcgtc
201023DNAArtificial SequenceTERM
2.1F primer 10ctgtcagttc caaacgtaaa acg
231125DNAArtificial SequenceTERM 2.1R primer 11aatctgatca
tgagcggaga attaa
251220DNAArtificial SequenceTERM 1F primer 12tcccgggtcc ttaggaagac
201321DNAArtificial SequenceTERM
1R primer 13tggattcagc aggcctagaa g
211418DNAArtificial SequenceTERM 1P-probe 14tcctcaggat ttaaatgg
181520DNAArtificial
SequenceActin_Fwd primer 15cttcgaatgc ccagcaatgt
201621DNAArtificial SequenceActin _rev primer
16gttcgcccac tagcgtacaa c
211715DNAArtificial SequenceActin _probe 17tcgaggctgt tcttt
1518460DNASorghum bicolor
18aactatctat actgtaataa tgttgtatag ccgccggata gctagctagt tagtcattca
60gcggcgatgg gtaataataa agtgtcatcc atccatcacc atgggtggca acgtgagcaa
120tgacctgatt gaacaaattg aaatgaaaag aagaaatatg ttatatgtca acgagatttc
180ctcataatgc cactgacaac gtgtgtccaa gaaatgtatc agtgatacgt atattcacaa
240tttttttatg acttatactc acaatttgtt tttttactac ttatactcga acaatttgtt
300gtgggtacca taacaatttc gatcgaatat atatcagaaa gttgacgaaa gtaagctcac
360tcaaaaagtt aaatgggctg cggaagctgc gtcaggccca agttttggct attctatccg
420gtatccacga ttttgatggc tgagggacat atgttcggct
46019461DNAArtificial SequenceConsensus Sequence for FIG.
6A-6Cmisc_feature(52)..(52)n is a, c, g, or tmisc_feature(290)..(291)n is
a, c, g, or tmisc_feature(459)..(460)n is a, c, g, or t 19aactatctat
actgtaataa tgttgtatag ccgccggata gctagctagt tnagtcattc 60agcggcgatg
ggtaataata aagtgtcatc catccatcac catgggtggc aacgtgagca 120atgacctgat
tgaacaaatt gaaatgaaaa gaagaaatat gttatatgtc aacgagattt 180cctcataatg
ccactgacaa cgtgtgtcca agaaatgtat cagtgatacg tatattcaca 240atttttttat
gacttatact cacaatttgt ttttttacta cttatactcn nacaatttgt 300tgtgggtacc
ataacaattt cgatcgaata tatatcagaa agttgacgaa agtaagctca 360ctcaaaaagt
taaatgggct gcggaagctg cgtcaggccc aagttttggc tattctatcc 420ggtatccacg
attttgatgg ctgagggaca tatgttcgnn t 461
User Contributions:
Comment about this patent or add new information about this topic: