Patent application title: Light-driven energy generation using proteorhodopsin
Inventors:
Edward F. Delong (Cabmbridge, MA, US)
Oded Beja (Misgav, IL)
IPC8 Class: AC07H2104FI
USPC Class:
536 231
Class name: Nitrogen containing n-glycosides, polymers thereof, metal derivatives (e.g., nucleic acids, oligonucleotides, etc.) dna or rna fragments or modified forms thereof (e.g., genes, etc.)
Publication date: 2009-05-07
Patent application number: 20090118483
Claims:
1. A light-driven energy generator, comprising:a) a light source;b) a host
with a correctly folded, integrated proteorhodopsin protein;c) a source
of retinal to bind covalently to said integrated proteorhodopsin protein,
thereby creating a light absorbing pigment, wherein illumination of said
light absorbing pigment with said light source results in conversion of
light energy to biochemical energy; andd) a mediator, wherein said
biochemical energy is harnessed by said mediator to produce said
light-driven energy.
2. The light-driven energy generator as set forth in claim 1, wherein said host is an artificial membrane system.
3. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane obtained from a bacterium.
4. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane preparation obtained from a bacterium.
5. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane obtained from a eukaryote.
6. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane preparation obtained from a eukaryote.
7. The light-driven energy generator as set forth in claim 1, wherein said light source is a fast-pulsed light source.
8. The light-driven energy generator as set forth in claim 7, wherein said fast-pulsed light source comprises a mechanism for delivering intermittent fast-light pulses at predetermined time intervals.
9. The light-driven energy generator as set forth in claim 1, wherein said light source exhibits different predetermined wavelengths.
10. The light-driven energy generator as set forth in claim 1, wherein said light-driven energy is in the form of mechanical, chemical, or electrical energy.
11. The light-driven energy generator as set forth in claim 1, wherein said retinal is a chemical derivative of a naturally-occurring retinal.
12. The light-driven energy generator as set forth in claim 1, wherein said proteorhodopsin has a photocycle that decays in 15 ms.
13. The light-driven energy generator as set forth in claim 1, wherein said proteorhodopsin protein is selected to determine an absorption spectra of said light absorbing pigment.
14. The light-driven energy generator as set forth in claim 13, wherein said light absorbing pigment has an absorption spectra with a maximum between 490 nm and 527 nm.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001]This application is a divisional of U.S. patent application Ser. No. 09/847,513, filed May 1, 2001, which claims priority from U.S. Provisional application 60/201,602 filed May 3, 2000, both of which are hereby incorporated by reference.
SEQUENCE LISTING
[0003]This application includes a sequence listing submitted on a compact disc. The contents of this compact disc are incorporated by reference herein.
FIELD OF THE INVENTION
[0004]The present invention relates generally to gene expression of functional recombinant proteins in bacteria. More particularly, the present invention relates to proteorhodopsin genes and proteins that function as a light-driven energy generator in Escherichia coli (E. coli) and other bacteria.
BACKGROUND ART
[0005]Retinal (vitamin A aldehyde) is a chromophore that binds integral membrane proteins (opsins) to form light-absorbing pigments called rhodopsins. Rhodopsins are currently known to belong to two distinct protein families. The visual rhodopsins, found in the eye throughout the animal kingdom, are photosensory pigments. Archeal rhodopsins, found in extreme halophilic environments, function as light-driven protons pumps (bacteriorhodopsins), chloride ion pumps (halorhodopsins), or photosensory receptors (sensory rhodopsins). The two protein families show no significant sequence similarity and may have different origins. They do, however, share identical topologies characterized by seven transmembrane α-helices that form a pocket in which retinal is covalently linked, as a pronated Schiff base (helix G).
[0006]The archaeal rhodopsins are able to generate a photocycle which produces a chemiosmotic membrane potential in response to light, as such light energy is converted into biochemical energy. Recently, a protein with high sequence similarity to the archaeal rhodopsins has also been retrieved in the eukaryote Neurospora crassa (J. A. Bieszke et al., Proceedings of National Academy of Sciences USA 96:8034, 1999). The eucaryal rhodopsin formed a photochemically reactive pigment when bound to all-trans retinal and exhibited photocycle kinetics similar to those of archaeal sensory rhodopsins (J. A. Bieszke et al., Biochemistry 38:14138, 1999). To date, however, no rhodopsin-like sequences have been reported in members of the domain Bacteria, and no light-driven proton pumps based on rhodopsin have ever before been functionally expressed in E. coli.
[0007]The phototropic conversion of light energy into biochemical energy using bacteriorhodopsin can be harnessed for a variety of processes and applications, such as bio-electronic applications and bio-materials, as has been reported in U.S. Pat. No. 5,757,525 for optical devices, U.S. Pat. No. 5,854,710 for optical Fourier processing, and U.S. Pat. No. 5,470,690 for optical information storage. Bacteriorhodopsin in bio-electronic applications is aimed to interface, integrate, or substitute the silicon based microelectronics systems as well as molecular devices. Bacteriorhodopsin as a bio-material is integrated, for instance, in optical films for light mediated computer memory applications and pattern recognition.
[0008]Previously, archaeal rhodopsins capable of generating a chemiosmotic membrane potential in response to light had only been found in halophilic archaea. Therefore, rhodopsins that originate from archaea adapted to highly saline environments cannot be functionally expressed in E. coli. Finally, the isolation and cultivation of halorhodopsins is an elaborate process. At present one does not foresee an economic utilization possible for this process (e.g. U.S. Pat. No. 5,290,699).
[0009]Accordingly, as one skilled in the art might readily acknowledge, there is a strong need to retrieve and provide rhodopsin-like sequences from naturally occurring members of the domain Bacteria.
OBJECTS AND ADVANTAGES
[0010]In light of the above, it is the primary objective of the present invention to provide rhodopsin-like sequences from naturally occurring members of the domain Bacteria. More specifically, it is the objective of the present invention to provide a method to retrieve proteorhodopsin genes from DNA of naturally occurring bacteria that encodes DNA sequence for proteorhodopsin proteins.
[0011]It is another objective of the present invention to provide proteorhodopsin-specific polymerase chain reaction primers that amplify the proteorhodopsin-containing gene from a DNA sample of naturally occurring bacteria.
[0012]It is yet another objective of the present invention to produce variants of a proteorhodopsin gene using the same proteorhodopsin-specific polymerase chain reaction primers by amplifying a proteorhodopsin containing gene from of a mixed sample of naturally occurring bacteria.
[0013]It is still another objective of the present invention to provide an expression vector that produces a proteorhodopsin protein in E. coli and other bacteria.
[0014]It is another objective of the present invention to provide a light-driven energy generator in which the functional properties of proteorhodopsin are utilized. These properties include the ability to integrate within a host, for instance a cell membrane of E. coli, making an integrated proteorhodopsin protein, and the ability to bind retinal, making a light absorbing pigment.
[0015]It is another objective of the present invention to provide a light source and illuminate the light absorbing pigment to convert light energy into biochemical energy.
[0016]It is another objective of the present invention to provide a mediator and mediate the biochemical energy into electrical energy.
[0017]It is another objective of the present invention to provide methods to manipulate the kinetics of the light-driven energy generator.
[0018]The advantage of the present invention over the prior art is that it is not restricted to operate in halophilic archaea and could therefore be functionally expressed in E. coli and other bacteria.
[0019]Accordingly, another advantage of the present invention is that it provides for a fast and cheap production method that allows for mass production of functionally active proteorhodopsin.
SUMMARY
[0020]The present invention provides proteorhodopsin gene and protein sequences retrieved from samples of naturally occurring members of the domain Bacteria. More specifically, the present invention provides a method for the retrieval and amplification of proteorhodopsin genes from DNA samples of naturally occurring marine bacteria. In accordance with several exemplary embodiments of the present invention, DNA samples were obtained from naturally occurring bacteria such as, for instance, marine proteobacteria, SAR86 bacteria, or recombinant DNA libraries containing naturally occurring bacteria. The present invention provides proteorhodopsin-specific polymerase chain reaction (PCR) primers to amplify a proteorhodopsin gene from DNA samples of these marine bacteria. The present invention also provides a device and method for the placement of proteorhodopsin genes in an expression vector to produce functional proteorhodopsin proteins in E. coli and other bacteria.
[0021]Accordingly, the present invention provides a method to produce and obtain variants of proteorhodopsin genes and proteins. The same proteorhodopsin-specific polymerase chain reaction primers amplify different variants of proteorhodopsin-containing genes from a mixed sample of naturally occurring bacteria. As one skilled in the art might readily acknowledge, these variants of a proteorhodopsin gene produce functional variations in the photocycle kinetics of the proteorhodopsin protein.
[0022]Furthermore, the present invention provides a light-driven energy generator that utilizes proteorhodopsin to convert light-energy into biochemical energy. This light-driven energy generator takes advantage of the functional properties of the proteorhodopsin protein once expressed in, for example, E. coli or other bacteria as is described in exemplary embodiments. These properties include the ability to integrate within a host such as, for instance, a cell membrane of E. coli or other Bacteria, and thereby making an integrated proteorhodopsin protein or integrated cell membrane protein. These properties also include the ability to bind retinal and thereby making a light absorbing pigment. Illuminating the light absorbing pigment with a light source converts light energy into biochemical energy. Finally, the biochemical energy can be mediated into electrical energy by a mediator.
[0023]In accordance with exemplary embodiments, the present invention enables one skilled in the art to manipulate the kinetics of the proteorhodopsin protein photocycle once it is operational in the light-driven energy generator. In particular, the present invention provides examples in which the light source characteristics are manipulated. Examples are the manipulation of the delivery of fast-light pulses and/or the delivery of light at different wavelengths. The present invention also provides examples in which incremental additions of retinal influences the function of the light-driven energy generator. In addition, a proteorhodopsin gene or protein variant can be selected to determine an absorption spectra of the light absorbing pigment to change the kinetics of the light energy generator, for instance to meet a design/functional criteria of an application wherein proteorhodopsin is utilized.
BRIEF DESCRIPTION OF THE FIGURES
[0024]The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee. The objectives and advantages of the present invention will be understood by reading the following detailed description in conjunction with the drawings, in which:
[0025]FIG. 1 illustrates the phylogenetic tree of bacterial 16S rRNA gene sequences including that encoded on the 130 kb bacterioplankton BAC clone (EBAC31A8).
[0026]FIG. 2 provides a nucleotide sequence of polymerase chain reaction primer 1 (Sequence ID No:2) used to amplify a proteorhodopsin gene.
[0027]FIG. 3 provides a nucleotide sequence of polymerase chain reaction primer 2 (Sequence ID No:3) used to amplify a proteorhodopsin gene.
[0028]FIG. 4 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:4) amplified from clone EBAC31A8 (Sequence ID No:1) using PCR primers 1 (Sequence ID No:2) and 2 (Sequence ID No:3), and the deduced amino acid sequence (Sequence ID No:5) of the proteorhodopsin gene Sequence ID No:4 amplified from clone EBAC31A8 (Sequence ID No:1).
[0029]FIG. 5 provides a map of the secondary structure of the proteorhodopsin protein (Sequence ID No:7). Single letter amino acid codes are used (according to J. Sasaki and J. L. Spudich, Biophys. J. 75:2435, 1998). Predicted retinal binding pocket residues are marked in black.
[0030]FIG. 6 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:8) amplified from clone EBAC40E8 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:9) of the proteorhodopsin gene Sequence ID No:8 amplified from clone EBAC40E8.
[0031]FIG. 7 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:10) amplified from clone EBAC41B4 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:11) of the proteorhodopsin gene Sequence ID No:7 amplified from clone EBAC41B4.
[0032]FIG. 8 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:12) amplified from clone EBAC64A5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:13) of the proteorhodopsin gene Sequence ID No:12 amplified from clone EBAC64A5.
[0033]FIG. 9 provides a variants map of the DNA sequences of the proteorhodopsin gene with Sequence ID No:4, Sequence ID No:8, Sequence ID No:10, and Sequence ID No:12 that were amplified from clone EBAC38A8, EBAC40E8, EBAC41B4 and EBAC64A5 respectively using the proteorhodopsin-specific PCR primer 1 (Sequence ID No:2) and 2 (Sequence ID No:3). Dots represent sequences having identical sequence as those in Sequence ID No:4.
[0034]FIG. 10 provides a variant map of the deduced amino acid sequences encoded by the proteorhodopsin gene with Sequence ID No:4, Sequence ID No:8, Sequence ID No:10, and Sequence ID No:12 that were amplified from respectively EBAC38A8, EBAC40E8, EBAC41B4 and EBAC64A5 using the proteorhodopsin-specific primer 1 (Sequence ID No:2) and 2 (Sequence ID No:3). Lower case represents the PCR primer sequence region. Dots represent residues having identical sequence as those in Sequence ID No:5.
[0035]FIG. 11 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:14) amplified from clone HOT0m1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:15) of the proteorhodopsin gene Sequence ID No:14 amplified from clone HOT0m1.
[0036]FIG. 12 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:16) amplified from clone HOT75m1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:17) of the proteorhodopsin gene Sequence ID No:16 amplified from clone HOT75m1.
[0037]FIG. 13 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:18) amplified from clone HOT75m3 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:19) of the proteorhodopsin gene Sequence ID No:18 amplified from clone HOT75m3.
[0038]FIG. 14 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:20) amplified from clone HOT75m4 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:21) of the proteorhodopsin gene Sequence ID No:20 amplified from clone HOT75m4.
[0039]FIG. 15 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:22) amplified from clone HOT75m8 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:23) of the proteorhodopsin gene Sequence ID No:22 amplified from clone HOT75m8.
[0040]FIG. 16 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:24) amplified from clone MB0m1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:25) of the proteorhodopsin gene Sequence ID No:24 amplified from clone MB0m1.
[0041]FIG. 17 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:26) amplified from clone MB0m2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:27) of the proteorhodopsin gene Sequence ID No:26 amplified from clone MB0m2.
[0042]FIG. 18 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:28) amplified from clone MB20m2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:29) of the proteorhodopsin gene Sequence ID No:28 amplified from clone MB20m2.
[0043]FIG. 19 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:30) amplified from clone MB20m5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:31) of the proteorhodopsin gene Sequence ID No:30 amplified from clone MB20m5.
[0044]FIG. 20 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:32) amplified from clone MB20 ml2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:33) of the proteorhodopsin gene Sequence ID No:32 amplified from clone MB20m12.
[0045]FIG. 21 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:34) amplified from clone MB40 ml using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:35) of the proteorhodopsin gene Sequence ID No:34 amplified from clone MB40m1.
[0046]FIG. 22 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:36) amplified from clone MB40m5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:37) of the proteorhodopsin gene Sequence ID No:36 amplified from clone MB40m5.
[0047]FIG. 23 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:38) amplified from clone MB40m12 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:39) of the proteorhodopsin gene Sequence ID No:38 amplified from clone MB40m12.
[0048]FIG. 24 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:40) amplified from clone MB100m5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:41) of the proteorhodopsin gene Sequence ID No:40 amplified from clone MB100m5.
[0049]FIG. 25 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:42) amplified from clone MB100m7 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:43) of the proteorhodopsin gene Sequence ID No:42 amplified from clone MB100m7.
[0050]FIG. 26 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:44) amplified from clone MB100m9 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:45) of the proteorhodopsin gene Sequence ID No:44 amplified from clone MB100m9.
[0051]FIG. 27 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:46) amplified from clone MB100m10 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:47) of the proteorhodopsin gene Sequence ID No:46 amplified from clone MB100m10.
[0052]FIG. 28 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:48) amplified from clone PALB1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:49) of the proteorhodopsin gene Sequence ID No:48 amplified from clone PALB1.
[0053]FIG. 29 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:50) amplified from clone PALB2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:51) of the proteorhodopsin gene Sequence ID No:50 amplified from clone PALB2.
[0054]FIG. 30 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:52) amplified from clone PALB5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:53) of the proteorhodopsin gene Sequence ID No:52 amplified from clone PALB5.
[0055]FIG. 31 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:54) amplified from clone PALB7 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:55) of the proteorhodopsin gene Sequence ID No:54 amplified from clone PALB7.
[0056]FIG. 32 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:56) amplified from clone PALB6 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:57) of the proteorhodopsin gene Sequence ID No:56 amplified from clone PALB6.
[0057]FIG. 33 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:58) amplified from clone PALB8 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:59) of the proteorhodopsin gene Sequence ID No:58 amplified from clone PALB8.
[0058]FIG. 34 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:60) amplified from clone PALE1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:61) of the proteorhodopsin gene Sequence ID No:60 amplified from clone PALE1.
[0059]FIG. 35 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:62) amplified from clone PALE6 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:63) of the proteorhodopsin gene Sequence ID No:62 amplified from clone PALE6.
[0060]FIG. 36 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:64) amplified from clone PALE7 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:65) of the proteorhodopsin gene Sequence ID No:64 amplified from PALE7.
[0061]FIG. 37 illustrates a phylogenetic tree of different proteorhodopsin genes.
[0062]FIG. 38 provides an example of an alignment of proteorhodopsin amino acid sequences.
[0063]FIG. 39 provides a light-driven energy generator that utilizes proteorhodopsin.
[0064]FIG. 40 provides an example of a proteorhodopsin-expressing E. Coli cell suspension (+) compared to control cells (-), both with all-trans retinal.
[0065]FIG. 41 provides an example of absorption spectra of retinal-constituted proteorhodopsin in E. coli membranes and a negative control.
[0066]FIG. 42 provides an example of a light-driven transport of protons by a proteorhodopsin-expressing E. coli cell suspension.
[0067]FIG. 43 provides an example of a transport of [3H]TPP.sup.+ in E. coli right-side-out vesicles containing expressed proteorhodopsin, reconstituted with or without 10 μM retinal in the presence of light or in the dark.
[0068]FIG. 44 provides an example of laser flash-induced absorbance changes in suspensions of E. coli membranes containing proteorhodopsin.
[0069]FIG. 45 provides an example of absorption spectra of retinal-constituted proteorhodopsin in E. coli membranes.
DETAILED DESCRIPTION
[0070]Although the following detailed description contains many specifics for the purposes of illustration, anyone of ordinary skill in the art will appreciate that many variations and alterations to the following details are within the scope of the invention. Accordingly, the following preferred embodiment of the invention is set forth without any loss of generality to, and without imposing limitations upon, the claimed invention.
Proteorhodopsin
[0071]The present invention provides rhodopsin-like gene and protein sequences retrieved from naturally occurring members of the domain Bacteria. More specifically, the present invention provides a method for the retrieval and amplification of proteorhodopsin genes from DNA samples of naturally occurring marine bacteria. In accordance with exemplary embodiments of the present invention, DNA samples were obtained from naturally occurring marine bacteria such as bacteria from the SAR86 group. Provided as an exemplary embodiment of the SAR86 group, DNA samples were obtained from a bacterioplankton Bacterial Artificial Chromosome (BAC) clone BAC31A8 (also referred to as EBAC31A08). In general, as will be appreciated by those of ordinary skill in the art, suitable DNA samples can also be obtained from other sources, e.g., from a marine environment or from a recombinant DNA library containing genomic fragments of samples of naturally occurring bacteria.
[0072]FIG. 1 shows the phylogenetic tree of bacterial 16S rRNA gene sequences including that encoded on the EBAC31A8. FIG. 1 also shows the relationship of EBAC31A8 to the SAR86 bacteria group as well as to the gamma-proteobacteria group. A subclone shotgun library was constructed from BAC clone 31A8, and subclones were sequenced in both directions on the MegaBACE 1000 capillary array electrophoresis DNA sequencing instrument (Molecular Dynamics, Sunnyvale, Calif.). Sequence analysis of a 130-kb genomic DNA that encodes the ribosomal RNA operon from BAC31A8, reveals an open reading frame encoding a proteorhodopsin. In an exemplary embodiment, the contiguous sequence was assembled using SEQUENCHER 3.1.1 software (Gene Codes Co., Ann Arbor, Mich.). Other sequencing techniques can also be used, as will be recognized by those skilled in the art. The sequence of the proteorhodopsin-containing contig has been deposited in GenBank under accession #AF279106 and deposit date Oct. 23rd, 2000. Appendix A, hereby incorporated, shows the nucleotide sequence of the BAC clone BAC31A8 (Sequence ID No:1) which contains the 130 kilobases genomic DNA from a naturally occurring marine bacterium.
[0073]Proteorhodopsin was amplified from the 130 kilobase bacterioplankton BAC clone 31A8 (Sequence ID No:1) by polymerase chain reaction (PCR), using the proteorhodopsin-specific primers 5'-aCCATGGgtaaattattactgatattagg-3' (Sequence ID No:2 and shown in FIG. 2) and 5'-agcattagaagattctttaacagc-3' (Sequence ID No:3 and shown in FIG. 3). References for PCR are, for instance, The Polymerase Chain Reaction, Mullis et al., Ed. (Birkhauser, Boston, 1994) and U.S. Pat. Nos. 4,683,195 and 4,683,202 to Mullis et al. The proteorhodopsin-specific PCR primers include the addition of 3 nucleotides that encoded one amino acid not found in the native gene sequence of clone BAC31A8 (Sequence ID No:6), in the second amino acid position which is a glycine located on the 2nd codon ("GGT"). Therefore, compare the second amino acid position in the Sequence ID No:5 using PCR primers 1 and 2 with the native Sequence ID no:7. This addition of one non-native amino acid created a new restriction endonuclease site (NcoI site) not present in the native sequence. This allowed subcloning of the amplified fragment into the NcoI restriction site of an expression vector pBAD TOPO TA Cloning® Kit (Invitrogen, La Jolla, Calif.). The present invention is not limited to the use of this type of expression vector and other expression vectors could also be used.
[0074]FIG. 4 shows the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:4) that results from amplification of the proteorhodopsin-containing DNA in BAC31A8 using proteorhodopsin-specific PCR primers Sequence ID No:2 and Sequence No:3. FIG. 4 also shows the deduced amino acid sequences (Sequence ID No:5) encoded by the proteorhodopsin gene (Sequence ID No:4).
[0075]FIG. 5 shows an exemplary embodiment of a secondary structure of proteorhodopsin after it has been folded in a cell membrane 510 and bonded with retinal 520. FIG. 5 shows the native proteorhodopsin gene (Sequence ID No:6) obtained from clone BAC31A8 and encodes a proteorhodopsin protein of 249 amino acids with a molecular weight of 27 kD (Sequence ID No:7). In FIG. 5, 530 indicates seven transmembrane domains, a typical feature of the rhodopsin protein family, that aligned well with the corresponding helices of the archaeal rhodopsins. FIG. 5 also shows the amino acid residues that form a retinal binding pocket indicated by 520. Although the proteorhodopsin proteins shown in FIGS. 4 and 5 both originate from BAC31A8, they differ with respect to the second amino acid position. The reason is that the proteorhodopsin-specific PCR primers that were used to amplify the proteorhodopsin gene from BAC31A8 (which resulted in proteorhodopsin protein as in FIG. 4; Sequence ID No:5) included the addition of 3 nucleotides. These 3 nucleotides encoded one amino acid not found in the native gene sequence (Sequence ID No:6), in the second amino acid position which is a glycine located on the 2nd codon ("GGT"). Proteorhodopsin protein (Sequence ID No:7) as shown in FIG. 5 originates from the native gene sequence without the addition of the 3 nucleotides. As mentioned above, the addition of the 3 nucleotides created a new restriction endonuclease site (NcoI site) that was not present in the native sequence and thereby allowed the amplified fragment to be subcloned into the NcoI site of the expression vector.
[0076]In the exemplary embodiment presented above, PCR primers with Sequence ID No:2 and Sequence ID No:3 were used. In general, the present invention provides a method for designing different proteorhodopsin-specific PCR primers that are all capable of amplifying a proteorhodopsin gene from DNA samples of naturally occurring microbial populations by polymerase chain reaction. In designing these primers one first needs to determine a DNA sequence of a proteorhodopsin gene. Then one can design oligodeoxynucleotide primers with a Watson-Crick base pair complementary to 5' and 3' ends of the proteorhodopsin gene.
Variants of Proteorhodopsin
[0077]In the previous section, an exemplary embodiment is provided of a proteorhodopsin gene and protein. The present invention also provides the retrieval of genetic variations of proteorhodopsin from naturally occurring genetic variations in naturally occurring bacterial populations. These genetic variations in proteorhodopsin sequences result in functional variations in the proteorhodopsin proteins as is discussed below.
[0078]The present invention enables one skilled in the art to use the same proteorhodopsin-specific PCR primers as shown in FIGS. 2 and 3 to successfully amplify different sequence variants from DNA originating from mixed naturally occurring bacterial populations when it is compared to for instance the proteorhodopsin gene as shown in FIG. 4. As mentioned above, different proteorhodopsin-specific PCR primers could be used to amplify genetic variants of proteorhodopsin.
[0079]FIGS. 6-8 show exemplary embodiments of three different and unique variants of the proteorhodopsin gene that were retrieved from a recombinant DNA library of other naturally occurring bacteria (i.e. the bacterial artificial chromosome library (BAC)). In general, genetic variants could be obtained from different DNA libraries containing naturally occurring bacteria as well as from samples of naturally occurring bacteria. FIG. 6 shows the variant of the proteorhodopsin gene sequence (Sequence ID No:8) that is amplified from the BAC clone 40 (BAC40E8) with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and 3. Accordingly, FIG. 6 also shows the deduced amino acid sequence (Sequence ID No:9) of the genetic variant of proteorhodopsin shown in FIG. 6. FIG. 7 shows the variant of the proteorhodopsin gene sequence (Sequence ID No:10) that is amplified from the BAC clone 41 (BAC41B4) with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and 3. Accordingly, FIG. 7 also shows the deduced amino acid sequence (Sequence ID No:11) of the genetic variant of proteorhodopsin shown in FIG. 7. FIG. 8 shows the variant of the proteorhodopsin gene sequence (Sequence ID No:12) that is amplified from the BAC clone 64 (BAC64A5) with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and 3. Accordingly, FIG. 8 also shows the deduced amino acid sequence (Sequence ID No:13) of the genetic variant of proteorhodopsin shown in FIG. 8.
[0080]FIG. 9 provides a variants map of the nucleotide sequences of the proteorhodopsin gene Sequence ID No:4, Sequence ID No:8, Sequence ID No:10, and Sequence ID No:12 amplified from respectively BAC31A8, BAC40E8, BAC41B4 and BAC64A5 using the proteorhodopsin-specific PCR primers Sequence ID No:2 and Sequence ID No:3. In FIG. 9 lower case letters represent the PCR primer sequence region. Dots represent residues having identical sequence as those in Sequence ID No:4. These proteorhodopsin gene sequences differ by as much as 31 nucleotides as is shown in FIG. 10. FIG. 10 provides a variant map of the deduced amino acid sequences of the proteorhodopsin genes shown in FIG. 9.
[0081]Using the same proteorhodopsin-specific PCR primers, as for instance shown in FIGS. 2 and 3, proteorhodopsin genes were also amplified from bacterioplankton extracts. As mentioned above, any proteorhodopsin-specific PCR primer can be used. These bacterioplankton extracts include those from the Monterey Bay (referred to as MB clones), the Southern Ocean (Palmer Station, referred to as PAL clones), and waters of the central North Pacific Ocean (Hawaii Ocean Time series station, referred to as HOT clones).
[0082]FIGS. 11-36 show exemplary embodiments of different and unique variants of proteorhodopsin that were retrieved from the MB clones, PAL clones, and HOT clones. FIGS. 11-36 each show a variant of a proteorhodopsin gene sequence that is amplified with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and Sequence ID No:3 from respectively clones HOT0m1, HOT75m1, HOT75m3, HOT75m4, HOT75m8, MB0m1, MB0m2, MB20m2, MB20m5, MB20m12, MB40m1, MB40m5, MB40 m12, MB100m5, MB100m7, MB100m9, MB100m10, PALB1, PALB2, PALB5, PALB7, PALB6, PALB8, PALE1, PALE6 and PALE7. The proteorhodopsin gene sequences retrieved from clones HOT0m1, HOT75 m1, HOT75m3, HOT75m4, HOT75m8, MB0m1, MB0m2, MB20m2, MB20m5, MB20m12, MB40 ml, MB40m5, MB40 m12, MB100m5, MB100m7, MB100m9, MB100m10, PALB1, PALB2, PALB5, PALB7, PALB6, PALB8, PALE1, PALE6 and PALE7, have respectively Sequence ID Nos: 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, and 64. Accordingly, FIGS. 11-36 also show the deduced amino acid sequence of each genetic variant of proteorhodopsin. The deduced amino acid sequence encoded by the proteorhodopsin gene retrieved from clones HOT0m1, HOT75m1, HOT75m3, HOT75m4, HOT75m8, MB0m1, MB0m2, MB20m2, MB20m5, MB20m12, MB40m1, MB40m5, MB40m12, MB100m5, MB100m7, MB100m9, MB100m10, PALB1, PALB2, PALB5, PALB7, PALB6, PALB8, PALE1, PALE6 and PALE7, have respectively Sequence ID Nos: 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, and 65.
[0083]In an exemplary embodiment shown in FIG. 37, fifteen different variants of proteorhodopsin in the PCR generated MB gene library 3710 were detected, falling into three clusters. The MB gene library includes MB clones MB0m2, MB40m5, MB20m2, MB40m12, MB100m10, MB20m12, MB40m1, MB100m5, MB20m5, MB100m7, MB0m1, and MB100m9 as well as BAC clones BAC40E8, BAC31A8 and BAC64A5. FIG. 37 is based on a phylogenetic analysis of the inferred amino acids of cloned proteorhodopsin genes. Evolutionary distances calculated from 220 positions were used to infer the tree topology by the neighbor joining method using the PaupSearch program of the Wisconsin Package version 10.0 (Genetics Computer Group (GCG), Madison Wis.). Other methods could also be used. The variants of the MB library share at least 97% identity over 248 amino acids, as shown in FIG. 38, and 93% identity at the DNA level. All the PCR amplified proteorhodopsin genes from Antarctic marine bacterioplankton (e.g. the PAL clones) were different from those of Monterey Bay (e.g. the MB clones) sharing 78% identity over 248 amino acids with the Monterey clade. The changes in amino acid sequences were not restricted to the hydrophilic loops, but spread over the entire protein including changes near the retinal binding domain 3830 as shown in FIG. 38, which are predicted retinal-binding residues. FIG. 38 shows an example of a multiple alignment of proteorhodopsin amino acid sequences that were obtained from different clones 3820. The secondary structure is derived from hydropathy plots (boxes 3810 shows trans-membrane helices).
Light-Driven Energy Generator
[0084]FIG. 39 provides a light-driven energy generator 3900 that utilizes proteorhodopsin, as obtained from naturally occurring bacteria as described above, to convert light-energy into biochemical energy. Light-driven energy generator 3900 takes advantage of the functional properties of the proteorhodopsin protein once expressed in, for instance, E. coli and other bacteria. These properties include the ability of proteorhodopsin 3906 to integrate within the cell membrane 3904 of, for instance, E. coli making an integrated proteorhodopsin protein 3908 (also called an integrated cell membrane protein). These properties also include the ability of proteorhodopsin 3906 to bind retinal 3910, making a light absorbing pigment 3912. The source of retinal 3910 is not limited to chromophore retinal but could also include chemical derivatives of retinal, such as 3-methyl-5-(1-pyryl)-2E,4E-pentadienal, 3,7-dimethyl-9-(1-pyryl)-2E,4E,6E,8E-nonatetraenal, all-trans-9-(4-azido-2,3,5,6-tetrafluorophenyl)-3,7-dimethyl-2,4,6,8,-non- atetraenal and 2,3-dehydro-4-oxoretinal. Illuminating light absorbing pigment 3912 with a light source 3914 results in a chemiosmotic gradient or proton pump in which light energy 3916 is converted into biochemical energy 3918. The chemiosmotic gradient involves pumping of protons from the inside to the outside of cell membrane 3904. When the protons return to the inside of cell membrane 3904 it produces biochemical energy 3918 via a proton translocating ATP-ase. Finally, the biochemical energy 3918 is harnessed by a mediator 3920 to produce energy 3922 for a particular process. For example, since proteorhodopsin functions as a light driven proton pump, it generates energy in the form of a proton motive force across the host cell membrane upon illumination. This light-driven proton motive force can be converted to many other forms of energy, one example above being the regeneration of adenosine triphosphate (ATP), via a proton-translocating ATPase. This coupling of the proton motive force generated by proteorhodopsin, for use by proton-translocating ATPases to synthesize ATP, could be accomplished both in living cells, as well as in artificially constructed membrane systems such as liposomes. Proteorhodopsin-based systems can convert light energy to a wide variety of useful mechanical, chemical, and electrical energy forms, for many industrial and technological applications. These include, but are not limited to, use in targeted drug delivery, uses as primary or secondary energy generators for biocatalyic reactors, fuel cells and nano-machines (including molecular motors), as well as uses in molecular switching or data storage devices.
[0085]Applications that can potentially benefit from proteorhodopsin-light driven energy generation are, for instance, bio-electronics applications that are aimed to interface, integrate, or substitute the silicon based microelectronics systems as well as molecular devices. Other applications that can potentially benefit from proteorhodopsin-light driven energy generation are, for instance, in bio-materials, wherein proteorhodopsin is integrated as a bio-material in, for instance, optical films for light mediated computer memory applications, optical information storage and pattern recognition.
[0086]Alternatively, proteorhodopsin is useful for a process to enhance yield or increase the potential of recombinant protein production or converting the light induced membrane potential into cellular signals, including modulation of gene expression. The biochemical energy derived from functional proteorhodopsin exposed to light could be harnessed to support a variety of cellular processes. For instance, the energy derived from light-mediated proton pumping could be used to enhance the production of secondary metabolites, or recombinant proteins in host cells, such as E. coli. Often, production of specific compounds in the biotechnology industry is limited, since their optimal expression or production occurs in the late stationary phase of growth, when energy reserves of the host cells are low. Retinal-bound proteorhodopsin expressed in such cells would provide an ample source of biochemical energy, by simple illumination. Proteorhodopsin-mediated light driven proton production could enhance any variety of biosynthetic or physiological processes which require energy.
[0087]The biochemical energy derived from proteorhodopsin light driven proton pumping could also be converted to other generally useful energy forms, for example electricity. Microbial fuel cells currently use carbon-based compounds, such as glucose, as the primary energy source. Via specific mediators of reduction potential (e.g. electrons), these microbial fuel cells convert cellular biochemical energy to electrical potential. Unlike carbon-based microbial fuel cells, proteorhodopsin uses light as the energy source, that can then be converted into a chemiosmotic potential, and finally into cellular biochemical energy by membrane-bound proton ATP-ases. Therefore, the use of proteorhodopsin could be employed to derive energy from light as the primary or supplementary energy source, that could then be converted into electrical potential (analogous microbial fuel cells that derive their energy from glucose).
[0088]In addition to energy generation in vivo in living cells, membranes containing proteorhodopsin could be used to enhance or enable other specific processes in vitro. Polymers produced from proteorhodopsin-containing membranes may have specific properties that could be used similarly to those containing bacteriorhodopsin. One example includes the use of these light sensitive molecules for optical computing applications.
[0089]As shown in FIG. 39, the kinetics of proteorhodopsin as it is utilized in 3900 is influenced by various factors such as the type of light source 3914 and the manipulation of light source 3914 in terms of frequency and/or wavelength at which the light 3916 is delivered. Light source 3914 could be any type of light source that delivers light energy 3916 that would be absorbed by light absorbing pigment 3918. For example, the light source 3914 could be tuned to optimally excite rhodopsin variances with an absorbance maximum of 490 nm or alternatively those rhodopsins with an absorbance maximum of 520 nm. Manipulation of the light source 3914 or the light 3916 being emitted by the light source 3914, for example, involves changing the frequency of fast-light pulses or the delivery of light 3916 as individual pulses, a train of pulses, or a continuous source of light. Manipulation also involves changing the wavelength of the delivery of light 3916 at different wavelengths. In addition, as is clear for one skilled in the art, changing the frequency and/or amount of retinal that will bind within integrated cell membrane protein 3908 also varies the function of proteorhodopsin. Finally, as was mentioned in the previous section, genetic variants of proteorhodopsin result in variants of the proteorhodopsin proteins that changes the kinetics of 3600 due to a difference in absorption of light at different wavelengths. The functional expression of such variation in these proteorhodopsin proteins adds another source of variation to the kinetics of proteorhodopsin as it is utilized in 3900.
[0090]As shown in FIG. 39, the light-driven energy generator includes a host 3902. In the present invention, as a preferred embodiment, host 3902 is a cell membrane preparation of E. coli. However, the present invention is not limited to the use of E. coli and, alternatively, other bacteria or eukaryotes could be used to provide host 3902 as an intact cell (in vivo) and/or as a cell membrane preparation (in vitro). For example, but not limited to, bacteria and yeast with developed genetic systems such as Bacillus spp. Species, Saccharomyces spp., Streptomyces spp. or Pichia spp. could be used as host for the expression of proteorhodopsin. In addition, in case a cell membrane preparation (in vitro) is used, host 3902 becomes equivalent to cell membrane 3904.
[0091]The light-driven energy generator 3900, as shown in FIG. 39, further includes proteorhodopsin 3906. Proteorhodopsin is presented in the form of the earlier presented expression vector containing a proteorhodopsin gene or one of its variants. Once proteorhodopsin 3906 has been put into host 3902, the proteorhodopsin expression vector expresses the proteorhodopsin protein in host 3902. An integral cell membrane protein 3908 is created in which the proteorhodopsin protein inserts into and folds properly within the cell membrane 3904. This is accomplished in the E. coli host by virtue of the native signal sequence found in the 5' end of the proteorhodopsin gene. It could also be accomplished by replacement of native sequence with another host-specific signal sequence in non-E. coli host systems.
[0092]As shown in FIG. 39, once retinal 3910 is added to cell membrane 3904, retinal 3910 binds within integrated cell membrane protein 3908 and forms a light absorbing pigment 3912. The particular example of FIG. 40 shows an integrated proteorhodopsin protein 3908 bound to retinal 3910 in E. coli. Chemical derivatives of retinal (as discussed above) could also be used as a substitute chromophore to generate functional proteorhodopsin. For the particular example of FIG. 40, the proteorhodopsin protein was cloned with its native signal sequence and included an addition of the V5 epitope, and a polyhistidine tail in the C-terminus. The proteorhodopsin protein was expressed in host 3902, i.e. E. coli outer-membrane protease-deficient strain UT5600, and induced with 0.2% arabinose for 3 hours. Cell membranes 3904 were prepared and resuspended in 50 mM Tris-Cl (pH 8.0) and 5 mM MgCl2. FIG. 40 shows a proteorhodopsin-expressing E. coli cell suspension. After 3 hours of induction in the presence of 10 μM all-trans retinal, cells expressing the protein acquire a reddish pigmentation as indicated by 4010 and the + (plus) symbol. FIG. 40 also shows that a cell suspension using the same PCR primers (Sequence ID No:2 and 3) but now in opposite orientation as a negative control, did not acquire a reddish pigmentation as indicated by 4020 and the - (minus) symbol.
[0093]FIG. 41 shows an exemplary embodiment of the absorption spectra of light absorbing pigment 3912 upon illumination with light source 3914 as is shown in FIG. 39. As mentioned above, the light absorbing pigment is a retinal-reconstituted proteorhodopsin in E. coli. FIG. 41 shows absorption spectra of light absorbing pigment 3912 as well as a negative control. After retinal 3910 addition to integrated proteorhodopsin protein 3908, light absorbing pigment 3912 was made. The retinal 3910 addition was done at selected time points, i.e. 10, 20, 30 and 40 min, and shows a progression from low to high absorption values indicated by respectively 4110, 4120, 4130 and 4140 upon illumination with light source 3914. FIG. 41 also shows the absorption spectra of retinal 3910 addition at these similar time points but now to a negative control of retinal 3910 containing a proteorhodopsin 3906 that was created using the same PCR primers in opposite orientation. 4150, 4160, 4170 and 4180 indicate the four absorption spectra for the negative control. An absorption peak at 520 nm was observed after 10 minutes (4110) of incubation as illustrated in FIG. 41. On further addition of retinal, the peak at 520 nm increased, and had a ˜100 nm half bandwidth. The 520 nm absorption peak was generated only in membranes containing proteorhodopsin 3906, and only in the presence of retinal 3910. The red shifted λmax of retinal (λmax=370 nm in the free state) is indicative of a protonated Shiff base covalent linkage of retinal to proteorhodopsin.
[0094]FIG. 42 shows an exemplary embodiment of the light mediated proton pump of the light-driven energy generator 3900 indicating the conversion of light energy 3916 as shown in FIG. 39. The proton pump action is illustrated by measuring pH changes in the medium surrounding the host 3902, which in this particular example involves a cell suspension of E. coli, illuminated by light source 3914. The beginning and cessation of illumination (with yellow light >485 nm delivered by 3916) is indicated 4110 ("ON") and 4120 ("OFF") respectively. The cells were suspended in 10 mM NaCl, 10 mM MgSO4.7H2O and 100, M CaCl2. Net outward transport of protons was observed solely in proteorhodopsin-containing E. coli cells, and only in the presence of retinal 3910 and light 3916 and is indicated by 4210 in FIG. 42. Light-induced acidification of the medium was completely abolished by the presence of 10 μM of the protonophore CCCP.
[0095]FIG. 43 is an exemplary embodiment showing that illumination by light source 3914 generates an electrical potential at the membrane 3904 in proteorhodopsin-containing right-side-out membrane vesicles, in the presence of retinal 3910, reaching -90 mV after 2 minutes from light 3916 onset. Transport of [3H]TPP.sup.+ in E. coli right-side-out vesicles containing expressed proteorhodopsin, reconstituted with (4310 and 4320) or without (4330 and 4340) 10 μM retinal 3910 in the presence of light (4310 and 4330) delivered by the light source 3914 or in the dark (4320 and 4340). FIG. 43 shows that proteorhodopsin, in its form of 3912 as a light absorbing pigment, pumps protons from the inside to the outside of cell membrane in a physiologically relevant range. The ability of proteorhodopsin to generate a physiologically significant membrane potential, even when heterologously expressed in normative membranes, is consistent with the proton pumping function for proteorhodopsin in the native gamma proteobacteria from which it is derived.
[0096]FIG. 44 is an exemplary embodiment showing that proteorhodopsin can have a fast photocycle and can therefore be characterized as a fast and therefore efficient transporter of protons. For the particular example of FIG. 44, light absorbing pigment 3912 is induced by laser pulses delivered by light source 3914. Laser pulse-induced absorption changes are shown by 3912 in host 3902, which in this case are suspensions of E. coli membranes containing proteorhodopsin. A 532-nm pulse (6 ns duration, 40 mJ) was delivered at time 0 and absorption changes were monitored at various wavelengths in the visible range in a lab-constructed pulse photolysis system. 64 transients were collected for each wavelength. 4410 indicates transients at 3 wavelengths exhibiting maximal amplitudes. 4420 indicates absorption difference absorption spectra calculated from amplitudes at 0.5 ms (indicated by 4430) and between 0.5 ms and 5.0 ms (indicated by 4440). In 4410, transient depletion occurred near the absorption maximum of pigment 3912 (500-nm trace indicated by 4450), and transient absorption increase was detected at 400 nm (indicated by 4460) and 590 nm (indicated by 4470), indicating a functional photocyclic reaction pathway. In 4420, the absorption difference spectrum shows that within 0.5 ms an intermediate with maximal absorption near 400 nm is produced (indicated by 4430), typical of unprotonated Schiff base forms (M intermediates) of retinylidene pigments. The 5-ms minus 0.5-ms difference spectrum 4440 shows that following M decay an intermediate species red-shifted from the unphotolyzed 520-nm state appears. The decay of proteorhodopsin final intermediate is the rate limiting step in the photocycle and is fit well by a single exponential process of 15 ms, with an upward baseline shift of 13% of the initial amplitude.
[0097]As mentioned above, a proteorhodopsin gene or protein variant can be selected to determine an absorption spectra of the light absorbing pigment to change the kinetics of the light energy generator 3900, for instance to meet a design/functional criteria of an application wherein proteorhodopsin is utilized. FIG. 45 shows an exemplary embodiment of different absorption spectra of retinal-reconstituted proteorhodopsins in E. coli as a function of wavelength 4510. As shown in FIG. 45, the absorbance 4520 is different and depends on the clone from which the proteorhodopsin was amplified. In this particular example, 5 μm all-trans retinal was added to the membranes suspensions in a 100 mM phosphate buffer, with a pH 7.0, and absorption spectra were recorded. The four spectra 4530, 4540, 4550, and 4560 are respectively for the proteorhodopsin genes retrieved from clones HOT75m4, PALE6, HOT0m1, and BAC31A8 at 1 hour after retinal addition. The proteorhodopsin gene retrieved from clone HOT75m4 4530 and PALE6 4540 produced a blue (490 nm) absorption maximum. The proteorhodopsin gene retrieved from clone HOT0m1 4550 and BAC31A8 4560 produced a green (527 nm) absorption maximum. In general, a range of wavelengths could be obtained that is not limited to the range shown in the example of FIG. 45.
[0098]It will be clear to one skilled in the art that the above embodiment may be altered in many ways without departing from the scope of the invention, such as for instance by mutagenesis to change the genetic sequence of proteorhodopsin and thereby changing the kinetics of the proteorhodopsin protein once it is expressed. Accordingly, the following claims and their legal equivalents should determine the scope of the invention.
Sequence CWU
1
651105184DNAUnknownNaturally Occuring gamma protobaceterium 1ttgttatatc
agtaatggct attgctccaa taacttaata ctaatatata attagtttat 60gaataaattt
tatatatttg ggttattgtt ttttacacta aatgcatttt cttgctcaga 120tcttctagat
acagacatga gagttcttga ttccgctgag tcaagaaacc tttgcgagtt 180tgaaggaaaa
gctttactag ttgtgaatgt tgcaagtaga tgtggttaca cttatcaata 240tgctggcctt
caaaagttat atgaaagtta taaagatgaa gattttctag taattgggat 300cccatctaga
gattttcttc aagaatactc tgatgaaagc gatgttgcag aattttgttc 360tacagaatac
ggtgttgaat ttcctatgtt ctcaactgct aaagtcaaag gaaaaaaagc 420acacccattt
tataaaaaac ttattgcaga atcaggtttt actccctcat ggaactttaa 480taaatactta
atctcaaaag agggcaaggt tgtatccaca tatggatcaa aggtaaagcc 540tgattcaaaa
gagcttatat cagctataga aggcttgctg taaaattatt acttagaaac 600taatacagtt
ttaggcttgt ttgctgcaaa tattccatta tctacaactc caggaatatt 660attaatcaaa
gcttccattt cagtggggtt tgaaatatcc atattagaga tatctaaaat 720gtgattacct
tggtctgtta taaatccagt tctatatgtg ggtattccac cgatcgagat 780tatttttctt
gcaacaaggc tcctactttc aggtatcacc tctataggca gtggaaaagc 840tcccaaaaga
ttaaccatct ttgactgatc aactatacat ataaactcgt tagaggcaga 900agcaactatc
ttttctctag tatgtgcgcc accaccacct ttaataagac aattttcagg 960agacacctca
tctgcaccat ctatgtaata agctatatca actacatcat taaggctaaa 1020gacctctatc
ccattttcat ttaataattt tgatgaagca tctgaactag aaacagctcc 1080agcaaatttg
tgcctatgct cctttagttc ttctataaaa aaattaactg ttgagccggt 1140tccaatacct
aaaatcatct caggatgaag attatttttg atatattcta tagcttgttt 1200agcaacattt
atctttgagc cactcataga gttataatac aagaaaatat aggtagttaa 1260ttattttgag
actaaaaatt aaaaaaacag gttcttttaa gaattcccag aagtacctaa 1320agcttattcc
taatgcatct gtttatgacg tagcattaaa atcacctata acatttgctc 1380taaatatttc
ttcaaagctg gggaataaag ttttcctaaa aagagaggat ctgcaaccta 1440tattttcttt
taaaaacaga ggagcgtata acaagattgt aaatttatcc gatgccgaaa 1500agaagagggg
ggttattgct gcatcagcag gaaatcatgc tcaaggggta gccagtgcat 1560gtaagaaatt
aaaaattaat tgcttgatag ttatgccaat aacaactcca gaaataaaaa 1620taaaagatgt
aaaaagattt ggagccaaaa tactccaaca tggggacaac gtagatgcag 1680cattaaaaga
ggcactgttt attgcaaaga aaaaaaaatt gtcttttgtt catccttttg 1740acgaccctct
aacaattgct ggccaaggga ctataggaca agaaattctt gaagataaaa 1800ataattttga
tgttgtcttt gttccggtgg gaggaggagg tattctagct ggtgtatctg 1860cctggatagc
acagaataat aagaaaataa aaattgttgg tgttgaggtt gaggattccg 1920cttgtcttgc
tgaggccgta aaagctaata aaagagttat tttaaaagaa gtgggcctct 1980ttgctgatgg
ggtggcagta tcaagggttg gaaaaaataa ttttgatgtt attaaagagt 2040gcgtagatga
agtcattaca gttagcgttg atgaggtctg caccgctgta aaagatatct 2100ttgaagatac
aagggttcta tcagaacctg ctggggcatt agcacttgca gggttaaaag 2160cctacgcaag
gaaagttaaa aataaaaaac ttattgctat aagttctggc gctaatgtaa 2220atttccaaag
acttaatttt attgttgagc gatcagagat tggtgaaaat agagaaaaaa 2280tattaagtat
caaaatccca gagatacctg gaagttttct taagctttca aggatgtttg 2340gcagctctca
agttacagag tttaactaca ggaaatctag cttaagcgat gcatatgttt 2400tagttggtgt
tagaactaaa actgaaaaat catttgaaat cttaaagtcc aaattaaaaa 2460aagcaggctt
cacctttagc gactttactc gaaatgaaat atccaatgat catctgaggc 2520atatggttgg
tggcagaaat agtgactcag gctctcataa caatgaaaga atatttaggg 2580gagagtttcc
tgagaagccg ggcgcgctgt taaattttct agagaaattt ggaaataaat 2640ggmatatttc
cttatttcat tacaggaacc taggttcagc ttttggaaag atattaattg 2700gcatcgagag
taaggataaa gacaagctaa taaatcattt aaataagtca ggnactattt 2760ttacagaaga
aacctctaac aaggcataca aagatttttt aaaatgaaag gttaatactt 2820taatctaaat
ttaattgaaa aaagctcatc gctagggttt tcccacggct ctttgaacaa 2880ctcggattga
gatctatcat cctcctcgtc gtaaattctc ccacctttag aatagaccaa 2940aaatagatat
gacaaaggag cgagctcata tttatatcta atttggaacg aagctacgcc 3000agtattaaat
tcattaacta tattatttcc tttataaagg tatccattag catctgaaaa 3060aatactaata
ggattttttg ctttcaaagc aacaaattga ctcttaagtc tgatctcatg 3120tttattattt
ttaaaccaat ttagatcaaa agataaggta tcttgtcttg aatcatatga 3180ggcaagatta
ttattatcct gccatatcag ccattcattt tcctttctta ttctatattg 3240cgcattgatt
cttaagttat catttggaaa tattgaacct gctatcttgt aaaattttct 3300accataccca
ttcgaatccc accgattatc tttctctccc ttaaagaagc taactctcca 3360gtcatatgtc
cagaatgaat agttctttgc ctcaaagtct gctgtaatac ctattcttct 3420ttttgatttg
ataaaggggt aggcttcatt ttttcttgtg atagttgtat ttttcccaga 3480agatctaaag
ttaaaatcta attgaaattt agagttgtcc ttaaaactaa aagaattttt 3540ttgatcgatg
cctattggat ttgaattacc agctgtgtca gcatcataat ttagatcaat 3600tccataatct
atttgtttta atatcgagct attatcaaat tcatttattt ttcgatttcc 3660tccaatacca
gcatgaatcc agtctcttct ttgcagataa ccaaagtcat ttaactcgaa 3720gtcgtcttca
aaataaagaa ggcttccact tatgttagat agtttatttg gaagatatgt 3780aaactgagtc
ctatacccaa gcccattttt accatctttt tctgaggcta acagatctga 3840atatgtaatt
aatttttttg aacgaatatt gatgtaatca ataacattga ctgttgatga 3900ttcgcctgtc
atttcattct caacattcgt taccatgaag ccaagcgttt tatttccaag 3960ctttgttcga
gatcgaagag cataatagtc tcttccaact gaaaaggctt catcagcctc 4020acttgctaca
aatactccaa attcattatt attacttttt tgagtaagtc ttaatgcaaa 4080atcaatatca
gaatagtttt ttttagctgc ctcgcaacct tcttcattac tttcttctga 4140gcaattatag
ctgggggcag ctccaatcct ccgcgtattt ataaccgagt atctatcata 4200attactaata
tcaaatagtg attggttttc attgaaaaat gctctttttt ctgagtaaaa 4260agtttcttga
gcagaaaagt taataaccac atcatcactc tcagcttgtc cgaaatctgg 4320attaatagct
aaatttattt gacgaccttt tccagtgcta taaaagattt cagccccaat 4380atctgaccct
tcttggtttg taactgaatt tttatttgaa gatatatatg gaaaaaaagt 4440aagctttgat
tttgtatagt tttgtatttc taagctatct aactcttgaa agtagtcatt 4500tctactagcc
attgttccgg cactgctaac ccatgactca ttttcggcac tataacgtaa 4560tgcggtgtaa
ttaatttttc ttatatcacc atcaggctgt ttcattaacg ttacatccca 4620aggaataaaa
aactcagaga cccaataccc atcaaatttt tgtgtttttg caatccaatc 4680tccatcccag
tctgtcttaa agtctcctgc ttgcgttttt atggcatcga aaagcgagtt 4740cccaagattt
atagcaagaa tgaaagcttt gttaccatca ccatcaaagt ctatatttat 4800agagttttta
tcgctaagtg aatttatttg atctctaagc gtcttcctgg agaacataga 4860atcattactt
tgaaaattct taaatccaac atatatacca tccttatttg agaaaattaa 4920agccgttgta
agaagttcat ttttcttaag agtaaaagga gatgtttcat aaaaatctgt 4980aatttcaaat
gcattattcc actcaggctc atcaagagag ccatcaataa caattgagtt 5040tgaccaaatc
agcacagagg taagtaaaag tgataatgag gctaaagtta ttttcataga 5100tagatttaat
tcagagtatt ttaatctatg aagagtagga aatcatccga tcattctcag 5160aaaaaacata
atcatattta aggtcatgtt tttctccaaa atgcatatta caaatctggt 5220attgataaca
cagtccaacc aataaaggtc ttgatacctc ctcgtttata gagccaataa 5280ttctatcgaa
atatccagag ccatagccaa gcctgtatcc atttaaatca actcctgtca 5340taggaataaa
cattaaatca atttcattta tgttgacata atcttcactt ttaacctctt 5400tgatcccaaa
ttgatttata aagaagttag gctgctcatc cagcaaatta aaagccatca 5460tttcgtcatc
aattactttt ggaatataaa tattcttttt aagtttagta aaagcttgaa 5520ttaacaaatt
tgtattgact tcatttcgaa aaggaaaata taaggcaatg ttttgcattt 5580catgagtatt
gattttttct aatacatttt cttggattaa aaaacttata ttgctcttag 5640ataaatcaga
aatagactgc ccctgctcaa agagtgattt tcgtatttta tttttcacca 5700taaattgtgc
cgaggctaac aaaccaaatc accgccatgt gctatacctg aaccgatggg 5760tcaggtggtg
aatttcataa caaatcaggc ttccctgtaa aggtactgcg caacaatgct 5820agaggaatat
cacctattta attgtatcgg ttcaaattta ttaacacatt agcgtatgaa 5880ccagaatgca
gttaattata aaatatatag gtattaagta aaagttaatt tttagagagc 5940agactctatt
ttttgtatta gcttttcgat atctttatta tttaaagcag agtcagtatt 6000ctcggttggc
ttggataaaa gcttactagc catagttaga ccagctatca ccaaggcatt 6060attcttatca
ctgatgccat caagctcatc atttaataat tttgcagctc taatgagttg 6120gtctttctct
tctggcggac aggctaacgt cagatctcgg ccaaaaattc ttaaagatag 6180cgtttccatt
tctgacatta ttttttgagt ttagttattt ctttatttag tatgagctta 6240tcttttttcc
agtctctttc cctttgcaca taagaatcta ttattccttt ttgctcatca 6300aatcttttta
tcaaaagatc taccttatcc tctagttcaa ataatgatga ttttttttct 6360tctgacattg
tataagttta attgtactag aatgaattgg aaagtttctt taataaaaga 6420gtaaaataag
gcatggaaaa aataattttt aaaaacagaa gagattcact tataaaacac 6480ctacctaaga
attcagcctt aattgtacct ggtgcagatt tgcaatatag aaatgctgac 6540tcatcttata
atttaagaca ggaaagtagc ttctactatc tgtctggctt ttgtgagccc 6600tcttctctaa
tggttttagt taataatgga aaaagcattg attcaataat ttttgttcct 6660gaaaaagata
aacttaaaga aatctgggat ggttatcgag ctggccctga gggcgcaata 6720aatgattttc
tttttgatca agcttttgaa aataataaat cagatgcttt aatgcctgaa 6780atccttcaag
ggctagaaaa agttttttat tcaataggga agaaaaatgg ctttgatcag 6840aaagtaattg
actggacatg cgcagcaaat tctaaagata ggcacagcaa atcaattgat 6900attattgatg
gctcttcgat ggtaggaaat ttaaggctta tcaaagataa gcatgaaatt 6960gatattatga
agagagcttg tgaaatttca gctgaatcat atattgaggt catgaaatct 7020ataaagcctg
gggacaatga gcaggaaata gaggcgctat ttttatatga attcgccaaa 7080aggggaggaa
ggtttccagc ttatacacct atagttgctg gaggtgaggg tgcttgtgta 7140ttgcattata
ttgaaaatga taaagagtta gcttcatcag atttaatttt ggtagacgca 7200ggatgtgaat
acaaaatgta tgcatctgat atcacaagaa ccttcccagt aagtggaaaa 7260ttttcagatg
aacagctaca aatttataat attgtccaca aagccaatct tgctgcaatc 7320gatgctgtaa
aaactggtaa tagcataatg gagccccaaa tggtttcaga aaaagtaatt 7380actgaaggtc
ttgtagagtt gggtattcta tctggcgatg ttaatcagct tcataaaaat 7440ggtgcattca
aggactttta tatgcataag gtgggacatt ggcttggact tgatgttcat 7500gatgttggtg
actacatgga gggagatgag tttatgaagt ttaagccagg gatgataacc 7560acaatcgagc
caggcatcta tatcagtagc gcaatggatg tagatgacaa atggaaaggc 7620atcggcataa
gaatagagga cgacatcctt gtaacagatt caggcaatat taatctaaca 7680gagaaggtgc
catctaatcc tcaagaaata gaatcattga tggcttagac tatggaggtt 7740ccaattgtta
tttctggcgg agggataata ggtaattaca tttctcttag gcttgaaaaa 7800aataatatca
aaaccgttat tgtcgaaaaa gctagtagtt tcaaagccct agataagggt 7860ataagaacag
tcactctcaa tgagcattct atgcaaatgc taaaaaatat tggtatttgc 7920ccatcaattg
ctcaaatcaa cagcatcgac gtattagatg gtgagggtac aggcaaaatt 7980caatttctag
caaaggacgt aggcagcgaa aacctttcat atgtaaccta tttcaatgaa 8040ttacaaaaac
taatttctga tccatgtaaa gaaagaacct tatttaataa tgagattgat 8100tcagttcaga
atcttaatac agaatctgat ccagagatca tgcttaaaga tggcatgacc 8160ataaaaacga
atctaattgc tggatgtgat ggaagaaatt caaatattgc aaaaattgct 8220tcacttacaa
gcagcttcga tgactactta caaacagctt taacttttgt cgttgatatt 8280gataatgatt
cacatggcaa agctcaccaa gttttttctg aaaaaggaat atttgcactt 8340atgccactcc
cagaaggcaa gggtgagatg aataaatgca cagtggtttg gtcaataaaa 8400aatcaagttt
tgggagatga gcctgtatct gagtttgtaa aaaataacat ttcttttttt 8460gaatcaaagc
ttaatgttag tctcagggtt aagtcagaaa ttttaagttt taaattatcg 8520aaccatcatt
ttgaaaacta tattagcgga cctattgttc ttcttggtga tgctgctcac 8580tcaattcacc
ccttagcagg tcaaggtatt aatctaggat ttgcagatgc agatactttt 8640tgtgaagagg
taattagttc ttataaaaaa gggattgcct ttaatgagaa atcagtttta 8700aaaagatatg
agattagaag aaaaagtatg aactttttaa tgttgaagtc tatggacttt 8760tttgtggatt
tatttggttc agaaaattta tatctaaggc tgatcagaaa tttaggtatt 8820tcttcgttaa
ataaatcaaa gtttgttaaa gcattcttta taagacatgc ctctggaatg 8880aataagtttt
aaatttgtat taaacttttt gacctttagc tctaagttct ttaagaactt 8940cactaatgcc
ttttttatca atgattctca taccttttgc agatacttta agatttacga 9000acctgttctc
agattcaacc caaaatttgt gtgtgtgaag attaggaaaa aactttcttt 9060tagtcctatt
tttagcgtga gaaacattgt ttcctgactg tggtatctta cctgttactt 9120gacatatttt
actcattgaa acgcgatttt atagaacact gaggaactta gcaatactat 9180tgtgaaacaa
atttatttat tacggcatgc acaatctgac tgggagagct ctaatcagaa 9240agattttgat
agaccattag caagaaaagg cattgaagaa gcaaataaaa tatcatgtta 9300ctgcaaatct
cattcaattt tagtagataa aatattctgt agcactgcag aaagaactaa 9360gcagactttt
gatatatgca gtgatgggct taattatcca atagctgaag cagtctatac 9420tgatgagctt
tacttttctg gccctggtga gatcgttaag cttatccaaa gtttaagtga 9480attcatttcc
tctgttttaa taataggcca caatccatca atgcaaatgt acatagatgc 9540tatttcagaa
aatcctcata ttacgtattc aacatgcggg ctggcagaaa ttctcgttga 9600aagttcatgg
aaagacttat ctttaaaaaa atgtaagtta aaatctttta ttcaaccagg 9660agagctttaa
aaattgaaaa acataaaaat taagatcatt aacccactaa tgggatccaa 9720gataccctta
cctcaatatg aaacaaaggg ctcggcagga ttggatttaa gggcatgcct 9780agatagtaat
ctcagccttc aagcaggaac atctcagttg atacctattg gttttgcaat 9840gtacttagaa
gatcccggtc ttgcagcaat ggttatacct agatcaggtt taggttctaa 9900gcatggaatc
gttcttggta atctggttgg gttgattgat tcagactatc aaggagagct 9960aatggttcct
gcctggaata gatcagatac agattttgag attaatcctg gagacaggat 10020tgcacaaatg
attatagttc cagtgattca agcagatttt gaaattgtag acgagttcaa 10080tgagactcag
aggggagaaa agggttttgg aagttcaggt ataaattgat aaatttactt 10140tttcttgcca
aatctttctt caaatttctg aactcttccg ccagtatcaa taattttttg 10200cttaccggta
taaaaaggat gagaagcaga ggatatatca agagggtagt atgggtatgt 10260ttttccatct
tcccattctt tcgtttgagt cgtatctaat gttgaacgaa tgagaaagaa 10320cttatcagca
ctagcgtcat ggaataaaac ttcacggtat tcaggatgta tatctttttt 10380cataataaaa
tcaaaatttg gatgagaact ataacaaaaa acaacttctt ttcaatcaaa 10440acatgaaaat
tttttactat gacatagctg tttcattgcc tctaaggcaa tgctttactt 10500atagttctga
gcttaaaatt acaaaaggaa cccgcgtgtc agttcctttt ggtaaaagga 10560agattgtagg
agtggttata aaaaatattc aaaagccaga tttcctaaaa aaagcgggag 10620ctataaaaaa
aattatcgct gtacttgatg aatatccttt gttcgacaag ccaatttttg 10680attctatatt
gtggtcttct gattactatc atcatcctat tggtgaggtt tttaatacct 10740ttatacccac
cgaattaaga aaaattaata ataaaaaaat tgaagcttta agagaatttt 10800ctgaatattc
agtaaatgag gatgataaga aattcgattt aaccaaggat caagaaaaag 10860cagtcaaggc
gctttctaaa tctaaaggat tttcacccac tttattatat ggagttacag 10920ggtctggaaa
aacagaagtt tacttaagag ttgccgaaac ttttattaaa aataataagt 10980cagtattagt
tttagttcca gaaataaatt taacccccca attgctctcg cgatttgaga 11040atagatttaa
tggtgaaatt ggcatatatc attctaagca aacagcagct aagagattaa 11100agacttggct
aaaagctaaa tttggttcta taaaaataat agtaggaact cgatcttcgg 11160ctttagtgcc
tttagataac attggtttaa taattatcga tgaagagcat gaccaatcat 11220ttaggcagtc
agaagggttt aaattctctg ctagagactt aagtataaaa agggcacagc 11280ttgcagatat
tccaattatt ttgggatcag caaccccttc gctgcaaact ttaaaacttg 11340taaaagaaaa
taaatttata agagttgata ttcctaatcg agttgatgga aacaagcctc 11400ctaaattaat
agccttagat atcaataaca gccctttaat aggcggagtt gctaaagaga 11460caattgaagc
aatgcaatca accatagaca gaggagaaca ggttctagtt tttattaata 11520gacgaggatt
cgctccactc tatcaatgta gtagttgtgg gtgggtagca gattgtaaat 11580cttgtgatac
aaatttagtc ttccaccagg caagaaatag attaatttgt cataggtgtg 11640aatctgccta
ctctgttaat ttgtcttgtc cggcatgcaa gtctaatgac tttaatatgt 11700atggagctgg
aacagagaga gttgaagaag ttcttaaaag cagctttgta aagactccaa 11760taattagagt
tgatcatgac tcaacaaaaa aagtgggagc tatggaggct atagttaaaa 11820aaattcattc
ctcagacgca gcaattttag ttggaactca aatgcttgca aaaggacatg 11880attttcctaa
agtcacctta agcgttattt taaatgctga taatggcctt ataagcccag 11940aaattaatgc
attagagaaa atatctcaat tgcttattca ggtctctgga agagcaggaa 12000gaaataataa
tcttgcaaaa gttattattc aaacaagata tcctgatgat ataaatctta 12060ataaaattaa
gacaggagat tatatgaaat ttgcttctca atgcctaagt accaatgagc 12120aaatgaactt
acctccattt actactttat gtctgcttag gtgctcatca ccaactcaaa 12180agagtaatgt
agatttccta gagaaagctg ttttaatttt atccaatagg actgatataa 12240atgttattgg
tcccttgcct tcattagttt cgaagtcgaa aggaaattat aggcaccaag 12300tctatatcca
tgcaccaaag aagacttttt taaataaggt attaaagttt ttgacaacag 12360agtttgaaaa
atggccggaa tctaataagg ttaagtggtc tttcgacatt gatccaatag 12420acttaagcta
aatattaatc ttaattaatt gtcctgggta tattggttta ttgtttagtt 12480tattctctgt
attaatttct tctacagtca ccccaaatct tatcgctatt tctgataaga 12540catccccttt
ttgtattttg taagtcacaa agcctggatc aatactcata aaggtatttg 12600ctttaggttt
gtccttaaaa tagttatgta ttcctaggaa aacagatctt gcaatcattc 12660ttctccctgg
cttgcccttt aatctttgtg cgtcttcagg gttggttata aaccctgact 12720caaccaatac
agaaggtata tcaatagact taagaactct gaaatcagcg tactcaacat 12780tctttttatg
aatttttgtg aaagggtctc ttttaagttg atccaatatc ttagttccta 12840aaattttact
ttcagaaatt tttttcttat atatttctgg ataggtttgt cttgccgcat 12900cctcatcaaa
atcaactggc tttagatttt ttatatcagc ctgtattctt tccctttgtt 12960ttttagataa
gttcctagca acagtacttg aagcttcatc tgaccatata aaaacagaag 13020caccctttac
ggaagataat ctaaacccat cagcatggat tgaaacaaaa atatctgctc 13080catatttcct
agcatcctgg tatctattat ttaaatctaa tgtctcatct ccatctctaa 13140tcatcaccgg
tctgtaccca taagtatctc ttaaggttct ttctaactcc ttcgcaataa 13200gtaacgtcac
atctttttct aaaatattat ttgggccaac cgcaccaggg tatttaccac 13260cgtgacccgc
atctatagca acaacaatat ctctaatact tttattgagg tttttatttc 13320ttttcacctt
aagctctaat tttatatttt ctgtatttat agtctgagtt gggttttgcc 13380aatggactga
ttcatataaa tcaacaacga ttctggtaag acttccatcc tgagatgctc 13440ttactttttt
gattggatag ttgtatggaa catttatctc ggtcggaagg ctcgattgat 13500taatttccat
gacaattcta gagggatttt taaagaataa gaccttacca atgaaacttt 13560atctagacta
aagttaatac taatttcatt gttacccatg tcttgtattt catcgaagaa 13620aacatcattt
ccgctgataa aaaaagatag aaaaccaagg attgccaatc tattcataag 13680tttttaacca
ttctttaaag ccattgtcac cagaaattaa agaaacctct cttccttcag 13740gaagatggct
gaaaattatt tttagatcaa aacttctttc atgttgaagc ctttcaggcc 13800actcaattag
cactactttc ttagagttta tttttctact tagatcaaat atatcaatat 13860cttctgcttc
gttagttcta taaagatcaa tatgcaaaaa cattaaatta ttaaaatcat 13920attcctcgca
gagagtgtaa gttggacttt ttacaagatc cttccatcca caatttttta 13980taatagatct
agatataaag gttttgcctg ctcctaagtc tccctcaaga tgtatttcaa 14040tttcttgcga
agaagattta agtatctcca tagctatttt tgaccctagt tggtttgtag 14100cctcgtcatt
tataagagta agttttttca tcgatttatt aattgtctta ttataggtat 14160taaactagag
gcatttaagc ctatttcacc aatatctacc ttaaatttta aacctgcctc 14220cgaatgaaca
gcaacagcaa taatgctcgc atttcttata tcaagacctt gggcaagaag 14280agcagtcaaa
acaccagcca gaacgtcacc agttcctcct gaagcaagtt caggtccgcc 14340gcacgcgcat
ataaaagact tgttatttgt tgagtcaaag accaccgttc ccattccttt 14400taatatgaca
atagaagccc cataagtgtc tgcaatttgc ttggcagcag aaatcctatc 14460tctttgaact
tcttcaatag atatgtttag taatatagcg gcctctcctg gatggggcgt 14520catcaagatt
gttttattgg attttttgat taaagatttt gatgatgcca cgatgtgtaa 14580agcacctgca
tctaatataa tttttgaatt atttgcagac tttaatattt tacccaatat 14640agtttttgca
aaagcagtat tagcaattcc tggtccatac aatataacgc tatgatcttt 14700aatttttaaa
tctatatcga ccccttcaac catatcaaca ccaattgcca ttacctcagg 14760gtttcttaat
aaagatgggc ttacattaga cgtgtctgtg acgagagtta ctagtccaga 14820gccacaaaat
aaacttgcct cagatgcaag tattccggcg ccacccatgc ctggggagcc 14880agcgcatatc
aaaacttttc caaaattacc tttatggcta tcctttgctc tattgggcag 14940aagatttttt
aattcttgaa aggtaaaact ttgcaatata gacatgaaat tacttaagct 15000aaattatgtt
gaatgctaaa aattcattct tcagcgaaat aaccaatcct atcggaatgg 15060aatgatccca
taaaaatttt ttcgtcatga ggcactgcaa tagtgggtag gccaaaaact 15120gttctagaaa
atgaaaactt atcttctctc ttaagtgttt tttgattaag tttatgaatg 15180gagaagggta
gagaacaatt tgttttttct gcacaattac caaaatcatt agcctgtaaa 15240tctaaagaag
ttaaccaaac cgaaccgtct tttaaaaaga tattatctgg actctgaata 15300aaataactac
ctgttttagt attttgatta atatcgtaaa cagataaatt gtcaccttga 15360ttatagttaa
cataaagaag tccagagccc tcatctagca aaattccatt aggcccactt 15420ccatctgttc
catcaacttt attaaaatta ttgtcactcc aaagtactac atgccctgaa 15480atacttttaa
ataatgaatt cattaaccat tctttcatgc tgatatctct tttatacata 15540tgagatgcat
aaaaacttcc atctttttta agggcaacat cattgaaata atattggtca 15600gggaccctaa
tacagccccg ccatatcata tcccatgaag attcattttt tataatttca 15660aacatttcaa
tcgactcaaa tggcgaatga ttaattacag ctagttgata gaagccttca 15720tcattttcaa
caagatctat tccgtgggga ttgaatatgt ctagctcgcc tcttatgcat 15780gaagcgtcac
cccaagagct ttctccaaaa gtaatttttg gtaccttttt ttcataggta 15840tttaaatcca
ttaaggcaaa gtaaccaggg gtatgttctg catatggacc gatccctcca 15900aattcagaaa
taaagaagaa tttattatca ggcgtaatca caatatcttc tgggtttgag 15960aaattacaaa
taaccttgat acggtcatcg gattcacact tactaatatc catttggggg 16020cctatatagt
cagtagatac gacagttaca gaaataaata aataaagaat agaaaccggg 16080acggtaatct
tataatagtg ctttataaaa atttctaaaa tctttgaagc atgatttgga 16140agtgcgatca
gccaaacccc ttttaaaaaa gatagagccc ccataacaac aaaaattact 16200gaccatagcc
tgtcagacca ttccggctga agaagaccag ttaaaaaaag aatcatccca 16260aagaataaag
ctaaatagcc tgagattttg actttcgatc caacaaattt cgaaaagaga 16320tatgtataaa
gaggctttat aagaactagt aaaccacagg ccaagaaaaa gaatgctaag 16380tagtaattca
taagttagtt tttatataaa tgctccttaa taatactaac aagttctaag 16440ggcttgtcca
atggaacatg gtgagcagct ccaggaaccc cctcaaaagt cataatgtca 16500ccatatgtat
ttttaatatt gtccaagata cttccggagg ttaataagct gtcttcaccg 16560tggatgaaca
aagcagggca gccaaatgaa aatgtgtaac cgaataacct ttcaagactg 16620ctaaacatga
catcatcaaa tttccatctc cacccagcct caatattttt tactgagtgc 16680tcagcaatgt
atcttaagta ccaatcattc gtacaatctt gcttaggcat taacctaaac 16740cttttaataa
tatctgtctt gtcttgatag tgcttgatca ttctgagagg agaagagtgt 16800tgattcgggt
cataatccgg tggtcttata aatgtatcaa taataataat attatttatt 16860agatcctttc
tttcagatgc aacgtaacca gcaacatgtc cgccgaggga gtgtccaaca 16920ataaaaatat
ttgaaatatt ttttttatcc ttttcctttt caattacaga gacaatacat 16980tctccaaaat
ctttaatgcc atatgaatct ctaaaagaag agtcacccat gccaggaaga 17040tctattgcaa
ctatatttgc gcagtctcta aagtggggcg caataggatc ccaccatttt 17100ttatgagcac
ctgttccgtg aataagaatt attaaatctt tgctttcatc tttggagttc 17160cagctagaat
aggatatatc cccatgagga ttcttgataa tctctgagct aggcttgtcc 17220tcaatggcat
ctttgaacca ctgtggggca tgaataatgt cttgatttag attgttagtt 17280atttccataa
acagtattct aagctataaa aaataaaaat atgaataaac ttaatttaac 17340gccagcagca
actgttttag tcctaaagga ttctcctgat gggatggaag ttttgatggt 17400aaaaagatca
agtaggcctc ccttcggaga cctttttgtt ttcccgggcg gcaagattga 17460cgaaggtgat
ttcaataata agatagaaga tttttgtgag ggcgtgactg ataaagaggc 17520ctccataaat
cttggattag attctggagg tctagcatat tgggttgcat gtattagaga 17580atgctttgag
gaggttggaa ttttacttgc taaaaaaaag agtggggaag atcttgatct 17640agatggagtc
gataaacata aatatcaaaa atatagagag atgttgttaa ataatgaaat 17700tgatttatat
aaaatctgtt tagaagaaaa tttaattcta atgcctcaac aaatagcccc 17760tttctcgcat
tggataaccc ctgaaataga aactaggaga tttgatacac gtttttttat 17820tgcccacctc
cccaagcatc agaccggaga acatgatggt agtgagctca tagacagtgt 17880ttggatttca
ccaaaagaag cgctcaaaaa atctcgttcg ggtgagatgc ctatgattat 17940gcctacaata
aaaaatttgg aacaatgtgc acaatttgat tcgggctcta agcttttaga 18000aaatcagagg
aatctctcaa atgaggatat cccaccaatc ctgccaaagt tttttaaaga 18060agatggtgag
tggaggggtc tattgcctgg agataaaggg tatgaggatc attaaataat 18120atggacttaa
ttactaaaat aacagctccc aaccctggtg ttttcactgg gggtgggact 18180aatacttatt
tgattggcaa agatgatata acccttgtcg accctggtcc aaatatatct 18240gagcatctag
atgaaattat caaagcaggg gatgggaaaa taaaaagaat ctttgttact 18300catacccata
cagatcattc cccagccgca ttgcctttat caaaaactct taatgttcca 18360atgtacggaa
ggctagtaga tggtgaatcc tcatgggagg atgaaacatt tatcccagat 18420attattttaa
atgataaaga tattattgag acagacgaat atacgttaga agtaatacat 18480actcctgggc
acgcatctaa tcatttatgc tttttaataa aagatacgaa atgccttcta 18540acaggcgatc
acattatgga cgggtctacg gttgttattg ggccaccaga tggcaatatg 18600acaagctata
tcaattcatt agaaaagtta ctagattttg atattgattg ctttgcgcct 18660gggcatggaa
attatattca tgagcctgag aaaaccattc aatcaattat taggcacaga 18720ctaacaagag
aaagaaaagc tcttagaaag ctaggagagg caggaatctc atcattagat 18780aaacttacta
agcttgttta tgatgatgtg tcagagatgc tccatcctat agctaaatat 18840agtctagaag
cacatttatt aaagcttata gatgaaaaga aagttaaatt agataaagat 18900ctattcgaaa
taatttaatc ctttttattt ttatgtaaga ctttctcctc aatagcttct 18960atatcaatat
catcaattga gtcttcgtta ttatcaggta tcttttttac atctttttca 19020attttaaggt
cgataggaga ggctccaaga tcaaaagtta gctcccttac attttttgaa 19080atagtatcct
caacacagtc atcttgatcg tatgcctcac gggtctctcc tttatcatta 19140atgggaaagg
gtctttgggg agggcccatt tgcatgcatt taatagtagc aactggtgag 19200tagtaatcgt
cactagaata taacttatca agttcttttg gtgatattga gcaaccaata 19260attccaaata
gaataaatgg cgctaagagt cttttcataa actttgtttg ttttctataa 19320gagactcaac
aaccgaagga tcggcaaggg ttgtcgtgtc tcctaaatta gatagatcat 19380tctcagcaat
ctttctaaga attctgcgca taatttttcc cgatcttgtt ttaggcaagc 19440ctggagcatt
ttgaattaaa tctggttttg caatagctcc aatttcttta gcaacaaatt 19500gtttcaattc
ataactaaag ttgtcatcaa atgattcatt tatcattaaa gtaacaaaag 19560catatattcc
ttgcccctta attggatgat caaaaccaac aacagctgct tcagcaattt 19620tagggtggag
cacaagagca ctttcaattt cagctgtacc tagtctgtgg ccagaaacat 19680taagaacatc
atcaactctt ccggttatcc agaagtatcc atcctcgtct cgcctggctc 19740catcaccagt
aaagtaaata tctttataca taccaaaata ggtgtcgatc attctttggt 19800gatcaccata
aatacttcta atttgactag gccaagattg ctcaataact aaattacctg 19860cattagagcc
ttctagcgta tttccatgct cgtcatagag agatggctta actccgaaga 19920agggcagagt
tgctgaccca ggttttgttg gagtaatacc cgctattgga gagataagta 19980cagaaccagt
ttcggtttgc caccaagtat caataacctc gcaattagat ttaccgacaa 20040cactgtagta
ccaatcccac gcctctggat taattggctc tccaactgta cctaaaattc 20100ttaggctatc
tctttttgtc tttttaacag gatcatcgcc ttgggccatc agagctctaa 20160tggcagttgg
agctgtataa aaaatactaa tgtcatgctt atcgcatatc tcccaacacc 20220ttgatgctga
agggtaggtt ggcactcctt caaacataag tgttgttgct ccatttgaaa 20280gaggtccgta
caagatataa gtatgtcctg ttatccatcc cacatctgca gtacaccagt 20340atttgtcctc
tggccttatt ccaaaaagat atttgaaact aatatgagcg cctaatagat 20400aacctgcagt
agtatgtaga acaccctttg gcttgcctgt agagcctgat gtatagagaa 20460tgaaaagagg
gtcttcggaa tccataggct ctggagcaca cttattagaa acatctttaa 20520caagatcttc
ataccaaaca tcttttttat catcccaatt aatttcgcca ccagttcttt 20580ttataaccag
tgtattttta acatctggac agcccagaag agcctcatct acattagatt 20640taagtggcac
ttttttgccg ccccttaaac cttcatcagc agttataaca attttacaat 20700cagcatcaag
aattctatct ttgagtgatt ctggagaaaa gccaccaaag acaacagagt 20760gcacggcacc
tattcgcgtg caagcaagca ttgcgaacgc agtctcaatg atcataggca 20820tataaataca
aactctcgag cccttttgaa cacccaggtc ttttaaaaca ttagcgaact 20880tacatacttc
gtcatggagc tctttgtagg tcaattcttt agaatcagca gggtcatctc 20940cttcccatat
taatgctatt ttgtttggat cattttctaa atgcctatcg atgcagttta 21000agctaatatt
tgttttacca ccctcaaacc acttcgcatt attaaattga ttattgaatg 21060ttgttttgaa
gtcttccatc cagcttatgt tttcgtttgc tagattttta aaaaatttag 21120aaggatcttc
tatggattgc ttgtaaagtt ctttgtattc atcaaagtct tttatataag 21180gattacttga
atgttttggg ctataaagct ttcgaggcat tcttaaataa ttgaaggttg 21240ggggttaatg
aaattctttc ctttgggatt ggacattatt tttgtaatga gcgattcgta 21300atcgctatca
ttggtttcta aattaatatc tgcagcatta attattaata aaggggcaga 21360actatagtct
aagaaaaacc ttgagtatgc atcattcagt ctttccaggt agtcaagagt 21420tagatattgt
tcgttaatat ttcctctctt agtaatcctg tcttttaaca catcaatagg 21480tgcctgaaga
tagattacta ggtcgggtgt tggcgcgtcc agggttagat ggtcatatac 21540tttgtcatat
agatccattt cctcgttaga aagagtaacc tcagcgaata atcgatcttt 21600ttctattaaa
aaatcagcaa ccctcactgt ttcaaaaagg cttctttgtt taagatcttg 21660aatttgttgc
attctttgaa acaagaagaa aagctgagtg gctagagctg attggcttgg 21720gtttttataa
aaattcttta agaagggatt ctctgccggt tgttctaaaa aagaatcata 21780attaaatgtt
tcggctatct tatttgctaa agtagttttc ccaacaccta tcggtccttc 21840aattgcgata
tattttggaa gtggtacttc tttaatagct gggttcattt ataacgtata 21900gtttttatcc
tagattatct caactcgtta tagaagccaa atatttttgc agattcttca 21960tcactttttt
cattcatctc agcaataatg gttcctctca atgagtcatc atttttttgg 22020aaatcagtcc
accacttccc aataccactc ttgtctccag aagctctctc tcttacaagg 22080agaaaatcat
atgcagctct aaaccttgga tgtcgaagag ttttatatgg ctggctacca 22140attctgctat
gaagttttaa ttgaagtacc caaatatcct taatatagct tgaaaatttt 22200cttggtattg
ctgtgatttt ttgttgttcg cgaagtacac catccataga tcggaaaaat 22260tttctcacat
taatttctcc attcttagaa cactttttca ataacgaggg ccataacaat 22320gcagccataa
gaaagcctgg tgtaattgac tgttggtttt taaccctatc atcagtattt 22380cttagagcat
gtgtcataac attgctggca aagtcattcc tacttggatc agaaaggata 22440agatatttat
ttaaatgaaa tgagcagagt ttttcaaaat tcttttcacc cattccattc 22500aaaaatattt
tgcagaactc atcaaacaat ctagcgtttg agatgcctga taaaagatgg 22560cctttatcat
agatggcatc cttaaccaga ttatctatct taaaattgag tttgttacta 22620aatcttatag
ctcttaagct tctcactgga tcttcttcga atcgtctttg gggatctcca 22680atagatacaa
taaccttctt gtgtatatgc ttaagcccat cgttatgatc ttctattttt 22740tttgtaacag
gacagtagta aagggcattc acagtaaaat ctcttctatg acaatcttgt 22800tcaagagtgc
cccaattatt atctctaaga atctttcctg tcgaatcagt aacaatattt 22860tcaccatctt
cctgatcact tcctgatctg aaagttgcta cttcaagcaa ttcactccta 22920ttaaaaacat
gaaccaattt aaatcttttg ccaattattc ttgaagcttt aaatgttttc 22980ctgatctgct
caggcgtggc attagtggct atatcgaaat cttttggctc tagtcccgtt 23040agtgcatcgc
gaacgcaccc accaaccaga taggcctgaa aattattttt ttgaaggtct 23100tgaacgacag
atatagcaaa tttacttatc ttattattat ctatcaaatc taattatgaa 23160tttttatgat
ttaagggcat tagccattta gctgcttttc tttaatttcg tcaagtgttt 23220tgcaatgaat
gcaatgagtt gcagttggtc tagcttcaag cctcttaatt ccaatttcat 23280caccacagga
ttcgcaccaa ccataatcat cttgcttaat ttgttcaata gatagaccta 23340ttttgctgat
aagttttctc tctctatctc tggttcttaa ttcaaaagca aactcttcct 23400cttgagaagc
cctgtctact gggtctgcat aggtttcacc tttagctcta agatgatcaa 23460aagttttttg
catttcatcc ttaagatgtt ctttccaaag aagaagaacg gcaacaaaat 23520gtttcttcat
tgctgcactc atatattttt cacccttctt agatttataa ggtgcaattt 23580tagatttatt
attagctatt gttgcttttg cagacttctt ggctgcaact tttttaactg 23640gagctttttt
ggtaacagtt tttttaactg tagctttttt ggcaacagtt tttttaactg 23700gagctttttt
tactgtcttg gatttttttt cgaccatgta agattttata gaattttagg 23760gtggagaaaa
tatcagatac tgacaaaatt agctagtcat tttttaattt atttaatact 23820ttcaagtacc
catccgagct aagtcttggc ccaaatgtct caacaacctt ggaagatgca 23880tagtttgcaa
acttagcaca tgcttcaata ttattccctt gaaggtaggc atgcataaac 23940gatccggcaa
acatatcacc agctccattg gtatctattg gagttatttc ttctgcttga 24000gcatgcttct
caaccccttt atctataaca acacttccat cggcgccttt tgtaatagca 24060gtcatatagg
gcttttcttt ataaaagcta acagcatcat caaggctttc tttaccagaa 24120aaagcaacag
cttcatcatc attacagaag atcatatcta ttccatatga ctctattaaa 24180tcaaattttt
ctttaaaacc atgaacaata cctgcatcag aaagagacaa ggctttcttt 24240acgtccttgt
ctttaaggtg ctctaagact gaaataacag cattaaagtt atcgtcactt 24300gttaccatgt
agccttcgat ataaaaaatt tttgaatttt ctacaacatc aaaatctata 24360tctgatttac
caagatacgc actaactcca agcatgctgc tcatagttct cttagcgtca 24420ggagtaacta
aaattaagca tttcccagtt ggttgatctg tattttcaga gctgacacca 24480atatgtttga
ctccagccga cctgagacta tcaagatagt ttcttccatc ttcatcatca 24540gaaactctgc
atacatgatg gcaattcgaa ccataatttg ctgcagcaac aagagaattg 24600gttgcagagc
caccgcaatc agaaatcgat tcggctccca tttcaataag tttgctaatt 24660ataggtgcct
gttcttcaga agatgaaaga gtcatagagt cggctacaag gcctacactt 24720gataaaaatt
catggctgac tttatattga gtatctacta aagcatttcc aagggcgcta 24780atatcatatt
tcatgtgtta ttcctttttg tattatttgt tttactgttt ctaacgttct 24840attaatttct
tgatccttat gcattgctga aatgaagcct gcttcatatt tagagggggc 24900aaagtatatt
ccacttctaa tacatgaatt taaaaaattt gaaaatagta catcatcagt 24960ttttgcaaca
tcattaatat tattagggag ttcttctgaa aaaaagaatc cgaacattcc 25020accaattctg
tttatagaaa acggaatacc tgactcaatc attaaggttt tcatcccatc 25080caaaagcaca
gatgcatttc tttctaactc tttaaacgga ttttctttaa tcaacaattg 25140caacaaagca
gttcctccag ccatagctag tggattgcca gacaaagttc ctgcttgata 25200gacaggacca
gaaggagcta gatagttcat aatttcttct ttgcctccaa aagctccaac 25260aggaagaccg
ccacctatta ctttccccaa agcagttaag tcaggagtaa tattataaat 25320ttcttgagct
ccgcctagcg aaactctaaa gccgctcatg acctcatcaa atattaaaat 25380agaattattg
gctgaggtgg tttccctcaa taactttaag aaatcttcat gacctggaac 25440aaagcccata
ttccctgcta ctggttcgac tattacggcc gctaagtcat cttttatctc 25500attaaatatt
tctaaaaact gttctttatt attgtattcg caactaaatg tatattttgc 25560caaatctgca
ggaacccctg gagagtcagg taagccaaag gtggcaaccc cagaacccgc 25620cttaattaaa
agagagtcaa catgaccgtg ataacagcca tcaaatttaa taattttatt 25680ccttcctgta
aaacctctgg ccaatctaat cgttgtcatg gttgcttcag tacctgaatt 25740aaccattctt
attttttcaa ttgaaggaat gcatttttta attagcctgg ccacatcaga 25800ttcaagactt
gtcggggcgc cataactagt tccaagcgca acttgatttt taattgcacc 25860tacaatgtct
gggtgtgaat gacccattat catcggaccc caagacccaa tataatcaat 25920atattcatta
tgatcagcat cataaaggta tgcgccggat gctctttcaa aaaatatagg 25980attgccattg
atatttttga atgcccttac tggtgaattt acccctccag gcataagagt 26040cttggcctct
ttgaataagg ctatggattt atcaatttta ttatgtgtca aagctaattc 26100ttcttaattt
ttaatttcga tatgatatca acttaattct tattatgtat ttaattgttt 26160tgaaaattca
tccacagtat tccagttggt gaactcataa gtatttgtaa catctgtagg 26220accctttgta
atccacataa tcaatctaat catgtgttta tctataaatt tatattttgg 26280ataatctatt
ttgcctgcaa agactgctaa tttcattggg ttccaaggag acagctctaa 26340aaatttttgc
atatacgggt ttgtttcagg tgtatttttt tcaggctttc tcgcaactac 26400attaactgaa
aagaaggcat tttcttttgt ttcaagacaa gcaacatttt tttgaataaa 26460ttcataaagt
tctggtttgt gtttgccata cctaatgctc gcaccaataa taattttatc 26520aaattgatat
aaatctaact ctactgcttt tgcaatatgt attatttttg aagattcaga 26580cacatctaat
ttagagaaaa tttttttaca aatttccaga gtttgcccat cggttgtaga 26640gtagattagt
agagttgatt tcataaatat attttatatt gttaattaaa aaatttcatg 26700gcacgcgcaa
aatataaaca aaaatatagt atatttaaaa aatgattaaa tatttattta 26760ttatcgcttt
attcatcagc aactttgcat atgcaaatat tgatgcagct aggtgcgcag 26820ggattagttc
agatcaagaa agacttgatt gctatgattt aatttttaaa gctaatgatg 26880agctaccttt
agatagtaat attaaaactc tcatcacacc agctatcaag gctgtgaccc 26940cagctgattc
aataaaaatt gaaaataaag caacaaaaga aaaggatttc gggcttccaa 27000aaacaaaaat
caaaaactct gcaaagaact caataaaaac ctcggtagta aggattaaaa 27060aaacaaaaag
tggtaaatta atttttactt tagaaaatga gcaagaatgg actgctgaaa 27120cttcttatag
agcaaggaat atgtttaaac cagaaaccgc agtcatttta gaagaggccc 27180tggttagtgg
tttttatatg attaatataa gtaataaaca gaaaattaga ataaagaggt 27240tgaaatagca
atgaccatac agagtatagc aacgaccgag ggtgccatga aaagaataag 27300atctgttctg
gggtcccaag atggctcaag cttcagggtt tatgttaccg gaggtgggtg 27360ctctgggttt
caatatggct tcaagtttga taacgatatt gcattcgatg atgatgttat 27420aaattgtgga
gatttctcgc ttttaataga ctctatgtcg tacccctatc tttatggatc 27480aactctggat
tttgttgaag atctctcagg ggctaaattc gttattaaaa acccaaatgc 27540caaaacaaca
tgcgggtgtg gagagtcatt tacagtttag attttgtaat tgaacccaaa 27600agtcctttta
cattcttgct tgatgtctga actatcagag cctcattact gactctttta 27660tatcccatcc
aggcaaatgc catggactca atagcaaaaa catcatggcc taaatcactt 27720gagaggacga
tatcattact agccatctca gaaattcttt ttaccagata cttgttatgg 27780gcaccaccgc
cacaaatgac aatgtcacaa ttattatgac catttttatg gattgaattt 27840attatagatt
ttgctgaaaa ttcaaccagg gtacatagaa tatcttcagc tttctttttt 27900aagaatttct
tggataatat ttttatatta aagagctctt tgcctgttga ttttggacat 27960tttcttttaa
aaaaattatt ttgaagcaat cttcctaact caatgtgatc tacttttcct 28020ttggctgcaa
ttgcaccatt tttatcaaaa ggaatctgaa gaaaatcact acaatatgca 28080tctaaaattg
cattaccagg gcctacatca gttccccata tgtcatttct attttttaca 28140aatgagtaat
ttgaaatccc ccctatattt aaaataattc gtgggtttct agctttataa 28200aataattgat
tatggaattc cggaacaaga ggggcgcctt cgccacccag agcaatatgc 28260atatttctaa
aatcacttac caccagaagg cccgtttctt ttgcaacaat atttggatca 28320ccaatttgca
tagaaaatgg aaatctctta ttaatttcat gcctaatggt ttgtccagaa 28380atagcaacac
actctataga tgatttctta attttactaa atccaatcgc ctcattaata 28440gattttgaaa
ataagaatcc tatctcttta ttgatagtgc ctaaatctga caacgagctt 28500tcgttatttt
caataagttt ttttactttt aatcttaggg attttgggaa tttaattgaa 28560tggaaatact
caagatgtat ttttgtacca atgctaagaa atgaaatatc aatagcgtca 28620tgactagttc
ctgtcattgc tccaatatag atctttttac tcattactac ttggagaaag 28680tttattgaac
tcctccattg ataatttatt attctttaat agtgtatcaa aacttgccct 28740taaatttttt
ggtactggtt cagctgatgg taattttact tttataggat ctattctttt 28800cttacctatc
ttaaactcat aatgtaaatg aggccctgtt gccagtcctg aactcccaac 28860gaaaccaata
gtatcgccct gagaaacttt tttgcctttt cttataccct tactaaattt 28920ttctaaatga
caatatctag tcgaatactc atttgtatga tttataacta tctcattacc 28980acatccattc
ctttgaccag aaaaagaaac aatcccatcg cctgtagttc tcaccggcga 29040tcctcttttt
gctgcataat cgactccatt gtgagctctt attgtatgaa gaacaggatg 29100cattctgttt
ggattaaaat gagaactaat atatgcaaaa tctaaagggg ctcttaaaaa 29160ggccttttgc
atattatttc cattttcatc aaagtactgt tttttactgg cttcagtgaa 29220aaacctattt
gcaaaatagg tattaccatt gttaataaac ttagcaatta cgatatcgcc 29280attttttact
ttctctccat cactataagg agtgtcataa attacatgaa attcgtcacc 29340ctctctaata
tcaaaaacga agtctacatc ccagccaaag atataagcaa agtccataat 29400cacactttcg
ggaatatttg catctagagc tgcctcataa aatgaggatt gaataatccc 29460gctattgtaa
gattcaatta actcaatact tttgcttata ttcttaaata ctatttctgg 29520agtaaggcta
attgaaatag aatttatttg atctttcatg atctcaattc tggttagctc 29580ctcaccagaa
tattcaaaga gcattttttc ccctggttta atgttagcaa tgatattttt 29640ggagtctaat
ctaaaaattt tataagcagt atttaaaggc actgaaaagt tttcaaaaat 29700tattgaaaga
ttttctccat cttgcacctc atgcatctga tatgttttcg tttgctcgag 29760aggtatagag
aatttttcag taatctcaat ttcttcaact ggcaaagatt catatgtttc 29820tatatcaata
taaagcatta atattaagac aatagaaaca gcaaaagaaa caagtaccgc 29880tctttttgga
acttttttaa aacctatcat tagtcctctt gaatatttaa gagttcagtg 29940ttgccaccaa
atgctaccga gttctttgag acaacctttt cgttaagaaa ttgcagaagg 30000taattaggcc
caccagcctt ataaccgctg ccggatagat tttgtccacc aaaaggctga 30060gaaccaacaa
cagctcccac catatctcta tttatgtaaa tatttccaac attacacttg 30120tcactaaata
tgtcggccct tttttctact ctagtatgaa tacccattgt aaggccgaag 30180ccactatcat
taatattttc aattagtgca tccatttcat ttgatttaaa tctaacgata 30240tgaagtattg
gtccaaattg ctcgtctttt agatcagaaa tattatcaat ctcaattatt 30300gttggagcta
caaaattttc atctacatta tcagcagacc taaaaataga ataattctta 30360tcttcaaaac
ctttgacgta agcattgaga ctatcaagag atgttttgct aattattgga 30420ccgatatctg
tgtcaagatt ttgagggttt ccaattttta gttctttcat accaccctta 30480atcattgata
aaagatcgtc atatatttcg tcttgtacac agagaactct caaggcagag 30540catcgctgtc
cagagctatc aaatgctgaa cgaataatat catcggttgc ttgctcgaga 30600agtgcgcttg
aatcaacaat cattgaattt atgcctcctg tctctgcaat caaaggaata 30660attgattcat
ggttacttgc aaggctgctt tgaatttttt ttgctgtttt taaagacccg 30720gtgaatgcaa
cccctttgat gttattaacc tttgaaagca tgtcaccatg aatgccgtct 30780cctaggatta
aatttaaagc atcttttggg accccaatct catgaaattt gttcacaatg 30840atgtagccaa
gaattgaagt atgttctgag ggcttaactg ttactttatt tccacatgca 30900agggctgcac
ttatttgccc tataagtatt gcaacaggaa agttccatgg actaatacat 30960aaaatatggc
cttttggtga gtaagatagg gcattgatct cgccggtggg accctcaaga 31020atatggtctt
ctgtttggag gccaacagct tgttttgcat agtatctgag aaaatctata 31080gcttctctaa
tttcatcaat agtattttgt actgtttttc cggcttcatt cataaggtaa 31140taaattagct
cagaaggatt tgcttcaatg tcgtctgcaa ttttttctaa aatggatgct 31200cttttttcaa
catgcatcaa agaccattca ctaatatttt gttcttcaag gctgcccttt 31260atataatcta
gatcatcata tgaggctgta ccaatatttc ttccatctgc aagagatgag 31320atgtcatgag
tattggtttt cttataatcc tttgctttat aaatcgatga agcctttatt 31380tcttttgaat
caaactttcc aagctcctct tcaagcattt ctaaattcac cctttcactt 31440aaatcaaatc
ctttagagtt cggtctgtca ttaaatatat ttcttggcat tggtatctct 31500tttttttcat
cctcaatttt taaatggggc cctctggcaa gccaagcaga atctgtttca 31560ggatcaagta
acctattaat aaatgagctg tttgctccat tttctaatag tcttcttaca 31620agatagggta
gcaagtcttt atatttgcca atcggcgcat aaattgaggt atttttttca 31680gtatttaaga
ttttgtttgc ggacttataa agcagctctc ccattccaaa tagtctttga 31740aattcataat
ccttatttga gccaagatga tggattgcag aaatggtatg tgcattatgt 31800gtagcaaatt
tagggtaaat tttttccaca ttaaaaattc tttttgcaca ggctaaataa 31860gctaagtctg
taacagattt ttttgtatag acaggataac catcatagcc atagatttga 31920gcatgcttaa
tttcataatc ccaatatgct ccttttacga gccgaacatg cataggtgct 31980ctgttttcta
gtagctcttc taaccaatct attgtggcta tagctctttt gccatatgcc 32040tgaacagcaa
taccaaatcc tttccagttt ttaatatttg gtgaaagggc catctcttta 32100ataatctcta
aactaacggc cagtctgtct tgctcctcag catcgattgt aatttctaca 32160tctttagact
ttgcatactc tgtgagctga ataagttttg gaagtagatc agactttatg 32220tcttttagct
ttttcatttc atatctagga gataaagctg atatttttat tgagacacca 32280ttaattgtat
tttttgtcag atttatcttg ccgacttcat ctatagcatt cttataagac 32340tgataataag
tgtcggcttg ttcagcgttt cttgctgcct ctccaagcat gtcaaatgaa 32400taaatttcat
tttctatatt cttaattttt tttatgtcat caaaatctct gcccataaca 32460aattcttgac
tgagaatatg catggcacca actacagcat ttctaattgg aaactcgcca 32520gattttgaga
ttaaagaact taatagggcg cttgggtttt tggtccactc atcgggtgta 32580gaaactacct
tgcctgcaag aagaagaccc catgttgatg cattaacgaa aacactatcc 32640gctttgttta
aatgctctat ccacgcaccc tcagataact tctcagatat tattagatcc 32700cttgtttttt
tgtctgggat tcttagtatt gattcggcta aacacatcag agcaacacct 32760tctttattat
ctagcccata ttcgcttaga aatgcatcta gttttgttct ctcgctctta 32820ttctctctgc
aagcatcaat tattttattg agcatttttt gaaatcgaag ggtcatttaa 32880aaaatcagaa
ttactaataa gatctgaaac tatttcttgt tcaggataaa acttgttact 32940tgttaatgtc
atagctatta ttttaatctt atagtgccaa gtatcaactt actgattcta 33000ttaatgactt
aaaaacatta tgatagtcat atgagtgatg cacttaaatt aattaaacga 33060ggaaccgacg
agatcctcac agagtctgat ttaaaaaaga aattagattc tggaaaacag 33120ctaatcatta
aggcggggtt tgatccaaca gctccagacc ttcatttggg tcataccgta 33180ttattaaata
agctaagaca ttttcaagat cttgggcata aagtaatttt tctaattggt 33240gattttactg
gtcaaattgg agatccctcc ggtaaaaata aaactaggcc aacacttact 33300tcagaagaat
taatttcaaa tgcaaaaaca tatgagaaac aagtttttaa aattcttaaa 33360aaagaattaa
cagaagttaa atttaattct gagtggtgca acaagcttgg tgcagatggt 33420ttgattggtc
ttgcatcaaa atataatgtt gcaagaatgc tggagcgtga tgattttaat 33480aaacgttata
gcgcaaatca aagcatagct attcatgaat ttttataccc ccttgttcaa 33540ggatatgact
ctgtagccct agaggctgat gtcgaatgcg gaggaacaga tcaaaaattt 33600aatttgctag
tagggagaga gctacaaaga tcctatggtc aagaacctca ggttgtttta 33660actgtaccca
ttctagaagg cttggacgga ataaataaaa tgtccaaatc attaaataac 33720tttatagcaa
tagatgaaga gcctaatgat atgttcggta aaataatgtc tatttcagac 33780gagttaatgt
ggagatggtt tgagttactc agctttacct cagagaaaga aataggagtt 33840cttaagaaga
aaatggaaga agggaccaac ccaagagata ttaagtttct tttagcagaa 33900gagttagtag
atagattcca ttcagagggt gatggttcga aatgcaagga agcttttctt 33960caaagatttc
aaaaaggtca aatgcctgat gacattcctt ccatgtcagt tgatgttggg 34020gctgagggca
ttccattagt aaacctttta aagaattgtg agatgacatc aagtacatct 34080gaggcgatga
gacttgttaa acaaggcgga gttaaaatcg actctgtaaa aatagaagat 34140cctaaaatgc
taatttcaaa aggccaggag tctatttatc aggttggcaa aagaaaattt 34200ttaaaaatta
aaacataatg aaaaataaat tagtccaaat atttcttctt ctttttatag 34260ttgcgtgtaa
tcaggatagc cctaatatta aacaaattag taatatgcag tattttattg 34320ataatgaaat
aagggaagga atctattctg ttgaaccagg cttgcaatac tcaattatcc 34380aaaatggaga
tcaaagttct gaatcaccat tgctgcagga tacaattaca gctcattttc 34440acgggaccct
cactgatggc tcagtttttt ggagttctgt tgaaatgggt gagcccttaa 34500cagtcgaact
atcaggtcta atagttgggt gccaaaaaat aatctctatg atgaagaaag 34560gtgatgaatg
gagagtttat atcgacccaa gtatggccta tggcgatgag ggccggcctg 34620ggataccttc
aaactcaatt cttatttttg atattgagtt attagatatc caaaaaaact 34680aacccttatc
tataacggat agagcgtatc cataaacctc agcaacctgg ttaataattt 34740tatttttagg
tgttccggct ccatgaccag ctcttccttc aattctaatt aatattggat 34800tatcacaccc
ctgggactct tgaagttttg ctgcaaactt gaatgaatga gagggtacta 34860ctctatcatc
tcttttagct gtagtaatca gagtagttgg atagcactca ccttcaacaa 34920tattatgcag
aggcgaataa gctaataagt tttcaaactc atctttctta tctggagagc 34980catagtcact
ttcccaggcc cagcctatag taaatttatg aaatctaagc atgtccaaga 35040cgccaacttg
tggaattgcg actttaaata aattaggatt ttgcaacatg gtcgcagcca 35100ctagcaaacc
tccattagag ccaccttgaa tagcggttga tgatggagaa ccaatttctt 35160gtgcgtgtaa
aaatttagca gaatatgcaa agtcatcaaa aacattttgt ttattaaaaa 35220gcctgcctgc
atcatgccag ttatcaccat attcaccacc acctcttaga ttaactacag 35280caactattcc
cccttggttc atccaagtga ggtagctttt actaaagcca ggtagtcttg 35340agatattaaa
gccaccatat ccatataaaa gtattggagt attactgtct atttttaagg 35400actttttgta
actaagatgg atgggtattt gtgttccatc tttggatgga aagaatttaa 35460agtcagatgt
aaataaagtt gagtcgtgtc ctttaagatc ttctttccaa aaaagttctt 35520gtgacatatc
agttagatta attttgtata tttctcgagg agttacaaaa tttgtaaatg 35580aaaaataaga
tacctcatct tcaatttcac caccaaaacc acccattgtt ccttttcttt 35640ctgttgctag
cttattttta tatgctcctt ttaaatcaaa gaagtgaacc tcagtaaaag 35700tatcaacgag
ataagaaact acaatagaat tatttataaa gctaacacta ctaatagaat 35760tagtactttc
accaacgact tcattccaaa caaatgagcc attttttatt gtaagtgata 35820ctacttttcc
gttcgcagca ttttcagttg agtaaaacca aaaagtatca ttcttgcttt 35880ctaaaaagct
ataagcgcct attagctcat ctattaaagg aataaaaggc tgatcagggc 35940ttagctgaac
atagagtcta tttctttcat ctgtgccttc accaatagat agaaacttaa 36000ttttagaatc
ttttacaaca cttattcccc aactccatct tggcttttca ggattctcat 36060aaacaatcac
atcctcatct tgagcagtac caattttatg aaacattaac tttggagctg 36120tattaatatc
ttttaaaagc tcttccgatg gctcgtcgta tttttggtaa taaaatccag 36180aatcatcatt
ttcccatgaa gcaccagaaa atttagccca ttcaatccta tcatcaagag 36240tttttcctga
ttcaatatca agtactttcc aggttctcca atcagaccca ccatctgata 36300ttgagaaagc
tagaagagat gcatcattac taacgctggt acttgcaaga gagatagttc 36360catcttctga
gaactgattt ggatcaagta aaactctatc ttggcattct tcacagtcct 36420taatcatcag
cttgctttgt tgccatgacc catcattgaa ataataaaaa gtttttttgt 36480taacctgata
aggcatgctt atcgaatctg tatcccaaac ctcatctaaa ttcttagcaa 36540tagatttttt
atatttattt tggcctataa atttttgtgt gaaattattt tgtctctcga 36600cccagtctgt
tgagtcctca cttgtaaaat cctccatcca tctataagca tcttcgatta 36660ggtatccatg
gacttcttca ctaaaaggaa ctttatttga ttcagggtat tcaaaattat 36720ctgtttgatt
tgagcaacta actagaatta atagagctgc catgtatatg tatttattgc 36780gcataaaaag
cataatactt tacaattacc ttatgaatca attcaatcga gcatggcaac 36840ttttacgcaa
aaaattggaa gttttttatt gtacttgccg cacccatcat gatgcttgaa 36900atagcaatgg
catcaatggt aaccccaata caaaacgtta ctcagcctga ggatattcta 36960gaatttttca
atgagaatat agcctttcta ggctctgtta gtctcttagg cgtagtttta 37020agtatggctt
ttatgggagc gctttttgtt tcgtatgcat ccatagaatc agaaaatgag 37080attgagcctc
taaatgcctt atttttaggc ataagaaaat tctttccact tctgggggct 37140tatcttatag
cttcagttgg tgtctttttt ggtatcttat tattaatact cccagctttt 37200tatgtagcag
caaggctttg catttttcct gcatttatta tgcttgaaga caaaggagct 37260atagaatctc
ttaaattatc ttgggaaaag acggatgagc atggcaccac tttgtttggg 37320cttaccatta
ccttcttttc tttaacaatg atttttgcat cagttgccca atccattata 37380agtccgggat
taatgcaatt agttgttctt gcaattattg aatatgtaat agtaattcca 37440tggggctatg
tatattttag tttatacaag tcattaaaaa gatattaggc agattaacta 37500gaataaatta
atacgaagct tttcgtttac aaaaaaacac accactctta taatccttcc 37560tccgaagatt
tgcactttgt taattcatgc aaacaacttt gggtctgtag ctcagcttgg 37620ttagagcgca
cccctgataa gggtgaggtc ggtggttcga gtccacccag acccaccatt 37680ctttgaacat
taacttttat ttcctttata cttattctta actaaataag gatagtatta 37740tggctaaagg
tttagacaag caaaaaaacg acaagaagaa aggcaaaact ctaaaagaaa 37800aaagagcggc
aaaaaaagaa aaaaagaaat agttttttgt agaagactat tattatctag 37860tcgacctagg
gccatcttta aaatctattt taaagaacaa cattattccg gtcactgagc 37920ttactaaagc
taaaatagaa aatattttta aaagaaggtt gtcgatatta tctctttctt 37980tccaatccat
aatatgaaac ccccacatca gatcccatat tttccattta ttagatctaa 38040tggccacaac
ctctgctgag taaacattaa tgtaaacatt taattcagta cctttaacat 38100ttttactttt
tactctataa attggcaatg atctgcctct atattcagat cctgattttt 38160catttacaac
ttcttcggta gcaaatggca aaagagttgt ttggcttgaa actgaatcta 38220tagcatcttc
catagaaatc tttgttaaag gctgcccaag catattcaga tacttagttg 38280agccctttgt
tgtgatgata acaatctctt gacctaatct ttttttgaat ttaacttctt 38340tagctttctc
tatttcaaaa tttaacttac ttaaatcaaa agatgtttca acatgattaa 38400ggtattgctc
gcctcttacg agctcaatct tattaaaagc aaagtatatg cctgagatag 38460tccaaagtaa
taactgaaga gagataaaaa aacttaggta cttatgaatt ttcctaacta 38520aaaagttcat
cttttcataa tcttcattat ttcatcaatt tttacatcaa attcgtcatc 38580accttttaaa
gattccttaa tgcattcacg caaatgggtt tttagaattt tgccctcaac 38640agtaataatt
gaattcttaa gtgcttttat ttgattaaga atatcaacgc agtattgacc 38700ttcctcaacc
attccttgaa cgcctcgcac ctggccctct attcttttta agcttattaa 38760ttgctcttta
tgacatggat gtttcatttt ataaaaacct cggaactata tacgtaaaaa 38820ttgtagcgaa
taaaaaaatt gctacagatg catagtacag atacctagat ttttttctag 38880tttctgaaca
tgctctgcct agttctgggt cagcagggca aggcatatga taagttttgt 38940agttgatata
tcctgcaatg acaatcatta ctaatgcaaa tagtgttatg taaagcttat 39000attgagatag
ggtaattaag aatggaaata cagttaccag gcttgcaaaa cttgccccag 39060caccaagagc
aacaaaaata gctggcaagg cacaacatat caaagtagaa gaagatgcaa 39120ataatgaaaa
aaaattcgac gccttatcat tcatagttat taaatttcta ggatttgaag 39180atcagacata
ttttggccat tagaaagaat cttcttttcg atctcagaga aggtgatttc 39240ttttgcctta
gaaaaggcaa tcactacttt tccattatca agatcaacat ctattctttt 39300gacatcttta
tcttttagaa aagtttttga aatacctctt gcacagaaat cacaaaccat 39360tcccttcacg
cttacgatcg ctatatttac atttgagagt ccttcaacaa atgcatcaaa 39420cctctcaggg
tttacatcaa gatcttttcc atcaaccatg gtttcatgta aatgaccttc 39480atgagaatgc
ccatgcatac cactatggtc attctccgcc gcaataaggc cgcttgaaag 39540caaagtaatt
aataataatt ttttcatatt tttctccttt taaaatctgt acataaaatg 39600taaatcccag
ctcgtatctg cgtcatagcc aaactcaaca agggcattac cataaaaaaa 39660tttcaattct
ggatacgtag atgatttatt tgttacagta ttttttttgg ttttaaccat 39720aacccatgta
tgtaaatcat tgtagtcagc aacgtaagga gcaaatccaa gttgaatata 39780ttcttcttta
tagtctttgc taaatttact gttaatatcc tttaacccaa agcccgcata 39840ccacttccgg
gtttcccagt ctccatggat accataaaaa ttatttttaa tacttccagg 39900cacaattcct
gattgaaaat ataaattcct ttgggatgta aaagtatttt ttctattcac 39960caaataagta
aaacgaaaat agttttctgt gttcttataa aatttatttt tagccctttc 40020taggccaatg
gaatatttgt aggttggaga gtagtggaaa taaatagaat cattaaaggt 40080atctgattta
tacattatgg ttgttccgcc agagtaggat atcggtcttg catcagcatt 40140aaagaccatt
aaaaaaacta aaatgattat tttatatata cccatggggg gtatcttacc 40200ataggattat
aaattaattt caggcatctc tccaagcgcc cattttcaat gaaatcacca 40260tcagaattcc
aaaaataacc atagctatag cagatgcata aaaaatacta actagcccat 40320atgagagcca
gtaaacagct accaatgcta taacagaaaa tactataagc ctgatggccg 40380tgattataat
tggccatttc attgcattgg ctccctgaga tgcaaaatac agggaaagcc 40440catatccttg
aaatacataa catacaccaa ggatttgaat atactgcttt gtaactaata 40500aggtctctgg
atccgatgta aaaatactaa tccaaagatt tggtgttagt gctaaagcta 40560gaccgataac
agccgaaagc aatccagctg ttgttgcacc aaccatgcct attttttctg 40620ctcgctctat
atttttggca cctatatttg ttccaaccat ggctgtcata gcagtcccaa 40680tgccgaacac
aattgggatt aataagaact ctacccttga accaattcca taaccggcaa 40740tagcagacgt
tccaaattga ccaatgagtc ctgtcagtaa aagaactgtt gcaactgtca 40800ttattggcga
aagagatgcg ggtaaggcaa cagaaaaaat atcttcgaat aattctcttt 40860caattgttag
tcttttgagc ttaagcctta caggcgatga agggctcgaa atttttatta 40920aagttactaa
tgccataaaa cctgaagtaa ctatcattga ccaagcagat cccacaagcc 40980caagtttagg
tagaccgaat gatcctaata taaatccagc agagaaaaaa acttgtatac 41040ctgcacaaat
taccgttagg actgcaggaa attgcatatc tcccatccct cttagggcgg 41100cggttaagct
acctgatagc caaacaacta ttgctcccaa taaatagaca aaacaataag 41160ctaaagactc
ctctaataat gcacctgttc cacctaatat tcttaataga ggctctccaa 41220aaataaaaaa
gataattaaa aaagctaaag ctccaaagca agatatatac aaagaatgcc 41280ataataattt
ttcagcccta ggcttgtctg cggctccaag acttcttgca atagaagatg 41340tgaccgctcc
tccaagagca ccaaatgcca tctgctgagt cagcatgatt gcaggaaatg 41400ccaaagtaac
agcagctaat ggagttatgc caagttgact aataaaccag aattcagcaa 41460gaacaaccac
agcatttatt aaaaatgcaa cagtattagg tgctgacatt ttaatcagca 41520aaggaaatat
tggatctttt aaaaattgct ctgttctctt gtccataacc ttctcattta 41580ttaatattta
tttggttaaa gggtgaaaga gaacacgttc tcaggattca cgaatcttta 41640aataataata
acagctttta gttaatttaa tttttaaaaa caaccaaatc tattatctgc 41700tagattttat
tttttaaaca tattaatatc taatttaatt gcaaaggggt ggccatttcg 41760gcctgagatc
cttggaaact ctttgaacct gatccataca ataatggcgg aggaattgca 41820tgaaaatcat
taaaaatcat ttattttatt caataatctt actatatttt tttagtctag 41880agacgagctc
tcaagctata gaagaagtta ttatcaaagg agactggaga gaaactagtc 41940tgtcagcaga
agactcaagc attgcagtcc ttgatagtaa attaatagaa tcccaggctt 42000taaagcactt
tgaaaacctt tcgtacctag taccaaattt aaattttgct gcaagtgatt 42060ctagagcaag
acatttccag ataagaggaa ttggagaaag atctggctat gaaagaactc 42120caaactctgc
agtaggttta ctgattgatg atatagactt ttctgggcaa ggtggaattg 42180ccacaacttt
tgatgttgat caaattgaag ttcacagagg cccccaagga gcaagaatag 42240ggtctagcgc
aatggcgggg ctcatatata tttctacaaa agacccaaca gaaagctttg 42300aagggaaggg
tgaaatagtt atggggtcat atggaacttt taataccgga attgctgttg 42360gcggtcctgt
aaattttaat aaagacctca cttataggtt agctataaag aaagattatt 42420ctgacggatt
taggaagaat atttttctca acaagtctga tacttctaag aaagatgaga 42480gcacatttag
attaaaagtg aattgggtaa ctgataacca aaccacttat aagtttctta 42540tatctcaaat
agagttagat gatcctgcag atatatggac tatagatggg agtcttaata 42600cattatctga
taagcctgga atggactctc aaaaaagtaa tgcctatggg gtaaaaattt 42660atcatcaatt
taaaaaattt gaatttcaaa gcctatcaag cttaacgaat actgatgtca 42720ttcttagtta
tgatgctgat tggggcaacc ctgagtcgca ttcacccttt atttatgact 42780atttttcaga
aactacaaga aaaagagata cttttagtca agaatttaga cttgtatccc 42840aatttgcaga
taaaaataca gaaaaaagca tcgaatgggt cgttggggct agttttgtag 42900atataaatga
aacaaatgct aaaaaagata ctggtattta tggagatcca tcagacccat 42960atggtcctta
ttttagtaac tcttcttctt tgagtgactt ttcttcttca agctattctt 43020tatttggaaa
tattgattat ttaattaatg aaacaataaa aatttcaatt ggtggaagat 43080gggagaattt
taaatctaat tattttgatt cttatgacga atcattttca ccatcaaata 43140aaatgtctgg
tggtaagttg tcactagtta aaactcttaa taataattct aatatttatt 43200ttaatattgc
taagggctat aaccagggtg gatttaattt aggtcttggt cttgataaaa 43260attcatcaaa
tagaaattta tattatgatc cagaattttt aactaattac gaagttggaa 43320ttaatagtaa
atttttccag tcaaaattaa atcttggagc agtcctgttt tattctgatc 43380gaaaagatca
acaggtctta atttcaaccc aggttgatcc ttcagatccc aacacttttt 43440tatacttaac
ccaaaatgct gcagaaggaa ttaataatgg cttagagtta aatatagatt 43500atgcactgaa
taaatctcta ggtatatttg ctaattttgg attgctaaat acagaaataa 43560aaaattggat
ttcaagacca gatatagaag gtagagaaca ggcgcatgct ccaaaaaata 43620gtttttcaat
aggcatcaat tggaagccaa caaaccaatc ttatttatca ttgaatgttg 43680ttggtaaaag
tgagttctat tactctgatt cccataacaa tacttctgag tcatacaact 43740taacaaacat
taattatgga tatgaacatg gacaatggac ttattcatta tgggcaagaa 43800atatttttga
taagtactat tcagtaagag gtttttactt tggtaacgag gcacctgatt 43860ttatagatac
gctctacaga agacatggag atccaaggca tataggagtc atggtccaat 43920atgatttcta
actttataag tgaattttgg atggaaatag cagctgttgt atttgctatt 43980atttatttat
tacttgctgt aaaacaagac gtaaagtgtt ggtttgcggc tataattagc 44040tcaatattat
atttctttat tatgtatgac gcaggcttat atatggaagc ttacctgcaa 44100attttttata
tcatgatggc cttttatggg cttcaacagt ggagaagtgt cgatactgat 44160gctcctcaat
ttattgttag aacatgggat aaaaggatgc atattaagat cattacatta 44220atagttatca
tgactttaat ctctggattt ttattagaaa aatataccaa tgctatcttg 44280cctttcatag
atggactaac aacatgggga gcaatagttg cgacatatat ggttgccaaa 44340agacttcttg
aaaattggat atattggttt gttatagatt ttatttctat ctttttattc 44400atgtccagag
gacttttatt aacctcagga ttattcttta tttatcttgt aataatatat 44460tttggttaca
tgtcatgggt aaaaataaga gacgatatta gtgcagaatc ttcataacaa 44520tctagatata
aaaaatcata atctagagat tattaaaaca attaaatcag gccctgtatc 44580tgaaatatct
atttgtaatt ttgataacat caaagcaatc ttaagagttg atcatccatg 44640cgcacacaaa
ataaatgtag atcgagaaaa tgaaattttt atactcagcc aactaaaaat 44700tttagatttt
agtccagagg ttttatttag tgatttgtct tatggaattt tagtatggag 44760atatatcgag
ggtattgaat tttcacttgg taaggattcc aatgaagttt ttttaaaaac 44820acttggaacc
gaattaaaaa aaattcatga tattgatctc cccaaaagta agaaaaaata 44880ttttagcaat
gacataaatt tttacagaaa tttactaaag gaggttcctg aaaatataat 44940tcttcataga
ggatttgatt tatacgacaa actcaataac tctgataatt atgttctctc 45000tcataatgac
ttgaataaga caaaccttct ttggagggat aggttatttt ttttagactg 45060ggagtactcg
agttttaaca atcctttttt tgatattgcc tcgttatcga atgcttataa 45120cttatcaaaa
gttgataggg caattttatg gaaagcttat acaaataatg aatattcagt 45180attaaatgat
acaaatctta gagaatggat gcatttttgt cattatttag agtacatgtg 45240gagtatttcg
ctcatacaaa atggaaaaat tgatcagaac accctaaatt taaaaaaatt 45300agagaaaaaa
ttaaaaaata ttatttaaac aacaagtatt gcgcattctt tagttgttgt 45360tttactatta
tgtaagaaat ttacatttaa atgggggttt gatatgaaaa ttttatgtgt 45420cttatatgat
gatccaaaaa caggtatgcc agaaaggtat gcaagagatg atttaccaaa 45480gttagataag
tatcctgatg gaatgacact tccatcccca aaatctatag attttactcc 45540tggtgagtta
cttggttgtg tatctggaga actagggctt cgaaagtttc ttgaagatgc 45600tggccataca
ctagttgtta cttctgataa ggatggagat ggatgtgagg ctgataaaga 45660attagtagat
gctgatattg ttatatcaca acctttcttc ccatattatt taacaagaga 45720caagatgaaa
acagcgccta atttaaaaat ggcaattaca gctggcattg ggtctgatca 45780tgttgatctt
caggcagcca tggataatag cgtggatgtt gttgaagtta cttactgtaa 45840ttctcgttca
gtcgccgaac acattgtgat gatgatccta tcgatggttc gtgattatca 45900tacccaacat
agaattgtaa aagagggagg atggaatata gctgatgctg tacaaaggtc 45960atatgatgta
gagggcatgc atgttggtac tgtagctgca ggtagaattg gtattgatat 46020gctaagaaaa
atgaaaccct ttgatgtgca tttgcattac tttgatattc ataaactctc 46080tgatgaaata
gaagcagaac taaacctcac ctatcatgat tctgtagagt cattagttgc 46140tgtgtgtgat
gtagttaata ttagttgccc attgcatcct aaaactgagc acttattcga 46200tgatgaaatg
attagtaaaa tgaaaagagg tgcatatatc atcaatactg ctcgtggcaa 46260gatttgtgat
aaagatgcta ttgcaagagg cttagagtca ggccagctaa gtggttatgc 46320tggtgatgtt
tggttcccac aaccagctcc aaatgatcac gtatggagaa caatgcctaa 46380ccacggcatg
actcctcata cttcaggtac ttcgctatct gctcaaacaa gatatgccgc 46440tggggttaga
gaaattctag aatgttattt tgcaggcgaa ccaattagag acccatattt 46500gattgttcaa
aatggtgatc ttgcaggtat gggtgcgcac tcgtacacaa agggtacagc 46560cacagatggc
tcagaagagg ccgctaagta taaaaaataa gttttagaac ttacttagcc 46620tttccttaat
aatagagtct gcctctgcca taatgctatg cattaattct tccactgtcg 46680ggatgtcatt
aaccagtcca gcaaccattc cacatgacca ggctccaacc tccatagttc 46740cttcatgcat
aatttttgga tagactcctg caacctcatc cacaatatca gcaaaagtta 46800attcatcacc
aagagctttt tctttttcaa tcaatctctc aacagcttca ttattaagaa 46860ccctttctgt
atttgttaat gatctcatga tgagtctagt atctaactca gaagcattta 46920cgatagcctc
tttcacattt tgatgaacag gtgcatcttg agtagcaata aacctagttc 46980ccatattcat
tccctcagca cctagtgaca tggcagcaac caaacttctt ccatctgcca 47040ttccacccga
ggcaacaaat ggtatttcaa gctcatctgc tgctctaggt aagagtatga 47100aattaggaat
gtcatcttct cctgggtgtc cgccacactc aaaaccatct acagaaaccg 47160catcgcaacc
tattgcttgt gcttttaatg agtgccttac agaggtgcat ttatgaataa 47220cttttattcc
tgcttctttt aaagctggga ggtattctgc aggatttctt ccagcagtct 47280ctacaacagg
aacccctgca tcaataatta cttttatcaa accaggatag tctgggggtg 47340ttagtgatgg
taaaaatgtt aaattaacag caaatggctt attggtcatt tctttgcatc 47400tggcaatttc
atttgctaat ttctcaggcg taccctgtgt tagaccagta attgttccaa 47460gtccacccgc
atttgatact gccgctgcaa gctcggcaaa accaacatgg tgcattccac 47520cttgaatgat
tggatgttct ataccaaata attcagttat tttagttttc ataatactct 47580cctatttatc
ttccattggg gttaaaaaat cttcatattg agtttgaagt ctttgcatgc 47640cacttatcca
tctatctctg tcattacctt ttcttttaac atattctaga acctcactat 47700gaggggttac
taaaaacctc tcctcctcta tggcgttaag aacatccttt gcaacaatat 47760cagcttccat
cattccatca actcctgcta caccaggacc atttgcagtc atagcagttc 47820taactgcttg
agggcataaa caggaaacgc cgataccttt atttccatat gtaattttta 47880tccattcagc
aaaactcaca gctgcagctt ttgtaactgc atatccagcc gcacctagtt 47940gagttaaaag
accggctgct gaggatgtat tcataagata gccttcacct tgttctatca 48000tttgaggaag
cacatgtttt gcagcatgaa tatgagactg aacatttaca ccccatatca 48060tgtcccaatc
cgaagtatct gcttcaaaaa atcctggctt tccaccgata cctgcatttg 48120aacaaaatat
atcaatacca ccagaaaatt cattagcctt ttgtataaca tttataatgt 48180cgttttcttt
ggaaacatca gcacttactg caagcccatt aacactcttg gctgtttcct 48240ccgccccatt
taaattcata tcaacacata ctatagaact tgccccagat gcatagaact 48300cttcacataa
agccttacca attccactgg cagcccctgt aaccacaact cttttattat 48360taattttcat
aaattgacca cttttttaat tttttattta actttattgc ttagatagtt 48420tattacaaaa
caacaatact taatattgca aaatcgcata cccatcttta taaatttatt 48480gctattatta
gtgctagaaa atgaaatact gagatttaat atgaagaatg ttgttgttat 48540tggctcgtcc
ggggcaatag gaaaagcctt tattgatagc tatatcaaag atgatgatgt 48600tgaaaatata
ttttcatttt caagaacagg cctttccatt gaggataaaa aactccatag 48660tttttttatt
gatattgagg atgaaactag tatttgtgat gccgcagaga agatagacaa 48720gtcctcaata
gatgaaatta tcgtcgcaag tggaatactt cataataaag attttgggcc 48780agaaaaaagt
attagagatt taaatgcaga taacctttta aaggtcatta aggttaatac 48840tatcggccca
acaattgttg gaaagtattt cattccattg ctaaataaaa aagaaaaaag 48900cgtcttagca
tttttaagtg caagagtcgg cagcatttct gataataaaa caggtggttg 48960gtatgcctat
agagcgagta aaactgcact taatcaaatc attaaaagtt ttagtattga 49020attacgaaga
accaatccaa atgccattat ttttggtcta cagccaggaa cagtagatag 49080tgaattaagc
gaacctttta aaagaaatgt aaaagaaggt aatttattta ctccagaata 49140tagtgtattg
cagctaaaaa atattattga tacagcaagt ccatctgatt caggcaaact 49200aatttcttgg
gatggggaag agattcagcc atagttggtt atgaatatat tttcatatta 49260gaataaattt
ttaggggaaa aaatgagtat taaatattat gactggtcta aatttcaggc 49320caacactagg
ccaaataaag ttgccataag agagctagat aataacaaga tctatactta 49380cggagaattg
gataaaagat catcaaggct tgcatcacat ctccaaagtt caggaataaa 49440aaaaggagat
cgtattgcga tactatcgct aaattgttca gaattttttg agctagaatt 49500tgcttgcgga
aagattgggg caatagagat accattaaat tggagattaa caaaacccga 49560gctcagttat
attcttaatg atagtgagcc aaaaactcta atttatgaca atcagtttga 49620agaaatggtg
aaagagctaa aagaagaatg taatatttct gaaatcatag ctcttgatca 49680atttgaccaa
gaaagtgatt atgaaaaagt tttgagtaat gcttcaggca tttattatca 49740ggaagaagtt
gatctagaag ataacattat gattatgtat acctctggaa caaccggtca 49800ccctaagggc
gccatgatca cgcacaaaat gcagcttttt aatgttatta atttaggtat 49860ttcagcagct
gtttcccctg aatcagtcca tttagttgtc cttcctttat ttcatacagg 49920cggaatgaat
tgttattcaa atccaattct tcatgcaggt ggcgagttaa tattacttaa 49980agagtttgag
cctgggaaag ttctatcaat tatcggcagc tctgactatg gagttactca 50040tctgtttgca
gttccagccc cttatcaatt tatgatgaat catccagatt ttgaatcaac 50100aaatttatca
ggagttaagt atgctggagt tgggggcgca ccttgtgcag aggctatttt 50160gaagacttat
ataagcaagg gtgtttcgat gcagcaagga tggggtatga cagaaactag 50220tccaggtgct
actggtcttg aatcgtccga ggctgaaaga aaaataggat ctgctggaaa 50280accgcttctt
catactgagg tcaaggtggt tggagatgat gggaatgaac tgcctgctgg 50340agaagtaggc
gagatttata ttaaaggccc aaatattaca cctggctatt ggaagaaaga 50400agaggctact
agagattctt ttgaagacgg gtggttaaaa acaggtgatg ctgcttactt 50460tgatgacgag
ggttttttat acatagttga tcgatggaag gatatgtata tctcaggtgg 50520agaaaatgtt
tatccagctg aagttsaaaa tgttatctat cagttaccac aaatcgcaga 50580agttggagtt
attggtattg atagccctaa gtggggtgag actggtaaag cctttgttgc 50640tttaaagccc
gatcatgaat tgacggcaga agaagtcata gatcattgtt taaaaaatct 50700agcaaagtac
aaaattccag agaaagttga gtttattgca gctcttccaa gaaatgctac 50760aggtaaagtt
ttaaaaagaa cattaagaga tatgtaatat ttttaagcca aaaaaaaccc 50820agcctaagct
gggtttttta tttaactatt taaagttatc taatgttaag cattagaaga 50880ttctttaaca
gcaacattcc atataattaa accaaataga atcttgttaa caaagtcagc 50940aaggttatag
ataaggttta agttaagagc tgatccaccg tcacccatca ggtaacctgt 51000gaaataacct
acaggataaa tcgcccaacc aaagatgata atatacatca ttgtgttgta 51060agctgattgc
acagcaggac ttgcagtatt acatgcagat tttccttctc cagcccataa 51120ttcataaatc
atgtataccc aagctaaaca cccaataatg aatgcaggcc atgcagccat 51180gattcctgct
tcacccatgt aaccaaacac aagcataaca agagaaccaa ctagtaattt 51240cttaaataat
gatccagcaa cattagttgc agcagcaaga attaagtaga attcacatat 51300taatagagga
actgttagta accaatcaat gtatctaaat acagttggcg aatcaccagt 51360ttcaatccat
acccctctca tgtacatgta atgccagaaa gcaataccag taacaagacc 51420agatacagtt
aatgatgttt tccattttgc agaaactcta tctctttcaa caaagaaaaa 51480tacagtagat
gctaataaag cagcagtaac taaccaaaaa gaaacaccag tgtaatcact 51540agcatcaagg
tcaccaccac ctgcagcaaa tgtaggaagt gcaataacac tacctaatat 51600cagtaataat
ttcatatata actcctatat taatgacaaa gaagcataat tgcctccccc 51660ctaattaaat
atgaaattta atagtaacca atattaatag aggttacaaa taaaaagcat 51720ctattttttt
aatagaaaat atgtgtaata aaacaccata ctttaaatta atatttattt 51780tgcataagta
gacttgttta gtatcatatt taacatgaaa gttgcaattt atcctggttc 51840ctttgacccc
atcacaaatg gtcacacgga tattattgat aggggttgcg gactctttga 51900caaggttgtt
gttgcaatag ctaagagtga atcgaaaaac cctcttttta gcctagagga 51960tagaattaat
ttagcccaat ctatttttaa aggaaatgaa aaagtagagg ttgttggttt 52020tccaagaaag
ttaacagttg atcttgcaaa agactatgga gcttgtgcaa ttataagagg 52080cctacgagca
gtttctgatt ttgaatatga atttcagtta gcaacaatga ataggtcgct 52140ggctcccaat
attgaaagca tttttttaac accaaaagaa agtctcattt atgtatcttc 52200tagcttaatt
aaagaaatat cagacttaaa aggcgatata tcgaagtttg ttcatcctat 52260agtagagcag
gcacttcgag cgagtagaca cttagctctg acaagcttca caaaaaaaag 52320tagctctttg
attgacgatc gttttacaaa tagtgccttt gcatttattg cagggctcgc 52380cttctcttcc
atatacattt aatttaaatt taaaatatcc tggactacca tcagctgagt 52440agaagtcttt
taatgtagtt cctccaacct ctattgcttg ctctagtatc ttttttccag 52500cagctactaa
tcttttacaa gcatctaaat ctaactcatt ggcatttttt agcggatgaa 52560ttttagctaa
gaaaaggctt tcagacgcat aaatattgcc gataccgact acattttttt 52620gattcattaa
atagctttta atatttgttt tagagtgact gcatcctgaa aaaaaatcct 52680ttgcattaaa
atttttagaa agaggttcag gcccaaggtt ttttattaac ttatgtttat 52740cgatatcaga
agtaagatgc attgatccaa accttctggg atcattataa ataattcttt 52800cctcatcaaa
aattaattca atatgatcat gtttgataaa gaagttttca ttattttttg 52860caattctaag
actcccagac attcctaaat gaagaataat ttttttatca ttagatagtt 52920taaaaattat
atattttgct cttctttcta agctttcgac aacttgatct tttactgagg 52980tttcaaagct
atccaccact ttccatctaa ggtttctgtt gtgtatcctt gcttctttca 53040gcaaagagcc
tttaaatttt ttaattgctc tcagggttgt ttcaacctct ggaagttcag 53100gcatttttaa
ctgagtaaat tattaatttt tactatgtca gctggagtaa ttgttcccga 53160tgcgagccct
agtcttagat tggcaagtat gtaatcatat tttgcattag caagattttt 53220ttccgcgctg
tataagtttt tttctgcctg caagagatca acaacgtttc ttgttccaac 53280tctgtagcca
acttgagtcg cttccagggc actagtggcc gaaatcactg cttgtttttg 53340agcatttaca
tttgcaacta atgttaaaac atttgaaaac tgggatctga cttcttgaat 53400aatccttctt
tctgtaaata gagtattttc atttgctctt tcatactgtg aatatgcttg 53460cttccttctt
gagttaacgg cgccgccttg aaagagtggc atacttagct gaattgcata 53520attccttctt
cctgttactg atggaactgg aataccttgg ccattgatat taaaaccttc 53580atagttaaat
tggtttgttt cagattctga ctgacttcca acaatgtcta tcttaggtaa 53640atgatttgaa
gctacacttc ttgcactgct tttcgctgct ttctttctca aatatgctgc 53700ttttaactgg
tagttatttt ccaatgctaa ttcaacccat gtctcttttg aacttggtgt 53760tggcaggtca
ataagcaaac catctcccaa ttcatttaag ctgaatattt ctctaccaat 53820cagagcattt
aaagactctc ttgcagaata aagtgatcct tctgttctaa ttcttgaggc 53880tttacttaga
tcaaatgcca attgagcctc ttgaactcca gttatggctg ataacccaac 53940atcgaatctt
tgttttgctt gatcaagttg ttttttaata gctttttctt cagatattgc 54000tgcatttaga
ttatcaatag ctctaagtac gccaaaataa agctcagcag ttcttactaa 54060aagattttgc
tgctcaaatg caaagtctgc ttcagcagca tctgtaagag atttagattg 54120cctatattga
aaccatgtat ctagtctaaa gagcggctga gtaaccctgg cagatgtaga 54180aaaagagtta
tattgctgct gcagttcttt gttttgatag tattcgttcc agttagttga 54240tccactcaag
gtaatactcg ggagaagcgc agctcttcct tgaaccttaa gctctttatc 54300tgctaaatat
gaatattccg ctgctttata tgtagggtca ttctcaagcg cttcattata 54360gatatccaaa
agactttcag atgaaatatt aaatgagata aaaagtgcta aaaatgattt 54420tgtgtaaatt
ttcataactt attttaacct attaataatg tttgcagtgc aaacatttat 54480tttttttaat
tttattgaat ttattttata tttcttagag tagaatatct ctaacaagtt 54540caataatttt
ttataataaa aaggctttaa aaattggcta aaaattcata tgacgctcag 54600gcaattgaag
tcttatctgg actagatcct gtcaaaaaaa gacctgggat gtatacggat 54660acatctaacc
caaatcattt aattcaagaa gttcttgata attcggttga tgaagctctt 54720tcaggttatt
gttcaaacat aaaagtatct gttctaaaag atggctttat taaggtctct 54780gatgatggaa
ggggtatgcc aattgatgag cacccggaac ataaagtttc aggtgttgag 54840ctcatccttt
gtaaacttca tgctggagcg aaattctctg gagatgatta taatttctct 54900ggtggccttc
atggtgttgg agtttctgtt gtaaatgccc tatctgatga attagaggtt 54960agagtaaaaa
gagattctaa agaataccaa atcactttta ataatggaga taagtcttct 55020gaattaaagc
caattgggga agtggggctc agaaattctg ggacatcaat taaattcaaa 55080ccaaatccta
catattttga aactatagag attcagataa aacagcttaa gcatttatta 55140aaggccaaag
cagttctctg tcctggatta acgatagagt tcgttaatga aaaaaagact 55200gatgataaac
aaaagtggta ttttgaagat gggctcaaaa gctatttgat tgattcttcc 55260gagggagcag
acttggtttt gctagattca attgtatgct ctaaaaaatc tcaagctcaa 55320gagcttgaat
ttgcaatcaa ttggtcatta agacccccaa aaaataaact cgatgaaacc 55380tatgtgaatc
tcataccaac tgctcagggt ggctcacatt taaatggctt taaggctggg 55440cttttagatt
cattaaaaga attttgtgaa tacagaaatc tattgcctaa aggtttaaaa 55500attaatgcag
atgatgttct taataatgca atttttataa tttcatctaa gcttcagaat 55560cctcaatttg
cagggcaaac caaggaaaga ctagattcaa aagatcacat gtcgttcgtc 55620tcaagtacca
caaaagacat tttaagtatt tggcttaaca ctcatacaga agagggcgaa 55680agaatagcag
aacttgcaat tatgtctgct cagacgagag caaaagtttc caatatagtt 55740gaaagaaaga
aaacttttag aggcccagcc ttacctggaa aactttcgga ctgtaatagt 55800caggacttaa
atgaaacaga gcttttttta gttgaggggg actcagccgg agggtccgca 55860aaacaagcaa
gagaaagatc tttccaggca atcatgcctt tgagagggaa gattttaaat 55920acttgggact
tagaaagtgc agaaataata aaatctcagg agataaaaaa cctatcaact 55980gcaattgggg
ttctgccagg aaataatgac ctttcatcac taagatacgg aaaaatttgt 56040attcttgcag
atgctgattc agatggtctg catattgcaa ctttactttg tgcattgttt 56100ctaaggcatt
ataaatcttt agttcaagag ggaaggatat atatttcaat gcctcctcta 56160tatagaatcg
attctggtaa agatgttcta tatgcacttg atgataaaca gcgagatgaa 56220atagttactg
aatttaaaaa gaagaagggc aagcctaaag taaacattca aaggtttaaa 56280ggacttggtg
aaatgaatcc acctcaacta agagagactg tgatggaccc tgctactcgt 56340cagcttgttc
agctttctat cagctcaagc gataatgcaa attctatgat ggacttactt 56400ttgtccaaaa
agaacgcacc agcaagaaaa gaatggcttg aaaagaaagg gtctctagca 56460aaaatataaa
tatgaaagaa caaataacct caattagcct caagcaatat gctgaagagt 56520cttatcttaa
ctatgcaatg tatgtcattt tagatagagc tttgcctaat attggagatg 56580gccttaagcc
tgttcaaaga agaatactct atgcaatgtc agagcttggg cttgatgctg 56640gctcaaagta
caaaaaatca gcaagaactg ttggagatgt tataggaaaa tttcatcccc 56700atggagacgg
cgctgcatat gaagctatgg ttttaatggc tcaaaatttc tcattcaaat 56760acccttttgt
agatggtcaa ggtaactggg gttctcagga tgatccaaaa tcttttgctg 56820caatgaggta
tacagaatct aagttaacta aatttgcaaa tcttttaatc tctgaattga 56880agtctggaac
agtcgattgg cagcctaatt ttgatggctc tcttttagag ccagtaattt 56940ttccagccaa
actcccatct attttattga atggcacttc tggaattgct gtaggaatgg 57000caacagatat
tccatctcat aatattaatg aaattattga tgccacagta catcttattg 57060ataatccaaa
atcacagttg gttgatttac tcaagataat taatggtcct gatttctcaa 57120ataattcgcc
aataattgct agcaaagatg agctgaatga aatttattcg actggaaaag 57180gcggcttcaa
agctcaagcc caatgggcgc aggataagaa tcaaatcatt atcaacgcat 57240taccttatca
agcatctggg tctaaaattt tagagcaaat agctgatcaa atgcttaaga 57300aaaaaattcc
aatggtggtt gatcttactg atgaaggaga ccacaaggag ccagtaaggc 57360ttgtcataac
tttaaaatcg aacagagtaa atgctgaaga tgtaatgaat cacctttttg 57420catcaactga
tttacaaaaa aattatagag taaatatgaa tttgatttca ttgaaaggtg 57480gaccaaaagt
tttctcccta gttgatttat tgaaagaatg gctagtcttt agaaaagaaa 57540ctgtaataag
aaaactagaa catagactcg accaggtaaa cgataggctg catatccttg 57600aggggttatt
aattgtttat ttagatttag ataaagtaat aaagattatt agagaatcag 57660atgagccaaa
aaaagatatt attacagctt ttaaactttc tgatatccag gcaaatgcaa 57720tccttgagat
taggttaagg caactagcca agctagaaca aatcaagtta gaacaagaaa 57780gagatacttt
ggtagcagaa caaacagata ttgaaaaaat actaagctct aagacaagac 57840ttaaaacatt
aattaaaaat gagcttattg aaataaaaga tgagtttgga gaagtaagam 57900aatctccaat
aagagaagca acagaagcca aagttttttc tgaagaagaa actcttgtca 57960ctgagcctat
tacagtagtc ttgtctgcag ctggctggat cagaagtgca aaaggccatg 58020agatagaccc
cagctcactt tcctatagag gagaagatgt acttcaagat tatggaagag 58080gaaagagcaa
tcaagtttca gtttttcttg actcaaatgg gaaggcttac tcacttgcaa 58140gtcactctct
tccatctgct agaggaatgg gtgatcctat tacaggaagg gtatctgcag 58200attctggagt
aaagttcatt tcttcattga ttgggaatga tgaagataaa ttcatgatta 58260tgaatactgc
tggatatggc tatatttcag agtttaaaaa tatggtttct aataagaaat 58320caggaaaagc
atttatgaaa atccctcatg aagcagacct tcttaaagcc attaaagtaa 58380gagacgatca
tttgtatata gcagcagttt caaatattgg cagactttta atttttaaga 58440ttgatgaatt
gccaactctt ggaaagggca aagggaataa aataataaat ataccaaccg 58500ctaaatttat
agcaaaagaa gagttaatga cccatgcgca acttgtttct gaggctagct 58560ctttaaggat
tgaaagcgga aagagattcc tcactttaaa actcaaagat ctagaaaact 58620atatttctac
aagagcaaaa aggggaaata tgcttccaca aggatatagg aaggtagata 58680aaatgattga
agaggttgag ttagaagtta aagaagactg attatagatt tctcaaaaat 58740ttttaatccc
tcatcaacta gctcattttc tattattagg cttggagaaa acctaacagt 58800agatgcattc
gccttaagaa tcattaaccc attatgatgg gattttttta ttaaatcatc 58860tatttgaatt
ttgctatcct tgcttacttc cactccgacc cataaaccag cagaagttat 58920tttttcaaaa
catttgtgct tctcatttat tttattcaac aaattgataa atctaacttc 58980tttctttaaa
accttattta aaaatgattt tttagaaatg gtatctataa cttcattacc 59040tatggcacat
gcaattgggc ctcctccaaa agtcgtgcca tgagaccctg ccgacatatg 59100ttttgaaact
ttatctgaag taagaatgcc tcctagtggg aagccattag atataccttt 59160agcaaaacaa
agaatatcag gagtaatgtt aaattgctca taagcaaaga gtgttccagt 59220cctgccaatc
ccggattgaa cttcatcaat tatgacaagt gccttatgtt tttttgcaag 59280tttctttatt
tttgctataa attttttatc tgcttttgtt attccagatt gccattgaac 59340cagttctaat
ataaccgctg cagtcttatc tgaaaaaacc ttttctagat tagtaatgtc 59400attatatgga
tgatttttta tccctctagg cagaggcgca aaaccgtctg taagatgctt 59460agctttagct
aaagctatac caagcatagt tcttccatga aatgacgttg aaaatgaaat 59520gacctcattt
ttatttttat taactgttga tgagcaaaat tttcttgcta tcttaactgc 59580agcctcaatt
gattctgccc ctgaattgca aaaaaatact ttatctgcaa aagaattctt 59640gcataacttt
cttgccaaag taactgaggg ctcatttatg tagagatttg ataaatgcca 59700aagctcctca
gattgctttt taagaatttt tattagatct ttatttgaat gaccaaggtt 59760agtgacggca
atgcctgctg tgaaatcaat atatttctta ttatttaggt cccatacatg 59820cgatccactc
gcttttttta caacaaaatc tgcaggagcg taaaaaggca ccatataatt 59880agtatattct
ttaattatct ctttcataaa tatattgtat cgtggtttca gttcaaaaat 59940taaatgacca
aaaatttcta gtatctttaa aaccaaacag ctcactcatt ggctttaata 60000gaataatatt
tatatctagt atatctattg tatgcggtgg catagcattg atatttttct 60060tttttggagc
aacgctcatt ttgccttttg ctggtttaga gcttggtatt ctattcactg 60120cattctattt
aagttttaaa tggagtgata aaaaagaaaa aatatttatt tctcaagatc 60180ttgtaactat
agaaaaaggc tctaattatg ctgaatataa atgggaagag ttcaggtcat 60240ttacctcttt
ccaggtttca aaagacagaa gagatcttct taagctaagc tttaggtcta 60300aaggcgagga
tgttgaggtt ggcagctttc taaatgaaga tgataaaaat gtattaatag 60360aagagctaac
tcagatcata gatacattaa atcacgattc cttctcaaag ccagagcttt 60420aatttctttt
taacttcttt tagctttaat tcttttattt cagggatgcc atttctaaaa 60480tttggaaatg
attctccctg tattaatggc ataagatatg caattgcttt tggggtaacg 60540tccattccat
ttttagctat aaaagatttt ggtaattttt tttctaaatt agctattttt 60600gccagtggag
ctggttcaat tttccaccta tattttttag ctttacctct aactattatg 60660ggcataacac
cattcatgcc ttctactgca tactgaactg ccttggcccc aacagccata 60720gcttgctcta
ggtctgtttt ggaggcaata tgtcgtgcac ttctttgtag atagtcagaa 60780acagcccaat
gattttttag ttttaattta tcagtaatta agttggcaat gtaaggcgca 60840actccaccaa
gttgagcatg accaaatgca tctattgttg ctgattcaga aagaaatctg 60900ttcttattat
ttttcaaccc ttcggatacc acaacaacac agtagccatt ttttttaaca 60960acacttttaa
cttctgctag aaattttttt tgattgaagg ttatttcagg taataaaatt 61020atatgaggtg
catctccttt ttcttttctt gcaagagaag atgcagccgc catccagcca 61080gcatgcctgc
ccataacttc taatataaaa acttttgttg atgttgctgc cattgaccga 61140acatctaatg
aggcttctag agttgatgtt gctatatatt tagccgccga cccaaatcca 61200ggacaacagt
ctgttaatac caaatcgtta tcaacagttt ttgggatggc tatacagtta 61260attggataat
ttaatttttt acttatctgc gaaactttaa atgcagtatc agctgaatca 61320ttaccgccat
tgtaaaaaaa atatcctata ttgtgcgctt taaaaacatc aataagcctt 61380ttgtactctc
tttcgcttga ctcaagatct tttaacttaa acctacatga cccaaaagcc 61440ccaccaggcc
tatatttcaa tgactctaat gcagatattg attcttttga tgtatctatt 61500agttcctcat
tcagcgcgcc caatattcca ttcttgcctg cataaatttt accgatgtct 61560ttatgcttct
tggcctctaa aatcagagcg ccagctgtag catttataac tgcggtaacg 61620ccgccggatt
gagcataaaa cgcatttttt ttcatttttt ctccatcaaa tgtactgaaa 61680taaaaatatg
taaagtatta taactaatat gcgaatacat atcttaggga tctgtggrac 61740ctttatgggc
ggccttgcta agatacttaa agagtcaggg catgaaatat ctggatcaga 61800tattcaattt
tatcctccca tgtcagacta tcttgatagc tttgatattg aaatgatcaa 61860gggctatgac
ataaaaagca tgccagatgc tgacttgtat gtgattggta atgcactttc 61920tagaggaaat
gaatccgttg agcatatttt gtccaattct cttccgttta agtcaggccc 61980tgaaatgctt
ggagaaattt taaaaaataa aaaggtttta gctatatctg gaacacatgg 62040taaaacaacg
acttcatata tgttgactca tattatgtta gatcagggta gggatgtcgg 62100ttttctagtt
ggtggaatat caaataatat atctggctct gcatgtcttg ggtcagatgg 62160aacttttgtg
attgaggcag atgaatatga ttctgcattt tttgataaaa gatcaaaatt 62220tattcactat
tcaccaagca ctatagtcat caataatatc gaatttgatc atgctgatat 62280ttttaataat
ttagatgaca ttaaaagaca attccatcat ctaattaaaa taattccatc 62340aaatggaaat
gttgtttatt ttgctgatga taaaaatata agagatctta tcgatatggg 62400aatttggtca
aatcagatag caatcaataa taatgctcat tcaattgagg cagtttattc 62460tgataagact
ctaaaatatg aagaaagtat ttattcatta aatgagttac ctttaatagg 62520agaacataat
tttaaaaact acatttcggc tattttggcg gcaaagacag atggaattca 62580aattcaggat
tctatcaaat cattagctag ctttgatggc gtaaaaagaa gattagaata 62640taaaggaagt
tttgatggca tagaaattta tgatgatttt gctcatcatc caaccgcaat 62700agaattttct
tctaatgccc tcgtaacaca aaatccatca aaaaaaatac ttggtctcat 62760tgagctaggc
tcaaacacta tgtctggggg ttctcatggc ttgtctttgg ttgaatctgc 62820aaaatcttta
actcatgtta tctggctaga tcgcaataat gttttgtcag agaatgctag 62880cattgaatct
actaacacta ctgaagattt tatttctgca gcgatatctg ctttctcaga 62940ttatgatatt
gttattttga tgaccaataa agacagccaa aaaatattaa aacccattgt 63000agatcacttt
gaaaaataat aatttaccag tttttccttt aggaatagtc gccctcccag 63060gtagcatcca
atctcttcaa atttttgagc ctaggtatat acagatggtt aaaacatgtc 63120tatccaagaa
ccatggattt gtaattgttt ttaatgccaa taatgagtct caaggcgatt 63180tcactttttc
taagaaggga agttttgttg aaattataga ctttaataat ttgccaaatg 63240gccttcttgg
gataactgta aaatctataa ataaggtgat aatcagtaat atatgtcaat 63300tagaagatgg
actgcatatt gctgatatta aggcacagat agatccagag gtagatgatc 63360aagctgtttt
ggcagaatat cctgagatat ctagcattct ttctcagctt gtaaagcatc 63420ctaagattag
tgacctgccc atccaggttg actttggctc tgctgattca gttgcatacc 63480acttagcagg
ccttatacct ctaagctcaa atgagaaaca aaaactatta gaagcattcg 63540atgcagcaca
gcggatgaga attctttcag actatattga aagaatatct actacataaa 63600ttatttattt
taatattatt ggcggcttat tagattttgc tctaagccaa ttgatggact 63660taaaaaccgt
agggatttct acaaaccttt tttcaatata gtatttgcct ggatagttac 63720taagcattag
cccaataata atagtgaaca aacctggcca ggtaacacca gcatcatgat 63780tcccccaata
ataagggaat aacctaaaat atttttaatc agtaaaacta aataccacaa 63840caacgggtta
ttggtcttaa atttcgagac atctttcttt ataaaatagt cacttggaat 63900taatccagca
agccacctca tactaacgag actaaataaa aaaataaata aagatattga 63960gctcagccaa
agtataagaa ctttataaga ttcaaaaaaa attatgatgg catttagggt 64020attaaattcc
ataattaccc acctttttat ttaataatga tttagtatag caatgtctaa 64080aaataaattt
aaacattaaa taaatataca ataattttgc caaaaaaagc cttaaatgat 64140atttgcactg
aaaccctcag tgataataaa gccgaaaatg ttctatcact tgatatcaaa 64200ggtatttctt
cttttgctga caacatcatt attgcaactg caaattcgaa taggcatgca 64260aagtctttat
ccgaaaagtt agttgaggag ataaaagcta ataaaattag tatcatgggt 64320gtagagggca
agacagaatc aggttggata ttagtagact gtggtgaggt tgtagtaaat 64380attatgaaga
atgacataag agagttttat gatttagaag gtctttgggg tgaaaacacc 64440ctcatcgatt
cttcgaaata aatgctatta aatatcataa gtgttggaaa ctcaccttca 64500tcttgggagt
taacaggaat agaatattac acaaagcaaa tccctaaaga agtaagtcta 64560aattttatta
acgtaaaagg gcagcaacac ccaaaaagat ctacagaaga ggtcttaaaa 64620cttgaatcta
agttaatctc gtccaaaata gattctaatg gatatattgt ttgctgggac 64680tcaagtggcg
agtcgttgaa taattttgaa tttagtaaat tttttgaaaa atccatgctg 64740gaaaatatga
agctttactt tgtaattggt ggttcatttg gcatacctcg agatattctt 64800gataaatcta
ataagataat ctcgatctca agtctcaccc tacctcacag gcttttcaag 64860attgttctta
tagagcaaat ctataggtca ttttcaattc tcaaaaatct tccctaccac 64920aaatgattga
tctgaatgag aggtttgttg aaaaaagaag ctttttcaat agattgttgc 64980ttatatattt
tttctttggg gccctgtttt tattcttgct tttcaagacc tattctttac 65040aggtttctag
ctattctgac tatgaactag cagcactaaa gaataaaaca aaagaagtgc 65100tagttcagcc
tgttcgagga gtcatctatg acaggaatgg aaatattcta gttaataatg 65160tccctacata
tgacctaata atccaaccat caaaaattaa aaacttagat gatttcatta 65220ttaatatttc
taaaataata gagctatcag attcagaaat tgaaaacatt attaaaaatt 65280ttaaaaggag
tgcaacttat aatagagagc taacaattaa aaaagatctc tcgaaagaag 65340aaatagcaaa
atttgaagta agaagctacc aatttcagaa tgcatttata gatgttaggt 65400atagtagaga
aaataaatat ccttacttat tttcacacgc actcggttat gtaggtggtg 65460taagtaatga
taaggtttta tcaattttaa aaaatcaaga tttgaagcaa tctgaaacaa 65520cttttaagta
ttcaggtggt tttattgctg gaaaaacagg attggaaaat atttatgatt 65580cagctttaag
agggtccttt ggaaaaaagt tatttgaagt tgatgcaagg ggaaggcttt 65640taaaagagct
aagttttgaa aaacctatta atggaaaaag cctttttact catttagatt 65700taaattcaca
aaaaaaagcc tttgaacaaa tgaataatag aaggggtgct gttgttgcct 65760tagagcttaa
atctggttct atagttacct acctaagcac tccaagcttt aatgtaaatg 65820gtctttctaa
tggcatgtcg tcggtagagt tcgaaaaact gattaatgat gtagacaagc 65880cattttttga
tagggccggt caaggtcggt actctccagc ttctacaata aagccagcaa 65940tagcattgtt
tggtattaaa gaaaaaatag tagactggaa ctttacactt aaagatcctg 66000gattttttgt
attaccagag gatcagagga tttatagggg atggaagaaa ggaggtcatg 66060gaacaattga
tatgaagggt gcaatcatag aaagttctaa tacttttttc ttttctcttg 66120cttataaggc
tgatattaat aatttaatca gccatctttc tgagtttggt tttggcagga 66180atgtttgtaa
agattgtttt aatccagatt ctggattatt gcctacgcct gaatggaaaa 66240tgaataatct
taattttggc tggtttaaag gagataccgt taatttaggt gttggtcagg 66300gctatatgag
tgccactcca gttcaattag catattactc tgcatttctg gcaaaaaaag 66360gaaatcttca
agagctatca tttgtcgaga gtgacagcct aagcaatact gcttttataa 66420ataatttaag
catagataac tcagactggg atcaaattca ctcaagcatg attggagtta 66480ttgaagatcc
gagaggcact gctaaaagat taaaaccttt aaaatcatat gttgttgctg 66540caaaatctgg
aacggttgag cttgtcagca cacaaacaaa ggaagattac aaactagtaa 66600ggcaaaatat
tggtaacaga gaccatgcaa tcattgtggc ttttggtcca atgcctgatc 66660cagagtatgc
agtaagtgtt gttattgaaa atggtgaaag cggcggttct gttgcaggtc 66720ctgttgctat
tgctgtttta aatgagctta tcaataaatg aagaaaaaat tagactttaa 66780aaactttagc
atttattttg atcaatattt atttattgcg ataaccctgt tgtcggtcat 66840gggtttattt
tttttataca gcgcatcaca agaagatatc agcactgttg ctaagcaagc 66900tgtattcgtt
ggttttggtt tgctgttaat gtttgtagtt agccaacctg accctgattt 66960ttataataca
ttttctgggt tattttttgg ggggggastt gtattgattt ttctaactat 67020gatttttggt
aaagaaataa atggagccaa aagatggctt gatttaggat tttttaccct 67080gcaatcctct
gagattatta aaatatcatt gccaattttt ttatcatcat atttatataa 67140taagccactt
ccaataagca ctaggcatac ttttattaca ttgatattaa taggttttat 67200atttgcactt
gtagctagac aacctgactt gggaaccagt ttagtagtat ttatgtcagg 67260gggttatgta
ttatttttag caggattaag ccgccgttta ttggatctgc aatagcctta 67320tttttattat
ccttacctbt tttatggaat aactttctag agccatttca acaacagaga 67380gttttaactt
atttgaaccc agacgcagac ccttatggta ctgcatggaa tataactcaa 67440tcaaaaatag
caattggatc aggcggaatt aatggaaaag gttatcagga gggctcccaa 67500gcccatcttg
atdttttacc agagacagaa acagatttta tttttgctgt tattgctgaa 67560gagtttggtt
ttattggagt ctgtattttg ttatcagtat ttdtctttat atkactcaga 67620tgtttatatt
tagcatttaa tgcaagagat agattttgca ggttaactat aggaggccta 67680agtttagttt
ttgcctctac attatttatt aatttagcaa tggttgttgg tgtagttcct 67740gtagttggta
tgcctcttcc atttatcagc aaaggcggct catctttgct atcctkttat 67800atagcttttg
ggattataat ttctatggca acacataaaa aattaatgca aagatgaaaa 67860aaattatatt
tataacttta atattcacta tttcaattac tgcggattat tcgaatcatg 67920aagatagcca
aatggtaata aatgaactcg tcacaaaaca cggttttgag gaatcttatg 67980ttactgcaat
cttaaaaaat gcaaaaaagc gtgatgagat gcttaaatct gttgctaatc 68040ctgctgaaaa
aacaaaaaca tgggatgaat atagggctat ctttataaaa acaaaaagag 68100tttcagaagg
caaaaaattt ataaaaaaaa atattaatgc tttagagagg gctgagaaag 68160aatttggagt
tcctaaagaa ataattactg ctattttagg cgtagaaact aattatggca 68220gcaataaagg
tggatacaga gttttagata gtcttactac tttaggtttt gatgacccgc 68280gaagatctaa
tttttttaga agagaactta tagagttctt ccttttaaca agagaaaata 68340atttagatat
caaaacaaca aaaggctcat atgcgggagc catgggatat gcacaattta 68400tttcgtcaag
ctaccgagcc tatgctgtag attttgatga agatggttac gttgatttat 68460ttaattctgt
tgatgatgca attgggagta ttgcaaatta tctttatgtt catggatgga 68520agagagaagg
aaagatcgta acaaaaactt atccaaacaa tgttagaaaa ttttataaac 68580ctcatgagtc
tctaacaagg ttcatacctt taatctttaa tgaagatgga aaagatcttt 68640tttttattgg
tgatgataac tttagggcta ttgctaagta taatattagc gatgtctatg 68700caatggcggt
ttattactta tcagaagagt taaaaaaatg aaaaaattat tattcacatt 68760actatcaact
tcaatattta ttcaagcaca gagttttgtt ccagattctc ctgagttaga 68820tctgaagagc
tatatcctaa ttgagccaaa caccaatact gttattgcag aatttaattc 68880agatttggaa
atagaaccag ctagcatgac taaaattatg actagctatg ttgtggctga 68940tcagattgca
aatggtttaa tatctcttga tgacgaagta ctaattagtg aaaaagcatg 69000gagaatgcaa
gggtctaaga cctttataga agctggaaaa aaagttacgg tatctgatct 69060tttaaaaggt
attatgattc agtcaggcaa tgatgcctct gttgcaattg cagaatatgc 69120aggcggcact
gaaagaggct tcgttgactt aatgaactct tatgctgcct ctttggagat 69180gaataatacg
atctttcaga attcaacagg gcttccagat gaaaatcatt tttcatcagc 69240aaaagattta
gccaatctga cagctaacta cattaataaa tttccagaag aatatgcttt 69300atataagcag
aagcaattta cttttaacaa tattaagcaa ttaaatagaa ataagctttt 69360atggagagac
gactcagcag atggtgttaa aactggtcat acagaagctg ctggatactg 69420tttagttggc
tcagcaaaaa ggggcggtat gaggcttatt acagttgttg caggaagtaa 69480gtcagataat
gatcgttttt tatcttccca aagattactt gagtatggat ttagattttt 69540tacaacacag
aagatgttaa gtgcagaaaa agaatatcag aatattacag tttggggtgg 69600acaagaaaag
atacttggcg tgggagttct agaggatata tctattactc ttcctagaac 69660aagtttcaaa
aatgttgaaa ctgtttataa agttaataac aatatccagg ctcctatcgt 69720agttggtcaa
aaagtcggga ctctagaaat aattagtaat gatgagattg ttctagttac 69780ggatttggtt
gctttaaaaa atattgaagc taaaggtttt tttggaagaa tatggtcaaa 69840gttcgttctt
tggatattta gcttatttgg tctaacagat gaaaacacca cttaaaggcg 69900tttttaatgg
ggcttttgat actgtagata acataaaaat atcacctttt tctcgagctt 69960atactttttc
tgatagcgta tatgaagtag ttcctttctt taattcaact gcaattgctt 70020ttgatgatca
tataaaaaga ttagaatttt ctgctagtca actagcgatg gccgtagatt 70080taaaagaagt
tatatttgaa attaattctt taataacatc atctgaattt agtaatggct 70140atgtctatta
tcaagttact cgaggcgttg atccattaag gtctcatatt catgagccaa 70200atttaaaaat
agaaaccttt ggctatgcaa aggcgcactt attccaatgg aaaccattaa 70260gagtatcagt
atgtgatgat attaggtggg ggaggtgtga tattaaatca acatcattgc 70320ttggtaatgt
tatgaacatg aatgcagcaa aactagataa ttgtgatgaa gttattatgc 70380acaaagataa
tttgttgacc gagggtggcg catctaatct tttttttgta aaaaatgatt 70440caatctgcac
cccagctcta aatggaaata tacttcccgg cataaccaga gcattactaa 70500ttaatgaatt
aaaaaattat agtatagagg tcatcgaaga taacttcagg ctagaagatt 70560tatcaaatgc
ttcttgcgcg tggctcacaa gcgcaacgaa agggctggcc cccatttctg 70620aaattagtaa
cctagaatct cacttggatt tataccatcc tcttttcaaa aagagtgaag 70680aaatttttaa
taagaagttc cttagttaga agtcagttta ataactatat tatcaagctc 70740gtaccaaaaa
tctttatcac taagaccttt atttgcaaga tcaagctcaa atacattttt 70800ttggagctgt
attaagtttt gtagtttatg atttttaata aaatgcatat attcagaagt 70860tttatttttc
caaaccccag atttaattag actatctaca ggatttttac tttgtctagc 70920attggctgca
ctgttaataa ttttaccaac tatccaaact agaagaggtg cataatgatc 70980ttccgctgat
tttattgaat ggataatttt taaagcttgc ttggtattat ttgaaataat 71040tttatcttca
agctcaaatg gcataaattc tgcagactcc acagaaggtt ttctttcttg 71100gccatcacca
tctttataag tcagtcttag aagctttact tcattctgtt gggcaactaa 71160gtttccagag
ttcatttcag atatattttg cactaaagag ggcctatctt tttcagaaat 71220aaaagatagt
tgatgcttaa gccaaatctt ttcttcaaaa gatttaagtt ttccacaatc 71280aatgatgagg
gctatctcat ccatctgttt tacccactta gttgtgaaat taagcttttc 71340attgcttgaa
ttaataatta ttgcaatatt atccatatgg ttaatatttt caatttgaga 71400tattttaatt
atttggtcag gtatttttcc ttgatcatga ttaatctcaa ctattatttt 71460tgagccaaat
aaagatccag aagcattttc aataatagtc tgatctattt tatcaaaccc 71520atctttatta
ataattgttt tttctttgaa ccctttattt gataaatgtt tgagcaaaag 71580atctttagag
ttatttttta agacaacctc agatccaaaa ataaagaaga tattttgtga 71640ctgatctaaa
tatttttgag ttgtcattgc ctcacatttc aaaagcatgc acctcaaaaa 71700ttatttgatc
aattaattcc ttctgcattt cagactcaag agatttaatc atttgatctt 71760gtgcaaatgg
attaagttca ttcgatttat atctcttgta gctaacaagc tttttactta 71820tttgtttatt
tccaatcatt tttatatgta tttctaaatt aatttctcct tcaagagctc 71880ttagagaaga
gcctccatag atatcatatt tatttaaaat atagtcttgt attagaatct 71940gattttgacg
attctcagct tctttttctg tattgaagag gattgctaac ttttgctcaa 72000aagaattagg
gacactacta tcaaaccgaa aattaaatag atctttttta tctagatcta 72060tatttagttg
attaaattgg cagccagtta aagatatcaa caagcaagtg tataaaaaga 72120aaggtcttag
tttcataacc tcatagtatc tatctgaaga atatatatca aagattaatt 72180ttaaataaca
aaatttatta ttttttcttt aatataaata gtttttttaa tagccgagtt 72240gtttattgtt
gatgccacat tgtcgatagc cagtgctaaa gcttctatat ctttttgctc 72300aaggtttttg
tctatcatta ccttacctct cacctttcca tttacttgaa ctattaaatt 72360aaattcagaa
acttctaata actcctctct aagaacaggc caagaagatt caatttcttc 72420ttgtgcaaag
tcaaagtaaa aattattcca taaatgttgt gagatatgag gcgcaatagg 72480attgagagtt
ttaagaataa ttattattgc ttcattaaga caatattgat ttgagatcga 72540cgcattacct
tctttaaaag aatcaggtat aaagttaact agctccatta tcgaagcaat 72600ggctgtatta
aatgaatatc ttgtttcaaa gtcatttgta acctttttaa gagtattatg 72660cgattttctt
cggagctcta cttcttgttt cgaaggatcc ttgggctcct ctaagtcaat 72720aaattttcta
ttgctgacta agttccatat ttttttcata aatcttgagg caccttctac 72780agatgactct
gaccattcta gactttgctc agggggagct gtaaacatca tgtaaagcct 72840tacagtgtca
gcgccatact tttctatata agattgagga tctacagtat ttccttttga 72900tttggacatt
ttagccccat ctttaagaac catgccttgt gtaagaagct ttttaaaagg 72960ctcattgcct
tcaactaaat ccatatctct aagcgcctta tgaaaaaatc ttgaatatag 73020taagtgcagt
atcgcatgct caatacctcc tatatataaa tcaacaggca gccaatattt 73080tgaattttta
tcaaacattt catctgcatt gtcagatgaa gtaaatcttg cgtaatacca 73140tgatgagtcc
ataaatgtat caaaagtatc tgtctctctt tttagtcgat cagaaatatt 73200ataaaaatct
tcattttgac ttaagggaat aggtgccgag ttctttttta gctctggaag 73260cactataggc
atatcctttt catcaataac tcttggttca ccattttcat aaacaaccgg 73320aatagggcaa
ccccaatatc tttgtctgct cacaccccag tctcttaatc tgaactggat 73380taactgctca
ccaaggtttg cgtctgctaa atctttaata atttcaagag atgcctcatc 73440agagtccatg
ccatcatatt tatcagagtt tattaatttg cctttttgta caactggaag 73500ttcattatta
ttatcaaaac taataacttg aggaatctct agattatatt tagatgcaaa 73560ttcgaaatct
ctttgatcat gagcaggaac acccatcact actccagttc cataatcaag 73620aagtacaaaa
tttccaatcc atacgggtat cttcttcttt gtaattggat gaattacatg 73680cattccacta
ttaattccaa gtttttcagc tttagccata tcagcttcag cagctttcac 73740ttctttgcat
ttgtttagaa agtcttttat cgattcatta ttttttgaca accctattga 73800tatagagtga
tttggtgata ttgctaaaaa agaaacacca aaaattgtat caggtcgtgt 73860tgaaaatacc
tttaaagaat catcggaatc ctcaatcatg tatttaattt ctgctccctg 73920agatttccca
atccaattcc tttgcattgt tttaacattt tcaggccaat ccacctcatc 73980caaagatgtt
aaaagttcct cagcgtagtc tgtaattttt ataaaccatt gatcaatttc 74040ttttatttca
acttgcgccc cagacctcca tccttttcca tctataactt gctcatttgc 74100taaaactgtc
tcatcaactg gatcccagtt cactaatgat tttttcctat aaaccaaacc 74160tttatcataa
aactttttaa atattaactg ctcccactta tagtattccg gttcacatgt 74220tctaagctct
tttgaccaat catacccaag gcccaaagat ataagttgct gtttcatatg 74280ttcaatattt
tgattcgtcc aatcttttgg gctaacttta ttagcaatcg ctgcgttttc 74340agcaggaagc
ccaaatgcat cccagcccat aggctgaaaa acattaaagt tattcattct 74400tttatatcta
gatatgacat caccgatcgt atagtttctt acatggccca tatgtaattt 74460acctgatgga
tatggaaaca ttgataggca ataaaatttt tctctaccat caggattagc 74520tttaaatttg
tcttctttta accattcatc ttgaatggtt ttctctattt gacttgggtt 74580atattcagga
ttcatttttt taggaactca ttttctaagt aattaatagt atgtttattt 74640tctttaaagg
tatcatcatg cagaatcctt tgatgaagag ggtgatttgt tttaatgccc 74700tctataaaaa
attcatctaa agcactcagc attctcttaa tagccgagct tctagaattt 74760gcctgagtaa
tgatttttgc tagaagtgaa tcatagttag ggggaactct atatcctcca 74820taaatatgtg
agtcatatct tatgccaaaa ccgccaggtg tatgcatttt tgtaattgtt 74880cctggggatg
gttgaaaatt atcaggatct tctgcattaa ttctgcattc tagtgagtgg 74940ccatgaaaat
taatatcatc ttgatttagc tcaattggca tttcaagagc aattcttagt 75000tgtgctttta
ctaaatcaaa gccagttatc atttctgtta ccggatgttc aacctgaatt 75060cttgtattca
tttctataaa atagaattga ttatcttcgt ataaaaattc aatagtgcca 75120accccttcgt
aatttatttc ttcacataaa ttgacacaag ctattagtgt tttattaaga 75180gcttcttgat
ttatattaag tgctggagct tcctcaatta ttttttgatg tctcctctgc 75240atactgcaat
cccttgtacc taaatgtatc gcctttcctt taccatcacc aactatttga 75300acttcaatat
gtctagggtt cccaataaat ttttctagat aaatggtttc attgccgaat 75360ccatttttag
cttcctgcat tgttatttct gcatgtccaa taagatcttc ctctctttca 75420acaactctca
ttcctcttcc accgccacca gccgttgcct taatcataat tggatagccg 75480atatctctag
caattttttt aaactcatca ttatcagatg gaatctcatc tttataacct 75540ggaacgattt
gaattccaga tttttcagcc agtgtttttg ctgttatttt gtcgcccatt 75600ttttggattg
tttctgaggt tggcccaata aatttaaatc cacttttttc acacatttct 75660gcaaaattat
gatcttctgc aaggaaccca tatccaggat atatagcatt agcacttgta 75720agttctgcag
ctgagagaat ggcaggtata tttaggtagc tttgagttgg agatgcaggc 75780ccaatacaaa
cagtttcatc agagaatctt aaatgtttta gatctttatc agcttctgag 75840taaacagata
ctgtcttgat accaagctct ttacaagctc ttatagctcg aagagcaatc 75900tcaccacgat
tggcaataag aactttataa ctcattttaa ttgacagtta taattttttg 75960tccaaattca
accggacttc catcttccac gtcaatactt gaaatcttgc catcaaattc 76020tgatttaatt
tcattcatca ttttcatggc ttcaactata cagagcacat cacctacttt 76080tatattgtct
cctactttta caaatggatc tttttcggga cttgggcttc tataaaaagt 76140tcctactatt
ggagaggtaa caacatcccc tatgacagtt tgcttagcct catccaattg 76200atgagttgtt
ggtgagatag ctggctcatt tttgacaatg attggttttg aaacaaattc 76260ttgattagag
ccgttatctc ttgaaattct tactgactca tctccttgac ttacttctat 76320ttcttttaaa
tcagactctt gaagcatttc tataagtttt tttattttcc taatatccat 76380ttatttaccc
ctgtatttta atatgccctt aagcatcgcc tcttcataac ccttagcacc 76440aataccagtt
atcacttctt cagctatatc agatagatat gagacatgtc taaattcctc 76500tcttgtatat
atgtctgaaa ggtgaacctc atagaatggt atgttgacac caagaaatgc 76560atcccttata
gctatgctcg tatgagtata tgcagcagga tttattatta caaagtctat 76620tttttcttca
atagtgctat gtattgattc tattatttca tgctccgcat tactttgaaa 76680agatattaaa
ttgcattgat tttgatttgc taagactaat aaatcttgtt gaatatcttc 76740aagtgatttg
ctgccatata cttccttttc tctagttccg agaagattga ggttggggcc 76800atttataagc
aatatattca tataatagat tttaacagaa atttacagat tttttaggat 76860tttaaagata
tttttataag agttgatttt tttgcatggg atagcaaaaa cccgcttcag 76920agcatccctg
atagtatatt aaaacctccg ataagttcaa tgaagaattt atttcaatgc 76980ttataataaa
ttcatctcta agaatctcag tctcgccaaa aaattcatcc ctatatagat 77040ctttacttga
ctcaagcgtc ttaaatggga gcactttatc tttaaatttg agctctatac 77100tatccaagta
catataataa ccatctctta tttgccaagt tatatatgct gcttgttttt 77160caacattggt
tgttaaagca aatacttcat tggcttctgg gaccctattt gagttttcaa 77220aaagatttgc
tgaattcaat tcacctgcaa tcacattgga tagtagtata attattaaac 77280ttttgatcat
gaagacttaa gaataaaccc ctaagtactt aaaataaaga ttttattaaa 77340taacacggta
gttaaaaatt tatgttccag actttcgcac taaaaatttt cttttatatg 77400cctatttggc
ttttaaaaat aattttttat agaaaaagaa cagtaataag agggcatcaa 77460tttgatgctc
aatctgctgc attattgtct ttgcttccaa agaaagattt atctgaatta 77520ttagatggtg
aaattgcaaa agctagaatt actcttgaag aagcaagaat tcaaaataaa 77580gtttctttaa
caccgtctat acaagtcaga aaagtagatc atattttacc aaagcatgat 77640ttaattctca
gagaatacaa gccgcatcaa gaggatttaa aaaaagttat tctctatttt 77700catggaggag
gttatgtcct taactcagtc aatacacatg atgatatggt ttcatatatg 77760tcagataaaa
taggagtaaa gttttattct ctagactata gactatcgcc tgagagcaaa 77820tatcctgatt
ctcttgatga tgctctagat gctttttctt ggctcatcgg ccagggatat 77880ggaccaggtg
atatttcagt ttgtggagac agcgcaggcg ctcatctagc tgcttctttg 77940tgccattacc
tagctgaaca aaataaagat ttgcctagta gccagctact aatatatcca 78000atgtgtgatc
cgtcctgttc atctgagtct tataatttat tatcttcagg atatcttcta 78060actaaaaaaa
ctatgatttg gttttgggat aagcttagaa cttccgaagt aaacaataac 78120gattcggcct
ttaatctctt aaaatttaat tttgaaaaaa ctttaccgcc aacaattatc 78180gttacaggtg
gctttgatcc tctatgcgat gatggagaaa aatatgcata tttattacat 78240aaaggtaaac
ataatgtgaa acaattacat tatccaacaa tgtttcatgg gtttgcatca 78300atgactaaat
tgaaagcagc gcagatagcc gttgaagatt ttttaaaaga atataagaaa 78360atactatgag
taaaatttta gaagtaagtg acttaagcat caatttttca acaagggatg 78420gattgtttaa
tgcagtggat aatataagtt ttgatataga aaaaaatcaa accttggcct 78480tggttggtga
gtcaggttct ggtaagtcgg taactgctat gtcaattctt cagctccttc 78540aaaaaccaca
agcatcatat tccaaagagt cttctattaa gtttaatggc gatgagataa 78600taaatgccaa
gtatgaaaag ttactttcct tgagaggaaa tattatatct atgatatttc 78660aagagccgat
gacctcacta aacccttatc acagagtagg taatcagata actgaatcaa 78720tactacttca
ctcaaaaagc tcaaaaaaag atgcaataga tgaagcaaaa aaattaatgg 78780cacttgttga
gattgatgat gttgaaagac ggttctatgc ataccctcat gagctttctg 78840gagggcagcg
acaaagagtt atgattgcta tggcccttgt taataaacct gagctattga 78900ttgctgacga
gccaacaacc gctcttgatg taactatcca agcccagata ttagatctca 78960tgtctaagct
taaaaatgaa ctaggcatgt caatactttt tattactcat gatctaggcc 79020tcgttcaaga
attttctgac aatgtttgtg ttatgaagaa tggcaagata gttgaacaag 79080gaaatactgt
tgaggtattt aacaacccct ctcacgaata tacaaaaaaa cttttagatg 79140cagaacctca
gcccaaatta gataaccctg taagtgatga gccaataatt gagatcaatg 79200acttaaatgt
ttactactca ataccttcaa ctaatttttt taaaaagaat acttttcatg 79260ctgttaaaaa
tacttccttt agtatttata aaaatactac aattggcctg gttggggaat 79320ctggatctgg
aaagtcaact ttgggtaagg ctatagcaaa cttagtttct tataagggga 79380atattaaatt
tgagggaaga gatatcaact caaattccca aaaagaaaat aaagaattga 79440aaaaaaatgt
ccaaattgtt tttcaagatc cttatgggtc attatcacca cgaatgacag 79500tgggggagat
agttggtgaa ggtttaggtg ttcactttaa gcttacaaaa aaagaaagag 79560acgaaaggat
agataaggtt ctgtcagatg tcggtatcga aatagtagct aagaataaat 79620atccgcatga
gttttctgga ggccaaaggc agagaattgc aattgctaga tctttaataa 79680tgaatcctgc
ttttatgatt cttgatgagc caacatcagc attagatagg tctattcaaa 79740ttcaggtaat
cgatttattg aaagagatac aaaatgaata tgggcttact tatcttttta 79800taagtcatga
tttaaaggtt attagatcga tgtcagactt tatttttgtt atgaaaaatg 79860gagaaatcgt
agagtcagga ccttctcaca aggtctttga aagcccagag caagactata 79920ctaaaaaatt
actatcagct gctttaaagt atgcatctga ataattaaat atatymcmtm 79980tggcaaatag
aaartattca aaagagctcg ttgacggtcc taatcaagct gcttctagat 80040caatgcttag
aggagtaggt ttcacatctg aggmtttcac aaaaccattt gttgggattg 80100cttccacagg
agcaaaagta accccatgca atatgcacat aaatgcactt tcagagatcg 80160ttgagaaatc
agttgatagt tcaggaggaa agggtgttct ttttaatact attactgttt 80220ccgatggaat
ttctatgggt acacagggta tgaaatattc tcttgtttct cgagaggtaa 80280ttgcagattc
aatagaaact gttgtgggat gtcttggtta tgatggagtt attgctgtcg 80340gtggttgtga
taaaaatatg cctggatgca ttattggaat ggcaagatta aatagaccat 80400caatatttat
atatggtggt tctatcaaac ctagtaaaga aaataccgac tatgttactg 80460tttgtgagaa
aactggagag tactcaaaag gcgatcttaa agaatctgaa ttaattcatg 80520tagaaaaaat
ttccgtaaaa gggcctggat cttgtggggg aatgtatacg gcaaatacta 80580tggcttctgc
gattgaagct ctaggcatga gtcttcctgg aagcagcagt caagatgcaa 80640tttcacacga
caaagaagat gattgtttta aggctggcga agcgataatg aatttattag 80700aaaaagatat
taagccttca gatattatga ctaagaatgc ttttgagaat gctataacaa 80760tggtaattgc
tctaggaggt tcaactaatg cagttctgca tttattggcc atggcgcatt 80820caataggggt
tgatttagag ctagatgact ttacaagaat aggaaaaaaa acacccgtta 80880tggcagatct
taagcccttt ggttctcatt atatgtctga actcaatgct aatggcggta 80940ttcagccact
aatgaaaact ttgcttgaga agggattact acacggcaat tgtcttaccg 81000ttaccggtca
gacgcttgct gaaaatcttt ctggaataaa accttacgag cctgatcaag 81060agataattaa
atcatttgat aatccaatta aatcaaatag ccatcttaga attctgtatg 81120gcaacttagc
gaaagatggt gcagttgcaa aaattacggg taaagaagga acttcctttg 81180aaggaagtgc
tcgtgtattt gattcagaag aagaaggggt taaagcaatc ctatctaaat 81240ctataaaagc
tggagatgtt gttgtaatta gatatgaagg gccaaaagga ggtccaggca 81300tgagagaaat
gctaaaacca acatctgcca taatgggtca aggtcttggc gatcaggtag 81360cttttataac
agacggtcgt ttttcaggag gcactcatgg atttgttgtt gggcacatta 81420ctccagaggc
tgcagatggc ggcttaattg caataattaa agatggcgac tctatattaa 81480ttgatgcaga
ttctgataag ttaattctta atatttctga ggatgaaatt tcaaatagac 81540taagtagatg
ggtaaaccca aaaacgcctc ccaaaaaagg agtcttagca aaatttgcaa 81600aaagtgttaa
atcagctagt cttggagcgg taacagatta aatatgtatt ttaaaagaaa 81660atttccaaat
agtagactaa gaagaatgcg tctcaattca aacctcagag acttgcttgc 81720tgaagttagg
ttgtctacaa atgatctaat tcagccatta tttataaaag aaggcttgag 81780tggaaaagag
gctattgaaa gcatgcccaa tattaataga tatggacaag attcaatttt 81840ttcagaaata
gaagagctac tagagcatga tataaatacg attgccctat ttccagttat 81900taatgattcc
aagaaaaaca gcactggaga tgaggctatc aatgcatcta atttgatgtg 81960tgaaactatc
agtaacataa agaaacgatt ccctgaaata atcttaatat ctgatgttgc 82020tctagaccca
tataccgatc atgggcatga cggtatttta aaaaatgatt atgttgataa 82080tgatgagact
ttagctgttt taaggaagca atcactaaca ttagctcagg ctgggacaga 82140cataattgca
ccatcagata tgatggatgg aagaataggc tctataagag aggcattgga 82200cgaaatggat
tataaaaata caatcctact atcatatgca gcaaaatata attcaaagtt 82260ttatggtcca
tttagagatg ctgttaattc agcttcaagt ctaggcaagt cttcaaaatc 82320cagttatcaa
atgtcaccaa aaaatataaa tgaagctcta catgaagttg ctatggatat 82380taatgaaggt
gcagacatag tcatggttaa gccaggtatg ccttatctag atataatttc 82440aaaagtaaaa
gaaaccttca aagtacctac ctttgcatac caggttagtg gtgaatatag 82500catgcttaaa
ctggcgattg ataaaggatg gcttgaaagc gatgttatgt tagaatcatt 82560aataagtttt
aaaagagcgg gagctgatgc aattctaacg tacgcagcta aagaaatttc 82620caaggagata
actaacaaat gagcaatgtg atagaaattc gtgatgaaga aagctttaat 82680agcgacgtct
taaattcaga aaaacctgta ttggttgatt tttgggctga gtggtgtgga 82740ccttgtaaac
agcttgcacc aactgttgaa acagttgcag cagaaaaatc agaaacatta 82800aaggtttgca
aaatggatgt tgattcaaat agagagattg ctgctaaata tggaataagg 82860tcaatccctt
cattaattat atttaaaaac ggagagcctg caggagttga agtaggtgct 82920ctaaccaaac
aacaattaga ggactttata agtacagtag tttaactttg caaagacttc 82980tttgcatatt
gcaagaaata ggattatcat ttcgacttct aaggctgaaa gccactcaaa 83040acaaaaaccc
accttttcaa tcataaaaac aacaactaga acggaaataa ctaaatgaac 83100cttactgaaa
ttaaaataaa accaataaat gaacttgtag atatagctac tgagcttggg 83160cttgaggatg
ttggaaggct gaaaaagcaa gagataatat ttagaatatt taagcataag 83220gcttctgaag
gtgttgatat ctatggtggc ggagttcttg agattttaaa tgatggtttt 83280gggtttttgc
gatccccaga aggctcttat tgcgctggcg aagatgatat ctatgtttca 83340ccaagccaaa
taagaaagtt tagcctcagg aagggagact cagttgctgg gaagataagg 83400acccctaaag
ataaagagcg ctattttgca ttaatccaag ttgatactat taatggtgaa 83460gagccaagaa
agactaaaaa caagattctt tttgaaaatt taactcctct ttttcccaat 83520gaaagactaa
tccttgaaca aggaacaggg tctaatgaag atctttcatc tcgaataatt 83580gatttgattg
ctccaatagg aaaaggtcag cgtggactta tcgtttctcc acccaaggct 83640ggtaaaacct
taatgcttca aagcatagct cattctatta aaagcaataa tccagaagta 83700gagcttatag
ttcttttgat tgatgaaaga cctgaagagg taacagagat gtcaagaact 83760gtaaaaggag
aggttgtagc tagtactttt gacgagccac ctactcgaca cgttcaagta 83820gcaaatatgg
ttattgaaaa agcaaaaaga cttgtagaac ataagaaaga tgtagttatc 83880ttattagatt
ctattactcg tcttggaaga gcatataact cagttcagcc tgcatcagga 83940aagatattga
gtggtggagt tgactccaat gctcttgaaa ggccaaaaag gttttttggt 84000gctgctagaa
atcttgaaga gggtggaagt ctcactattc ttgctactgc tctagttgaa 84060acaggctcaa
agatggatga agttatttat gaggaattca aaggtacggg taatatggag 84120attcaccttg
aaagaaaaat agccgaaaaa agaatatatc ctgctattaa tattagaaga 84180tcgggaacaa
gaagagaaga tttacttact gctgaagatg aattacaaag aatgtgggtc 84240ttaaggaaaa
ttttagacga tatggaagat gctcagtcaa ttcagttcct aatagataga 84300ttgaaatctc
ataaaacaaa cgatgagttc tttacttcaa tgaaaggggg taatggcaag 84360aagactagat
aaagtttttt gccatatcaa tcatcatctt atcagttggg ctttgcgata 84420cgtgaatttc
taattcttta aactcatttt gacatctatc ctttatattt tctgaagcaa 84480caataaatat
tttttttctt aaaactgcat cgtccagatt tgttattaag aatttaagag 84540tactaaaatt
atagattaaa aaaatttcat tattatctgt aacttttggt atttgctcca 84600gcaaatagat
tacttcatag caaactatct catctagact agccttaagt ttttgttgaa 84660gaaaaccatt
tgaattttcg ccgcaaaata aaagactctt tcctagaaaa tttttctcta 84720ttaacttaag
aattccttca gatgaatggc tttgtggaaa atgagatttt atgccacttt 84780caagaagttt
attggaagtt gcgggcccaa cagacaaaaa attgtgtgga agatcatcta 84840aatcgaaaaa
agattttaaa atatcaagtc cgtacgaagc agcagcttgg ctagtaaaaa 84900ttaagtttga
atatgaatga atatttttaa ttttatcgat agcgattttt gaagggtcga 84960ttgaattaat
tttagaaaga taaatatttt taagagctat ttcttgagct tcacataaag 85020atattaggtt
tcctgataaa tttttgggcc tcgtattaat tatcattttt taaaattaaa 85080gatttcgctc
cttccaaaat aaactcatca gcaaataaca taatatctat tgcaatatct 85140tgaaaggaag
atatcttttc tttataaatt ttttcgttgc cttcataaga taatattttc 85200cctgatattt
ttatttggcc atttttatct tcacataaaa tagctattgg agacaggcaa 85260gtaccttcca
tggcagcaac aaaagatcgc tctgcactgg caagaattaa ttctttaggg 85320tcaccaatat
tttctaaaag ctcaataata tcttttttat ctgacagaca ttcaattgct 85380atatatccct
gagatgcaga aggtaacatt tcttcaattg agaattcata tgaattttgt 85440tttaaaccaa
gcctttttat agcagcctta gcaactacca gcccatcaaa cagaccatca 85500ttgagtttct
ttattctagt agctatgttg cctcttactg gaatagtttt tatatcaggt 85560cttaagttat
taatttgtgc ttttcttctt ggaccagaag taccaatagt tgaatttcta 85620gcaaattcag
aaaatgattt cccatcctta gatagaagca ggtcttctgg agattcccta 85680ctaaaaacac
ttattatttt aaactttgga tcaagctttg ctggaacatc ttttaaacta 85740tgcactgcaa
tatctgcttc atcagcctct agagaggact caagtgtaga aatgaatagg 85800ccctttccac
caatttcatg aagaggttta tctgtttggt ctccttctga tgtcatagga 85860acaagctcaa
ccttaatatt attgatctta gctaataact catctgcaac aaattttgcc 85920tgatacattg
ctagttctga ttgccttgta gctattctta ttttcatttt gtttccaata 85980aaagagcctt
cacatcgcca acactctttt ctttcaagat agtgaattcc tcaggaattt 86040ccaaggtagt
aaatttacta tactcaaggt agattttaca agatggcttt atttcattct 86100ttcgaattat
agattttaga acttttaatt catattcttc gccaaaagga ggatcaagaa 86160gaattaaatc
aaaacgagat aagtcatttt ttttaatcca actaaatgca tccttaaaaa 86220aaactttaga
tttatccttg atgccaagaa gttgaatatt ttttgctaac acagaatagt 86280tttttttatt
aagttcaacg aacacaactt tttttgactg tctggatatt gcttcaatac 86340caagcgcacc
cgttccggca aacaaatcaa gacatattaa attttcaatc tcaaattgaa 86400gccaattaaa
aattatttct tttaatttat ttgaagttgg tcttaaggaa tccttaaatt 86460cgaaaggtat
ttttttacct tttaaataac ctccagtgat tctgatattg tttttcattt 86520tttttgcaaa
atgggtatgc ctatttatta aataaatata attamaaata catttaaact 86580aagttataat
tcatccatgm gtcctacaag ataattttag aaaagcaatg aggagctata 86640tttactctgt
cagtgtgatg tcaaatgttg acgagaataa aaagtttagt gctataacag 86700tttcttcagt
tacctctgtt tctttagatc ctcctagttt gctcgtttgt atcaataaat 86760cagctggaat
tcacaactca ataaaagaag ggtcctcttt ttgcataaac cttttaaata 86820aaaatcaaga
agatatttct aatctatgca gttcatttaa gaccgaaggt gatagattta 86880atagtggtga
ctgggattta agcggcactc cgtttttaaa aagtgcccag gctaatattt 86940tttgtactgt
tgatcaatta atttcatacc acacacatac tattgtgatt ggtcatgtca 87000caaactctct
tagcgatgag aaaattaata cactgacata tgttgatggt agctatgaat 87060aaattttcaa
aaaatgtatt ttttatttta ataatactca actctttttt tcttgcctct 87120aacatttttg
ctagccagga agagtgtgaa gaaaagccaa gtgtttttat tatctctcct 87180caagacggtt
ttatctcaga atctaataat gtaaaagtct tatttggatc aaaaaatatt 87240gaaataaatc
cagctggcaa aggtgagatt gcaaaaaata aatgttttgc aagcgggcat 87300caccatcttt
tagttaatat cgaagcattg ccagagagct ttattccttt tgacaagggt 87360tatttacatt
ttggaggagg tcaaactgaa acaattcttg atcttgatcc tggaacctat 87420tctctccaac
taattcttgg atcttatgtg cataattcaa aaatgcaggt aaataacttt 87480aaaggtcaag
gacccttttt atcagaaaaa ataacaatta cagttaatta gagattagac 87540cagttataac
tttatctaga tgttcatcta agttattagc aattatattt gcactagaaa 87600taggaccaga
attatcgtca taatcatcac caactacatt cacctttctt attatcccaa 87660caagaccatg
caacatagac caaagagtta tacatttaaa ggcaatcact tcctcaggct 87720cgtctgctag
attagcaaaa ctttttctca tattatcgta tgttccattt gcagatttaa 87780gaagatctgg
gtaatcggca aagttaccaa cagctgttcc aaacatcaaa tcatatgtat 87840gtgcattttt
taaaccaaac cctatatatt tgcttgcatt agtaacaagt tttttctttg 87900taattttttt
tggattctca aaaaaaacaa gctcattaag ttttttaaag ccaacggttg 87960caacagcggc
atatacacat tcttttgttt caaaatgcct ataaggagca gtttgagaaa 88020caccactttc
ctttgcaagg gatctaatac ttagtttagt gtaaccatct ctatcacaaa 88080gcctgcatgc
gcattctata agttcttcct ttaagtttcc gtgatgatag ttattcataa 88140tttaatttta
atgttgacac tgctaacata tatcatgtta ttatgtatac accgcataca 88200ttaagttgat
tacatattaa agtcaaatac agatatatga acataacaaa aataaataac 88260accgtcatag
ctcttttatt aggcaatgcg tttctatcaa atcttgaggc tttagaagtt 88320cttgaggtta
aaatgcttga tgaatatgct gtaactagag aatttcctgg aaagctcatt 88380ccatcagacc
agtctaagct agcatttgaa atacctggaa agataaactc tattaatgtt 88440gatatcggag
atgaggtcat cttaggggat gaacttgcct cattggatga tagagaagct 88500ttagcacaat
taaatcaatc aaaggctaaa tttgatttag ctgaacaagt actagcaaga 88560tatataaatc
tcagagcaga tgggcatatt tctattcaag atcttgataa ggctgaatct 88620gatcaaatag
tagctaagtc gcagtatgat ttttatagag ttaaatttga gcaaactaag 88680ttactagctc
cttttaatgg agttattcaa aatagatttc tagatacagg atcagtaatc 88740aatgcaggtg
tccaagtttt agaaatttta ggctctagca atgttgaagc aagaatttct 88800attccaatga
actatatgaa caaggttaag attggagatg agtatgaatt tgatatcaga 88860ggaatatcta
caaaagctac gttagagaga ctggctccca tgtcaaccgg aggctccaat 88920aataggttag
caatttttag atttgatacc ttttttaatc caggatcaat agtaaagctc 88980aaactaagca
tcactgagaa agcaaaaggt acatgggttc caattaagtc actgtcccag 89040tccgaacaag
gtatttgggc catatatacc attaacgagc aacaagtagt tgttagagat 89100cttgttgatg
ttatttattt tgaagacgaa tatgcttttg tcagcggaac acttaataat 89160ggtgatttag
taattttagg cggagctcaa aaaattattg aaggaaaaat aataaaataa 89220aatgaatgtt
attaattttc taatagaaaa gcctaggata ttatttctaa ctttagcatt 89280tatattactt
tctggaattt cttctgggct ttcagttcca attcaagaaa accctgaact 89340ggctgagaga
tggggaggtg ttcgtatttt tcttcccggg gcatccccag aaagaattga 89400aacagagata
gtaaatgatc tagaaatcaa acttagagaa gttgaagaaa tcgatgagct 89460tgaatcaatt
attactcaag gtttttcaac aattgtagtt gaattaaatc aaagtgtacc 89520tcctatactt
attgaagaga cttggtccaa ggttcaagac aagctcaatc aaatagttat 89580tcctcaaggt
gcagaaatat ttcttgatag aagcagtggt ccgcctatca ctgttcagta 89640tgctgtaacc
tggaacggca gtggagatgc tccactaata atgatgtcca gactagcaag 89700ccagctaaaa
agaaaattaa gctcaatagg ctcatctcat caaactgcaa tttttggtga 89760aacagatgaa
gaaattttaa ttgaactaga ttcatcaaag ctatcttcgc ttggattatc 89820atttcaagat
atcgcaagtg ctattcaagc cctagatgca aaaaaaccta ttggtgtatc 89880ctcaaacaac
aattctgagc ttttatatag actcaaagat aatatacaga gcattcaaaa 89940actctcagaa
atacctatca aggttattaa taaatcagag atcatacagc tagatgatgt 90000ggcatttatt
tcaaaaatcc cggtttctcc tattgaagac atattcttgt ttaatggaaa 90060tgtagttatt
tctgttgctg gaaccggatc attttctcaa agagtccatg attatgtaga 90120acgcgcaaca
attgttgtag atgagatgag agaaactctg ccgactgaga tcactataga 90180tttagtttat
gacgaatctg cttacacaac taaaaaattt aatgagcttg taaaaagttt 90240ttcattagca
atattttttg ttttagcttt aagtcttttt tttcttggaa ttagatcagc 90300aataattgtt
actcttatcc tgccattttc tatttgcctg gttatgattg gttgtaggtt 90360tataggctta
cccttgcata tgacatctat cactggaatt attattgcac taggattgct 90420tatagataat
gggattattg ttgttgaaga ttataagaat agaagagcat ctggattaaa 90480tatcaatgat
tcaatttcac aaggactaaa aaacttatgg gctccattag ctgctgcaac 90540agcaacaacc
gtcttctctt ttcttcctat tgttactgga gaagggtcga gtattgaatt 90600cgtaggcggc
atggcaatga cagtcattat gtctataaca tcttcattag ttttggcgtt 90660attaatggtt
ccagttctga tgagttatat ggaaaaaatt ccgttcttta aggatgtgga 90720tattagcaag
gaagggtata gaaatgaaaa aatccttaat aaatataggg cctttttaaa 90780ctgggcgttc
ttagttccta gaagagcaat catgatatcg cttgcattgc ctgttctagg 90840attctttctt
tttaattctt tacctaaaga tttctttcct gctcaagata gagatatgtt 90900tagagttaat
atagaactgc cttctaacgc ctcatcactt actacaatgc agagagttaa 90960ggaaattaga
gaagatattc tagatagtga tttaatttca atagaaaaag attattcgtt 91020tatcggcaga
atgatgccta gagttttgat gaatgttgtt ggtggagaag aaaaacaagg 91080atccaataat
attgcgcagt ctgtattttt tgctactgat tattatgaaa tgattgaaaa 91140ccttccagat
ttatcaagaa gactggttaa aaataaccct gacattatag ttaatattga 91200tagtttctcg
tctggccccc cggttttttc agatgttagt tatgtaattt ttggagatga 91260tccagattta
ctaaaatcac ttggtgagga gctagagcta attattaaca attctcctga 91320tgtgagtctt
acgaaatctg caacttcaaa ctcaataacc aatgttgagt ttgaacttaa 91380cagctcaaat
atttcactgt ctggtcaaaa tgccaattat cttgtaaatg aaatgtttac 91440tgcaaacaat
ggaatatttg ttggcactat gttggattca aacaaagaaa taccagtcag 91500gctgaaaggg
ctgtctaata aaaacaatat tacgggaaat actagtttta taacaatgcc 91560ctctcaaggt
ggttttgagt attttgatag tttcggaaaa agctcactaa caaacaaatc 91620gtcaacaatt
actaggcttg atggccaaag aacaaatgat gttgagggct ggatttggac 91680aggtacgctc
ccgtctgcta ctgaaaaatc tattaaaaaa gatgttaaag attttgaatt 91740aagattgcca
ataggctatt cattaaaaca acttggcgag gctgaaagca ggggccaatc 91800tcaagcctca
ttatactctt cagcttttat gtatttcatt cttataatag taggcttggt 91860tatggcgctt
aattctttca gagaggctgg tctaattttg tctgttgcat tcttatcaat 91920tggactatcc
tttcttggtt tatttatagg ccagcaaaat tatggattta ttggaactat 91980aagtgcaata
gggttaattg gcttatcaat aaatgattca attattgtct tatctcacat 92040aaaagaagag
gctgagaaga aatcactaac caaagctgag cttgttgaag ttgttatcag 92100gtctacacgt
catataatca ctacctcttt gacaacactt ggtggttttc ttccacttat 92160ttttgcaagt
gtattcttca aaccgcttgc ttgggcaatg agcattggag tattaggcgc 92220gactattaca
gccttattat atattcctgc aatgtttatg ataatgagaa aggttaagta 92280ctagaacaac
tttccgagca ataatttctt cctagctttt ttacaataag tgattcatga 92340gtataggtct
cgcaagaatc acatttaatc atagagttat caatcttttt tttatgaggt 92400cctgatatga
gagatctgaa aaagaatacc gcagttatta tcaatataaa tatgaccaat 92460ggaatcaaaa
gtaatatcga ttttaataag aacatttttt agtcactaga tgctggctta 92520ctatttctta
taaaattcca agaccagcca gacttatctt cagatttttt tgattgctca 92580tcttggtagt
taatagaaat tattttttta gtatcttcaa gcaattcaat atatcccaaa 92640gattcatatg
aagcctctaa gatcttaaga gctctataat tttcactaga gttaggaata 92700ttttcaatta
cataatttgc tcttcttatt gctgctatat gcgcatcaac actaacgtaa 92760taatcagctg
ccgcaagctc atttcttgca atcatatttc ttaaatagat atttctttgt 92820ttagcatatg
tagaatattg gctatcagga aatcttgtta agaattcagt tagttctgaa 92880aatgattcct
tagctcctga gatatctcta tttgatagat cagtatctgt cattcttaca 92940ataaagctat
tatctctcgt atagctagaa aggcctttca taaaatatgc ataatcaata 93000tttggatgcc
taggatgaag tcttataaat ttttctgctg cagcatgcgc agcctcagtt 93060tcagcattca
taaattgagc ataaataagc tctacttgcg cctgttcagc atatttgcca 93120aaaggatatc
ttgattcaat tgcttctaat gaatcaatag caccaaaata attttttcca 93180gccatccttc
tttgggcttg atcgtaataa attttttcag gctgttctat ttctgggcca 93240tcagaattac
aactaaccaa taacagagtt acaattggca atactataaa tagttttaaa 93300attagtttat
ctttcattat ttgcacattc tacctgcatt atcgtaattt aagcttgaaa 93360acatctattt
agattaattt ttttataata agttcatatg attgttaaaa atgttccaaa 93420tgatctatct
tcaatgaggc tagacaaagc tacagcagaa atgtttacag attattccag 93480aactcagata
aaaaaatgga tagaagaagg cagagttctt ataaatggag aggtatccca 93540gccaagagat
aaagtttatg agaatgatca gattgaatta agccctaaag aagaacaaaa 93600agtatcatgg
gaagctcaag atatagattt tgaaattcac tttgaaaatg aagattttat 93660tataattaat
aaacctgctg gtttaataat gcatccaggt tctggttgct atgatggaac 93720tctcgcaaat
gggctcattt ataaatttcc agaattgatc aatattccaa ggtcgggaat 93780tgttcatcga
ttagacaaag atacttctgg cattctgctc gtagcaagaa atgagtcttt 93840taggaacttt
tttattaatg aaatgcagga gaggagagtt gtaaaaaaat atacgtctat 93900tgttattggt
tctacactag gaagcttttc tatagaagag ccaattggaa gagataaaaa 93960taatagaacc
aaaatggcaa ttcgagaaga tggcaaagat gctttaacat ttgtaaagct 94020taaagaaaat
attggaaact actctgtgtt ggatataaga atagagacag gaagaactca 94080tcagattagg
gtccatctat catcaaaaaa actaccaata attggagata aaacctatga 94140cccaagcagg
tctattgcaa gagatacccc tgaagagcta attaatatta tccgaggttt 94200tccaaggcag
gcattacatg caacacacct ctcattcaat gaccaaaaaa caaataatat 94260tttttctttt
gatattccca ctccaaatga tatggaggaa ctacttctag aattaagaaa 94320attgatctaa
tagtaactaa aaacttgttt tttgattaat aaaataatat aaaccttatt 94380cctaagagtt
ttttggtaag aaattgaaat tatctggcgc agacatgcta atgcaagcac 94440ttcatgatga
aggtgttgag ctaatctttg gctacccagg tggagccgcg cttcatatct 94500atgatgcaat
ttttagacaa gataaaatag atcatatttt agtaaggcat gagcaaggtg 94560caacccatgc
agcagatgga tattcaaggg cgacaggtaa gccaggagtt gtcttagtca 94620cttctggacc
tggtgcaaca aatgctatta caggaatcgc gactgcattt atggattcca 94680taccaatggt
agttatttca gggcaggttg ctagccattt aataggtact gatgcttttc 94740aagaaactga
tatgattggt gtttcaagac caattgttaa gcatagctat acagttttta 94800atgctgaaga
aatacctaag ataattaaag aagcttttta tgtcgcaact tcaggcagac 94860ctggacctgt
tgttatagat atcccaaaag acatgacagc tccggataat ctttttgatt 94920actcgtatcc
tgaagaagcc aagataagat catacaatcc tccgattgag ccagaaaaaa 94980atcaaataga
tagagcagtc gaagctatat tgatatcaaa aaaaccagtt atatatgctg 95040gtggtggggc
aattgctagt aatgccgaaa aagaattact tgaacttaat gaaattattg 95100atgctcctgt
tacaaatact ttaatgggat tgggtattta ccctgctagt catcatagat 95160ttcttgggat
gttagggatg catggaacat atcaggcaaa tatggcaatg cataatgcag 95220acttaataat
tgctattggc gccagatttg atgacaggat taccaataaa ccatcaaagt 95280ttgcacctaa
tgccaaagtg gttcatctag atgttgatca ctcatctgta tcaaaaatta 95340tagaagcaaa
tgtagctgtt tttgggcaag taaaaaattc cttaaaatta ataaaagaaa 95400ctcttgaaaa
aaaattagac tcttacgatt ctttcgctct tcagccttgg cacgatcaga 95460taaaagaatg
gaaatcacta catggtttaa attatgagct ttataaagat gaatctgatg 95520atcatcccat
tttaccccag gctgtagtcc agcatgtcca tgagattaca aatggggaag 95580catatgtgac
ttccgatgtt ggtcagcatc aaatgtttgc tgctcaatat tatcattttg 95640ataagcctag
aagatggatc aattctggtg gtctaggaac tatgggtttt ggtttgccag 95700cagcaatggg
tgtaaaactc gcttttccaa aagatgaggt tgtttgcatt actggtgagg 95760gtagtatcca
aatgtgcatc caagaattgt ctacatgtct tcaatataat ctcccaataa 95820aaataattaa
tatcaataac gaagctcttg gtatggttaa acaatggcag gatatgaatt 95880atggaggaag
gcactctgaa agtacctatc aaaactcgtt accggatttt ataaaactgg 95940ctgaatcata
tggtcatata ggaattaaaa ttacaaaaaa ttctgattta agtgaaggct 96000taaaaaaagc
ttttgaaatg aaagataaac ttgtctttgt tgatatttat gtagatcctt 96060cagagcatgt
ttatccaatg caggttgcaa atggcagcct agaaaatatg tggctatcaa 96120aggatgaaca
aacatgatta aaagaaaact aattttaatt atggaaaata aaccaggagc 96180tctagtaaga
gtagttggac tgtttcatca aagaggctac aacattgaaa cccttcatgt 96240agatactgtt
aaagactttt ctacttacaa atcgatattg aaaaaaaact tgaaaccaaa 96300tttgaggata
atcaaatatc tagactgacc atagaaacaa tggtttcaga tgaccttatg 96360aggcagattt
tgagacagct caataaatta atagatgtta tagctgtaag caatgaagag 96420acaacctatt
taaaaggagt attattagat gaaaatttat tatgacgaag atgcaaacat 96480agaaattatt
aaagggatga atgtctcaat aattggctat ggatctcaag ggaatgctca 96540tgcaaataat
cttcatgaat ctggtgtaag tgttactgta ggtttaagag aagggtcttc 96600ttcatgggca
aaggcagaag aagcaggctt aaaagttcaa acagttgctg attcggtaat 96660ccaagcagat
ttggttatga ttttggcacc agatgaattc caaaaaaata tatatgaaac 96720cgaaatcaag
ccaaacttaa aaaccagtgc aattcttgca tttgcacatg gctttaatat 96780tcattttgaa
aaaatagttc ctgaagcaac taacagcgta attatgattg caccaaaagg 96840tccaggccat
actgttagaa gtacttatac caatggtgga ggcgttccat ctctcatagc 96900tatatatgaa
gatgctttaa gtgatgaaga ttattcagca aaagatgtag ctctatctta 96960tgcaaaagca
aatggcggca cgagggctgg tgttcttgaa acatctttta aagaagaaac 97020agaaacagat
ttatttggcg aacaagcagt tttatgtggc gggcttaccg ctttaattaa 97080agctgggttt
gaaactctag ttgaggcagg ctacagtgaa gagatggcat attttgaatg 97140ccttcatgaa
acaaaactaa tcacagactt aattcaagaa ggtggcattg ctaatatgca 97200ttactcaata
tcaaatactg ctgagtatgg tgattatgtg agtggaccca aagtaattac 97260tagcgatacc
aaaaaagcaa tgaaaggaat actagaaaat atacagtcag gaaaattcgc 97320agatgacttc
ttgaatgact gtcgacaaag caatgatggc actggcggac ctgttatgaa 97380aagcaataga
gaagctacaa aaattcatcc aatagagtcg gtaggggctg agctaaggtc 97440taaaatgaag
ttcctaaatt cacaaaaatt ggtggataaa gaaattaatt aaaattaatt 97500aaaaaaaagg
tatcttcttc ggaataactt cgttaagata cgcgtccgca caaagaggcg 97560gttgttcttt
aaaaatattt ggttactcgt gtgggtgttc aaaatacgag aaaaaataat 97620ttagattttt
tataaaaatc aacaaaacat gatattaatt gaagagtttg atcatggctc 97680agattgaacg
ctggcggtag gcttaacaca tgcaagtcgt gcgagaaagt atcttcggat 97740atgagtagag
cggcggacgg gtgagtaacg cgtaggaatc tacctagtag aaggggatag 97800cccggggaaa
cccggattaa taccgtatac ctccttcggg agaaagaagg cctctctttg 97860aagctttcgc
tactagatga gcctgcgtaa gattagcttg ttggtgaggt aaaggctcac 97920caaggcgacg
atctttagct ggtctgagag gacgatcagc cacattggga ctgagacacg 97980gcccagactc
ctacgggagg cagcagtggg gaatattgga caatgggcgc aagcctgatc 98040cagccatacc
gcgtgtgtga agaaggcctt cgggttgtaa agcactttaa gcagggagaa 98100aaagttataa
gttaatacct tataaccctg atgttacctg cagaataagc accggctaat 98160tccgtgccag
cagccgcggt aatacggaag gtgcaagcgt taatcggaat tactgggcgt 98220aaagcgcgcg
taggtggttt gttaagttgg atgtgaaagc cctgggctca acctaggaac 98280tgcatccaaa
actaactcac tagagtacga tagagggagg tagaattcat agtgtagcgg 98340tggaatgcgt
agatattatg aagaatacca gtggcgaagg cggcctcctg gatctgtact 98400gacactgagg
tgcgaaagcg tgggtagcga acaggattag ataccctggt agtccacgcc 98460gtaaacgatg
acaactagct gttgggagac tatgtctttc agtggcgcag ctaacgcttt 98520aagttgtccg
cctggggagt acggccgcaa ggctaaaact caaatgaatt gacggggacc 98580cgcacaagcg
gtggagcatg tggtttaatt cgatgcaacg cgaaaaacct tacctactct 98640tgacatactt
ggaggctctt gtaatgagag tgtgcctttt ggaaccaaga tacaggtgct 98700gcatggctgt
cgtcagctcg tgtcgtgaga tgttccgtta agtcggataa cgagcgcaac 98760ccttaccctt
atttgccagc gattcggtcg ggaactataa ggggactgcc ggtgataaac 98820cggaggaagg
tgaggacgac gtcaagtcat catggccctt acgagtaggg ctacacacgt 98880gctacaatgg
gagatacaga cggacgctaa gccgcgaggt ggtgctaatc ctaaaaagtc 98940tttcgtagtc
cggattggag tctgcaactc gactccatga agtcggaatc gctagtaatc 99000gcggatcagc
atgccgcggt gaatacgttc tcgggtcttg tacacaccgc ccgtcacacc 99060atggaagtgg
attgcaccag aagtagatag tctaacctta gggagggcgt ttaccacggt 99120gtgcttcatg
actggggtga agtcgtaaca aggtagccgt aggggaacct gtggctggat 99180cacctcctta
acgataaatc gcgttttaaa cgcccacacg agtaatcaaa tattaaaaaa 99240aagaacattt
agatatgtaa aatcattggt atgtaatttt ctagtgtata catttatgta 99300tacataagat
cactgcaatt aaaaagtaac atatgcattt atgtgtatgt taaaaaagta 99360attaatatat
tttattaagt tactctcaaa aatgaagata aaacttcaaa aaaaatatgt 99420aacctttttt
aaggttatat gatcaagtaa aggaagagca caaggcggat gccttggcag 99480cataaggcga
tgaaggacgt aataacctgc gataagcctc ggggagctgg taaataagct 99540tcgatccgag
gatttccgaa tgggaaaacc caatacacat aagtgtatta tcttatactg 99600aatacatagg
tataagaggc aaacctaggg aactgaaaca tctaagtacc tagaggaaaa 99660gaaatcaaca
gagattccgg tagtagcggc gagcgaaacc ggaccagccc ttaagcttat 99720tttagtccag
caaaatattc tggaaagttt agccatagta ggtgatagcc ctgtatgcga 99780aagactaatt
taagtgaaat cgagtaggtc gggacacgag aaatcttgac tgaacatggg 99840gggaccatcc
tccaaggcta aatactctat gctgaccgat agtgaaccag taccgtgagg 99900gaaaggcgaa
aagaaccccg gcgaggggag tgaaatagaa cctgaaacct tgtgcttaca 99960agcagtcgga
gcagacttgt tctgtgacgg cgtacctttt gtataatggg tcaacgactt 100020aatttcagta
gcaagcttaa ccatttaggg taggcgtagg gaaaccgagt cttaataggg 100080cgctcagttg
ctggaattag acccgaaacc gggtgatcta tccatggcca gtgtgaaggt 100140cgagtaacat
cgactggagg cgcgaaccca cttatgttga aaaatgaggg gatgagctgt 100200ggataggagt
gaaaggctaa tcaaacccgg agatagctgg ttctcttcga aaactattta 100260ggtagtgcct
cgtgtattac tgtagggggt agagcactgt ttcggctagg gggtcatccc 100320gacttaccaa
accgatgcaa actccgaata cctacaagta tgagcacggg agacagactg 100380cgggtgctaa
cgtccgtagt cgagagggaa acaacccaga ctgtcagcta aggtcccaaa 100440ttatgattaa
gtgggaaaca atgtgggaag gcacaaacag ctaggaggtt ggcttagaag 100500cagccatcct
ttaaagaaag cgtaatagct cactagtcga gtcggcctgc gtggaagata 100560taacggggct
aaatcataaa ccgaagctac agatcttaaa tttatttaag atggtagaag 100620agcgttctgt
aagcggttga aggtaagctg agaggcgaac tggacgtatc agaagtgcga 100680atgttgacat
gagtaacgat caaagaggtg aaaaacctct tcgccgaaaa accaagggtt 100740cctgtccaac
gctaatcgag gcagggtgag gcggccccta aggcgagggc gaaagccgta 100800gtcgatggga
aacaggttaa tattcctgta ctttttataa ctgcgatggg gtgacggaga 100860aggttagact
agcacggcga cggttgtccg tgttcaaggt tgtaggctgg tgttctaggt 100920aaatccggaa
cgctaaggct gagaactgat aacgaccact ctacgagtgg gaagtagtcg 100980ataccatgct
tccaggaaaa acctctaagc ttcaggttat aagaaaccgt accctaaacc 101040gacacaggtg
gttaggtcga gtagaccaag gtgtttgaga gaactatggt gaaggaacta 101100ggcaaaatag
caccgtaact tcgggagaag gtgcgccgcg tttggtgatg agacttgctc 101160tctaagctga
acgtggtcga agataccagg tggctgcgac tgtttactaa aaacatagca 101220ctctgcaaac
tcgtaagagg aagtataggg tgtgacgcct gcccggtgcc ggaaggttaa 101280ttgatggggt
tagcttatgc gaagctcttg atcgaagccc cggtaaacgg cggccgtaac 101340tataacggtc
ctaaggtagc gaaattcctt gtcgggtaag ttccgacctg cacgaatggc 101400gtaacgatgg
ccacactgtc tccaccatag actcagtgaa attgaaatcg ctgttaagat 101460gcagtgtacc
cgcagctaga cggaaagacc ccgtgcacct ttactatagg ttcgcactgg 101520actttgacct
tacttgtgta ggataggtgg gagactttga agcagagacg ccagtctttg 101580tggagtcatc
cttgaaatac cacccttgta agattgaagt tctaacctag gtccattatc 101640tggatcaggg
acagtgcgtg ctgggtagtt tgactggggc ggtctcctcc taaagagtaa 101700cggaggagta
cgaaggtatc cttatcacgg tcggacatcg tgaggtaagt ataaaggcag 101760aaggatgctt
gactgcgaga tcgacggatc gagcaggtag gaaactaggt cttagtgatc 101820cggtggttct
gaatggaagg gccatcgctc aacggataaa aggtacgccg gggataacag 101880gctgataccg
cccaagagtt catatcgacg gcggtgtttg gcacctcgat gtcggctcat 101940cacatcctgg
ggctggagca ggtcccaagg gtatggctgt tcgccattta aagtggtacg 102000cgagctgggt
ttagaacgtc gtgagacagt tcggtcccta tctgctgtgg gcgtttggag 102060atttgaggga
agctgattct agtacgagag gaccgaattg gacgaacctc tggtgttccg 102120gttgtcacgc
cagtggcatt gccgggtagc tatgttcgga aaggataacc gctgaaagca 102180tataagcggg
aagcctctcc caagattaaa tctcccagag actttatgtc tcctaaagag 102240tcgtcataga
ctatgacgtt gataggcaag atgtgtaagc gctgcgaggc gttgagctaa 102300cttgtactaa
taactcgtga ggcttgatca tgtaacctta agcaaggttc ataatttgag 102360taaaacattg
tagtgagaat taaaaaataa aaagttacat accagtttgc ctgatgacaa 102420tagcaacttg
gaaccacctg atcccatctc gaactcagaa gtgaaacgag ttaacgccaa 102480tggtagtgca
gggtctccct gtgtgagagt aggaaatcgt caggcttttt tctttaaggc 102540ttccagttta
ctggaagcct ttttttttat ctcaagtata atacccagat gattattgga 102600ttaacaggag
gcattggttc tggcaaatct gccgctgcag acttctttat tgatttaggt 102660atatcagtct
tagatgcaga tcaagttgct aaagaagctt tatctacaaa ttctcctgga 102720tatactgatt
ttatttctca atttggtgaa gtgtatttaa ataataatcg tgaggttgat 102780aggctgaaat
tgcgcgaact tattttttca aatccttcaa aaaaaaagga tcttgagaat 102840attattcatc
ccatagttcg gtctgctatt agtaatttta ttattacatc aacatcacca 102900tattctattg
ttatggtgcc actcattttt gaaacaaatt catataaaaa ttacgataag 102960attattactg
ttgactgtga tttagaactt cagatagtaa gggcctcaag tagagatgct 103020caaaataaat
cgcagattaa gaatattatt aataagcaag cctctagaga ggagaggcta 103080agtatttctg
atgacgtact tatcaataac agcaccctat ctgatctaaa aaaacaagtt 103140aatgttttac
atactaaata tatggagtta ttaaatgagt agttgcccta gatgtgaaaa 103200acctgtcaaa
ctttctactg acaatattta tagacctttc tgctctgaga aatgcaaact 103260tatagatttt
ggtgattggg ctaatgaaga taataaaatt tcaagaccaa ttcaatctga 103320agatttttac
gaagattaaa tttaagaaag tctccattca ccactttcaa tcatcggttt 103380ggctttttta
tacttcattt cttgcgtatc ttgaccgtta gtaattttta caagatcatt 103440tctgcctaac
ttaggttcgt ttcttgtcac agtacttgtt tttactatag gagtggcttg 103500tttttcattt
tgaaagatat cagaattaat ttcctctttt tccaatttta attcttgaga 103560actattttgt
ttgtttatgc tctctagctc agattctgtc gatatttgta atgagaaaag 103620tattcttata
gtttcaacat ctatttccga aagcatagat tcaaacattg aatatgcctc 103680tcttttgaat
tcattttttg gatttttttg agcataagcc cttagaccaa cactatttct 103740taaatgatct
atttctgata aatgctcttt ccaatggaca tcaagtactt gcagcataac 103800ctgcttctca
agaagcaatc tattttcacc aagatcacta aacttttttg aatatttatt 103860ttttgcttgc
agaacaattt cttcggcaat tgtatttggt acaagttttt tattactatt 103920aattttattg
gctatatctg tttctagtcc atagctctct tttaaataat catcgagctg 103980cctacttttc
cattgagact ctattgattc ttcaggtaca tataaattag atattccttt 104040gaattgctgc
tcgatgagtg actcaatggt actgctaata tcttcctctt ctagtaattg 104100attccttaaa
gaatatatag cttgtctttg atcatttgat acatcgtcgt actctaaaag 104160atttttcctt
gcatcaaagt ttctgctctc aattcttttt tgcgcatttt ctattcctct 104220tgaaagcatt
ttagcttcaa tatgatcatc tcccatgccc agcctttcaa aagtagccct 104280tcttccatcc
gaaataaaaa gtctcaagag atcgtcttct aaagataaga agaattttga 104340ataacctgga
tcaccttgtc tgcctgatct cccccttaac tgattatcta ttcttcttga 104400ttcatgcctt
tctgttccaa gtatatgaag cccaccagat tctataactt ttttattatt 104460ttctttccac
tctatatcgc tttggtcttc tttttttccg ccaagaacta tatccgtccc 104520ccttcctgcc
atattcgtag caatagttac cattccaggt ttacctgcat tggcaattat 104580ctctgcttct
ttttcgtgat gcttggcatt taaaatttga tgaggtattt ttttgttatt 104640taaatatgct
gatacttctt ctgaagattc aaccgaaact gttccaacaa gaatgggaga 104700agattttttt
cttaattgtt caatttcttc aattagagct ttatatttcg attctgttgt 104760aagaaatact
aagtcattaa gatcagctct agccatagga acatttgttg ggatgatgat 104820gacatttagg
ccatagattt gactaaactc tactgcttca gtatctgctg ttcctgtcat 104880cccagaaagt
tttttaaata atctaaaaaa gttttggaat gtggtggatg ctagtgtttg 104940agactctctt
tggatagcaa cattttcttt gcattccagt gcctggtgaa caccttcact 105000cattcttctt
ccgggcattg ttctacctgt atgctcatca atcaaaagaa cctcaccgtt 105060cctaaccaaa
taatccacat tctttttaaa taagaagctt gctctaagtg ttgcttgaac 105120aaatttcata
atttttaaat tagaaacaga gtaagcccat ctgaggctcc aagccgattc 105180cagc
105184229DNAArtificialsynthetic construct 2accatgggta aattattact
gatattagg
29324DNAArtificialsynthetic construct 3agcattagaa gattctttaa cagc
244750DNAUnknownnaturally occuring
gamma protobacterium 4atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca
ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala
Leu Pro Thr1 5 10 15ttt
gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe
Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30tct ttt tgg tta gtt act gct gct
tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala
Leu Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act
192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr
50 55 60gta tct ggt ctt gtt act ggt att
gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile
Ala Phe Trp His Tyr Met Tyr Met65 70 75
80aga ggg gta tgg att gaa act ggt gat tcg cca act gta
ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val
Phe Arg Tyr 85 90 95att
gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile
Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110att ctt gct gct gca act aat
gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn
Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120
125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa
gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu
Ala 130 135 140gga atc atg gct gca tgg
cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp
Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160gta tac atg att tat gaa tta tgg gct gga gaa
gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu
Gly Lys Ser Ala Cys 165 170
175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat
576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190att atc atc ttt ggt tgg
gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt
atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 2505250PRTUnknownNaturally Occuring gamma
protobaceterium 5Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala
Gly Ser Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
130 135 140Gly Ile Met Ala Ala Trp Pro
Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly
Lys Ser Ala Cys 165 170
175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 2506747DNAUnknownNaturally occuring
gamma protobacterium 6atg aaa tta tta ctg ata tta ggt agt gtt att gca ctt
cct aca ttt 48Met Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr Phe1 5 10 15gct
gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt tct 96Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val Ser 20
25 30ttt tgg tta gtt act gct gct tta
tta gca tct act gta ttt ttc ttt 144Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe Phe 35 40
45gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act gta
192Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr Val
50 55 60tct ggt ctt gtt act ggt att gct
ttc tgg cat tac atg tac atg aga 240Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met Arg65 70 75
80ggg gta tgg att gaa act ggt gat tcg cca act gta ttt
aga tac att 288Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr Ile 85 90 95gat
tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta att 336Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu Ile
100 105 110ctt gct gct gca act aat gtt
gct gga tca tta ttt aag aaa tta cta 384Leu Ala Ala Ala Thr Asn Val
Ala Gly Ser Leu Phe Lys Lys Leu Leu 115 120
125gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca
gga 432Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
Gly 130 135 140atc atg gct gca tgg cct
gca ttc att att ggg tgt tta gct tgg gta 480Ile Met Ala Ala Trp Pro
Ala Phe Ile Ile Gly Cys Leu Ala Trp Val145 150
155 160tac atg att tat gaa tta tgg gct gga gaa gga
aaa tct gca tgt aat 528Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly
Lys Ser Ala Cys Asn 165 170
175act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat att
576Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr Ile
180 185 190atc atc ttt ggt tgg gcg
att tat cct gta ggt tat ttc aca ggt tac 624Ile Ile Phe Gly Trp Ala
Ile Tyr Pro Val Gly Tyr Phe Thr Gly Tyr 195 200
205ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc
tat aac 672Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile
Tyr Asn 210 215 220ctt gct gac ttt gtt
aac aag att cta ttt ggt tta att ata tgg aat 720Leu Ala Asp Phe Val
Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp Asn225 230
235 240gtt gct gtt aaa gaa tct tct aat gct
747Val Ala Val Lys Glu Ser Ser Asn Ala
2457249PRTUnknownNaturally Occuring gamma protobaceterium 7Met Lys
Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr Phe1 5
10 15Ala Ala Gly Gly Gly Asp Leu Asp
Ala Ser Asp Tyr Thr Gly Val Ser 20 25
30Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe
Phe 35 40 45Val Glu Arg Asp Arg
Val Ser Ala Lys Trp Lys Thr Ser Leu Thr Val 50 55
60Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr
Met Arg65 70 75 80Gly
Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr Ile
85 90 95Asp Trp Leu Leu Thr Val Pro
Leu Leu Ile Cys Glu Phe Tyr Leu Ile 100 105
110Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys
Leu Leu 115 120 125Val Gly Ser Leu
Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala Gly 130
135 140Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys
Leu Ala Trp Val145 150 155
160Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys Asn
165 170 175Thr Ala Ser Pro Ala
Val Gln Ser Ala Tyr Asn Thr Met Met Tyr Ile 180
185 190Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr
Phe Thr Gly Tyr 195 200 205Leu Met
Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr Asn 210
215 220Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly
Leu Ile Ile Trp Asn225 230 235
240Val Ala Val Lys Glu Ser Ser Asn Ala
2458750DNAUnknownNaturally occuring gamma protobacterium 8atg ggt aaa tta
tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu
Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5
10 15ttt gct gca ggt ggt ggt gac ctt gat gct
agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala
Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt act gct gct cta tta gca tct act gta ttt ttc
144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe
35 40 45ttt gtt gaa aga gat aga gtt tct
gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser
Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tcg ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg
240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65
70 75 80aga ggg gta tgg att
gag act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile
Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85
90 95att gat tgg tta cta aca gtt cct cta ttg ata
tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile
Cys Glu Phe Tyr Leu 100 105
110att ctt gct gct gca aca aat gtt gct gct ggc ctg ttt aag aaa tta
384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu
115 120 125ttg gtt ggt tct ctt gtt atg
ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met
Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135
140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg
480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145
150 155 160gta tac atg att
tat gaa cta tgg gct gga gaa ggc aag gct gca tgt 528Val Tyr Met Ile
Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165
170 175aat act gca agt cct gct gtg caa tca gct
tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala
Tyr Asn Thr Met Met Tyr 180 185
190ata atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt
624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly
195 200 205tac cta atg ggt gac ggt gga
tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly
Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215
220gac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg
720Asp Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225
230 235 240aat gtt gct gtt
aaa gaa tct tct aat gct 750Asn Val Ala Val
Lys Glu Ser Ser Asn Ala 245
2509250PRTUnknownNaturally Occuring gamma protobaceterium 9Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5
10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp
Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe
35 40 45Phe Val Glu Arg Asp Arg Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65
70 75 80Arg Gly Val Trp
Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85
90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu
Ile Cys Glu Phe Tyr Leu 100 105
110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu
115 120 125Leu Val Gly Ser Leu Val Met
Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135
140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala
Trp145 150 155 160Val Tyr
Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys
165 170 175Asn Thr Ala Ser Pro Ala Val
Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185
190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe
Thr Gly 195 200 205Tyr Leu Met Gly
Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asp Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly
Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25010750DNAUnknownnaturally occuring gamma protobacterium 10atg ggt
aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly
Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5
10 15ttt gct gca ggt ggt ggt gac ctt
gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu
Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gct act gct gct tta tta gca tct act gta ttt
ttc 144Ser Phe Trp Leu Ala Thr Ala Ala Leu Leu Ala Ser Thr Val Phe
Phe 35 40 45ttt gtt gaa aga gat
aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp
Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg
tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met
Tyr Met65 70 75 80aga
ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg
Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca gtt
cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val
Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105
110att ctt gct gct gct act aat gtt gct gga tca tta ttt aag
aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys
Lys Leu 115 120 125cta gtt ggt tct
ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser
Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga atc atg gct gca tgg cct gca ttc att att ggg
tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly
Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tgg gct gga gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro
Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190att atc atc ttt ggt tgg gcg att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac ctg
atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gat ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25011250PRTUnknownNaturally Occuring gamma protobaceterium 11Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Ala Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25012750DNAUnknownnaturally occuring gamma protobacterium
12atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15ttt gct gca ggt ggc ggt
gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly
Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt aca gct gct cta tta gca tct act
gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr
Val Phe Phe 35 40 45ttt gtt gaa
aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu
Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat
tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His
Tyr Met Tyr Met65 70 75
80aga gga gta tgg att gaa act ggt gat tcg cct act gta ttt aga tac
288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca
gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gct gct gca act aat gtt gcc ggc tca tta
ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125cta gtt
ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga att atg gca gct tgg cct gca ttc att att
ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro
Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180
185 190atc ata gtc ttc ggt tgg gca att tat cct ata ggt
tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Ile Gly
Tyr Phe Thr Gly 195 200 205tac cta
atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25013250PRTUnknownNaturally Occuring gamma protobaceterium 13Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180
185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Ile
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25014750DNAUnknownnaturally occuring gamma protobacterium
14atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15ttt gct gca ggt ggt ggt
gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly
Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt act gct gct cta tta gca tct act
gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr
Val Phe Phe 35 40 45ttt gtt gaa
aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu
Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tcg ggt ctt gtt act ggt att gct ttc tgg cat
tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His
Tyr Met Tyr Met65 70 75
80aga ggg gta tgg att gag acc ggt gat tcg cca act gta ttt aga tac
288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca
gtt cct cta ttg ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gct gct gca aca aat gtt gct gct ggc ctg
ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu
Phe Lys Lys Leu 115 120 125ttg gtt
ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga att atg aac gct tgg ggt gca ttc gtt att
ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile
Gly Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tgg gct gga gaa ggc aag gct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys
165 170 175aat act gca agt cct
gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro
Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190ata atc atc ttt ggt tgg gca att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac cta
atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25015250PRTUnknownNaturally Occuring gamma protobaceterium 15Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25016753DNAUnknownnaturally occuring gamma protobacterium
16atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15ttt gct gct gct ggt ggc
gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act
gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45ttt gta gaa
aga gac caa gtc agc gct aag tgg aaa act tca ctt gct 192Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Ala 50
55 60gta tct ggt tta att act ggt ata gct ttt tgg cat
tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat
288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta tta act
gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca agt gtt gct gct tca tta
ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125cta gct
ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140gga tta gct cct gta tta cct gct ttc att att
ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta
528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175agt act gca agt cct
gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro
Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180
185 190att att gtt gtt gga tgg gca att tat cct gct gga
tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly
Tyr Ala Ala Gly 195 200 205tac cta
atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu
Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220tat aac ctt gcc gac ctt gtt aac aag att cta
ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile225 230 235
240tgg aat gtt gct gtt aaa gaa tct tct aat gct
753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25017251PRTUnknownNaturally Occuring gamma protobaceterium
17Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Ala 50
55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser
Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180
185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala
Gly Tyr Ala Ala Gly 195 200 205Tyr
Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile
Leu Phe Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25018753DNAUnknownnaturally occuring gamma
protobacterium 18atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt
cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu
Pro Ser1 5 10 15ttt gct
gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala
Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20
25 30tca ttc tgg ctg gtt aca gct ggt atg
tta gcg gca act gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met
Leu Ala Ala Thr Val Phe Phe 35 40
45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act
192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt tta att act ggt ata gct
ttt tgg cat tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala
Phe Trp His Tyr Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt
aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe
Arg Tyr 85 90 95att gat
tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp
Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca agt gtt gct
gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala
Ala Ser Leu Phe Lys Lys Leu 115 120
125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct
432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140ggt tta gct cct gta tta cct gct
ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala
Phe Ile Ile Gly Met Ala Gly Trp145 150
155 160tta tac atg att tat gag cta cat atg ggt gaa ggt
aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly
Lys Ala Ala Val 165 170
175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg aag
576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys
180 185 190att att gtt att gga tgg
gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Ile Gly Trp
Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200
205tac cta atg agt ggt gac ggt gta tac gct tca aac tta aac
ctt ata 672Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn
Leu Ile 210 215 220tat aac ctt gct gac
ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp
Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230
235 240tgg aat gtt gct gtt aaa gaa tct tct aat
gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn
Ala 245 25019251PRTUnknownNaturally
Occuring gamma protobaceterium 19Met Gly Lys Leu Leu Leu Ile Leu Gly Ser
Ala Ile Ala Leu Pro Ser1 5 10
15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val
20 25 30Ser Phe Trp Leu Val Thr
Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser
Leu Thr 50 55 60Val Ser Gly Leu Ile
Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70
75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr
Pro Thr Val Phe Arg Tyr 85 90
95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu
100 105 110Ile Leu Ala Ala Cys
Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115
120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe
Ala Gly Glu Ala 130 135 140Gly Leu Ala
Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160Leu Tyr Met Ile Tyr Glu Leu
His Met Gly Glu Gly Lys Ala Ala Val 165
170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn
Ala Met Met Lys 180 185 190Ile
Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195
200 205Tyr Leu Met Ser Gly Asp Gly Val Tyr
Ala Ser Asn Leu Asn Leu Ile 210 215
220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25020753DNAUnknownnaturally occuring gamma protobacterium 20atg ggt aaa
tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15ttt gct gct gct ggt ggc gat cta gat
ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt
144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45ttt gta gaa aga gac caa gtc
agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg
240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80aga ggt gtt tgg
ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95att gat tgg tta tta act gtt cca tta caa
gtg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Val Val Glu Phe Tyr Leu 100 105
110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt
384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125cta gct ggt tca tta gta atg
tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg
480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160tta tac atg att
tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile
Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165
170 175agt act gca agt cct gct gtt aac tct gca
tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala
Tyr Asn Ala Met Met Met 180 185
190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt
624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly
195 200 205tac cta atg ggt ggc gaa ggt
gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly
Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215
220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att
720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240tgg aat gtt gct
gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25021251PRTUnknownNaturally Occuring gamma protobaceterium 21Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Val Val Glu Phe Tyr Leu 100 105
110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly
Trp145 150 155 160Leu Tyr
Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser Pro Ala Val
Asn Ser Ala Tyr Asn Ala Met Met Met 180 185
190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala
Ala Gly 195 200 205Tyr Leu Met Gly
Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25022753DNAUnknownnaturally occuring gamma protobacterium 22atg
ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15ttt gct gct gct ggt ggc gat
cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp
Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg
ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val
Phe Phe 35 40 45ttt gta gaa aga
gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg
Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gct ttt tgg cat tat
ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr
Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat
288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta tta act
gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca aat gtt gct gct tca tta
ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125cta gct
ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140gga ttg gct cct gta tgg cct gct ttc att att
ggt atg gct gga tgg 480Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta
528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175agt act gca agt cct
gct gtt aac tct gca tac aac gca atg atg gtg 576Ser Thr Ala Ser Pro
Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 180
185 190att att gtt gtt gga tgg gca att tat cct gct gga
tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly
Tyr Ala Ala Gly 195 200 205tac cta
atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu
Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220tat aac ctt gcc gac ctt gtt aac aag att cta
ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile225 230 235
240tgg aat gtt gct gtt aaa gaa tct tct aat gct
753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25023251PRTUnknownNaturally Occuring gamma protobaceterium
23Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser
Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 180
185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala
Gly Tyr Ala Ala Gly 195 200 205Tyr
Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile
Leu Phe Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25024750DNAUnknownnaturally occuring gamma
protobacterium 24atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt
cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15ttt gct
gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30tct ttt tgg tta gtt act gct gct cta
tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act
192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt ctt gtt act ggt att gct
ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80aga ggg gta tgg att gag act ggt gat tcg cca act gta ttt
aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95att gat
tgg tta cta aca gtt cct cta ttg ata tgt gaa ttc tac tta 336Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gct gct gca aca aat gtt gct
gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala
Ala Gly Leu Phe Lys Lys Leu 115 120
125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca
432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga att atg aac gct tgg cct gca
ttc att att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Pro Ala
Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160gta tac atg att tat gaa cta tat gct gga gaa gga
aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly
Lys Ser Ala Cys 165 170
175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct
576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala
180 185 190atc ata gtc ttc ggt tgg
gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt
att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25025250PRTUnknownNaturally Occuring gamma
protobaceterium 25Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala
Ala Gly Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
130 135 140Gly Ile Met Asn Ala Trp Pro
Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly
Lys Ser Ala Cys 165 170
175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala
180 185 190Ile Ile Val Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25026750DNAUnknownnaturally occuring
gamma protobacterium 26atg ggt aaa tta tta ctg ata tta ggt agt gtt att
gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile
Ala Leu Pro Thr1 5 10
15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt
96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val
20 25 30tct ttt tgg tta gtt act gct
gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala
Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta
act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu
Thr 50 55 60gta tct ggt ctt gtt act
ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr
Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70
75 80aga ggg gta tgg att gaa act ggt gat tcg cca
act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro
Thr Val Phe Arg Tyr 85 90
95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta
336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110att ctt gct gct gct act
aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr
Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120
125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt
gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly
Glu Ala 130 135 140gga att atg aac gct
tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala
Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150
155 160gta tac atg att tat gag ctt tgg ctt gga
gaa gga aaa gct gcg tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Leu Gly
Glu Gly Lys Ala Ala Cys 165 170
175aat aca gca agt cct gct gtt cag tca gct tac aac aca atg atg atg
576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Met
180 185 190atc atc atc ttt ggt tgg
gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac cta atg ggt gac ggt gga tca gca ctt aac tta aac ctt
atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25027250PRTUnknownNaturally Occuring gamma
protobaceterium 27Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala
Ala Gly Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
130 135 140Gly Ile Met Asn Ala Trp Gly
Ala Phe Val Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Trp Leu Gly Glu Gly
Lys Ala Ala Cys 165 170
175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Met
180 185 190Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25028750DNAUnknownnaturally occuring
gamma protobacterium 28atg ggt aaa tta tta ctg ata tta ggt agt gtt att
gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile
Ala Leu Pro Thr1 5 10
15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt
96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val
20 25 30tct ttt tgg tta gtt act gct
gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala
Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta
act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu
Thr 50 55 60gta tct ggt ctt gtt act
ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr
Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70
75 80aga ggg gta tgg att gaa act ggt gat tcg cca
act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro
Thr Val Phe Arg Tyr 85 90
95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta
336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110att ctt gct gct gca act
aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr
Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120
125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt
gag gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly
Glu Ala 130 135 140gga att atg aac gct
tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala
Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150
155 160gta tac atg att tat gaa cta tgg gct gga
gaa ggc aag gct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly
Glu Gly Lys Ala Ala Cys 165 170
175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat
576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190ata atc atc ttt ggt tgg
gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt
atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25029250PRTUnknownNaturally Occuring gamma
protobaceterium 29Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala
Ala Gly Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
130 135 140Gly Ile Met Asn Ala Trp Gly
Ala Phe Val Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly
Lys Ala Ala Cys 165 170
175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25030750DNAUnknownnaturally occuring
gamma protobacterium 30atg ggt aaa tta tta ctg ata tta ggt agt gtt att
gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile
Ala Leu Pro Thr1 5 10
15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt
96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val
20 25 30tct ttt tgg tta gtt aca gct
gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala
Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta
act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu
Thr 50 55 60gta tct ggt ctt gtt act
ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr
Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70
75 80aga ggg gta tgg att gaa act ggt gat tcg cca
act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro
Thr Val Phe Arg Tyr 85 90
95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta
336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110att ctt gct gct gct act
aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr
Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120
125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt
gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly
Glu Ala 130 135 140caa att atg gct gca
tgg cct gca ttc att att ggg tgt tta gct tgg 480Gln Ile Met Ala Ala
Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160gta tac atg att tat gaa cta tat gct gga
gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly
Glu Gly Lys Ser Ala Cys 165 170
175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct
576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala
180 185 190atc ata gtc ttc ggt tgg
gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac cta atg ggt gac ggt ggg tca gct ctt aac tta aac ctt
att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ctt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Leu Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25031250PRTUnknownNaturally Occuring gamma
protobaceterium 31Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala
Gly Ser Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
130 135 140Gln Ile Met Ala Ala Trp Pro
Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly
Lys Ser Ala Cys 165 170
175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala
180 185 190Ile Ile Val Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Leu Gly Leu Ile Ile Trp225 230
235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25032750DNAUnknownnaturally occuring
gamma protobacterium 32atg ggt aaa tta tta ctg ata tta ggt agt gtt att
gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile
Ala Leu Pro Thr1 5 10
15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt
96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val
20 25 30tct ttt tgg tta gtt act gct
gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala
Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta
act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu
Thr 50 55 60gta tct ggt ctt gtt act
ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr
Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70
75 80aga ggg gta tgg att gaa act ggt gat tcg cca
act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro
Thr Val Phe Arg Tyr 85 90
95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta
336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110att ctt gct gct gca gct
aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Ala
Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120
125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt
gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly
Glu Ala 130 135 140gga atc atg gct gca
tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala
Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160gta tac atg att tat gaa tta tgg gct gga
gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly
Glu Gly Lys Ser Ala Cys 165 170
175aat act gca agt cct gct gtg caa tca gcc tac aac aca atg atg tat
576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190att atc atc ttt ggt tgg
gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac ttg atg ggt gac ggt gga tca gct ctt aac tta aac ctt
atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25033250PRTUnknownNaturally Occuring gamma
protobaceterium 33Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu
Pro Thr1 5 10 15Phe Ala
Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20
25 30Ser Phe Trp Leu Val Thr Ala Ala Leu
Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Val Thr Gly Ile Ala
Phe Trp His Tyr Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe
Arg Tyr 85 90 95Ile Asp
Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Ala Asn Val Ala
Gly Ser Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala
130 135 140Gly Ile Met Ala Ala Trp Pro
Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly
Lys Ser Ala Cys 165 170
175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190Ile Ile Ile Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25034750DNAUnknownnaturally occuring
gamma protobacterium 34atg ggt aaa tta tta ctg ata ata ggt agt gtt att
gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Ile Gly Ser Val Ile
Ala Leu Pro Thr1 5 10
15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt
96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val
20 25 30tct ttt tgg tta gtt aca gct
gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala
Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40
45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta
act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu
Thr 50 55 60gta tct ggt ctt gtt act
ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr
Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70
75 80aga gga gta tgg att gaa act ggt gat tcg cca
act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro
Thr Val Phe Arg Tyr 85 90
95att gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta
336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110att ctt gct gct gca act
aat gtt gcc ggc tca tta ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr
Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120
125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt
gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly
Glu Ala 130 135 140gga att atg gca gct
tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala
Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160gta tat atg att tat gaa cta tat gct gga
gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly
Glu Gly Lys Ser Ala Cys 165 170
175aat aca gca agt cct gct gtg caa tca gct tac aac aca atg atg tat
576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190att atc gtc ttt ggt tgg
gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp
Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200
205tac ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt
atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu
Ile Tyr 210 215 220aac ctt gct gac ttt
gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe
Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230
235 240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25035250PRTUnknownNaturally Occurring
gamma proteobacterium 35Met Gly Lys Leu Leu Leu Ile Ile Gly Ser Val Ile
Ala Leu Pro Thr1 5 10
15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val
20 25 30Ser Phe Trp Leu Val Thr Ala
Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu
Thr 50 55 60Val Ser Gly Leu Val Thr
Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70
75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro
Thr Val Phe Arg Tyr 85 90
95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu
100 105 110Ile Leu Ala Ala Ala Thr
Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120
125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly
Glu Ala 130 135 140Gly Ile Met Ala Ala
Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150
155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly
Glu Gly Lys Ser Ala Cys 165 170
175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr
180 185 190Ile Ile Val Phe Gly
Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195
200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu
Asn Leu Ile Tyr 210 215 220Asn Leu Ala
Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225
230 235 240Asn Val Ala Val Lys Glu Ser
Ser Asn Ala 245
25036750DNAUnknownnaturally occuring gamma protobacterium 36atg ggt aaa
tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5
10 15ttt gct gca ggt ggt ggt gac ctt gat
gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp
Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt act gct gct cta tta gca tct act gta ttt ttc
144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe
35 40 45ttt gtt gaa aga gat aga gtt
tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tcg ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg
240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65
70 75 80aga ggg gta tgg
att gag act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp
Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85
90 95att gat tgg tta cta aca gtt cct cta ttg
ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu
Ile Cys Glu Phe Tyr Leu 100 105
110att ctt gct gct gca aca aat gtt gct gct ggc ctg ttt aag aaa tta
384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu
115 120 125ttg gtt ggt tct ctt gtt atg
ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met
Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135
140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg
480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145
150 155 160gta tac atg att
tat gaa cta tgg gct gga gaa ggc aag gct gca tgt 528Val Tyr Met Ile
Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165
170 175aat act gca agt cct gct gtg caa tca gct
tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala
Tyr Asn Thr Met Met Tyr 180 185
190ata atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt
624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly
195 200 205tac cta atg ggt gac ggt gga
tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly
Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215
220aac ctt gct gac ttt gtt aac aag aat cta ttt ggt tta att ata tgg
720Asn Leu Ala Asp Phe Val Asn Lys Asn Leu Phe Gly Leu Ile Ile Trp225
230 235 240aat gtt gct gtt
aaa gaa tct tct aat gct 750Asn Val Ala Val
Lys Glu Ser Ser Asn Ala 245
25037250PRTUnknownNaturally Occuring gamma protobaceterium 37Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5
10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp
Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe
35 40 45Phe Val Glu Arg Asp Arg Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65
70 75 80Arg Gly Val Trp
Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85
90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu
Ile Cys Glu Phe Tyr Leu 100 105
110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu
115 120 125Leu Val Gly Ser Leu Val Met
Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135
140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala
Trp145 150 155 160Val Tyr
Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys
165 170 175Asn Thr Ala Ser Pro Ala Val
Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185
190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe
Thr Gly 195 200 205Tyr Leu Met Gly
Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Asn Leu Phe Gly
Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25038750DNAUnknownnaturally occuring gamma protobacterium 38atg ggt
aaa tta tta cgg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly
Lys Leu Leu Arg Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5
10 15ttt gct gca ggt ggc ggt gac ctt
gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu
Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt aca gct gct cta tta gca tct act gta ttt
ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe
Phe 35 40 45ttt gtt gaa aga gat
aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp
Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg
tat atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met
Tyr Met65 70 75 80aga
gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg
Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca gtt
cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val
Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105
110att ctt gct gct gca act aat gtt gct gga tca tta ttt aag
aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys
Lys Leu 115 120 125cta gtt ggt tct
ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser
Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga atc atg gct gca tgg cct gca ttc att att ggg
tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly
Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tgg gct gga gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro
Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190atc atc atc gtt ggt tgg gcg att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac ctg
atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25039250PRTUnknownNaturally Occuring gamma protobaceterium 39Met
Gly Lys Leu Leu Arg Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25040750DNAUnknownnaturally occuring gamma protobacterium
40atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15ttt gct gca ggt ggc ggt
gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly
Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt aca gct gct cta tta gca tct act
gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr
Val Phe Phe 35 40 45ttt gtt gaa
aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu
Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat
tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His
Tyr Met Tyr Met65 70 75
80aga gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac
288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca
gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gct gct gca act aat gtt gcc ggc tca tta
ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125cta gtt
ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga att atg gca gct tgg cct gca ttc att att
ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro
Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180
185 190atc ata gtc ttc ggt tgg gca att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac cta
atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25041250PRTUnknownNaturally Occuring gamma protobaceterium 41Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180
185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25042750DNAUnknownnaturally occuring gamma protobacterium
42atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15ttt gct gca ggt ggt ggt
gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly
Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt act gct gct tta tta gca tct act
gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr
Val Phe Phe 35 40 45ttt gtt gaa
aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu
Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat
tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His
Tyr Met Tyr Met65 70 75
80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac
288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca
gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gct gct gct act aat gtt gcc ggc tca tta
ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125cta gtt
ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga att atg gca gct tgg cct gca ttc att att
ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro
Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180
185 190atc ata gtc ttc ggt tgg gca att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac cta
atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gct gct gtt aaa gaa tct tct aat gct
750Asn Ala Ala Val Lys Glu Ser Ser Asn Ala 245
25043250PRTUnknownNaturally Occuring gamma protobaceterium 43Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180
185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Ala Ala Val Lys Glu Ser Ser Asn Ala 245
25044750DNAUnknownnaturally occuring gamma protobacterium
44atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15ttt gct gca ggt ggt ggt
gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly
Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt act gct gct tta tta gca tct act
gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr
Val Phe Phe 35 40 45ttt gtt gaa
aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu
Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat
tac atg tat atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His
Tyr Met Tyr Met65 70 75
80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac
288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95ata gat tgg tta cta aca
gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gcc gct gca act aat gtt gct gga tca tta
ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125ctt gtt
ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga atc atg gct gca tgg cct gca ttc att att
ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160gta tac atg att tat gaa cta tgg gct gga gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro
Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190atc atc atc ttt ggt tgg gcg att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac ctt
atg ggt gac ggt gga tca gca ctt aac tta aac ctt att tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25045250PRTUnknownNaturally Occuring gamma protobaceterium 45Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25046750DNAUnknownnaturally occuring gamma protobacterium
46atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15ttt gct gca ggt ggc ggt
gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly
Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30tct ttt tgg tta gtt aca gct gct cta tta gcg tct act
gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr
Val Phe Phe 35 40 45ttt gtt gaa
aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu
Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat
tac atg tat atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His
Tyr Met Tyr Met65 70 75
80aga gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac
288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta cta aca
gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110att ctt gct gct gca act aat gtt gcc ggc tca tta
ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125cta gtt
ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140gga ata atg gcg gct tgg cct gca ttc atc gtt
gga tgt tta gca tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Val
Gly Cys Leu Ala Trp145 150 155
160gta tat atg att tat gaa cta tgg gct ggt gaa gga aaa tct gca tgt
528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175aat act gca agt cct
gct gta cag tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro
Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190atc atc atc gtt ggt tgg gca att tat cct gta ggt
tat ttc aca ggt 624Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val Gly
Tyr Phe Thr Gly 195 200 205tac cta
atg ggt gac ggt gga tca gct ctt aat cta aac ctt att tat 672Tyr Leu
Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220aac ctt gct gac ttt gtt aac aag att cta ttt
ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile Trp225 230 235
240aat gtt gct gtt aaa gaa tct tct aat gct
750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25047250PRTUnknownNaturally Occuring gamma protobaceterium 47Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1
5 10 15Phe Ala Ala Gly Gly Gly Asp
Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val
Phe Phe 35 40 45Phe Val Glu Arg
Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr
Met Tyr Met65 70 75
80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu
Phe Lys Lys Leu 115 120 125Leu Val
Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130
135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Val
Gly Cys Leu Ala Trp145 150 155
160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys
165 170 175Asn Thr Ala Ser
Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180
185 190Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val
Gly Tyr Phe Thr Gly 195 200 205Tyr
Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210
215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile Trp225 230 235
240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25048753DNAUnknownnaturally occuring gamma protobacterium
48atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca
48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15ttt gct gct gct ggt ggc
gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act
gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45ttt gta gaa
aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt tta att act ggt ata gct ttt tgg cat
tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat
288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg cta tta act
gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca agt gtt gct gct tca tta
ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125cta gct
ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140ggt tta gct cct gta tta cct gct ttc att ctt
ggt atg gct ggt tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Leu
Gly Met Ala Gly Trp145 150 155
160tta tac atg att tat gag cta cat atg ggt gaa ggt aag gct gct gta
528Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val
165 170 175agt act gca agt cct
gct gtt aac tct gct tac aat gca atg atg aag 576Ser Thr Ala Ser Pro
Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180
185 190att att gtt att gga tgg gca att tat cct gct gga
tat gct gct ggt 624Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly
Tyr Ala Ala Gly 195 200 205tac cta
atg agt ggt gac ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu
Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220tat aac ctt gct gac ttt gtt aac aag att cta
ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile225 230 235
240tgg aat gtt gct gtt aaa gaa tct tct aat gct
753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25049251PRTUnknownNaturally Occuring gamma protobaceterium
49Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Leu
Gly Met Ala Gly Trp145 150 155
160Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser
Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180
185 190Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala
Gly Tyr Ala Ala Gly 195 200 205Tyr
Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile
Leu Phe Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25050753DNAUnknownnaturally occuring gamma
protobacterium 50atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt
cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu
Pro Ser1 5 10 15ttt gct
gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala
Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20
25 30tca ttc tgg ctg gtt aca gct ggt atg
tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met
Leu Ala Ala Thr Val Phe Phe 35 40
45ttt gta gaa aga gac caa gtc agc gct gag tgg aaa act tca ctt act
192Phe Val Glu Arg Asp Gln Val Ser Ala Glu Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt tta att act ggt ata gct
ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala
Phe Trp His Tyr Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gat act ggt gat acc cca aca gta ttc
aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe
Arg Tyr 85 90 95att gat
tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp
Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca agt gtt gct
gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala
Ala Ser Leu Phe Lys Lys Leu 115 120
125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct
432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140gga tta gct cct gta tta cct gct
ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala
Phe Ile Ile Gly Met Ala Gly Trp145 150
155 160tta tac atg att tat gag cta tat atg ggt gaa ggt
aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly
Lys Ala Ala Val 165 170
175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg
576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met
180 185 190att att gtt gtt gga tgg
gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp
Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200
205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac
ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn
Leu Ile 210 215 220tat aac ctt gct gac
ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp
Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230
235 240tgg aat gtt gct gtt aaa gaa tct tct aat
gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn
Ala 245 25051251PRTUnknownNaturally
Occuring gamma protobaceterium 51Met Gly Lys Leu Leu Leu Ile Leu Gly Ser
Ala Ile Ala Leu Pro Ser1 5 10
15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val
20 25 30Ser Phe Trp Leu Val Thr
Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Gln Val Ser Ala Glu Trp Lys Thr Ser
Leu Thr 50 55 60Val Ser Gly Leu Ile
Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70
75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr
Pro Thr Val Phe Arg Tyr 85 90
95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu
100 105 110Ile Leu Ala Ala Cys
Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115
120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe
Ala Gly Glu Ala 130 135 140Gly Leu Ala
Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160Leu Tyr Met Ile Tyr Glu Leu
Tyr Met Gly Glu Gly Lys Ala Ala Val 165
170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn
Ala Met Met Met 180 185 190Ile
Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195
200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr
Ala Ser Asn Leu Asn Leu Ile 210 215
220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25052753DNAUnknownnaturally occuring gamma protobacterium 52atg ggt aaa
tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15ttt gct gct gct ggt ggc gat cta gat
ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt
144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45ttt gta gaa aga gac caa gtc
agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gcc ttt tgg cat tat ctc tat atg
240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80aga ggt gtt tgg
ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95att gat tgg tta tta act gtt cca tta caa
atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Met Val Glu Phe Tyr Leu 100 105
110att ctt gct gct tgt aca aat gtt gct gct tca tta ttt aag aag ctt
384Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125cta gct ggt tca tta gta atg
tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140gga tta gct cct gta tgg cct gct ttc att att ggt atg gct gga tgg
480Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160tta tac atg att
tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile
Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165
170 175agt act gca agt cct gct gtt aac tct gca
tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala
Tyr Asn Ala Met Met Met 180 185
190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt
624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly
195 200 205tac cta atg ggt ggc gaa ggt
gta tac gct tca aac cta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly
Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215
220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att
720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240tgg aat gtt gct
gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25053251PRTUnknownNaturally Occuring gamma protobaceterium 53Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Met Val Glu Phe Tyr Leu 100 105
110Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly
Trp145 150 155 160Leu Tyr
Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser Pro Ala Val
Asn Ser Ala Tyr Asn Ala Met Met Met 180 185
190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala
Ala Gly 195 200 205Tyr Leu Met Gly
Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25054753DNAUnknownnaturally occuring gamma protobacterium 54atg
ggt aaa tta tta ctg ata tta ggt agt gct att gcg ctt cca tca 48Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15ttt gct gct gct ggt ggc gat
cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp
Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt acg gct ggt atg tta gcg gca act gta
ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val
Phe Phe 35 40 45ttt gta gaa aga
gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg
Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gct ttt tgg cat tat
ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr
Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat
288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta tta act
gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gcc gct tgt aca agt gtt gct gct tca tta
ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125cta gct
ggt tca ttg gta atg tta ggt gct gga tct gca ggc gaa gct 432Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Ser Ala Gly Glu Ala 130
135 140gga tta gct cct gta tta cct gct ttc att att
ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta
528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175agt act gca agt cct
gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro
Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180
185 190att att gtt gtt gga tgg gca att tat cct gct gga
tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly
Tyr Ala Ala Gly 195 200 205tac cta
atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctc ata 672Tyr Leu
Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220tat aac ctt gct gac ttt gtt aac aag att cta
ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile225 230 235
240tgg aat gtt gct gtt aaa gaa tct tct aat gct
753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25055251PRTUnknownNaturally Occuring gamma protobaceterium
55Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Ser Ala Gly Glu Ala 130
135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser
Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180
185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala
Gly Tyr Ala Ala Gly 195 200 205Tyr
Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile
Leu Phe Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25056753DNAUnknownnaturally occuring gamma
protobacterium 56atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt
cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu
Pro Ser1 5 10 15ttt gct
gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala
Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20
25 30tca ttc tgg ctg gtt aca gct ggt atg
tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met
Leu Ala Ala Thr Val Phe Phe 35 40
45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act
192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt tta att act ggt ata gct
ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala
Phe Trp His Tyr Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc
aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe
Arg Tyr 85 90 95att gat
tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp
Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca aat gtt gct
gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Asn Val Ala
Ala Ser Leu Phe Lys Lys Leu 115 120
125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct
432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140gga tta gct cct gta tgg cct gct
ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Trp Pro Ala
Phe Ile Ile Gly Met Ala Gly Trp145 150
155 160tta tac atg att tat gag cta tat atg ggt gaa ggt
aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly
Lys Ala Ala Val 165 170
175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg gtg
576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val
180 185 190att att gtt gtt gga tgg
gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp
Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200
205tac cta atg ggt ggc gaa ggt gta tac gct tca aac cta aac
ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn
Leu Ile 210 215 220tat aac ctt gct gac
ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp
Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230
235 240tgg aat gtt gct gtt aaa gaa tct tct aat
gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn
Ala 245 25057251PRTUnknownNaturally
Occuring gamma protobaceterium 57Met Gly Lys Leu Leu Leu Ile Leu Gly Ser
Ala Ile Ala Leu Pro Ser1 5 10
15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val
20 25 30Ser Phe Trp Leu Val Thr
Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser
Leu Thr 50 55 60Val Ser Gly Leu Ile
Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70
75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr
Pro Thr Val Phe Arg Tyr 85 90
95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu
100 105 110Ile Leu Ala Ala Cys
Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115
120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe
Ala Gly Glu Ala 130 135 140Gly Leu Ala
Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160Leu Tyr Met Ile Tyr Glu Leu
Tyr Met Gly Glu Gly Lys Ala Ala Val 165
170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn
Ala Met Met Val 180 185 190Ile
Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195
200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr
Ala Ser Asn Leu Asn Leu Ile 210 215
220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25058753DNAUnknownnaturally occuring gamma protobacterium 58atg ggt aaa
tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15ttt gct gct gct ggt ggc gat cta gat
ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt
144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45ttt gta gaa aga gac caa gtc
agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg
240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80aga ggt gtt tgg
ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95att gat tgg tta tta act gtt cca tta caa
atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Met Val Glu Phe Tyr Leu 100 105
110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt
384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125cta gct ggt tca tta gta atg
tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg
480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160tta tac atg att
tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile
Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165
170 175agt act gca agt cct gct gtt aac tct gca
tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala
Tyr Asn Ala Met Met Met 180 185
190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt
624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly
195 200 205tac cta atg ggt ggc gaa ggt
gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly
Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215
220tat aac ctt gct gac ctt gtt aac aag att cta ttt ggt ttg atc att
720Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240tgg aat gtt gct
gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25059251PRTUnknownNaturally Occuring gamma protobaceterium 59Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Met Val Glu Phe Tyr Leu 100 105
110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly
Trp145 150 155 160Leu Tyr
Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser Pro Ala Val
Asn Ser Ala Tyr Asn Ala Met Met Met 180 185
190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala
Ala Gly 195 200 205Tyr Leu Met Gly
Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25060753DNAUnknownnaturally occuring gamma protobacterium 60atg
ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met
Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15ttt gct gct gct ggt ggc gat
cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp
Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg
ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val
Phe Phe 35 40 45ttt gta gaa aga
gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg
Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gct ttt tgg cat tat
ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr
Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat
288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95att gat tgg tta tta act
gtt cca tta caa gtg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Val Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca agt gtt gct gct tca tta
ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125cta gct
ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140gga tta gct cct gta tta cct gct ttc att att
ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160tta tac atg att tat gag cta tat atg ggt gaa ggc aag gct gct gta
528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175agt act gca agt cct
gct gtt aac cct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro
Ala Val Asn Pro Ala Tyr Asn Ala Met Met Met 180
185 190att att gtt gtt gga tgg gca att tat cct gct gga
tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly
Tyr Ala Ala Gly 195 200 205tac cta
atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu
Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220tat aac ctt gct gac ttt gtt aac aag att cta
ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu
Phe Gly Leu Ile Ile225 230 235
240tgg aat gtt gct gtt aaa gaa tct tct aat gct
753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
25061251PRTUnknownNaturally Occuring gamma protobaceterium
61Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1
5 10 15Phe Ala Ala Ala Gly Gly
Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr
Val Phe Phe 35 40 45Phe Val Glu
Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His
Tyr Leu Tyr Met65 70 75
80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr
85 90 95Ile Asp Trp Leu Leu Thr
Val Pro Leu Gln Val Val Glu Phe Tyr Leu 100
105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu
Phe Lys Lys Leu 115 120 125Leu Ala
Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile
Gly Met Ala Gly Trp145 150 155
160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser
Pro Ala Val Asn Pro Ala Tyr Asn Ala Met Met Met 180
185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala
Gly Tyr Ala Ala Gly 195 200 205Tyr
Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile
Leu Phe Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala
245 25062753DNAUnknownnaturally occuring gamma
protobacterium 62atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt
cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu
Pro Ser1 5 10 15ttt gct
gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala
Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20
25 30tca ttc tgg ctg gtt aca gct ggt atg
tta gcg gca act gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met
Leu Ala Ala Thr Val Phe Phe 35 40
45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act
192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50
55 60gta tct ggt tta att act ggt ata gct
ttt tgg cat tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala
Phe Trp His Tyr Leu Tyr Met65 70 75
80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt
aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe
Arg Tyr 85 90 95att gat
tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp
Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100
105 110att ctt gct gct tgt aca agt gtt gct
gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala
Ala Ser Leu Phe Lys Lys Leu 115 120
125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct
432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130
135 140ggt tta gct cct gta tta cct gct
ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala
Phe Ile Ile Gly Met Ala Gly Trp145 150
155 160tta tac atg att tat gag cta cat atg ggt gaa ggt
aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly
Lys Ala Ala Val 165 170
175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg aag
576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys
180 185 190att att gtt att gga tgg
gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Ile Gly Trp
Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200
205tac cta atg agt ggt gac ggt gta tac gct tca aac tta aac
ctt ata 672Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn
Leu Ile 210 215 220tat aac ctt gct gac
ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp
Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230
235 240tgg aat gtt gct gtt aaa gaa tct tct aat
gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn
Ala 245 25063251PRTUnknownNaturally
Occuring gamma protobaceterium 63Met Gly Lys Leu Leu Leu Ile Leu Gly Ser
Ala Ile Ala Leu Pro Ser1 5 10
15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val
20 25 30Ser Phe Trp Leu Val Thr
Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40
45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser
Leu Thr 50 55 60Val Ser Gly Leu Ile
Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70
75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr
Pro Thr Val Phe Arg Tyr 85 90
95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu
100 105 110Ile Leu Ala Ala Cys
Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115
120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe
Ala Gly Glu Ala 130 135 140Gly Leu Ala
Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160Leu Tyr Met Ile Tyr Glu Leu
His Met Gly Glu Gly Lys Ala Ala Val 165
170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn
Ala Met Met Lys 180 185 190Ile
Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195
200 205Tyr Leu Met Ser Gly Asp Gly Val Tyr
Ala Ser Asn Leu Asn Leu Ile 210 215
220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25064753DNAUnknownnaturally occuring gamma protobacterium 64atg ggt aaa
tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15ttt gct gct gct ggt ggc gat cta gat
ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt
144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45ttt gta gaa aga gac caa gtc
agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg
240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80aga ggt gtt tgg
ata gat act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95att gat tgg tta tta act gtt cca tta caa
atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Met Val Glu Phe Tyr Leu 100 105
110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt
384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125cta gct ggt tca tta gta atg
tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg
480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145
150 155 160cta tac atg att
tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile
Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165
170 175agt act gca agt cct gct gtt aac tct gca
tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala
Tyr Asn Ala Met Met Met 180 185
190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt
624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly
195 200 205tac cta atg ggt ggc gaa ggc
gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly
Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215
220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att
720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225
230 235 240tgg aat gtt gct
gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala
Val Lys Glu Ser Ser Asn Ala 245
25065251PRTUnknownNaturally Occuring gamma protobaceterium 65Met Gly Lys
Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5
10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp
Ile Ser Asp Thr Val Gly Val 20 25
30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe
35 40 45Phe Val Glu Arg Asp Gln Val
Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55
60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65
70 75 80Arg Gly Val Trp
Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85
90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln
Met Val Glu Phe Tyr Leu 100 105
110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu
115 120 125Leu Ala Gly Ser Leu Val Met
Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135
140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly
Trp145 150 155 160Leu Tyr
Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val
165 170 175Ser Thr Ala Ser Pro Ala Val
Asn Ser Ala Tyr Asn Ala Met Met Met 180 185
190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala
Ala Gly 195 200 205Tyr Leu Met Gly
Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210
215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe
Gly Leu Ile Ile225 230 235
240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245
250
User Contributions:
Comment about this patent or add new information about this topic: