Patents - stay tuned to the technology

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: Light-driven energy generation using proteorhodopsin

Inventors:  Edward F. Delong (Cabmbridge, MA, US)  Oded Beja (Misgav, IL)
IPC8 Class: AC07H2104FI
USPC Class: 536 231
Class name: Nitrogen containing n-glycosides, polymers thereof, metal derivatives (e.g., nucleic acids, oligonucleotides, etc.) dna or rna fragments or modified forms thereof (e.g., genes, etc.)
Publication date: 2009-05-07
Patent application number: 20090118483



eration system using proteorhodopsin is provided. The system includes a light source, a host with a correctly folded, integrated proteorhodopsin protein, a source of retinal, and a mediator. The source of retinal binds covalently to the integrated proteorhodopsin protein, thereby creating a light absorbing pigment. Illumination of the light absorbing pigment with the light source results in conversion of light energy to biochemical energy. This biochemical energy is harnessed by the mediator to produce light-driven energy, such as mechanical, chemical or electrical energy.

Claims:

1. A light-driven energy generator, comprising:a) a light source;b) a host with a correctly folded, integrated proteorhodopsin protein;c) a source of retinal to bind covalently to said integrated proteorhodopsin protein, thereby creating a light absorbing pigment, wherein illumination of said light absorbing pigment with said light source results in conversion of light energy to biochemical energy; andd) a mediator, wherein said biochemical energy is harnessed by said mediator to produce said light-driven energy.

2. The light-driven energy generator as set forth in claim 1, wherein said host is an artificial membrane system.

3. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane obtained from a bacterium.

4. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane preparation obtained from a bacterium.

5. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane obtained from a eukaryote.

6. The light-driven energy generator as set forth in claim 1, wherein said host is a cell membrane preparation obtained from a eukaryote.

7. The light-driven energy generator as set forth in claim 1, wherein said light source is a fast-pulsed light source.

8. The light-driven energy generator as set forth in claim 7, wherein said fast-pulsed light source comprises a mechanism for delivering intermittent fast-light pulses at predetermined time intervals.

9. The light-driven energy generator as set forth in claim 1, wherein said light source exhibits different predetermined wavelengths.

10. The light-driven energy generator as set forth in claim 1, wherein said light-driven energy is in the form of mechanical, chemical, or electrical energy.

11. The light-driven energy generator as set forth in claim 1, wherein said retinal is a chemical derivative of a naturally-occurring retinal.

12. The light-driven energy generator as set forth in claim 1, wherein said proteorhodopsin has a photocycle that decays in 15 ms.

13. The light-driven energy generator as set forth in claim 1, wherein said proteorhodopsin protein is selected to determine an absorption spectra of said light absorbing pigment.

14. The light-driven energy generator as set forth in claim 13, wherein said light absorbing pigment has an absorption spectra with a maximum between 490 nm and 527 nm.

Description:

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001]This application is a divisional of U.S. patent application Ser. No. 09/847,513, filed May 1, 2001, which claims priority from U.S. Provisional application 60/201,602 filed May 3, 2000, both of which are hereby incorporated by reference.

SEQUENCE LISTING

[0003]This application includes a sequence listing submitted on a compact disc. The contents of this compact disc are incorporated by reference herein.

FIELD OF THE INVENTION

[0004]The present invention relates generally to gene expression of functional recombinant proteins in bacteria. More particularly, the present invention relates to proteorhodopsin genes and proteins that function as a light-driven energy generator in Escherichia coli (E. coli) and other bacteria.

BACKGROUND ART

[0005]Retinal (vitamin A aldehyde) is a chromophore that binds integral membrane proteins (opsins) to form light-absorbing pigments called rhodopsins. Rhodopsins are currently known to belong to two distinct protein families. The visual rhodopsins, found in the eye throughout the animal kingdom, are photosensory pigments. Archeal rhodopsins, found in extreme halophilic environments, function as light-driven protons pumps (bacteriorhodopsins), chloride ion pumps (halorhodopsins), or photosensory receptors (sensory rhodopsins). The two protein families show no significant sequence similarity and may have different origins. They do, however, share identical topologies characterized by seven transmembrane α-helices that form a pocket in which retinal is covalently linked, as a pronated Schiff base (helix G).

[0006]The archaeal rhodopsins are able to generate a photocycle which produces a chemiosmotic membrane potential in response to light, as such light energy is converted into biochemical energy. Recently, a protein with high sequence similarity to the archaeal rhodopsins has also been retrieved in the eukaryote Neurospora crassa (J. A. Bieszke et al., Proceedings of National Academy of Sciences USA 96:8034, 1999). The eucaryal rhodopsin formed a photochemically reactive pigment when bound to all-trans retinal and exhibited photocycle kinetics similar to those of archaeal sensory rhodopsins (J. A. Bieszke et al., Biochemistry 38:14138, 1999). To date, however, no rhodopsin-like sequences have been reported in members of the domain Bacteria, and no light-driven proton pumps based on rhodopsin have ever before been functionally expressed in E. coli.

[0007]The phototropic conversion of light energy into biochemical energy using bacteriorhodopsin can be harnessed for a variety of processes and applications, such as bio-electronic applications and bio-materials, as has been reported in U.S. Pat. No. 5,757,525 for optical devices, U.S. Pat. No. 5,854,710 for optical Fourier processing, and U.S. Pat. No. 5,470,690 for optical information storage. Bacteriorhodopsin in bio-electronic applications is aimed to interface, integrate, or substitute the silicon based microelectronics systems as well as molecular devices. Bacteriorhodopsin as a bio-material is integrated, for instance, in optical films for light mediated computer memory applications and pattern recognition.

[0008]Previously, archaeal rhodopsins capable of generating a chemiosmotic membrane potential in response to light had only been found in halophilic archaea. Therefore, rhodopsins that originate from archaea adapted to highly saline environments cannot be functionally expressed in E. coli. Finally, the isolation and cultivation of halorhodopsins is an elaborate process. At present one does not foresee an economic utilization possible for this process (e.g. U.S. Pat. No. 5,290,699).

[0009]Accordingly, as one skilled in the art might readily acknowledge, there is a strong need to retrieve and provide rhodopsin-like sequences from naturally occurring members of the domain Bacteria.

OBJECTS AND ADVANTAGES

[0010]In light of the above, it is the primary objective of the present invention to provide rhodopsin-like sequences from naturally occurring members of the domain Bacteria. More specifically, it is the objective of the present invention to provide a method to retrieve proteorhodopsin genes from DNA of naturally occurring bacteria that encodes DNA sequence for proteorhodopsin proteins.

[0011]It is another objective of the present invention to provide proteorhodopsin-specific polymerase chain reaction primers that amplify the proteorhodopsin-containing gene from a DNA sample of naturally occurring bacteria.

[0012]It is yet another objective of the present invention to produce variants of a proteorhodopsin gene using the same proteorhodopsin-specific polymerase chain reaction primers by amplifying a proteorhodopsin containing gene from of a mixed sample of naturally occurring bacteria.

[0013]It is still another objective of the present invention to provide an expression vector that produces a proteorhodopsin protein in E. coli and other bacteria.

[0014]It is another objective of the present invention to provide a light-driven energy generator in which the functional properties of proteorhodopsin are utilized. These properties include the ability to integrate within a host, for instance a cell membrane of E. coli, making an integrated proteorhodopsin protein, and the ability to bind retinal, making a light absorbing pigment.

[0015]It is another objective of the present invention to provide a light source and illuminate the light absorbing pigment to convert light energy into biochemical energy.

[0016]It is another objective of the present invention to provide a mediator and mediate the biochemical energy into electrical energy.

[0017]It is another objective of the present invention to provide methods to manipulate the kinetics of the light-driven energy generator.

[0018]The advantage of the present invention over the prior art is that it is not restricted to operate in halophilic archaea and could therefore be functionally expressed in E. coli and other bacteria.

[0019]Accordingly, another advantage of the present invention is that it provides for a fast and cheap production method that allows for mass production of functionally active proteorhodopsin.

SUMMARY

[0020]The present invention provides proteorhodopsin gene and protein sequences retrieved from samples of naturally occurring members of the domain Bacteria. More specifically, the present invention provides a method for the retrieval and amplification of proteorhodopsin genes from DNA samples of naturally occurring marine bacteria. In accordance with several exemplary embodiments of the present invention, DNA samples were obtained from naturally occurring bacteria such as, for instance, marine proteobacteria, SAR86 bacteria, or recombinant DNA libraries containing naturally occurring bacteria. The present invention provides proteorhodopsin-specific polymerase chain reaction (PCR) primers to amplify a proteorhodopsin gene from DNA samples of these marine bacteria. The present invention also provides a device and method for the placement of proteorhodopsin genes in an expression vector to produce functional proteorhodopsin proteins in E. coli and other bacteria.

[0021]Accordingly, the present invention provides a method to produce and obtain variants of proteorhodopsin genes and proteins. The same proteorhodopsin-specific polymerase chain reaction primers amplify different variants of proteorhodopsin-containing genes from a mixed sample of naturally occurring bacteria. As one skilled in the art might readily acknowledge, these variants of a proteorhodopsin gene produce functional variations in the photocycle kinetics of the proteorhodopsin protein.

[0022]Furthermore, the present invention provides a light-driven energy generator that utilizes proteorhodopsin to convert light-energy into biochemical energy. This light-driven energy generator takes advantage of the functional properties of the proteorhodopsin protein once expressed in, for example, E. coli or other bacteria as is described in exemplary embodiments. These properties include the ability to integrate within a host such as, for instance, a cell membrane of E. coli or other Bacteria, and thereby making an integrated proteorhodopsin protein or integrated cell membrane protein. These properties also include the ability to bind retinal and thereby making a light absorbing pigment. Illuminating the light absorbing pigment with a light source converts light energy into biochemical energy. Finally, the biochemical energy can be mediated into electrical energy by a mediator.

[0023]In accordance with exemplary embodiments, the present invention enables one skilled in the art to manipulate the kinetics of the proteorhodopsin protein photocycle once it is operational in the light-driven energy generator. In particular, the present invention provides examples in which the light source characteristics are manipulated. Examples are the manipulation of the delivery of fast-light pulses and/or the delivery of light at different wavelengths. The present invention also provides examples in which incremental additions of retinal influences the function of the light-driven energy generator. In addition, a proteorhodopsin gene or protein variant can be selected to determine an absorption spectra of the light absorbing pigment to change the kinetics of the light energy generator, for instance to meet a design/functional criteria of an application wherein proteorhodopsin is utilized.

BRIEF DESCRIPTION OF THE FIGURES

[0024]The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee. The objectives and advantages of the present invention will be understood by reading the following detailed description in conjunction with the drawings, in which:

[0025]FIG. 1 illustrates the phylogenetic tree of bacterial 16S rRNA gene sequences including that encoded on the 130 kb bacterioplankton BAC clone (EBAC31A8).

[0026]FIG. 2 provides a nucleotide sequence of polymerase chain reaction primer 1 (Sequence ID No:2) used to amplify a proteorhodopsin gene.

[0027]FIG. 3 provides a nucleotide sequence of polymerase chain reaction primer 2 (Sequence ID No:3) used to amplify a proteorhodopsin gene.

[0028]FIG. 4 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:4) amplified from clone EBAC31A8 (Sequence ID No:1) using PCR primers 1 (Sequence ID No:2) and 2 (Sequence ID No:3), and the deduced amino acid sequence (Sequence ID No:5) of the proteorhodopsin gene Sequence ID No:4 amplified from clone EBAC31A8 (Sequence ID No:1).

[0029]FIG. 5 provides a map of the secondary structure of the proteorhodopsin protein (Sequence ID No:7). Single letter amino acid codes are used (according to J. Sasaki and J. L. Spudich, Biophys. J. 75:2435, 1998). Predicted retinal binding pocket residues are marked in black.

[0030]FIG. 6 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:8) amplified from clone EBAC40E8 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:9) of the proteorhodopsin gene Sequence ID No:8 amplified from clone EBAC40E8.

[0031]FIG. 7 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:10) amplified from clone EBAC41B4 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:11) of the proteorhodopsin gene Sequence ID No:7 amplified from clone EBAC41B4.

[0032]FIG. 8 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:12) amplified from clone EBAC64A5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:13) of the proteorhodopsin gene Sequence ID No:12 amplified from clone EBAC64A5.

[0033]FIG. 9 provides a variants map of the DNA sequences of the proteorhodopsin gene with Sequence ID No:4, Sequence ID No:8, Sequence ID No:10, and Sequence ID No:12 that were amplified from clone EBAC38A8, EBAC40E8, EBAC41B4 and EBAC64A5 respectively using the proteorhodopsin-specific PCR primer 1 (Sequence ID No:2) and 2 (Sequence ID No:3). Dots represent sequences having identical sequence as those in Sequence ID No:4.

[0034]FIG. 10 provides a variant map of the deduced amino acid sequences encoded by the proteorhodopsin gene with Sequence ID No:4, Sequence ID No:8, Sequence ID No:10, and Sequence ID No:12 that were amplified from respectively EBAC38A8, EBAC40E8, EBAC41B4 and EBAC64A5 using the proteorhodopsin-specific primer 1 (Sequence ID No:2) and 2 (Sequence ID No:3). Lower case represents the PCR primer sequence region. Dots represent residues having identical sequence as those in Sequence ID No:5.

[0035]FIG. 11 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:14) amplified from clone HOT0m1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:15) of the proteorhodopsin gene Sequence ID No:14 amplified from clone HOT0m1.

[0036]FIG. 12 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:16) amplified from clone HOT75m1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:17) of the proteorhodopsin gene Sequence ID No:16 amplified from clone HOT75m1.

[0037]FIG. 13 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:18) amplified from clone HOT75m3 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:19) of the proteorhodopsin gene Sequence ID No:18 amplified from clone HOT75m3.

[0038]FIG. 14 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:20) amplified from clone HOT75m4 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:21) of the proteorhodopsin gene Sequence ID No:20 amplified from clone HOT75m4.

[0039]FIG. 15 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:22) amplified from clone HOT75m8 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:23) of the proteorhodopsin gene Sequence ID No:22 amplified from clone HOT75m8.

[0040]FIG. 16 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:24) amplified from clone MB0m1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:25) of the proteorhodopsin gene Sequence ID No:24 amplified from clone MB0m1.

[0041]FIG. 17 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:26) amplified from clone MB0m2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:27) of the proteorhodopsin gene Sequence ID No:26 amplified from clone MB0m2.

[0042]FIG. 18 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:28) amplified from clone MB20m2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:29) of the proteorhodopsin gene Sequence ID No:28 amplified from clone MB20m2.

[0043]FIG. 19 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:30) amplified from clone MB20m5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:31) of the proteorhodopsin gene Sequence ID No:30 amplified from clone MB20m5.

[0044]FIG. 20 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:32) amplified from clone MB20 ml2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:33) of the proteorhodopsin gene Sequence ID No:32 amplified from clone MB20m12.

[0045]FIG. 21 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:34) amplified from clone MB40 ml using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:35) of the proteorhodopsin gene Sequence ID No:34 amplified from clone MB40m1.

[0046]FIG. 22 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:36) amplified from clone MB40m5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:37) of the proteorhodopsin gene Sequence ID No:36 amplified from clone MB40m5.

[0047]FIG. 23 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:38) amplified from clone MB40m12 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:39) of the proteorhodopsin gene Sequence ID No:38 amplified from clone MB40m12.

[0048]FIG. 24 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:40) amplified from clone MB100m5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:41) of the proteorhodopsin gene Sequence ID No:40 amplified from clone MB100m5.

[0049]FIG. 25 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:42) amplified from clone MB100m7 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:43) of the proteorhodopsin gene Sequence ID No:42 amplified from clone MB100m7.

[0050]FIG. 26 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:44) amplified from clone MB100m9 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:45) of the proteorhodopsin gene Sequence ID No:44 amplified from clone MB100m9.

[0051]FIG. 27 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:46) amplified from clone MB100m10 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:47) of the proteorhodopsin gene Sequence ID No:46 amplified from clone MB100m10.

[0052]FIG. 28 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:48) amplified from clone PALB1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:49) of the proteorhodopsin gene Sequence ID No:48 amplified from clone PALB1.

[0053]FIG. 29 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:50) amplified from clone PALB2 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:51) of the proteorhodopsin gene Sequence ID No:50 amplified from clone PALB2.

[0054]FIG. 30 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:52) amplified from clone PALB5 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:53) of the proteorhodopsin gene Sequence ID No:52 amplified from clone PALB5.

[0055]FIG. 31 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:54) amplified from clone PALB7 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:55) of the proteorhodopsin gene Sequence ID No:54 amplified from clone PALB7.

[0056]FIG. 32 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:56) amplified from clone PALB6 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:57) of the proteorhodopsin gene Sequence ID No:56 amplified from clone PALB6.

[0057]FIG. 33 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:58) amplified from clone PALB8 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:59) of the proteorhodopsin gene Sequence ID No:58 amplified from clone PALB8.

[0058]FIG. 34 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:60) amplified from clone PALE1 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:61) of the proteorhodopsin gene Sequence ID No:60 amplified from clone PALE1.

[0059]FIG. 35 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:62) amplified from clone PALE6 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:63) of the proteorhodopsin gene Sequence ID No:62 amplified from clone PALE6.

[0060]FIG. 36 provides the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:64) amplified from clone PALE7 using PCR primers 1 (Sequence ID No:2) and 2 (Sequence No:3), and the deduced amino acid sequence (Sequence ID No:65) of the proteorhodopsin gene Sequence ID No:64 amplified from PALE7.

[0061]FIG. 37 illustrates a phylogenetic tree of different proteorhodopsin genes.

[0062]FIG. 38 provides an example of an alignment of proteorhodopsin amino acid sequences.

[0063]FIG. 39 provides a light-driven energy generator that utilizes proteorhodopsin.

[0064]FIG. 40 provides an example of a proteorhodopsin-expressing E. Coli cell suspension (+) compared to control cells (-), both with all-trans retinal.

[0065]FIG. 41 provides an example of absorption spectra of retinal-constituted proteorhodopsin in E. coli membranes and a negative control.

[0066]FIG. 42 provides an example of a light-driven transport of protons by a proteorhodopsin-expressing E. coli cell suspension.

[0067]FIG. 43 provides an example of a transport of [3H]TPP.sup.+ in E. coli right-side-out vesicles containing expressed proteorhodopsin, reconstituted with or without 10 μM retinal in the presence of light or in the dark.

[0068]FIG. 44 provides an example of laser flash-induced absorbance changes in suspensions of E. coli membranes containing proteorhodopsin.

[0069]FIG. 45 provides an example of absorption spectra of retinal-constituted proteorhodopsin in E. coli membranes.

DETAILED DESCRIPTION

[0070]Although the following detailed description contains many specifics for the purposes of illustration, anyone of ordinary skill in the art will appreciate that many variations and alterations to the following details are within the scope of the invention. Accordingly, the following preferred embodiment of the invention is set forth without any loss of generality to, and without imposing limitations upon, the claimed invention.

Proteorhodopsin

[0071]The present invention provides rhodopsin-like gene and protein sequences retrieved from naturally occurring members of the domain Bacteria. More specifically, the present invention provides a method for the retrieval and amplification of proteorhodopsin genes from DNA samples of naturally occurring marine bacteria. In accordance with exemplary embodiments of the present invention, DNA samples were obtained from naturally occurring marine bacteria such as bacteria from the SAR86 group. Provided as an exemplary embodiment of the SAR86 group, DNA samples were obtained from a bacterioplankton Bacterial Artificial Chromosome (BAC) clone BAC31A8 (also referred to as EBAC31A08). In general, as will be appreciated by those of ordinary skill in the art, suitable DNA samples can also be obtained from other sources, e.g., from a marine environment or from a recombinant DNA library containing genomic fragments of samples of naturally occurring bacteria.

[0072]FIG. 1 shows the phylogenetic tree of bacterial 16S rRNA gene sequences including that encoded on the EBAC31A8. FIG. 1 also shows the relationship of EBAC31A8 to the SAR86 bacteria group as well as to the gamma-proteobacteria group. A subclone shotgun library was constructed from BAC clone 31A8, and subclones were sequenced in both directions on the MegaBACE 1000 capillary array electrophoresis DNA sequencing instrument (Molecular Dynamics, Sunnyvale, Calif.). Sequence analysis of a 130-kb genomic DNA that encodes the ribosomal RNA operon from BAC31A8, reveals an open reading frame encoding a proteorhodopsin. In an exemplary embodiment, the contiguous sequence was assembled using SEQUENCHER 3.1.1 software (Gene Codes Co., Ann Arbor, Mich.). Other sequencing techniques can also be used, as will be recognized by those skilled in the art. The sequence of the proteorhodopsin-containing contig has been deposited in GenBank under accession #AF279106 and deposit date Oct. 23rd, 2000. Appendix A, hereby incorporated, shows the nucleotide sequence of the BAC clone BAC31A8 (Sequence ID No:1) which contains the 130 kilobases genomic DNA from a naturally occurring marine bacterium.

[0073]Proteorhodopsin was amplified from the 130 kilobase bacterioplankton BAC clone 31A8 (Sequence ID No:1) by polymerase chain reaction (PCR), using the proteorhodopsin-specific primers 5'-aCCATGGgtaaattattactgatattagg-3' (Sequence ID No:2 and shown in FIG. 2) and 5'-agcattagaagattctttaacagc-3' (Sequence ID No:3 and shown in FIG. 3). References for PCR are, for instance, The Polymerase Chain Reaction, Mullis et al., Ed. (Birkhauser, Boston, 1994) and U.S. Pat. Nos. 4,683,195 and 4,683,202 to Mullis et al. The proteorhodopsin-specific PCR primers include the addition of 3 nucleotides that encoded one amino acid not found in the native gene sequence of clone BAC31A8 (Sequence ID No:6), in the second amino acid position which is a glycine located on the 2nd codon ("GGT"). Therefore, compare the second amino acid position in the Sequence ID No:5 using PCR primers 1 and 2 with the native Sequence ID no:7. This addition of one non-native amino acid created a new restriction endonuclease site (NcoI site) not present in the native sequence. This allowed subcloning of the amplified fragment into the NcoI restriction site of an expression vector pBAD TOPO TA Cloning® Kit (Invitrogen, La Jolla, Calif.). The present invention is not limited to the use of this type of expression vector and other expression vectors could also be used.

[0074]FIG. 4 shows the nucleotide sequence of the proteorhodopsin gene (Sequence ID No:4) that results from amplification of the proteorhodopsin-containing DNA in BAC31A8 using proteorhodopsin-specific PCR primers Sequence ID No:2 and Sequence No:3. FIG. 4 also shows the deduced amino acid sequences (Sequence ID No:5) encoded by the proteorhodopsin gene (Sequence ID No:4).

[0075]FIG. 5 shows an exemplary embodiment of a secondary structure of proteorhodopsin after it has been folded in a cell membrane 510 and bonded with retinal 520. FIG. 5 shows the native proteorhodopsin gene (Sequence ID No:6) obtained from clone BAC31A8 and encodes a proteorhodopsin protein of 249 amino acids with a molecular weight of 27 kD (Sequence ID No:7). In FIG. 5, 530 indicates seven transmembrane domains, a typical feature of the rhodopsin protein family, that aligned well with the corresponding helices of the archaeal rhodopsins. FIG. 5 also shows the amino acid residues that form a retinal binding pocket indicated by 520. Although the proteorhodopsin proteins shown in FIGS. 4 and 5 both originate from BAC31A8, they differ with respect to the second amino acid position. The reason is that the proteorhodopsin-specific PCR primers that were used to amplify the proteorhodopsin gene from BAC31A8 (which resulted in proteorhodopsin protein as in FIG. 4; Sequence ID No:5) included the addition of 3 nucleotides. These 3 nucleotides encoded one amino acid not found in the native gene sequence (Sequence ID No:6), in the second amino acid position which is a glycine located on the 2nd codon ("GGT"). Proteorhodopsin protein (Sequence ID No:7) as shown in FIG. 5 originates from the native gene sequence without the addition of the 3 nucleotides. As mentioned above, the addition of the 3 nucleotides created a new restriction endonuclease site (NcoI site) that was not present in the native sequence and thereby allowed the amplified fragment to be subcloned into the NcoI site of the expression vector.

[0076]In the exemplary embodiment presented above, PCR primers with Sequence ID No:2 and Sequence ID No:3 were used. In general, the present invention provides a method for designing different proteorhodopsin-specific PCR primers that are all capable of amplifying a proteorhodopsin gene from DNA samples of naturally occurring microbial populations by polymerase chain reaction. In designing these primers one first needs to determine a DNA sequence of a proteorhodopsin gene. Then one can design oligodeoxynucleotide primers with a Watson-Crick base pair complementary to 5' and 3' ends of the proteorhodopsin gene.

Variants of Proteorhodopsin

[0077]In the previous section, an exemplary embodiment is provided of a proteorhodopsin gene and protein. The present invention also provides the retrieval of genetic variations of proteorhodopsin from naturally occurring genetic variations in naturally occurring bacterial populations. These genetic variations in proteorhodopsin sequences result in functional variations in the proteorhodopsin proteins as is discussed below.

[0078]The present invention enables one skilled in the art to use the same proteorhodopsin-specific PCR primers as shown in FIGS. 2 and 3 to successfully amplify different sequence variants from DNA originating from mixed naturally occurring bacterial populations when it is compared to for instance the proteorhodopsin gene as shown in FIG. 4. As mentioned above, different proteorhodopsin-specific PCR primers could be used to amplify genetic variants of proteorhodopsin.

[0079]FIGS. 6-8 show exemplary embodiments of three different and unique variants of the proteorhodopsin gene that were retrieved from a recombinant DNA library of other naturally occurring bacteria (i.e. the bacterial artificial chromosome library (BAC)). In general, genetic variants could be obtained from different DNA libraries containing naturally occurring bacteria as well as from samples of naturally occurring bacteria. FIG. 6 shows the variant of the proteorhodopsin gene sequence (Sequence ID No:8) that is amplified from the BAC clone 40 (BAC40E8) with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and 3. Accordingly, FIG. 6 also shows the deduced amino acid sequence (Sequence ID No:9) of the genetic variant of proteorhodopsin shown in FIG. 6. FIG. 7 shows the variant of the proteorhodopsin gene sequence (Sequence ID No:10) that is amplified from the BAC clone 41 (BAC41B4) with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and 3. Accordingly, FIG. 7 also shows the deduced amino acid sequence (Sequence ID No:11) of the genetic variant of proteorhodopsin shown in FIG. 7. FIG. 8 shows the variant of the proteorhodopsin gene sequence (Sequence ID No:12) that is amplified from the BAC clone 64 (BAC64A5) with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and 3. Accordingly, FIG. 8 also shows the deduced amino acid sequence (Sequence ID No:13) of the genetic variant of proteorhodopsin shown in FIG. 8.

[0080]FIG. 9 provides a variants map of the nucleotide sequences of the proteorhodopsin gene Sequence ID No:4, Sequence ID No:8, Sequence ID No:10, and Sequence ID No:12 amplified from respectively BAC31A8, BAC40E8, BAC41B4 and BAC64A5 using the proteorhodopsin-specific PCR primers Sequence ID No:2 and Sequence ID No:3. In FIG. 9 lower case letters represent the PCR primer sequence region. Dots represent residues having identical sequence as those in Sequence ID No:4. These proteorhodopsin gene sequences differ by as much as 31 nucleotides as is shown in FIG. 10. FIG. 10 provides a variant map of the deduced amino acid sequences of the proteorhodopsin genes shown in FIG. 9.

[0081]Using the same proteorhodopsin-specific PCR primers, as for instance shown in FIGS. 2 and 3, proteorhodopsin genes were also amplified from bacterioplankton extracts. As mentioned above, any proteorhodopsin-specific PCR primer can be used. These bacterioplankton extracts include those from the Monterey Bay (referred to as MB clones), the Southern Ocean (Palmer Station, referred to as PAL clones), and waters of the central North Pacific Ocean (Hawaii Ocean Time series station, referred to as HOT clones).

[0082]FIGS. 11-36 show exemplary embodiments of different and unique variants of proteorhodopsin that were retrieved from the MB clones, PAL clones, and HOT clones. FIGS. 11-36 each show a variant of a proteorhodopsin gene sequence that is amplified with the same proteorhodopsin-specific PCR primers as provided in Sequence ID No:2 and Sequence ID No:3 from respectively clones HOT0m1, HOT75m1, HOT75m3, HOT75m4, HOT75m8, MB0m1, MB0m2, MB20m2, MB20m5, MB20m12, MB40m1, MB40m5, MB40 m12, MB100m5, MB100m7, MB100m9, MB100m10, PALB1, PALB2, PALB5, PALB7, PALB6, PALB8, PALE1, PALE6 and PALE7. The proteorhodopsin gene sequences retrieved from clones HOT0m1, HOT75 m1, HOT75m3, HOT75m4, HOT75m8, MB0m1, MB0m2, MB20m2, MB20m5, MB20m12, MB40 ml, MB40m5, MB40 m12, MB100m5, MB100m7, MB100m9, MB100m10, PALB1, PALB2, PALB5, PALB7, PALB6, PALB8, PALE1, PALE6 and PALE7, have respectively Sequence ID Nos: 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, and 64. Accordingly, FIGS. 11-36 also show the deduced amino acid sequence of each genetic variant of proteorhodopsin. The deduced amino acid sequence encoded by the proteorhodopsin gene retrieved from clones HOT0m1, HOT75m1, HOT75m3, HOT75m4, HOT75m8, MB0m1, MB0m2, MB20m2, MB20m5, MB20m12, MB40m1, MB40m5, MB40m12, MB100m5, MB100m7, MB100m9, MB100m10, PALB1, PALB2, PALB5, PALB7, PALB6, PALB8, PALE1, PALE6 and PALE7, have respectively Sequence ID Nos: 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, and 65.

[0083]In an exemplary embodiment shown in FIG. 37, fifteen different variants of proteorhodopsin in the PCR generated MB gene library 3710 were detected, falling into three clusters. The MB gene library includes MB clones MB0m2, MB40m5, MB20m2, MB40m12, MB100m10, MB20m12, MB40m1, MB100m5, MB20m5, MB100m7, MB0m1, and MB100m9 as well as BAC clones BAC40E8, BAC31A8 and BAC64A5. FIG. 37 is based on a phylogenetic analysis of the inferred amino acids of cloned proteorhodopsin genes. Evolutionary distances calculated from 220 positions were used to infer the tree topology by the neighbor joining method using the PaupSearch program of the Wisconsin Package version 10.0 (Genetics Computer Group (GCG), Madison Wis.). Other methods could also be used. The variants of the MB library share at least 97% identity over 248 amino acids, as shown in FIG. 38, and 93% identity at the DNA level. All the PCR amplified proteorhodopsin genes from Antarctic marine bacterioplankton (e.g. the PAL clones) were different from those of Monterey Bay (e.g. the MB clones) sharing 78% identity over 248 amino acids with the Monterey clade. The changes in amino acid sequences were not restricted to the hydrophilic loops, but spread over the entire protein including changes near the retinal binding domain 3830 as shown in FIG. 38, which are predicted retinal-binding residues. FIG. 38 shows an example of a multiple alignment of proteorhodopsin amino acid sequences that were obtained from different clones 3820. The secondary structure is derived from hydropathy plots (boxes 3810 shows trans-membrane helices).

Light-Driven Energy Generator

[0084]FIG. 39 provides a light-driven energy generator 3900 that utilizes proteorhodopsin, as obtained from naturally occurring bacteria as described above, to convert light-energy into biochemical energy. Light-driven energy generator 3900 takes advantage of the functional properties of the proteorhodopsin protein once expressed in, for instance, E. coli and other bacteria. These properties include the ability of proteorhodopsin 3906 to integrate within the cell membrane 3904 of, for instance, E. coli making an integrated proteorhodopsin protein 3908 (also called an integrated cell membrane protein). These properties also include the ability of proteorhodopsin 3906 to bind retinal 3910, making a light absorbing pigment 3912. The source of retinal 3910 is not limited to chromophore retinal but could also include chemical derivatives of retinal, such as 3-methyl-5-(1-pyryl)-2E,4E-pentadienal, 3,7-dimethyl-9-(1-pyryl)-2E,4E,6E,8E-nonatetraenal, all-trans-9-(4-azido-2,3,5,6-tetrafluorophenyl)-3,7-dimethyl-2,4,6,8,-non- atetraenal and 2,3-dehydro-4-oxoretinal. Illuminating light absorbing pigment 3912 with a light source 3914 results in a chemiosmotic gradient or proton pump in which light energy 3916 is converted into biochemical energy 3918. The chemiosmotic gradient involves pumping of protons from the inside to the outside of cell membrane 3904. When the protons return to the inside of cell membrane 3904 it produces biochemical energy 3918 via a proton translocating ATP-ase. Finally, the biochemical energy 3918 is harnessed by a mediator 3920 to produce energy 3922 for a particular process. For example, since proteorhodopsin functions as a light driven proton pump, it generates energy in the form of a proton motive force across the host cell membrane upon illumination. This light-driven proton motive force can be converted to many other forms of energy, one example above being the regeneration of adenosine triphosphate (ATP), via a proton-translocating ATPase. This coupling of the proton motive force generated by proteorhodopsin, for use by proton-translocating ATPases to synthesize ATP, could be accomplished both in living cells, as well as in artificially constructed membrane systems such as liposomes. Proteorhodopsin-based systems can convert light energy to a wide variety of useful mechanical, chemical, and electrical energy forms, for many industrial and technological applications. These include, but are not limited to, use in targeted drug delivery, uses as primary or secondary energy generators for biocatalyic reactors, fuel cells and nano-machines (including molecular motors), as well as uses in molecular switching or data storage devices.

[0085]Applications that can potentially benefit from proteorhodopsin-light driven energy generation are, for instance, bio-electronics applications that are aimed to interface, integrate, or substitute the silicon based microelectronics systems as well as molecular devices. Other applications that can potentially benefit from proteorhodopsin-light driven energy generation are, for instance, in bio-materials, wherein proteorhodopsin is integrated as a bio-material in, for instance, optical films for light mediated computer memory applications, optical information storage and pattern recognition.

[0086]Alternatively, proteorhodopsin is useful for a process to enhance yield or increase the potential of recombinant protein production or converting the light induced membrane potential into cellular signals, including modulation of gene expression. The biochemical energy derived from functional proteorhodopsin exposed to light could be harnessed to support a variety of cellular processes. For instance, the energy derived from light-mediated proton pumping could be used to enhance the production of secondary metabolites, or recombinant proteins in host cells, such as E. coli. Often, production of specific compounds in the biotechnology industry is limited, since their optimal expression or production occurs in the late stationary phase of growth, when energy reserves of the host cells are low. Retinal-bound proteorhodopsin expressed in such cells would provide an ample source of biochemical energy, by simple illumination. Proteorhodopsin-mediated light driven proton production could enhance any variety of biosynthetic or physiological processes which require energy.

[0087]The biochemical energy derived from proteorhodopsin light driven proton pumping could also be converted to other generally useful energy forms, for example electricity. Microbial fuel cells currently use carbon-based compounds, such as glucose, as the primary energy source. Via specific mediators of reduction potential (e.g. electrons), these microbial fuel cells convert cellular biochemical energy to electrical potential. Unlike carbon-based microbial fuel cells, proteorhodopsin uses light as the energy source, that can then be converted into a chemiosmotic potential, and finally into cellular biochemical energy by membrane-bound proton ATP-ases. Therefore, the use of proteorhodopsin could be employed to derive energy from light as the primary or supplementary energy source, that could then be converted into electrical potential (analogous microbial fuel cells that derive their energy from glucose).

[0088]In addition to energy generation in vivo in living cells, membranes containing proteorhodopsin could be used to enhance or enable other specific processes in vitro. Polymers produced from proteorhodopsin-containing membranes may have specific properties that could be used similarly to those containing bacteriorhodopsin. One example includes the use of these light sensitive molecules for optical computing applications.

[0089]As shown in FIG. 39, the kinetics of proteorhodopsin as it is utilized in 3900 is influenced by various factors such as the type of light source 3914 and the manipulation of light source 3914 in terms of frequency and/or wavelength at which the light 3916 is delivered. Light source 3914 could be any type of light source that delivers light energy 3916 that would be absorbed by light absorbing pigment 3918. For example, the light source 3914 could be tuned to optimally excite rhodopsin variances with an absorbance maximum of 490 nm or alternatively those rhodopsins with an absorbance maximum of 520 nm. Manipulation of the light source 3914 or the light 3916 being emitted by the light source 3914, for example, involves changing the frequency of fast-light pulses or the delivery of light 3916 as individual pulses, a train of pulses, or a continuous source of light. Manipulation also involves changing the wavelength of the delivery of light 3916 at different wavelengths. In addition, as is clear for one skilled in the art, changing the frequency and/or amount of retinal that will bind within integrated cell membrane protein 3908 also varies the function of proteorhodopsin. Finally, as was mentioned in the previous section, genetic variants of proteorhodopsin result in variants of the proteorhodopsin proteins that changes the kinetics of 3600 due to a difference in absorption of light at different wavelengths. The functional expression of such variation in these proteorhodopsin proteins adds another source of variation to the kinetics of proteorhodopsin as it is utilized in 3900.

[0090]As shown in FIG. 39, the light-driven energy generator includes a host 3902. In the present invention, as a preferred embodiment, host 3902 is a cell membrane preparation of E. coli. However, the present invention is not limited to the use of E. coli and, alternatively, other bacteria or eukaryotes could be used to provide host 3902 as an intact cell (in vivo) and/or as a cell membrane preparation (in vitro). For example, but not limited to, bacteria and yeast with developed genetic systems such as Bacillus spp. Species, Saccharomyces spp., Streptomyces spp. or Pichia spp. could be used as host for the expression of proteorhodopsin. In addition, in case a cell membrane preparation (in vitro) is used, host 3902 becomes equivalent to cell membrane 3904.

[0091]The light-driven energy generator 3900, as shown in FIG. 39, further includes proteorhodopsin 3906. Proteorhodopsin is presented in the form of the earlier presented expression vector containing a proteorhodopsin gene or one of its variants. Once proteorhodopsin 3906 has been put into host 3902, the proteorhodopsin expression vector expresses the proteorhodopsin protein in host 3902. An integral cell membrane protein 3908 is created in which the proteorhodopsin protein inserts into and folds properly within the cell membrane 3904. This is accomplished in the E. coli host by virtue of the native signal sequence found in the 5' end of the proteorhodopsin gene. It could also be accomplished by replacement of native sequence with another host-specific signal sequence in non-E. coli host systems.

[0092]As shown in FIG. 39, once retinal 3910 is added to cell membrane 3904, retinal 3910 binds within integrated cell membrane protein 3908 and forms a light absorbing pigment 3912. The particular example of FIG. 40 shows an integrated proteorhodopsin protein 3908 bound to retinal 3910 in E. coli. Chemical derivatives of retinal (as discussed above) could also be used as a substitute chromophore to generate functional proteorhodopsin. For the particular example of FIG. 40, the proteorhodopsin protein was cloned with its native signal sequence and included an addition of the V5 epitope, and a polyhistidine tail in the C-terminus. The proteorhodopsin protein was expressed in host 3902, i.e. E. coli outer-membrane protease-deficient strain UT5600, and induced with 0.2% arabinose for 3 hours. Cell membranes 3904 were prepared and resuspended in 50 mM Tris-Cl (pH 8.0) and 5 mM MgCl2. FIG. 40 shows a proteorhodopsin-expressing E. coli cell suspension. After 3 hours of induction in the presence of 10 μM all-trans retinal, cells expressing the protein acquire a reddish pigmentation as indicated by 4010 and the + (plus) symbol. FIG. 40 also shows that a cell suspension using the same PCR primers (Sequence ID No:2 and 3) but now in opposite orientation as a negative control, did not acquire a reddish pigmentation as indicated by 4020 and the - (minus) symbol.

[0093]FIG. 41 shows an exemplary embodiment of the absorption spectra of light absorbing pigment 3912 upon illumination with light source 3914 as is shown in FIG. 39. As mentioned above, the light absorbing pigment is a retinal-reconstituted proteorhodopsin in E. coli. FIG. 41 shows absorption spectra of light absorbing pigment 3912 as well as a negative control. After retinal 3910 addition to integrated proteorhodopsin protein 3908, light absorbing pigment 3912 was made. The retinal 3910 addition was done at selected time points, i.e. 10, 20, 30 and 40 min, and shows a progression from low to high absorption values indicated by respectively 4110, 4120, 4130 and 4140 upon illumination with light source 3914. FIG. 41 also shows the absorption spectra of retinal 3910 addition at these similar time points but now to a negative control of retinal 3910 containing a proteorhodopsin 3906 that was created using the same PCR primers in opposite orientation. 4150, 4160, 4170 and 4180 indicate the four absorption spectra for the negative control. An absorption peak at 520 nm was observed after 10 minutes (4110) of incubation as illustrated in FIG. 41. On further addition of retinal, the peak at 520 nm increased, and had a ˜100 nm half bandwidth. The 520 nm absorption peak was generated only in membranes containing proteorhodopsin 3906, and only in the presence of retinal 3910. The red shifted λmax of retinal (λmax=370 nm in the free state) is indicative of a protonated Shiff base covalent linkage of retinal to proteorhodopsin.

[0094]FIG. 42 shows an exemplary embodiment of the light mediated proton pump of the light-driven energy generator 3900 indicating the conversion of light energy 3916 as shown in FIG. 39. The proton pump action is illustrated by measuring pH changes in the medium surrounding the host 3902, which in this particular example involves a cell suspension of E. coli, illuminated by light source 3914. The beginning and cessation of illumination (with yellow light >485 nm delivered by 3916) is indicated 4110 ("ON") and 4120 ("OFF") respectively. The cells were suspended in 10 mM NaCl, 10 mM MgSO4.7H2O and 100, M CaCl2. Net outward transport of protons was observed solely in proteorhodopsin-containing E. coli cells, and only in the presence of retinal 3910 and light 3916 and is indicated by 4210 in FIG. 42. Light-induced acidification of the medium was completely abolished by the presence of 10 μM of the protonophore CCCP.

[0095]FIG. 43 is an exemplary embodiment showing that illumination by light source 3914 generates an electrical potential at the membrane 3904 in proteorhodopsin-containing right-side-out membrane vesicles, in the presence of retinal 3910, reaching -90 mV after 2 minutes from light 3916 onset. Transport of [3H]TPP.sup.+ in E. coli right-side-out vesicles containing expressed proteorhodopsin, reconstituted with (4310 and 4320) or without (4330 and 4340) 10 μM retinal 3910 in the presence of light (4310 and 4330) delivered by the light source 3914 or in the dark (4320 and 4340). FIG. 43 shows that proteorhodopsin, in its form of 3912 as a light absorbing pigment, pumps protons from the inside to the outside of cell membrane in a physiologically relevant range. The ability of proteorhodopsin to generate a physiologically significant membrane potential, even when heterologously expressed in normative membranes, is consistent with the proton pumping function for proteorhodopsin in the native gamma proteobacteria from which it is derived.

[0096]FIG. 44 is an exemplary embodiment showing that proteorhodopsin can have a fast photocycle and can therefore be characterized as a fast and therefore efficient transporter of protons. For the particular example of FIG. 44, light absorbing pigment 3912 is induced by laser pulses delivered by light source 3914. Laser pulse-induced absorption changes are shown by 3912 in host 3902, which in this case are suspensions of E. coli membranes containing proteorhodopsin. A 532-nm pulse (6 ns duration, 40 mJ) was delivered at time 0 and absorption changes were monitored at various wavelengths in the visible range in a lab-constructed pulse photolysis system. 64 transients were collected for each wavelength. 4410 indicates transients at 3 wavelengths exhibiting maximal amplitudes. 4420 indicates absorption difference absorption spectra calculated from amplitudes at 0.5 ms (indicated by 4430) and between 0.5 ms and 5.0 ms (indicated by 4440). In 4410, transient depletion occurred near the absorption maximum of pigment 3912 (500-nm trace indicated by 4450), and transient absorption increase was detected at 400 nm (indicated by 4460) and 590 nm (indicated by 4470), indicating a functional photocyclic reaction pathway. In 4420, the absorption difference spectrum shows that within 0.5 ms an intermediate with maximal absorption near 400 nm is produced (indicated by 4430), typical of unprotonated Schiff base forms (M intermediates) of retinylidene pigments. The 5-ms minus 0.5-ms difference spectrum 4440 shows that following M decay an intermediate species red-shifted from the unphotolyzed 520-nm state appears. The decay of proteorhodopsin final intermediate is the rate limiting step in the photocycle and is fit well by a single exponential process of 15 ms, with an upward baseline shift of 13% of the initial amplitude.

[0097]As mentioned above, a proteorhodopsin gene or protein variant can be selected to determine an absorption spectra of the light absorbing pigment to change the kinetics of the light energy generator 3900, for instance to meet a design/functional criteria of an application wherein proteorhodopsin is utilized. FIG. 45 shows an exemplary embodiment of different absorption spectra of retinal-reconstituted proteorhodopsins in E. coli as a function of wavelength 4510. As shown in FIG. 45, the absorbance 4520 is different and depends on the clone from which the proteorhodopsin was amplified. In this particular example, 5 μm all-trans retinal was added to the membranes suspensions in a 100 mM phosphate buffer, with a pH 7.0, and absorption spectra were recorded. The four spectra 4530, 4540, 4550, and 4560 are respectively for the proteorhodopsin genes retrieved from clones HOT75m4, PALE6, HOT0m1, and BAC31A8 at 1 hour after retinal addition. The proteorhodopsin gene retrieved from clone HOT75m4 4530 and PALE6 4540 produced a blue (490 nm) absorption maximum. The proteorhodopsin gene retrieved from clone HOT0m1 4550 and BAC31A8 4560 produced a green (527 nm) absorption maximum. In general, a range of wavelengths could be obtained that is not limited to the range shown in the example of FIG. 45.

[0098]It will be clear to one skilled in the art that the above embodiment may be altered in many ways without departing from the scope of the invention, such as for instance by mutagenesis to change the genetic sequence of proteorhodopsin and thereby changing the kinetics of the proteorhodopsin protein once it is expressed. Accordingly, the following claims and their legal equivalents should determine the scope of the invention.

Sequence CWU 1

651105184DNAUnknownNaturally Occuring gamma protobaceterium 1ttgttatatc agtaatggct attgctccaa taacttaata ctaatatata attagtttat 60gaataaattt tatatatttg ggttattgtt ttttacacta aatgcatttt cttgctcaga 120tcttctagat acagacatga gagttcttga ttccgctgag tcaagaaacc tttgcgagtt 180tgaaggaaaa gctttactag ttgtgaatgt tgcaagtaga tgtggttaca cttatcaata 240tgctggcctt caaaagttat atgaaagtta taaagatgaa gattttctag taattgggat 300cccatctaga gattttcttc aagaatactc tgatgaaagc gatgttgcag aattttgttc 360tacagaatac ggtgttgaat ttcctatgtt ctcaactgct aaagtcaaag gaaaaaaagc 420acacccattt tataaaaaac ttattgcaga atcaggtttt actccctcat ggaactttaa 480taaatactta atctcaaaag agggcaaggt tgtatccaca tatggatcaa aggtaaagcc 540tgattcaaaa gagcttatat cagctataga aggcttgctg taaaattatt acttagaaac 600taatacagtt ttaggcttgt ttgctgcaaa tattccatta tctacaactc caggaatatt 660attaatcaaa gcttccattt cagtggggtt tgaaatatcc atattagaga tatctaaaat 720gtgattacct tggtctgtta taaatccagt tctatatgtg ggtattccac cgatcgagat 780tatttttctt gcaacaaggc tcctactttc aggtatcacc tctataggca gtggaaaagc 840tcccaaaaga ttaaccatct ttgactgatc aactatacat ataaactcgt tagaggcaga 900agcaactatc ttttctctag tatgtgcgcc accaccacct ttaataagac aattttcagg 960agacacctca tctgcaccat ctatgtaata agctatatca actacatcat taaggctaaa 1020gacctctatc ccattttcat ttaataattt tgatgaagca tctgaactag aaacagctcc 1080agcaaatttg tgcctatgct cctttagttc ttctataaaa aaattaactg ttgagccggt 1140tccaatacct aaaatcatct caggatgaag attatttttg atatattcta tagcttgttt 1200agcaacattt atctttgagc cactcataga gttataatac aagaaaatat aggtagttaa 1260ttattttgag actaaaaatt aaaaaaacag gttcttttaa gaattcccag aagtacctaa 1320agcttattcc taatgcatct gtttatgacg tagcattaaa atcacctata acatttgctc 1380taaatatttc ttcaaagctg gggaataaag ttttcctaaa aagagaggat ctgcaaccta 1440tattttcttt taaaaacaga ggagcgtata acaagattgt aaatttatcc gatgccgaaa 1500agaagagggg ggttattgct gcatcagcag gaaatcatgc tcaaggggta gccagtgcat 1560gtaagaaatt aaaaattaat tgcttgatag ttatgccaat aacaactcca gaaataaaaa 1620taaaagatgt aaaaagattt ggagccaaaa tactccaaca tggggacaac gtagatgcag 1680cattaaaaga ggcactgttt attgcaaaga aaaaaaaatt gtcttttgtt catccttttg 1740acgaccctct aacaattgct ggccaaggga ctataggaca agaaattctt gaagataaaa 1800ataattttga tgttgtcttt gttccggtgg gaggaggagg tattctagct ggtgtatctg 1860cctggatagc acagaataat aagaaaataa aaattgttgg tgttgaggtt gaggattccg 1920cttgtcttgc tgaggccgta aaagctaata aaagagttat tttaaaagaa gtgggcctct 1980ttgctgatgg ggtggcagta tcaagggttg gaaaaaataa ttttgatgtt attaaagagt 2040gcgtagatga agtcattaca gttagcgttg atgaggtctg caccgctgta aaagatatct 2100ttgaagatac aagggttcta tcagaacctg ctggggcatt agcacttgca gggttaaaag 2160cctacgcaag gaaagttaaa aataaaaaac ttattgctat aagttctggc gctaatgtaa 2220atttccaaag acttaatttt attgttgagc gatcagagat tggtgaaaat agagaaaaaa 2280tattaagtat caaaatccca gagatacctg gaagttttct taagctttca aggatgtttg 2340gcagctctca agttacagag tttaactaca ggaaatctag cttaagcgat gcatatgttt 2400tagttggtgt tagaactaaa actgaaaaat catttgaaat cttaaagtcc aaattaaaaa 2460aagcaggctt cacctttagc gactttactc gaaatgaaat atccaatgat catctgaggc 2520atatggttgg tggcagaaat agtgactcag gctctcataa caatgaaaga atatttaggg 2580gagagtttcc tgagaagccg ggcgcgctgt taaattttct agagaaattt ggaaataaat 2640ggmatatttc cttatttcat tacaggaacc taggttcagc ttttggaaag atattaattg 2700gcatcgagag taaggataaa gacaagctaa taaatcattt aaataagtca ggnactattt 2760ttacagaaga aacctctaac aaggcataca aagatttttt aaaatgaaag gttaatactt 2820taatctaaat ttaattgaaa aaagctcatc gctagggttt tcccacggct ctttgaacaa 2880ctcggattga gatctatcat cctcctcgtc gtaaattctc ccacctttag aatagaccaa 2940aaatagatat gacaaaggag cgagctcata tttatatcta atttggaacg aagctacgcc 3000agtattaaat tcattaacta tattatttcc tttataaagg tatccattag catctgaaaa 3060aatactaata ggattttttg ctttcaaagc aacaaattga ctcttaagtc tgatctcatg 3120tttattattt ttaaaccaat ttagatcaaa agataaggta tcttgtcttg aatcatatga 3180ggcaagatta ttattatcct gccatatcag ccattcattt tcctttctta ttctatattg 3240cgcattgatt cttaagttat catttggaaa tattgaacct gctatcttgt aaaattttct 3300accataccca ttcgaatccc accgattatc tttctctccc ttaaagaagc taactctcca 3360gtcatatgtc cagaatgaat agttctttgc ctcaaagtct gctgtaatac ctattcttct 3420ttttgatttg ataaaggggt aggcttcatt ttttcttgtg atagttgtat ttttcccaga 3480agatctaaag ttaaaatcta attgaaattt agagttgtcc ttaaaactaa aagaattttt 3540ttgatcgatg cctattggat ttgaattacc agctgtgtca gcatcataat ttagatcaat 3600tccataatct atttgtttta atatcgagct attatcaaat tcatttattt ttcgatttcc 3660tccaatacca gcatgaatcc agtctcttct ttgcagataa ccaaagtcat ttaactcgaa 3720gtcgtcttca aaataaagaa ggcttccact tatgttagat agtttatttg gaagatatgt 3780aaactgagtc ctatacccaa gcccattttt accatctttt tctgaggcta acagatctga 3840atatgtaatt aatttttttg aacgaatatt gatgtaatca ataacattga ctgttgatga 3900ttcgcctgtc atttcattct caacattcgt taccatgaag ccaagcgttt tatttccaag 3960ctttgttcga gatcgaagag cataatagtc tcttccaact gaaaaggctt catcagcctc 4020acttgctaca aatactccaa attcattatt attacttttt tgagtaagtc ttaatgcaaa 4080atcaatatca gaatagtttt ttttagctgc ctcgcaacct tcttcattac tttcttctga 4140gcaattatag ctgggggcag ctccaatcct ccgcgtattt ataaccgagt atctatcata 4200attactaata tcaaatagtg attggttttc attgaaaaat gctctttttt ctgagtaaaa 4260agtttcttga gcagaaaagt taataaccac atcatcactc tcagcttgtc cgaaatctgg 4320attaatagct aaatttattt gacgaccttt tccagtgcta taaaagattt cagccccaat 4380atctgaccct tcttggtttg taactgaatt tttatttgaa gatatatatg gaaaaaaagt 4440aagctttgat tttgtatagt tttgtatttc taagctatct aactcttgaa agtagtcatt 4500tctactagcc attgttccgg cactgctaac ccatgactca ttttcggcac tataacgtaa 4560tgcggtgtaa ttaatttttc ttatatcacc atcaggctgt ttcattaacg ttacatccca 4620aggaataaaa aactcagaga cccaataccc atcaaatttt tgtgtttttg caatccaatc 4680tccatcccag tctgtcttaa agtctcctgc ttgcgttttt atggcatcga aaagcgagtt 4740cccaagattt atagcaagaa tgaaagcttt gttaccatca ccatcaaagt ctatatttat 4800agagttttta tcgctaagtg aatttatttg atctctaagc gtcttcctgg agaacataga 4860atcattactt tgaaaattct taaatccaac atatatacca tccttatttg agaaaattaa 4920agccgttgta agaagttcat ttttcttaag agtaaaagga gatgtttcat aaaaatctgt 4980aatttcaaat gcattattcc actcaggctc atcaagagag ccatcaataa caattgagtt 5040tgaccaaatc agcacagagg taagtaaaag tgataatgag gctaaagtta ttttcataga 5100tagatttaat tcagagtatt ttaatctatg aagagtagga aatcatccga tcattctcag 5160aaaaaacata atcatattta aggtcatgtt tttctccaaa atgcatatta caaatctggt 5220attgataaca cagtccaacc aataaaggtc ttgatacctc ctcgtttata gagccaataa 5280ttctatcgaa atatccagag ccatagccaa gcctgtatcc atttaaatca actcctgtca 5340taggaataaa cattaaatca atttcattta tgttgacata atcttcactt ttaacctctt 5400tgatcccaaa ttgatttata aagaagttag gctgctcatc cagcaaatta aaagccatca 5460tttcgtcatc aattactttt ggaatataaa tattcttttt aagtttagta aaagcttgaa 5520ttaacaaatt tgtattgact tcatttcgaa aaggaaaata taaggcaatg ttttgcattt 5580catgagtatt gattttttct aatacatttt cttggattaa aaaacttata ttgctcttag 5640ataaatcaga aatagactgc ccctgctcaa agagtgattt tcgtatttta tttttcacca 5700taaattgtgc cgaggctaac aaaccaaatc accgccatgt gctatacctg aaccgatggg 5760tcaggtggtg aatttcataa caaatcaggc ttccctgtaa aggtactgcg caacaatgct 5820agaggaatat cacctattta attgtatcgg ttcaaattta ttaacacatt agcgtatgaa 5880ccagaatgca gttaattata aaatatatag gtattaagta aaagttaatt tttagagagc 5940agactctatt ttttgtatta gcttttcgat atctttatta tttaaagcag agtcagtatt 6000ctcggttggc ttggataaaa gcttactagc catagttaga ccagctatca ccaaggcatt 6060attcttatca ctgatgccat caagctcatc atttaataat tttgcagctc taatgagttg 6120gtctttctct tctggcggac aggctaacgt cagatctcgg ccaaaaattc ttaaagatag 6180cgtttccatt tctgacatta ttttttgagt ttagttattt ctttatttag tatgagctta 6240tcttttttcc agtctctttc cctttgcaca taagaatcta ttattccttt ttgctcatca 6300aatcttttta tcaaaagatc taccttatcc tctagttcaa ataatgatga ttttttttct 6360tctgacattg tataagttta attgtactag aatgaattgg aaagtttctt taataaaaga 6420gtaaaataag gcatggaaaa aataattttt aaaaacagaa gagattcact tataaaacac 6480ctacctaaga attcagcctt aattgtacct ggtgcagatt tgcaatatag aaatgctgac 6540tcatcttata atttaagaca ggaaagtagc ttctactatc tgtctggctt ttgtgagccc 6600tcttctctaa tggttttagt taataatgga aaaagcattg attcaataat ttttgttcct 6660gaaaaagata aacttaaaga aatctgggat ggttatcgag ctggccctga gggcgcaata 6720aatgattttc tttttgatca agcttttgaa aataataaat cagatgcttt aatgcctgaa 6780atccttcaag ggctagaaaa agttttttat tcaataggga agaaaaatgg ctttgatcag 6840aaagtaattg actggacatg cgcagcaaat tctaaagata ggcacagcaa atcaattgat 6900attattgatg gctcttcgat ggtaggaaat ttaaggctta tcaaagataa gcatgaaatt 6960gatattatga agagagcttg tgaaatttca gctgaatcat atattgaggt catgaaatct 7020ataaagcctg gggacaatga gcaggaaata gaggcgctat ttttatatga attcgccaaa 7080aggggaggaa ggtttccagc ttatacacct atagttgctg gaggtgaggg tgcttgtgta 7140ttgcattata ttgaaaatga taaagagtta gcttcatcag atttaatttt ggtagacgca 7200ggatgtgaat acaaaatgta tgcatctgat atcacaagaa ccttcccagt aagtggaaaa 7260ttttcagatg aacagctaca aatttataat attgtccaca aagccaatct tgctgcaatc 7320gatgctgtaa aaactggtaa tagcataatg gagccccaaa tggtttcaga aaaagtaatt 7380actgaaggtc ttgtagagtt gggtattcta tctggcgatg ttaatcagct tcataaaaat 7440ggtgcattca aggactttta tatgcataag gtgggacatt ggcttggact tgatgttcat 7500gatgttggtg actacatgga gggagatgag tttatgaagt ttaagccagg gatgataacc 7560acaatcgagc caggcatcta tatcagtagc gcaatggatg tagatgacaa atggaaaggc 7620atcggcataa gaatagagga cgacatcctt gtaacagatt caggcaatat taatctaaca 7680gagaaggtgc catctaatcc tcaagaaata gaatcattga tggcttagac tatggaggtt 7740ccaattgtta tttctggcgg agggataata ggtaattaca tttctcttag gcttgaaaaa 7800aataatatca aaaccgttat tgtcgaaaaa gctagtagtt tcaaagccct agataagggt 7860ataagaacag tcactctcaa tgagcattct atgcaaatgc taaaaaatat tggtatttgc 7920ccatcaattg ctcaaatcaa cagcatcgac gtattagatg gtgagggtac aggcaaaatt 7980caatttctag caaaggacgt aggcagcgaa aacctttcat atgtaaccta tttcaatgaa 8040ttacaaaaac taatttctga tccatgtaaa gaaagaacct tatttaataa tgagattgat 8100tcagttcaga atcttaatac agaatctgat ccagagatca tgcttaaaga tggcatgacc 8160ataaaaacga atctaattgc tggatgtgat ggaagaaatt caaatattgc aaaaattgct 8220tcacttacaa gcagcttcga tgactactta caaacagctt taacttttgt cgttgatatt 8280gataatgatt cacatggcaa agctcaccaa gttttttctg aaaaaggaat atttgcactt 8340atgccactcc cagaaggcaa gggtgagatg aataaatgca cagtggtttg gtcaataaaa 8400aatcaagttt tgggagatga gcctgtatct gagtttgtaa aaaataacat ttcttttttt 8460gaatcaaagc ttaatgttag tctcagggtt aagtcagaaa ttttaagttt taaattatcg 8520aaccatcatt ttgaaaacta tattagcgga cctattgttc ttcttggtga tgctgctcac 8580tcaattcacc ccttagcagg tcaaggtatt aatctaggat ttgcagatgc agatactttt 8640tgtgaagagg taattagttc ttataaaaaa gggattgcct ttaatgagaa atcagtttta 8700aaaagatatg agattagaag aaaaagtatg aactttttaa tgttgaagtc tatggacttt 8760tttgtggatt tatttggttc agaaaattta tatctaaggc tgatcagaaa tttaggtatt 8820tcttcgttaa ataaatcaaa gtttgttaaa gcattcttta taagacatgc ctctggaatg 8880aataagtttt aaatttgtat taaacttttt gacctttagc tctaagttct ttaagaactt 8940cactaatgcc ttttttatca atgattctca taccttttgc agatacttta agatttacga 9000acctgttctc agattcaacc caaaatttgt gtgtgtgaag attaggaaaa aactttcttt 9060tagtcctatt tttagcgtga gaaacattgt ttcctgactg tggtatctta cctgttactt 9120gacatatttt actcattgaa acgcgatttt atagaacact gaggaactta gcaatactat 9180tgtgaaacaa atttatttat tacggcatgc acaatctgac tgggagagct ctaatcagaa 9240agattttgat agaccattag caagaaaagg cattgaagaa gcaaataaaa tatcatgtta 9300ctgcaaatct cattcaattt tagtagataa aatattctgt agcactgcag aaagaactaa 9360gcagactttt gatatatgca gtgatgggct taattatcca atagctgaag cagtctatac 9420tgatgagctt tacttttctg gccctggtga gatcgttaag cttatccaaa gtttaagtga 9480attcatttcc tctgttttaa taataggcca caatccatca atgcaaatgt acatagatgc 9540tatttcagaa aatcctcata ttacgtattc aacatgcggg ctggcagaaa ttctcgttga 9600aagttcatgg aaagacttat ctttaaaaaa atgtaagtta aaatctttta ttcaaccagg 9660agagctttaa aaattgaaaa acataaaaat taagatcatt aacccactaa tgggatccaa 9720gataccctta cctcaatatg aaacaaaggg ctcggcagga ttggatttaa gggcatgcct 9780agatagtaat ctcagccttc aagcaggaac atctcagttg atacctattg gttttgcaat 9840gtacttagaa gatcccggtc ttgcagcaat ggttatacct agatcaggtt taggttctaa 9900gcatggaatc gttcttggta atctggttgg gttgattgat tcagactatc aaggagagct 9960aatggttcct gcctggaata gatcagatac agattttgag attaatcctg gagacaggat 10020tgcacaaatg attatagttc cagtgattca agcagatttt gaaattgtag acgagttcaa 10080tgagactcag aggggagaaa agggttttgg aagttcaggt ataaattgat aaatttactt 10140tttcttgcca aatctttctt caaatttctg aactcttccg ccagtatcaa taattttttg 10200cttaccggta taaaaaggat gagaagcaga ggatatatca agagggtagt atgggtatgt 10260ttttccatct tcccattctt tcgtttgagt cgtatctaat gttgaacgaa tgagaaagaa 10320cttatcagca ctagcgtcat ggaataaaac ttcacggtat tcaggatgta tatctttttt 10380cataataaaa tcaaaatttg gatgagaact ataacaaaaa acaacttctt ttcaatcaaa 10440acatgaaaat tttttactat gacatagctg tttcattgcc tctaaggcaa tgctttactt 10500atagttctga gcttaaaatt acaaaaggaa cccgcgtgtc agttcctttt ggtaaaagga 10560agattgtagg agtggttata aaaaatattc aaaagccaga tttcctaaaa aaagcgggag 10620ctataaaaaa aattatcgct gtacttgatg aatatccttt gttcgacaag ccaatttttg 10680attctatatt gtggtcttct gattactatc atcatcctat tggtgaggtt tttaatacct 10740ttatacccac cgaattaaga aaaattaata ataaaaaaat tgaagcttta agagaatttt 10800ctgaatattc agtaaatgag gatgataaga aattcgattt aaccaaggat caagaaaaag 10860cagtcaaggc gctttctaaa tctaaaggat tttcacccac tttattatat ggagttacag 10920ggtctggaaa aacagaagtt tacttaagag ttgccgaaac ttttattaaa aataataagt 10980cagtattagt tttagttcca gaaataaatt taacccccca attgctctcg cgatttgaga 11040atagatttaa tggtgaaatt ggcatatatc attctaagca aacagcagct aagagattaa 11100agacttggct aaaagctaaa tttggttcta taaaaataat agtaggaact cgatcttcgg 11160ctttagtgcc tttagataac attggtttaa taattatcga tgaagagcat gaccaatcat 11220ttaggcagtc agaagggttt aaattctctg ctagagactt aagtataaaa agggcacagc 11280ttgcagatat tccaattatt ttgggatcag caaccccttc gctgcaaact ttaaaacttg 11340taaaagaaaa taaatttata agagttgata ttcctaatcg agttgatgga aacaagcctc 11400ctaaattaat agccttagat atcaataaca gccctttaat aggcggagtt gctaaagaga 11460caattgaagc aatgcaatca accatagaca gaggagaaca ggttctagtt tttattaata 11520gacgaggatt cgctccactc tatcaatgta gtagttgtgg gtgggtagca gattgtaaat 11580cttgtgatac aaatttagtc ttccaccagg caagaaatag attaatttgt cataggtgtg 11640aatctgccta ctctgttaat ttgtcttgtc cggcatgcaa gtctaatgac tttaatatgt 11700atggagctgg aacagagaga gttgaagaag ttcttaaaag cagctttgta aagactccaa 11760taattagagt tgatcatgac tcaacaaaaa aagtgggagc tatggaggct atagttaaaa 11820aaattcattc ctcagacgca gcaattttag ttggaactca aatgcttgca aaaggacatg 11880attttcctaa agtcacctta agcgttattt taaatgctga taatggcctt ataagcccag 11940aaattaatgc attagagaaa atatctcaat tgcttattca ggtctctgga agagcaggaa 12000gaaataataa tcttgcaaaa gttattattc aaacaagata tcctgatgat ataaatctta 12060ataaaattaa gacaggagat tatatgaaat ttgcttctca atgcctaagt accaatgagc 12120aaatgaactt acctccattt actactttat gtctgcttag gtgctcatca ccaactcaaa 12180agagtaatgt agatttccta gagaaagctg ttttaatttt atccaatagg actgatataa 12240atgttattgg tcccttgcct tcattagttt cgaagtcgaa aggaaattat aggcaccaag 12300tctatatcca tgcaccaaag aagacttttt taaataaggt attaaagttt ttgacaacag 12360agtttgaaaa atggccggaa tctaataagg ttaagtggtc tttcgacatt gatccaatag 12420acttaagcta aatattaatc ttaattaatt gtcctgggta tattggttta ttgtttagtt 12480tattctctgt attaatttct tctacagtca ccccaaatct tatcgctatt tctgataaga 12540catccccttt ttgtattttg taagtcacaa agcctggatc aatactcata aaggtatttg 12600ctttaggttt gtccttaaaa tagttatgta ttcctaggaa aacagatctt gcaatcattc 12660ttctccctgg cttgcccttt aatctttgtg cgtcttcagg gttggttata aaccctgact 12720caaccaatac agaaggtata tcaatagact taagaactct gaaatcagcg tactcaacat 12780tctttttatg aatttttgtg aaagggtctc ttttaagttg atccaatatc ttagttccta 12840aaattttact ttcagaaatt tttttcttat atatttctgg ataggtttgt cttgccgcat 12900cctcatcaaa atcaactggc tttagatttt ttatatcagc ctgtattctt tccctttgtt 12960ttttagataa gttcctagca acagtacttg aagcttcatc tgaccatata aaaacagaag 13020caccctttac ggaagataat ctaaacccat cagcatggat tgaaacaaaa atatctgctc 13080catatttcct agcatcctgg tatctattat ttaaatctaa tgtctcatct ccatctctaa 13140tcatcaccgg tctgtaccca taagtatctc ttaaggttct ttctaactcc ttcgcaataa 13200gtaacgtcac atctttttct aaaatattat ttgggccaac cgcaccaggg tatttaccac 13260cgtgacccgc atctatagca acaacaatat ctctaatact tttattgagg tttttatttc 13320ttttcacctt aagctctaat tttatatttt ctgtatttat agtctgagtt gggttttgcc 13380aatggactga ttcatataaa tcaacaacga ttctggtaag acttccatcc tgagatgctc 13440ttactttttt gattggatag ttgtatggaa catttatctc ggtcggaagg ctcgattgat 13500taatttccat gacaattcta gagggatttt taaagaataa gaccttacca atgaaacttt 13560atctagacta aagttaatac taatttcatt gttacccatg tcttgtattt catcgaagaa 13620aacatcattt ccgctgataa aaaaagatag aaaaccaagg attgccaatc tattcataag 13680tttttaacca ttctttaaag ccattgtcac cagaaattaa agaaacctct cttccttcag 13740gaagatggct gaaaattatt tttagatcaa aacttctttc atgttgaagc ctttcaggcc 13800actcaattag cactactttc ttagagttta tttttctact tagatcaaat atatcaatat 13860cttctgcttc gttagttcta taaagatcaa tatgcaaaaa cattaaatta ttaaaatcat 13920attcctcgca gagagtgtaa gttggacttt ttacaagatc cttccatcca caatttttta 13980taatagatct agatataaag gttttgcctg ctcctaagtc tccctcaaga tgtatttcaa 14040tttcttgcga agaagattta agtatctcca tagctatttt tgaccctagt tggtttgtag 14100cctcgtcatt tataagagta agttttttca tcgatttatt aattgtctta ttataggtat 14160taaactagag gcatttaagc ctatttcacc aatatctacc ttaaatttta aacctgcctc 14220cgaatgaaca gcaacagcaa taatgctcgc atttcttata tcaagacctt gggcaagaag 14280agcagtcaaa acaccagcca gaacgtcacc agttcctcct gaagcaagtt caggtccgcc 14340gcacgcgcat ataaaagact tgttatttgt tgagtcaaag accaccgttc ccattccttt 14400taatatgaca atagaagccc cataagtgtc tgcaatttgc ttggcagcag aaatcctatc 14460tctttgaact tcttcaatag atatgtttag taatatagcg gcctctcctg gatggggcgt 14520catcaagatt gttttattgg attttttgat taaagatttt gatgatgcca cgatgtgtaa 14580agcacctgca tctaatataa tttttgaatt atttgcagac tttaatattt tacccaatat 14640agtttttgca aaagcagtat tagcaattcc tggtccatac aatataacgc tatgatcttt 14700aatttttaaa tctatatcga ccccttcaac catatcaaca ccaattgcca ttacctcagg 14760gtttcttaat aaagatgggc ttacattaga cgtgtctgtg acgagagtta ctagtccaga 14820gccacaaaat aaacttgcct cagatgcaag tattccggcg ccacccatgc ctggggagcc 14880agcgcatatc aaaacttttc caaaattacc tttatggcta tcctttgctc tattgggcag 14940aagatttttt aattcttgaa aggtaaaact ttgcaatata gacatgaaat tacttaagct 15000aaattatgtt

gaatgctaaa aattcattct tcagcgaaat aaccaatcct atcggaatgg 15060aatgatccca taaaaatttt ttcgtcatga ggcactgcaa tagtgggtag gccaaaaact 15120gttctagaaa atgaaaactt atcttctctc ttaagtgttt tttgattaag tttatgaatg 15180gagaagggta gagaacaatt tgttttttct gcacaattac caaaatcatt agcctgtaaa 15240tctaaagaag ttaaccaaac cgaaccgtct tttaaaaaga tattatctgg actctgaata 15300aaataactac ctgttttagt attttgatta atatcgtaaa cagataaatt gtcaccttga 15360ttatagttaa cataaagaag tccagagccc tcatctagca aaattccatt aggcccactt 15420ccatctgttc catcaacttt attaaaatta ttgtcactcc aaagtactac atgccctgaa 15480atacttttaa ataatgaatt cattaaccat tctttcatgc tgatatctct tttatacata 15540tgagatgcat aaaaacttcc atctttttta agggcaacat cattgaaata atattggtca 15600gggaccctaa tacagccccg ccatatcata tcccatgaag attcattttt tataatttca 15660aacatttcaa tcgactcaaa tggcgaatga ttaattacag ctagttgata gaagccttca 15720tcattttcaa caagatctat tccgtgggga ttgaatatgt ctagctcgcc tcttatgcat 15780gaagcgtcac cccaagagct ttctccaaaa gtaatttttg gtaccttttt ttcataggta 15840tttaaatcca ttaaggcaaa gtaaccaggg gtatgttctg catatggacc gatccctcca 15900aattcagaaa taaagaagaa tttattatca ggcgtaatca caatatcttc tgggtttgag 15960aaattacaaa taaccttgat acggtcatcg gattcacact tactaatatc catttggggg 16020cctatatagt cagtagatac gacagttaca gaaataaata aataaagaat agaaaccggg 16080acggtaatct tataatagtg ctttataaaa atttctaaaa tctttgaagc atgatttgga 16140agtgcgatca gccaaacccc ttttaaaaaa gatagagccc ccataacaac aaaaattact 16200gaccatagcc tgtcagacca ttccggctga agaagaccag ttaaaaaaag aatcatccca 16260aagaataaag ctaaatagcc tgagattttg actttcgatc caacaaattt cgaaaagaga 16320tatgtataaa gaggctttat aagaactagt aaaccacagg ccaagaaaaa gaatgctaag 16380tagtaattca taagttagtt tttatataaa tgctccttaa taatactaac aagttctaag 16440ggcttgtcca atggaacatg gtgagcagct ccaggaaccc cctcaaaagt cataatgtca 16500ccatatgtat ttttaatatt gtccaagata cttccggagg ttaataagct gtcttcaccg 16560tggatgaaca aagcagggca gccaaatgaa aatgtgtaac cgaataacct ttcaagactg 16620ctaaacatga catcatcaaa tttccatctc cacccagcct caatattttt tactgagtgc 16680tcagcaatgt atcttaagta ccaatcattc gtacaatctt gcttaggcat taacctaaac 16740cttttaataa tatctgtctt gtcttgatag tgcttgatca ttctgagagg agaagagtgt 16800tgattcgggt cataatccgg tggtcttata aatgtatcaa taataataat attatttatt 16860agatcctttc tttcagatgc aacgtaacca gcaacatgtc cgccgaggga gtgtccaaca 16920ataaaaatat ttgaaatatt ttttttatcc ttttcctttt caattacaga gacaatacat 16980tctccaaaat ctttaatgcc atatgaatct ctaaaagaag agtcacccat gccaggaaga 17040tctattgcaa ctatatttgc gcagtctcta aagtggggcg caataggatc ccaccatttt 17100ttatgagcac ctgttccgtg aataagaatt attaaatctt tgctttcatc tttggagttc 17160cagctagaat aggatatatc cccatgagga ttcttgataa tctctgagct aggcttgtcc 17220tcaatggcat ctttgaacca ctgtggggca tgaataatgt cttgatttag attgttagtt 17280atttccataa acagtattct aagctataaa aaataaaaat atgaataaac ttaatttaac 17340gccagcagca actgttttag tcctaaagga ttctcctgat gggatggaag ttttgatggt 17400aaaaagatca agtaggcctc ccttcggaga cctttttgtt ttcccgggcg gcaagattga 17460cgaaggtgat ttcaataata agatagaaga tttttgtgag ggcgtgactg ataaagaggc 17520ctccataaat cttggattag attctggagg tctagcatat tgggttgcat gtattagaga 17580atgctttgag gaggttggaa ttttacttgc taaaaaaaag agtggggaag atcttgatct 17640agatggagtc gataaacata aatatcaaaa atatagagag atgttgttaa ataatgaaat 17700tgatttatat aaaatctgtt tagaagaaaa tttaattcta atgcctcaac aaatagcccc 17760tttctcgcat tggataaccc ctgaaataga aactaggaga tttgatacac gtttttttat 17820tgcccacctc cccaagcatc agaccggaga acatgatggt agtgagctca tagacagtgt 17880ttggatttca ccaaaagaag cgctcaaaaa atctcgttcg ggtgagatgc ctatgattat 17940gcctacaata aaaaatttgg aacaatgtgc acaatttgat tcgggctcta agcttttaga 18000aaatcagagg aatctctcaa atgaggatat cccaccaatc ctgccaaagt tttttaaaga 18060agatggtgag tggaggggtc tattgcctgg agataaaggg tatgaggatc attaaataat 18120atggacttaa ttactaaaat aacagctccc aaccctggtg ttttcactgg gggtgggact 18180aatacttatt tgattggcaa agatgatata acccttgtcg accctggtcc aaatatatct 18240gagcatctag atgaaattat caaagcaggg gatgggaaaa taaaaagaat ctttgttact 18300catacccata cagatcattc cccagccgca ttgcctttat caaaaactct taatgttcca 18360atgtacggaa ggctagtaga tggtgaatcc tcatgggagg atgaaacatt tatcccagat 18420attattttaa atgataaaga tattattgag acagacgaat atacgttaga agtaatacat 18480actcctgggc acgcatctaa tcatttatgc tttttaataa aagatacgaa atgccttcta 18540acaggcgatc acattatgga cgggtctacg gttgttattg ggccaccaga tggcaatatg 18600acaagctata tcaattcatt agaaaagtta ctagattttg atattgattg ctttgcgcct 18660gggcatggaa attatattca tgagcctgag aaaaccattc aatcaattat taggcacaga 18720ctaacaagag aaagaaaagc tcttagaaag ctaggagagg caggaatctc atcattagat 18780aaacttacta agcttgttta tgatgatgtg tcagagatgc tccatcctat agctaaatat 18840agtctagaag cacatttatt aaagcttata gatgaaaaga aagttaaatt agataaagat 18900ctattcgaaa taatttaatc ctttttattt ttatgtaaga ctttctcctc aatagcttct 18960atatcaatat catcaattga gtcttcgtta ttatcaggta tcttttttac atctttttca 19020attttaaggt cgataggaga ggctccaaga tcaaaagtta gctcccttac attttttgaa 19080atagtatcct caacacagtc atcttgatcg tatgcctcac gggtctctcc tttatcatta 19140atgggaaagg gtctttgggg agggcccatt tgcatgcatt taatagtagc aactggtgag 19200tagtaatcgt cactagaata taacttatca agttcttttg gtgatattga gcaaccaata 19260attccaaata gaataaatgg cgctaagagt cttttcataa actttgtttg ttttctataa 19320gagactcaac aaccgaagga tcggcaaggg ttgtcgtgtc tcctaaatta gatagatcat 19380tctcagcaat ctttctaaga attctgcgca taatttttcc cgatcttgtt ttaggcaagc 19440ctggagcatt ttgaattaaa tctggttttg caatagctcc aatttcttta gcaacaaatt 19500gtttcaattc ataactaaag ttgtcatcaa atgattcatt tatcattaaa gtaacaaaag 19560catatattcc ttgcccctta attggatgat caaaaccaac aacagctgct tcagcaattt 19620tagggtggag cacaagagca ctttcaattt cagctgtacc tagtctgtgg ccagaaacat 19680taagaacatc atcaactctt ccggttatcc agaagtatcc atcctcgtct cgcctggctc 19740catcaccagt aaagtaaata tctttataca taccaaaata ggtgtcgatc attctttggt 19800gatcaccata aatacttcta atttgactag gccaagattg ctcaataact aaattacctg 19860cattagagcc ttctagcgta tttccatgct cgtcatagag agatggctta actccgaaga 19920agggcagagt tgctgaccca ggttttgttg gagtaatacc cgctattgga gagataagta 19980cagaaccagt ttcggtttgc caccaagtat caataacctc gcaattagat ttaccgacaa 20040cactgtagta ccaatcccac gcctctggat taattggctc tccaactgta cctaaaattc 20100ttaggctatc tctttttgtc tttttaacag gatcatcgcc ttgggccatc agagctctaa 20160tggcagttgg agctgtataa aaaatactaa tgtcatgctt atcgcatatc tcccaacacc 20220ttgatgctga agggtaggtt ggcactcctt caaacataag tgttgttgct ccatttgaaa 20280gaggtccgta caagatataa gtatgtcctg ttatccatcc cacatctgca gtacaccagt 20340atttgtcctc tggccttatt ccaaaaagat atttgaaact aatatgagcg cctaatagat 20400aacctgcagt agtatgtaga acaccctttg gcttgcctgt agagcctgat gtatagagaa 20460tgaaaagagg gtcttcggaa tccataggct ctggagcaca cttattagaa acatctttaa 20520caagatcttc ataccaaaca tcttttttat catcccaatt aatttcgcca ccagttcttt 20580ttataaccag tgtattttta acatctggac agcccagaag agcctcatct acattagatt 20640taagtggcac ttttttgccg ccccttaaac cttcatcagc agttataaca attttacaat 20700cagcatcaag aattctatct ttgagtgatt ctggagaaaa gccaccaaag acaacagagt 20760gcacggcacc tattcgcgtg caagcaagca ttgcgaacgc agtctcaatg atcataggca 20820tataaataca aactctcgag cccttttgaa cacccaggtc ttttaaaaca ttagcgaact 20880tacatacttc gtcatggagc tctttgtagg tcaattcttt agaatcagca gggtcatctc 20940cttcccatat taatgctatt ttgtttggat cattttctaa atgcctatcg atgcagttta 21000agctaatatt tgttttacca ccctcaaacc acttcgcatt attaaattga ttattgaatg 21060ttgttttgaa gtcttccatc cagcttatgt tttcgtttgc tagattttta aaaaatttag 21120aaggatcttc tatggattgc ttgtaaagtt ctttgtattc atcaaagtct tttatataag 21180gattacttga atgttttggg ctataaagct ttcgaggcat tcttaaataa ttgaaggttg 21240ggggttaatg aaattctttc ctttgggatt ggacattatt tttgtaatga gcgattcgta 21300atcgctatca ttggtttcta aattaatatc tgcagcatta attattaata aaggggcaga 21360actatagtct aagaaaaacc ttgagtatgc atcattcagt ctttccaggt agtcaagagt 21420tagatattgt tcgttaatat ttcctctctt agtaatcctg tcttttaaca catcaatagg 21480tgcctgaaga tagattacta ggtcgggtgt tggcgcgtcc agggttagat ggtcatatac 21540tttgtcatat agatccattt cctcgttaga aagagtaacc tcagcgaata atcgatcttt 21600ttctattaaa aaatcagcaa ccctcactgt ttcaaaaagg cttctttgtt taagatcttg 21660aatttgttgc attctttgaa acaagaagaa aagctgagtg gctagagctg attggcttgg 21720gtttttataa aaattcttta agaagggatt ctctgccggt tgttctaaaa aagaatcata 21780attaaatgtt tcggctatct tatttgctaa agtagttttc ccaacaccta tcggtccttc 21840aattgcgata tattttggaa gtggtacttc tttaatagct gggttcattt ataacgtata 21900gtttttatcc tagattatct caactcgtta tagaagccaa atatttttgc agattcttca 21960tcactttttt cattcatctc agcaataatg gttcctctca atgagtcatc atttttttgg 22020aaatcagtcc accacttccc aataccactc ttgtctccag aagctctctc tcttacaagg 22080agaaaatcat atgcagctct aaaccttgga tgtcgaagag ttttatatgg ctggctacca 22140attctgctat gaagttttaa ttgaagtacc caaatatcct taatatagct tgaaaatttt 22200cttggtattg ctgtgatttt ttgttgttcg cgaagtacac catccataga tcggaaaaat 22260tttctcacat taatttctcc attcttagaa cactttttca ataacgaggg ccataacaat 22320gcagccataa gaaagcctgg tgtaattgac tgttggtttt taaccctatc atcagtattt 22380cttagagcat gtgtcataac attgctggca aagtcattcc tacttggatc agaaaggata 22440agatatttat ttaaatgaaa tgagcagagt ttttcaaaat tcttttcacc cattccattc 22500aaaaatattt tgcagaactc atcaaacaat ctagcgtttg agatgcctga taaaagatgg 22560cctttatcat agatggcatc cttaaccaga ttatctatct taaaattgag tttgttacta 22620aatcttatag ctcttaagct tctcactgga tcttcttcga atcgtctttg gggatctcca 22680atagatacaa taaccttctt gtgtatatgc ttaagcccat cgttatgatc ttctattttt 22740tttgtaacag gacagtagta aagggcattc acagtaaaat ctcttctatg acaatcttgt 22800tcaagagtgc cccaattatt atctctaaga atctttcctg tcgaatcagt aacaatattt 22860tcaccatctt cctgatcact tcctgatctg aaagttgcta cttcaagcaa ttcactccta 22920ttaaaaacat gaaccaattt aaatcttttg ccaattattc ttgaagcttt aaatgttttc 22980ctgatctgct caggcgtggc attagtggct atatcgaaat cttttggctc tagtcccgtt 23040agtgcatcgc gaacgcaccc accaaccaga taggcctgaa aattattttt ttgaaggtct 23100tgaacgacag atatagcaaa tttacttatc ttattattat ctatcaaatc taattatgaa 23160tttttatgat ttaagggcat tagccattta gctgcttttc tttaatttcg tcaagtgttt 23220tgcaatgaat gcaatgagtt gcagttggtc tagcttcaag cctcttaatt ccaatttcat 23280caccacagga ttcgcaccaa ccataatcat cttgcttaat ttgttcaata gatagaccta 23340ttttgctgat aagttttctc tctctatctc tggttcttaa ttcaaaagca aactcttcct 23400cttgagaagc cctgtctact gggtctgcat aggtttcacc tttagctcta agatgatcaa 23460aagttttttg catttcatcc ttaagatgtt ctttccaaag aagaagaacg gcaacaaaat 23520gtttcttcat tgctgcactc atatattttt cacccttctt agatttataa ggtgcaattt 23580tagatttatt attagctatt gttgcttttg cagacttctt ggctgcaact tttttaactg 23640gagctttttt ggtaacagtt tttttaactg tagctttttt ggcaacagtt tttttaactg 23700gagctttttt tactgtcttg gatttttttt cgaccatgta agattttata gaattttagg 23760gtggagaaaa tatcagatac tgacaaaatt agctagtcat tttttaattt atttaatact 23820ttcaagtacc catccgagct aagtcttggc ccaaatgtct caacaacctt ggaagatgca 23880tagtttgcaa acttagcaca tgcttcaata ttattccctt gaaggtaggc atgcataaac 23940gatccggcaa acatatcacc agctccattg gtatctattg gagttatttc ttctgcttga 24000gcatgcttct caaccccttt atctataaca acacttccat cggcgccttt tgtaatagca 24060gtcatatagg gcttttcttt ataaaagcta acagcatcat caaggctttc tttaccagaa 24120aaagcaacag cttcatcatc attacagaag atcatatcta ttccatatga ctctattaaa 24180tcaaattttt ctttaaaacc atgaacaata cctgcatcag aaagagacaa ggctttcttt 24240acgtccttgt ctttaaggtg ctctaagact gaaataacag cattaaagtt atcgtcactt 24300gttaccatgt agccttcgat ataaaaaatt tttgaatttt ctacaacatc aaaatctata 24360tctgatttac caagatacgc actaactcca agcatgctgc tcatagttct cttagcgtca 24420ggagtaacta aaattaagca tttcccagtt ggttgatctg tattttcaga gctgacacca 24480atatgtttga ctccagccga cctgagacta tcaagatagt ttcttccatc ttcatcatca 24540gaaactctgc atacatgatg gcaattcgaa ccataatttg ctgcagcaac aagagaattg 24600gttgcagagc caccgcaatc agaaatcgat tcggctccca tttcaataag tttgctaatt 24660ataggtgcct gttcttcaga agatgaaaga gtcatagagt cggctacaag gcctacactt 24720gataaaaatt catggctgac tttatattga gtatctacta aagcatttcc aagggcgcta 24780atatcatatt tcatgtgtta ttcctttttg tattatttgt tttactgttt ctaacgttct 24840attaatttct tgatccttat gcattgctga aatgaagcct gcttcatatt tagagggggc 24900aaagtatatt ccacttctaa tacatgaatt taaaaaattt gaaaatagta catcatcagt 24960ttttgcaaca tcattaatat tattagggag ttcttctgaa aaaaagaatc cgaacattcc 25020accaattctg tttatagaaa acggaatacc tgactcaatc attaaggttt tcatcccatc 25080caaaagcaca gatgcatttc tttctaactc tttaaacgga ttttctttaa tcaacaattg 25140caacaaagca gttcctccag ccatagctag tggattgcca gacaaagttc ctgcttgata 25200gacaggacca gaaggagcta gatagttcat aatttcttct ttgcctccaa aagctccaac 25260aggaagaccg ccacctatta ctttccccaa agcagttaag tcaggagtaa tattataaat 25320ttcttgagct ccgcctagcg aaactctaaa gccgctcatg acctcatcaa atattaaaat 25380agaattattg gctgaggtgg tttccctcaa taactttaag aaatcttcat gacctggaac 25440aaagcccata ttccctgcta ctggttcgac tattacggcc gctaagtcat cttttatctc 25500attaaatatt tctaaaaact gttctttatt attgtattcg caactaaatg tatattttgc 25560caaatctgca ggaacccctg gagagtcagg taagccaaag gtggcaaccc cagaacccgc 25620cttaattaaa agagagtcaa catgaccgtg ataacagcca tcaaatttaa taattttatt 25680ccttcctgta aaacctctgg ccaatctaat cgttgtcatg gttgcttcag tacctgaatt 25740aaccattctt attttttcaa ttgaaggaat gcatttttta attagcctgg ccacatcaga 25800ttcaagactt gtcggggcgc cataactagt tccaagcgca acttgatttt taattgcacc 25860tacaatgtct gggtgtgaat gacccattat catcggaccc caagacccaa tataatcaat 25920atattcatta tgatcagcat cataaaggta tgcgccggat gctctttcaa aaaatatagg 25980attgccattg atatttttga atgcccttac tggtgaattt acccctccag gcataagagt 26040cttggcctct ttgaataagg ctatggattt atcaatttta ttatgtgtca aagctaattc 26100ttcttaattt ttaatttcga tatgatatca acttaattct tattatgtat ttaattgttt 26160tgaaaattca tccacagtat tccagttggt gaactcataa gtatttgtaa catctgtagg 26220accctttgta atccacataa tcaatctaat catgtgttta tctataaatt tatattttgg 26280ataatctatt ttgcctgcaa agactgctaa tttcattggg ttccaaggag acagctctaa 26340aaatttttgc atatacgggt ttgtttcagg tgtatttttt tcaggctttc tcgcaactac 26400attaactgaa aagaaggcat tttcttttgt ttcaagacaa gcaacatttt tttgaataaa 26460ttcataaagt tctggtttgt gtttgccata cctaatgctc gcaccaataa taattttatc 26520aaattgatat aaatctaact ctactgcttt tgcaatatgt attatttttg aagattcaga 26580cacatctaat ttagagaaaa tttttttaca aatttccaga gtttgcccat cggttgtaga 26640gtagattagt agagttgatt tcataaatat attttatatt gttaattaaa aaatttcatg 26700gcacgcgcaa aatataaaca aaaatatagt atatttaaaa aatgattaaa tatttattta 26760ttatcgcttt attcatcagc aactttgcat atgcaaatat tgatgcagct aggtgcgcag 26820ggattagttc agatcaagaa agacttgatt gctatgattt aatttttaaa gctaatgatg 26880agctaccttt agatagtaat attaaaactc tcatcacacc agctatcaag gctgtgaccc 26940cagctgattc aataaaaatt gaaaataaag caacaaaaga aaaggatttc gggcttccaa 27000aaacaaaaat caaaaactct gcaaagaact caataaaaac ctcggtagta aggattaaaa 27060aaacaaaaag tggtaaatta atttttactt tagaaaatga gcaagaatgg actgctgaaa 27120cttcttatag agcaaggaat atgtttaaac cagaaaccgc agtcatttta gaagaggccc 27180tggttagtgg tttttatatg attaatataa gtaataaaca gaaaattaga ataaagaggt 27240tgaaatagca atgaccatac agagtatagc aacgaccgag ggtgccatga aaagaataag 27300atctgttctg gggtcccaag atggctcaag cttcagggtt tatgttaccg gaggtgggtg 27360ctctgggttt caatatggct tcaagtttga taacgatatt gcattcgatg atgatgttat 27420aaattgtgga gatttctcgc ttttaataga ctctatgtcg tacccctatc tttatggatc 27480aactctggat tttgttgaag atctctcagg ggctaaattc gttattaaaa acccaaatgc 27540caaaacaaca tgcgggtgtg gagagtcatt tacagtttag attttgtaat tgaacccaaa 27600agtcctttta cattcttgct tgatgtctga actatcagag cctcattact gactctttta 27660tatcccatcc aggcaaatgc catggactca atagcaaaaa catcatggcc taaatcactt 27720gagaggacga tatcattact agccatctca gaaattcttt ttaccagata cttgttatgg 27780gcaccaccgc cacaaatgac aatgtcacaa ttattatgac catttttatg gattgaattt 27840attatagatt ttgctgaaaa ttcaaccagg gtacatagaa tatcttcagc tttctttttt 27900aagaatttct tggataatat ttttatatta aagagctctt tgcctgttga ttttggacat 27960tttcttttaa aaaaattatt ttgaagcaat cttcctaact caatgtgatc tacttttcct 28020ttggctgcaa ttgcaccatt tttatcaaaa ggaatctgaa gaaaatcact acaatatgca 28080tctaaaattg cattaccagg gcctacatca gttccccata tgtcatttct attttttaca 28140aatgagtaat ttgaaatccc ccctatattt aaaataattc gtgggtttct agctttataa 28200aataattgat tatggaattc cggaacaaga ggggcgcctt cgccacccag agcaatatgc 28260atatttctaa aatcacttac caccagaagg cccgtttctt ttgcaacaat atttggatca 28320ccaatttgca tagaaaatgg aaatctctta ttaatttcat gcctaatggt ttgtccagaa 28380atagcaacac actctataga tgatttctta attttactaa atccaatcgc ctcattaata 28440gattttgaaa ataagaatcc tatctcttta ttgatagtgc ctaaatctga caacgagctt 28500tcgttatttt caataagttt ttttactttt aatcttaggg attttgggaa tttaattgaa 28560tggaaatact caagatgtat ttttgtacca atgctaagaa atgaaatatc aatagcgtca 28620tgactagttc ctgtcattgc tccaatatag atctttttac tcattactac ttggagaaag 28680tttattgaac tcctccattg ataatttatt attctttaat agtgtatcaa aacttgccct 28740taaatttttt ggtactggtt cagctgatgg taattttact tttataggat ctattctttt 28800cttacctatc ttaaactcat aatgtaaatg aggccctgtt gccagtcctg aactcccaac 28860gaaaccaata gtatcgccct gagaaacttt tttgcctttt cttataccct tactaaattt 28920ttctaaatga caatatctag tcgaatactc atttgtatga tttataacta tctcattacc 28980acatccattc ctttgaccag aaaaagaaac aatcccatcg cctgtagttc tcaccggcga 29040tcctcttttt gctgcataat cgactccatt gtgagctctt attgtatgaa gaacaggatg 29100cattctgttt ggattaaaat gagaactaat atatgcaaaa tctaaagggg ctcttaaaaa 29160ggccttttgc atattatttc cattttcatc aaagtactgt tttttactgg cttcagtgaa 29220aaacctattt gcaaaatagg tattaccatt gttaataaac ttagcaatta cgatatcgcc 29280attttttact ttctctccat cactataagg agtgtcataa attacatgaa attcgtcacc 29340ctctctaata tcaaaaacga agtctacatc ccagccaaag atataagcaa agtccataat 29400cacactttcg ggaatatttg catctagagc tgcctcataa aatgaggatt gaataatccc 29460gctattgtaa gattcaatta actcaatact tttgcttata ttcttaaata ctatttctgg 29520agtaaggcta attgaaatag aatttatttg atctttcatg atctcaattc tggttagctc 29580ctcaccagaa tattcaaaga gcattttttc ccctggttta atgttagcaa tgatattttt 29640ggagtctaat ctaaaaattt tataagcagt atttaaaggc actgaaaagt tttcaaaaat 29700tattgaaaga ttttctccat cttgcacctc atgcatctga tatgttttcg tttgctcgag 29760aggtatagag aatttttcag taatctcaat ttcttcaact ggcaaagatt catatgtttc 29820tatatcaata taaagcatta atattaagac aatagaaaca gcaaaagaaa caagtaccgc 29880tctttttgga acttttttaa aacctatcat tagtcctctt gaatatttaa gagttcagtg 29940ttgccaccaa atgctaccga gttctttgag acaacctttt cgttaagaaa ttgcagaagg 30000taattaggcc caccagcctt ataaccgctg ccggatagat tttgtccacc aaaaggctga 30060gaaccaacaa

cagctcccac catatctcta tttatgtaaa tatttccaac attacacttg 30120tcactaaata tgtcggccct tttttctact ctagtatgaa tacccattgt aaggccgaag 30180ccactatcat taatattttc aattagtgca tccatttcat ttgatttaaa tctaacgata 30240tgaagtattg gtccaaattg ctcgtctttt agatcagaaa tattatcaat ctcaattatt 30300gttggagcta caaaattttc atctacatta tcagcagacc taaaaataga ataattctta 30360tcttcaaaac ctttgacgta agcattgaga ctatcaagag atgttttgct aattattgga 30420ccgatatctg tgtcaagatt ttgagggttt ccaattttta gttctttcat accaccctta 30480atcattgata aaagatcgtc atatatttcg tcttgtacac agagaactct caaggcagag 30540catcgctgtc cagagctatc aaatgctgaa cgaataatat catcggttgc ttgctcgaga 30600agtgcgcttg aatcaacaat cattgaattt atgcctcctg tctctgcaat caaaggaata 30660attgattcat ggttacttgc aaggctgctt tgaatttttt ttgctgtttt taaagacccg 30720gtgaatgcaa cccctttgat gttattaacc tttgaaagca tgtcaccatg aatgccgtct 30780cctaggatta aatttaaagc atcttttggg accccaatct catgaaattt gttcacaatg 30840atgtagccaa gaattgaagt atgttctgag ggcttaactg ttactttatt tccacatgca 30900agggctgcac ttatttgccc tataagtatt gcaacaggaa agttccatgg actaatacat 30960aaaatatggc cttttggtga gtaagatagg gcattgatct cgccggtggg accctcaaga 31020atatggtctt ctgtttggag gccaacagct tgttttgcat agtatctgag aaaatctata 31080gcttctctaa tttcatcaat agtattttgt actgtttttc cggcttcatt cataaggtaa 31140taaattagct cagaaggatt tgcttcaatg tcgtctgcaa ttttttctaa aatggatgct 31200cttttttcaa catgcatcaa agaccattca ctaatatttt gttcttcaag gctgcccttt 31260atataatcta gatcatcata tgaggctgta ccaatatttc ttccatctgc aagagatgag 31320atgtcatgag tattggtttt cttataatcc tttgctttat aaatcgatga agcctttatt 31380tcttttgaat caaactttcc aagctcctct tcaagcattt ctaaattcac cctttcactt 31440aaatcaaatc ctttagagtt cggtctgtca ttaaatatat ttcttggcat tggtatctct 31500tttttttcat cctcaatttt taaatggggc cctctggcaa gccaagcaga atctgtttca 31560ggatcaagta acctattaat aaatgagctg tttgctccat tttctaatag tcttcttaca 31620agatagggta gcaagtcttt atatttgcca atcggcgcat aaattgaggt atttttttca 31680gtatttaaga ttttgtttgc ggacttataa agcagctctc ccattccaaa tagtctttga 31740aattcataat ccttatttga gccaagatga tggattgcag aaatggtatg tgcattatgt 31800gtagcaaatt tagggtaaat tttttccaca ttaaaaattc tttttgcaca ggctaaataa 31860gctaagtctg taacagattt ttttgtatag acaggataac catcatagcc atagatttga 31920gcatgcttaa tttcataatc ccaatatgct ccttttacga gccgaacatg cataggtgct 31980ctgttttcta gtagctcttc taaccaatct attgtggcta tagctctttt gccatatgcc 32040tgaacagcaa taccaaatcc tttccagttt ttaatatttg gtgaaagggc catctcttta 32100ataatctcta aactaacggc cagtctgtct tgctcctcag catcgattgt aatttctaca 32160tctttagact ttgcatactc tgtgagctga ataagttttg gaagtagatc agactttatg 32220tcttttagct ttttcatttc atatctagga gataaagctg atatttttat tgagacacca 32280ttaattgtat tttttgtcag atttatcttg ccgacttcat ctatagcatt cttataagac 32340tgataataag tgtcggcttg ttcagcgttt cttgctgcct ctccaagcat gtcaaatgaa 32400taaatttcat tttctatatt cttaattttt tttatgtcat caaaatctct gcccataaca 32460aattcttgac tgagaatatg catggcacca actacagcat ttctaattgg aaactcgcca 32520gattttgaga ttaaagaact taatagggcg cttgggtttt tggtccactc atcgggtgta 32580gaaactacct tgcctgcaag aagaagaccc catgttgatg cattaacgaa aacactatcc 32640gctttgttta aatgctctat ccacgcaccc tcagataact tctcagatat tattagatcc 32700cttgtttttt tgtctgggat tcttagtatt gattcggcta aacacatcag agcaacacct 32760tctttattat ctagcccata ttcgcttaga aatgcatcta gttttgttct ctcgctctta 32820ttctctctgc aagcatcaat tattttattg agcatttttt gaaatcgaag ggtcatttaa 32880aaaatcagaa ttactaataa gatctgaaac tatttcttgt tcaggataaa acttgttact 32940tgttaatgtc atagctatta ttttaatctt atagtgccaa gtatcaactt actgattcta 33000ttaatgactt aaaaacatta tgatagtcat atgagtgatg cacttaaatt aattaaacga 33060ggaaccgacg agatcctcac agagtctgat ttaaaaaaga aattagattc tggaaaacag 33120ctaatcatta aggcggggtt tgatccaaca gctccagacc ttcatttggg tcataccgta 33180ttattaaata agctaagaca ttttcaagat cttgggcata aagtaatttt tctaattggt 33240gattttactg gtcaaattgg agatccctcc ggtaaaaata aaactaggcc aacacttact 33300tcagaagaat taatttcaaa tgcaaaaaca tatgagaaac aagtttttaa aattcttaaa 33360aaagaattaa cagaagttaa atttaattct gagtggtgca acaagcttgg tgcagatggt 33420ttgattggtc ttgcatcaaa atataatgtt gcaagaatgc tggagcgtga tgattttaat 33480aaacgttata gcgcaaatca aagcatagct attcatgaat ttttataccc ccttgttcaa 33540ggatatgact ctgtagccct agaggctgat gtcgaatgcg gaggaacaga tcaaaaattt 33600aatttgctag tagggagaga gctacaaaga tcctatggtc aagaacctca ggttgtttta 33660actgtaccca ttctagaagg cttggacgga ataaataaaa tgtccaaatc attaaataac 33720tttatagcaa tagatgaaga gcctaatgat atgttcggta aaataatgtc tatttcagac 33780gagttaatgt ggagatggtt tgagttactc agctttacct cagagaaaga aataggagtt 33840cttaagaaga aaatggaaga agggaccaac ccaagagata ttaagtttct tttagcagaa 33900gagttagtag atagattcca ttcagagggt gatggttcga aatgcaagga agcttttctt 33960caaagatttc aaaaaggtca aatgcctgat gacattcctt ccatgtcagt tgatgttggg 34020gctgagggca ttccattagt aaacctttta aagaattgtg agatgacatc aagtacatct 34080gaggcgatga gacttgttaa acaaggcgga gttaaaatcg actctgtaaa aatagaagat 34140cctaaaatgc taatttcaaa aggccaggag tctatttatc aggttggcaa aagaaaattt 34200ttaaaaatta aaacataatg aaaaataaat tagtccaaat atttcttctt ctttttatag 34260ttgcgtgtaa tcaggatagc cctaatatta aacaaattag taatatgcag tattttattg 34320ataatgaaat aagggaagga atctattctg ttgaaccagg cttgcaatac tcaattatcc 34380aaaatggaga tcaaagttct gaatcaccat tgctgcagga tacaattaca gctcattttc 34440acgggaccct cactgatggc tcagtttttt ggagttctgt tgaaatgggt gagcccttaa 34500cagtcgaact atcaggtcta atagttgggt gccaaaaaat aatctctatg atgaagaaag 34560gtgatgaatg gagagtttat atcgacccaa gtatggccta tggcgatgag ggccggcctg 34620ggataccttc aaactcaatt cttatttttg atattgagtt attagatatc caaaaaaact 34680aacccttatc tataacggat agagcgtatc cataaacctc agcaacctgg ttaataattt 34740tatttttagg tgttccggct ccatgaccag ctcttccttc aattctaatt aatattggat 34800tatcacaccc ctgggactct tgaagttttg ctgcaaactt gaatgaatga gagggtacta 34860ctctatcatc tcttttagct gtagtaatca gagtagttgg atagcactca ccttcaacaa 34920tattatgcag aggcgaataa gctaataagt tttcaaactc atctttctta tctggagagc 34980catagtcact ttcccaggcc cagcctatag taaatttatg aaatctaagc atgtccaaga 35040cgccaacttg tggaattgcg actttaaata aattaggatt ttgcaacatg gtcgcagcca 35100ctagcaaacc tccattagag ccaccttgaa tagcggttga tgatggagaa ccaatttctt 35160gtgcgtgtaa aaatttagca gaatatgcaa agtcatcaaa aacattttgt ttattaaaaa 35220gcctgcctgc atcatgccag ttatcaccat attcaccacc acctcttaga ttaactacag 35280caactattcc cccttggttc atccaagtga ggtagctttt actaaagcca ggtagtcttg 35340agatattaaa gccaccatat ccatataaaa gtattggagt attactgtct atttttaagg 35400actttttgta actaagatgg atgggtattt gtgttccatc tttggatgga aagaatttaa 35460agtcagatgt aaataaagtt gagtcgtgtc ctttaagatc ttctttccaa aaaagttctt 35520gtgacatatc agttagatta attttgtata tttctcgagg agttacaaaa tttgtaaatg 35580aaaaataaga tacctcatct tcaatttcac caccaaaacc acccattgtt ccttttcttt 35640ctgttgctag cttattttta tatgctcctt ttaaatcaaa gaagtgaacc tcagtaaaag 35700tatcaacgag ataagaaact acaatagaat tatttataaa gctaacacta ctaatagaat 35760tagtactttc accaacgact tcattccaaa caaatgagcc attttttatt gtaagtgata 35820ctacttttcc gttcgcagca ttttcagttg agtaaaacca aaaagtatca ttcttgcttt 35880ctaaaaagct ataagcgcct attagctcat ctattaaagg aataaaaggc tgatcagggc 35940ttagctgaac atagagtcta tttctttcat ctgtgccttc accaatagat agaaacttaa 36000ttttagaatc ttttacaaca cttattcccc aactccatct tggcttttca ggattctcat 36060aaacaatcac atcctcatct tgagcagtac caattttatg aaacattaac tttggagctg 36120tattaatatc ttttaaaagc tcttccgatg gctcgtcgta tttttggtaa taaaatccag 36180aatcatcatt ttcccatgaa gcaccagaaa atttagccca ttcaatccta tcatcaagag 36240tttttcctga ttcaatatca agtactttcc aggttctcca atcagaccca ccatctgata 36300ttgagaaagc tagaagagat gcatcattac taacgctggt acttgcaaga gagatagttc 36360catcttctga gaactgattt ggatcaagta aaactctatc ttggcattct tcacagtcct 36420taatcatcag cttgctttgt tgccatgacc catcattgaa ataataaaaa gtttttttgt 36480taacctgata aggcatgctt atcgaatctg tatcccaaac ctcatctaaa ttcttagcaa 36540tagatttttt atatttattt tggcctataa atttttgtgt gaaattattt tgtctctcga 36600cccagtctgt tgagtcctca cttgtaaaat cctccatcca tctataagca tcttcgatta 36660ggtatccatg gacttcttca ctaaaaggaa ctttatttga ttcagggtat tcaaaattat 36720ctgtttgatt tgagcaacta actagaatta atagagctgc catgtatatg tatttattgc 36780gcataaaaag cataatactt tacaattacc ttatgaatca attcaatcga gcatggcaac 36840ttttacgcaa aaaattggaa gttttttatt gtacttgccg cacccatcat gatgcttgaa 36900atagcaatgg catcaatggt aaccccaata caaaacgtta ctcagcctga ggatattcta 36960gaatttttca atgagaatat agcctttcta ggctctgtta gtctcttagg cgtagtttta 37020agtatggctt ttatgggagc gctttttgtt tcgtatgcat ccatagaatc agaaaatgag 37080attgagcctc taaatgcctt atttttaggc ataagaaaat tctttccact tctgggggct 37140tatcttatag cttcagttgg tgtctttttt ggtatcttat tattaatact cccagctttt 37200tatgtagcag caaggctttg catttttcct gcatttatta tgcttgaaga caaaggagct 37260atagaatctc ttaaattatc ttgggaaaag acggatgagc atggcaccac tttgtttggg 37320cttaccatta ccttcttttc tttaacaatg atttttgcat cagttgccca atccattata 37380agtccgggat taatgcaatt agttgttctt gcaattattg aatatgtaat agtaattcca 37440tggggctatg tatattttag tttatacaag tcattaaaaa gatattaggc agattaacta 37500gaataaatta atacgaagct tttcgtttac aaaaaaacac accactctta taatccttcc 37560tccgaagatt tgcactttgt taattcatgc aaacaacttt gggtctgtag ctcagcttgg 37620ttagagcgca cccctgataa gggtgaggtc ggtggttcga gtccacccag acccaccatt 37680ctttgaacat taacttttat ttcctttata cttattctta actaaataag gatagtatta 37740tggctaaagg tttagacaag caaaaaaacg acaagaagaa aggcaaaact ctaaaagaaa 37800aaagagcggc aaaaaaagaa aaaaagaaat agttttttgt agaagactat tattatctag 37860tcgacctagg gccatcttta aaatctattt taaagaacaa cattattccg gtcactgagc 37920ttactaaagc taaaatagaa aatattttta aaagaaggtt gtcgatatta tctctttctt 37980tccaatccat aatatgaaac ccccacatca gatcccatat tttccattta ttagatctaa 38040tggccacaac ctctgctgag taaacattaa tgtaaacatt taattcagta cctttaacat 38100ttttactttt tactctataa attggcaatg atctgcctct atattcagat cctgattttt 38160catttacaac ttcttcggta gcaaatggca aaagagttgt ttggcttgaa actgaatcta 38220tagcatcttc catagaaatc tttgttaaag gctgcccaag catattcaga tacttagttg 38280agccctttgt tgtgatgata acaatctctt gacctaatct ttttttgaat ttaacttctt 38340tagctttctc tatttcaaaa tttaacttac ttaaatcaaa agatgtttca acatgattaa 38400ggtattgctc gcctcttacg agctcaatct tattaaaagc aaagtatatg cctgagatag 38460tccaaagtaa taactgaaga gagataaaaa aacttaggta cttatgaatt ttcctaacta 38520aaaagttcat cttttcataa tcttcattat ttcatcaatt tttacatcaa attcgtcatc 38580accttttaaa gattccttaa tgcattcacg caaatgggtt tttagaattt tgccctcaac 38640agtaataatt gaattcttaa gtgcttttat ttgattaaga atatcaacgc agtattgacc 38700ttcctcaacc attccttgaa cgcctcgcac ctggccctct attcttttta agcttattaa 38760ttgctcttta tgacatggat gtttcatttt ataaaaacct cggaactata tacgtaaaaa 38820ttgtagcgaa taaaaaaatt gctacagatg catagtacag atacctagat ttttttctag 38880tttctgaaca tgctctgcct agttctgggt cagcagggca aggcatatga taagttttgt 38940agttgatata tcctgcaatg acaatcatta ctaatgcaaa tagtgttatg taaagcttat 39000attgagatag ggtaattaag aatggaaata cagttaccag gcttgcaaaa cttgccccag 39060caccaagagc aacaaaaata gctggcaagg cacaacatat caaagtagaa gaagatgcaa 39120ataatgaaaa aaaattcgac gccttatcat tcatagttat taaatttcta ggatttgaag 39180atcagacata ttttggccat tagaaagaat cttcttttcg atctcagaga aggtgatttc 39240ttttgcctta gaaaaggcaa tcactacttt tccattatca agatcaacat ctattctttt 39300gacatcttta tcttttagaa aagtttttga aatacctctt gcacagaaat cacaaaccat 39360tcccttcacg cttacgatcg ctatatttac atttgagagt ccttcaacaa atgcatcaaa 39420cctctcaggg tttacatcaa gatcttttcc atcaaccatg gtttcatgta aatgaccttc 39480atgagaatgc ccatgcatac cactatggtc attctccgcc gcaataaggc cgcttgaaag 39540caaagtaatt aataataatt ttttcatatt tttctccttt taaaatctgt acataaaatg 39600taaatcccag ctcgtatctg cgtcatagcc aaactcaaca agggcattac cataaaaaaa 39660tttcaattct ggatacgtag atgatttatt tgttacagta ttttttttgg ttttaaccat 39720aacccatgta tgtaaatcat tgtagtcagc aacgtaagga gcaaatccaa gttgaatata 39780ttcttcttta tagtctttgc taaatttact gttaatatcc tttaacccaa agcccgcata 39840ccacttccgg gtttcccagt ctccatggat accataaaaa ttatttttaa tacttccagg 39900cacaattcct gattgaaaat ataaattcct ttgggatgta aaagtatttt ttctattcac 39960caaataagta aaacgaaaat agttttctgt gttcttataa aatttatttt tagccctttc 40020taggccaatg gaatatttgt aggttggaga gtagtggaaa taaatagaat cattaaaggt 40080atctgattta tacattatgg ttgttccgcc agagtaggat atcggtcttg catcagcatt 40140aaagaccatt aaaaaaacta aaatgattat tttatatata cccatggggg gtatcttacc 40200ataggattat aaattaattt caggcatctc tccaagcgcc cattttcaat gaaatcacca 40260tcagaattcc aaaaataacc atagctatag cagatgcata aaaaatacta actagcccat 40320atgagagcca gtaaacagct accaatgcta taacagaaaa tactataagc ctgatggccg 40380tgattataat tggccatttc attgcattgg ctccctgaga tgcaaaatac agggaaagcc 40440catatccttg aaatacataa catacaccaa ggatttgaat atactgcttt gtaactaata 40500aggtctctgg atccgatgta aaaatactaa tccaaagatt tggtgttagt gctaaagcta 40560gaccgataac agccgaaagc aatccagctg ttgttgcacc aaccatgcct attttttctg 40620ctcgctctat atttttggca cctatatttg ttccaaccat ggctgtcata gcagtcccaa 40680tgccgaacac aattgggatt aataagaact ctacccttga accaattcca taaccggcaa 40740tagcagacgt tccaaattga ccaatgagtc ctgtcagtaa aagaactgtt gcaactgtca 40800ttattggcga aagagatgcg ggtaaggcaa cagaaaaaat atcttcgaat aattctcttt 40860caattgttag tcttttgagc ttaagcctta caggcgatga agggctcgaa atttttatta 40920aagttactaa tgccataaaa cctgaagtaa ctatcattga ccaagcagat cccacaagcc 40980caagtttagg tagaccgaat gatcctaata taaatccagc agagaaaaaa acttgtatac 41040ctgcacaaat taccgttagg actgcaggaa attgcatatc tcccatccct cttagggcgg 41100cggttaagct acctgatagc caaacaacta ttgctcccaa taaatagaca aaacaataag 41160ctaaagactc ctctaataat gcacctgttc cacctaatat tcttaataga ggctctccaa 41220aaataaaaaa gataattaaa aaagctaaag ctccaaagca agatatatac aaagaatgcc 41280ataataattt ttcagcccta ggcttgtctg cggctccaag acttcttgca atagaagatg 41340tgaccgctcc tccaagagca ccaaatgcca tctgctgagt cagcatgatt gcaggaaatg 41400ccaaagtaac agcagctaat ggagttatgc caagttgact aataaaccag aattcagcaa 41460gaacaaccac agcatttatt aaaaatgcaa cagtattagg tgctgacatt ttaatcagca 41520aaggaaatat tggatctttt aaaaattgct ctgttctctt gtccataacc ttctcattta 41580ttaatattta tttggttaaa gggtgaaaga gaacacgttc tcaggattca cgaatcttta 41640aataataata acagctttta gttaatttaa tttttaaaaa caaccaaatc tattatctgc 41700tagattttat tttttaaaca tattaatatc taatttaatt gcaaaggggt ggccatttcg 41760gcctgagatc cttggaaact ctttgaacct gatccataca ataatggcgg aggaattgca 41820tgaaaatcat taaaaatcat ttattttatt caataatctt actatatttt tttagtctag 41880agacgagctc tcaagctata gaagaagtta ttatcaaagg agactggaga gaaactagtc 41940tgtcagcaga agactcaagc attgcagtcc ttgatagtaa attaatagaa tcccaggctt 42000taaagcactt tgaaaacctt tcgtacctag taccaaattt aaattttgct gcaagtgatt 42060ctagagcaag acatttccag ataagaggaa ttggagaaag atctggctat gaaagaactc 42120caaactctgc agtaggttta ctgattgatg atatagactt ttctgggcaa ggtggaattg 42180ccacaacttt tgatgttgat caaattgaag ttcacagagg cccccaagga gcaagaatag 42240ggtctagcgc aatggcgggg ctcatatata tttctacaaa agacccaaca gaaagctttg 42300aagggaaggg tgaaatagtt atggggtcat atggaacttt taataccgga attgctgttg 42360gcggtcctgt aaattttaat aaagacctca cttataggtt agctataaag aaagattatt 42420ctgacggatt taggaagaat atttttctca acaagtctga tacttctaag aaagatgaga 42480gcacatttag attaaaagtg aattgggtaa ctgataacca aaccacttat aagtttctta 42540tatctcaaat agagttagat gatcctgcag atatatggac tatagatggg agtcttaata 42600cattatctga taagcctgga atggactctc aaaaaagtaa tgcctatggg gtaaaaattt 42660atcatcaatt taaaaaattt gaatttcaaa gcctatcaag cttaacgaat actgatgtca 42720ttcttagtta tgatgctgat tggggcaacc ctgagtcgca ttcacccttt atttatgact 42780atttttcaga aactacaaga aaaagagata cttttagtca agaatttaga cttgtatccc 42840aatttgcaga taaaaataca gaaaaaagca tcgaatgggt cgttggggct agttttgtag 42900atataaatga aacaaatgct aaaaaagata ctggtattta tggagatcca tcagacccat 42960atggtcctta ttttagtaac tcttcttctt tgagtgactt ttcttcttca agctattctt 43020tatttggaaa tattgattat ttaattaatg aaacaataaa aatttcaatt ggtggaagat 43080gggagaattt taaatctaat tattttgatt cttatgacga atcattttca ccatcaaata 43140aaatgtctgg tggtaagttg tcactagtta aaactcttaa taataattct aatatttatt 43200ttaatattgc taagggctat aaccagggtg gatttaattt aggtcttggt cttgataaaa 43260attcatcaaa tagaaattta tattatgatc cagaattttt aactaattac gaagttggaa 43320ttaatagtaa atttttccag tcaaaattaa atcttggagc agtcctgttt tattctgatc 43380gaaaagatca acaggtctta atttcaaccc aggttgatcc ttcagatccc aacacttttt 43440tatacttaac ccaaaatgct gcagaaggaa ttaataatgg cttagagtta aatatagatt 43500atgcactgaa taaatctcta ggtatatttg ctaattttgg attgctaaat acagaaataa 43560aaaattggat ttcaagacca gatatagaag gtagagaaca ggcgcatgct ccaaaaaata 43620gtttttcaat aggcatcaat tggaagccaa caaaccaatc ttatttatca ttgaatgttg 43680ttggtaaaag tgagttctat tactctgatt cccataacaa tacttctgag tcatacaact 43740taacaaacat taattatgga tatgaacatg gacaatggac ttattcatta tgggcaagaa 43800atatttttga taagtactat tcagtaagag gtttttactt tggtaacgag gcacctgatt 43860ttatagatac gctctacaga agacatggag atccaaggca tataggagtc atggtccaat 43920atgatttcta actttataag tgaattttgg atggaaatag cagctgttgt atttgctatt 43980atttatttat tacttgctgt aaaacaagac gtaaagtgtt ggtttgcggc tataattagc 44040tcaatattat atttctttat tatgtatgac gcaggcttat atatggaagc ttacctgcaa 44100attttttata tcatgatggc cttttatggg cttcaacagt ggagaagtgt cgatactgat 44160gctcctcaat ttattgttag aacatgggat aaaaggatgc atattaagat cattacatta 44220atagttatca tgactttaat ctctggattt ttattagaaa aatataccaa tgctatcttg 44280cctttcatag atggactaac aacatgggga gcaatagttg cgacatatat ggttgccaaa 44340agacttcttg aaaattggat atattggttt gttatagatt ttatttctat ctttttattc 44400atgtccagag gacttttatt aacctcagga ttattcttta tttatcttgt aataatatat 44460tttggttaca tgtcatgggt aaaaataaga gacgatatta gtgcagaatc ttcataacaa 44520tctagatata aaaaatcata atctagagat tattaaaaca attaaatcag gccctgtatc 44580tgaaatatct atttgtaatt ttgataacat caaagcaatc ttaagagttg atcatccatg 44640cgcacacaaa ataaatgtag atcgagaaaa tgaaattttt atactcagcc aactaaaaat 44700tttagatttt agtccagagg ttttatttag tgatttgtct tatggaattt tagtatggag 44760atatatcgag ggtattgaat tttcacttgg taaggattcc aatgaagttt ttttaaaaac 44820acttggaacc gaattaaaaa aaattcatga tattgatctc cccaaaagta agaaaaaata 44880ttttagcaat gacataaatt tttacagaaa tttactaaag gaggttcctg aaaatataat 44940tcttcataga ggatttgatt tatacgacaa actcaataac tctgataatt atgttctctc 45000tcataatgac ttgaataaga caaaccttct ttggagggat aggttatttt ttttagactg 45060ggagtactcg agttttaaca atcctttttt tgatattgcc tcgttatcga atgcttataa 45120cttatcaaaa

gttgataggg caattttatg gaaagcttat acaaataatg aatattcagt 45180attaaatgat acaaatctta gagaatggat gcatttttgt cattatttag agtacatgtg 45240gagtatttcg ctcatacaaa atggaaaaat tgatcagaac accctaaatt taaaaaaatt 45300agagaaaaaa ttaaaaaata ttatttaaac aacaagtatt gcgcattctt tagttgttgt 45360tttactatta tgtaagaaat ttacatttaa atgggggttt gatatgaaaa ttttatgtgt 45420cttatatgat gatccaaaaa caggtatgcc agaaaggtat gcaagagatg atttaccaaa 45480gttagataag tatcctgatg gaatgacact tccatcccca aaatctatag attttactcc 45540tggtgagtta cttggttgtg tatctggaga actagggctt cgaaagtttc ttgaagatgc 45600tggccataca ctagttgtta cttctgataa ggatggagat ggatgtgagg ctgataaaga 45660attagtagat gctgatattg ttatatcaca acctttcttc ccatattatt taacaagaga 45720caagatgaaa acagcgccta atttaaaaat ggcaattaca gctggcattg ggtctgatca 45780tgttgatctt caggcagcca tggataatag cgtggatgtt gttgaagtta cttactgtaa 45840ttctcgttca gtcgccgaac acattgtgat gatgatccta tcgatggttc gtgattatca 45900tacccaacat agaattgtaa aagagggagg atggaatata gctgatgctg tacaaaggtc 45960atatgatgta gagggcatgc atgttggtac tgtagctgca ggtagaattg gtattgatat 46020gctaagaaaa atgaaaccct ttgatgtgca tttgcattac tttgatattc ataaactctc 46080tgatgaaata gaagcagaac taaacctcac ctatcatgat tctgtagagt cattagttgc 46140tgtgtgtgat gtagttaata ttagttgccc attgcatcct aaaactgagc acttattcga 46200tgatgaaatg attagtaaaa tgaaaagagg tgcatatatc atcaatactg ctcgtggcaa 46260gatttgtgat aaagatgcta ttgcaagagg cttagagtca ggccagctaa gtggttatgc 46320tggtgatgtt tggttcccac aaccagctcc aaatgatcac gtatggagaa caatgcctaa 46380ccacggcatg actcctcata cttcaggtac ttcgctatct gctcaaacaa gatatgccgc 46440tggggttaga gaaattctag aatgttattt tgcaggcgaa ccaattagag acccatattt 46500gattgttcaa aatggtgatc ttgcaggtat gggtgcgcac tcgtacacaa agggtacagc 46560cacagatggc tcagaagagg ccgctaagta taaaaaataa gttttagaac ttacttagcc 46620tttccttaat aatagagtct gcctctgcca taatgctatg cattaattct tccactgtcg 46680ggatgtcatt aaccagtcca gcaaccattc cacatgacca ggctccaacc tccatagttc 46740cttcatgcat aatttttgga tagactcctg caacctcatc cacaatatca gcaaaagtta 46800attcatcacc aagagctttt tctttttcaa tcaatctctc aacagcttca ttattaagaa 46860ccctttctgt atttgttaat gatctcatga tgagtctagt atctaactca gaagcattta 46920cgatagcctc tttcacattt tgatgaacag gtgcatcttg agtagcaata aacctagttc 46980ccatattcat tccctcagca cctagtgaca tggcagcaac caaacttctt ccatctgcca 47040ttccacccga ggcaacaaat ggtatttcaa gctcatctgc tgctctaggt aagagtatga 47100aattaggaat gtcatcttct cctgggtgtc cgccacactc aaaaccatct acagaaaccg 47160catcgcaacc tattgcttgt gcttttaatg agtgccttac agaggtgcat ttatgaataa 47220cttttattcc tgcttctttt aaagctggga ggtattctgc aggatttctt ccagcagtct 47280ctacaacagg aacccctgca tcaataatta cttttatcaa accaggatag tctgggggtg 47340ttagtgatgg taaaaatgtt aaattaacag caaatggctt attggtcatt tctttgcatc 47400tggcaatttc atttgctaat ttctcaggcg taccctgtgt tagaccagta attgttccaa 47460gtccacccgc atttgatact gccgctgcaa gctcggcaaa accaacatgg tgcattccac 47520cttgaatgat tggatgttct ataccaaata attcagttat tttagttttc ataatactct 47580cctatttatc ttccattggg gttaaaaaat cttcatattg agtttgaagt ctttgcatgc 47640cacttatcca tctatctctg tcattacctt ttcttttaac atattctaga acctcactat 47700gaggggttac taaaaacctc tcctcctcta tggcgttaag aacatccttt gcaacaatat 47760cagcttccat cattccatca actcctgcta caccaggacc atttgcagtc atagcagttc 47820taactgcttg agggcataaa caggaaacgc cgataccttt atttccatat gtaattttta 47880tccattcagc aaaactcaca gctgcagctt ttgtaactgc atatccagcc gcacctagtt 47940gagttaaaag accggctgct gaggatgtat tcataagata gccttcacct tgttctatca 48000tttgaggaag cacatgtttt gcagcatgaa tatgagactg aacatttaca ccccatatca 48060tgtcccaatc cgaagtatct gcttcaaaaa atcctggctt tccaccgata cctgcatttg 48120aacaaaatat atcaatacca ccagaaaatt cattagcctt ttgtataaca tttataatgt 48180cgttttcttt ggaaacatca gcacttactg caagcccatt aacactcttg gctgtttcct 48240ccgccccatt taaattcata tcaacacata ctatagaact tgccccagat gcatagaact 48300cttcacataa agccttacca attccactgg cagcccctgt aaccacaact cttttattat 48360taattttcat aaattgacca cttttttaat tttttattta actttattgc ttagatagtt 48420tattacaaaa caacaatact taatattgca aaatcgcata cccatcttta taaatttatt 48480gctattatta gtgctagaaa atgaaatact gagatttaat atgaagaatg ttgttgttat 48540tggctcgtcc ggggcaatag gaaaagcctt tattgatagc tatatcaaag atgatgatgt 48600tgaaaatata ttttcatttt caagaacagg cctttccatt gaggataaaa aactccatag 48660tttttttatt gatattgagg atgaaactag tatttgtgat gccgcagaga agatagacaa 48720gtcctcaata gatgaaatta tcgtcgcaag tggaatactt cataataaag attttgggcc 48780agaaaaaagt attagagatt taaatgcaga taacctttta aaggtcatta aggttaatac 48840tatcggccca acaattgttg gaaagtattt cattccattg ctaaataaaa aagaaaaaag 48900cgtcttagca tttttaagtg caagagtcgg cagcatttct gataataaaa caggtggttg 48960gtatgcctat agagcgagta aaactgcact taatcaaatc attaaaagtt ttagtattga 49020attacgaaga accaatccaa atgccattat ttttggtcta cagccaggaa cagtagatag 49080tgaattaagc gaacctttta aaagaaatgt aaaagaaggt aatttattta ctccagaata 49140tagtgtattg cagctaaaaa atattattga tacagcaagt ccatctgatt caggcaaact 49200aatttcttgg gatggggaag agattcagcc atagttggtt atgaatatat tttcatatta 49260gaataaattt ttaggggaaa aaatgagtat taaatattat gactggtcta aatttcaggc 49320caacactagg ccaaataaag ttgccataag agagctagat aataacaaga tctatactta 49380cggagaattg gataaaagat catcaaggct tgcatcacat ctccaaagtt caggaataaa 49440aaaaggagat cgtattgcga tactatcgct aaattgttca gaattttttg agctagaatt 49500tgcttgcgga aagattgggg caatagagat accattaaat tggagattaa caaaacccga 49560gctcagttat attcttaatg atagtgagcc aaaaactcta atttatgaca atcagtttga 49620agaaatggtg aaagagctaa aagaagaatg taatatttct gaaatcatag ctcttgatca 49680atttgaccaa gaaagtgatt atgaaaaagt tttgagtaat gcttcaggca tttattatca 49740ggaagaagtt gatctagaag ataacattat gattatgtat acctctggaa caaccggtca 49800ccctaagggc gccatgatca cgcacaaaat gcagcttttt aatgttatta atttaggtat 49860ttcagcagct gtttcccctg aatcagtcca tttagttgtc cttcctttat ttcatacagg 49920cggaatgaat tgttattcaa atccaattct tcatgcaggt ggcgagttaa tattacttaa 49980agagtttgag cctgggaaag ttctatcaat tatcggcagc tctgactatg gagttactca 50040tctgtttgca gttccagccc cttatcaatt tatgatgaat catccagatt ttgaatcaac 50100aaatttatca ggagttaagt atgctggagt tgggggcgca ccttgtgcag aggctatttt 50160gaagacttat ataagcaagg gtgtttcgat gcagcaagga tggggtatga cagaaactag 50220tccaggtgct actggtcttg aatcgtccga ggctgaaaga aaaataggat ctgctggaaa 50280accgcttctt catactgagg tcaaggtggt tggagatgat gggaatgaac tgcctgctgg 50340agaagtaggc gagatttata ttaaaggccc aaatattaca cctggctatt ggaagaaaga 50400agaggctact agagattctt ttgaagacgg gtggttaaaa acaggtgatg ctgcttactt 50460tgatgacgag ggttttttat acatagttga tcgatggaag gatatgtata tctcaggtgg 50520agaaaatgtt tatccagctg aagttsaaaa tgttatctat cagttaccac aaatcgcaga 50580agttggagtt attggtattg atagccctaa gtggggtgag actggtaaag cctttgttgc 50640tttaaagccc gatcatgaat tgacggcaga agaagtcata gatcattgtt taaaaaatct 50700agcaaagtac aaaattccag agaaagttga gtttattgca gctcttccaa gaaatgctac 50760aggtaaagtt ttaaaaagaa cattaagaga tatgtaatat ttttaagcca aaaaaaaccc 50820agcctaagct gggtttttta tttaactatt taaagttatc taatgttaag cattagaaga 50880ttctttaaca gcaacattcc atataattaa accaaataga atcttgttaa caaagtcagc 50940aaggttatag ataaggttta agttaagagc tgatccaccg tcacccatca ggtaacctgt 51000gaaataacct acaggataaa tcgcccaacc aaagatgata atatacatca ttgtgttgta 51060agctgattgc acagcaggac ttgcagtatt acatgcagat tttccttctc cagcccataa 51120ttcataaatc atgtataccc aagctaaaca cccaataatg aatgcaggcc atgcagccat 51180gattcctgct tcacccatgt aaccaaacac aagcataaca agagaaccaa ctagtaattt 51240cttaaataat gatccagcaa cattagttgc agcagcaaga attaagtaga attcacatat 51300taatagagga actgttagta accaatcaat gtatctaaat acagttggcg aatcaccagt 51360ttcaatccat acccctctca tgtacatgta atgccagaaa gcaataccag taacaagacc 51420agatacagtt aatgatgttt tccattttgc agaaactcta tctctttcaa caaagaaaaa 51480tacagtagat gctaataaag cagcagtaac taaccaaaaa gaaacaccag tgtaatcact 51540agcatcaagg tcaccaccac ctgcagcaaa tgtaggaagt gcaataacac tacctaatat 51600cagtaataat ttcatatata actcctatat taatgacaaa gaagcataat tgcctccccc 51660ctaattaaat atgaaattta atagtaacca atattaatag aggttacaaa taaaaagcat 51720ctattttttt aatagaaaat atgtgtaata aaacaccata ctttaaatta atatttattt 51780tgcataagta gacttgttta gtatcatatt taacatgaaa gttgcaattt atcctggttc 51840ctttgacccc atcacaaatg gtcacacgga tattattgat aggggttgcg gactctttga 51900caaggttgtt gttgcaatag ctaagagtga atcgaaaaac cctcttttta gcctagagga 51960tagaattaat ttagcccaat ctatttttaa aggaaatgaa aaagtagagg ttgttggttt 52020tccaagaaag ttaacagttg atcttgcaaa agactatgga gcttgtgcaa ttataagagg 52080cctacgagca gtttctgatt ttgaatatga atttcagtta gcaacaatga ataggtcgct 52140ggctcccaat attgaaagca tttttttaac accaaaagaa agtctcattt atgtatcttc 52200tagcttaatt aaagaaatat cagacttaaa aggcgatata tcgaagtttg ttcatcctat 52260agtagagcag gcacttcgag cgagtagaca cttagctctg acaagcttca caaaaaaaag 52320tagctctttg attgacgatc gttttacaaa tagtgccttt gcatttattg cagggctcgc 52380cttctcttcc atatacattt aatttaaatt taaaatatcc tggactacca tcagctgagt 52440agaagtcttt taatgtagtt cctccaacct ctattgcttg ctctagtatc ttttttccag 52500cagctactaa tcttttacaa gcatctaaat ctaactcatt ggcatttttt agcggatgaa 52560ttttagctaa gaaaaggctt tcagacgcat aaatattgcc gataccgact acattttttt 52620gattcattaa atagctttta atatttgttt tagagtgact gcatcctgaa aaaaaatcct 52680ttgcattaaa atttttagaa agaggttcag gcccaaggtt ttttattaac ttatgtttat 52740cgatatcaga agtaagatgc attgatccaa accttctggg atcattataa ataattcttt 52800cctcatcaaa aattaattca atatgatcat gtttgataaa gaagttttca ttattttttg 52860caattctaag actcccagac attcctaaat gaagaataat ttttttatca ttagatagtt 52920taaaaattat atattttgct cttctttcta agctttcgac aacttgatct tttactgagg 52980tttcaaagct atccaccact ttccatctaa ggtttctgtt gtgtatcctt gcttctttca 53040gcaaagagcc tttaaatttt ttaattgctc tcagggttgt ttcaacctct ggaagttcag 53100gcatttttaa ctgagtaaat tattaatttt tactatgtca gctggagtaa ttgttcccga 53160tgcgagccct agtcttagat tggcaagtat gtaatcatat tttgcattag caagattttt 53220ttccgcgctg tataagtttt tttctgcctg caagagatca acaacgtttc ttgttccaac 53280tctgtagcca acttgagtcg cttccagggc actagtggcc gaaatcactg cttgtttttg 53340agcatttaca tttgcaacta atgttaaaac atttgaaaac tgggatctga cttcttgaat 53400aatccttctt tctgtaaata gagtattttc atttgctctt tcatactgtg aatatgcttg 53460cttccttctt gagttaacgg cgccgccttg aaagagtggc atacttagct gaattgcata 53520attccttctt cctgttactg atggaactgg aataccttgg ccattgatat taaaaccttc 53580atagttaaat tggtttgttt cagattctga ctgacttcca acaatgtcta tcttaggtaa 53640atgatttgaa gctacacttc ttgcactgct tttcgctgct ttctttctca aatatgctgc 53700ttttaactgg tagttatttt ccaatgctaa ttcaacccat gtctcttttg aacttggtgt 53760tggcaggtca ataagcaaac catctcccaa ttcatttaag ctgaatattt ctctaccaat 53820cagagcattt aaagactctc ttgcagaata aagtgatcct tctgttctaa ttcttgaggc 53880tttacttaga tcaaatgcca attgagcctc ttgaactcca gttatggctg ataacccaac 53940atcgaatctt tgttttgctt gatcaagttg ttttttaata gctttttctt cagatattgc 54000tgcatttaga ttatcaatag ctctaagtac gccaaaataa agctcagcag ttcttactaa 54060aagattttgc tgctcaaatg caaagtctgc ttcagcagca tctgtaagag atttagattg 54120cctatattga aaccatgtat ctagtctaaa gagcggctga gtaaccctgg cagatgtaga 54180aaaagagtta tattgctgct gcagttcttt gttttgatag tattcgttcc agttagttga 54240tccactcaag gtaatactcg ggagaagcgc agctcttcct tgaaccttaa gctctttatc 54300tgctaaatat gaatattccg ctgctttata tgtagggtca ttctcaagcg cttcattata 54360gatatccaaa agactttcag atgaaatatt aaatgagata aaaagtgcta aaaatgattt 54420tgtgtaaatt ttcataactt attttaacct attaataatg tttgcagtgc aaacatttat 54480tttttttaat tttattgaat ttattttata tttcttagag tagaatatct ctaacaagtt 54540caataatttt ttataataaa aaggctttaa aaattggcta aaaattcata tgacgctcag 54600gcaattgaag tcttatctgg actagatcct gtcaaaaaaa gacctgggat gtatacggat 54660acatctaacc caaatcattt aattcaagaa gttcttgata attcggttga tgaagctctt 54720tcaggttatt gttcaaacat aaaagtatct gttctaaaag atggctttat taaggtctct 54780gatgatggaa ggggtatgcc aattgatgag cacccggaac ataaagtttc aggtgttgag 54840ctcatccttt gtaaacttca tgctggagcg aaattctctg gagatgatta taatttctct 54900ggtggccttc atggtgttgg agtttctgtt gtaaatgccc tatctgatga attagaggtt 54960agagtaaaaa gagattctaa agaataccaa atcactttta ataatggaga taagtcttct 55020gaattaaagc caattgggga agtggggctc agaaattctg ggacatcaat taaattcaaa 55080ccaaatccta catattttga aactatagag attcagataa aacagcttaa gcatttatta 55140aaggccaaag cagttctctg tcctggatta acgatagagt tcgttaatga aaaaaagact 55200gatgataaac aaaagtggta ttttgaagat gggctcaaaa gctatttgat tgattcttcc 55260gagggagcag acttggtttt gctagattca attgtatgct ctaaaaaatc tcaagctcaa 55320gagcttgaat ttgcaatcaa ttggtcatta agacccccaa aaaataaact cgatgaaacc 55380tatgtgaatc tcataccaac tgctcagggt ggctcacatt taaatggctt taaggctggg 55440cttttagatt cattaaaaga attttgtgaa tacagaaatc tattgcctaa aggtttaaaa 55500attaatgcag atgatgttct taataatgca atttttataa tttcatctaa gcttcagaat 55560cctcaatttg cagggcaaac caaggaaaga ctagattcaa aagatcacat gtcgttcgtc 55620tcaagtacca caaaagacat tttaagtatt tggcttaaca ctcatacaga agagggcgaa 55680agaatagcag aacttgcaat tatgtctgct cagacgagag caaaagtttc caatatagtt 55740gaaagaaaga aaacttttag aggcccagcc ttacctggaa aactttcgga ctgtaatagt 55800caggacttaa atgaaacaga gcttttttta gttgaggggg actcagccgg agggtccgca 55860aaacaagcaa gagaaagatc tttccaggca atcatgcctt tgagagggaa gattttaaat 55920acttgggact tagaaagtgc agaaataata aaatctcagg agataaaaaa cctatcaact 55980gcaattgggg ttctgccagg aaataatgac ctttcatcac taagatacgg aaaaatttgt 56040attcttgcag atgctgattc agatggtctg catattgcaa ctttactttg tgcattgttt 56100ctaaggcatt ataaatcttt agttcaagag ggaaggatat atatttcaat gcctcctcta 56160tatagaatcg attctggtaa agatgttcta tatgcacttg atgataaaca gcgagatgaa 56220atagttactg aatttaaaaa gaagaagggc aagcctaaag taaacattca aaggtttaaa 56280ggacttggtg aaatgaatcc acctcaacta agagagactg tgatggaccc tgctactcgt 56340cagcttgttc agctttctat cagctcaagc gataatgcaa attctatgat ggacttactt 56400ttgtccaaaa agaacgcacc agcaagaaaa gaatggcttg aaaagaaagg gtctctagca 56460aaaatataaa tatgaaagaa caaataacct caattagcct caagcaatat gctgaagagt 56520cttatcttaa ctatgcaatg tatgtcattt tagatagagc tttgcctaat attggagatg 56580gccttaagcc tgttcaaaga agaatactct atgcaatgtc agagcttggg cttgatgctg 56640gctcaaagta caaaaaatca gcaagaactg ttggagatgt tataggaaaa tttcatcccc 56700atggagacgg cgctgcatat gaagctatgg ttttaatggc tcaaaatttc tcattcaaat 56760acccttttgt agatggtcaa ggtaactggg gttctcagga tgatccaaaa tcttttgctg 56820caatgaggta tacagaatct aagttaacta aatttgcaaa tcttttaatc tctgaattga 56880agtctggaac agtcgattgg cagcctaatt ttgatggctc tcttttagag ccagtaattt 56940ttccagccaa actcccatct attttattga atggcacttc tggaattgct gtaggaatgg 57000caacagatat tccatctcat aatattaatg aaattattga tgccacagta catcttattg 57060ataatccaaa atcacagttg gttgatttac tcaagataat taatggtcct gatttctcaa 57120ataattcgcc aataattgct agcaaagatg agctgaatga aatttattcg actggaaaag 57180gcggcttcaa agctcaagcc caatgggcgc aggataagaa tcaaatcatt atcaacgcat 57240taccttatca agcatctggg tctaaaattt tagagcaaat agctgatcaa atgcttaaga 57300aaaaaattcc aatggtggtt gatcttactg atgaaggaga ccacaaggag ccagtaaggc 57360ttgtcataac tttaaaatcg aacagagtaa atgctgaaga tgtaatgaat cacctttttg 57420catcaactga tttacaaaaa aattatagag taaatatgaa tttgatttca ttgaaaggtg 57480gaccaaaagt tttctcccta gttgatttat tgaaagaatg gctagtcttt agaaaagaaa 57540ctgtaataag aaaactagaa catagactcg accaggtaaa cgataggctg catatccttg 57600aggggttatt aattgtttat ttagatttag ataaagtaat aaagattatt agagaatcag 57660atgagccaaa aaaagatatt attacagctt ttaaactttc tgatatccag gcaaatgcaa 57720tccttgagat taggttaagg caactagcca agctagaaca aatcaagtta gaacaagaaa 57780gagatacttt ggtagcagaa caaacagata ttgaaaaaat actaagctct aagacaagac 57840ttaaaacatt aattaaaaat gagcttattg aaataaaaga tgagtttgga gaagtaagam 57900aatctccaat aagagaagca acagaagcca aagttttttc tgaagaagaa actcttgtca 57960ctgagcctat tacagtagtc ttgtctgcag ctggctggat cagaagtgca aaaggccatg 58020agatagaccc cagctcactt tcctatagag gagaagatgt acttcaagat tatggaagag 58080gaaagagcaa tcaagtttca gtttttcttg actcaaatgg gaaggcttac tcacttgcaa 58140gtcactctct tccatctgct agaggaatgg gtgatcctat tacaggaagg gtatctgcag 58200attctggagt aaagttcatt tcttcattga ttgggaatga tgaagataaa ttcatgatta 58260tgaatactgc tggatatggc tatatttcag agtttaaaaa tatggtttct aataagaaat 58320caggaaaagc atttatgaaa atccctcatg aagcagacct tcttaaagcc attaaagtaa 58380gagacgatca tttgtatata gcagcagttt caaatattgg cagactttta atttttaaga 58440ttgatgaatt gccaactctt ggaaagggca aagggaataa aataataaat ataccaaccg 58500ctaaatttat agcaaaagaa gagttaatga cccatgcgca acttgtttct gaggctagct 58560ctttaaggat tgaaagcgga aagagattcc tcactttaaa actcaaagat ctagaaaact 58620atatttctac aagagcaaaa aggggaaata tgcttccaca aggatatagg aaggtagata 58680aaatgattga agaggttgag ttagaagtta aagaagactg attatagatt tctcaaaaat 58740ttttaatccc tcatcaacta gctcattttc tattattagg cttggagaaa acctaacagt 58800agatgcattc gccttaagaa tcattaaccc attatgatgg gattttttta ttaaatcatc 58860tatttgaatt ttgctatcct tgcttacttc cactccgacc cataaaccag cagaagttat 58920tttttcaaaa catttgtgct tctcatttat tttattcaac aaattgataa atctaacttc 58980tttctttaaa accttattta aaaatgattt tttagaaatg gtatctataa cttcattacc 59040tatggcacat gcaattgggc ctcctccaaa agtcgtgcca tgagaccctg ccgacatatg 59100ttttgaaact ttatctgaag taagaatgcc tcctagtggg aagccattag atataccttt 59160agcaaaacaa agaatatcag gagtaatgtt aaattgctca taagcaaaga gtgttccagt 59220cctgccaatc ccggattgaa cttcatcaat tatgacaagt gccttatgtt tttttgcaag 59280tttctttatt tttgctataa attttttatc tgcttttgtt attccagatt gccattgaac 59340cagttctaat ataaccgctg cagtcttatc tgaaaaaacc ttttctagat tagtaatgtc 59400attatatgga tgatttttta tccctctagg cagaggcgca aaaccgtctg taagatgctt 59460agctttagct aaagctatac caagcatagt tcttccatga aatgacgttg aaaatgaaat 59520gacctcattt ttatttttat taactgttga tgagcaaaat tttcttgcta tcttaactgc 59580agcctcaatt gattctgccc ctgaattgca aaaaaatact ttatctgcaa aagaattctt 59640gcataacttt cttgccaaag taactgaggg ctcatttatg tagagatttg ataaatgcca 59700aagctcctca gattgctttt taagaatttt tattagatct ttatttgaat gaccaaggtt 59760agtgacggca atgcctgctg tgaaatcaat atatttctta ttatttaggt cccatacatg 59820cgatccactc gcttttttta caacaaaatc tgcaggagcg taaaaaggca ccatataatt 59880agtatattct ttaattatct ctttcataaa tatattgtat cgtggtttca gttcaaaaat 59940taaatgacca aaaatttcta gtatctttaa aaccaaacag ctcactcatt ggctttaata 60000gaataatatt tatatctagt atatctattg tatgcggtgg catagcattg atatttttct 60060tttttggagc aacgctcatt ttgccttttg ctggtttaga gcttggtatt ctattcactg 60120cattctattt aagttttaaa tggagtgata aaaaagaaaa aatatttatt tctcaagatc 60180ttgtaactat

agaaaaaggc tctaattatg ctgaatataa atgggaagag ttcaggtcat 60240ttacctcttt ccaggtttca aaagacagaa gagatcttct taagctaagc tttaggtcta 60300aaggcgagga tgttgaggtt ggcagctttc taaatgaaga tgataaaaat gtattaatag 60360aagagctaac tcagatcata gatacattaa atcacgattc cttctcaaag ccagagcttt 60420aatttctttt taacttcttt tagctttaat tcttttattt cagggatgcc atttctaaaa 60480tttggaaatg attctccctg tattaatggc ataagatatg caattgcttt tggggtaacg 60540tccattccat ttttagctat aaaagatttt ggtaattttt tttctaaatt agctattttt 60600gccagtggag ctggttcaat tttccaccta tattttttag ctttacctct aactattatg 60660ggcataacac cattcatgcc ttctactgca tactgaactg ccttggcccc aacagccata 60720gcttgctcta ggtctgtttt ggaggcaata tgtcgtgcac ttctttgtag atagtcagaa 60780acagcccaat gattttttag ttttaattta tcagtaatta agttggcaat gtaaggcgca 60840actccaccaa gttgagcatg accaaatgca tctattgttg ctgattcaga aagaaatctg 60900ttcttattat ttttcaaccc ttcggatacc acaacaacac agtagccatt ttttttaaca 60960acacttttaa cttctgctag aaattttttt tgattgaagg ttatttcagg taataaaatt 61020atatgaggtg catctccttt ttcttttctt gcaagagaag atgcagccgc catccagcca 61080gcatgcctgc ccataacttc taatataaaa acttttgttg atgttgctgc cattgaccga 61140acatctaatg aggcttctag agttgatgtt gctatatatt tagccgccga cccaaatcca 61200ggacaacagt ctgttaatac caaatcgtta tcaacagttt ttgggatggc tatacagtta 61260attggataat ttaatttttt acttatctgc gaaactttaa atgcagtatc agctgaatca 61320ttaccgccat tgtaaaaaaa atatcctata ttgtgcgctt taaaaacatc aataagcctt 61380ttgtactctc tttcgcttga ctcaagatct tttaacttaa acctacatga cccaaaagcc 61440ccaccaggcc tatatttcaa tgactctaat gcagatattg attcttttga tgtatctatt 61500agttcctcat tcagcgcgcc caatattcca ttcttgcctg cataaatttt accgatgtct 61560ttatgcttct tggcctctaa aatcagagcg ccagctgtag catttataac tgcggtaacg 61620ccgccggatt gagcataaaa cgcatttttt ttcatttttt ctccatcaaa tgtactgaaa 61680taaaaatatg taaagtatta taactaatat gcgaatacat atcttaggga tctgtggrac 61740ctttatgggc ggccttgcta agatacttaa agagtcaggg catgaaatat ctggatcaga 61800tattcaattt tatcctccca tgtcagacta tcttgatagc tttgatattg aaatgatcaa 61860gggctatgac ataaaaagca tgccagatgc tgacttgtat gtgattggta atgcactttc 61920tagaggaaat gaatccgttg agcatatttt gtccaattct cttccgttta agtcaggccc 61980tgaaatgctt ggagaaattt taaaaaataa aaaggtttta gctatatctg gaacacatgg 62040taaaacaacg acttcatata tgttgactca tattatgtta gatcagggta gggatgtcgg 62100ttttctagtt ggtggaatat caaataatat atctggctct gcatgtcttg ggtcagatgg 62160aacttttgtg attgaggcag atgaatatga ttctgcattt tttgataaaa gatcaaaatt 62220tattcactat tcaccaagca ctatagtcat caataatatc gaatttgatc atgctgatat 62280ttttaataat ttagatgaca ttaaaagaca attccatcat ctaattaaaa taattccatc 62340aaatggaaat gttgtttatt ttgctgatga taaaaatata agagatctta tcgatatggg 62400aatttggtca aatcagatag caatcaataa taatgctcat tcaattgagg cagtttattc 62460tgataagact ctaaaatatg aagaaagtat ttattcatta aatgagttac ctttaatagg 62520agaacataat tttaaaaact acatttcggc tattttggcg gcaaagacag atggaattca 62580aattcaggat tctatcaaat cattagctag ctttgatggc gtaaaaagaa gattagaata 62640taaaggaagt tttgatggca tagaaattta tgatgatttt gctcatcatc caaccgcaat 62700agaattttct tctaatgccc tcgtaacaca aaatccatca aaaaaaatac ttggtctcat 62760tgagctaggc tcaaacacta tgtctggggg ttctcatggc ttgtctttgg ttgaatctgc 62820aaaatcttta actcatgtta tctggctaga tcgcaataat gttttgtcag agaatgctag 62880cattgaatct actaacacta ctgaagattt tatttctgca gcgatatctg ctttctcaga 62940ttatgatatt gttattttga tgaccaataa agacagccaa aaaatattaa aacccattgt 63000agatcacttt gaaaaataat aatttaccag tttttccttt aggaatagtc gccctcccag 63060gtagcatcca atctcttcaa atttttgagc ctaggtatat acagatggtt aaaacatgtc 63120tatccaagaa ccatggattt gtaattgttt ttaatgccaa taatgagtct caaggcgatt 63180tcactttttc taagaaggga agttttgttg aaattataga ctttaataat ttgccaaatg 63240gccttcttgg gataactgta aaatctataa ataaggtgat aatcagtaat atatgtcaat 63300tagaagatgg actgcatatt gctgatatta aggcacagat agatccagag gtagatgatc 63360aagctgtttt ggcagaatat cctgagatat ctagcattct ttctcagctt gtaaagcatc 63420ctaagattag tgacctgccc atccaggttg actttggctc tgctgattca gttgcatacc 63480acttagcagg ccttatacct ctaagctcaa atgagaaaca aaaactatta gaagcattcg 63540atgcagcaca gcggatgaga attctttcag actatattga aagaatatct actacataaa 63600ttatttattt taatattatt ggcggcttat tagattttgc tctaagccaa ttgatggact 63660taaaaaccgt agggatttct acaaaccttt tttcaatata gtatttgcct ggatagttac 63720taagcattag cccaataata atagtgaaca aacctggcca ggtaacacca gcatcatgat 63780tcccccaata ataagggaat aacctaaaat atttttaatc agtaaaacta aataccacaa 63840caacgggtta ttggtcttaa atttcgagac atctttcttt ataaaatagt cacttggaat 63900taatccagca agccacctca tactaacgag actaaataaa aaaataaata aagatattga 63960gctcagccaa agtataagaa ctttataaga ttcaaaaaaa attatgatgg catttagggt 64020attaaattcc ataattaccc acctttttat ttaataatga tttagtatag caatgtctaa 64080aaataaattt aaacattaaa taaatataca ataattttgc caaaaaaagc cttaaatgat 64140atttgcactg aaaccctcag tgataataaa gccgaaaatg ttctatcact tgatatcaaa 64200ggtatttctt cttttgctga caacatcatt attgcaactg caaattcgaa taggcatgca 64260aagtctttat ccgaaaagtt agttgaggag ataaaagcta ataaaattag tatcatgggt 64320gtagagggca agacagaatc aggttggata ttagtagact gtggtgaggt tgtagtaaat 64380attatgaaga atgacataag agagttttat gatttagaag gtctttgggg tgaaaacacc 64440ctcatcgatt cttcgaaata aatgctatta aatatcataa gtgttggaaa ctcaccttca 64500tcttgggagt taacaggaat agaatattac acaaagcaaa tccctaaaga agtaagtcta 64560aattttatta acgtaaaagg gcagcaacac ccaaaaagat ctacagaaga ggtcttaaaa 64620cttgaatcta agttaatctc gtccaaaata gattctaatg gatatattgt ttgctgggac 64680tcaagtggcg agtcgttgaa taattttgaa tttagtaaat tttttgaaaa atccatgctg 64740gaaaatatga agctttactt tgtaattggt ggttcatttg gcatacctcg agatattctt 64800gataaatcta ataagataat ctcgatctca agtctcaccc tacctcacag gcttttcaag 64860attgttctta tagagcaaat ctataggtca ttttcaattc tcaaaaatct tccctaccac 64920aaatgattga tctgaatgag aggtttgttg aaaaaagaag ctttttcaat agattgttgc 64980ttatatattt tttctttggg gccctgtttt tattcttgct tttcaagacc tattctttac 65040aggtttctag ctattctgac tatgaactag cagcactaaa gaataaaaca aaagaagtgc 65100tagttcagcc tgttcgagga gtcatctatg acaggaatgg aaatattcta gttaataatg 65160tccctacata tgacctaata atccaaccat caaaaattaa aaacttagat gatttcatta 65220ttaatatttc taaaataata gagctatcag attcagaaat tgaaaacatt attaaaaatt 65280ttaaaaggag tgcaacttat aatagagagc taacaattaa aaaagatctc tcgaaagaag 65340aaatagcaaa atttgaagta agaagctacc aatttcagaa tgcatttata gatgttaggt 65400atagtagaga aaataaatat ccttacttat tttcacacgc actcggttat gtaggtggtg 65460taagtaatga taaggtttta tcaattttaa aaaatcaaga tttgaagcaa tctgaaacaa 65520cttttaagta ttcaggtggt tttattgctg gaaaaacagg attggaaaat atttatgatt 65580cagctttaag agggtccttt ggaaaaaagt tatttgaagt tgatgcaagg ggaaggcttt 65640taaaagagct aagttttgaa aaacctatta atggaaaaag cctttttact catttagatt 65700taaattcaca aaaaaaagcc tttgaacaaa tgaataatag aaggggtgct gttgttgcct 65760tagagcttaa atctggttct atagttacct acctaagcac tccaagcttt aatgtaaatg 65820gtctttctaa tggcatgtcg tcggtagagt tcgaaaaact gattaatgat gtagacaagc 65880cattttttga tagggccggt caaggtcggt actctccagc ttctacaata aagccagcaa 65940tagcattgtt tggtattaaa gaaaaaatag tagactggaa ctttacactt aaagatcctg 66000gattttttgt attaccagag gatcagagga tttatagggg atggaagaaa ggaggtcatg 66060gaacaattga tatgaagggt gcaatcatag aaagttctaa tacttttttc ttttctcttg 66120cttataaggc tgatattaat aatttaatca gccatctttc tgagtttggt tttggcagga 66180atgtttgtaa agattgtttt aatccagatt ctggattatt gcctacgcct gaatggaaaa 66240tgaataatct taattttggc tggtttaaag gagataccgt taatttaggt gttggtcagg 66300gctatatgag tgccactcca gttcaattag catattactc tgcatttctg gcaaaaaaag 66360gaaatcttca agagctatca tttgtcgaga gtgacagcct aagcaatact gcttttataa 66420ataatttaag catagataac tcagactggg atcaaattca ctcaagcatg attggagtta 66480ttgaagatcc gagaggcact gctaaaagat taaaaccttt aaaatcatat gttgttgctg 66540caaaatctgg aacggttgag cttgtcagca cacaaacaaa ggaagattac aaactagtaa 66600ggcaaaatat tggtaacaga gaccatgcaa tcattgtggc ttttggtcca atgcctgatc 66660cagagtatgc agtaagtgtt gttattgaaa atggtgaaag cggcggttct gttgcaggtc 66720ctgttgctat tgctgtttta aatgagctta tcaataaatg aagaaaaaat tagactttaa 66780aaactttagc atttattttg atcaatattt atttattgcg ataaccctgt tgtcggtcat 66840gggtttattt tttttataca gcgcatcaca agaagatatc agcactgttg ctaagcaagc 66900tgtattcgtt ggttttggtt tgctgttaat gtttgtagtt agccaacctg accctgattt 66960ttataataca ttttctgggt tattttttgg ggggggastt gtattgattt ttctaactat 67020gatttttggt aaagaaataa atggagccaa aagatggctt gatttaggat tttttaccct 67080gcaatcctct gagattatta aaatatcatt gccaattttt ttatcatcat atttatataa 67140taagccactt ccaataagca ctaggcatac ttttattaca ttgatattaa taggttttat 67200atttgcactt gtagctagac aacctgactt gggaaccagt ttagtagtat ttatgtcagg 67260gggttatgta ttatttttag caggattaag ccgccgttta ttggatctgc aatagcctta 67320tttttattat ccttacctbt tttatggaat aactttctag agccatttca acaacagaga 67380gttttaactt atttgaaccc agacgcagac ccttatggta ctgcatggaa tataactcaa 67440tcaaaaatag caattggatc aggcggaatt aatggaaaag gttatcagga gggctcccaa 67500gcccatcttg atdttttacc agagacagaa acagatttta tttttgctgt tattgctgaa 67560gagtttggtt ttattggagt ctgtattttg ttatcagtat ttdtctttat atkactcaga 67620tgtttatatt tagcatttaa tgcaagagat agattttgca ggttaactat aggaggccta 67680agtttagttt ttgcctctac attatttatt aatttagcaa tggttgttgg tgtagttcct 67740gtagttggta tgcctcttcc atttatcagc aaaggcggct catctttgct atcctkttat 67800atagcttttg ggattataat ttctatggca acacataaaa aattaatgca aagatgaaaa 67860aaattatatt tataacttta atattcacta tttcaattac tgcggattat tcgaatcatg 67920aagatagcca aatggtaata aatgaactcg tcacaaaaca cggttttgag gaatcttatg 67980ttactgcaat cttaaaaaat gcaaaaaagc gtgatgagat gcttaaatct gttgctaatc 68040ctgctgaaaa aacaaaaaca tgggatgaat atagggctat ctttataaaa acaaaaagag 68100tttcagaagg caaaaaattt ataaaaaaaa atattaatgc tttagagagg gctgagaaag 68160aatttggagt tcctaaagaa ataattactg ctattttagg cgtagaaact aattatggca 68220gcaataaagg tggatacaga gttttagata gtcttactac tttaggtttt gatgacccgc 68280gaagatctaa tttttttaga agagaactta tagagttctt ccttttaaca agagaaaata 68340atttagatat caaaacaaca aaaggctcat atgcgggagc catgggatat gcacaattta 68400tttcgtcaag ctaccgagcc tatgctgtag attttgatga agatggttac gttgatttat 68460ttaattctgt tgatgatgca attgggagta ttgcaaatta tctttatgtt catggatgga 68520agagagaagg aaagatcgta acaaaaactt atccaaacaa tgttagaaaa ttttataaac 68580ctcatgagtc tctaacaagg ttcatacctt taatctttaa tgaagatgga aaagatcttt 68640tttttattgg tgatgataac tttagggcta ttgctaagta taatattagc gatgtctatg 68700caatggcggt ttattactta tcagaagagt taaaaaaatg aaaaaattat tattcacatt 68760actatcaact tcaatattta ttcaagcaca gagttttgtt ccagattctc ctgagttaga 68820tctgaagagc tatatcctaa ttgagccaaa caccaatact gttattgcag aatttaattc 68880agatttggaa atagaaccag ctagcatgac taaaattatg actagctatg ttgtggctga 68940tcagattgca aatggtttaa tatctcttga tgacgaagta ctaattagtg aaaaagcatg 69000gagaatgcaa gggtctaaga cctttataga agctggaaaa aaagttacgg tatctgatct 69060tttaaaaggt attatgattc agtcaggcaa tgatgcctct gttgcaattg cagaatatgc 69120aggcggcact gaaagaggct tcgttgactt aatgaactct tatgctgcct ctttggagat 69180gaataatacg atctttcaga attcaacagg gcttccagat gaaaatcatt tttcatcagc 69240aaaagattta gccaatctga cagctaacta cattaataaa tttccagaag aatatgcttt 69300atataagcag aagcaattta cttttaacaa tattaagcaa ttaaatagaa ataagctttt 69360atggagagac gactcagcag atggtgttaa aactggtcat acagaagctg ctggatactg 69420tttagttggc tcagcaaaaa ggggcggtat gaggcttatt acagttgttg caggaagtaa 69480gtcagataat gatcgttttt tatcttccca aagattactt gagtatggat ttagattttt 69540tacaacacag aagatgttaa gtgcagaaaa agaatatcag aatattacag tttggggtgg 69600acaagaaaag atacttggcg tgggagttct agaggatata tctattactc ttcctagaac 69660aagtttcaaa aatgttgaaa ctgtttataa agttaataac aatatccagg ctcctatcgt 69720agttggtcaa aaagtcggga ctctagaaat aattagtaat gatgagattg ttctagttac 69780ggatttggtt gctttaaaaa atattgaagc taaaggtttt tttggaagaa tatggtcaaa 69840gttcgttctt tggatattta gcttatttgg tctaacagat gaaaacacca cttaaaggcg 69900tttttaatgg ggcttttgat actgtagata acataaaaat atcacctttt tctcgagctt 69960atactttttc tgatagcgta tatgaagtag ttcctttctt taattcaact gcaattgctt 70020ttgatgatca tataaaaaga ttagaatttt ctgctagtca actagcgatg gccgtagatt 70080taaaagaagt tatatttgaa attaattctt taataacatc atctgaattt agtaatggct 70140atgtctatta tcaagttact cgaggcgttg atccattaag gtctcatatt catgagccaa 70200atttaaaaat agaaaccttt ggctatgcaa aggcgcactt attccaatgg aaaccattaa 70260gagtatcagt atgtgatgat attaggtggg ggaggtgtga tattaaatca acatcattgc 70320ttggtaatgt tatgaacatg aatgcagcaa aactagataa ttgtgatgaa gttattatgc 70380acaaagataa tttgttgacc gagggtggcg catctaatct tttttttgta aaaaatgatt 70440caatctgcac cccagctcta aatggaaata tacttcccgg cataaccaga gcattactaa 70500ttaatgaatt aaaaaattat agtatagagg tcatcgaaga taacttcagg ctagaagatt 70560tatcaaatgc ttcttgcgcg tggctcacaa gcgcaacgaa agggctggcc cccatttctg 70620aaattagtaa cctagaatct cacttggatt tataccatcc tcttttcaaa aagagtgaag 70680aaatttttaa taagaagttc cttagttaga agtcagttta ataactatat tatcaagctc 70740gtaccaaaaa tctttatcac taagaccttt atttgcaaga tcaagctcaa atacattttt 70800ttggagctgt attaagtttt gtagtttatg atttttaata aaatgcatat attcagaagt 70860tttatttttc caaaccccag atttaattag actatctaca ggatttttac tttgtctagc 70920attggctgca ctgttaataa ttttaccaac tatccaaact agaagaggtg cataatgatc 70980ttccgctgat tttattgaat ggataatttt taaagcttgc ttggtattat ttgaaataat 71040tttatcttca agctcaaatg gcataaattc tgcagactcc acagaaggtt ttctttcttg 71100gccatcacca tctttataag tcagtcttag aagctttact tcattctgtt gggcaactaa 71160gtttccagag ttcatttcag atatattttg cactaaagag ggcctatctt tttcagaaat 71220aaaagatagt tgatgcttaa gccaaatctt ttcttcaaaa gatttaagtt ttccacaatc 71280aatgatgagg gctatctcat ccatctgttt tacccactta gttgtgaaat taagcttttc 71340attgcttgaa ttaataatta ttgcaatatt atccatatgg ttaatatttt caatttgaga 71400tattttaatt atttggtcag gtatttttcc ttgatcatga ttaatctcaa ctattatttt 71460tgagccaaat aaagatccag aagcattttc aataatagtc tgatctattt tatcaaaccc 71520atctttatta ataattgttt tttctttgaa ccctttattt gataaatgtt tgagcaaaag 71580atctttagag ttatttttta agacaacctc agatccaaaa ataaagaaga tattttgtga 71640ctgatctaaa tatttttgag ttgtcattgc ctcacatttc aaaagcatgc acctcaaaaa 71700ttatttgatc aattaattcc ttctgcattt cagactcaag agatttaatc atttgatctt 71760gtgcaaatgg attaagttca ttcgatttat atctcttgta gctaacaagc tttttactta 71820tttgtttatt tccaatcatt tttatatgta tttctaaatt aatttctcct tcaagagctc 71880ttagagaaga gcctccatag atatcatatt tatttaaaat atagtcttgt attagaatct 71940gattttgacg attctcagct tctttttctg tattgaagag gattgctaac ttttgctcaa 72000aagaattagg gacactacta tcaaaccgaa aattaaatag atctttttta tctagatcta 72060tatttagttg attaaattgg cagccagtta aagatatcaa caagcaagtg tataaaaaga 72120aaggtcttag tttcataacc tcatagtatc tatctgaaga atatatatca aagattaatt 72180ttaaataaca aaatttatta ttttttcttt aatataaata gtttttttaa tagccgagtt 72240gtttattgtt gatgccacat tgtcgatagc cagtgctaaa gcttctatat ctttttgctc 72300aaggtttttg tctatcatta ccttacctct cacctttcca tttacttgaa ctattaaatt 72360aaattcagaa acttctaata actcctctct aagaacaggc caagaagatt caatttcttc 72420ttgtgcaaag tcaaagtaaa aattattcca taaatgttgt gagatatgag gcgcaatagg 72480attgagagtt ttaagaataa ttattattgc ttcattaaga caatattgat ttgagatcga 72540cgcattacct tctttaaaag aatcaggtat aaagttaact agctccatta tcgaagcaat 72600ggctgtatta aatgaatatc ttgtttcaaa gtcatttgta acctttttaa gagtattatg 72660cgattttctt cggagctcta cttcttgttt cgaaggatcc ttgggctcct ctaagtcaat 72720aaattttcta ttgctgacta agttccatat ttttttcata aatcttgagg caccttctac 72780agatgactct gaccattcta gactttgctc agggggagct gtaaacatca tgtaaagcct 72840tacagtgtca gcgccatact tttctatata agattgagga tctacagtat ttccttttga 72900tttggacatt ttagccccat ctttaagaac catgccttgt gtaagaagct ttttaaaagg 72960ctcattgcct tcaactaaat ccatatctct aagcgcctta tgaaaaaatc ttgaatatag 73020taagtgcagt atcgcatgct caatacctcc tatatataaa tcaacaggca gccaatattt 73080tgaattttta tcaaacattt catctgcatt gtcagatgaa gtaaatcttg cgtaatacca 73140tgatgagtcc ataaatgtat caaaagtatc tgtctctctt tttagtcgat cagaaatatt 73200ataaaaatct tcattttgac ttaagggaat aggtgccgag ttctttttta gctctggaag 73260cactataggc atatcctttt catcaataac tcttggttca ccattttcat aaacaaccgg 73320aatagggcaa ccccaatatc tttgtctgct cacaccccag tctcttaatc tgaactggat 73380taactgctca ccaaggtttg cgtctgctaa atctttaata atttcaagag atgcctcatc 73440agagtccatg ccatcatatt tatcagagtt tattaatttg cctttttgta caactggaag 73500ttcattatta ttatcaaaac taataacttg aggaatctct agattatatt tagatgcaaa 73560ttcgaaatct ctttgatcat gagcaggaac acccatcact actccagttc cataatcaag 73620aagtacaaaa tttccaatcc atacgggtat cttcttcttt gtaattggat gaattacatg 73680cattccacta ttaattccaa gtttttcagc tttagccata tcagcttcag cagctttcac 73740ttctttgcat ttgtttagaa agtcttttat cgattcatta ttttttgaca accctattga 73800tatagagtga tttggtgata ttgctaaaaa agaaacacca aaaattgtat caggtcgtgt 73860tgaaaatacc tttaaagaat catcggaatc ctcaatcatg tatttaattt ctgctccctg 73920agatttccca atccaattcc tttgcattgt tttaacattt tcaggccaat ccacctcatc 73980caaagatgtt aaaagttcct cagcgtagtc tgtaattttt ataaaccatt gatcaatttc 74040ttttatttca acttgcgccc cagacctcca tccttttcca tctataactt gctcatttgc 74100taaaactgtc tcatcaactg gatcccagtt cactaatgat tttttcctat aaaccaaacc 74160tttatcataa aactttttaa atattaactg ctcccactta tagtattccg gttcacatgt 74220tctaagctct tttgaccaat catacccaag gcccaaagat ataagttgct gtttcatatg 74280ttcaatattt tgattcgtcc aatcttttgg gctaacttta ttagcaatcg ctgcgttttc 74340agcaggaagc ccaaatgcat cccagcccat aggctgaaaa acattaaagt tattcattct 74400tttatatcta gatatgacat caccgatcgt atagtttctt acatggccca tatgtaattt 74460acctgatgga tatggaaaca ttgataggca ataaaatttt tctctaccat caggattagc 74520tttaaatttg tcttctttta accattcatc ttgaatggtt ttctctattt gacttgggtt 74580atattcagga ttcatttttt taggaactca ttttctaagt aattaatagt atgtttattt 74640tctttaaagg tatcatcatg cagaatcctt tgatgaagag ggtgatttgt tttaatgccc 74700tctataaaaa attcatctaa agcactcagc attctcttaa tagccgagct tctagaattt 74760gcctgagtaa tgatttttgc tagaagtgaa tcatagttag ggggaactct atatcctcca 74820taaatatgtg agtcatatct tatgccaaaa ccgccaggtg tatgcatttt tgtaattgtt 74880cctggggatg gttgaaaatt atcaggatct tctgcattaa ttctgcattc tagtgagtgg 74940ccatgaaaat taatatcatc ttgatttagc tcaattggca tttcaagagc aattcttagt 75000tgtgctttta ctaaatcaaa gccagttatc atttctgtta ccggatgttc aacctgaatt 75060cttgtattca tttctataaa atagaattga ttatcttcgt ataaaaattc aatagtgcca 75120accccttcgt aatttatttc ttcacataaa ttgacacaag ctattagtgt tttattaaga 75180gcttcttgat ttatattaag tgctggagct tcctcaatta ttttttgatg tctcctctgc 75240atactgcaat

cccttgtacc taaatgtatc gcctttcctt taccatcacc aactatttga 75300acttcaatat gtctagggtt cccaataaat ttttctagat aaatggtttc attgccgaat 75360ccatttttag cttcctgcat tgttatttct gcatgtccaa taagatcttc ctctctttca 75420acaactctca ttcctcttcc accgccacca gccgttgcct taatcataat tggatagccg 75480atatctctag caattttttt aaactcatca ttatcagatg gaatctcatc tttataacct 75540ggaacgattt gaattccaga tttttcagcc agtgtttttg ctgttatttt gtcgcccatt 75600ttttggattg tttctgaggt tggcccaata aatttaaatc cacttttttc acacatttct 75660gcaaaattat gatcttctgc aaggaaccca tatccaggat atatagcatt agcacttgta 75720agttctgcag ctgagagaat ggcaggtata tttaggtagc tttgagttgg agatgcaggc 75780ccaatacaaa cagtttcatc agagaatctt aaatgtttta gatctttatc agcttctgag 75840taaacagata ctgtcttgat accaagctct ttacaagctc ttatagctcg aagagcaatc 75900tcaccacgat tggcaataag aactttataa ctcattttaa ttgacagtta taattttttg 75960tccaaattca accggacttc catcttccac gtcaatactt gaaatcttgc catcaaattc 76020tgatttaatt tcattcatca ttttcatggc ttcaactata cagagcacat cacctacttt 76080tatattgtct cctactttta caaatggatc tttttcggga cttgggcttc tataaaaagt 76140tcctactatt ggagaggtaa caacatcccc tatgacagtt tgcttagcct catccaattg 76200atgagttgtt ggtgagatag ctggctcatt tttgacaatg attggttttg aaacaaattc 76260ttgattagag ccgttatctc ttgaaattct tactgactca tctccttgac ttacttctat 76320ttcttttaaa tcagactctt gaagcatttc tataagtttt tttattttcc taatatccat 76380ttatttaccc ctgtatttta atatgccctt aagcatcgcc tcttcataac ccttagcacc 76440aataccagtt atcacttctt cagctatatc agatagatat gagacatgtc taaattcctc 76500tcttgtatat atgtctgaaa ggtgaacctc atagaatggt atgttgacac caagaaatgc 76560atcccttata gctatgctcg tatgagtata tgcagcagga tttattatta caaagtctat 76620tttttcttca atagtgctat gtattgattc tattatttca tgctccgcat tactttgaaa 76680agatattaaa ttgcattgat tttgatttgc taagactaat aaatcttgtt gaatatcttc 76740aagtgatttg ctgccatata cttccttttc tctagttccg agaagattga ggttggggcc 76800atttataagc aatatattca tataatagat tttaacagaa atttacagat tttttaggat 76860tttaaagata tttttataag agttgatttt tttgcatggg atagcaaaaa cccgcttcag 76920agcatccctg atagtatatt aaaacctccg ataagttcaa tgaagaattt atttcaatgc 76980ttataataaa ttcatctcta agaatctcag tctcgccaaa aaattcatcc ctatatagat 77040ctttacttga ctcaagcgtc ttaaatggga gcactttatc tttaaatttg agctctatac 77100tatccaagta catataataa ccatctctta tttgccaagt tatatatgct gcttgttttt 77160caacattggt tgttaaagca aatacttcat tggcttctgg gaccctattt gagttttcaa 77220aaagatttgc tgaattcaat tcacctgcaa tcacattgga tagtagtata attattaaac 77280ttttgatcat gaagacttaa gaataaaccc ctaagtactt aaaataaaga ttttattaaa 77340taacacggta gttaaaaatt tatgttccag actttcgcac taaaaatttt cttttatatg 77400cctatttggc ttttaaaaat aattttttat agaaaaagaa cagtaataag agggcatcaa 77460tttgatgctc aatctgctgc attattgtct ttgcttccaa agaaagattt atctgaatta 77520ttagatggtg aaattgcaaa agctagaatt actcttgaag aagcaagaat tcaaaataaa 77580gtttctttaa caccgtctat acaagtcaga aaagtagatc atattttacc aaagcatgat 77640ttaattctca gagaatacaa gccgcatcaa gaggatttaa aaaaagttat tctctatttt 77700catggaggag gttatgtcct taactcagtc aatacacatg atgatatggt ttcatatatg 77760tcagataaaa taggagtaaa gttttattct ctagactata gactatcgcc tgagagcaaa 77820tatcctgatt ctcttgatga tgctctagat gctttttctt ggctcatcgg ccagggatat 77880ggaccaggtg atatttcagt ttgtggagac agcgcaggcg ctcatctagc tgcttctttg 77940tgccattacc tagctgaaca aaataaagat ttgcctagta gccagctact aatatatcca 78000atgtgtgatc cgtcctgttc atctgagtct tataatttat tatcttcagg atatcttcta 78060actaaaaaaa ctatgatttg gttttgggat aagcttagaa cttccgaagt aaacaataac 78120gattcggcct ttaatctctt aaaatttaat tttgaaaaaa ctttaccgcc aacaattatc 78180gttacaggtg gctttgatcc tctatgcgat gatggagaaa aatatgcata tttattacat 78240aaaggtaaac ataatgtgaa acaattacat tatccaacaa tgtttcatgg gtttgcatca 78300atgactaaat tgaaagcagc gcagatagcc gttgaagatt ttttaaaaga atataagaaa 78360atactatgag taaaatttta gaagtaagtg acttaagcat caatttttca acaagggatg 78420gattgtttaa tgcagtggat aatataagtt ttgatataga aaaaaatcaa accttggcct 78480tggttggtga gtcaggttct ggtaagtcgg taactgctat gtcaattctt cagctccttc 78540aaaaaccaca agcatcatat tccaaagagt cttctattaa gtttaatggc gatgagataa 78600taaatgccaa gtatgaaaag ttactttcct tgagaggaaa tattatatct atgatatttc 78660aagagccgat gacctcacta aacccttatc acagagtagg taatcagata actgaatcaa 78720tactacttca ctcaaaaagc tcaaaaaaag atgcaataga tgaagcaaaa aaattaatgg 78780cacttgttga gattgatgat gttgaaagac ggttctatgc ataccctcat gagctttctg 78840gagggcagcg acaaagagtt atgattgcta tggcccttgt taataaacct gagctattga 78900ttgctgacga gccaacaacc gctcttgatg taactatcca agcccagata ttagatctca 78960tgtctaagct taaaaatgaa ctaggcatgt caatactttt tattactcat gatctaggcc 79020tcgttcaaga attttctgac aatgtttgtg ttatgaagaa tggcaagata gttgaacaag 79080gaaatactgt tgaggtattt aacaacccct ctcacgaata tacaaaaaaa cttttagatg 79140cagaacctca gcccaaatta gataaccctg taagtgatga gccaataatt gagatcaatg 79200acttaaatgt ttactactca ataccttcaa ctaatttttt taaaaagaat acttttcatg 79260ctgttaaaaa tacttccttt agtatttata aaaatactac aattggcctg gttggggaat 79320ctggatctgg aaagtcaact ttgggtaagg ctatagcaaa cttagtttct tataagggga 79380atattaaatt tgagggaaga gatatcaact caaattccca aaaagaaaat aaagaattga 79440aaaaaaatgt ccaaattgtt tttcaagatc cttatgggtc attatcacca cgaatgacag 79500tgggggagat agttggtgaa ggtttaggtg ttcactttaa gcttacaaaa aaagaaagag 79560acgaaaggat agataaggtt ctgtcagatg tcggtatcga aatagtagct aagaataaat 79620atccgcatga gttttctgga ggccaaaggc agagaattgc aattgctaga tctttaataa 79680tgaatcctgc ttttatgatt cttgatgagc caacatcagc attagatagg tctattcaaa 79740ttcaggtaat cgatttattg aaagagatac aaaatgaata tgggcttact tatcttttta 79800taagtcatga tttaaaggtt attagatcga tgtcagactt tatttttgtt atgaaaaatg 79860gagaaatcgt agagtcagga ccttctcaca aggtctttga aagcccagag caagactata 79920ctaaaaaatt actatcagct gctttaaagt atgcatctga ataattaaat atatymcmtm 79980tggcaaatag aaartattca aaagagctcg ttgacggtcc taatcaagct gcttctagat 80040caatgcttag aggagtaggt ttcacatctg aggmtttcac aaaaccattt gttgggattg 80100cttccacagg agcaaaagta accccatgca atatgcacat aaatgcactt tcagagatcg 80160ttgagaaatc agttgatagt tcaggaggaa agggtgttct ttttaatact attactgttt 80220ccgatggaat ttctatgggt acacagggta tgaaatattc tcttgtttct cgagaggtaa 80280ttgcagattc aatagaaact gttgtgggat gtcttggtta tgatggagtt attgctgtcg 80340gtggttgtga taaaaatatg cctggatgca ttattggaat ggcaagatta aatagaccat 80400caatatttat atatggtggt tctatcaaac ctagtaaaga aaataccgac tatgttactg 80460tttgtgagaa aactggagag tactcaaaag gcgatcttaa agaatctgaa ttaattcatg 80520tagaaaaaat ttccgtaaaa gggcctggat cttgtggggg aatgtatacg gcaaatacta 80580tggcttctgc gattgaagct ctaggcatga gtcttcctgg aagcagcagt caagatgcaa 80640tttcacacga caaagaagat gattgtttta aggctggcga agcgataatg aatttattag 80700aaaaagatat taagccttca gatattatga ctaagaatgc ttttgagaat gctataacaa 80760tggtaattgc tctaggaggt tcaactaatg cagttctgca tttattggcc atggcgcatt 80820caataggggt tgatttagag ctagatgact ttacaagaat aggaaaaaaa acacccgtta 80880tggcagatct taagcccttt ggttctcatt atatgtctga actcaatgct aatggcggta 80940ttcagccact aatgaaaact ttgcttgaga agggattact acacggcaat tgtcttaccg 81000ttaccggtca gacgcttgct gaaaatcttt ctggaataaa accttacgag cctgatcaag 81060agataattaa atcatttgat aatccaatta aatcaaatag ccatcttaga attctgtatg 81120gcaacttagc gaaagatggt gcagttgcaa aaattacggg taaagaagga acttcctttg 81180aaggaagtgc tcgtgtattt gattcagaag aagaaggggt taaagcaatc ctatctaaat 81240ctataaaagc tggagatgtt gttgtaatta gatatgaagg gccaaaagga ggtccaggca 81300tgagagaaat gctaaaacca acatctgcca taatgggtca aggtcttggc gatcaggtag 81360cttttataac agacggtcgt ttttcaggag gcactcatgg atttgttgtt gggcacatta 81420ctccagaggc tgcagatggc ggcttaattg caataattaa agatggcgac tctatattaa 81480ttgatgcaga ttctgataag ttaattctta atatttctga ggatgaaatt tcaaatagac 81540taagtagatg ggtaaaccca aaaacgcctc ccaaaaaagg agtcttagca aaatttgcaa 81600aaagtgttaa atcagctagt cttggagcgg taacagatta aatatgtatt ttaaaagaaa 81660atttccaaat agtagactaa gaagaatgcg tctcaattca aacctcagag acttgcttgc 81720tgaagttagg ttgtctacaa atgatctaat tcagccatta tttataaaag aaggcttgag 81780tggaaaagag gctattgaaa gcatgcccaa tattaataga tatggacaag attcaatttt 81840ttcagaaata gaagagctac tagagcatga tataaatacg attgccctat ttccagttat 81900taatgattcc aagaaaaaca gcactggaga tgaggctatc aatgcatcta atttgatgtg 81960tgaaactatc agtaacataa agaaacgatt ccctgaaata atcttaatat ctgatgttgc 82020tctagaccca tataccgatc atgggcatga cggtatttta aaaaatgatt atgttgataa 82080tgatgagact ttagctgttt taaggaagca atcactaaca ttagctcagg ctgggacaga 82140cataattgca ccatcagata tgatggatgg aagaataggc tctataagag aggcattgga 82200cgaaatggat tataaaaata caatcctact atcatatgca gcaaaatata attcaaagtt 82260ttatggtcca tttagagatg ctgttaattc agcttcaagt ctaggcaagt cttcaaaatc 82320cagttatcaa atgtcaccaa aaaatataaa tgaagctcta catgaagttg ctatggatat 82380taatgaaggt gcagacatag tcatggttaa gccaggtatg ccttatctag atataatttc 82440aaaagtaaaa gaaaccttca aagtacctac ctttgcatac caggttagtg gtgaatatag 82500catgcttaaa ctggcgattg ataaaggatg gcttgaaagc gatgttatgt tagaatcatt 82560aataagtttt aaaagagcgg gagctgatgc aattctaacg tacgcagcta aagaaatttc 82620caaggagata actaacaaat gagcaatgtg atagaaattc gtgatgaaga aagctttaat 82680agcgacgtct taaattcaga aaaacctgta ttggttgatt tttgggctga gtggtgtgga 82740ccttgtaaac agcttgcacc aactgttgaa acagttgcag cagaaaaatc agaaacatta 82800aaggtttgca aaatggatgt tgattcaaat agagagattg ctgctaaata tggaataagg 82860tcaatccctt cattaattat atttaaaaac ggagagcctg caggagttga agtaggtgct 82920ctaaccaaac aacaattaga ggactttata agtacagtag tttaactttg caaagacttc 82980tttgcatatt gcaagaaata ggattatcat ttcgacttct aaggctgaaa gccactcaaa 83040acaaaaaccc accttttcaa tcataaaaac aacaactaga acggaaataa ctaaatgaac 83100cttactgaaa ttaaaataaa accaataaat gaacttgtag atatagctac tgagcttggg 83160cttgaggatg ttggaaggct gaaaaagcaa gagataatat ttagaatatt taagcataag 83220gcttctgaag gtgttgatat ctatggtggc ggagttcttg agattttaaa tgatggtttt 83280gggtttttgc gatccccaga aggctcttat tgcgctggcg aagatgatat ctatgtttca 83340ccaagccaaa taagaaagtt tagcctcagg aagggagact cagttgctgg gaagataagg 83400acccctaaag ataaagagcg ctattttgca ttaatccaag ttgatactat taatggtgaa 83460gagccaagaa agactaaaaa caagattctt tttgaaaatt taactcctct ttttcccaat 83520gaaagactaa tccttgaaca aggaacaggg tctaatgaag atctttcatc tcgaataatt 83580gatttgattg ctccaatagg aaaaggtcag cgtggactta tcgtttctcc acccaaggct 83640ggtaaaacct taatgcttca aagcatagct cattctatta aaagcaataa tccagaagta 83700gagcttatag ttcttttgat tgatgaaaga cctgaagagg taacagagat gtcaagaact 83760gtaaaaggag aggttgtagc tagtactttt gacgagccac ctactcgaca cgttcaagta 83820gcaaatatgg ttattgaaaa agcaaaaaga cttgtagaac ataagaaaga tgtagttatc 83880ttattagatt ctattactcg tcttggaaga gcatataact cagttcagcc tgcatcagga 83940aagatattga gtggtggagt tgactccaat gctcttgaaa ggccaaaaag gttttttggt 84000gctgctagaa atcttgaaga gggtggaagt ctcactattc ttgctactgc tctagttgaa 84060acaggctcaa agatggatga agttatttat gaggaattca aaggtacggg taatatggag 84120attcaccttg aaagaaaaat agccgaaaaa agaatatatc ctgctattaa tattagaaga 84180tcgggaacaa gaagagaaga tttacttact gctgaagatg aattacaaag aatgtgggtc 84240ttaaggaaaa ttttagacga tatggaagat gctcagtcaa ttcagttcct aatagataga 84300ttgaaatctc ataaaacaaa cgatgagttc tttacttcaa tgaaaggggg taatggcaag 84360aagactagat aaagtttttt gccatatcaa tcatcatctt atcagttggg ctttgcgata 84420cgtgaatttc taattcttta aactcatttt gacatctatc ctttatattt tctgaagcaa 84480caataaatat tttttttctt aaaactgcat cgtccagatt tgttattaag aatttaagag 84540tactaaaatt atagattaaa aaaatttcat tattatctgt aacttttggt atttgctcca 84600gcaaatagat tacttcatag caaactatct catctagact agccttaagt ttttgttgaa 84660gaaaaccatt tgaattttcg ccgcaaaata aaagactctt tcctagaaaa tttttctcta 84720ttaacttaag aattccttca gatgaatggc tttgtggaaa atgagatttt atgccacttt 84780caagaagttt attggaagtt gcgggcccaa cagacaaaaa attgtgtgga agatcatcta 84840aatcgaaaaa agattttaaa atatcaagtc cgtacgaagc agcagcttgg ctagtaaaaa 84900ttaagtttga atatgaatga atatttttaa ttttatcgat agcgattttt gaagggtcga 84960ttgaattaat tttagaaaga taaatatttt taagagctat ttcttgagct tcacataaag 85020atattaggtt tcctgataaa tttttgggcc tcgtattaat tatcattttt taaaattaaa 85080gatttcgctc cttccaaaat aaactcatca gcaaataaca taatatctat tgcaatatct 85140tgaaaggaag atatcttttc tttataaatt ttttcgttgc cttcataaga taatattttc 85200cctgatattt ttatttggcc atttttatct tcacataaaa tagctattgg agacaggcaa 85260gtaccttcca tggcagcaac aaaagatcgc tctgcactgg caagaattaa ttctttaggg 85320tcaccaatat tttctaaaag ctcaataata tcttttttat ctgacagaca ttcaattgct 85380atatatccct gagatgcaga aggtaacatt tcttcaattg agaattcata tgaattttgt 85440tttaaaccaa gcctttttat agcagcctta gcaactacca gcccatcaaa cagaccatca 85500ttgagtttct ttattctagt agctatgttg cctcttactg gaatagtttt tatatcaggt 85560cttaagttat taatttgtgc ttttcttctt ggaccagaag taccaatagt tgaatttcta 85620gcaaattcag aaaatgattt cccatcctta gatagaagca ggtcttctgg agattcccta 85680ctaaaaacac ttattatttt aaactttgga tcaagctttg ctggaacatc ttttaaacta 85740tgcactgcaa tatctgcttc atcagcctct agagaggact caagtgtaga aatgaatagg 85800ccctttccac caatttcatg aagaggttta tctgtttggt ctccttctga tgtcatagga 85860acaagctcaa ccttaatatt attgatctta gctaataact catctgcaac aaattttgcc 85920tgatacattg ctagttctga ttgccttgta gctattctta ttttcatttt gtttccaata 85980aaagagcctt cacatcgcca acactctttt ctttcaagat agtgaattcc tcaggaattt 86040ccaaggtagt aaatttacta tactcaaggt agattttaca agatggcttt atttcattct 86100ttcgaattat agattttaga acttttaatt catattcttc gccaaaagga ggatcaagaa 86160gaattaaatc aaaacgagat aagtcatttt ttttaatcca actaaatgca tccttaaaaa 86220aaactttaga tttatccttg atgccaagaa gttgaatatt ttttgctaac acagaatagt 86280tttttttatt aagttcaacg aacacaactt tttttgactg tctggatatt gcttcaatac 86340caagcgcacc cgttccggca aacaaatcaa gacatattaa attttcaatc tcaaattgaa 86400gccaattaaa aattatttct tttaatttat ttgaagttgg tcttaaggaa tccttaaatt 86460cgaaaggtat ttttttacct tttaaataac ctccagtgat tctgatattg tttttcattt 86520tttttgcaaa atgggtatgc ctatttatta aataaatata attamaaata catttaaact 86580aagttataat tcatccatgm gtcctacaag ataattttag aaaagcaatg aggagctata 86640tttactctgt cagtgtgatg tcaaatgttg acgagaataa aaagtttagt gctataacag 86700tttcttcagt tacctctgtt tctttagatc ctcctagttt gctcgtttgt atcaataaat 86760cagctggaat tcacaactca ataaaagaag ggtcctcttt ttgcataaac cttttaaata 86820aaaatcaaga agatatttct aatctatgca gttcatttaa gaccgaaggt gatagattta 86880atagtggtga ctgggattta agcggcactc cgtttttaaa aagtgcccag gctaatattt 86940tttgtactgt tgatcaatta atttcatacc acacacatac tattgtgatt ggtcatgtca 87000caaactctct tagcgatgag aaaattaata cactgacata tgttgatggt agctatgaat 87060aaattttcaa aaaatgtatt ttttatttta ataatactca actctttttt tcttgcctct 87120aacatttttg ctagccagga agagtgtgaa gaaaagccaa gtgtttttat tatctctcct 87180caagacggtt ttatctcaga atctaataat gtaaaagtct tatttggatc aaaaaatatt 87240gaaataaatc cagctggcaa aggtgagatt gcaaaaaata aatgttttgc aagcgggcat 87300caccatcttt tagttaatat cgaagcattg ccagagagct ttattccttt tgacaagggt 87360tatttacatt ttggaggagg tcaaactgaa acaattcttg atcttgatcc tggaacctat 87420tctctccaac taattcttgg atcttatgtg cataattcaa aaatgcaggt aaataacttt 87480aaaggtcaag gacccttttt atcagaaaaa ataacaatta cagttaatta gagattagac 87540cagttataac tttatctaga tgttcatcta agttattagc aattatattt gcactagaaa 87600taggaccaga attatcgtca taatcatcac caactacatt cacctttctt attatcccaa 87660caagaccatg caacatagac caaagagtta tacatttaaa ggcaatcact tcctcaggct 87720cgtctgctag attagcaaaa ctttttctca tattatcgta tgttccattt gcagatttaa 87780gaagatctgg gtaatcggca aagttaccaa cagctgttcc aaacatcaaa tcatatgtat 87840gtgcattttt taaaccaaac cctatatatt tgcttgcatt agtaacaagt tttttctttg 87900taattttttt tggattctca aaaaaaacaa gctcattaag ttttttaaag ccaacggttg 87960caacagcggc atatacacat tcttttgttt caaaatgcct ataaggagca gtttgagaaa 88020caccactttc ctttgcaagg gatctaatac ttagtttagt gtaaccatct ctatcacaaa 88080gcctgcatgc gcattctata agttcttcct ttaagtttcc gtgatgatag ttattcataa 88140tttaatttta atgttgacac tgctaacata tatcatgtta ttatgtatac accgcataca 88200ttaagttgat tacatattaa agtcaaatac agatatatga acataacaaa aataaataac 88260accgtcatag ctcttttatt aggcaatgcg tttctatcaa atcttgaggc tttagaagtt 88320cttgaggtta aaatgcttga tgaatatgct gtaactagag aatttcctgg aaagctcatt 88380ccatcagacc agtctaagct agcatttgaa atacctggaa agataaactc tattaatgtt 88440gatatcggag atgaggtcat cttaggggat gaacttgcct cattggatga tagagaagct 88500ttagcacaat taaatcaatc aaaggctaaa tttgatttag ctgaacaagt actagcaaga 88560tatataaatc tcagagcaga tgggcatatt tctattcaag atcttgataa ggctgaatct 88620gatcaaatag tagctaagtc gcagtatgat ttttatagag ttaaatttga gcaaactaag 88680ttactagctc cttttaatgg agttattcaa aatagatttc tagatacagg atcagtaatc 88740aatgcaggtg tccaagtttt agaaatttta ggctctagca atgttgaagc aagaatttct 88800attccaatga actatatgaa caaggttaag attggagatg agtatgaatt tgatatcaga 88860ggaatatcta caaaagctac gttagagaga ctggctccca tgtcaaccgg aggctccaat 88920aataggttag caatttttag atttgatacc ttttttaatc caggatcaat agtaaagctc 88980aaactaagca tcactgagaa agcaaaaggt acatgggttc caattaagtc actgtcccag 89040tccgaacaag gtatttgggc catatatacc attaacgagc aacaagtagt tgttagagat 89100cttgttgatg ttatttattt tgaagacgaa tatgcttttg tcagcggaac acttaataat 89160ggtgatttag taattttagg cggagctcaa aaaattattg aaggaaaaat aataaaataa 89220aatgaatgtt attaattttc taatagaaaa gcctaggata ttatttctaa ctttagcatt 89280tatattactt tctggaattt cttctgggct ttcagttcca attcaagaaa accctgaact 89340ggctgagaga tggggaggtg ttcgtatttt tcttcccggg gcatccccag aaagaattga 89400aacagagata gtaaatgatc tagaaatcaa acttagagaa gttgaagaaa tcgatgagct 89460tgaatcaatt attactcaag gtttttcaac aattgtagtt gaattaaatc aaagtgtacc 89520tcctatactt attgaagaga cttggtccaa ggttcaagac aagctcaatc aaatagttat 89580tcctcaaggt gcagaaatat ttcttgatag aagcagtggt ccgcctatca ctgttcagta 89640tgctgtaacc tggaacggca gtggagatgc tccactaata atgatgtcca gactagcaag 89700ccagctaaaa agaaaattaa gctcaatagg ctcatctcat caaactgcaa tttttggtga 89760aacagatgaa gaaattttaa ttgaactaga ttcatcaaag ctatcttcgc ttggattatc 89820atttcaagat atcgcaagtg ctattcaagc cctagatgca aaaaaaccta ttggtgtatc 89880ctcaaacaac aattctgagc ttttatatag actcaaagat aatatacaga gcattcaaaa 89940actctcagaa atacctatca aggttattaa taaatcagag atcatacagc tagatgatgt 90000ggcatttatt tcaaaaatcc cggtttctcc tattgaagac atattcttgt ttaatggaaa 90060tgtagttatt tctgttgctg gaaccggatc attttctcaa agagtccatg attatgtaga 90120acgcgcaaca attgttgtag atgagatgag agaaactctg ccgactgaga tcactataga 90180tttagtttat gacgaatctg cttacacaac taaaaaattt aatgagcttg taaaaagttt 90240ttcattagca atattttttg ttttagcttt aagtcttttt tttcttggaa ttagatcagc 90300aataattgtt

actcttatcc tgccattttc tatttgcctg gttatgattg gttgtaggtt 90360tataggctta cccttgcata tgacatctat cactggaatt attattgcac taggattgct 90420tatagataat gggattattg ttgttgaaga ttataagaat agaagagcat ctggattaaa 90480tatcaatgat tcaatttcac aaggactaaa aaacttatgg gctccattag ctgctgcaac 90540agcaacaacc gtcttctctt ttcttcctat tgttactgga gaagggtcga gtattgaatt 90600cgtaggcggc atggcaatga cagtcattat gtctataaca tcttcattag ttttggcgtt 90660attaatggtt ccagttctga tgagttatat ggaaaaaatt ccgttcttta aggatgtgga 90720tattagcaag gaagggtata gaaatgaaaa aatccttaat aaatataggg cctttttaaa 90780ctgggcgttc ttagttccta gaagagcaat catgatatcg cttgcattgc ctgttctagg 90840attctttctt tttaattctt tacctaaaga tttctttcct gctcaagata gagatatgtt 90900tagagttaat atagaactgc cttctaacgc ctcatcactt actacaatgc agagagttaa 90960ggaaattaga gaagatattc tagatagtga tttaatttca atagaaaaag attattcgtt 91020tatcggcaga atgatgccta gagttttgat gaatgttgtt ggtggagaag aaaaacaagg 91080atccaataat attgcgcagt ctgtattttt tgctactgat tattatgaaa tgattgaaaa 91140ccttccagat ttatcaagaa gactggttaa aaataaccct gacattatag ttaatattga 91200tagtttctcg tctggccccc cggttttttc agatgttagt tatgtaattt ttggagatga 91260tccagattta ctaaaatcac ttggtgagga gctagagcta attattaaca attctcctga 91320tgtgagtctt acgaaatctg caacttcaaa ctcaataacc aatgttgagt ttgaacttaa 91380cagctcaaat atttcactgt ctggtcaaaa tgccaattat cttgtaaatg aaatgtttac 91440tgcaaacaat ggaatatttg ttggcactat gttggattca aacaaagaaa taccagtcag 91500gctgaaaggg ctgtctaata aaaacaatat tacgggaaat actagtttta taacaatgcc 91560ctctcaaggt ggttttgagt attttgatag tttcggaaaa agctcactaa caaacaaatc 91620gtcaacaatt actaggcttg atggccaaag aacaaatgat gttgagggct ggatttggac 91680aggtacgctc ccgtctgcta ctgaaaaatc tattaaaaaa gatgttaaag attttgaatt 91740aagattgcca ataggctatt cattaaaaca acttggcgag gctgaaagca ggggccaatc 91800tcaagcctca ttatactctt cagcttttat gtatttcatt cttataatag taggcttggt 91860tatggcgctt aattctttca gagaggctgg tctaattttg tctgttgcat tcttatcaat 91920tggactatcc tttcttggtt tatttatagg ccagcaaaat tatggattta ttggaactat 91980aagtgcaata gggttaattg gcttatcaat aaatgattca attattgtct tatctcacat 92040aaaagaagag gctgagaaga aatcactaac caaagctgag cttgttgaag ttgttatcag 92100gtctacacgt catataatca ctacctcttt gacaacactt ggtggttttc ttccacttat 92160ttttgcaagt gtattcttca aaccgcttgc ttgggcaatg agcattggag tattaggcgc 92220gactattaca gccttattat atattcctgc aatgtttatg ataatgagaa aggttaagta 92280ctagaacaac tttccgagca ataatttctt cctagctttt ttacaataag tgattcatga 92340gtataggtct cgcaagaatc acatttaatc atagagttat caatcttttt tttatgaggt 92400cctgatatga gagatctgaa aaagaatacc gcagttatta tcaatataaa tatgaccaat 92460ggaatcaaaa gtaatatcga ttttaataag aacatttttt agtcactaga tgctggctta 92520ctatttctta taaaattcca agaccagcca gacttatctt cagatttttt tgattgctca 92580tcttggtagt taatagaaat tattttttta gtatcttcaa gcaattcaat atatcccaaa 92640gattcatatg aagcctctaa gatcttaaga gctctataat tttcactaga gttaggaata 92700ttttcaatta cataatttgc tcttcttatt gctgctatat gcgcatcaac actaacgtaa 92760taatcagctg ccgcaagctc atttcttgca atcatatttc ttaaatagat atttctttgt 92820ttagcatatg tagaatattg gctatcagga aatcttgtta agaattcagt tagttctgaa 92880aatgattcct tagctcctga gatatctcta tttgatagat cagtatctgt cattcttaca 92940ataaagctat tatctctcgt atagctagaa aggcctttca taaaatatgc ataatcaata 93000tttggatgcc taggatgaag tcttataaat ttttctgctg cagcatgcgc agcctcagtt 93060tcagcattca taaattgagc ataaataagc tctacttgcg cctgttcagc atatttgcca 93120aaaggatatc ttgattcaat tgcttctaat gaatcaatag caccaaaata attttttcca 93180gccatccttc tttgggcttg atcgtaataa attttttcag gctgttctat ttctgggcca 93240tcagaattac aactaaccaa taacagagtt acaattggca atactataaa tagttttaaa 93300attagtttat ctttcattat ttgcacattc tacctgcatt atcgtaattt aagcttgaaa 93360acatctattt agattaattt ttttataata agttcatatg attgttaaaa atgttccaaa 93420tgatctatct tcaatgaggc tagacaaagc tacagcagaa atgtttacag attattccag 93480aactcagata aaaaaatgga tagaagaagg cagagttctt ataaatggag aggtatccca 93540gccaagagat aaagtttatg agaatgatca gattgaatta agccctaaag aagaacaaaa 93600agtatcatgg gaagctcaag atatagattt tgaaattcac tttgaaaatg aagattttat 93660tataattaat aaacctgctg gtttaataat gcatccaggt tctggttgct atgatggaac 93720tctcgcaaat gggctcattt ataaatttcc agaattgatc aatattccaa ggtcgggaat 93780tgttcatcga ttagacaaag atacttctgg cattctgctc gtagcaagaa atgagtcttt 93840taggaacttt tttattaatg aaatgcagga gaggagagtt gtaaaaaaat atacgtctat 93900tgttattggt tctacactag gaagcttttc tatagaagag ccaattggaa gagataaaaa 93960taatagaacc aaaatggcaa ttcgagaaga tggcaaagat gctttaacat ttgtaaagct 94020taaagaaaat attggaaact actctgtgtt ggatataaga atagagacag gaagaactca 94080tcagattagg gtccatctat catcaaaaaa actaccaata attggagata aaacctatga 94140cccaagcagg tctattgcaa gagatacccc tgaagagcta attaatatta tccgaggttt 94200tccaaggcag gcattacatg caacacacct ctcattcaat gaccaaaaaa caaataatat 94260tttttctttt gatattccca ctccaaatga tatggaggaa ctacttctag aattaagaaa 94320attgatctaa tagtaactaa aaacttgttt tttgattaat aaaataatat aaaccttatt 94380cctaagagtt ttttggtaag aaattgaaat tatctggcgc agacatgcta atgcaagcac 94440ttcatgatga aggtgttgag ctaatctttg gctacccagg tggagccgcg cttcatatct 94500atgatgcaat ttttagacaa gataaaatag atcatatttt agtaaggcat gagcaaggtg 94560caacccatgc agcagatgga tattcaaggg cgacaggtaa gccaggagtt gtcttagtca 94620cttctggacc tggtgcaaca aatgctatta caggaatcgc gactgcattt atggattcca 94680taccaatggt agttatttca gggcaggttg ctagccattt aataggtact gatgcttttc 94740aagaaactga tatgattggt gtttcaagac caattgttaa gcatagctat acagttttta 94800atgctgaaga aatacctaag ataattaaag aagcttttta tgtcgcaact tcaggcagac 94860ctggacctgt tgttatagat atcccaaaag acatgacagc tccggataat ctttttgatt 94920actcgtatcc tgaagaagcc aagataagat catacaatcc tccgattgag ccagaaaaaa 94980atcaaataga tagagcagtc gaagctatat tgatatcaaa aaaaccagtt atatatgctg 95040gtggtggggc aattgctagt aatgccgaaa aagaattact tgaacttaat gaaattattg 95100atgctcctgt tacaaatact ttaatgggat tgggtattta ccctgctagt catcatagat 95160ttcttgggat gttagggatg catggaacat atcaggcaaa tatggcaatg cataatgcag 95220acttaataat tgctattggc gccagatttg atgacaggat taccaataaa ccatcaaagt 95280ttgcacctaa tgccaaagtg gttcatctag atgttgatca ctcatctgta tcaaaaatta 95340tagaagcaaa tgtagctgtt tttgggcaag taaaaaattc cttaaaatta ataaaagaaa 95400ctcttgaaaa aaaattagac tcttacgatt ctttcgctct tcagccttgg cacgatcaga 95460taaaagaatg gaaatcacta catggtttaa attatgagct ttataaagat gaatctgatg 95520atcatcccat tttaccccag gctgtagtcc agcatgtcca tgagattaca aatggggaag 95580catatgtgac ttccgatgtt ggtcagcatc aaatgtttgc tgctcaatat tatcattttg 95640ataagcctag aagatggatc aattctggtg gtctaggaac tatgggtttt ggtttgccag 95700cagcaatggg tgtaaaactc gcttttccaa aagatgaggt tgtttgcatt actggtgagg 95760gtagtatcca aatgtgcatc caagaattgt ctacatgtct tcaatataat ctcccaataa 95820aaataattaa tatcaataac gaagctcttg gtatggttaa acaatggcag gatatgaatt 95880atggaggaag gcactctgaa agtacctatc aaaactcgtt accggatttt ataaaactgg 95940ctgaatcata tggtcatata ggaattaaaa ttacaaaaaa ttctgattta agtgaaggct 96000taaaaaaagc ttttgaaatg aaagataaac ttgtctttgt tgatatttat gtagatcctt 96060cagagcatgt ttatccaatg caggttgcaa atggcagcct agaaaatatg tggctatcaa 96120aggatgaaca aacatgatta aaagaaaact aattttaatt atggaaaata aaccaggagc 96180tctagtaaga gtagttggac tgtttcatca aagaggctac aacattgaaa cccttcatgt 96240agatactgtt aaagactttt ctacttacaa atcgatattg aaaaaaaact tgaaaccaaa 96300tttgaggata atcaaatatc tagactgacc atagaaacaa tggtttcaga tgaccttatg 96360aggcagattt tgagacagct caataaatta atagatgtta tagctgtaag caatgaagag 96420acaacctatt taaaaggagt attattagat gaaaatttat tatgacgaag atgcaaacat 96480agaaattatt aaagggatga atgtctcaat aattggctat ggatctcaag ggaatgctca 96540tgcaaataat cttcatgaat ctggtgtaag tgttactgta ggtttaagag aagggtcttc 96600ttcatgggca aaggcagaag aagcaggctt aaaagttcaa acagttgctg attcggtaat 96660ccaagcagat ttggttatga ttttggcacc agatgaattc caaaaaaata tatatgaaac 96720cgaaatcaag ccaaacttaa aaaccagtgc aattcttgca tttgcacatg gctttaatat 96780tcattttgaa aaaatagttc ctgaagcaac taacagcgta attatgattg caccaaaagg 96840tccaggccat actgttagaa gtacttatac caatggtgga ggcgttccat ctctcatagc 96900tatatatgaa gatgctttaa gtgatgaaga ttattcagca aaagatgtag ctctatctta 96960tgcaaaagca aatggcggca cgagggctgg tgttcttgaa acatctttta aagaagaaac 97020agaaacagat ttatttggcg aacaagcagt tttatgtggc gggcttaccg ctttaattaa 97080agctgggttt gaaactctag ttgaggcagg ctacagtgaa gagatggcat attttgaatg 97140ccttcatgaa acaaaactaa tcacagactt aattcaagaa ggtggcattg ctaatatgca 97200ttactcaata tcaaatactg ctgagtatgg tgattatgtg agtggaccca aagtaattac 97260tagcgatacc aaaaaagcaa tgaaaggaat actagaaaat atacagtcag gaaaattcgc 97320agatgacttc ttgaatgact gtcgacaaag caatgatggc actggcggac ctgttatgaa 97380aagcaataga gaagctacaa aaattcatcc aatagagtcg gtaggggctg agctaaggtc 97440taaaatgaag ttcctaaatt cacaaaaatt ggtggataaa gaaattaatt aaaattaatt 97500aaaaaaaagg tatcttcttc ggaataactt cgttaagata cgcgtccgca caaagaggcg 97560gttgttcttt aaaaatattt ggttactcgt gtgggtgttc aaaatacgag aaaaaataat 97620ttagattttt tataaaaatc aacaaaacat gatattaatt gaagagtttg atcatggctc 97680agattgaacg ctggcggtag gcttaacaca tgcaagtcgt gcgagaaagt atcttcggat 97740atgagtagag cggcggacgg gtgagtaacg cgtaggaatc tacctagtag aaggggatag 97800cccggggaaa cccggattaa taccgtatac ctccttcggg agaaagaagg cctctctttg 97860aagctttcgc tactagatga gcctgcgtaa gattagcttg ttggtgaggt aaaggctcac 97920caaggcgacg atctttagct ggtctgagag gacgatcagc cacattggga ctgagacacg 97980gcccagactc ctacgggagg cagcagtggg gaatattgga caatgggcgc aagcctgatc 98040cagccatacc gcgtgtgtga agaaggcctt cgggttgtaa agcactttaa gcagggagaa 98100aaagttataa gttaatacct tataaccctg atgttacctg cagaataagc accggctaat 98160tccgtgccag cagccgcggt aatacggaag gtgcaagcgt taatcggaat tactgggcgt 98220aaagcgcgcg taggtggttt gttaagttgg atgtgaaagc cctgggctca acctaggaac 98280tgcatccaaa actaactcac tagagtacga tagagggagg tagaattcat agtgtagcgg 98340tggaatgcgt agatattatg aagaatacca gtggcgaagg cggcctcctg gatctgtact 98400gacactgagg tgcgaaagcg tgggtagcga acaggattag ataccctggt agtccacgcc 98460gtaaacgatg acaactagct gttgggagac tatgtctttc agtggcgcag ctaacgcttt 98520aagttgtccg cctggggagt acggccgcaa ggctaaaact caaatgaatt gacggggacc 98580cgcacaagcg gtggagcatg tggtttaatt cgatgcaacg cgaaaaacct tacctactct 98640tgacatactt ggaggctctt gtaatgagag tgtgcctttt ggaaccaaga tacaggtgct 98700gcatggctgt cgtcagctcg tgtcgtgaga tgttccgtta agtcggataa cgagcgcaac 98760ccttaccctt atttgccagc gattcggtcg ggaactataa ggggactgcc ggtgataaac 98820cggaggaagg tgaggacgac gtcaagtcat catggccctt acgagtaggg ctacacacgt 98880gctacaatgg gagatacaga cggacgctaa gccgcgaggt ggtgctaatc ctaaaaagtc 98940tttcgtagtc cggattggag tctgcaactc gactccatga agtcggaatc gctagtaatc 99000gcggatcagc atgccgcggt gaatacgttc tcgggtcttg tacacaccgc ccgtcacacc 99060atggaagtgg attgcaccag aagtagatag tctaacctta gggagggcgt ttaccacggt 99120gtgcttcatg actggggtga agtcgtaaca aggtagccgt aggggaacct gtggctggat 99180cacctcctta acgataaatc gcgttttaaa cgcccacacg agtaatcaaa tattaaaaaa 99240aagaacattt agatatgtaa aatcattggt atgtaatttt ctagtgtata catttatgta 99300tacataagat cactgcaatt aaaaagtaac atatgcattt atgtgtatgt taaaaaagta 99360attaatatat tttattaagt tactctcaaa aatgaagata aaacttcaaa aaaaatatgt 99420aacctttttt aaggttatat gatcaagtaa aggaagagca caaggcggat gccttggcag 99480cataaggcga tgaaggacgt aataacctgc gataagcctc ggggagctgg taaataagct 99540tcgatccgag gatttccgaa tgggaaaacc caatacacat aagtgtatta tcttatactg 99600aatacatagg tataagaggc aaacctaggg aactgaaaca tctaagtacc tagaggaaaa 99660gaaatcaaca gagattccgg tagtagcggc gagcgaaacc ggaccagccc ttaagcttat 99720tttagtccag caaaatattc tggaaagttt agccatagta ggtgatagcc ctgtatgcga 99780aagactaatt taagtgaaat cgagtaggtc gggacacgag aaatcttgac tgaacatggg 99840gggaccatcc tccaaggcta aatactctat gctgaccgat agtgaaccag taccgtgagg 99900gaaaggcgaa aagaaccccg gcgaggggag tgaaatagaa cctgaaacct tgtgcttaca 99960agcagtcgga gcagacttgt tctgtgacgg cgtacctttt gtataatggg tcaacgactt 100020aatttcagta gcaagcttaa ccatttaggg taggcgtagg gaaaccgagt cttaataggg 100080cgctcagttg ctggaattag acccgaaacc gggtgatcta tccatggcca gtgtgaaggt 100140cgagtaacat cgactggagg cgcgaaccca cttatgttga aaaatgaggg gatgagctgt 100200ggataggagt gaaaggctaa tcaaacccgg agatagctgg ttctcttcga aaactattta 100260ggtagtgcct cgtgtattac tgtagggggt agagcactgt ttcggctagg gggtcatccc 100320gacttaccaa accgatgcaa actccgaata cctacaagta tgagcacggg agacagactg 100380cgggtgctaa cgtccgtagt cgagagggaa acaacccaga ctgtcagcta aggtcccaaa 100440ttatgattaa gtgggaaaca atgtgggaag gcacaaacag ctaggaggtt ggcttagaag 100500cagccatcct ttaaagaaag cgtaatagct cactagtcga gtcggcctgc gtggaagata 100560taacggggct aaatcataaa ccgaagctac agatcttaaa tttatttaag atggtagaag 100620agcgttctgt aagcggttga aggtaagctg agaggcgaac tggacgtatc agaagtgcga 100680atgttgacat gagtaacgat caaagaggtg aaaaacctct tcgccgaaaa accaagggtt 100740cctgtccaac gctaatcgag gcagggtgag gcggccccta aggcgagggc gaaagccgta 100800gtcgatggga aacaggttaa tattcctgta ctttttataa ctgcgatggg gtgacggaga 100860aggttagact agcacggcga cggttgtccg tgttcaaggt tgtaggctgg tgttctaggt 100920aaatccggaa cgctaaggct gagaactgat aacgaccact ctacgagtgg gaagtagtcg 100980ataccatgct tccaggaaaa acctctaagc ttcaggttat aagaaaccgt accctaaacc 101040gacacaggtg gttaggtcga gtagaccaag gtgtttgaga gaactatggt gaaggaacta 101100ggcaaaatag caccgtaact tcgggagaag gtgcgccgcg tttggtgatg agacttgctc 101160tctaagctga acgtggtcga agataccagg tggctgcgac tgtttactaa aaacatagca 101220ctctgcaaac tcgtaagagg aagtataggg tgtgacgcct gcccggtgcc ggaaggttaa 101280ttgatggggt tagcttatgc gaagctcttg atcgaagccc cggtaaacgg cggccgtaac 101340tataacggtc ctaaggtagc gaaattcctt gtcgggtaag ttccgacctg cacgaatggc 101400gtaacgatgg ccacactgtc tccaccatag actcagtgaa attgaaatcg ctgttaagat 101460gcagtgtacc cgcagctaga cggaaagacc ccgtgcacct ttactatagg ttcgcactgg 101520actttgacct tacttgtgta ggataggtgg gagactttga agcagagacg ccagtctttg 101580tggagtcatc cttgaaatac cacccttgta agattgaagt tctaacctag gtccattatc 101640tggatcaggg acagtgcgtg ctgggtagtt tgactggggc ggtctcctcc taaagagtaa 101700cggaggagta cgaaggtatc cttatcacgg tcggacatcg tgaggtaagt ataaaggcag 101760aaggatgctt gactgcgaga tcgacggatc gagcaggtag gaaactaggt cttagtgatc 101820cggtggttct gaatggaagg gccatcgctc aacggataaa aggtacgccg gggataacag 101880gctgataccg cccaagagtt catatcgacg gcggtgtttg gcacctcgat gtcggctcat 101940cacatcctgg ggctggagca ggtcccaagg gtatggctgt tcgccattta aagtggtacg 102000cgagctgggt ttagaacgtc gtgagacagt tcggtcccta tctgctgtgg gcgtttggag 102060atttgaggga agctgattct agtacgagag gaccgaattg gacgaacctc tggtgttccg 102120gttgtcacgc cagtggcatt gccgggtagc tatgttcgga aaggataacc gctgaaagca 102180tataagcggg aagcctctcc caagattaaa tctcccagag actttatgtc tcctaaagag 102240tcgtcataga ctatgacgtt gataggcaag atgtgtaagc gctgcgaggc gttgagctaa 102300cttgtactaa taactcgtga ggcttgatca tgtaacctta agcaaggttc ataatttgag 102360taaaacattg tagtgagaat taaaaaataa aaagttacat accagtttgc ctgatgacaa 102420tagcaacttg gaaccacctg atcccatctc gaactcagaa gtgaaacgag ttaacgccaa 102480tggtagtgca gggtctccct gtgtgagagt aggaaatcgt caggcttttt tctttaaggc 102540ttccagttta ctggaagcct ttttttttat ctcaagtata atacccagat gattattgga 102600ttaacaggag gcattggttc tggcaaatct gccgctgcag acttctttat tgatttaggt 102660atatcagtct tagatgcaga tcaagttgct aaagaagctt tatctacaaa ttctcctgga 102720tatactgatt ttatttctca atttggtgaa gtgtatttaa ataataatcg tgaggttgat 102780aggctgaaat tgcgcgaact tattttttca aatccttcaa aaaaaaagga tcttgagaat 102840attattcatc ccatagttcg gtctgctatt agtaatttta ttattacatc aacatcacca 102900tattctattg ttatggtgcc actcattttt gaaacaaatt catataaaaa ttacgataag 102960attattactg ttgactgtga tttagaactt cagatagtaa gggcctcaag tagagatgct 103020caaaataaat cgcagattaa gaatattatt aataagcaag cctctagaga ggagaggcta 103080agtatttctg atgacgtact tatcaataac agcaccctat ctgatctaaa aaaacaagtt 103140aatgttttac atactaaata tatggagtta ttaaatgagt agttgcccta gatgtgaaaa 103200acctgtcaaa ctttctactg acaatattta tagacctttc tgctctgaga aatgcaaact 103260tatagatttt ggtgattggg ctaatgaaga taataaaatt tcaagaccaa ttcaatctga 103320agatttttac gaagattaaa tttaagaaag tctccattca ccactttcaa tcatcggttt 103380ggctttttta tacttcattt cttgcgtatc ttgaccgtta gtaattttta caagatcatt 103440tctgcctaac ttaggttcgt ttcttgtcac agtacttgtt tttactatag gagtggcttg 103500tttttcattt tgaaagatat cagaattaat ttcctctttt tccaatttta attcttgaga 103560actattttgt ttgtttatgc tctctagctc agattctgtc gatatttgta atgagaaaag 103620tattcttata gtttcaacat ctatttccga aagcatagat tcaaacattg aatatgcctc 103680tcttttgaat tcattttttg gatttttttg agcataagcc cttagaccaa cactatttct 103740taaatgatct atttctgata aatgctcttt ccaatggaca tcaagtactt gcagcataac 103800ctgcttctca agaagcaatc tattttcacc aagatcacta aacttttttg aatatttatt 103860ttttgcttgc agaacaattt cttcggcaat tgtatttggt acaagttttt tattactatt 103920aattttattg gctatatctg tttctagtcc atagctctct tttaaataat catcgagctg 103980cctacttttc cattgagact ctattgattc ttcaggtaca tataaattag atattccttt 104040gaattgctgc tcgatgagtg actcaatggt actgctaata tcttcctctt ctagtaattg 104100attccttaaa gaatatatag cttgtctttg atcatttgat acatcgtcgt actctaaaag 104160atttttcctt gcatcaaagt ttctgctctc aattcttttt tgcgcatttt ctattcctct 104220tgaaagcatt ttagcttcaa tatgatcatc tcccatgccc agcctttcaa aagtagccct 104280tcttccatcc gaaataaaaa gtctcaagag atcgtcttct aaagataaga agaattttga 104340ataacctgga tcaccttgtc tgcctgatct cccccttaac tgattatcta ttcttcttga 104400ttcatgcctt tctgttccaa gtatatgaag cccaccagat tctataactt ttttattatt 104460ttctttccac tctatatcgc tttggtcttc tttttttccg ccaagaacta tatccgtccc 104520ccttcctgcc atattcgtag caatagttac cattccaggt ttacctgcat tggcaattat 104580ctctgcttct ttttcgtgat gcttggcatt taaaatttga tgaggtattt ttttgttatt 104640taaatatgct gatacttctt ctgaagattc aaccgaaact gttccaacaa gaatgggaga 104700agattttttt cttaattgtt caatttcttc aattagagct ttatatttcg attctgttgt 104760aagaaatact aagtcattaa gatcagctct agccatagga acatttgttg ggatgatgat 104820gacatttagg ccatagattt gactaaactc tactgcttca gtatctgctg ttcctgtcat 104880cccagaaagt tttttaaata atctaaaaaa gttttggaat gtggtggatg ctagtgtttg 104940agactctctt tggatagcaa cattttcttt gcattccagt gcctggtgaa caccttcact 105000cattcttctt ccgggcattg ttctacctgt atgctcatca atcaaaagaa cctcaccgtt 105060cctaaccaaa taatccacat tctttttaaa taagaagctt gctctaagtg ttgcttgaac 105120aaatttcata atttttaaat tagaaacaga gtaagcccat ctgaggctcc aagccgattc 105180cagc 105184229DNAArtificialsynthetic construct 2accatgggta aattattact gatattagg

29324DNAArtificialsynthetic construct 3agcattagaa gattctttaa cagc 244750DNAUnknownnaturally occuring gamma protobacterium 4atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga atc atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa tta tgg gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190att atc atc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 2505250PRTUnknownNaturally Occuring gamma protobaceterium 5Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 2506747DNAUnknownNaturally occuring gamma protobacterium 6atg aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca ttt 48Met Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr Phe1 5 10 15gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt tct 96Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val Ser 20 25 30ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc ttt 144Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe Phe 35 40 45gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act gta 192Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr Val 50 55 60tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg aga 240Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met Arg65 70 75 80ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac att 288Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr Ile 85 90 95gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta att 336Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu Ile 100 105 110ctt gct gct gca act aat gtt gct gga tca tta ttt aag aaa tta cta 384Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu Leu 115 120 125gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca gga 432Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala Gly 130 135 140atc atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg gta 480Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp Val145 150 155 160tac atg att tat gaa tta tgg gct gga gaa gga aaa tct gca tgt aat 528Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys Asn 165 170 175act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat att 576Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr Ile 180 185 190atc atc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt tac 624Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly Tyr 195 200 205ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat aac 672Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr Asn 210 215 220ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg aat 720Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp Asn225 230 235 240gtt gct gtt aaa gaa tct tct aat gct 747Val Ala Val Lys Glu Ser Ser Asn Ala 2457249PRTUnknownNaturally Occuring gamma protobaceterium 7Met Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr Phe1 5 10 15Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val Ser 20 25 30Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe Phe 35 40 45Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr Val 50 55 60Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met Arg65 70 75 80Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr Ile 85 90 95Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu Ile 100 105 110Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu Leu 115 120 125Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala Gly 130 135 140Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp Val145 150 155 160Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys Asn 165 170 175Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr Ile 180 185 190Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly Tyr 195 200 205Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr Asn 210 215 220Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp Asn225 230 235 240Val Ala Val Lys Glu Ser Ser Asn Ala 2458750DNAUnknownNaturally occuring gamma protobacterium 8atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tcg ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gag act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta ttg ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca aca aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa ggc aag gct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190ata atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220gac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asp Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 2509250PRTUnknownNaturally Occuring gamma protobaceterium 9Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asp Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25010750DNAUnknownnaturally occuring gamma protobacterium 10atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gct act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Ala Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gct act aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga atc atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190att atc atc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gat ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25011250PRTUnknownNaturally Occuring gamma protobaceterium 11Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25

30Ser Phe Trp Leu Ala Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25012750DNAUnknownnaturally occuring gamma protobacterium 12atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt aca gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga gga gta tgg att gaa act ggt gat tcg cct act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gcc ggc tca tta ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg gca gct tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190atc ata gtc ttc ggt tgg gca att tat cct ata ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Ile Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25013250PRTUnknownNaturally Occuring gamma protobaceterium 13Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Ile Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25014750DNAUnknownnaturally occuring gamma protobacterium 14atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tcg ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gag acc ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta ttg ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca aca aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa ggc aag gct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190ata atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25015250PRTUnknownNaturally Occuring gamma protobaceterium 15Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25016753DNAUnknownnaturally occuring gamma protobacterium 16atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt gct 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Ala 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gcc gac ctt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25017251PRTUnknownNaturally Occuring gamma protobaceterium 17Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Ala 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25018753DNAUnknownnaturally occuring gamma protobacterium 18atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140ggt tta gct cct gta tta cct gct

ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta cat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg aag 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180 185 190att att gtt att gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg agt ggt gac ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25019251PRTUnknownNaturally Occuring gamma protobaceterium 19Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180 185 190Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25020753DNAUnknownnaturally occuring gamma protobacterium 20atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa gtg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Val Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25021251PRTUnknownNaturally Occuring gamma protobaceterium 21Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Val Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25022753DNAUnknownnaturally occuring gamma protobacterium 22atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca aat gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga ttg gct cct gta tgg cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg gtg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gcc gac ctt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25023251PRTUnknownNaturally Occuring gamma protobaceterium 23Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25024750DNAUnknownnaturally occuring gamma protobacterium 24atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gag act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta ttg ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca aca aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg aac gct tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190atc ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25025250PRTUnknownNaturally Occuring gamma protobaceterium 25Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Asn Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25026750DNAUnknownnaturally occuring gamma protobacterium 26atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile

Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gct act aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gag ctt tgg ctt gga gaa gga aaa gct gcg tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Leu Gly Glu Gly Lys Ala Ala Cys 165 170 175aat aca gca agt cct gct gtt cag tca gct tac aac aca atg atg atg 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Met 180 185 190atc atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gca ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25027250PRTUnknownNaturally Occuring gamma protobaceterium 27Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Leu Gly Glu Gly Lys Ala Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Met 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25028750DNAUnknownnaturally occuring gamma protobacterium 28atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa ggc aag gct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190ata atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25029250PRTUnknownNaturally Occuring gamma protobaceterium 29Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25030750DNAUnknownnaturally occuring gamma protobacterium 30atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt aca gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gct act aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140caa att atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg 480Gln Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190atc ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt ggg tca gct ctt aac tta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ctt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Leu Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25031250PRTUnknownNaturally Occuring gamma protobaceterium 31Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gln Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Leu Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25032750DNAUnknownnaturally occuring gamma protobacterium 32atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca gct aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Ala Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga atc atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa tta tgg gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gcc tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190att atc atc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac ttg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25033250PRTUnknownNaturally Occuring gamma protobaceterium 33Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala

Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Ala Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25034750DNAUnknownnaturally occuring gamma protobacterium 34atg ggt aaa tta tta ctg ata ata ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Ile Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt aca gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gcc ggc tca tta ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg gca gct tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tat atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat aca gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190att atc gtc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25035250PRTUnknownNaturally Occurring gamma proteobacterium 35Met Gly Lys Leu Leu Leu Ile Ile Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25036750DNAUnknownnaturally occuring gamma protobacterium 36atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tcg ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gag act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta ttg ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca aca aat gtt gct gct ggc ctg ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125ttg gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gag gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg aac gct tgg ggt gca ttc gtt att ggg tgt tta gct tgg 480Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa ggc aag gct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190ata atc atc ttt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag aat cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Asn Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25037250PRTUnknownNaturally Occuring gamma protobaceterium 37Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Ala Gly Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Asn Ala Trp Gly Ala Phe Val Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ala Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Asn Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25038750DNAUnknownnaturally occuring gamma protobacterium 38atg ggt aaa tta tta cgg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Arg Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt aca gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tat atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga atc atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190atc atc atc gtt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac ctg atg ggt gac ggt gga tca gct ctt aac tta aac ctt atc tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25039250PRTUnknownNaturally Occuring gamma protobaceterium 39Met Gly Lys Leu Leu Arg Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25040750DNAUnknownnaturally occuring gamma protobacterium 40atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt aca gct gct cta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gcc ggc tca tta ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg gca gct tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt

528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190atc ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25041250PRTUnknownNaturally Occuring gamma protobaceterium 41Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25042750DNAUnknownnaturally occuring gamma protobacterium 42atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tac atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct cta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gct act aat gtt gcc ggc tca tta ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga att atg gca gct tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tat gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct tcg gtt caa tca gct tac aac aca atg atg gct 576Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190atc ata gtc ttc ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aac tta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gct gct gtt aaa gaa tct tct aat gct 750Asn Ala Ala Val Lys Glu Ser Ser Asn Ala 245 25043250PRTUnknownNaturally Occuring gamma protobaceterium 43Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Tyr Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ser Val Gln Ser Ala Tyr Asn Thr Met Met Ala 180 185 190Ile Ile Val Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Ala Ala Val Lys Glu Ser Ser Asn Ala 245 25044750DNAUnknownnaturally occuring gamma protobacterium 44atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggt ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt act gct gct tta tta gca tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tat atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga ggg gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95ata gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gcc gct gca act aat gtt gct gga tca tta ttt aag aaa tta 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125ctt gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga atc atg gct gca tgg cct gca ttc att att ggg tgt tta gct tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160gta tac atg att tat gaa cta tgg gct gga gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct gct gtg caa tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190atc atc atc ttt ggt tgg gcg att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac ctt atg ggt gac ggt gga tca gca ctt aac tta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25045250PRTUnknownNaturally Occuring gamma protobaceterium 45Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Ile Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Phe Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25046750DNAUnknownnaturally occuring gamma protobacterium 46atg ggt aaa tta tta ctg ata tta ggt agt gtt att gca ctt cct aca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15ttt gct gca ggt ggc ggt gac ctt gat gct agt gat tac act ggt gtt 96Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30tct ttt tgg tta gtt aca gct gct cta tta gcg tct act gta ttt ttc 144Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45ttt gtt gaa aga gat aga gtt tct gca aaa tgg aaa aca tca tta act 192Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt ctt gtt act ggt att gct ttc tgg cat tac atg tat atg 240Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80aga gga gta tgg att gaa act ggt gat tcg cca act gta ttt aga tac 288Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta cta aca gtt cct tta tta ata tgt gaa ttc tac tta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110att ctt gct gct gca act aat gtt gcc ggc tca tta ttt aag aaa ctt 384Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125cta gtt ggt tct ctt gtt atg ctt gtg ttt ggt tac atg ggt gaa gca 432Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140gga ata atg gcg gct tgg cct gca ttc atc gtt gga tgt tta gca tgg 480Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Val Gly Cys Leu Ala Trp145 150 155 160gta tat atg att tat gaa cta tgg gct ggt gaa gga aaa tct gca tgt 528Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175aat act gca agt cct gct gta cag tca gct tac aac aca atg atg tat 576Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190atc atc atc gtt ggt tgg gca att tat cct gta ggt tat ttc aca ggt 624Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205tac cta atg ggt gac ggt gga tca gct ctt aat cta aac ctt att tat 672Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220aac ctt gct gac ttt gtt aac aag att cta ttt ggt tta att ata tgg 720Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240aat gtt gct gtt aaa gaa tct tct aat gct 750Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25047250PRTUnknownNaturally Occuring gamma protobaceterium 47Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Val Ile Ala Leu Pro Thr1 5 10 15Phe Ala Ala Gly Gly Gly Asp Leu Asp Ala Ser Asp Tyr Thr Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Ala Leu Leu Ala Ser Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Arg Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Val Thr Gly Ile Ala Phe Trp His Tyr Met Tyr Met65 70 75 80Arg Gly Val Trp Ile Glu Thr Gly Asp Ser Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Leu Ile Cys Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Ala Thr Asn Val Ala Gly Ser Leu Phe Lys Lys Leu 115 120 125Leu Val Gly Ser Leu Val Met Leu Val Phe Gly Tyr Met Gly Glu Ala 130 135 140Gly Ile Met Ala Ala Trp Pro Ala Phe Ile Val Gly Cys Leu Ala Trp145 150 155 160Val Tyr Met Ile Tyr Glu Leu Trp Ala Gly Glu Gly Lys Ser Ala Cys 165 170 175Asn Thr Ala Ser Pro Ala Val Gln Ser Ala Tyr Asn Thr Met Met Tyr 180 185 190Ile Ile Ile Val Gly Trp Ala Ile Tyr Pro Val Gly Tyr Phe Thr Gly 195 200 205Tyr Leu Met Gly Asp Gly Gly Ser Ala Leu Asn Leu Asn Leu Ile Tyr 210 215 220Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile Trp225 230 235 240Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25048753DNAUnknownnaturally occuring gamma protobacterium 48atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act

gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg cta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140ggt tta gct cct gta tta cct gct ttc att ctt ggt atg gct ggt tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Leu Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta cat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gct tac aat gca atg atg aag 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180 185 190att att gtt att gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg agt ggt gac ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25049251PRTUnknownNaturally Occuring gamma protobaceterium 49Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Leu Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180 185 190Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25050753DNAUnknownnaturally occuring gamma protobacterium 50atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct gag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Glu Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gat act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25051251PRTUnknownNaturally Occuring gamma protobaceterium 51Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Glu Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25052753DNAUnknownnaturally occuring gamma protobacterium 52atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gcc ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca aat gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tgg cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac cta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25053251PRTUnknownNaturally Occuring gamma protobaceterium 53Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25054753DNAUnknownnaturally occuring gamma protobacterium 54atg ggt aaa tta tta ctg ata tta ggt agt gct att gcg ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt acg gct ggt atg tta gcg gca act gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gcc gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca ttg gta atg tta ggt gct gga tct gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Ser Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctc ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25055251PRTUnknownNaturally Occuring gamma protobaceterium 55Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr

Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Ser Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25056753DNAUnknownnaturally occuring gamma protobacterium 56atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca aat gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tgg cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg gtg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac cta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25057251PRTUnknownNaturally Occuring gamma protobaceterium 57Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Asn Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Trp Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Val 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25058753DNAUnknownnaturally occuring gamma protobacterium 58atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ctt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25059251PRTUnknownNaturally Occuring gamma protobaceterium 59Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Leu Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25060753DNAUnknownnaturally occuring gamma protobacterium 60atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gac act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa gtg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Val Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta tat atg ggt gaa ggc aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac cct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Pro Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25061251PRTUnknownNaturally Occuring gamma protobaceterium 61Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Val Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Pro Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25062753DNAUnknownnaturally occuring gamma protobacterium 62atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gta ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tac atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gat act ggt gat aca cca aca gta ttt aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140ggt tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160tta tac atg att tat gag cta cat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg aag

576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180 185 190att att gtt att gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg agt ggt gac ggt gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25063251PRTUnknownNaturally Occuring gamma protobaceterium 63Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu His Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Lys 180 185 190Ile Ile Val Ile Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Ser Gly Asp Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25064753DNAUnknownnaturally occuring gamma protobacterium 64atg ggt aaa tta tta ctg ata tta ggt agt gct att gca ctt cca tca 48Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15ttt gct gct gct ggt ggc gat cta gat ata agt gat act gtt ggt gtt 96Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30tca ttc tgg ctg gtt aca gct ggt atg tta gcg gca act gtg ttc ttt 144Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45ttt gta gaa aga gac caa gtc agc gct aag tgg aaa act tca ctt act 192Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60gta tct ggt tta att act ggt ata gct ttt tgg cat tat ctc tat atg 240Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80aga ggt gtt tgg ata gat act ggt gat acc cca aca gta ttc aga tat 288Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95att gat tgg tta tta act gtt cca tta caa atg gtt gag ttc tat cta 336Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110att ctt gct gct tgt aca agt gtt gct gct tca tta ttt aag aag ctt 384Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125cta gct ggt tca tta gta atg tta ggt gct gga ttt gca ggc gaa gct 432Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140gga tta gct cct gta tta cct gct ttc att att ggt atg gct gga tgg 480Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160cta tac atg att tat gag cta tat atg ggt gaa ggt aag gct gct gta 528Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175agt act gca agt cct gct gtt aac tct gca tac aac gca atg atg atg 576Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190att att gtt gtt gga tgg gca att tat cct gct gga tat gct gct ggt 624Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205tac cta atg ggt ggc gaa ggc gta tac gct tca aac tta aac ctt ata 672Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220tat aac ctt gct gac ttt gtt aac aag att cta ttt ggt ttg atc att 720Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240tgg aat gtt gct gtt aaa gaa tct tct aat gct 753Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 25065251PRTUnknownNaturally Occuring gamma protobaceterium 65Met Gly Lys Leu Leu Leu Ile Leu Gly Ser Ala Ile Ala Leu Pro Ser1 5 10 15Phe Ala Ala Ala Gly Gly Asp Leu Asp Ile Ser Asp Thr Val Gly Val 20 25 30Ser Phe Trp Leu Val Thr Ala Gly Met Leu Ala Ala Thr Val Phe Phe 35 40 45Phe Val Glu Arg Asp Gln Val Ser Ala Lys Trp Lys Thr Ser Leu Thr 50 55 60Val Ser Gly Leu Ile Thr Gly Ile Ala Phe Trp His Tyr Leu Tyr Met65 70 75 80Arg Gly Val Trp Ile Asp Thr Gly Asp Thr Pro Thr Val Phe Arg Tyr 85 90 95Ile Asp Trp Leu Leu Thr Val Pro Leu Gln Met Val Glu Phe Tyr Leu 100 105 110Ile Leu Ala Ala Cys Thr Ser Val Ala Ala Ser Leu Phe Lys Lys Leu 115 120 125Leu Ala Gly Ser Leu Val Met Leu Gly Ala Gly Phe Ala Gly Glu Ala 130 135 140Gly Leu Ala Pro Val Leu Pro Ala Phe Ile Ile Gly Met Ala Gly Trp145 150 155 160Leu Tyr Met Ile Tyr Glu Leu Tyr Met Gly Glu Gly Lys Ala Ala Val 165 170 175Ser Thr Ala Ser Pro Ala Val Asn Ser Ala Tyr Asn Ala Met Met Met 180 185 190Ile Ile Val Val Gly Trp Ala Ile Tyr Pro Ala Gly Tyr Ala Ala Gly 195 200 205Tyr Leu Met Gly Gly Glu Gly Val Tyr Ala Ser Asn Leu Asn Leu Ile 210 215 220Tyr Asn Leu Ala Asp Phe Val Asn Lys Ile Leu Phe Gly Leu Ile Ile225 230 235 240Trp Asn Val Ala Val Lys Glu Ser Ser Asn Ala 245 250



Patent applications in class DNA or RNA fragments or modified forms thereof (e.g., genes, etc.)

Patent applications in all subclasses DNA or RNA fragments or modified forms thereof (e.g., genes, etc.)


User Contributions:

Comment about this patent or add new information about this topic:

CAPTCHA
Images included with this patent application:
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and imageLight-driven energy generation using proteorhodopsin diagram and image
Light-driven energy generation using proteorhodopsin diagram and image
Similar patent applications:
DateTitle
2010-09-23Ab initio generation of single copy genomic probes
2010-02-04Trioxyethylene gold nanoclusters functionalized with a single dna
2011-09-29Helicobacter pylori bacterium proliferation inhibitor
2011-12-01Metal enhanced fluorescence-based sensing methods
2012-07-05Microprocessing in heterogeneous reaction mixtures
New patent applications in this class:
DateTitle
2022-05-05Methods for purification of messenger rna
2019-05-16Novel minimal utr sequences
2019-05-16Dna gridiron compositions and methods
2019-05-16Methods and compositions for nucleoside triphosphate and ribonucleic acid production
2019-05-16Trityl-mono-galnac compounds and their use
Top Inventors for class "Organic compounds -- part of the class 532-570 series"
RankInventor's name
1William Marshall
2Anastasia Khvorova
3Eric E. Swayze
4Devin Leake
5Stephen Scaringe
Website © 2025 Advameg, Inc.