Patents - stay tuned to the technology

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: GENETICALLY ENGINEERED BACTERIUM FOR THE PRODUCTION OF 3-HYDROXYBUTYRATE

Inventors:  Michael Koepke (Skokie, IL, US)  Michael Koepke (Skokie, IL, US)  Rasmus Overgaard Jensen (Skokie, IL, US)  James Bruce Yarnton Haycock Behrendorff (Copenhagen, DK)  Ryan Edward Hill (Stockholm, SE)  Darmawi Juminaga (Skokie, IL, US)  Alexander Paul Mueller (Skokie, IL, US)
IPC8 Class: AC12P742FI
USPC Class: 1 1
Class name:
Publication date: 2018-04-19
Patent application number: 20180105847



Abstract:

The invention relates to a genetically engineered bacterium having an enzyme that converts acetyl-CoA to acetoacetyl-CoA, an enzyme that converts acetoacetyl-CoA to 3-hydroxybutyryl-CoA, and an enzyme that converts 3-hydroxybutyryl-CoA to 3-hydroxybutyrate. The bacterium may also have enzymes to produce other downstream products, such as 3-hydroxybutyryaldehyde, and 1,3-butanediol. Typically, the bacterium is capable of producing these products from a gaseous substrate, such as syngas or an industrial waste gas.

Claims:

1. A genetically engineered C1-fixing bacterium comprising: (a) an enzyme that converts acetyl-CoA to acetoacetyl-CoA, (b) an enzyme that converts acetoacetyl-CoA to 3-hydroxybutyryl-CoA, and (c) an enzyme that converts 3-hydroxybutyryl-CoA to 3-hydroxybutyrate, wherein at least one of the enzymes is exogenous to the bacterium.

2. The bacterium of claim 1, wherein the enzyme that converts acetyl-CoA to acetoacetyl-CoA is thiolase (EC 2.3.1.9).

3. The bacterium of claim 1, wherein the enzyme that converts acetoacetyl-CoA to 3-hydroxybutyryl-CoA is 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157) or acetoacetyl-CoA reductase (EC 4.2.1.36).

4. The bacterium of claim 1, wherein the enzyme that converts 3-hydroxybutyryl-CoA to 3-hydroxybutyrate is thioesterase (EC 3.1.2.20), phosphate butyryltransferase (EC 2.3.1.19) and butyrate kinase (EC 2.7.2.7), or CoA-transferase (EC 2.8.3.9).

5. The bacterium of claim 1, wherein the enzyme that converts 3-hydroxybutyryl-CoA to 3-hydroxybutyrate is stereospecific.

6. The bacterium of claim 1, wherein the 3-hydroxybutyrate is (R)-3-hydroxybutyrate, (S)-3-hydroxybutyrate, or a combination thereof.

7. The bacterium of claim 1, wherein the bacterium further comprises an isomerase that interconverts (R)-3-hydroxybutyrate and (S)-3-hydroxybutyrate.

8. The bacterium of claim 1, wherein the bacterium further comprises an enzyme that converts 3 -hydroxybutyrate to 3-hydroxybutyryaldehyde.

9. The bacterium of claim 8, wherein the enzyme that converts 3-hydroxybutyrate to 3-hydroxybutyryaldehyde is aldehyde:ferredoxin oxidoreductase (EC 1.2.7.5).

10. The bacterium of claim 8, wherein the bacterium further comprises an enzyme that converts 3-hydroxybutyryaldehyde to 1,3-butanediol.

11. The bacterium of claim 10, wherein the enzyme that converts 3-hydroxybutyryaldehyde to 1,3-butanediol is alcohol dehydrogenase (EC 1.1.1.1. or 1.1.1.2.).

12. The bacterium of claim 1, wherein the bacterium is derived from a parental bacterium selected from the group consisting of Acetobacterium woodii, Alkalibaculum bacchii, Blautia product, Butyribacterium methylotrophicum, Clostridium aceticum, Clostridium autoethanogenum, Clostridium carboxidivorans, Clostridium coskatii, Clostridium drakei, Clostridium formicoaceticum, Clostridium ljungdahlii, Clostridium magnum, Clostridium ragsdalei, Clostridium scatologenes, Eubacterium limosum, Moorella thermautotrophica, Moorella thermoacetica, Oxobacter pfennigii, Sporomusa ovata, Sporomusa silvacetica, Sporomusa sphaeroides, and Thermoanaerobacter kiuvi.

13. The bacterium of claim 1, wherein the bacterium further comprises exogenous or endogenous aldehyde:ferredoxin oxidoreductase (AOR).

14. The bacterium of claim 1, wherein the bacterium further comprises a disruptive mutation in a phosphotransacetylase (Pta) and an acetate kinase (Ack).

15. The bacterium of claim 1, wherein the bacterium further comprises a disruptive mutation in a thioesterase.

16. A method of producing 3-hydroxybutyrate comprising culturing the bacterium of claim 1 in the presence of a substrate, whereby the bacterium produces 3-hydroxybutyrate.

17. The method of claim 16, wherein the substrate is a gaseous substrate comprising one or more of CO, CO.sub.2, and H.sub.2.

18. The method of claim 16, wherein the substrate comprises syngas or industrial waste gas.

19. A method of producing 3-hydroxybutyryaldehyde comprising culturing the bacterium of claim 8 in the presence of a substrate, whereby the bacterium produces 3-hydroxybutyryaldehyde.

20. A method of producing 1,3-butanediol comprising culturing the bacterium of claim 10 in the presence of a substrate, whereby the bacterium produces 1,3-butanediol.

Description:

CROSS REFERENCE TO RELATED APPLICATIONS

[0001] This application is a continuation of U.S. patent application Ser. No. 15/293,191 filed Oct. 13, 2016, which claims the benefit of U.S. Provisional Patent Application No. 62/240,850 filed Oct. 13, 2015, the entireties of which are incorporated herein by reference.

BACKGROUND OF THE INVENTION

[0002] With recent advances in fermentation and metabolic engineering, fermentation routes to various products have been identified and developed (Clomburg, Appl Microbiol Biotechnol, 86: 419-434, 2010; Peralta-Yahya, Biotechnol J, 5: 147-162, 2010; Cho, Biotechnol Adv, pii: S0734-9750(14)00181-5, 2014. However, all of these fermentation routes are energy (ATP)-consuming or, at best, energy (ATP)-neutral, which restricts product yield in energy-limited systems and uncouples product production from microorganism growth. The present invention provides energy (ATP)-generating pathways that overcome these limitations by providing novel fermentation routes and pathways to a variety of products, including acids, alkenes, aldehydes, alcohols, and diols. These pathways are directly coupled to microorganism growth and offer high product yields.

[0003] In particular, the invention relates to fermentation pathways involving Ptb-Buk. Phosphate butyryltransferase (Ptb) (EC 2.3.1.19) natively catalyzes the reaction of butanoyl-CoA and phosphate to form CoA and butanoyl phosphate. Butyrate kinase (Buk) (EC 2.7.2.7) natively catalyzes the reaction of butanoyl phosphate and ADP to form butyrate (butanoate) and ATP. Accordingly, these enzymes together (Ptb-Buk) natively catalyze the conversion of butanoyl-CoA to butyrate and generate one ATP via substrate level phosphorylation (SLP).

[0004] The inventors have discovered that Ptb is promiscuous and is capable of accepting a variety of acyl-CoAs and enoyl-CoAs as substrates, such that Ptb-Buk may be used to convert a number of acyl-CoAs and enoyl-CoAs to their corresponding acids or alkenates, respectively, while simultaneously generating ATP via substrate level phosphorylation.

[0005] Furthermore, in combination with an aldehyde ferredoxin oxidoreductase (AOR) and an alcohol dehydrogenase, acids formed via the Ptb-Buk system can be further converted to their respective aldehydes, alcohols, or diols. AOR (EC 1.2.7.5) catalyzes the reaction of an acid and reduced ferredoxin (which can, for example, be generated from oxidation of CO or hydrogen) to form an aldehyde and oxidized ferredoxin. Alcohol dehydrogenase (EC 1.1.1.1 and EC 1.1.1.2) can convert an aldehyde and NAD(P)H to an alcohol and NAD(P).

[0006] Introduction of Ptb-Buk and/or AOR into a heterologous species, therefore, provides a novel, alternate route to the formation of native and non-native products, such as as acids, alkenes, ketones, aldehydes, alcohols, and diols at high yields, thus overcoming limitations of the current state of the art.

SUMMARY OF THE INVENTION

[0007] The invention provides a genetically engineered bacterium comprising exogenous phosphate butyryltransferase (Ptb) and exogenous butyrate kinase (Buk) (Ptb-Buk). Generally, the Ptb-Buk acts on a non-native substrate, e.g., a substrate other than butanoyl-CoA and/or butanoyl phosphate, and produces a non-native product, e.g., a product other than butanoyl phosphate or butyrate. In certain embodiments, the Ptb-Buk converts acetoacetyl-CoA to acetoacetate, 3-hydroxyisovaleryl-CoA to 3-hydroxyisovalerate, 3-hydroxybutyryl-CoA to 3-hydroxybutyrate, or 2-hydroxyisobutyryl-CoA to 2-hydroxyisobutyrate.

[0008] The bacterium may produce one or more of an acid, an alkene, a ketone, an aldehyde, an alcohol, or a diol. More specifically, the bacterium may produce one or more of acetone or a precursor thereof, isopropanol or a precursor thereof, isobutylene or a precursor thereof, 3-hydroxybutyrate or a precursor thereof, 1,3-butanediol or a precursor thereof, 2-hydroxyisobutyrate or a precursor thereof, adipic acid or a precursor thereof, 1,3-hexanediol or a precursor thereof, 3-methyl-2-butanol or a precursor thereof, 2-buten-1-ol or a precursor thereof, isovalerate or a precursor thereof, or isoamyl alcohol or a precursor thereof. The bacterium does not typically produce butanol.

[0009] The bacterium may further comprise a disruptive mutation in a phosphotransacetylase (Pta) and an acetate kinase (Ack). The bacterium may further comprise a disruptive mutation in a thioesterase. In another embodiment, the invention provides a genetically engineered bacterium comprising exogenous Ptb-Buk and exogenous or endogenous aldehyde:ferredoxin oxidoreductase.

[0010] The invention further provides a method of producing a product comprising culturing the bacterium of any of the aforementioned embodiments in the presence of a substrate. The product may be, for example, acetone or a precursor thereof, isopropanol or a precursor thereof, isobutylene or a precursor thereof, 3-hydroxybutyrate or a precursor thereof, 1,3-butanediol or a precursor thereof, 2-hydroxyisobutyrate or a precursor thereof, adipic acid or a precursor thereof, 1,3-hexanediol or a precursor thereof, 3-methyl-2-butanol or a precursor thereof, 2-buten-1-ol or a precursor thereof, isovalerate or a precursor thereof, or isoamyl alcohol or a precursor thereof. Typically, the substrate is a gaseous substrate comprising, for example, one or more of CO, CO.sub.2, and H.sub.2. In one embodiment, the gaseous substrate is syngas. In another embodiment, the gaseous substrate is an industrial waste gas.

BRIEF DESCRIPTION OF THE DRAWINGS

[0011] FIG. 1 is a diagram of metabolic pathways for the production of various products, including acetone, isopropanol, isobutylene, 3-hydroxybutyrate, 1,3-butanediol, and 2-hydroxyisobutyrate from acetyl-CoA. Acetyl-CoA may be generated from any suitable substrate, such as a carbohydrate (e.g., sugar) substrate or a gaseous substrate. In the present invention, acetyl-CoA is often generated from a gaseous substrate. Bold arrows indicate steps that may be catalyzed by Ptb-Buk.

[0012] FIG. 2 is a diagram showing the reactions natively catalyzed by Ptb-Buk, namely the conversion of butanoyl-CoA to butyrate and the generation of one ATP.

[0013] FIG. 3 is a diagram comparing the activities of CoA-transferase, thioesterase, and Ptb-Buk.

[0014] FIG. 4 is a graph showing average acetone production in E. coli BL21 (D3) modified with plasmids comprising exogenous genes. This data demonstrates the ability of Ptb-Buk to convert acetoacetyl-CoA to acetoacetate in E. coli in vivo.

[0015] FIG. 5 is a graph showing the effect of induction of E. coli BL21 (DE3) carrying both the pACYC-ptb-buk and pCOLA-thlA-adc plasmids (expressing thiolase, Ptb-Buk, and acetoacetate decarboxylase).

[0016] FIG. 6 is a diagram of a pathway designed to use Ptb-Buk for acetone production, while recycling the reducing equivalents produced in the production of (R)-3-hydroxybutyryl-CoA and the ATP generated by Ptb-Buk.

[0017] FIG. 7 is a diagram showing the role of aldehyde:ferredoxin oxidoreductase (AOR), ferredoxin, and Adh in the production of 1,3-butanediol in C. autoethanogenum. More generally, AOR may be used to catalyze the conversion of an acid to an aldehyde and Adh may be used to catalyze the conversion of the aldehyde to an alcohol/diol.

[0018] FIG. 8 is a diagram showing the stereospecificity of Ptb-Buk for the production of (R)-3-hydroxybutyrate and 2-hydroxyisobutyrate. The term "native" in FIG. 8 refers to native thioesterase.

[0019] FIG. 9 is a diagram showing the production of isobutene via Ptb-Buk conversion of 3-hydroxyisovaleryl-CoA and 3-hydroxyisovalerate using alternative pathway 1.

[0020] FIG. 10 is a diagram showing the production of isobutene via Ptb-Buk conversion of 3-hydroxyisovaleryl-CoA and 3-hydroxyisovalerate using alternative pathway 2.

[0021] FIG. 11 is a diagram showing the production of 1,3-butanediol via 3-butyraldehyde dehydrogenase (Bld).

[0022] FIG. 12 is a graph showing isopropanol production in C. autoethanogenum using the Ptb-Buk system over a control. .smallcircle. pMTL85147-thlA-adc, .circle-solid. pMTL85147-thlA-ptb-buk-adc.

[0023] FIGS. 13A-F are graphs showing production of 3-hydroxybutyrate, acetate, ethanol, and acetone with modular plasmids in E. coli with different concentrations of inducer IPTG (0, 50, 100 .mu.M). FIG. 13A: pACYC-ptb-buk, pCOLA-thlA-adc, pCDF-phaB. FIG. 13B: pACYC-ptb-buk, pCOLA-thlA-adc, pCDF-phaB-bdh1. FIG. 13C: pCOLA-thlA-adc, pCDF-phaB-bdh1. FIG. 13D: pCOLA-thlA-adc. FIG. 13E: pCDF-phaB-bdh1. FIG. 13F: pCDF-phaB.

[0024] FIG. 14 is a plasmid map of plasmid pMTL8225-budA::thlA-phaB.

[0025] FIG. 15 is a gel image of PCR verification of replacement of acetolactate synthase (budA) genes with thiolase (thlA) and 3-hydroxybutyryl-CoA dehydrogenase (phaB) genes in C. autoethanogenum for 4 clones (1, 4, 7, 9) compared to wild-type (W). All clones are positive as seen by a larger PCR fragment size compared to wild-type.

[0026] FIG. 16 is a graph showing fermentation profile of a batch fermentation C. autoethanogenum budA::thlAphaB strain and demonstrating 3-hydroxybutyrate and 1,3-butanediol formation from gas.

[0027] FIG. 17A is a graph showing production of 1,3-BDO via thiolase, 3-hydroxybutyryl-CoA dehydrogenase (Bld), and butyraldehyde dehydrogenase. FIG. 17B is a graph showing the impact of bld expression on growth.

[0028] FIG. 18A is a graph showing the formation of 3-hydroxybutyrate and 1,3-butanediol from gaseous substrate in C. autoethanogenum pMTL8315-Pfdx-hbd1-thlA. FIG. 18B is a graph showing the reduction of acetate to ethanol in the same culture.

[0029] FIG. 19 is a graph showing the fermentation profile for strain C. autoethanogenum pMTL8315-Pfdx-hbd1-thlA demonstrating formation of 3-hydroxybutyrate and 1,3-butanediol from gaseous substrate in continuous culture (where indicated, media was replenished continuously with given dilution rate D).

[0030] FIG. 20A and FIG. 20B are graphs showing increased CoA hydrolysis activity on a range of acyl-CoAs (acetoacetyl-CoA, 3-hydroxybutyryl-CoA and 2-hydroxyisobutyryl-CoA) in C. autoethanogenum expressing the Ptb-Buk system from plasmid pMTL82256-ptb-buk compared to wild-type (WT).

[0031] FIG. 21A and FIG. 21B are graphs showing reduced acyl-CoA hydrolysis activity of C. autoethanogenum strains with inactivated thioesterases (CT2640=thioesterase 1, CT 1524=thioesterase 2, CT1780=thioesterase 3) compared to activity found in C. autoethanogenum LZ1560 or LZ1561.

[0032] FIG. 22 is a graph showing increased specific isopropanol production in a C. autoethanogenum strain with disrupted thioesterase 3 CAETHG_1780 compared to wild-type C. autoethanogenum.

[0033] FIGS. 23A-D are graphs showing growth (FIG. 23A) and isopropanol (FIG. 23B), acetate (FIG. 23C), and ethanol (FIG. 23D) production profiles of C. autoethanogenum wild-type and strain with disrupted thioesterase 3 (CAETHG_1780) compared to wild-type C. autoethanogenum.

[0034] FIG. 24 is a plasmid map of pMTL8225-pta-ack::ptb-buk.

[0035] FIG. 25 is a gel image indicating the replacement of pta and ack genes replaced with ptb and buk genes and ermB cassette.

[0036] FIG. 26 is a graph showing increased conversion 3-hydroxybutyrate to 1,3-BDO by overexpression of the aldehyde:ferredoxin oxidoreductase gene aor1.

[0037] FIG. 27 is a graph showing the activity of thioesterase TesB, Pta-Ack, and Ptb-Buk system on CoA hydrolysis of acetoacetyl-CoA, 3-hydroxybutyryl-CoA and 2-hydroxyisobutyryl-CoA compared to control (BL21 strain). Ptb-Buk shows highest activity, while Pta-Ack shows no activity.

[0038] FIGS. 28A and 28B are graphs showing production of 3-hydroxybutyrate via Ptb-Buk in combination with an (S)-specific (Hbd) (FIG. 28A) or (R)-specific 3-hydroxybutyrate (PhaB) (FIG. 28B) dehydrogenase.

[0039] FIGS. 29A-D are graphs showing LC-MS/MS detection of 2-hydroxyisobutyric acid (2-HIB) and 2-hydroxybutyrate (2-HB). FIG. 29A: 1 mM 2-HIB standard. FIG. 29B: 1 mM 2-HB standard. FIG. 29C: 0.5 mM 2-HB and 2-HIB standard. FIG. 29D: duplicate of C. autoethanogenum sample showing 2-HIB and 2-HB production from gas.

[0040] FIG. 30 is a set of graphs showing GC-MS confirmation of 2-hydroxyisobutyric acid (8.91 min) production. First panel: C. autoethanogenum+pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-tesB. Second panel: C. autoethanogenum+pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-ptb-buk (spectrum). Third panel: E. coli+pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-tesB. Fourth panel: E. coli+pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-ptb-buk.

[0041] FIG. 31 is a set of graphs of real time PCR showing expression of genes of the 2-HIBA pathway (thlA, hba, meaBhcmA, hcmB from pta-ack promoter and respectively Wood-Ljungdahl operon promoter) in E. coli, C. autoethanogenum LZ1561 at 30.degree. C., and C. autoethanogenum LZ1561 at 37.degree. C.

[0042] FIG. 32 is a diagram showing the production of various products in a microorganism comprising Ptb-Buk, AOR, and Adh.

[0043] FIG. 33 is a diagram showing the coupling firefly luciferase (Luc) to the Ptb-Buk system to characterize Ptb-Buk variants.

[0044] FIG. 34 is a diagram of metabolic pathways for the production of various products, including adipic acid. Bold arrows indicate steps that may be catalyzed by Ptb-Buk.

[0045] FIG. 35 is a diagram of metabolic pathways for the production of various products, including 1,3-hexanediol, 2-methyl-2-butanol, and 2-buten-1-ol. Bold arrows indicate steps that may be catalyzed by Ptb-Buk.

[0046] FIG. 36 is a diagram of metabolic pathways for the production of various products, including isovalerate and isoamyl alcohol. Bold arrows indicate steps that may be catalyzed by Ptb-Buk.

[0047] FIG. 37 is a graph of 3-HB production in C. autoethanogenum containing plasmid pMTL82256-thlA-ctfAB at various points of growth.

[0048] FIG. 38A is a graph showing the growth and ethanol and 2,3-butanediol production profile of strain C. autoethanogenum pta-ack::ptb-buk+pMTL85147-thlA-ptb-buk-adc. FIG. 38B is a graph showing the isopropanol and 3-HB production profile of strain C. autoethanogenum pta-ack::ptb-buk+pMTL85147-thlA-ptb-buk-adc.

[0049] FIG. 39 is a diagram of a pathway scheme for producing a range of C.sub.4, C.sub.6, C.sub.8, C.sub.10, C.sub.12, C14 alcohols, ketones, enols or diols via combining known chain elongation pathway (Hbd, Crt, Bcd-EtfAB, Thl) with Ptb-Buk+AOR/Adc-Adh.

[0050] FIG. 40 is a graph showing production of 3-HB and 1,3-BDO by C. autoethanogenum transformed with plasmid pMTL83159-phaB-thlA at various points of growth.

[0051] FIG. 41 is a graph showing production of 3-HB and 1,3-BDO by C. autoethanogenum comprising budA knockout and pMTL-HBD-ThlA at various points of growth.

[0052] FIG. 42A is a graph showing production of 3-HB in a C. autoethanogenum pMTL83159-phaB-thlA+pMTL82256 fermentation. FIG. 42B is a graph showing production of 3-HB in a C. autoethanogenum pMTL83159-phaB-thlA+pMTL82256-buk-ptb fermentation.

[0053] FIG. 43 is a graph showing the production of 3-HB in a C. autoethanogenum strain with thioesterase knockout (.DELTA.CAETHG_1524) expressing plasmid pMTL83156-phaB-thlA with and without Ptb-Buk expression plasmid pMTL82256-buk-ptb.

[0054] FIG. 44 is a graph showing showing ethanol and 1,3-BDO production in a C. autoethanogenum strain expressing plasmid pMTL82256-hbd-thlA (2 pf) with and without AOR overexpression plasmid pMTL83159-aor1 (+aor1).

DETAILED DESCRIPTION OF THE INVENTION

Metabolic Pathways of FIGS. 1 and 34-36

[0055] FIGS. 1 and 34-36 are diagrams of metabolic pathways for the production of various acid, alkene, ketone, aldehyde, alcohol, and diol products, including acetone, isopropanol, isobutylene, 3-hydroxybutyrate (R- and S-isomers), 1,3-butanediol, 2-hydroxyisobutyrate, adipic acid, 1,3-hexanediol, 2-methyl-2-butanol, 2-buten-1-ol, isovalerate, and isoamyl alcohol from a substrate. Bold arrows indicate steps that may be catalyzed by Ptb-Buk. Exemplary enzymes are provided for each of the steps and enzymatic pathways detailed in FIGS. 1 and 34-36. However, additional suitable enzymes may be known to a person of ordinary skill in the art.

[0056] Step 1 shows the conversion of acetyl-CoA to acetoacetyl-CoA. This step may be catalyzed by thiolase (i.e., acetyl-CoA acetyltransferase) (EC 2.3.1.9). The thiolase may be, for example, ThlA from Clostridium acetobutylicum (WP_010966157.1) (SEQ ID NO: 1), PhaA from Cupriavidus necator (WP_013956452.1) (SEQ ID NO: 2), BktB from Cupriavidus necator (WP_011615089.1) (SEQ ID NO: 3), or AtoB from Escherichia coli (NP_416728.1) (SEQ ID NO: 4). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli has native activity for this step.

[0057] Step 2 shows the conversion of acetoacetyl-CoA to acetoacetate. This step may be catalyzed by CoA-transferase (i.e., acetyl-CoA:acetoacetyl-CoA transferase) (EC 2.8.3.9). The CoA-transferase may be, for example, CtfAB, a heterodimer comprising subunits CtfA and CtfB, from Clostridium beijerinckii (CtfA, WP_012059996.1) (SEQ ID NO: 5) (CtfB, WP_012059997.1) (SEQ ID NO: 6). This step may also be catalyzed by thioesterase (EC 3.1.2.20). The thioesterase may be, for example, TesB from Escherichia coli (NP_414986.1) (SEQ ID NO: 7). This step may also be catalyzed by a putative thioesterase, e.g., from Clostridium autoethanogenum or Clostridium ljungdahlii. In particular, three putative thioesterases have been identified in Clostridium autoethanogenum: (1) "thioesterase 1" (AGY74947.1; annotated as palmitoyl-CoA hydrolase; SEQ ID NO: 8), (2) "thioesterase 2" (AGY75747.1; annotated as 4-hydroxybenzoyl-CoA thioesterase; SEQ ID NO: 9), and (3) "thioesterase 3" (AGY75999.1; annotated as putative thioesterase; SEQ ID NO: 10). Three putative thioesterases have also been identified in Clostridium ljungdahlii: (1) "thioesterase 1" (ADK15695.1; annotated as predicted acyl-CoA thioesterase 1; SEQ ID NO: 11), (2) "thioesterase 2" (ADK16655.1; annotated as predicted thioesterase; SEQ ID NO: 12), and (3) "thioesterase 3" (ADK16959.1; annotated as predicted thioesterase; SEQ ID NO: 13). This step may also be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0058] Step 3 shows the conversion of acetoacetate to acetone. This step may be catalyzed by an acetoacetate decarboxylase (EC 4.1.1.4). The acetoacetate decarboxylase may be, for example, Adc from Clostridium beijerinckii (WP_012059998.1) (SEQ ID NO: 14). This step may also be catalyzed by an alpha-ketoisovalerate decarboxylase (EC 4.1.1.74). The alpha-ketoisovalerate decarboxylase may be, for example, KivD from Lactococcus lactis (SEQ ID NO: 15). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Additionally, Escherichia coli does not have known native activity for this step. Rarely, conversion of acetoacetate to acetone may occur spontaneously. However, spontaneous conversion is highly inefficient and unlikely to result in the production of downstream products at desirable levels.

[0059] Step 4 shows the conversion of acetone to isopropanol. This step may be catalyzed by a primary:secondary alcohol dehydrogenase (EC 1.1.1.2). The primary:secondary alcohol dehydrogenase may be, for example, SecAdh from Clostridium autoethanogenum (AGY74782.1) (SEQ ID NO: 16), SecAdh from Clostridium ljungdahlii (ADK15544.1) (SEQ ID NO: 17), SecAdh from Clostridium ragsdalei (WP_013239134.1) (SEQ ID NO: 18), or SecAdh from Clostridium beijerinckii (WP_026889046.1) (SEQ ID NO: 19). This step may also be catalyzed by a primary:secondary alcohol dehydrogenase (EC 1.1.1.80), such as SecAdh from Thermoanaerobacter brokii (3FSR_A) (SEQ ID NO: 20). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step (Kopke, Appl Environ Microbiol, 80: 3394-3403, 2014). However, Escherichia coli does not have known native activity for this step. Knocking down or knocking out this enzyme in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei results in the production and accumulation of acetone rather than isopropanol (WO 2015/085015).

[0060] Step 5 shows the conversion of acetone to 3-hydroxyisovalerate. This step may be catalyzed by a hydroxyisovalerate synthase, such as hydroxymethylglutaryl-CoA synthase (HMG-CoA synthase) (EC 2.3.3.10) from Mus musculus (SEQ ID NO: 21) (US 2012/0110001). The hydroxymethylglutaryl-CoA synthase may be engineered to improve activity. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0061] Step 6 shows the conversion of 3-hydroxyisovalerate to isobutylene (isobutene). This step may be catalyzed by a hydroxyisovalerate phosphorylase/decarboxylase. This step may also be catalyzed by mevalonate diphosphate decarboxylase (hydroxyisovalerate decarboxylase) (EC 4.1.1.33). The mevalonate diphosphate decarboxylase may be, for example, Mdd from Saccharomyces cerevisiae (CAA96324.1) (SEQ ID NO: 22) or Mdd from Picrophilus torridus (WP_011178157.1) (SEQ ID NO: 23) (US 2011/0165644; van Leeuwen, Appl Microbiol Biotechnol, 93: 1377-1387, 2012). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step

[0062] Step 7 shows the conversion of acetone to 3-hydroxyisovaleryl-CoA. This step may be catalyzed by a 3-hydroxyisovaleryl-CoA synthase. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step

[0063] Step 8 shows the conversion of 3-hydroxyisovaleryl-CoA to 3-hydroxyisovalerate. This step may be catalyzed by CoA-transferase (i.e., acetyl-CoA:acetoacetyl-CoA transferase) (EC 2.8.3.9). The CoA-transferase may be, for example, CtfAB, a heterodimer comprising subunits CtfA and CtfB, from Clostridium beijerinckii (CtfA, WP_012059996.1) (SEQ ID NO: 5) (CtfB, WP_012059997.1) (SEQ ID NO: 6). This step may also be catalyzed by thioesterase (EC 3.1.2.20). The thioesterase may be, for example, TesB from Escherichia coli (NP_414986.1) (SEQ ID NO: 7). This step may also be catalyzed by a putative thioesterase, e.g., from Clostridium autoethanogenum or Clostridium ljungdahlii. In particular, three putative thioesterases have been identified in Clostridium autoethanogenum: (1) "thioesterase 1" (AGY74947.1; annotated as palmitoyl-CoA hydrolase; SEQ ID NO: 8), (2) "thioesterase 2" (AGY75747.1; annotated as 4-hydroxybenzoyl-CoA thioesterase; SEQ ID NO: 9), and (3) "thioesterase 3" (AGY75999.1; annotated as putative thioesterase; SEQ ID NO: 10). Three putative thioesterases have also been identified in Clostridium ljungdahlii: (1) "thioesterase 1" (ADK15695.1; annotated as predicted acyl-CoA thioesterase 1; SEQ ID NO: 11), (2) "thioesterase 2" (ADK16655.1; annotated as predicted thioesterase; SEQ ID NO: 12), and (3) "thioesterase 3" (ADK16959.1; annotated as predicted thioesterase; SEQ ID NO: 13). This step may also be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0064] Step 9 shows the conversion of acetyl-CoA to 3-methyl-2-oxopentanoate. This step encompasses a number of enzymatic reactions involved in the isoleucine biosynthesis pathway, which is natively present in many bacteria, including Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (and Escherichia coli). Enzymes involved in the conversion of acetyl-CoA to 3-methyl-2-oxopentanoate may include citramalate synthase (EC 2.3.1.182), 3-isopropylmalate dehydratase (EC 4.2.1.35), 3-isopropylmalate dehydrogenase (EC 1.1.1.85), acetolactate synthase (EC 2.2.1.6), ketol-acid reductoisomerase (EC 1.1.1.86), and/or dihydroxyacid dehydratase (EC 4.2.1.9). The citramalate synthase may be, for example, CimA from Clostridium autoethanogenum (AGY76958.1) (SEQ ID NO: 24) or CimA from Methanocaldococcus jannaschii (NP_248395.1) (SEQ ID NO: 25). The 3-isopropylmalate dehydratase may be, for example, LeuCD from Clostridium autoethanogenum (WP_023162955.1, LeuC; AGY77204.1, LeuD) (SEQ ID NOs: 26 and 27, respectively) or LeuCD from Escherichia coli (NP_414614.1, LeuC; NP_414613.1, LeuD) (SEQ ID NOs: 28 and 29, respectively). The 3-isopropylmalate dehydrogenase may be, for example, LeuB from Clostridium autoethanogenum (WP_023162957.1) (SEQ ID NO: 30) or LeuB from Escherichia coli (NP_414615.4) (SEQ ID NO: 31). The acetolactate synthase may be, for example, IlvBN from Clostridium autoethanogenum (AGY74359.1, IlvB; AGY74635.1, IlvB; AGY74360.1, IlvN) (SEQ ID NOs: 32, 33, and 34, respectively) or IlvBN from Escherichia coli (NP_418127.1, IlvB; NP_418126.1, IlvN) (SEQ ID NOs: 35 and 36, respectively). The ketol-acid reductoisomerase may be, for example, IlvC from Clostridium autoethanogenum (WP_013238693.1) (SEQ ID NO: 37) or IlvC from Escherichia coli (NP_418222.1) (SEQ ID NO: 38). The dihydroxyacid dehydratase may be, for example, IlvD from Clostridium autoethanogenum (WP_013238694.1) (SEQ ID NO: 39) or IlvD from Escherichia coli (YP_026248.1) (SEQ ID NO: 40). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step.

[0065] Step 10 shows the conversion of 3-methyl-2-oxopentoate to 2-methylbutanoyl-CoA. This step may be catalyzed by ketoisovalerate oxidoreductase (EC 1.2.7.7). The ketoisovalerate oxidoreductase may be, for example, the VorABCD from Methanothermobacter thermautotrophicus (WP_010876344.1, VorA; WP_010876343.1, VorB; WP_010876342.1, VorC; WP_010876341.1, VorD) (SEQ ID NOs: 41-44, respectively) or VorABCD from Pyrococcus furiosus (WP_011012106.1, VorA; WP_011012105.1, VorB; WP_011012108.1, VorC; WP_011012107.1, VorD) (SEQ ID NOs: 45-48, respectively). VorABCD is a 4-subunit enzyme. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0066] Step 11 shows the conversion of 2-methylbutanoyl-CoA to 2-methylcrotonyl-CoA. This step may be catalyzed by 2-methylbutanoyl-CoA dehydrogenase (EC 1.3.99.12). The 2-methylbutanoyl-CoA dehydrogenase may be, for example, AcdH from Streptomyces avermitilis (AAD44196.1 or BAB69160.1) (SEQ ID NO: 49) or AcdH from Streptomyces coelicolor (AAD44195.1) (SEQ ID NO: 50). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0067] Step 12 shows the conversion of 2-methylcrotonyl-CoA to 3-hydroxyisovaleryl-CoA. This step may be catalyzed by crotonase/3-hydroxybutyryl-CoA dehydratase (EC 4.2.1.55). The crotonase/3-hydroxybutyryl-CoA dehydratase may be, for example, Crt from Clostridium beijerinckii (ABR34202.1) (SEQ ID NO: 51), Crt from Clostridium acetobutylicum (NP_349318.1) (SEQ ID NO: 52), or LiuC from Myxococcus xanthus (WP_011553770.1). This step may also be catalyzed by crotonyl-CoA carboxylase-reductase (EC 1.3.1.86). The crotonyl-CoA carboxylase-reductase may be, for example, Ccr from Treponema denticola (NP_971211.1) (SEQ ID NO: 53). This step may also be catalyzed by crotonyl-CoA reductase (EC 1.3.1.44). The crotonyl-CoA reductase may be, for example, Ter from Euglena gracilis (AAW66853.1) (SEQ ID NO: 54). This step may also be catalyzed by a 3-hydroxypropionyl-CoA dehydratase (EC 4.2.1.116). This 3-hydroxypropionyl-CoA dehydratase may be, for example, Msed_2001 from Metallosphaera sedula (WP_012021928.1). This step may also be catalyzed by a enoyl-CoA hydratase. This enoyl-CoA hydratase (4.2.1.17) may be, for example, YngF from Bacillus anthracis (WP_000787371.1). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0068] Step 13 shows the conversion of acetoacetyl-CoA to 3-hydroxybutyryl-CoA. This step may be catalyzed by 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157). The 3-hydroxybutyryl-CoA dehydrogenase may be, for example, Hbd from Clostridium beijerinckii (WP_011967675.1) (SEQ ID NO: 55), Hbd from Clostridium acetobutylicum (NP_349314.1) (SEQ ID NO: 56), or Hbd1 from Clostridium kluyveri (WP_011989027.1) (SEQ ID NO: 57). This step may also be catalyzed by acetoacetyl-CoA reductase (EC 4.2.1.36). The acetoacetyl-CoA reductase may be, for example, PhaB from Cupriavidus necator (WP_010810131.1) (SEQ ID NO: 58). This step may also be catalyzed by acetoacetyl-CoA hydratase (EC 4.2.1.119). Of note, PhaB is R-specific and Hbd is S-specific. Additionally, Hbd1 from Clostridium kluyveri is NADPH-dependent and Hbd from Clostridium acetobutylicum and Clostridium beijerinckii are NADH-dependent. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0069] Step 14 shows the conversion of 3-hydroxybutyryl-CoA to 3-hydroxybutyrate. This step may be catalyzed by thioesterase (EC 3.1.2.20). The thioesterase may be, for example, TesB from Escherichia coli (NP_414986.1) (SEQ ID NO: 7). This step may also be catalyzed by a putative thioesterase, e.g., from Clostridium autoethanogenum or Clostridium ljungdahlii. In particular, three putative thioesterases have been identified in Clostridium autoethanogenum: (1) "thioesterase 1" (AGY74947.1; annotated as palmitoyl-CoA hydrolase; SEQ ID NO: 8), (2) "thioesterase 2" (AGY75747.1; annotated as 4-hydroxybenzoyl-CoA thioesterase; SEQ ID NO: 9), and (3) "thioesterase 3" (AGY75999.1; annotated as putative thioesterase; SEQ ID NO: 10). Three putative thioesterases have also been identified in Clostridium ljungdahlii: (1) "thioesterase 1" (ADK15695.1; annotated as predicted acyl-CoA thioesterase 1; SEQ ID NO: 11), (2) "thioesterase 2" (ADK16655.1; annotated as predicted thioesterase; SEQ ID NO: 12), and (3) "thioesterase 3" (ADK16959.1; annotated as predicted thioesterase; SEQ ID NO: 13). This step may also be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0070] Step 15 shows the conversion of 3-hydroxybutyrate to acetoacetate. This step may be catalyzed by 3-hydroxybutyrate dehydrogenase (EC 1.1.1.30). The 3-hydroxybutyrate dehydrogenase may be, for example, Bdh1 from Ralstonia pickettii (BAE72684.1) (SEQ ID NO: 60) or Bdh2 from Ralstonia pickettii (BAE72685.1) (SEQ ID NO: 61). The reverse reaction, the conversion of acetoacetate to 3-hydroxybutyrate, may be catalyzed by different 3-hydroxybutyrate dehydrogenase (EC 1.1.1.30) enzymes. For example, the conversion of acetoacetate to 3-hydroxybutyrate may be catalyzed by Bdh from Clostridium autoethanogenum (AGY75962) (SEQ ID NO: 62). Clostridium ljungdahlii and Clostridium ragsdalei likely have enzymes with similar activity. Escherichia coli does not have known native activity for this step.

[0071] Step 16 shows the conversion of 3-hydroxybutyrate to 3-hydroxybutyrylaldehyde. This step may be catalyzed by aldehyde:ferredoxin oxidoreductase (EC 1.2.7.5). The aldehyde:ferredoxin oxidoreductase (AOR) may be, for example, AOR from Clostridium autoethanogenum (WP_013238665.1; WP_013238675.1) (SEQ ID NOs: 63 and 64, respectively) or AOR from Clostridium ljungdahlii (ADK15073.1; ADK15083.1) (SEQ ID NOs: 65 and 66, respectively). In further embodiments, the aldehyde:ferredoxin oxidoreductase may be or may be derived, for example, from any of the following sources, the sequences of which are publically available:

TABLE-US-00001 Description Microrganism Accession GeneID aldehyde:ferredoxin oxidoreductase Acidilobus saccharovorans 345-15 NC_014374.1 9498931 aldehyde:ferredoxin oxidoreductase Acidilobus saccharovorans 345-15 NC_014374.1 9499504 aldehyde:ferredoxin oxidoreductase Acidilobus saccharovorans 345-15 NC_014374.1 9499550 aldehyde:ferredoxin oxidoreductase Acidilobus saccharovorans 345-15 NC_014374.1 9498997 aldehyde:ferredoxin oxidoreductase Aciduliprofundum boonei T469 NC_013926.1 8828075 aldehyde:ferredoxin oxidoreductase Aciduliprofundum boonei T469 NC_013926.1 8828511 aldehyde:ferredoxin oxidoreductase Aciduliprofundum boonei T469 NC_013926.1 8828305 aldehyde:ferredoxin oxidoreductase Aciduliprofundum boonei T469 NC_013926.1 8827762 aldehyde:ferredoxin oxidoreductase Aciduliprofundum boonei T469 NC_013926.1 8827370 aldehyde:ferredoxin oxidoreductase Aciduliprofundum sp. MAR08-339 NC_019942.1 14306579 aldehyde:ferredoxin oxidoreductase Aciduliprofundum sp. MAR08-339 NC_019942.1 14306982 aldehyde:ferredoxin oxidoreductase Aciduliprofundum sp. MAR08-339 NC_019942.1 14306639 aldehyde:ferredoxin oxidoreductase Aciduliprofundum sp. MAR08-339 NC_019942.1 14307339 aldehyde:ferredoxin oxidoreductase Aeropyrum pernix K1 NC_000854.2 1444491 aldehyde:ferredoxin oxidoreductase Archaeoglobus fulgidus DSM 4304 NC_000917.1 1483287 aldehyde:ferredoxin oxidoreductase Archaeoglobus fulgidus DSM 4304 NC_000917.1 1483233 aldehyde:ferredoxin oxidoreductase Archaeoglobus fulgidus DSM 4304 NC_000917.1 1483554 aldehyde:ferredoxin oxidoreductase Archaeoglobus fulgidus DSM 4304 NC_000917.1 1485513 aldehyde:ferredoxin oxidoreductase Archaeoglobus profundus DSM NC_013741.1 8738726 5631 aldehyde:ferredoxin oxidoreductase Archaeoglobus profundus DSM NC_013741.1 8740019 5631 aldehyde:ferredoxin oxidoreductase Archaeoglobus sulfaticallidus NC_021169.1 15392228 PM70-1 aldehyde:ferredoxin oxidoreductase Archaeoglobus sulfaticallidus NC_021169.1 15393814 PM70-1 aldehyde:ferredoxin oxidoreductase Archaeoglobus sulfaticallidus NC_021169.1 15391826 PM70-1 aldehyde:ferredoxin oxidoreductase Archaeoglobus sulfaticallidus NC_021169.1 15393763 PM70-1 aldehyde:ferredoxin oxidoreductase Archaeoglobus sulfaticallidus NC_021169.1 15393491 PM70-1 aldehyde:ferredoxin oxidoreductase Archaeoglobus veneficus SNP6 NC_015320.1 10393142 aldehyde:ferredoxin oxidoreductase Archaeoglobus veneficus SNP6 NC_015320.1 10395048 aldehyde:ferredoxin oxidoreductase Caldisphaera lagunensis DSM NC_019791.1 14212403 15908 aldehyde:ferredoxin oxidoreductase Caldisphaera lagunensis DSM NC_019791.1 14211524 15908 aldehyde:ferredoxin oxidoreductase Caldisphaera lagunensis DSM NC_019791.1 14212092 15908 aldehyde:ferredoxin oxidoreductase Caldisphaera lagunensis DSM NC_019791.1 14212561 15908 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5710116 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5710117 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5709088 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5708891 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5710478 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5710457 aldehyde:ferredoxin oxidoreductase Caldivirga maquilingensis IC-167 NC_009954.1 5709696 aldehyde:ferredoxin oxidoreductase Candidatus Caldiarchaeum NC_022786.1 17602865 subterraneum aldehyde:ferredoxin oxidoreductase Candidatus Korarchaeum NC_010482.1 6094361 cryptofilum OPF8 aldehyde:ferredoxin oxidoreductase Candidatus Korarchaeum NC_010482.1 6094198 cryptofilum OPF8 aldehyde:ferredoxin oxidoreductase Candidatus Korarchaeum NC_010482.1 6093546 cryptofilum OPF8 aldehyde:ferredoxin oxidoreductase Candidatus Korarchaeum NC_010482.1 6093319 cryptofilum OPF8 aldehyde:ferredoxin oxidoreductase Candidatus Korarchaeum NC_010482.1 6094057 cryptofilum OPF8 aldehyde:ferredoxin oxidoreductase Candidatus Korarchaeum NC_010482.1 6093563 cryptofilum OPF8 aldehyde:ferredoxin oxidoreductase Chloroflexus aurantiacus J-10-fl NC_010175.1 5828639 aldehyde:ferredoxin oxidoreductase Clostridium acetobutylicum ATCC NC_003030 .1 1118201 824 aldehyde:ferredoxin oxidoreductase Clostridium botulinum A sfr. ATCC NC_009495.1 5187636 3502 aldehyde:ferredoxin oxidoreductase Clostridium botulinum A str. Hall NC_009698.1 5400593 aldehyde:ferredoxin oxidoreductase Desulfovibrio vulgaris str. NC_002937.3 2796664 Hildenborough aldehyde:ferredoxin oxidoreductase Desulfovibrio vulgaris str. NC_002937.3 2795337 Hildenborough aldehyde:ferredoxin oxidoreductase Desulfurococcus fermentans DSM NC_018001.1 13061477 16532 aldehyde:ferredoxin oxidoreductase Desulfurococcus fermentans DSM NC_018001.1 13061068 16532 aldehyde:ferredoxin oxidoreductase Desulfurococcus fermentans DSM NC_018001.1 13062247 16532 aldehyde:ferredoxin oxidoreductase Desulfurococcus kamchatkensis NC_011766.1 7171099 1221n aldehyde:ferredoxin oxidoreductase Desulfurococcus kamchatkensis NC_011766.1 7171759 1221n aldehyde:ferredoxin oxidoreductase Desulfurococcus kamchatkensis NC_011766.1 7170725 1221n aldehyde:ferredoxin oxidoreductase Desulfurococcus mucosus DSM NC_014961.1 10152801 2162 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8778536 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8779007 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8778940 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8779639 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8778820 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8778745 aldehyde:ferredoxin oxidoreductase Ferroglobus placidus DSM 10642 NC_013849.1 8779874 aldehyde:ferredoxin oxidoreductase Fervidicoccus fontis Kam940 NC_017461.1 12449263 aldehyde:ferredoxin oxidoreductase Fervidicoccus fontis Kam940 NC_017461.1 12449994 aldehyde:ferredoxin oxidoreductase Fervidicoccus fontis Kam940 NC_017461.1 12449294 aldehyde:ferredoxin oxidoreductase Fervidicoccus fontis Kam940 NC_017461.1 12449682 aldehyde:ferredoxin oxidoreductase Geobacter sulfurreducens PCA NC_002939.5 2685730 aldehyde:ferredoxin oxidoreductase Geobacter sulfurreducens PCA NC_002939.5 2687039 aldehyde:ferredoxin oxidoreductase Halalkalicoccus jeotgali B3 NC_014297.1 9418623 aldehyde:ferredoxin oxidoreductase Halalkalicoccus jeotgali B3 NC_014297.1 9418760 aldehyde:ferredoxin oxidoreductase Halalkalicoccus jeotgali B3 NC_014297.1 9420819 aldehyde:ferredoxin oxidoreductase Halalkalicoccus jeotgali B3 NC_014297.1 9418748 aldehyde:ferredoxin oxidoreductase Haloarcula hispanica ATCC 33960 NC_015948.1 11051410 aldehyde:ferredoxin oxidoreductase Haloarcula hispanica ATCC 33960 NC_015948.1 11050783 aldehyde:ferredoxin oxidoreductase Haloarcula hispanica ATCC 33960 NC_015948.1 11051433 aldehyde:ferredoxin oxidoreductase Haloarcula hispanica N601 NC_023013.1 23805333 aldehyde:ferredoxin oxidoreductase Haloarcula hispanica N601 NC_023013.1 23805138 aldehyde:ferredoxin oxidoreductase Haloarcula hispanica N601 NC_023013.1 23804665 aldehyde:ferredoxin oxidoreductase Haloarcula marismortui ATCC NC_006396.1 3127969 43049 aldehyde:ferredoxin oxidoreductase Haloarcula marismortui ATCC NC_006396.1 3129232 43049 aldehyde:ferredoxin oxidoreductase Haloferax mediterranei ATCC NC_017941.2 13028168 33500 aldehyde:ferredoxin oxidoreductase Haloferax mediterranei ATCC NC_017941.2 13028399 33500 aldehyde:ferredoxin oxidoreductase Haloferax volcanii DS2 NC_013964.1 8919329 aldehyde:ferredoxin oxidoreductase Haloferax volcanii DS2 NC_013964.1 8919033 aldehyde:ferredoxin oxidoreductase Haloferax volcanii DS2 NC_013967.1 8926544 aldehyde:ferredoxin oxidoreductase Halogeomefricum borinquense DSM NC_014735.1 9989054 11551 aldehyde:ferredoxin oxidoreductase Halogeomefricum borinquense DSM NC_014729.1 9994424 11551 aldehyde:ferredoxin oxidoreductase Halogeomefricum borinquense DSM NC_014729.1 9992444 11551 aldehyde:ferredoxin oxidoreductase halophilic archaeon DL31 NC_015954.1 11095016 aldehyde:ferredoxin oxidoreductase halophilic archaeon DL31 NC_015954.1 11095541 aldehyde:ferredoxin oxidoreductase halophilic archaeon DL31 NC_015954.1 11094595 aldehyde:ferredoxin oxidoreductase halophilic archaeon DL31 NC_015954.1 11096497 aldehyde:ferredoxin oxidoreductase halophilic archaeon DL31 NC_015954.1 11094563 aldehyde:ferredoxin oxidoreductase halophilic archaeon DL31 NC_015954.1 11095602 aldehyde:ferredoxin oxidoreductase Halopiger xanaduensis SH-6 NC_015666.1 10799161 aldehyde:ferredoxin oxidoreductase Halopiger xanaduensis SH-6 NC_015658.1 10795465 aldehyde:ferredoxin oxidoreductase Halopiger xanaduensis SH-6 NC_015666.1 10798686 aldehyde:ferredoxin oxidoreductase Halopiger xanaduensis SH-6 NC_015666.1 10796679 aldehyde:ferredoxin oxidoreductase Halorubrum lacusprofundi ATCC NC_012029.1 7400122 49239 aldehyde:ferredoxin oxidoreductase Halorubrum lacusprofundi ATCC NC_012029.1 7400291 49239 aldehyde:ferredoxin oxidoreductase Halorubrum lacusprofundi ATCC NC_012029.1 7400689

49239 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013744.1 8744461 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013744.1 8744695 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013743.1 8740954 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013745.1 8745418 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013743.1 8742968 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013743.1 8741246 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013743.1 8741269 5511 aldehyde:ferredoxin oxidoreductase Haloterrigena turkmenica DSM NC_013745.1 8745313 5511 aldehyde:ferredoxin oxidoreductase Hyperthermus butylicus DSM 5456 NC_008818.1 4781896 aldehyde:ferredoxin oxidoreductase Hyperthermus butylicus DSM 5456 NC_008818.1 4782266 aldehyde:ferredoxin oxidoreductase Hyperthermus butylicus DSM 5456 NC_008818.1 4782804 aldehyde:ferredoxin oxidoreductase Hyperthermus butylicus DSM 5456 NC_008818.1 4781774 aldehyde:ferredoxin oxidoreductase Ignicoccus hospitalis KIN4/I NC_009776.1 5562477 aldehyde:ferredoxin oxidoreductase Ignicoccus hospitalis KIN4/I NC_009776.1 5562774 aldehyde:ferredoxin oxidoreductase Ignisphaera aggregans DSM 17230 NC_014471.1 9716798 aldehyde:ferredoxin oxidoreductase Methanocaldococcus jannaschii NC_000909.1 1452083 DSM 2661 aldehyde:ferredoxin oxidoreductase Methanocella arvoryzae MRE50 NC_009464.1 5142690 aldehyde:ferredoxin oxidoreductase Methanocella arvoryzae MRE50 NC_009464.1 5143773 aldehyde:ferredoxin oxidoreductase Methanocella conradii HZ254 NC_017034.1 11972399 aldehyde:ferredoxin oxidoreductase Methanocella conradii HZ254 NC_017034.1 11971349 aldehyde:ferredoxin oxidoreductase Methanocella paludicola SANAE NC_013665.1 8680711 aldehyde:ferredoxin oxidoreductase Methanocella paludicola SANAE NC_013665.1 8680676 aldehyde:ferredoxin oxidoreductase Methanocorpusculum labreanum Z NC_008942.1 4795790 aldehyde:ferredoxin oxidoreductase Methanoculleus marisnigri JR1 NC_009051.1 4847673 aldehyde:ferredoxin oxidoreductase Methanohalobium evestigatum Z- NC_014253.1 9347460 7303 aldehyde:ferredoxin oxidoreductase Methanohalobium evestigatum Z- NC_014253.1 9347022 7303 aldehyde:ferredoxin oxidoreductase Methanolobus psychrophilus R15 NC_018876.1 13845119 aldehyde:ferredoxin oxidoreductase Methanomethylovorans hollandica NC_019977.1 14408029 DSM 15978 aldehyde:ferredoxin oxidoreductase Methanosaeta harundinacea 6Ac NC_017527.1 12511443 aldehyde:ferredoxin oxidoreductase Methanosaeta thermophila PT NC_008553.1 4462364 aldehyde:ferredoxin oxidoreductase Methanosalsum zhilinae DSM 4017 NC_015676.1 10822365 aldehyde:ferredoxin oxidoreductase Methanosarcina acetivorans C2A NC_003552.1 1475882 aldehyde:ferredoxin oxidoreductase Methanosarcina acetivorans C2A NC_003552.1 1474856 aldehyde:ferredoxin oxidoreductase Methanosarcina acetivorans C2A NC_003552.1 1473602 aldehyde:ferredoxin oxidoreductase Methanosarcina barkeri str. Fusaro NC_007355.1 3625763 aldehyde:ferredoxin oxidoreductase Methanosarcina mazei Go1 NC_003901.1 1479263 aldehyde:ferredoxin oxidoreductase Methanosarcina mazei Go1 NC_003901.1 1481668 aldehyde:ferredoxin oxidoreductase Methanosarcina mazei Go1 NC_003901.1 1480987 aldehyde:ferredoxin oxidoreductase Methanosarcina mazei Tuc01 NC_020389.1 14656065 aldehyde:ferredoxin oxidoreductase Methanosarcina mazei Tuc01 NC_020389.1 14656771 aldehyde:ferredoxin oxidoreductase Methanosarcina mazei Tuc01 NC_020389.1 14654304 aldehyde:ferredoxin oxidoreductase Methanosphaerula palustris E1-9c NC_011832.1 7271108 aldehyde:ferredoxin oxidoreductase Methanospirillum hungatei JF-1 NC_007796.1 3924565 aldehyde:ferredoxin oxidoreductase Methylomicrobium alcaliphilum 20Z NC_016112.1 11361147 aldehyde:ferredoxin oxidoreductase Moorella thermoacetica ATCC NC_007644.1 3831332 39073 aldehyde:ferredoxin oxidoreductase Moorella thermoacetica ATCC NC_007644.1 3830998 39073 aldehyde:ferredoxin oxidoreductase Moorella thermoacetica ATCC NC_007644.1 3831866 39073 aldehyde:ferredoxin oxidoreductase Natrialba magadii ATCC 43099 NC_013922.1 8824961 aldehyde:ferredoxin oxidoreductase Natrialba magadii ATCC 43099 NC_013922.1 8823392 aldehyde:ferredoxin oxidoreductase Natrialba magadii ATCC 43099 NC_013923.1 8826737 aldehyde:ferredoxin oxidoreductase Natrialba magadii ATCC 43099 NC_013922.1 8825516 aldehyde:ferredoxin oxidoreductase Natrinema pellirubrum DSM 15624 NC_019962.1 14335278 aldehyde:ferredoxin oxidoreductase Natrinema pellirubrum DSM 15624 NC_019962.1 14333050 aldehyde:ferredoxin oxidoreductase Natrinema pellirubrum DSM 15624 NC_019962.1 14333754 aldehyde:ferredoxin oxidoreductase Natrinema sp. J7-2 NC_018224.1 13349954 aldehyde:ferredoxin oxidoreductase Natronobacterium gregoryi SP2 NC_019792.1 14210296 aldehyde:ferredoxin oxidoreductase Natronobacterium gregoryi SP2 NC_019792.1 14207133 aldehyde:ferredoxin oxidoreductase Natronobacterium gregoryi SP2 NC_019792.1 14209682 aldehyde:ferredoxin oxidoreductase Natronobacterium gregoryi SP2 NC_019792.1 14207576 aldehyde:ferredoxin oxidoreductase Natronobacterium gregoryi SP2 NC_019792.1 14206941 aldehyde:ferredoxin oxidoreductase Natronobacterium gregoryi SP2 NC_019792.1 14206532 aldehyde:ferredoxin oxidoreductase Natronococcus occultus SP4 NC_019974.1 14403316 aldehyde:ferredoxin oxidoreductase Natronococcus occultus SP4 NC_019974.1 14405255 aldehyde:ferredoxin oxidoreductase Natronococcus occultus SP4 NC_019974.1 14403781 aldehyde:ferredoxin oxidoreductase Natronococcus occultus SP4 NC_019974.1 14402014 aldehyde:ferredoxin oxidoreductase Natronomonas moolapensis 8.8.11 NC_020388.1 14651997 aldehyde:ferredoxin oxidoreductase Natronomonas moolapensis 8.8.11 NC_020388.1 14652892 aldehyde:ferredoxin oxidoreductase Natronomonas moolapensis 8.8.11 NC_020388.1 14651999 aldehyde:ferredoxin oxidoreductase Natronomonas pharaonis DSM 2160 NC_007427.1 3694680 aldehyde:ferredoxin oxidoreductase Natronomonas pharaonis DSM 2160 NC_007426.1 3702508 aldehyde:ferredoxin oxidoreductase Natronomonas pharaonis DSM 2160 NC_007426.1 3702507 aldehyde:ferredoxin oxidoreductase Natronomonas pharaonis DSM 2160 NC_007426.1 3702509 aldehyde:ferredoxin oxidoreductase Pyrobaculum aerophilum str. IM2 NC_003364.1 1464236 aldehyde:ferredoxin oxidoreductase Pyrobaculum aerophilum str. IM2 NC_003364 .1 1464102 aldehyde:ferredoxin oxidoreductase Pyrobaculum aerophilum str. IM2 NC_003364.1 1465126 aldehyde:ferredoxin oxidoreductase Pyrobaculum aerophilum str. IM2 NC_003364.1 1465445 aldehyde:ferredoxin oxidoreductase Pyrobaculum arsenaticum DSM NC_009376.1 5055904 13514 aldehyde:ferredoxin oxidoreductase Pyrobaculum arsenaticum DSM NC_009376.1 5055700 13514 aldehyde:ferredoxin oxidoreductase Pyrobaculum arsenaticum DSM NC_009376.1 5054881 13514 aldehyde:ferredoxin oxidoreductase Pyrobaculum arsenaticum DSM NC_009376.1 5054644 13514 aldehyde:ferredoxin oxidoreductase Pyrobaculum arsenaticum DSM NC_009376.1 5054547 13514 aldehyde:ferredoxin oxidoreductase Pyrobaculum calidifontis JCM NC_009073.1 4910224 11548 aldehyde:ferredoxin oxidoreductase Pyrobaculum calidifontis JCM NC_009073.1 4908822 11548 aldehyde:ferredoxin oxidoreductase Pyrobaculum calidifontis JCM NC_009073.1 4909927 11548 aldehyde:ferredoxin oxidoreductase Pyrobaculum calidifontis JCM NC_009073.1 4910099 11548 aldehyde:ferredoxin oxidoreductase Pyrobaculum islandicum DSM 4184 NC_008701.1 4617364 aldehyde:ferredoxin oxidoreductase Pyrobaculum islandicum DSM 4184 NC_008701.1 4616724 aldehyde:ferredoxin oxidoreductase Pyrobaculum islandicum DSM 4184 NC_008701.1 4617494 aldehyde:ferredoxin oxidoreductase Pyrobaculum neutrophilum V24Sta NC_010525.1 6165427 aldehyde:ferredoxin oxidoreductase Pyrobaculum neutrophilum V24Sta NC_010525.1 6164958 aldehyde:ferredoxin oxidoreductase Pyrobaculum neutrophilum V24Sta NC_010525.1 6164976 aldehyde:ferredoxin oxidoreductase Pyrobaculum oguniense TE7 NC_016885.1 11853778 aldehyde:ferredoxin oxidoreductase Pyrobaculum oguniense TE7 NC_016885.1 11854024 aldehyde:ferredoxin oxidoreductase Pyrobaculum oguniense TE7 NC_016885.1 11856490 aldehyde:ferredoxin oxidoreductase Pyrobaculum oguniense TE7 NC_016885.1 11856176 aldehyde:ferredoxin oxidoreductase Pyrobaculum oguniense TE7 NC_016885.1 11854908 aldehyde:ferredoxin oxidoreductase Pyrobaculum sp. 1860 NC_016645.1 11594868 aldehyde:ferredoxin oxidoreductase Pyrobaculum sp. 1860 NC_016645.1 11596631 aldehyde:ferredoxin oxidoreductase Pyrobaculum sp. 1860 NC_016645.1 11594049 aldehyde:ferredoxin oxidoreductase Pyrococcus abyssi GE5 NC_000868.1 1496313 aldehyde:ferredoxin oxidoreductase Pyrococcus abyssi GE5 NC_000868.1 1495669 aldehyde:ferredoxin oxidoreductase Pyrococcus abyssi GE5 NC_000868.1 1496580 aldehyde:ferredoxin oxidoreductase Pyrococcus abyssi GE5 NC_000868.1 1495287 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus COM1 NC_018092.1 13302148 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus COM1 NC_018092.1 13301806 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus COM1 NC_018092.1 13301219 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus COM1 NC_018092.1 13300785 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus DSM 3638 NC_003413.1 1468181 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus DSM 3638 NC_003413.1 1469073 aldehyde:ferredoxin oxidoreductase Pyrococcus furiosus DSM 3638 NC_003413.1 1469843 aldehyde:ferredoxin oxidoreductase Pyrococcus horikoshii OT3 NC_000961.1 1443218 aldehyde:ferredoxin oxidoreductase Pyrococcus horikoshii OT3 NC_000961.1 1443341 aldehyde:ferredoxin oxidoreductase Pyrococcus horikoshii OT3 NC_000961.1 1443932

aldehyde:ferredoxin oxidoreductase Pyrococcus horikoshii OT3 NC_000961.1 1443598 aldehyde:ferredoxin oxidoreductase Pyrococcus sp. NA2 NC_015474.1 10555029 aldehyde:ferredoxin oxidoreductase Pyrococcus sp. NA2 NC_015474.1 10554020 aldehyde:ferredoxin oxidoreductase Pyrococcus sp. NA2 NC_015474.1 10555341 aldehyde:ferredoxin oxidoreductase Pyrococcus sp. ST04 NC_017946.1 13022107 aldehyde:ferredoxin oxidoreductase Pyrococcus sp. ST04 NC_017946.1 13022436 aldehyde:ferredoxin oxidoreductase Pyrococcus sp. ST04 NC_017946.1 13021314 aldehyde:ferredoxin oxidoreductase Pyrococcus yayanosii CH1 NC_015680.1 10837518 aldehyde:ferredoxin oxidoreductase Pyrococcus yayanosii CH1 NC_015680.1 10837112 aldehyde:ferredoxin oxidoreductase Pyrococcus yayanosii CH1 NC_015680.1 10837264 aldehyde:ferredoxin oxidoreductase Pyrolobus fumarii 1A NC_015931.1 11138144 aldehyde:ferredoxin oxidoreductase Pyrolobus fumarii 1A NC_015931.1 11138776 aldehyde:ferredoxin oxidoreductase Pyrolobus fumarii 1A NC_015931.1 11139127 aldehyde:ferredoxin oxidoreductase Rhodospirillum rubrum ATCC NC_007643.1 3833668 11170 aldehyde:ferredoxin oxidoreductase Staphylothermus hellenicus DSM NC_014205.1 9234557 12710 aldehyde:ferredoxin oxidoreductase Staphylothermus hellenicus DSM NC_014205.1 9233414 12710 aldehyde:ferredoxin oxidoreductase Staphylothermus hellenicus DSM NC_014205.1 9234134 12710 aldehyde:ferredoxin oxidoreductase Staphylothermus hellenicus DSM NC_014205.1 9234110 12710 aldehyde:ferredoxin oxidoreductase Staphylothermus marinus F1 NC_009033.1 4907444 aldehyde:ferredoxin oxidoreductase Staphylothermus marinus F1 NC_009033.1 4907343 aldehyde:ferredoxin oxidoreductase Thermanaerovibrio NC_013522.1 8630284 acidaminovorans DSM 6589 aldehyde:ferredoxin oxidoreductase Thermanaerovibrio NC_013522.1 8630027 acidaminovorans DSM 6589 aldehyde:ferredoxin oxidoreductase Thermanaerovibrio NC_013522.1 8630623 acidaminovorans DSM 6589 aldehyde:ferredoxin oxidoreductase Thermoanaerobacter wiegelii NC_015958.1 11082596 Rt8.B1 aldehyde:ferredoxin oxidoreductase Thermococcus barophilus MP NC_014804.1 10041639 aldehyde:ferredoxin oxidoreductase Thermococcus barophilus MP NC_014804.1 10041106 aldehyde:ferredoxin oxidoreductase Thermococcus barophilus MP NC_014804.1 10042460 aldehyde:ferredoxin oxidoreductase Thermococcus cleftensis NC_018015.1 13037745 aldehyde:ferredoxin oxidoreductase Thermococcus cleftensis NC_018015.1 13038896 aldehyde:ferredoxin oxidoreductase Thermococcus cleftensis NC_018015.1 13037242 aldehyde:ferredoxin oxidoreductase Thermococcus gammatolerans EJ3 NC_012804.1 7988317 aldehyde:ferredoxin oxidoreductase Thermococcus gammatolerans EJ3 NC_012804.1 7987451 aldehyde:ferredoxin oxidoreductase Thermococcus kodakarensis KOD1 NC_006624.1 3233851 aldehyde:ferredoxin oxidoreductase Thermococcus kodakarensis KOD1 NC_006624.1 3233735 aldehyde:ferredoxin oxidoreductase Thermococcus litoralis DSM 5473 NC_022084.1 16550741 aldehyde:ferredoxin oxidoreductase Thermococcus litoralis DSM 5473 NC_022084.1 16548761 aldehyde:ferredoxin oxidoreductase Thermococcus litoralis DSM 5473 NC_022084.1 16550885 aldehyde:ferredoxin oxidoreductase Thermococcus onnurineus NA1 NC_011529.1 7018383 aldehyde:ferredoxin oxidoreductase Thermococcus onnurineus NA1 NC_011529.1 7016739 aldehyde:ferredoxin oxidoreductase Thermococcus onnurineus NA1 NC_011529.1 7017051 aldehyde:ferredoxin oxidoreductase Thermococcus onnurineus NA1 NC_011529.1 7017476 aldehyde:ferredoxin oxidoreductase Thermococcus sibiricus MM 739 NC_012883.1 8096638 aldehyde:ferredoxin oxidoreductase Thermococcus sibiricus MM 739 NC_012883.1 8096005 aldehyde:ferredoxin oxidoreductase Thermococcus sibiricus MM 739 NC_012883.1 8096629 aldehyde:ferredoxin oxidoreductase Thermococcus sibiricus MM 739 NC_012883.1 8095463 aldehyde:ferredoxin oxidoreductase Thermococcus sibiricus MM 739 NC_012883.1 8096131 aldehyde:ferredoxin oxidoreductase Thermococcus sibiricus MM 739 NC_012883.1 8096636 aldehyde:ferredoxin oxidoreductase Thermococcus sp. 4557 NC_015865.1 11015504 aldehyde:ferredoxin oxidoreductase Thermococcus sp. 4557 NC_015865.1 11015249 aldehyde:ferredoxin oxidoreductase Thermococcus sp. 4557 NC_015865.1 11015571 aldehyde:ferredoxin oxidoreductase Thermococcus sp. AM4 NC_016051.1 7419050 aldehyde:ferredoxin oxidoreductase Thermococcus sp. AM4 NC_016051.1 7418514 aldehyde:ferredoxin oxidoreductase Thermococcus sp. AM4 NC_016051.1 7420292 aldehyde:ferredoxin oxidoreductase Thermodesulfovibrio yellowstonii NC_011296.1 6941429 DSM 11347 aldehyde:ferredoxin oxidoreductase Thermodesulfovibrio yellowstonii NC_011296.1 6943174 DSM 11347 aldehyde:ferredoxin oxidoreductase Thermodesulfovibrio yellowstonii NC_011296.1 6941905 DSM 11347 aldehyde:ferredoxin oxidoreductase Thermofilum pendens Hrk 5 NC_008698.1 4602054 aldehyde:ferredoxin oxidoreductase Thermofilum pendens Hrk 5 NC_008698.1 4601386 aldehyde:ferredoxin oxidoreductase Thermofilum pendens Hrk 5 NC_008698.1 4600878 aldehyde:ferredoxin oxidoreductase Thermofilum pendens Hrk 5 NC_008698.1 4600730 aldehyde:ferredoxin oxidoreductase Thermofilum sp. 1910b NC_022093.1 16572780 aldehyde:ferredoxin oxidoreductase Thermofilum sp. 1910b NC_022093.1 16572926 aldehyde:ferredoxin oxidoreductase Thermofilum sp. 1910b NC_022093.1 16573009 aldehyde:ferredoxin oxidoreductase Thermofilum sp. 1910b NC_022093.1 16574342 aldehyde:ferredoxin oxidoreductase Thermogladius cellulolyticus 1633 NC_017954.1 13012904 aldehyde:ferredoxin oxidoreductase Thermoplasma acidophilum DSM NC_002578.1 1456355 1728 aldehyde:ferredoxin oxidoreductase Thermoplasma acidophilum DSM NC_002578.1 1456646 1728 aldehyde:ferredoxin oxidoreductase Thermoplasma vokanium GSS1 NC_002689.2 1441901 aldehyde:ferredoxin oxidoreductase Thermoplasma vokanium GSS1 NC_002689.2 1441379 aldehyde:ferredoxin oxidoreductase Thermoproteus tenax Kra 1 NC_016070.1 11262174 aldehyde:ferredoxin oxidoreductase Thermoproteus tenax Kra 1 NC_016070.1 11262275 aldehyde:ferredoxin oxidoreductase Thermoproteus tenax Kra 1 NC_016070.1 11262652 aldehyde:ferredoxin oxidoreductase Thermoproteus tenax Kra 1 NC_016070.1 11262926 aldehyde:ferredoxin oxidoreductase Thermoproteus uzoniensis 768-20 NC_015315.1 10361668 aldehyde:ferredoxin oxidoreductase Thermoproteus uzoniensis 768-20 NC_015315.1 10361250 aldehyde:ferredoxin oxidoreductase Thermoproteus uzoniensis 768-20 NC_015315.1 10360972 aldehyde:ferredoxin oxidoreductase Thermosphaera aggregans DSM NC_014160.1 9165115 11486 aldehyde:ferredoxin oxidoreductase Thermosphaera aggregans DSM NC_014160.1 9165462 11486 aldehyde:ferredoxin oxidoreductase Thermus thermophilus HB8 NC_006461.1 3168554 aldehyde:ferredoxin oxidoreductase Thermus thermophilus HB8 NC_006461.1 3168612 aldehyde:ferredoxin oxidoreductase Vukanisaeta disfributa DSM 14429 NC_014537.1 9753145 aldehyde:ferredoxin oxidoreductase Vukanisaeta disfributa DSM 14429 NC_014537.1 9750947 aldehyde:ferredoxin oxidoreductase Vukanisaeta disfributa DSM 14429 NC_014537.1 9750989 aldehyde:ferredoxin oxidoreductase Vukanisaeta disfributa DSM 14429 NC_014537.1 9753486 aldehyde:ferredoxin oxidoreductase Vukanisaeta disfributa DSM 14429 NC_014537.1 9751414 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288238 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288894 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288574 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288827 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288607 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288523 aldehyde:ferredoxin oxidoreductase Vukanisaeta moutnovskia 768-28 NC_015151.1 10288815

[0072] AOR catalyzes the reaction of an acid and reduced ferredoxin to form an aldehyde and oxidized ferredoxin. In acetogens, this reaction can be coupled to oxidation CO (via CO dehydrogenase, EC 1.2.7.4) or hydrogen (via ferredoxin-dependent hydrogenase, EC 1.12.7.2 or 1.12.1.4) that both yield reduced ferredoxin (Kopke, Curr Opin Biotechnol 22: 320-325, 2011; Kopke, PNAS USA, 107: 13087-13092, 2010). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous AOR or introduction of an exogenous AOR in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Alternatively, exogenous AOR may be introduced into a microorganism that does not natively comprise AOR, e.g., E. coli. In particular, the co-expression of Ptb-Buk and AOR (and, optionally, Adh) may enable such a microorganism to produce new non-native products.

[0073] Step 17 shows the conversion of 3-hydroxybutyrylaldehyde to 1,3-butanediol. This step may be catalyzed by alcohol dehydrogenase (EC 1.1.1.1. or 1.1.1.2.). Alcohol dehydrogenase can convert an aldehyde and NAD(P)H to an alcohol and NAD(P). The alcohol dehydrogenase may be, for example, Adh from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 67), Clostridium ljungdahlii (ADK17019.1) (SEQ ID NO: 68), or Clostridium ragsdalei, BdhB from Clostridium acetobutylicum (NP_349891.1) (SEQ ID NO: 69), Bdh from Clostridium beijerinckii (WP_041897187.1) (SEQ ID NO: 70), Bdh1 from Clostridium ljungdahlii (YP_003780648.1) (SEQ ID NO: 71), Bdh1 from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 72), Bdh2 from Clostridium ljungdahlii (YP_003782121.1) (SEQ ID NO: 73), Bdh2 from Clostridium autoethanogenum (AGY74784.1) (SEQ ID NO: 74), AdhE1 from Clostridium acetobutylicum (NP_149325.1) (SEQ ID NO: 75), AdhE2 from Clostridium acetobutylicum (NP_149199.1) (SEQ ID NO: 76), AdhE from Clostridium beijerinckii (WP_041893626.1) (SEQ ID NO: 77), AdhE1 from Clostridium autoethanogenum (WP_023163372.1) (SEQ ID NO: 78), or AdhE2 from Clostridium autoethanogenum (WP_023163373.1) (SEQ ID NO: 79). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous alcohol dehydrogenase or introduction of an exogenous alcohol dehydrogenase in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Escherichia coli likely does not have native activity for this step.

[0074] Step 18 shows the conversion of 3-hydroxybutyryl-CoA to 3-hydroxybutyrylaldehyde. This step may be catalyzed by butyraldehyde dehydrogenase (EC 1.2.1.57). The butyraldehyde dehydrogenase may be, for example, Bld from Clostridium saccharoperbutylacetonicum (AAP42563.1) (SEQ ID NO: 80). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0075] Step 19 shows the conversion of 3-hydroxybutyryl-CoA to 2-hydroxyisobutyryl-CoA. This step may be catalyzed by 2-hydroxyisobutyryl-CoA mutase (EC 5.4.99.-). The 2-hydroxyisobutyryl-CoA mutase may be, for example, HcmAB from Aquincola tertiaricarbonis (AFK77668.1, large subunit; AFK77665.1, small subunit) (SEQ ID NOs: 81 and 82, respectively) or HcmAB from Kyrpidia tusciae (WP_013074530.1, large subunit; WP_013074531.1, small subunit) (SEQ ID NOs: 83 and 84, respectively). Chaperone MeaB (AFK77667.1, Aquincola tertiaricarbonis; WP_013074529.1, Kyrpidia tusciae) (SEQ ID NOs: 85 and 86, respectively) has been described to improve activity of HcmAB by reactivating HcmAB, although MeaB is not required for HcmAB function (Yaneva, J Biol Chem, 287: 15502-15511, 2012). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0076] Step 20 shows the conversion of 2-hydroxyisobutyryl-CoA to 2-hydroxyisobutyrate. This step may be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0077] Step 21 shows the conversion of acetyl-CoA to succinyl-CoA. This step encompasses a number of enzymatic reactions involved in the reductive TCA pathway, which is natively present in many bacteria, including Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (and Escherichia coli) (Brown, Biotechnol Biofuels, 7: 40, 2014; U.S. Pat. No. 9,297,026). Enzymes involved in the conversion of acetyl-CoA to succinyl-CoA may include pyruvate:ferredoxin oxidoreductase (PFOR) (EC 1.2.7.1), pyruvate carboxylase (PYC) (EC 6.4.1.1), malic enzyme/malate dehydrogenase (EC 1.1.1.38, EC 1.1.1.40), pyruvate phosphate dikinase (PPDK) (EC:2.7.9.1), PEP carboxykinase (PCK) (EC 4.1.1.49), fumarate hydratase/fumerase (EC 4.2.1.2), fumarate reductase (EC 1.3.5.1)/succinate dehydrogenase (EC 1.3.5.4), and succinyl-CoA synthetase (EC 6.2.1.5). The pyruvate:ferredoxin oxidoreductase may be, for example, from Clostridium autoethanogenum (AGY75153, AGY77232) or Escherichia coli (NP_415896). The pyruvate carboxylase may be, for example, from Clostridium autoethanogenum (AGY75817). The malic enzyme/malate dehydrogenase may be, for example, from Clostridium autoethanogenum (AGY76687) or Escherichia coli (NP_416714, NP_417703). The pyruvate phosphate dikinase (PPDK) may be, for example, from Clostridium autoethanogenum (AGY76274, AGY77114). The PEP carboxykinase (PCK) may be, for example, from Clostridium autoethanogenum (AGY76928) or Escherichia coli (NP_417862). The fumarate hydratase/fumerase may be, for example, from Clostridium autoethanogenum (AGY76121, AGY76122) or Escherichia coli (NP_416128, NP_416129, NP_418546). The fumarate reductase/succinate dehydrogenase may be, for example, from Clostridium autoethanogenum (AGY74573, AGY74575, AGY75257, AGY77166) or Escherichia coli (NP_415249, NP_415250, NP_415251, NP_415252, NP_418575, NP_418576, NP_418577, NP_418578). The succinyl-CoA synthetase may be, for example, from Escherichia coli (NP_415256, NP_415257).

[0078] Step 22 shows shows the conversion of acetyl-CoA and succinyl-CoA to 3-oxo-adipyl-CoA. This step may be catalyzed by .beta.-ketoadipyl-CoA thiolase (EC 2.3.1.16). The ketoisovalerate oxidoreductase may be, for example, PaaJ from Escherichia coli (WP_001206190.1). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0079] Step 23 shows the conversion of 3-oxo-adipyl-CoA to 3-hydroxyadipyl-CoA. This step may be catalyzed by 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157) or acetoacetyl-CoA hydratase (EC 4.2.1.119). The 3-hydroxybutyryl-CoA dehydrogenase or acetoacetyl-CoA hydratase may be, for example, Hbd from Clostridium beijerinckii (WP_011967675.1) (SEQ ID NO: 55), Hbd from Clostridium acetobutylicum (NP_349314.1) (SEQ ID NO: 56), Hbd1 from Clostridium kluyveri (WP_011989027.1) (SEQ ID NO: 57), or PaaH1 from Cupriavidus necator (WP_010814882.1). Of note, PhaB is R-specific and Hbd is S-specific. Additionally, Hbd1 from Clostridium kluyveri is NADPH-dependent and Hbd from Clostridium acetobutylicum and Clostridium beijerinckii are NADH-dependent. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0080] Step 24 shows the conversion of 3-hydroxyadipyl-CoA to 2,3-dehydroadipyl-CoA. This step may be catalyzed by an enoyl-CoA hydratase (EC: 4.2.1.17) or enoyl-CoA reductase (EC: 1.3.1.38). The enoyl-CoA hydratase or enoyl-CoA reductase may be, for example, Crt from C. acetobutylicum (NP_349318.1) or PhaJ from Aeromonas caviae (O32472) (Seq. ID No. 52). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0081] Step 25 shows the conversion of 2,3-dehydroadipyl-CoA to adipyl-CoA. This step may be catalyzed by trans-2-enoyl-CoA reductase (EC 1.3.8.1, EC 1.3.1.86, EC 1.3.1.85, EC 1.3.1.44). The trans-2-enoyl-CoA reductase may be, for example, Bcd from C. acetobutylicum (NP_349317.1) that forms a complex with electron flavoproteins EtfAB (NP_349315, NP_349316), Ccr from Streptomyces collinus (AAA92890), Ccr from Rhodobacter sphaeroides (YP_354044.1), Ter from Treponema denticola (NP_971211.1), or Ter from Euglena gracilis (AY741582.1). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0082] Step 26 shows the conversion of adipyl-CoA to adipic acid. This step may be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0083] Step 27 shows the conversion of shows the conversion of 3-hydroxbutyryl-CoA to crotonyl-CoA. This step may be catalyzed by a crotonyl-CoA hydratase (crotonase) (EC 4.2.1.17) or crotonyl-CoA reductase (EC 1.3.1.38). The crotonyl-CoA hydratase (crotonase) or crotonyl-CoA reductase may be, for example, Crt from C. acetobutylicum (NP_349318.1) (SEQ ID NO: 52) or PhaJ from Aeromonas caviae (O32472). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0084] Step 28 shows the conversion of crotonyl-CoA to crotonate. This step may be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0085] Step 29 shows the conversion of crotonate to crotonaldehyde. This step may be catalyzed by aldehyde:ferredoxin oxidoreductase (EC 1.2.7.5). Exemplary sources for aldehyde:ferredoxin oxidoreductases are described elsewhere in this application. AOR catalyzes the reaction of an acid and reduced ferredoxin to form an aldehyde and oxidized ferredoxin. In acetogens, this reaction can be coupled to oxidation CO (via CO dehydrogenase, EC 1.2.7.4) or hydrogen (via ferredoxin-dependent hydrogenase, EC 1.12.7.2 or 1.12.1.4) that both yield reduced ferredoxin (Kopke, Curr Opin Biotechnol 22: 320-325, 2011; Kopke, PNAS USA, 107: 13087-13092, 2010). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous AOR or introduction of an exogenous AOR in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. AOR of Pyrococcus furiosus has been demonstrated activity converting crotonaldehyde and crotonate (Loes, J Bacteriol, 187: 7056-7061, 2005). Alternatively, exogenous AOR may be introduced into a microorganism that does not natively comprise AOR, e.g., E. coli. In particular, the co-expression of Ptb-Buk and AOR (and, optionally, Adh) may enable such a microorganism to produce new non-native products.

[0086] Step 30 shows the conversion of crotonaldehyde to 2-buten-1-ol. This step may be catalyzed by alcohol dehydrogenase (EC 1.1.1.1. or 1.1.1.2.). Alcohol dehydrogenase can convert an aldehyde and NAD(P)H to an alcohol and NAD(P). The alcohol dehydrogenase may be, for example, Adh from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 67), Clostridium ljungdahlii (ADK17019.1) (SEQ ID NO: 68), or Clostridium ragsdalei, BdhB from Clostridium acetobutylicum (NP_349891.1) (SEQ ID NO: 69), Bdh from Clostridium beijerinckii (WP_041897187.1) (SEQ ID NO: 70), Bdh1 from Clostridium ljungdahlii (YP_003780648.1) (SEQ ID NO: 71), Bdh1 from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 72), Bdh2 from Clostridium ljungdahlii (YP_003782121.1) (SEQ ID NO: 73), Bdh2 from Clostridium autoethanogenum (AGY74784.1) (SEQ ID NO: 74), AdhE1 from Clostridium acetobutylicum (NP_149325.1) (SEQ ID NO: 75), AdhE2 from Clostridium acetobutylicum (NP_149199.1) (SEQ ID NO: 76), AdhE from Clostridium beijerinckii (WP_041893626.1) (SEQ ID NO: 77), AdhE1 from Clostridium autoethanogenum (WP_023163372.1) (SEQ ID NO: 78), or AdhE2 from Clostridium autoethanogenum (WP_023163373.1) (SEQ ID NO: 79). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous alcohol dehydrogenase or introduction of an exogenous alcohol dehydrogenase in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Escherichia coli likely does not have native activity for this step.

[0087] Step 31 shows the conversion of crotonyl-CoA to butyryl-CoA. This step may be catalyzed by butyryl-CoA dehydrogenase or trans-2-enoyl-CoA reductase (EC 1.3.8.1, EC 1.3.1.86, EC 1.3.1.85, EC 1.3.1.44). The butyryl-CoA dehydrogenase or trans-2-enoyl-CoA reductase may be, for example, Bcd from C. acetobutylicum (NP_349317.1) that forms a complex with electron flavoproteins EtfAB (NP_349315, NP_349316), Ccr from Streptomyces collinus (AAA92890), Ccr from Rhodobacter sphaeroides (YP_354044.1), Ter from Treponema denticola (NP_971211.1), or Ter from Euglena gracilis (AY741582.1). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0088] Step 32 shows the conversion of butyryl-CoA to acetobutyryl-CoA. This step may be catalyzed by thiolase or acyl-CoA acetyltransferase (EC 2.3.1.9). The thiolase may be, for example, ThlA from Clostridium acetobutylicum (WP_010966157.1) (SEQ ID NO: 1), ThlA1 from Clostridium kluyveri (EDK35681), ThlA2 from Clostridium kluyveri (EDK35682), ThlA3 from Clostridium kluyveri (EDK35683), PhaA from Cupriavidus necator (WP_013956452.1) (SEQ ID NO: 2), BktB from Cupriavidus necator (WP_011615089.1) (SEQ ID NO: 3), or AtoB from Escherichia coli (NP_416728.1) (SEQ ID NO: 4). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli has native activity for this step.

[0089] Step 33 shows the conversion of acetobutyryl-CoA to acetobutyrate. This step may be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0090] Step 34 shows the conversion of acetobutyrate to acetylacetone. This step may be catalyzed by an acetoacetate decarboxylase (EC 4.1.1.4). The acetoacetate decarboxylase may be, for example, Adc from Clostridium beijerinckii (WP_012059998.1) (SEQ ID NO: 14). This step may also be catalyzed by an alpha-ketoisovalerate decarboxylase (EC 4.1.1.74). The alpha-ketoisovalerate decarboxylase may be, for example, KivD from Lactococcus lactis (SEQ ID NO: 15). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Additionally, Escherichia coli does not have known native activity for this step. Rarely, conversion of acetoacetate to acetone may occur spontaneously. However, spontaneous conversion is highly inefficient and unlikely to result in the production of downstream products at desirable levels.

[0091] Step 35 shows the conversion of acetylacetone to 3-methyl-2-butanol. This step may be catalyzed by a primary:secondary alcohol dehydrogenase (EC 1.1.1.2). The primary:secondary alcohol dehydrogenase may be, for example, SecAdh from Clostridium autoethanogenum (AGY74782.1) (SEQ ID NO: 16), SecAdh from Clostridium ljungdahlii (ADK15544.1) (SEQ ID NO: 17), SecAdh from Clostridium ragsdalei (WP_013239134.1) (SEQ ID NO: 18), or SecAdh from Clostridium beijerinckii (WP_026889046.1) (SEQ ID NO: 19). This step may also be catalyzed by a primary:secondary alcohol dehydrogenase (EC 1.1.1.80), such as SecAdh from Thermoanaerobacter brokii (3FSR_A) (SEQ ID NO: 20). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step (Kopke, Appl Environ Microbiol, 80: 3394-3403, 2014). However, Escherichia coli does not have known native activity for this step. Knocking down or knocking out this enzyme in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei results in the production and accumulation of acetylacetone rather than 3-methyl-2-butanol (WO 2015/085015).

[0092] Step 36 shows the conversion of acetobutyryl-CoA to 3-hydroxyhexanoyl-CoA. This step may be catalyzed by 3-hydroxybutyryl-CoA dehydrogenase (EC 1.1.1.157) or acetoacetyl-CoA hydratase (EC 4.2.1.119). The 3-hydroxybutyryl-CoA dehydrogenase or acetoacetyl-CoA hydratase may be, for example, Hbd from Clostridium beijerinckii (WP_011967675.1) (SEQ ID NO: 55), Hbd from Clostridium acetobutylicum (NP_349314.1) (SEQ ID NO: 56), Hbd1 from Clostridium kluyveri (WP_011989027.1) (SEQ ID NO: 57), Hbd2 from Clostridium kluyveri (EDK34807), or PaaH1 from Cupriavidus necator (WP_010814882.1). Of note, PhaB is R-specific and Hbd is S-specific. Additionally, Hbd1 from Clostridium kluyveri is NADPH-dependent and Hbd from Clostridium acetobutylicum and Clostridium beijerinckii are NADH-dependent. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0093] Step 37 shows the conversion of 3-hydroxyhexanoyl-CoA to 3-hydroxyhexanoate. This step may be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0094] Step 38 shows the conversion of 3-hydroxyhexanoate to 1,3-hexaldehyde. This step may be catalyzed by aldehyde:ferredoxin oxidoreductase (EC 1.2.7.5). Exemplary sources for aldehyde:ferredoxin oxidoreductases are described elsewhere in this application. AOR catalyzes the reaction of an acid and reduced ferredoxin to form an aldehyde and oxidized ferredoxin. In acetogens, this reaction can be coupled to oxidation CO (via CO dehydrogenase, EC 1.2.7.4) or hydrogen (via ferredoxin-dependent hydrogenase, EC 1.12.7.2 or 1.12.1.4) that both yield reduced ferredoxin (Kopke, Curr Opin Biotechnol 22: 320-325, 2011; Kopke, PNAS USA, 107: 13087-13092, 2010). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous AOR or introduction of an exogenous AOR in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Alternatively, exogenous AOR may be introduced into a microorganism that does not natively comprise AOR, e.g., E. coli. In particular, the co-expression of Ptb-Buk and AOR (and, optionally, Adh) may enable such a microorganism to produce new non-native products.

[0095] Step 39 shows the conversion of 1,3-hexaldehyde to 1,3-hexanediol. This step may be catalyzed by alcohol dehydrogenase (EC 1.1.1.1. or 1.1.1.2.). Alcohol dehydrogenase can convert an aldehyde and NAD(P)H to an alcohol and NAD(P). The alcohol dehydrogenase may be, for example, Adh from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 67), Clostridium ljungdahlii (ADK17019.1) (SEQ ID NO: 68), or Clostridium ragsdalei, BdhB from Clostridium acetobutylicum (NP_349891.1) (SEQ ID NO: 69), Bdh from Clostridium beijerinckii (WP_041897187.1) (SEQ ID NO: 70), Bdh1 from Clostridium ljungdahlii (YP_003780648.1) (SEQ ID NO: 71), Bdh1 from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 72), Bdh2 from Clostridium ljungdahlii (YP_003782121.1) (SEQ ID NO: 73), Bdh2 from Clostridium autoethanogenum (AGY74784.1) (SEQ ID NO: 74), AdhE1 from Clostridium acetobutylicum (NP_149325.1) (SEQ ID NO: 75), AdhE2 from Clostridium acetobutylicum (NP_149199.1) (SEQ ID NO: 76), AdhE from Clostridium beijerinckii (WP_041893626.1) (SEQ ID NO: 77), AdhE1 from Clostridium autoethanogenum (WP_023163372.1) (SEQ ID NO: 78), or AdhE2 from Clostridium autoethanogenum (WP_023163373.1) (SEQ ID NO: 79). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous alcohol dehydrogenase or introduction of an exogenous alcohol dehydrogenase in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Escherichia coli likely does not have native activity for this step.

[0096] Step 40 shows the conversion of acetoacetyl-CoA to 3-hydroxy-3-methylglutaryl-CoA. This step may be catalyzed by a hydroxymethylglutaryl-CoA synthase (HMG-CoA synthase) (EC 2.3.3.10). HMG-CoA synthases are widespread across many genera and kingdoms of life and include, e.g., MvaS from Staphylococcus aureus (WP_053014863.1), ERG13 from Saccharomyces cerevisiae (NP_013580.1), HMGCS2 from Mus musculus (NP_032282.2), and many other members of the EC 2.3.3.10 group of enzymes. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0097] Step 41 shows the conversion of 3-hydroxy-3-methylglutanoyl-CoA to 3-methylgluconyl-CoA. This step may be catalyzed by a 3-hydroxybutyryl-CoA dehydratase (EC 4.2.1.55). The 3-hydroxybutyryl-CoA dehydratase may be, for example, LiuC from Myxococcus xanthus (WP_011553770.1). This step may also be catalyzed by a short-chain-enoyl-CoA hydratase (EC 4.2.1.150) or an enoyl-CoA hydratase (EC 4.2.1.17). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0098] Step 42 shows the conversion of 3-methylgluconyl-CoA to 2-methylcrotonyl-CoA. This step may be catalyzed by a methylcrotonyl-CoA decarboxylase (with high structural similarity to glutaconate-CoA transferase (EC 2.8.3.12)), e.g., aibAB from Myxococcus xanthus (WP_011554267.1 and WP_011554268.1). This step may also be catalyzed by a methylcrotonoyl-CoA carboxylase (EC 6.4.1.4), e.g., LiuDB from Pseudomonas aeruginosa (NP_250702.1 and NP_250704.1) or MCCA and MCCB from Arabidopsis thaliana (NP_563674.1 and NP_567950.1). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0099] Step 43 shows the conversion of methylcrotonyl-CoA to isovaleryl-CoA. This step may be catalyzed by an oxidoreductase, zinc-binding dehydrogenase. This oxidoreductase, zinc-binding dehydrogenase may be, for example, AibC from Myxococcus xanthus (WP_011554269.1). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not have known native activity for this step. Escherichia coli does not have known native activity for this step.

[0100] Step 44 shows the conversion of isovaleryl-CoA to isovalerate. This step may be catalyzed by CoA-transferase (i.e., acetyl-CoA:acetoacetyl-CoA transferase) (EC 2.8.3.9). The CoA-transferase may be, for example, CtfAB, a heterodimer comprising subunits CtfA and CtfB, from Clostridium beijerinckii (CtfA, WP_012059996.1) (SEQ ID NO: 5) (CtfB, WP_012059997.1) (SEQ ID NO: 6). This step may also be catalyzed by thioesterase (EC 3.1.2.20). The thioesterase may be, for example, TesB from Escherichia coli (NP_414986.1) (SEQ ID NO: 7). This step may also be catalyzed by a putative thioesterase, e.g., from Clostridium autoethanogenum or Clostridium ljungdahlii. In particular, three putative thioesterases have been identified in Clostridium autoethanogenum: (1) "thioesterase 1" (AGY74947.1; annotated as palmitoyl-CoA hydrolase; SEQ ID NO: 8), (2) "thioesterase 2" (AGY75747.1; annotated as 4-hydroxybenzoyl-CoA thioesterase; SEQ ID NO: 9), and (3) "thioesterase 3" (AGY75999.1; annotated as putative thioesterase; SEQ ID NO: 10). Three putative thioesterases have also been identified in Clostridium ljungdahlii: (1) "thioesterase 1" (ADK15695.1; annotated as predicted acyl-CoA thioesterase 1; SEQ ID NO: 11), (2) "thioesterase 2" (ADK16655.1; annotated as predicted thioesterase; SEQ ID NO: 12), and (3) "thioesterase 3" (ADK16959.1; annotated as predicted thioesterase; SEQ ID NO: 13). This step may also be catalyzed by phosphate butyryltransferase (EC 2.3.1.19)+butyrate kinase (EC 2.7.2.7). Exemplary sources for phosphate butyryltransferase and butyrate kinase are described elsewhere in this application. Native enzymes in Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (or Escherichia coli), such as thioesterases from Clostridium autoethanogenum, may catalyze this step and result in the production of some amount of downstream products. However, introduction of an exogenous enzyme or overexpression of an endogenous enzyme may be required to produce downstream products at desirable levels. Additionally, in certain embodiments, a disruptive mutation may be introduced to an endogenous enzyme, such as an endogenous thioesterase, to reduce or eliminate competition with introduced Ptb-Buk.

[0101] Step 45 shows the conversion of isovalerate to isovaleraldehyde. This step may be catalyzed by aldehyde:ferredoxin oxidoreductase (EC 1.2.7.5). The aldehyde:ferredoxin oxidoreductase (AOR) may be, for example, AOR from Clostridium autoethanogenum (WP_013238665.1; WP_013238675.1) (SEQ ID NOs: 63 and 64, respectively) or AOR from Clostridium ljungdahlii (ADK15073.1; ADK15083.1) (SEQ ID NOs: 65 and 66, respectively). Further exemplary sources for aldehyde:ferredoxin oxidoreductases are described elsewhere in this application. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous AOR or introduction of an exogenous AOR in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Alternatively, exogenous AOR may be introduced into a microorganism that does not natively comprise AOR, e.g., E. coli. In particular, the co-expression of Ptb-Buk and AOR (and, optionally, Adh) may enable such a microorganism to produce new non-native products.

[0102] Step 46 shows the conversion of isovaleraldehyde to isoamyl alcohol. This step may be catalyzed by alcohol dehydrogenase (EC 1.1.1.1. or 1.1.1.2.). Alcohol dehydrogenase can convert an aldehyde and NAD(P)H to an alcohol and NAD(P). The alcohol dehydrogenase may be, for example, Adh from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 67), Clostridium ljungdahlii (ADK17019.1) (SEQ ID NO: 68), or Clostridium ragsdalei, BdhB from Clostridium acetobutylicum (NP_349891.1) (SEQ ID NO: 69), Bdh from Clostridium beijerinckii (WP_041897187.1) (SEQ ID NO: 70), Bdh1 from Clostridium ljungdahlii (YP_003780648.1) (SEQ ID NO: 71), Bdh1 from Clostridium autoethanogenum (AGY76060.1) (SEQ ID NO: 72), Bdh2 from Clostridium ljungdahlii (YP_003782121.1) (SEQ ID NO: 73), Bdh2 from Clostridium autoethanogenum (AGY74784.1) (SEQ ID NO: 74), AdhE1 from Clostridium acetobutylicum (NP_149325.1) (SEQ ID NO: 75), AdhE2 from Clostridium acetobutylicum (NP_149199.1) (SEQ ID NO: 76), AdhE from Clostridium beijerinckii (WP_041893626.1) (SEQ ID NO: 77), AdhE1 from Clostridium autoethanogenum (WP_023163372.1) (SEQ ID NO: 78), or AdhE2 from Clostridium autoethanogenum (WP_023163373.1) (SEQ ID NO: 79). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei have native activity for this step. However, overexpression of endogenous alcohol dehydrogenase or introduction of an exogenous alcohol dehydrogenase in Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei may be desirable to enhance product yields. Escherichia coli likely does not have native activity for this step.

[0103] Step 47 shows the conversion of isovaleryl-CoA to isovaleraldehyde. This step may be catalyzed by butyraldehyde dehydrogenase (EC 1.2.1.57). The butyraldehyde dehydrogenase may be, for example, Bld from Clostridium saccharoperbutylacetonicum (AAP42563.1) (SEQ ID NO: 80). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei likely do not have native activity for this step. Escherichia coli does not have known native activity for this step.

Overview of Ptb-Buk

[0104] The invention provides new pathways utilizing the Ptb-Buk enzyme system. In nature, this enzyme system is found in a range of butyrate producing microorganisms, such as butyrate-producing Clostridia or Butyrivibrio. In particular, phosphate butyryltransferase (Ptb) (EC 2.3.1.19) natively catalyzes the reaction of butanoyl-CoA+phosphate to form CoA+butanoyl phosphate and butyrate kinase (Buk) (EC 2.7.2.7) natively catalyzes the reaction of butanoyl phosphate and ADP to form butyrate (butanoate) and ATP. Accordingly, these enzymes together (Ptb-Buk) natively catalyze the conversion of butanoyl-CoA to butyrate and generate one ATP via substrate level-phosphorylation (FIG. 2). However, the inventors have discovered that Ptb is promiscuous and is capable of accepting a variety of acyl-CoAs and enoyl-CoAs as substrates, such that Ptb-Buk may be used to convert a number of acyl-CoAs and enoyl-CoAs to their corresponding acids or alkenates, respectively, while simultaneously generating ATP. It has been reported Ptb is active on a range of acyl-CoAs including acetoacetyl-CoA, in vitro (Thompson, Appl Environ Microbiol, 56: 607-613, 1990). It has not previously been shown that acetoacetyl-phosphate could be a substrate for Buk. Although Buk is known to accept a broad substrate range (Liu, Appl Microbiol Biotechnol, 53: 545-552, 2000), no activity has been shown in vivo.

[0105] Additionally, the inventors have discovered that the introduction of exogenous Ptb-Buk enables certain microorganisms to produce useful products, including acetone, isopropanol, isobutylene, 3-hydroxybutyrate, 1,3-butanediol, and 2-hydroxyisobutyrate, as well as other products such as propionate, caproate, and octonate.

[0106] New pathways that rely on Ptb-Buk offer several major advantages over other known and existing pathway routes for production of products that rely on a CoA-transferase--as in the classic Clostridial acetone-butanol-ethanol (ABE) fermentation pathway--or a thioesterase (Jones, Microbiol Rev, 50: 484-524, 1986; Matsumoto, Appl Microbiol Biotechnol, 97: 205-210, 2013; May, Metabol Eng, 15: 218-225, 2013) (FIG. 3). In particular, these new pathways (1) are not dependent on the presence or production of particular molecules, such as organic acids, e.g., butyrate or acetate, required for the CoA-transferase reaction and (2) allow for generation of ATP via substrate level phosphorylation that would not be conserved in a thioesterase or CoA-transferase reaction. The same advantages also apply when using the Ptb-Buk system for other reactions, such as the conversion of 3-hydroxybutyryl-CoA to 3-hydroxybutyrate. Thus, these new pathways have the potential to yield much higher production titers and rates by generating additional energy and producing target products without co-production of undesired byproducts, such as acetate.

[0107] Particularly on a commercial scale, it is not desirable for microorganisms to produce acetate (or other organic acids required for the CoA transferase reaction) as byproduct, since acetate diverts carbon away from target products and thus affects the efficiency and yield of target products. Additionally, acetate may be toxic to microorganisms and/or may serve as a substrate for the growth of contaminating microorganisms. Furthermore, the presence of acetate makes it more difficult to recover and separate target products and to control fermentation conditions to favor the production of target products.

[0108] ATP generation through substrate level phosphorylation can be used as a driving force for product synthesis, especially in ATP-limited systems. In particular, acetogenic bacteria are known to live on the thermodynamic edge of life (Schuchmann, Nat Rev Microbiol, 12: 809-821, 2014). As such, all acetogenic microorganisms isolated to date have been described to produce acetate (Drake, Acetogenic Prokaryotes, In: The Prokaryotes, 3.sup.rd edition, pages 354-420, New York, N.Y., Springer, 2006) since the production of acetate provides the microorganism with an option to directly generate ATP from substrate level phosphorylation via Pta (phosphotransacetylase) (EC 2.3.1.8) and Ack (acetate kinase) (EC 2.7.2.1). Although mechanisms such as membrane gradients and electro bifurcation enzymes coupled to ion or proton translocating systems, e.g., the Rnf complex (Schuchmann, Nat Rev Microbiol, 12: 809-821, 2014), conserve ATP in these microorganisms, direct ATP generation remains critical for their survival. As a result, when introducing heterologous pathways that do not allow for ATP generation, acetate is produced as a byproduct (Schiel-Bengelsdorf, FEBS Lett, 586: 2191-2198, 2012). The Ptb-Buk pathways described herein, however, provide an alternative mechanism for the microorganism to generate ATP via substrate level phosphorylation and, therefore, avoid acetate production. In particular, acetate-forming enzymes, such as Pta-Ack, that would otherwise be essential (Nagarajan, Microb Cell Factories, 12: 118, 2013) can be replaced with Ptb-Buk as an alternative means of ATP generation. Since the microorganism can then rely on ATP generation via Ptb-Buk, this system provides a driving force that ensures maximum flux through the new pathways that use Ptb-Buk. The generation of ATP may also be crucial for downstream pathways that require ATP. For example, fermentative production of isobutylene from acetone requires ATP. While the complete pathway from acetyl-CoA to isobutylene is ATP-consuming when using a CoA-transferase or a thioesterase, the pathway is energy neutral when using Ptb-Buk.

[0109] Exemplary sources for Ptb and Buk are provided. However, it should be appreciated that other suitable sources for Ptb and Buk may be available. Additionally, Ptb and Buk may be engineered to improve activity and/or genes encoding Ptb-Buk may be codon-optimized for expression in particular host microorganisms.

[0110] The phosphate butyryltransferase may be or may be derived, for example, from any of the following sources, the sequences of which are publically available:

TABLE-US-00002 Description Microorganism Accession phosphate butyryltransferase Closfridium sp. EKQ52186 phosphate butyryltransferase Closfridium sp. WP_009167896 phosphate butyryltransferase Closfridium WP_015390396 saccharoperbutylacetonicum phosphate butyryltransferase Closfridium WP_022743598 saccharobutylicum phosphate butyryltransferase Closfridium beijerinckii WP_026886639 phosphate butyryltransferase Closfridium beijerinckii WP_041893500 phosphate butyryltransferase Closfridium butyricum WP_003410761 phosphate butyryltransferase Clostridium sp. CDB14331 phosphate butyryltransferase Closfridium botulinum WP_049180512 phosphate butyryltransferase Closfridium sp. CDB74819 phosphate butyryltransferase Closfridium paraputrificum WP_027098882 phosphate butyryltransferase Closfridium sp. WP_024615655 phosphate butyryltransferase Closfridium celatum WP_005211129 phosphate butyryltransferase Clostridium baratii WP_039312969 phosphate butyryltransferase Closfridium intestinale WP_021800215 phosphate butyryltransferase Closfridium sp. WP_042402499 phosphate butyryltransferase Closfridium sp. WP_032117069 phosphate butyryltransferase Closfridium perfringens ABG85761 phosphate butyryltransferase Closfridium botulinum WP_003374233 phosphate butyryltransferase Closfridium perfringens WP_004460499 phosphate butyryltransferase Closfridium perfringens WP_003454254 phosphate butyryltransferase Closfridium perfringens WP_041707926 phosphate butyryltransferase Closfridium perfringens BAB82054 phosphate butyryltransferase Clostridium sp. WP_008681116 phosphate butyryltransferase Closfridium chauvoei WP_021876993 phosphate butyryltransferase Closfridium colicanis WP_002598839 phosphate butyryltransferase Closfridium cadaveris WP_027637778 phosphate butyryltransferase Closfridium acetobutylicum WP_010966357 phosphate butyryltransferase Closfridium pasteurianum WP_015617430 phosphate butyryltransferase Closfridium arbusti WP_010238988 phosphate butyryltransferase Closfridium pasteurianum WP_003445696 phosphate butyryltransferase Clostridium scatologenes WP_029160341 phosphate butyryltransferase Closfridium sp. WP_032120461 phosphate butyryltransferase Closfridium drakei WP_032078800 phosphate butyryltransferase Closfridium sp. WP_021281241 phosphate butyryltransferase Closfridium argentinense WP_039635970 phosphate butyryltransferase Closfridium akagii WP_026883231 phosphate butyryltransferase Closfridium sp. WP_053242611 phosphate butyryltransferase Clostridium carboxidivorans WP_007063154 phosphate butyryltransferase Closfridium sp. WP_035292411 phosphate butyryltransferase Closfridium sulfidigenes WP_035133394 phosphate butyryltransferase Closfridium tetanomorphum WP_035147564 phosphate butyryltransferase Closfridium WP_027633206 hydrogeniformans phosphate butyryltransferase Closfridium sp. WP_040212965 phosphate butyryltransferase Candidatus Clostridium WP_040327613 phosphate butyryltransferase Closfridium sp. WP_040192242 phosphate butyryltransferase Closfridium sp. WP_050606427 phosphate butyryltransferase Closfridium lundense WP_027625137 phosphate butyryltransferase Closfridium algidicarnis WP_029451333 phosphate butyryltransferase Closfridium sp. WP_035306567 phosphate butyryltransferase Closfridium acetobutylicum AAA75486 phosphate butyryltransferase Closfridium botulinum WP_025775938 phosphate butyryltransferase Closfridium botulinum WP_045541062 phosphate butyryltransferase Closfridium botulinum WP_003357252 phosphate butyryltransferase Closfridium botulinum WP_030037192 phosphate butyryltransferase Closfridium bornimense WP_044039341 phosphate butyryltransferase Closfridium botulinum WP_041346554 phosphate butyryltransferase Closfridium sp. WP_053468896 phosphate butyryltransferase Closfridiales bacterium WP_034572261 phosphate butyryltransferase Closfridium tetani WP_023439553 phosphate butyryltransferase Closfridiales bacterium ERI95297 phosphate butyryltransferase Closfridium botulinum WP_047403027 phosphate butyryltransferase Closfridium tetani WP_011100667 phosphate butyryltransferase Closfridium tetani WP_035111554 phosphate butyryltransferase Closfridium senegalense WP_010295062 phosphate butyryltransferase Caloramator sp. WP_027307587 phosphate butyryltransferase Thermobrachium celere WP_018661036 phosphate butyryltransferase Closfridium cellulovorans WP_010073683 phosphate butyryltransferase Coprococcus comes CDB84786 phosphate butyryltransferase Coprococcus comes WP_008371924 phosphate butyryltransferase Eubacterium sp. CCZ03827 phosphate butyryltransferase Closfridium sp. CCZ05442 phosphate butyryltransferase Caloramator australicus WP_008907395 phosphate butyryltransferase Closfridium sp. CCY59505 phosphate butyryltransferase Lachnospiraceae bacterium WP_035626368 phosphate butyryltransferase Lachnospiraceae bacterium WP_027440767 phosphate butyryltransferase Fervidicella metallireducens WP_035381340 phosphate butyryltransferase Closfridium sp. CCX89274 phosphate butyryltransferase Eubacterium xylanophilum WP_026834525 phosphate butyryltransferase Roseburia sp. CDF44203 phosphate butyryltransferase Butyrivibrio crossotus WP_005600912 phosphate butyryltransferase Lachnospiraceae bacterium WP_027117626 phosphate butyryltransferase Closfridium sp. CDA68345 phosphate butyryltransferase Peptosfreptococcaceae WP_026899905 bacterium phosphate butyryltransferase Butyrivibrio crossotus CCY77124 phosphate butyryltransferase Closfridium sp. CDE44914 phosphate butyryltransferase Coprococcus eutactus WP_004853197 phosphate butyryltransferase Firmicutes bacterium CCY23248 phosphate butyryltransferase Lachnospiraceae bacterium WP_027111007 phosphate butyryltransferase Lachnospiraceae bacterium WP_016293387 phosphate butyryltransferase Closfridium sp. WP_046822491

[0111] In a preferred embodiment, the phosphate butyryltransferase is Ptb from Clostridium acetobutylicum (WP_010966357; SEQ ID NO: 87) or Clostridium beijerinckii (WP_026886639; SEQ ID NO: 88) (WP_041893500; SEQ ID NO: 89). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not natively contain phosphate butyryltransferase.

[0112] The butyrate kinase may be or may be derived, for example, from any of the following sources, the sequences of which are publically available:

TABLE-US-00003 Description Microorganism Accession butyrate kinase Closfridium pasteurianum ALB48406 butyrate kinase Closfridium sp. CDB14330 butyrate kinase Closfridium sp. CDB74820 butyrate kinase Closfridium sp. EKQ52187 butyrate kinase Closfridium perfringens Q0SQKO butyrate kinase Closfridium sp. WP_002582660 butyrate kinase Closfridium colicanis WP_002598838 butyrate kinase Closfridium botuhnum WP_003371719 butyrate kinase Closfridium perfringens WP_003454444 butyrate kinase Closfridium perfringens WP_004459180 butyrate kinase Closfridium celatum WP_005211128 butyrate kinase Closfridium sp. WP_008681112 butyrate kinase Closfridium sp. WP_008681114 butyrate kinase Closfridium sp. WP_009167897 butyrate kinase Closfridium perfringens WP_011010889 butyrate kinase Closfridium beijerinckii WP_011967556 butyrate kinase Closfridium botuhnum WP_012422882 butyrate kinase Closfridium botuhnum WP_012450845 butyrate kinase Closfridium saccharoperbutylacetonicum WP_015390397 butyrate kinase Closfridium beijerinckii WP_017209677 butyrate kinase Closfridium botuhnum WP_017825911 butyrate kinase Closfridium chauvoei WP_021876994 butyrate kinase Closfridium saccharobutylicum WP_022743599 butyrate kinase Closfridium sp. WP_024615656 butyrate kinase Closfridium perfringens WP_025648345 butyrate kinase Closfridium beijerinckii WP_026886638 butyrate kinase Closfridium paraputrificum WP_027098883 butyrate kinase Closfridium sp. WP_032117070 butyrate kinase Closfridium botulinum WP_035786166 butyrate kinase Closfridium baratii WP_039312972 butyrate kinase Closfridium diolis WP_039772701 butyrate kinase Closfridium botulinum WP_041082388 butyrate kinase Closfridium beijerinckii WP_041893502 butyrate kinase Closfridium sp. WP_042402497 butyrate kinase Closfridium baratii WP_045725505 butyrate kinase Closfridium perfringens WP_049039634 butyrate kinase Closfridium botulinum WP_049180514 butyrate kinase Closfridium botulinum WP_053341511 butyrate kinase Closfridium butyricum ABU40948 butyrate kinase Closfridium sp. CDE44915 butyrate kinase Closfridium senegalense WP_010295059 butyrate kinase Closfridium intestinale WP_021800216 butyrate kinase Eubacterium venfriosum WP_005363839 butyrate kinase Closfridiales bacterium WP_021657038 butyrate kinase Closfridium sp. WP_021281242 butyrate kinase Clostridium sporogenes WP_045520059 butyrate kinase Closfridium sp. WP_050606428 butyrate kinase Closfridium botulinum WP_012048334 butyrate kinase Closfridium botulinum WP_012343352 butyrate kinase Closfridium botulinum WP_003401518 butyrate kinase Closfridium argentinense WP_039635972 butyrate kinase Closfridium botulinum WP_003357547 butyrate kinase Closfridium hydrogeniformans WP_027633205 butyrate kinase Closfridium botulinum WP_033066487 butyrate kinase Roseburia sp. CDF44202 butyrate kinase Lachnospiraceae bacterium WP_027111008 butyrate kinase Closfridium sp. CDA68344 butyrate kinase Lachnospiraceae bacterium WP_022782491 butyrate kinase Closfridium botulinum WP_012101111 butyrate kinase Closfridium carboxidivorans WP_007063155 butyrate kinase Closfridium botulinum WP_041346556 butyrate kinase Closfridium drakei WP_032078801 butyrate kinase Closfridium sp. WP_032120462 butyrate kinase Closfridium sp. WP_053468897 butyrate kinase Firmicutes bacterium CCZ27888 butyrate kinase Closfridium sp. WP_035306569 butyrate kinase Coprococcus comes CDB84787 butyrate kinase Closfridium sp. WP_035292410 butyrate kinase Closfridium sp. CCX89275 butyrate kinase Closfridium sp. WP_040212963 butyrate kinase Closfridium pasteurianum WP_003445697 butyrate kinase Closfridium sp. WP_053242610 butyrate kinase Lachnospiraceae bacterium WP_016299320 butyrate kinase Lachnospiraceae bacterium WP_022785085 butyrate kinase Lachnospiraceae bacterium WP_016281561 butyrate kinase Eubacterium sp. CDA28786 butyrate kinase Clostridium scatologenes WP_029160342 butyrate kinase Lachnospiraceae bacterium WP_016228168 butyrate kinase Closfridium pasteurianum WP_015617429 butyrate kinase Closfridium algidicarnis WP_029451332 butyrate kinase Lachnospiraceae bacterium WP_016293388 butyrate kinase Closfridium sulfidigenes WP_035133396 butyrate kinase Closfridium tetani WP_011100666 butyrate kinase Closfridium tetanomorphum WP_035147567 butyrate kinase Subdoligranulum variabile WP_007045828 butyrate kinase Eubacterium sp. CCZ03826 butyrate kinase Firmicutes bacterium CDF07483 butyrate kinase Eubacterium sp. CDB13677 butyrate kinase Closfridium sp. WP_008400594 butyrate kinase Closfridium tetani WP_023439552 butyrate kinase Closfridiales bacterium WP_022787536 butyrate kinase Lachnospiraceae bacterium WP_027434709 butyrate kinase Firmicutes bacterium CCY23249 butyrate kinase Closfridium acetobutylicum WP_010966356

[0113] In a preferred embodiment, the butyrate kinase is Buk from Clostridium acetobutylicum (WP_010966356; SEQ ID NO: 90) or Clostridium beijerinckii (WP_011967556; SEQ ID NO: 91) (WP_017209677; SEQ ID NO: 92) (WP_026886638; SEQ ID NO: 93) (WP_041893502; SEQ ID NO: 94). Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei do not natively contain butyrate kinase.

[0114] Since Ptb-Buk has been shown to function on a broad range of substrates it is reasonable to assume that if Ptb-Buk does not exhibit any activity and a desired substrate it can be engineered to achieve activity on the substrate in question. A strategy could be (but would not be limited to) rational design based on available crystal structures of Ptb and Buk with and without a bound substrate where the binding pocket would be changed to accommodate the new substrate or through saturation mutagenesis. When activity is obtained, it can be further improved through iterative cycles of enzyme engineering. These engineering efforts would be combined with assays to test enzyme activity. These types of strategies have previously proven effective (see, e.g., Huang, Nature, 537: 320-327, 2016; Khoury, Trends Biotechnol, 32: 99-109, 2014; Packer, Nature Rev Genetics, 16: 379-394, 2015; Privett, PNAS USA, 109: 3790-3795, 2012).

[0115] To improve substrate specificity of Ptb-Buk towards a specific acyl-CoA substrate, Ptb-Buk variants from public databases or generated Ptb-Buk mutants (for example, from directed evolution) can be screened using a high throughput assay, namely overexpressing Ptb-Buk enzyme pairs in E. coli, adding a test substrate, and screening for ATP production with a bioluminescence assay. The assay can use the well-established practice of correlating ATP concentration with firefly luciferase enzyme bioluminescence. The amenability of this assay to multi-well plate formats would facilitate efficient screening of substrate preference across new Ptb-Buk combinations (FIG. 33).

[0116] By screening for ATP production rather than depletion of substrate or accumulation of product, the assay avoids measuring spontaneous hydrolysis of the CoA group. However, an alternative approach described in literature, is to use free CoA can be measured using the established assay using Ellman's reagent (5,5'-dithiobis-(2-nitrobenzoic acid) or DTNB) (Thompson, Appl Environ Microbiol, 56: 607-613, 1990.) in order to estimate the coupling efficiency of the Ptb-Buk reactions (FIG. 33). Acyl-CoAs and corresponding free acids and phospho-intermediates can also be measured during the validation phase using LC-MS/MS.

[0117] In a high-throughput screening approach, it is difficult gather kinetic data due to the labor involved in protein quantification. Instead, for each preparation of E. coli lysate containing Ptb-Buk enzymes, the activity against each substrate of interest (measured as luminescence per unit time) can be compared to the activity against the positive control substrate (butyryl-CoA) and against acetyl-CoA (the physiological substrate that will likely provide the greatest competition for enzyme active sites against target acyl-CoA).

[0118] In order to ensure that the assay is not biased due to native phosphotransacetylase (Pta) and/or acetate kinase (Ack) activity, the assay can also be evaluated in an E. coli strain where pta and/or ack genes have been knocked out.

Production of Acetone and Isopropanol

[0119] Acetone and isopropanol are important industrial solvents with a combined market size of 8 million tons and a global market value of $8.5-11 billion. In addition, acetone and isopropanol are precursors to valuable downstream products, including polymethyl methacrylate (PMMA), which has a global market value of $7 billion, isobutylene, which has a global market value of $25-29 billion, and propylene, which has a global market value of $125 billion. Additionally, a route from acetone to jet fuel has recently been reported. Currently, industrial acetone production is directly linked to petrochemical phenol production, as it is a by-product of the cumene process. Around 92% of acetone output by volume is a co-product of phenol production from cumene. This has significant implications on both environment and market. In the cumene process, per mol phenol produced one mol of sodium sulfite accumulates posing a serious waste management problem and a challenge to natural environments and human health. The world market demand for phenol is expected to stagnate or decline, while the demand for acetone is predicted to rise. Alternative phenol production routes from direct oxidation of benzene are in development and expected to commercialize soon; this could result in a complete elimination of acetone production.

[0120] Acetone has been produced at industrial scale for almost 100 years, as a by-product of butanol in ABE fermentation. While industrial ABE fermentation declined in the second half of the 20.sup.th century due to low oil prices and high sugar costs, it has recently revived, with several commercial plants built during the last few years. Multiple groups have also demonstrated acetone production from sugar in heterologous hosts that express the corresponding enzymes from ABE fermentation organisms, in particular E. coli and yeast through metabolic engineering and synthetic biology approaches by several academic groups. However, low yields and high costs associated the pre-treatment needed to release the polysaccharide-component of biomass make the production of acetone via standard fermentation uneconomic as current biochemical conversion technologies do not utilize the lignin component of biomass, which can constitute up to 40% of this material.

[0121] The invention provides a microorganism capable of producing acetone or precursors thereof from a substrate. The invention further provides a method of producing acetone or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of acetone may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0122] Acetone via steps 1, 2, and 3: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 2, and 3, whereby the microorganism is capable of producing acetone or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 2 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 2, and 3 are described elsewhere in this application. If the microorganism is derived from a parental microorganism that natively contains a primary:secondary alcohol dehydrogenase capable of converting acetone to isopropanol (step 4) (e.g., Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei), the microorganism may be modified to knock down or knock out the expression of primary:secondary alcohol dehydrogenase (e.g., by disrupting the gene encoding the primary:secondary alcohol dehydrogenase), such that the microorganism produces acetone without converting it to isopropanol (WO 2015/085015).

[0123] Acetone via steps 1, 13, 14, 15, and 3: In one embodiment, the invention provides a microorganism comprising exogenous enzymes for steps 1, 13, 14, 15, and 3, whereby the microorganism is capable of producing acetone or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 14 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 14, 15, and 3 are described elsewhere in this application. If the microorganism is derived from a parental microorganism that natively contains a primary:secondary alcohol dehydrogenase capable of converting acetone to isopropanol (step 4) (e.g., Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei), the microorganism may be modified to knock down or knock out the expression of primary: secondary alcohol dehydrogenase (e.g., by disrupting the gene encoding the primary:secondary alcohol dehydrogenase), such that the microorganism produces acetone without converting it to isopropanol (WO 2015/085015).

[0124] In one embodiment, the microorganism may comprise more than one pathway for the production of acetone.

[0125] The invention provides a microorganism capable of producing isopropanol or precursors thereof from a substrate. The invention further provides a method of producing isopropanol or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of isopropanol may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0126] Isopropanol via steps 1, 2, 3, and 4: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 2, 3, and 4, whereby the microorganism is capable of producing isopropanol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 2 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 2, 3, and 4 are described elsewhere in this application. If the microorganism is derived from a parental microorganism that natively contains a primary:secondary alcohol dehydrogenase capable of converting acetone to isopropanol (step 4) (e.g., Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei), introduction of an exogenous enzyme for step 4 is not required to produce isopropanol. However, modification of the microorganism, for example, to overexpress a native primary:secondary alcohol dehydrogenase may result in enhanced production of isopropanol.

[0127] Isopropanol via steps 1, 13, 14, 15, 3, and 4: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 14, 15, 3, and 4, whereby the microorganism is capable of producing isopropanol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 14 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 14, 15, 3, and 4 are described elsewhere in this application. If the microorganism is derived from a parental microorganism that natively contains a primary:secondary alcohol dehydrogenase capable of converting acetone to isopropanol (step 4) (e.g., Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei), introduction of an exogenous enzyme for step 4 is not required to produce isopropanol. However, modification of the microorganism, for example, to overexpress a native primary:secondary alcohol dehydrogenase may result in enhanced production of isopropanol.

[0128] In one embodiment, the microorganism may comprise more than one pathway for the production of isopropanol.

Production of Isobutylene

[0129] Isobutylene is a major chemical building block with a market size of over 15 million tons and a global market value of $25-29 billion. Beyond its use in chemistry and as a fuel additive (15 Mt/yr), isobutylene may be converted to isooctane, a high performance, drop-in fuel for gasoline cars. Global Bioenergies has filed patent applications on the fermentative production of isobutene (i.e., isobutylene) from acetone, but none of the disclosed routes involve Ptb-Buk (WO 2010/001078; EP 2295593; WO 2011/076691; van Leeuwen, Appl Microbiol Biotechnol, 93: 1377-1387, 2012).

[0130] The invention provides a microorganism capable of producing isobutylene or precursors thereof from a substrate. The invention further provides a method of producing isobutylene or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of isobutylene may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0131] FIG. 1 shows two alternative routes to isobutylene. The first involves the production of isobutylene via steps 1, 2, 3, 5, and 6. The second involves the production of isobutylene via steps 1, 2, 3, 7, 8, and 6. Steps 2 and 8 may be catalyzed by Ptb-Buk. Accordingly, each route may involve Ptb-Buk.

[0132] Isobutylene via steps 1, 2, 3, 5, and 6: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 2, 3, 5, and 6, whereby the microorganism is capable of producing isobutylene or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 2 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 2, 3, 5, and 6 are described elsewhere in this application. If the microorganism is derived from a parental microorganism that natively contains a primary: secondary alcohol dehydrogenase capable of converting acetone to isopropanol (step 4) (e.g., Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei), the microorganism may be modified to knock down or knock out the expression of primary:secondary alcohol dehydrogenase (e.g., by disrupting the gene encoding the primary:secondary alcohol dehydrogenase) to prevent the conversion of acetone to isopropanol and maximize the conversion of acetone to isobutylene.

[0133] Isobutylene via steps 1, 2, 3, 7, 8, and 6: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 2, 3, 7, 8, and 6, whereby the microorganism is capable of producing isobutylene or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 2 and/or step 8 are catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 2, 3, 7, 8, and 6 are described elsewhere in this application. If the microorganism is derived from a parental microorganism that natively contains a primary:secondary alcohol dehydrogenase capable of converting acetone to isopropanol (step 4) (e.g., Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei), the microorganism may be modified to knock down or knock out the expression of primary: secondary alcohol dehydrogenase (e.g., by disrupting the gene encoding the primary:secondary alcohol dehydrogenase) to prevent the conversion of acetone to isopropanol and maximize the conversion of acetone to isobutylene.

Production of 3-Hydroxybutyrate

[0134] 3-Hydroxybutyrate (3-HB) is a four carbon carboxylic acid in the family of betahydroxy acids. 3-hydroxybutyrate is a cosmetic ingredient for oily skin clarification, an intermediate for anti-aging cream formulations, an intermediate for polyhydroxybutyrate (PHB), a biodegradable polymer resin, and co-monomer with other polyhydroxy acids for novel bioplastics. Additionally, 3-hydroxybutyrate has specialty applications in biocompatible and biodegradable nanocomposites, particularly for medical implants, intermediate for C3/C4 chemicals, chiral building blocks, and fine chemicals. Although the production of (R)- and (S)-3-hydroxybutyrate by recombinant E. coli grown on glucose, the production of 3-hydroxybutyrate has not been demonstrated from microorganisms grown on gaseous substrates (Tseng, Appl Environ Microbiol, 75: 3137-3145, 2009). Notably, the system previously demonstrated in E. coli was not directly transferrable to acetogens, including C. autoethanogenum, due to the presence of native thioesterases in acetogens. Although E. coli also has a thioesterase TesB that can act on 3-HB-CoA, Tseng showed that background activity is minimal (<0.1 g/L). While in E. coli production of stereopure isomers were reported, the inventors surprisingly found that a mix of isomers were produced in C. autoethanogenum. Without being bound to this theory, this is likely a result of native isomerase activity. This enables the combination of an (S)-specific 3-hydroxybutyryl-CoA dehydrogenase (Hbd) to be combined with the (R)-specific Ptb-Buk for optimized production. To produce stereopure isomers, this activity can be knocked-out. Taken together, it this invention enables to produce several g/L of 3-HB compared to low production in E. coli and using Ptb-Buk any combination of (R)- or (S)-specific 3-hydroxybutyryl-CoA dehydrogenase and native Clostridium autoethanogenum thioesterase.

[0135] The invention provides a microorganism capable of producing 3-hydroxybutyrate or precursors thereof from a substrate. The invention further provides a method of producing 3-hydroxybutyrate or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of 3-hydroxybutyrate may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0136] FIG. 1 shows two alternative routes to 3-hydroxybutyrate. The first involves the production of 3-hydroxybutyrate via steps 1, 2, and 15. The second involves the production of 3-hydroxybutyrate via steps 1, 13, and 14. Steps 2 and 14 may be catalyzed by Ptb-Buk. Accordingly, each route may involve Ptb-Buk. In one embodiment, the microorganism may comprise more than one pathway for the production of 3-hydroxybutyrate, wherein Ptb-Buk may catalyze more than one step (e.g., steps 2 and 14).

[0137] 3-Hydroxybutyrate via steps 1, 2, and 15: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 2, and 15, whereby the microorganism is capable of producing 3-hydroxybutyrate or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 2 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 2, and 15 are described elsewhere in this application.

[0138] 3-Hydroxybutyrate via steps 1, 13, and 14: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, and 14, whereby the microorganism is capable of producing 3-hydroxybutyrate or precursors thereof from substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 14 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, and 14 are described elsewhere in this application.

Production of 1, 3-Butanediol

[0139] 1,3-Butanediol (1,3-BDO) is commonly used as a solvent for food flavoring agents and is a co-monomer used in certain polyurethane and polyester resins. More importantly, 1,3-butanediol may be catalytically converted to 1,3-butadiene (Makshina, Chem Soc Rev, 43: 7917-7953, 2014). Butadiene is used to produce rubber, plastics, lubricants, latex, and other products. While much of the butadiene produced today is used for the rubber in automobile tires, it can also be used to produce adiponitrile, which can be used in the manufacture of nylon 6,6. Global demand for butadiene is on the rise. In 2011, there was an estimated 10.5 million tons of demand, valued at $40 billion.

[0140] The invention provides a microorganism capable of producing 1,3-butanediol or precursors thereof from a substrate. The invention further provides a method of producing 1,3-butanediol or precursors thereof by culturing such a microorganism in the presence of substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of 1,3-butanediol may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0141] In certain embodiments, the microorganism may produce 1,3-butanediol without co-production of ethanol (or with production of only a small amount of ethanol, e.g., less than 0.1-1.0 g/L ethanol or less than 1-10 g/L ethanol).

[0142] FIG. 1 shows three alternative routes to 1,3-butanediol. The first involves the production of 1,3-butanediol via steps 1, 2, 15, 16, and 17. The second involves the production of 1,3-butanediol via steps 1, 13, 14, 16, and 17. The third involves the production of 1,3-butanediol via steps 1, 13, 18, and 17. Steps 2 and 14 may be catalyzed by Ptb-Buk. Accordingly, at least the first and second routes may involve Ptb-Buk. In one embodiment, the microorganism may comprise more than one pathway for the production of 1,3-butanediol. In a related embodiment, the Ptb-Buk may catalyze more than one step (e.g., steps 2 and 14).

[0143] 1,3-Butanediol via steps 1, 2, 15, 16, and 17: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 2, 15, 16, and 17, whereby the microorganism is capable of producing 1,3-butanediol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 2 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 2, 15, 16, and 17 are described elsewhere in this application.

[0144] 1,3-Butanediol via steps 1, 13, 14, 16, and 17: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 14, 16, and 17, whereby the microorganism is capable of producing 1,3-butanediol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 14 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 14, 16, and 17 are described elsewhere in this application.

[0145] 1,3-Butanediol via steps 1, 13, 18, and 17: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 18, and 17, whereby the microorganism is capable of producing 1,3-butanediol or precursors thereof from a substrate, such as a gaseous substrate (FIG. 11). Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. Exemplary types and sources of enzymes for steps 1, 13, 18, and 17 are described elsewhere in this application. A similar route has been demonstrated in E. coli, but not in acetogens such as Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei (Kataoka, J Biosci Bioeng, 115: 475-480, 2013). Although the use of Ptb-Buk results in the production of (R)-1,3-butanediol, this route, which does not require the use of Ptb-Buk, may result in the production of (S)-1,3-butanediol.

Production of 2-Hydroxyisobutyrate

[0146] 2-Hydroxyisobutyrate (2-HIB) is a four carbon carboxylic acid that may serve as a building block for many types of polymers. The methyl ester of methacrylic acid, which can be synthesized by dehydration of 2-hydroxyisobutyrate or via the corresponding amide, is polymerized to polymethylmethacrylate (PMMA) for the production of acrylic glass, durable coatings, and inks. For this compound alone, the global market exceeds 3 million tons. Other branched C4 carboxylic acids, e.g., chloro- and amino-derivatives of 2-hydroxyisobutyrate, as well as isobutylene glycol and its oxide, are also used in polymers and for many other applications.

[0147] The stereospecificity of the Ptb-Buk system is particularly useful in overcoming the limitations of the current state of art with respect to the production of 2-hydroxyisobutyrate. Both Ptb-Buk and thioesterases are promiscuous, such that side activity with 3-hydroxybutyryl-CoA may divert resources away from target pathways for the production of 2-hydroxyisobutyryl-CoA (see, e.g., FIG. 1 and FIG. 8). However, Ptb-Buk is able to distinguish between stereoisomers and will act on (R)-3-hydroxybutyryl-CoA, but not on (S)-3-hydroxybutyryl-CoA. In contrast, thioesterases are not able to distinguish between 3-hydroxybutyryl-CoA stereoisomers. In a preferred embodiment, an (S)-specific acetoacetyl-CoA hydratase (EC 4.2.1.119) (step 13) is chosen in combination with the Ptb-Buk (step 20) to avoid losses to 3-hydroxybutyrate and maximize 2-hydroxyisobutyrate yield (FIG. 8). The (S)-specific form of 3-hydroxybutyryl-CoA is also the preferred substrate for the 2-hydroxyisobutyryl-CoA mutase (EC 5.4.99.-) (step 19) (Yaneva, J Biol Chem, 287: 15502-15511, 2012).

[0148] The invention provides a microorganism capable of producing 2-hydroxyisobutyrate or precursors thereof from a substrate. The invention further provides a method of producing 2-hydroxyisobutyrate or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of 2-hydroxyisobutyrate may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0149] 2-Hydroxyisobutyrate via steps 1, 13, 19, and 20: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 19, and 20, whereby the microorganism is capable of producing 2-hydroxyisobutyrate or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 20 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 19, and 20 are described elsewhere in this application.

[0150] In certain embodiments, the invention also provides a microorganism capable of producing 2-hydroxybutyrate (2-HB) or precursors thereof from a substrate. The invention further provides a method of producing 2-hydroxybutyrate or precursors thereof by culturing such a microorganism in the presence of a substrate. Without wishing to be bound by any particular theory, the inventors believe the observed production of 2-hydroxybutyrate is attributable to nonspecific mutase activity in microorganisms such as Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei.

Production of Adipic Acid

[0151] Adipic acid is the most important dicarboxylic acid with an estimated market of greater US $4.5 billion with about 2.5 billion kgs produced annually. Over 60% of produced adipic acid is being used as monomer precursor for the production of nylon and the global market for adipic acid is expected to reach US $7.5 billion by 2019. Currently, adipic acid is almost exclusively produced petrochemically, e.g. by carbonylation of butadiene.

[0152] The invention provides a microorganism capable of producing adipic acid or precursors thereof from a substrate (FIG. 34). The invention further provides a method of producing adipic acid or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of adipic acid may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0153] Adipic acid via steps 22, 23, 24, 25, and 26: In one embodiment, the invention provides a microorganism comprising enzymes for steps 22, 23, 24, 25, and 26, whereby the microorganism is capable of producing adipic acid or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 26 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 22, 23, 24, 25, and 26 are described elsewhere in this application.

[0154] Adipic acid via steps 21, 22, 23, 24, 25, and 26: In one embodiment, the invention provides a microorganism comprising enzymes for steps 21, 22, 23, 24, 25, and 26, whereby the microorganism is capable of producing adipic acid or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 26 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 21, 22, 23, 24, 25, and 26 are described elsewhere in this application.

[0155] In one embodiment, the microorganism may comprise more than one pathway for the production of adipic acid.

Production of 1, 3-Hexanediol

[0156] The invention provides a microorganism capable of producing 1,3-hexanediol or precursors thereof from a substrate (FIG. 35). The invention further provides a method of producing 1,3-hexanediol or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of 1,3-hexanediol may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0157] The pathways depicted in FIG. 35 begin with 3-hydroxybutyryl-CoA, which may be produced via steps 1 and 13, as depicted in FIG. 1.

[0158] 1,3-Hexanediol via steps 1, 13, 27, 31, 32, 36, 37, 38, and 39: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 27, 31, 32, 36, 37, 38, and 39, whereby the microorganism is capable of producing 1,3-hexanediol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 37 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 27, 31, 32, 36, 37, 38, and 39 are described elsewhere in this application.

Production of 3-Methyl-2-butanol

[0159] The invention provides a microorganism capable of producing 3-methyl-2-butanol or precursors thereof from a substrate (FIG. 35). The invention further provides a method of producing 3-methyl-2-butanol or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of 3-methyl-2-butanol may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0160] The pathways depicted in FIG. 35 begin with 3-hydroxybutyryl-CoA, which may be produced via steps 1 and 13, as depicted in FIG. 1.

[0161] 3-Methyl-2-butanol via steps 1, 13, 27, 31, 32, 33, 34, and 35: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 27, 31, 32, 33, 34, and 35, whereby the microorganism is capable of producing 3-methyl-2-butanol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 33 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 27, 31, 32, 33, 34, and 35 are described elsewhere in this application.

Production of 2-Buten-1-ol

[0162] The invention provides a microorganism capable of producing 2-buten-1-ol or precursors thereof from a substrate (FIG. 35). The invention further provides a method of producing 2-buten-1-ol or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of 2-buten-1-ol may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0163] The pathways depicted in FIG. 35 begin with 3-hydroxybutyryl-CoA, which may be produced via steps 1 and 13, as depicted in FIG. 1.

[0164] 2-Buten-1-ol via steps 1, 13, 27, 28, 29, and 30: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 13, 27, 28, 29, and 30, whereby the microorganism is capable of producing 2-buten-1-ol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 28 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 13, 27, 28, 29, and 30 are described elsewhere in this application.

Production of Isovalerate

[0165] The invention provides a microorganism capable of producing isovalerate or precursors thereof from a substrate (FIG. 36). The invention further provides a method of producing isovalerate or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of isovalerate may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0166] Isovalerate via steps 1, 40, 41, 42, 43, and 44: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 40, 41, 42, 43, and 44, whereby the microorganism is capable of producing isovalerate or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 44 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 40, 41, 42, 43, and 44 are described elsewhere in this application.

Production of Isoamyl Alcohol

[0167] The invention provides a microorganism capable of producing isoamyl alcohol or precursors thereof from a substrate (FIG. 36). The invention further provides a method of producing isoamyl alcohol or precursors thereof by culturing such a microorganism in the presence of a substrate. In preferred embodiments, the microorganism is derived from a parental microorganism selected from the group consisting of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. However, the microorganism may also be derived from an entirely different microorganism, e.g., Escherichia coli. The enzymatic pathways described for the production of isoamyl alcohol may comprise endogenous enzymes and, where endogenous enzyme activity is absent or low, exogenous enzymes.

[0168] Isoamyl alcohol via steps 1, 40, 41, 42, 43, 44, 45, and 46: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 40, 41, 42, 43, 44, 45, and 46, whereby the microorganism is capable of producing isoamyl alcohol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. In a preferred embodiment, step 44 is catalyzed by Ptb-Buk. Exemplary types and sources of enzymes for steps 1, 40, 41, 42, 43, 44, 45, and 46 are described elsewhere in this application.

[0169] Isoamyl alcohol via steps 1, 40, 41, 42, 43, 47 and 46: In one embodiment, the invention provides a microorganism comprising enzymes for steps 1, 40, 41, 42, 43, 47 and 46, whereby the microorganism is capable of producing isoamyl alcohol or precursors thereof from a substrate, such as a gaseous substrate. Typically, at least one of the enzymes in this pathway is exogenous to the microorganism. Exemplary types and sources of enzymes for steps 1, 40, 41, 42, 43, 47 and 46 are described elsewhere in this application.

[0170] In one embodiment, the microorganism may comprise more than one pathway for the production of isoamyl alcohol.

Production of Additional Products

[0171] The invention provides a microorganism comprising exogenous Ptb-Buk and exogenous or endogenous aldehyde:ferredoxin oxidoreductase (AOR). Such a microorganism may produce, for example, 1-propanol, 1-butanol, 1-hexanol, and 1-octanol or precursors thereof from acetyl-CoA generated, for example, from a gaseous substrate (FIG. 32). The invention further provides a method of producing 1-propanol, 1-butanol, 1-hexanol, and 1-octanol or precursors thereof by culturing such a microorganism in the presence of a gaseous substrate. Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei natively comprise AOR. However, AOR may be overexpressed in such microorganisms in combination with expression of exogenous Ptb-Buk. Alternatively, exogenous AOR and exogenous Ptb-Buk may be expressed in a microorganism other than Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei, such as Escherichia coli.

Production of Precursors and Intermediates

[0172] The pathways depicted in FIGS. 1, 34, 35, and 36 may be modified to produce precursors or intermediates of the aforementioned products. In particular, partial enzymatic pathways for any of the pathways described herein may be inserted in a host microorganism to obtain production of precursors or intermediates.

Definitions and Background

[0173] The term "genetic modification" or "genetic engineering" broadly refers to manipulation of the genome or nucleic acids of a microorganism. Likewise, the term "genetically engineered" refers to a microorganism comprising a manipulated genome or nucleic acids. Methods of genetic modification of include, for example, heterologous gene expression, gene or promoter insertion or deletion, nucleic acid mutation, altered gene expression or inactivation, enzyme engineering, directed evolution, knowledge-based design, random mutagenesis methods, gene shuffling, and codon optimization.

[0174] "Recombinant" indicates that a nucleic acid, protein, or microorganism is the product of genetic modification, engineering, or recombination. Generally, the term "recombinant" refers to a nucleic acid, protein, or microorganism that contains or is encoded by genetic material derived from multiple sources, such as two or more different strains or species of microorganisms. As used herein, the term "recombinant" may also be used to describe a microorganism that comprises a mutated nucleic acid or protein, including a mutated form of an endogenous nucleic acid or protein.

[0175] "Endogenous" refers to a nucleic acid or protein that is present or expressed in the wild-type or parental microorganism from which the microorganism of the invention is derived. For example, an endogenous gene is a gene that is natively present in the wild-type or parental microorganism from which the microorganism of the invention is derived. In one embodiment, the expression of an endogenous gene may be controlled by an exogenous regulatory element, such as an exogenous promoter.

[0176] "Exogenous" refers to a nucleic acid or protein that is not present in the wild-type or parental microorganism from which the microorganism of the invention is derived. In one embodiment, an exogenous gene or enzyme may be derived from a heterologous (i.e., different) strain or species and introduced to or expressed in the microorganism of the invention. In another embodiment, an exogenous gene or enzyme may be artificially or recombinantly created and introduced to or expressed in the microorganism of the invention. Exogenous nucleic acids may be adapted to integrate into the genome of the microorganism of the invention or to remain in an extra-chromosomal state in the microorganism of the invention, for example, in a plasmid.

[0177] "Enzyme activity," or simply "activity," refers broadly to enzymatic activity, including, but not limited, to the activity of an enzyme, the amount of an enzyme, or the availability of an enzyme to catalyze a reaction. Accordingly, "increasing" enzyme activity includes increasing the activity of an enzyme, increasing the amount of an enzyme, or increasing the availability of an enzyme to catalyze a reaction. Similarly, "decreasing" enzyme activity includes decreasing the activity of an enzyme, decreasing the amount of an enzyme, or decreasing the availability of an enzyme to catalyze a reaction.

[0178] With respect to enzyme activity, a "substrate" is a molecule upon which an enzyme acts and a "product" is a molecule produced by the action of an enzyme. A "native substrate," therefore, is a molecule upon which an enzyme natively acts in a wild-type microorganism and a "native product" is a molecule natively produced by the action of the enzyme in the wild-type microorganism. For example, butanoyl-CoA is the native substrate of Ptb and butanoyl phosphate and is the native substrate of Buk. Additionally, butanoyl phosphate is the native product of Ptb and butyrate (butanoate) is the native product of Buk. Likewise, a "non-native substrate" is a molecule upon which an enzyme does not natively act in a wild-type microorganism and a "non-native product" is a molecule not natively produced by the action of the enzyme in the wild-type microorganism. An enzyme that is capable of acting on multiple different substrates, whether native or non-native, is typically referred to as a "promiscuous" enzyme. The inventors have discovered that Ptb is promiscuous and is capable of accepting a variety of acyl-CoAs and enoyl-CoAs as substrates, such that Ptb-Buk may be used to convert a number of acyl-CoAs and enoyl-CoAs to their corresponding acids or alkenates, respectively, while simultaneously generating ATP. Thus, in preferred embodiments, the Ptb-Buk of the invention acts on non-native substrates (i.e., substrates other than butanoyl-CoA and/or butanoyl phosphate) to produce non-native products (i.e., products other than butanoyl phosphate and/or butyrate (butanoate)).

[0179] The term "butyryl-CoA" may be used interchangeably herein with "butanoyl-CoA."

[0180] The term "energy-generating" or the like may be used interchangeably herein with "energy-conserving" or the like. Both of these terms are commonly used in the literature.

[0181] "Mutated" refers to a nucleic acid or protein that has been modified in the microorganism of the invention compared to the wild-type or parental microorganism from which the microorganism of the invention is derived. In one embodiment, the mutation may be a deletion, insertion, or substitution in a gene encoding an enzyme. In another embodiment, the mutation may be a deletion, insertion, or substitution of one or more amino acids in an enzyme.

[0182] In particular, a "disruptive mutation" is a mutation that reduces or eliminates (i.e., "disrupts") the expression or activity of a gene or enzyme. The disruptive mutation may partially inactivate, fully inactivate, or delete the gene or enzyme. The disruptive mutation may be a knockout (KO) mutation. The disruptive mutation may be any mutation that reduces, prevents, or blocks the biosynthesis of a product produced by an enzyme. The disruptive mutation may include, for example, a mutation in a gene encoding an enzyme, a mutation in a genetic regulatory element involved in the expression of a gene encoding an enzyme, the introduction of a nucleic acid which produces a protein that reduces or inhibits the activity of an enzyme, or the introduction of a nucleic acid (e.g., antisense RNA, siRNA, CRISPR) or protein which inhibits the expression of an enzyme. The disruptive mutation may be introduced using any method known in the art.

[0183] Introduction of a disruptive mutation results in a microorganism of the invention that produces no target product or substantially no target product or a reduced amount of target product compared to the parental microorganism from which the microorganism of the invention is derived. For example, the microorganism of the invention may produce no target product or at least about 1%, 3%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 95% less target product than the parental microorganism. For example, the microorganism of the invention may produce less than about 0.001, 0.01, 0.10, 0.30, 0.50, or 1.0 g/L target product.

[0184] "Codon optimization" refers to the mutation of a nucleic acid, such as a gene, for optimized or improved translation of the nucleic acid in a particular strain or species. Codon optimization may result in faster translation rates or higher translation accuracy. In a preferred embodiment, the genes of the invention are codon optimized for expression in Clostridium, particularly Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. In a further preferred embodiment, the genes of the invention are codon optimized for expression in Clostridium autoethanogenum LZ1561, which is deposited under DSMZ accession number DSM23693.

[0185] "Overexpressed" refers to an increase in expression of a nucleic acid or protein in the microorganism of the invention compared to the wild-type or parental microorganism from which the microorganism of the invention is derived. Overexpression may be achieved by any means known in the art, including modifying gene copy number, gene transcription rate, gene translation rate, or enzyme degradation rate.

[0186] The term "variants" includes nucleic acids and proteins whose sequence varies from the sequence of a reference nucleic acid and protein, such as a sequence of a reference nucleic acid and protein disclosed in the prior art or exemplified herein. The invention may be practiced using variant nucleic acids or proteins that perform substantially the same function as the reference nucleic acid or protein. For example, a variant protein may perform substantially the same function or catalyze substantially the same reaction as a reference protein. A variant gene may encode the same or substantially the same protein as a reference gene. A variant promoter may have substantially the same ability to promote the expression of one or more genes as a reference promoter.

[0187] Such nucleic acids or proteins may be referred to herein as "functionally equivalent variants." By way of example, functionally equivalent variants of a nucleic acid may include allelic variants, fragments of a gene, mutated genes, polymorphisms, and the like. Homologous genes from other microorganisms are also examples of functionally equivalent variants. These include homologous genes in species such as Clostridium acetobutylicum, Clostridium beijerinckii, or Clostridium ljungdahlii, the details of which are publicly available on websites such as Genbank or NCBI. Functionally equivalent variants also include nucleic acids whose sequence varies as a result of codon optimization for a particular microorganism. A functionally equivalent variant of a nucleic acid will preferably have at least approximately 70%, approximately 80%, approximately 85%, approximately 90%, approximately 95%, approximately 98%, or greater nucleic acid sequence identity (percent homology) with the referenced nucleic acid. A functionally equivalent variant of a protein will preferably have at least approximately 70%, approximately 80%, approximately 85%, approximately 90%, approximately 95%, approximately 98%, or greater amino acid identity (percent homology) with the referenced protein. The functional equivalence of a variant nucleic acid or protein may be evaluated using any method known in the art.

[0188] Nucleic acids may be delivered to a microorganism of the invention using any method known in the art. For example, nucleic acids may be delivered as naked nucleic acids or may be formulated with one or more agents, such as liposomes. The nucleic acids may be DNA, RNA, cDNA, or combinations thereof, as is appropriate. Restriction inhibitors may be used in certain embodiments. Additional vectors may include plasmids, viruses, bacteriophages, cosmids, and artificial chromosomes. In a preferred embodiment, nucleic acids are delivered to the microorganism of the invention using a plasmid. By way of example, transformation (including transduction or transfection) may be achieved by electroporation, ultrasonication, polyethylene glycol-mediated transformation, chemical or natural competence, protoplast transformation, prophage induction, or conjugation. In certain embodiments having active restriction enzyme systems, it may be necessary to methylate a nucleic acid before introduction of the nucleic acid into a microorganism.

[0189] Furthermore, nucleic acids may be designed to comprise a regulatory element, such as a promoter, to increase or otherwise control expression of a particular nucleic acid. The promoter may be a constitutive promoter or an inducible promoter. Ideally, the promoter is a Wood-Ljungdahl pathway promoter, a ferredoxin promoter, a pyruvate:ferredoxin oxidoreductase promoter, an Rnf complex operon promoter, an ATP synthase operon promoter, or a phosphotransacetylase/acetate kinase operon promoter.

[0190] A "microorganism" is a microscopic organism, especially a bacterium, archea, virus, or fungus. The microorganism of the invention is typically a bacterium. As used herein, recitation of "microorganism" should be taken to encompass "bacterium."

[0191] A "parental microorganism" is a microorganism used to generate a microorganism of the invention. The parental microorganism may be a naturally-occurring microorganism (i.e., a wild-type microorganism) or a microorganism that has been previously modified (i.e., a mutant or recombinant microorganism). The microorganism of the invention may be modified to express or overexpress one or more enzymes that were not expressed or overexpressed in the parental microorganism. Similarly, the microorganism of the invention may be modified to contain one or more genes that were not contained by the parental microorganism. The microorganism of the invention may also be modified to not express or to express lower amounts of one or more enzymes that were expressed in the parental microorganism. In one embodiment, the parental microorganism is Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. In a preferred embodiment, the parental microorganism is Clostridium autoethanogenum LZ1561, which is deposited under DSMZ accession number DSM23693.

[0192] The term "derived from" indicates that a nucleic acid, protein, or microorganism is modified or adapted from a different (e.g., a parental or wild-type) nucleic acid, protein, or microorganism, so as to produce a new nucleic acid, protein, or microorganism. Such modifications or adaptations typically include insertion, deletion, mutation, or substitution of nucleic acids or genes. Generally, the microorganism of the invention is derived from a parental microorganism. In one embodiment, the microorganism of the invention is derived from Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. In a preferred embodiment, the microorganism of the invention is derived from Clostridium autoethanogenum LZ1561, which is deposited under DSMZ accession number DSM23693.

[0193] The microorganism of the invention may be further classified based on functional characteristics. For example, the microorganism of the invention may be or may be derived from a C1-fixing microorganism, an anaerobe, an acetogen, an ethanologen, a carboxydotroph, and/or a methanotroph. Table 1 provides a representative list of microorganisms and identifies their functional characteristics.

TABLE-US-00004 TABLE 1 C1- Anaer- Ace- Ethanol- Auto- Carboxy- Meth- fixing obe togen ogen troph dotroph anotroph Acetobacterium woodii + + + +/- .sup.1 - - - Alkalibaculum bacchii + + + + + + - Blautia producta + + + - + + - Butyri bacterium methylotrophicum + + + + + + - Clostridium aceticum + + + - + + - Clostridium autoethanogenum + + + + + + - Clostridium carboxidivorans + + + + + + - Clostridium coskatii + + + + + + - Clostridium drakei + + + - + + - Clostridium formicoaceticum + + + - + + - Clostridium ljungdahlii + + + + + + - Clostridium magnum + + + - + +/- .sup.2 - Clostridium ragsdalei + + + + + + - Clostridium scatologenes + + + - + + - Eubacterium limosum + + + - + + - Moorella thermautotrophica + + + + + + - Moorella thermoacetica (formerly + + + - .sup.3 + + - Clostridium thermoaceticum) Oxobacter pfennigii + + + - + + - Sporomusa ovata + + + - + +/- .sup.4 - Sporomusa silvacetica + + + - + +/- .sup.5 - Sporomusa sphaeroides + + + - + +/- .sup.6 - Thermoanaerobacter kiuvi + + + - + - - .sup.1 Acetobacterium woodi can produce ethanol from fructose, but not from gas. .sup.2 It has not been investigated whether Clostridium magnum can grow on CO. .sup.3 One strain of Moorella thermoacetica, Moorella sp. HUC22-1, has been reported to produce ethanol from gas. .sup.4 It has not been investigated whether Sporomusa ovata can grow on CO. .sup.5 It has not been investigated whether Sporomusa silvacetica can grow on CO. .sup.6 It has not been investigated whether Sporomusa sphaeroides can grow on CO.

[0194] "C1" refers to a one-carbon molecule, for example, CO, CO.sub.2, CH.sub.4, or CH.sub.3OH. "C1-oxygenate" refers to a one-carbon molecule that also comprises at least one oxygen atom, for example, CO, CO.sub.2, or CH.sub.3OH. "C1-carbon source" refers a one carbon-molecule that serves as a partial or sole carbon source for the microorganism of the invention. For example, a C1-carbon source may comprise one or more of CO, CO.sub.2, CH.sub.4, CH.sub.3OH, or CH.sub.2O.sub.2. Preferably, the C1-carbon source comprises one or both of CO and CO.sub.2. A "C1-fixing microorganism" is a microorganism that has the ability to produce one or more products from a C1-carbon source. Typically, the microorganism of the invention is a C1-fixing bacterium. In a preferred embodiment, the microorganism of the invention is derived from a C1-fixing microorganism identified in Table 1.

[0195] An "anaerobe" is a microorganism that does not require oxygen for growth. An anaerobe may react negatively or even die if oxygen is present above a certain threshold. Typically, the microorganism of the invention is an anaerobe. In a preferred embodiment, the microorganism of the invention is derived from an anaerobe identified in Table 1.

[0196] An "acetogen" is a microorganism that produces or is capable of producing acetate (or acetic acid) as a product of anaerobic respiration. Typically, acetogens are obligately anaerobic bacteria that use the Wood-Ljungdahl pathway as their main mechanism for energy conservation and for synthesis of acetyl-CoA and acetyl-CoA-derived products, such as acetate (Ragsdale, Biochim Biophys Acta, 1784: 1873-1898, 2008). Acetogens use the acetyl-CoA pathway as a (1) mechanism for the reductive synthesis of acetyl-CoA from CO.sub.2, (2) terminal electron-accepting, energy conserving process, (3) mechanism for the fixation (assimilation) of CO.sub.2 in the synthesis of cell carbon (Drake, Acetogenic Prokaryotes, In: The Prokaryotes, 3.sup.rd edition, p. 354, New York, N.Y., 2006). All naturally occurring acetogens are C1-fixing, anaerobic, autotrophic, and non-methanotrophic. Typically, the microorganism of the invention is an acetogen. In a preferred embodiment, the microorganism of the invention is derived from an acetogen identified in Table 1.

[0197] An "ethanologen" is a microorganism that produces or is capable of producing ethanol. Typically, the microorganism of the invention is an ethanologen. In a preferred embodiment, the microorganism of the invention is derived from an ethanologen identified in Table 1.

[0198] An "autotroph" is a microorganism capable of growing in the absence of organic carbon. Instead, autotrophs use inorganic carbon sources, such as CO and/or CO.sub.2. Typically, the microorganism of the invention is an autotroph. In a preferred embodiment, the microorganism of the invention is derived from an autotroph identified in Table 1.

[0199] A "carboxydotroph" is a microorganism capable of utilizing CO as a sole source of carbon. Typically, the microorganism of the invention is a carboxydotroph. In a preferred embodiment, the microorganism of the invention is derived from a carboxydotroph identified in Table 1.

[0200] A "methanotroph" is a microorganism capable of utilizing methane as a sole source of carbon and energy. In certain embodiments, the microorganism of the invention is derived from a methanotroph.

[0201] More broadly, the microorganism of the invention may be derived from any genus or species identified in Table 1.

[0202] In a preferred embodiment, the microorganism of the invention is derived from the cluster of Clostridia comprising the species Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei. These species were first reported and characterized by Abrini, Arch Microbiol, 161: 345-351, 1994 (Clostridium autoethanogenum), Tanner, Int J System Bacteriol, 43: 232-236, 1993 (Clostridium ljungdahlii), and Huhnke, WO 2008/028055 (Clostridium ragsdalei).

[0203] These three species have many similarities. In particular, these species are all C1-fixing, anaerobic, acetogenic, ethanologenic, and carboxydotrophic members of the genus Clostridium. These species have similar genotypes and phenotypes and modes of energy conservation and fermentative metabolism. Moreover, these species are clustered in clostridial rRNA homology group I with 16S rRNA DNA that is more than 99% identical, have a DNA G+C content of about 22-30 mol %, are gram-positive, have similar morphology and size (logarithmic growing cells between 0.5-0.7.times.3-5 .mu.m), are mesophilic (grow optimally at 30-37.degree. C.), have similar pH ranges of about 4-7.5 (with an optimal pH of about 5.5-6), lack cytochromes, and conserve energy via an Rnf complex. Also, reduction of carboxylic acids into their corresponding alcohols has been shown in these species (Perez, Biotechnol Bioeng, 110:1066-1077, 2012). Importantly, these species also all show strong autotrophic growth on CO-containing gases, produce ethanol and acetate (or acetic acid) as main fermentation products, and produce small amounts of 2,3-butanediol and lactic acid under certain conditions.

[0204] However, these three species also have a number of differences. These species were isolated from different sources: Clostridium autoethanogenum from rabbit gut, Clostridium ljungdahlii from chicken yard waste, and Clostridium ragsdalei from freshwater sediment. These species differ in utilization of various sugars (e.g., rhamnose, arabinose), acids (e.g., gluconate, citrate), amino acids (e.g., arginine, histidine), and other substrates (e.g., betaine, butanol). Moreover, these species differ in auxotrophy to certain vitamins (e.g., thiamine, biotin). These species have differences in nucleic and amino acid sequences of Wood-Ljungdahl pathway genes and proteins, although the general organization and number of these genes and proteins has been found to be the same in all species (Kopke, Curr Opin Biotechnol, 22: 320-325, 2011).

[0205] Thus, in summary, many of the characteristics of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei are not specific to that species, but are rather general characteristics for this cluster of C1-fixing, anaerobic, acetogenic, ethanologenic, and carboxydotrophic members of the genus Clostridium. However, since these species are, in fact, distinct, the genetic modification or manipulation of one of these species may not have an identical effect in another of these species. For instance, differences in growth, performance, or product production may be observed.

[0206] The microorganism of the invention may also be derived from an isolate or mutant of Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. Isolates and mutants of Clostridium autoethanogenum include JA1-1 (DSM10061) (Abrini, Arch Microbiol, 161: 345-351, 1994), LBS1560 (DSM19630) (WO 2009/064200), and LZ1561 (DSM23693). Isolates and mutants of Clostridium ljungdahlii include ATCC 49587 (Tanner, Int J Syst Bacteriol, 43: 232-236, 1993), PETCT (DSM13528, ATCC 55383), ERI-2 (ATCC 55380) (U.S. Pat. No. 5,593,886), C-01 (ATCC 55988) (U.S. Pat. No. 6,368,819), O-52 (ATCC 55989) (U.S. Pat. No. 6,368,819), and OTA-1 (Tirado-Acevedo, Production of bioethanol from synthesis gas using Clostridium ljungdahlii, PhD thesis, North Carolina State University, 2010). Isolates and mutants of Clostridium ragsdalei include PI 1 (ATCC BAA-622, ATCC PTA-7826) (WO 2008/028055).

[0207] In some embodiments, however, the microorganism of the invention is a microorganism other than Clostridium autoethanogenum, Clostridium ljungdahlii, or Clostridium ragsdalei. For example, the microorganism may be selected from the group consisting of Escherichia coli, Saccharomyces cerevisiae, Clostridium acetobutylicum, Clostridium beijerinckii, Clostridium saccharbutyricum, Clostridium saccharoperbutylacetonicum, Clostridium butyricum, Clostridium diolis, Clostridium kluyveri, Clostridium pasterianium, Clostridium novyi, Clostridium difficile, Clostridium thermocellum, Clostridium cellulolyticum, Clostridium cellulovorans, Clostridium phytofermentans, Lactococcus lactis, Bacillus subtilis, Bacillus licheniformis, Zymomonas mobilis, Klebsiella oxytoca, Klebsiella pneumonia, Corynebacterium glutamicum, Trichoderma reesei, Cupriavidus necator, Pseudomonas putida, Lactobacillus plantarum, and Methylobacterium extorquens.

[0208] "Substrate" refers to a carbon and/or energy source for the microorganism of the invention. Typically, the substrate is gaseous and comprises a C1-carbon source, for example, CO, CO.sub.2, and/or CH.sub.4. Preferably, the substrate comprises a C1-carbon source of CO or CO+CO.sub.2. The substrate may further comprise other non-carbon components, such as H.sub.2, N.sub.2, or electrons.

[0209] The substrate generally comprises at least some amount of CO, such as about 1, 2, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, or 100 mol % CO. The substrate may comprise a range of CO, such as about 20-80, 30-70, or 40-60 mol % CO. Preferably, the substrate comprises about 40-70 mol % CO (e.g., steel mill or blast furnace gas), about 20-30 mol % CO (e.g., basic oxygen furnace gas), or about 15-45 mol % CO (e.g., syngas). In some embodiments, the substrate may comprise a relatively low amount of CO, such as about 1-10 or 1-20 mol % CO. The microorganism of the invention typically converts at least a portion of the CO in the substrate to a product. In some embodiments, the substrate comprises no or substantially no CO.

[0210] The substrate may comprise some amount of H.sub.2. For example, the substrate may comprise about 1, 2, 5, 10, 15, 20, or 30 mol % H.sub.2. In some embodiments, the substrate may comprise a relatively high amount of H.sub.2, such as about 60, 70, 80, or 90 mol % H.sub.2. In further embodiments, the substrate comprises no or substantially no H.sub.2.

[0211] The substrate may comprise some amount of CO.sub.2. For example, the substrate may comprise about 1-80 or 1-30 mol % CO.sub.2. In some embodiments, the substrate may comprise less than about 20, 15, 10, or 5 mol % CO.sub.2. In another embodiment, the substrate comprises no or substantially no CO.sub.2.

[0212] Although the substrate is typically gaseous, the substrate may also be provided in alternative forms. For example, the substrate may be dissolved in a liquid saturated with a CO-containing gas using a microbubble dispersion generator. By way of further example, the substrate may be adsorbed onto a solid support.

[0213] The substrate and/or C1-carbon source may be a waste gas obtained as a byproduct of an industrial process or from some other source, such as from automobile exhaust fumes or biomass gasification. In certain embodiments, the industrial process is selected from the group consisting of ferrous metal products manufacturing, such as a steel mill manufacturing, non-ferrous products manufacturing, petroleum refining processes, coal gasification, electric power production, carbon black production, ammonia production, methanol production, and coke manufacturing. In these embodiments, the substrate and/or C1-carbon source may be captured from the industrial process before it is emitted into the atmosphere, using any convenient method.

[0214] The substrate and/or C1-carbon source may be syngas, such as syngas obtained by gasification of coal or refinery residues, gasification of biomass or lignocellulosic material, or reforming of natural gas. In another embodiment, the syngas may be obtained from the gasification of municipal solid waste or industrial solid waste.

[0215] The composition of the substrate may have a significant impact on the efficiency and/or cost of the reaction. For example, the presence of oxygen (O.sub.2) may reduce the efficiency of an anaerobic fermentation process. Depending on the composition of the substrate, it may be desirable to treat, scrub, or filter the substrate to remove any undesired impurities, such as toxins, undesired components, or dust particles, and/or increase the concentration of desirable components.

[0216] The microorganism of the invention may be cultured to produce one or more products. For instance, Clostridium autoethanogenum produces or can be engineered to produce ethanol (WO 2007/117157), acetate (WO 2007/117157), butanol (WO 2008/115080 and WO 2012/053905), butyrate (WO 2008/115080), 2,3-butanediol (WO 2009/151342), lactate (WO 2011/112103), butene (WO 2012/024522), butadiene (WO 2012/024522), methyl ethyl ketone (2-butanone) (WO 2012/024522 and WO 2013/185123), ethylene (WO 2012/026833), acetone (WO 2012/115527), isopropanol (WO 2012/115527), lipids (WO 2013/036147), 3-hydroxypropionate (3-HP) (WO 2013/180581), isoprene (WO 2013/180584), fatty acids (WO 2013/191567), 2-butanol (WO 2013/185123), 1,2-propanediol (WO 2014/0369152), and 1-propanol (WO 2014/0369152). In addition to one or more target products, the microorganism of the invention may also produce ethanol, acetate, and/or 2,3-butanediol. In certain embodiments, microbial biomass itself may be considered a product.

[0217] A "native product" is a product produced by a genetically unmodified microorganism. For example, ethanol, acetate, and 2,3-butanediol are native products of Clostridium autoethanogenum, Clostridium ljungdahlii, and Clostridium ragsdalei. A "non-native product" is a product that is produced by a genetically modified microorganism, but is not produced by a genetically unmodified microorganism from which the genetically modified microorganism is derived.

[0218] The terms "intermediate" and "precursor," which may be referred to interchangeably herein, refer to a molecular entity in an enzymatic pathway upstream of an observed or target product.

[0219] "Selectivity" refers to the ratio of the production of a target product to the production of all fermentation products produced by a microorganism. The microorganism of the invention may be engineered to produce products at a certain selectivity or at a minimum selectivity. In one embodiment, a target product account for at least about 5%, 10%, 15%, 20%, 30%, 50%, or 75% of all fermentation products produced by the microorganism of the invention. In one embodiment, the target product accounts for at least 10% of all fermentation products produced by the microorganism of the invention, such that the microorganism of the invention has a selectivity for the target product of at least 10%. In another embodiment, the target product accounts for at least 30% of all fermentation products produced by the microorganism of the invention, such that the microorganism of the invention has a selectivity for the target product of at least 30%.

[0220] "Increasing the efficiency," "increased efficiency," and the like include, but are not limited to, increasing growth rate, product production rate or volume, product volume per volume of substrate consumed, or product selectivity. Efficiency may be measured relative to the performance of parental microorganism from which the microorganism of the invention is derived.

[0221] Typically, the culture is performed in a bioreactor. The term "bioreactor" includes a culture/fermentation device consisting of one or more vessels, towers, or piping arrangements, such as a continuous stirred tank reactor (CSTR), immobilized cell reactor (ICR), trickle bed reactor (TBR), bubble column, gas lift fermenter, static mixer, or other vessel or other device suitable for gas-liquid contact. In some embodiments, the bioreactor may comprise a first growth reactor and a second culture/fermentation reactor. The substrate may be provided to one or both of these reactors. As used herein, the terms "culture" and "fermentation" are used interchangeably. These terms encompass both the growth phase and product biosynthesis phase of the culture/fermentation process.

[0222] The culture is generally maintained in an aqueous culture medium that contains nutrients, vitamins, and/or minerals sufficient to permit growth of the microorganism. Preferably the aqueous culture medium is an anaerobic microbial growth medium, such as a minimal anaerobic microbial growth medium. Suitable media are well known in the art.

[0223] The culture/fermentation should desirably be carried out under appropriate conditions for production of the target product. Typically, the culture/fermentation is performed under anaerobic conditions. Reaction conditions to consider include pressure (or partial pressure), temperature, gas flow rate, liquid flow rate, media pH, media redox potential, agitation rate (if using a continuous stirred tank reactor), inoculum level, maximum gas substrate concentrations to ensure that gas in the liquid phase does not become limiting, and maximum product concentrations to avoid product inhibition. In particular, the rate of introduction of the substrate may be controlled to ensure that the concentration of gas in the liquid phase does not become limiting, since products may be consumed by the culture under gas-limited conditions.

[0224] Operating a bioreactor at elevated pressures allows for an increased rate of gas mass transfer from the gas phase to the liquid phase. Accordingly, it is generally preferable to perform the culture/fermentation at pressures higher than atmospheric pressure. Also, since a given gas conversion rate is, in part, a function of the substrate retention time and retention time dictates the required volume of a bioreactor, the use of pressurized systems can greatly reduce the volume of the bioreactor required and, consequently, the capital cost of the culture/fermentation equipment. This, in turn, means that the retention time, defined as the liquid volume in the bioreactor divided by the input gas flow rate, can be reduced when bioreactors are maintained at elevated pressure rather than atmospheric pressure. The optimum reaction conditions will depend partly on the particular microorganism used. However, in general, it is preferable to operate the fermentation at a pressure higher than atmospheric pressure. Also, since a given gas conversion rate is in part a function of substrate retention time and achieving a desired retention time in turn dictates the required volume of a bioreactor, the use of pressurized systems can greatly reduce the volume of the bioreactor required, and consequently the capital cost of the fermentation equipment.

[0225] Target products may be separated or purified from a fermentation broth using any method or combination of methods known in the art, including, for example, fractional distillation, evaporation, pervaporation, gas stripping, phase separation, and extractive fermentation, including for example, liquid-liquid extraction. In certain embodiments, target products are recovered from the fermentation broth by continuously removing a portion of the broth from the bioreactor, separating microbial cells from the broth (conveniently by filtration), and recovering one or more target products from the broth. Alcohols and/or acetone may be recovered, for example, by distillation. Acids may be recovered, for example, by adsorption on activated charcoal. Separated microbial cells are preferably returned to the bioreactor. The cell-free permeate remaining after target products have been removed is also preferably returned to the bioreactor. Additional nutrients (such as B vitamins) may be added to the cell-free permeate to replenish the medium before it is returned to the bioreactor.

EXAMPLES

[0226] The following examples further illustrate the invention but, of course, should not be construed to limit its scope in any way.

Example 1

[0227] This example demonstrates the ability of Ptb-Buk to convert acetoacetyl-CoA to acetoacetate in E. coli in vivo and its use in production of acetone, isopropanol, 3-hydroxybutyrate, and isobutylene

[0228] Pathways that rely on the Ptb-Buk system for acetoacetate production from acetoacetyl-CoA were designed and constructed. This was done in a modular fashion using a pDUET vector system (Novagen). One module contained ptb-buk genes from C. beijerinckii NCIMB8052 (GenBank NC_009617, position 232027..234147; Cbei_0203-204; NCBI-GeneID 5291437-38) on plasmid pACYC. Another module contained the thiolase gene thlA of C. acetobutylicum (Genbank NC_001988, position 82040..83218; CA_P0078; NCBI-GeneID 1116083) and the acetoacetate decarboxylase gene adc of C. beijerinckii NCIMB8052 (Genbank NC_009617, position 4401916..4402656; Cbei_3835; NCBI-GeneID 5294996) on plasmid pCOLA. Ptb and buk genes were amplified from genomic DNA of C. beijerinckii NCIMB8052 and thlA and adc genes from an existing acetone plasmid pMTL85147-thlA-ctfAB-adc (WO 2012/115527) and cloned under control of the T7 promoter present in the pDUET vectors via restriction independent cloning with the circular polymerase extension cloning (CPEC) method (Quan, PloS One, 4:e6441, 2009).

[0229] Oligonucleotides used for amplification of ptb and buk genes:

TABLE-US-00005 SEQ ID NO: Name Sequence Direction 95 pACYCDuet-ptb- AAGTTTTTACTCATATGTAT reverse buk-pACYC-ptb-R1 ATCTCCTTCTTATACTTAAC 96 pACYCDuet-ptb- AGAAGGAGATATACATATGA forward buk-ptb-pACYC-F1 GTAAAAACTTTGATGAGTTA 97 pACYCDuet-ptb- ACCAGACTCGAGGGTACCTA reverse buk-buk-pACYC-R1 GTAAACCTTAGCTTGTTC 98 pACYCDuet-ptb- TAAGGTTTACTAGGTACCCT forward buk-pACYC-buk-F1 CGAGTCTGGTAAAGAAAC

[0230] Oligonucleotides used for amplification of thlA and adc genes:

TABLE-US-00006 SEQ ID NO: Name Sequence Direction 99 pCOLADuet-thlA- ACATATGTATATCTCCTTC reverse adc-thlA-adc-R1 TTACTAGCACTTTTCTAGC AATATTG 100 pCOLADuet-thlA- AGTAAGAAGGAGATATACA forward adc-adc-Th1A-F1 TATGTTAGAAAGTGAAGTA TCTAAAC 101 pCOLADuet-thlA- CAGACTCGAGGGTACCTTA reverse adc-adc-pCOLA-R1 TTTTACTGAAAGATAATCA TGTAC 102 pCOLADuet-thlA- TCTTTCAGTAAAATAAGGT forward adc-pCOLA-adc-F1 ACCCTCGAGTCTGGTAAAG AAAC 103 pCOLADuet-thlA- GAAGGAGATATACATATGA forward adc-thlA-pCOLA- AAGAAGTTGTAATAGCTAG F1 TG 104 pCOLADuet-thlA- ACAACTTCTTTCATATGTA reverse adc-pCOLA-thlA- TATCTCCTTCTTATACTTA R1 AC

[0231] After the plasmids pACYC-ptb-buk (SEQ ID NO: 105) and pCOLA-thlA-adc (SEQ ID NO: 106) were constructed, they were transformed individually and together into E. coli BL21 (DE3) (Novagen) and growth experiments carried out in quadruplicates in 1.5 mL cultures in 12-well plates at 28.degree. C. with 160 rpm orbital shaking using M9 minimal medium (Sambrook, Molecular Cloning: A Laboratory Manual, Vol 3, Cold Spring Harbour Press, 1989) with glucose (FIG. 4). The cultures were inoculated at an OD 600 nm of 0.1 and induced with different concentrations of IPTG (0, 50, 100 .mu.M) after 2 h of growth (FIG. 5). The plates were sealed using plate tape strips and each well was pierced with a green tipped needle to provide micro-aerobic conditions. Growth was carried out for another 64 h of induction. The experiment was repeated in triplicate.

[0232] Acetone concentrations, as well as the concentrations of other metabolites such as isobutylene, were measured using gas chromatography (GC) analysis, employing an Agilent 6890N headspace GC equipped with a Supelco polyethylene glycol (PEG) 60-.mu.m solid-phase microextraction fiber, a Restek Rtx-1 (30 m.times.0.32 .mu.m.times.5 .mu.m) column, and a flame ionization detector (FID). Samples (4 ml) were transferred into a 20-ml headspace vial, upon which the fiber was incubated (exposed) for 10 min at 50.degree. C. The sample was desorbed in the injector at 250.degree. C. for 9 min. Chromatography was performed with an oven program of 40.degree. C. (5-min hold) and 10.degree. C./min to 200.degree. C., followed by a 5-min hold at 220.degree. C. The column flow rate was 1 ml/min, with hydrogen as the carrier gas. The FID was kept at 250.degree. C., with hydrogen at 40 ml/min, air at 450 ml/min, and nitrogen at 15 ml/min as the makeup gas.

[0233] It was immediately obvious that acetone was produced in the strain carrying both the pACYC-ptb-buk and pCOLA-thlA-adc plasmids (expressing thiolase, Ptb-Buk, and acetoacetate decarboxylase). Average final acetone production of 0.19 g/L was measured, whereas no acetone was produced in a no plasmid control, media control, and single plasmid controls pACYC-ptb-buk (expressing Ptb-Buk) or pCOLA-thlA-adc plasmid (expressing thiolase and acetoacetate decarboxylase) (below reliable detection limit). The uninduced culture of the strain carrying both the pACYC-ptb-buk and pCOLA-thlA-adc plasmids (expressing thiolase, Ptb-Buk, and acetoacetate decarboxylase) did not produce appreciable amounts of acetone.

[0234] Average acetone production in E. coli BL21 (DE3):

TABLE-US-00007

[0234] Strain Acetone (g/L) Thl + Ptb-Buk + Adc [E. coli BL21 (DE3) + 0.19 .+-. 0.04 pACYC-ptb-buk + pCOLA-thlA-adc] Thl + Adc alone [E. coli BL21 (DE3) + pCOLA-thlA-adc] 0 .04 .+-. 0.01 Ptb-Buk alone [E. coli BL21 (DE3) + pACYC-ptb-buk] 0.03 .+-. 0.01 No plasmid control [E. coli BL21 (DE3)] 0.04 .+-. 0.01 Media control 0.03 .+-. 0.01

[0235] This experiment clearly demonstrates that Ptb-Buk is able to perform the conversion of acetoacetyl-CoA to acetoacetate can be used in place of a CoA-transferase or a thioesterase for the production of acetone, exemplified using a route that comprises steps 1, 2, and 3 of FIG. 1.

[0236] It is well known that isopropanol can be produced from acetone by addition of a primary:secondary alcohol dehydrogenase (Kopke, Appl Environ Microbiol, 80: 3394-3403, 2014) (step 4 in FIG. 1) and that isobutylene can be produced from acetone via addition of a hydroxyisovalerate synthase (step 5 in FIG. 1) and decarboxylase (step 6 in FIG. 1) (van Leeuwen, Appl Microbiol Biotechnol, 93: 1377-1387, 2012). A pathway can be constructed that includes the above-demonstrated acetone route via Ptb-Buk with the genes thlA, ptb-buk, and adc and a primary:secondary alcohol dehydrogenase gene (e.g., Genbank accession number NC_022592, pos. 609711..610766; CAETHG_0553; NCBI-GeneID: 17333984) that would allow isopropanol production via the Ptb-Buk system in E. coli comprising steps 1, 2, 3, and 4 of FIG. 1. Similarly, a pathway can be constructed that includes the above-demonstrated acetone route via Ptb-Buk conversion of acetoacetyl-CoA to acetoacetate with the genes thlA, ptb-buk, and adc and genes for a hydroxyisovalerate synthase and decarboxylase that would allow isobutylene production via the Ptb-Buk system in E. coli comprising of steps 1, 2, 3, 5, and 6 of FIG. 1. Acetoacetate can also be converted to 3-hydroxybutyrate via a 3-hydroxybutyrate dehydrogenase Bdh. This can be combined with Ptb-Buk conversion of acetoacetyl-CoA to acetoacetate for 3-hydroxybutyrate production in a strain expressing genes thlA, ptb-buk, and bdh resulting in a pathway comprising steps 1, 2, and 15 of FIG. 1.

Example 2

[0237] This example demonstrates the ability of Ptb-Buk to convert acetoacetyl-CoA to acetoacetate in C. autoethanogenum in vivo and the use of Ptb-Buk in the production of acetone, isopropanol, 3-hydroxybutyrate, and isobutylene from a gaseous substrate.

[0238] To demonstrate that the Ptb-Buk system also allows acetone, isopropanol, or isobutylene synthesis from gaseous substrates, a plasmid was constructed that contains the same genes as in Example 1, thl+ptb-buk+adc under control of a clostridial promoter on a shuttle vector that allows expression in acetogens such as C. autoethanogenum, C. ljungdahlii or C. ragsdalei.

[0239] The pMTL plasmid is a shuttle plasmid system for introducing circular dna into Clostridia via E. coli conjugation (Heap, J Microbiol Methods, 78: 79-85, 2009. The genes of interest (i.e., hbd, phaB, thlA, ptb, buk, and aor1) were cloned into the lacZ region of the plasmids using common techniques in molecular biology including dna restriction digestion followed by ligation, and the golden gate dna assembly technology when more than one pieces of dna fragments were to be cloned simultaneously into the plasmid. The constructed plasmids are verified by DNA sequencing.

[0240] Production of acetone and isopropanol was previously demonstrated in C. autoethanogenum using a plasmid pMTL85147-thlA-ctfAB-adc encoding thl+ctfAB+adc (WO 2012/115527) under the control of a clostridial promoter from the Wood-Ljungdahl gene cluster. In this plasmid the ctfAB genes encoding the CoA transferase were replaced directly with ptb-buk genes encoding the Ptb-Buk system. This was done as described in Example 1 using the CPEC method. The resulting plasmid is pMTL85147-thlA-ptb-buk-adc.

[0241] Oligonucleotides used for the amplification of ptb-buk and cloning into pMTL8317-thl-ptb-buk-adc are described below.

TABLE-US-00008 SEQ ID NO: Name Sequence Direction 107 thlA-ptb-R1 ATTTCCTCCCTTTCTAGCACTTT reverse TCTAGCAATATTG 108 adc-buk-F1 TAAGGTTTACTAAGGAGGTTGT forward TTTATGTTAGAAAG 109 thlA-ptb-F1 GCTAGAAAAGTGCTAGAAAGG forward GAGGAAATGAACATG 110 Buk-adc-R1 AAAACAACCTCCTTAGTAAACC reverse TTAGCTTGTTCTTC

[0242] C. autoethanogenum DSM10061 and DSM23693 (a derivate of DSM10061) were sourced from DSMZ (The German Collection of Microorganisms and Cell Cultures, Inhoffenstra e 7 B, 38124 Braunschweig, Germany).

[0243] Strains were grown at 37.degree. C. in PETC medium at pH 5.6 using standard anaerobic techniques (Hungate, Meth Microbiol, 3B: 117-132, 1969; Wolfe, Adv Microb Physiol, 6: 107-146, 1971). 30 psi CO-containing steel mill gas (collected from New Zealand Steel site in Glenbrook, NZ) or a synthetic gas blend with same composition of 44% CO, 32% N.sub.2, 22% CO.sub.2, 2% H.sub.2 was used as substrate for autotrophic growth. For solid media, 1.2% bacto agar (BD, Franklin Lakes, N.J. 07417, USA) was added.

[0244] The construct was synthesized and then transformed into C. autoethanogenum via conjugation. For this, the expression vector was first introduced into the conjugative donor strain E. coli HB101+R702 (CA434) (Williams, J Gen Microbiol, 1136: 819-826, 1990) (the donor) using standard heat shock transformation. Donor cells were recovered in SOC medium (Sambrook, Molecular Cloning: A Laboratory Manual, Vol 3, Cold Spring Harbour Press, 1989) at 37.degree. C. for 1 h before being plated on to LB medium (Sambrook, Molecular Cloning: A Laboratory Manual, Vol 3, Cold Spring Harbour Press, 1989) plates containing 100 .mu.g/ml spectinomycin and 25 .mu.g/ml chloramphenicol. LB plates were incubated at 37.degree. C. overnight. The next day, 5 ml LB aliquots containing 100 .mu.g/ml spectinomycin and 25 .mu.g/ml chloramphenicol were inoculated with several donor colonies and incubated at 37.degree. C., shaking for approximately 4 h, or until the culture was visibly dense but had not yet entered stationary phase. 1.5 ml of the donor culture was harvested in a microcentrifuge tube at room temperature by centrifugation at 4000 rpm for 2 min, and the supernatant was discarded. The donor cells were gently resuspended in 500 .mu.l sterile PBS buffer (Sambrook, Molecular Cloning: A Laboratory Manual, Vol 3, Cold Spring Harbour Press, 1989) and centrifuged at 4000 rpm for 2 min and the PBS supernatant was discarded. The pellet was introduced into an anaerobic chamber and gently resuspended in 200 .mu.l during late exponential phase C. autoethanogenum culture (the recipient). The conjugation mixture (the mix of donor and recipient cells) was spotted onto PETC-MES+fructose agar plates and left to dry. When the spots were no longer visibly wet, the plates were introduced into a pressure jar, pressurized with syngas to 25-30 psi and incubated at 37.degree. C. for .about.24 h. After 24 h incubation, the conjugation mixture was removed from the plates by gently scraping it off using a 10 .mu.l inoculation loop. The removed mixture was suspended in 200-300 .mu.l PETC medium. 100 .mu.l aliquots of the conjugation mixture were plated on to PETC medium agar plates supplemented 15 .mu.g/ml thiamphenicol to select for transformants bearing the plasmid, which confers resistance to thiamphenicol via expression of chloramphenicol acetyl-transferase.

[0245] Three distinct colonies of C. autoethanogenum bearing the pMTL85147-thlA-ptb-buk-adc plasmid were inoculated into 2 mL of PETC-MES medium with 15 .mu.g/ml thiamphenicol and grown autotrophically at 37.degree. C. with 100 rpm orbital shaking for three days. Cultures were diluted to OD.sub.600 nm=0.05 in 10 mL PETC-MES medium with 15 .mu.g/ml thiamphenicol in serum bottles and grown autotrophically at 37.degree. C. with 100 rpm orbital shaking for five days, sampling daily to measure biomass and metabolites. In parallel a control strain was examined where the expression plasmid encoded only thl and adc under the control of the Wood-Ljungdahl cluster promoter, with no ctfAB or ptb-buk genes to catalyse the formation of acetoacetate from acetoacetyl-CoA (pMTL85147-thlA-adc). Cultures were sampled for five days in order to monitor metabolites and biomass accumulation.

[0246] Isopropanol concentrations as well as concentrations of ethanol, acetic acid, 2,3-butanediol and lactic acid were measured by high-performance liquid chromatography (HPLC) on an Agilent LC with refractive index (RI) detection at 35.degree. C. Samples were prepared by diluting 400 .mu.L with 100 .mu.L of 5-sulfosalicylic acid solution (1% w/v in 1 M sulphuric acid), followed by a 3 minute centrifugation at 14,000 rpm; the supernatant was transferred to a glass vial for analysis. Separation was carried out with a 10 .mu.L injection on to an Alltech IOA-2000 column (150 mm.times.6.5 mm.times.8 .mu.m) at 0.7 mL/min and 65.degree. C. under isocratic conditions, using 5 mM sulphuric acid mobile phase.

[0247] In some instances, a longer HPLC method was used to improve peak separation. In this method, isopropanol, ethanol, acetate, 2,3-butanediol, and also 3-hydroxybutyrate (which is not separated using the shorter method) concentrations were measured by high-performance liquid chromatography (HPLC) on an Agilent 1260 Infinity LC with refractive index (RI) detection at 35.degree. C. Samples were prepared by diluting 400 .mu.L with 100 .mu.L of 5-sulfosalicylic acid solution (1% w/v in 1 M sulphuric acid), followed by a 3 minute centrifugation at 14,000 rpm; the supernatant was transferred to a glass vial for analysis. Separation was carried out with a 10 .mu.L injection on to an Aminex HPX-87H column (300 mm.times.7.8 mm.times.9 .mu.m) at 0.6 mL/min and 35.degree. C. under isocratic conditions, using 5 mM sulphuric acid mobile phase.

[0248] C. autoethanogenum bearing the pMTL85147-thlA-ptb-buk-adc produced isopropanol up to 0.804 g IPA/g of biomass, whereas control strain C. autoethanogenum with pMTL85147-thlA-adc that does not contain Ptb-Buk produced no IPA (FIG. 12).

[0249] This experiment clearly demonstrates that Ptb-Buk is able to perform the conversion of acetoacetyl-CoA to acetoacetate in the isopropanol pathway when using a gaseous substrate. Ptb-Buk can be used in place of a CoA transferase or a thioesterase in a gas-fermenting acetogen such as C. autoethanogenum, exemplified using a route that comprises steps 1, 2, 3, and 4 of FIG. 1.

[0250] C. autoethanogenum contains a native primary:secondary alcohol dehydrogenase that converts acetone to isopropanol (Kopke, Appl Environ Microbiol, 80: 3394-3403, 2014). It has been demonstrated that knock-out of this gene eliminates conversion of acetone to isopropanol in C. autoethanogenum (WO 2015/085015). In background of this knock-out, it becomes possible to produce acetone (rather than isopropanol) via the Ptb-Buk system from a gaseous feedstock, using the same genes comprising steps 1, 2, and 3 of FIG. 1. Addition of hydroxyisovalerate synthase and decarboxylase genes (van Leeuwen, Appl Microbiol Biotechnol, 93: 1377-1387, 2012) to this strain would enable isobutylene production from gas in C. autoethanogenum or similar bacteria comprising of steps 1, 2, 3, 5, and 6 of FIG. 1.

[0251] Acetoacetate can also be converted to 3-hydroxybutyrate via a 3-hydroxybutyrate dehydrogenase Bdh. A 3-hydroxybutyrate dehydrogenase was identified in the genome of C. autoethanogenum (AGY75962) and other acetogens as C. ljungdahlii (ADK16920.1). This activity can be combined with Ptb-Buk (or CoA transferase) conversion of acetoacetyl-CoA to acetoacetate for 3-hydroxybutyrate production in a strain expressing genes thlA, ptb-buk (or ctfAB) and bdh resulting a pathway comprising steps 1, 2, and 15 of FIG. 1. Low levels of 3-hydroxybutyrate formation (up to 2 g/L) via this route have been demonstrated in C. autoethanogenum. These levels could be enhanced by overexpressing the Bdh gene that is only expressed in at low levels natively.

[0252] In one experiment, C. autoethanogenum was transformed with plasmid pMTL82256-thlA-ctfAB as described in Example 2. The production was monitored for 10 days from six biological replicates under autotrophic conditions as described in Example 2. The average of 3-HB after 10 days was 1.86.+-.0.14 g/L. At day 10, 1,3-butanediol was produced (from 3-HB) at an average titer of 0.38.+-.0.05 g/L (FIG. 37). No acetone or isopropanol was formed. This demonstrates that 3-HB can be produced efficiently via acetoacetate through native enzymes.

[0253] In certain embodiments, it may be desirable to knock out or knock down expression of 3-hydroxybutyrate dehydrogenases, such as Bdh, to prevent carbon drain to 3-HB and therefore boost production of products such as acetone, isopropanol, and isobutylene.

Example 3

[0254] This example demonstrates the ability of Ptb-Buk to convert (R)-3-hydroxybutyryl-CoA to (R)-3-hydroxybutyryrate in E. coli in vivo for production of (R)-hydroxybutyrate, acetone, isopropanol, or isobutylene.

[0255] Pathways were designed and constructed that rely on the Ptb-Buk system for (R)-3-hydroxybutyrate production from (R)-3-hydroxybutyryl-CoA. Additionally, a 3-hydroxybutyrate dehydrogenase (Bdh) was utilized for conversion of (R)-3-HB to acetoacetate. It has been reported that Ralstonia pickettii have two 3-hydroxybutyrate dehydrogenases Bdh1 and Bdh2 that are able to convert 3-hydroxybutyrate to acetoacetate in vitro (Takanashi, J Biosci Bioeng, 101: 501-507, 2006). One pathway was designed making use of this enzyme for acetone production (steps 1, 13, 14, 15, 3 of FIG. 1), while recycling the reducing equivalents produced in the production of (R)-3-hydroxybutyryl-CoA and the ATP generated by Ptb-Buk (FIG. 6).

[0256] The pathways were constructed in a modular fashion using the pDUET vector system (Novagen). The two modules described in example above (pACYC-ptb-buk for expression of Ptb-Buk and pCOLA-thlA-adc for expression of thiolase and acetoacetate decarboxylase) were used together with two additional modules containing either (R)-specific 3-hydroxybutyrate dehydrogenase phaB of Cupravidus necator (WP_010810131.1) alone (pCDF-phaB) and one with 3-hydroxybutyrate dehydrogenase bdh1 gene of Ralstonia pickettii (BAE72684.1) (pCDF-phaB-bdh1) in vector pCDF. Both phaB and bdh1 gene were synthesized from GeneArt and cloned under control of the T7 promoter present in via restriction independent cloning with the circular polymerase extension cloning (CPEC) method (Quan, PloS One, 4:e6441, 2009).

[0257] Oligonucleotides used for amplification of bdh1 gene:

TABLE-US-00009 SEQ ID NO: Name Sequence Direction 111 pDuet-insert2-R1 CATATGTATATCTCCTTCTTA forward TACTTAAC 112 insert2-pDuet-F1 GTTAAGTATAAGAAGGAGATA forward TACATATG 113 pDuet-insert2-F1 CCTCGAGTCTGGTAAAGAAAC forward 114 insert2-pDuet-R1 GTTTCTTTACCAGACTCGAGG forward

[0258] Oligonucleotides used for amplification of phaB gene:

TABLE-US-00010 SEQ ID NO: Name Sequence Direction 115 pCDF-phaB-pACYC- CTATTCTTTGTGTCATGGTA forward phaB-R1 TATCTCCTTATTAAAG 116 pCDF-phaB-phaB- ATAAGGAGATATACCATGAC forward pACYC-F1 ACAAAGAATAGCATAC 117 pCDF-phaB-pACYC- TGGTTTACACATGGGATAAG forward phaB-F1 ATCCGAATTCGAGCTC 118 pCDF-phaB-phaB- AGCTCGAATTCGGATCTTAT forward pACYC-R1 CCCATGTGTAAACCAC

[0259] After the plasmids pACYC-ptb-buk (SEQ ID NO: 105), pCOLA-thlA-adc (SEQ ID NO: 106), pCDF-phaB (SEQ ID NO: 119) and pCDF-phaB-bdh1 (SEQ ID NO: 120) were constructed, they were transformed individually and in combinations into E. coli BL21 (DE3) (Novagen) and growth experiments were carried out in quadruplicate in 1.5 mL cultures in 12-well plates at 28.degree. C. with 160 rpm orbital shaking using M9 minimal medium with glucose. The cultures were inoculated at an OD 600 nm of 0.1 and after 2 h of growth induced with different concentrations of IPTG (0, 50, 100 .mu.M). The plates were sealed using BioRad plate tape strips and each well pierced with a green tipped needle to provide micro-aerobic conditions. Growth was carried out for another 64 h of induction. The experiment was repeated 3 times. Metabolites were measured as described in previous examples.

[0260] Cultures containing a combination of plasmids pACYC-ptb-buk, pCOLA-thlA-adc and pCDF-phaB produced between 1.65-2.4 g/L (R)-3-hydroxybutyrate (depending on level of inducer), with only very small amounts of byproducts (FIGS. 13A-F), demonstrating the efficiency of the Ptb-Buk system to convert (R)-3-hydroxybutyryl-CoA to (R)-3-hydroxybutyryrate and support growth (FIG. 13A-F). In cultures that also expressed bdh1 (containing a combination of plasmids pACYC-ptb-buk, pCOLA-thlA-adc, and pCDF-phaB-bdh1) only small amounts of (R)-3-hydroxybutyryrate were found in the culture media, while between 0.89-1.16 g/L acetone was found (depending on level of inducer), indicating that bdh1 gene is efficient in converting (R)-3-hydroxybutyrate to acetoacetate and further to acetone. In all plasmid combinations that lack Ptb-Buk, no 3-hydroxybutyrate or acetone was found (FIG. 13A-F). In these cultures, acetate levels were significantly higher.

[0261] This experiment clearly demonstrates that Ptb-Buk is able to perform the conversion of (R)-3-hydroxybutyrate-CoA to 3-hydroxybutyrate and also that Bdh1 is able in vivo to convert 3-hydroxybutyrate further to acetoacetate by recycling the reducing equivalents produced in the production of (R)-3-hydroxybutyryl-CoA. The experiment also highlights that Ptb-Buk is able to support growth and therefore acetate production becomes unnecessary. Production of (R)-3-hydroxybutyrate formation was exemplified in a strain that comprises steps 1, 13, and 14 of FIG. 1. Production of acetone was exemplified via a route that comprises steps 1, 13, 14, 15, and 3 of FIG. 1.

[0262] It is well known that isopropanol can be produced from acetone by addition of a primary:secondary alcohol dehydrogenase (step 4 in FIG. 1) (Kopke, Appl Environ Microbiol, 80: 3394-3403, 2014) and that isobutylene can be produced from acetone via addition of a hydroxyisovalerate synthase (step 5 in FIG. 1) and decarboxylase (step 6 in FIG. 1) (van Leeuwen, Appl Microbiol Biotechnol, 93: 1377-1387, 2012). A pathway can be constructed that includes the above-demonstrated acetone route via Ptb-Buk with the genes thlA, ptb-buk, and adc and a primary:secondary alcohol dehydrogenase gene (e.g., Genbank NC_022592, pos. 609711..610766; CAETHG_0553; NCBI-GeneID: 17333984) that would allow isopropanol production via the Ptb-Buk system in E. coli (steps 1, 13, 14, 15, 3, and 4 of FIG. 1). Similarly, a pathway can be constructed that includes the above-demonstrated acetone route via Ptb-Buk with the genes thlA, ptb-buk, and adc and genes for a hydroxyisovalerate synthase and decarboxylase that would allow isobutylene production via the Ptb-Buk system in E. coli (steps 1, 13, 14, 15, 3, 5, and 6 of FIG. 1).

Example 4

[0263] This example demonstrates the production of (R)-3-hydroxybutyrate and 1,3-butanediol in C. autoethanogenum. It also demonstrates production of 1,3-butanediol in absence of 2,3-butanediol.

[0264] A strain of C. autoethanogenum was constructed in which the native pathway for 2,3-butanediol production was inactivated and replaced with genes for (R)-3-hydroxybutyryl-CoA formation. This was achieved by replacing the acetolactate decarboxylase gene (budA) on genome of C. autoethanogenum with genes for thiolase (thlA of C. acetobutylicum; GenBank NC_001988, position 82040..83218; CA_P0078; NCBI-GeneID 1116083) and (R)-specific 3-hydroxybutyrate dehydrogenase (phaB of Cupravidus necator; GenBank WP_010810131.1) resulting in strain C. autoethanogenum budA::thlAphaB.

[0265] To replace budA gene with thlA and phaB genes a plasmid, pMTL8225-budA::thlA-phaB (FIG. 14), with E. coli toxin gene mazF under tet3n0 tetracycline inducible promoter (for counter selection), .about.1 kb upstream homology arm of budA gene, thlA, phaB, ermB cassette flanked by loxP sites and .about.1 kb downstream homology arm of budA gene were assembled on plasmid pMTL-tet3no.

[0266] The .about.1 kb upstream and downstream homology arms of budA were PCR amplified from C. autoethanogenum with primers SN01/SN02 and SN07/SN08. thlA and phaB genes were PCR amplified from genomic DNA of Cupriavidus necator using primers SN03/SN04mod. The ermB cassette flanked with loxP sites was PCR amplified using primers SN05mod/SN06. tet3no promoter flanked by FseI and PmeI was synthesized and treated with restriction enzymes FseI and PmeI and cleaned. The PCR products and digested vector were assembled using GeneArt Seamless cloning kit from Life Technologies and plasmid pMTL8225-budA::thlA-phaB (SEQ ID NO: 121) with no mutations in the inserted fragments was used to transform C. autoethanogenum by conjugation as described in previous examples.

[0267] Following conjugation and selection on trimethoprim and clarithromycin, 9 colonies were streaked twice on PETC-MES agar plates with clarithromycin and anhydrotetracycline to induce the expression of mazF genes. The colonies from clarithromycin and anhydrotetracycline should have the budA genes replaced with thlA and phaB genes and ermB cassette. This was verified by PCR using primers Og31f/Og32r flanking the homology arms and KAPA polymerase (FIG. 15).

[0268] While a band of .about.3.3 kb is amplified from the wild type strain, bands of .about.5.7 kb were amplified from colonies 1, 4, 7 and 9 indicating the replacement of budA gene with thlA, phaB and ermB cassette. The above event was further confirmed by sequencing the PCR products of all 4 clones. With the resulting modification the expression of thlA and phaB genes is driven by the promoter upstream of budA gene.

TABLE-US-00011 SEQ ID NO: Description Sequence 122 SN01 ATTTACAAATTCGGCCGGCCTACCTCCTCGTA TAAATAAGATG 123 SN02 CTAGCTATTACAACTTCTTTCATATTACATTC ACCTCTATGTC 124 SN03 GACATAGAGGTGAATGTAATATGAAAGAAGTT GTAATAGCTAG 125 SN04mod GTATAGCATACATTATACGAACGGTATTATCC CATGTGTAAACCACCGT 126 SN05mod TTCGTATAATGTATGCTATACGAAGTTATCCT TAGAAGCAAACTTAAG 127 SN06 GTCTAGTGTTTTTTTCTATCAATACTCTAGAT ACCGTTCGTATAGC 128 SN07 TGTATGCTATACGAACGGTAAGTATTGATAGA AAAAAACACTAGAC 129 SN08 CAAAAAGGAGTTTAAACAAAAAGTCATAAACC TGGATAAC 130 Og31f CCGTTTCTCACAACAACAATACCAG 131 Og32r AAACCACCTTGACGATGAAACCATA

[0269] A fermentation with C. autoethanogenum budA::thlA-phaB strain was carried out. The culture was grown at 37.degree. C. under synthetic gas (50% CO, 18% CO.sub.2, 2% H.sub.2, and 30% N.sub.2) that was continuously fed into the bioreactor. The gas flow was initially set at 50 ml/min, increasing to 400 ml/min over the course of the experiment, while the agitation was increased from 200 rpm to 500 rpm. The fermentation was carried out for close to 5 days. Metabolites were measured as described in examples above.

[0270] The concentration of 1,3-butanediol and other metabolites, such as 2-hydroxyisobutyric acid, were measured using gas chromatography (GC) analysis, employing an Agilent 6890N GC equipped a Agilent CP-SIL 5CB-MS (50 m.times.0.25 .mu.m.times.0.25 .mu.m) column, autosampler and a flame ionization detector (FID). Samples were prepared by diluting 400 .mu.L of sample with 400 .mu.L of acetonitrile, followed by a 3 minute centrifugation at 14,000 rpm; the supernatant was transferred to a glass vial and the sample was dried in a Thermo SpeedVac. Once dry, the samples were then suspended in a solution of 400 .mu.L of N,O-Bistrifluoroacetamide (BSTFA) and pyridine (3:1 ratio) and heated in a sealed glass vial for 60 minutes at 60.degree. C. Samples were transferred to an autosampler for analysis using a 1 .mu.L injection, a split ration of 30 to 1, and an inlet temperature of 250.degree. C. Chromatography was performed with an oven program of 70.degree. C. (no hold) to a ramp of 3.degree. C./min to 110.degree. C. to a ramp of 15.degree. C./min to 230.degree. C., followed by a final ramp of 40.degree. C./min to 310.degree. C. with a 3-min hold. The column flow rate was 1.8 ml/min, with helium as the carrier gas. The FID was kept at 320.degree. C., with hydrogen at 40 ml/min, air at 400 ml/min, and helium at 20 ml/min as the makeup gas.

[0271] Surprisingly, up to 1.55 g/L 3-hydroxybutyrate was produced from gas in a C. autoethanogenum budA::thlA-phaB strain expressing thlA and phaB (FIG. 16). A native thioesterase may convert the formed 3-hydroxybutyryl-CoA to 3-hydroxybutyrate. In the genome sequence, three putative thioesterases were identified.

[0272] Even more surprising, it was also found that, along 3-hydroxybutyrate formation, there was also 1,3-butanediol formation of up to 150 mg/L (FIG. 16). This may be due to native aldehyde:ferredoxin oxidoreductase (AOR) and alcohol dehydrogenase activity. Two AOR genes and several alcohol dehydrogenases are present in the genome of C. autoethanogenum (Mock, J Bacteriol, 197: 2965-2980, 2015). This reduction of 3-hydroxybutyrate is powered by reduced ferredoxin and thus can be directly coupled to CO oxidation, which provides reduced ferredoxin (CO+Fd.sub.ox.fwdarw.CO.sub.2+Fd.sub.red) (FIG. 7).

[0273] 1,3-BDO production was also demonstrated from gas via an alternative route using a butyraldehyde dehydrogenase Bld from Clostridium saccharoperbutylacetonicum (AAP42563.1) (SEQ ID NO: 80). The bld gene was synthesized and cloned together with the same thiolase (thlA of C. acetobutylicum) and (R)-specific 3-hydroxybutyrate dehydrogenase (phaB of Cupravidus necator) into a plasmid pMTL8315-Pfdx-thlA-phaB-bld (SEQ ID NO: 132). Bld and phaB genes were amplified from the above plasmid via primers in table below and cloned into existing plasmid pMTL85147-thlA (WO 2012/115527).

TABLE-US-00012 SEQ ID NO: Primer Sequence Direction 133 bld-phaB-F1 ACATGGGATAAG forward AAGGAGATATAC ATATGATAAAAG 134 bld-pMTL-R1 CGTCGACTCTAG forward ATTAACCTGCTA AAACACATCTTC 135 pMTL-bld-F1 GTGTTTTAGCAG forward GTTAATCTAGAG TCGACGTCACGC

[0274] The resulting construct was transformed into C. autoethanogenum as described above and a growth experiment was conducted in serum bottles with 50-mL PETC media and pressurized at 30 psi with CO-containing steel mill gas (collected from New Zealand Steel site in Glenbrook, NZ) or a synthetic gas blend with same composition of 44% CO, 32% N.sub.2, 22% CO.sub.2, 2% H.sub.2.

[0275] 1,3-BDO production was demonstrated via this route from gas (FIG. 17A), but production was less (up to 67 mg/L 1,3-BDO) than via the AOR route and, in contrast to the AOR route, growth was impacted when expressing the bld gene comparing to the C. autoethanogenum wild-type (FIG. 17B).

[0276] In another experiment, C. autoethanogenum transformed with plasmid pMTL83159-phaB-thlA as described in Example 2 produced 0.33 and 0.46 g/L of 3-HB and 1,3-BDO, respectively, in a bottle experiment under autotrophic conditions as described in Example 2 (FIG. 40).

Example 5

[0277] This example demonstrates the production of (S)-3-hydroxybutyrate and 1,3-butanediol in C. autoethanogenum.

[0278] A plasmid was constructed that expresses a thiolase (thlA from C. acetobutylicum; SEQ ID NO: 136) and an (S)-specific 3-hydroxybutyrate dehydrogenase (hbd1 from C. kluyveri; SEQ ID NO: 137) under either a ferredoxin promoter (P.sub.fdx isolated from C. autoethanogenum; SEQ ID NO: 138) or a pyruvate-ferredoxin oxidoreductase promoter (P.sub.pfor isolated from C. autoethanogenum; SEQ ID NO: 139). The plasmid was constructed as follows: P-hbd1-rbs2-thlA and pieced together and cloned into the pMTL83151 vector (Heap, J Microbiol Meth, 78: 79-85, 2009) by routine methods in molecular cloning, including restrictive enzyme digestion followed by ligation, overlap extension polymerase chain reaction, seamless cloning (Thermo Fisher Scientific), and GeneArt Type IIs (Thermo Fisher Scientific). The operon P-hbd1-rbs2-thlA was cloned in between restriction sites NotI and XhoI found in the multiple cloning region of the plasmid. P is the constitutive promoter which contains an intact ribosome binding site (rbs). rbs2 (SEQ ID NO: 140) is the ribosome binding site for expressing thlA. The stepwise procedures were amplification of the P, hbd1, and thlA from existing templates with primers listed below.

TABLE-US-00013 SEQ ID NO: Name Sequence Direction 141 Pfdx-F1 AAAGGTCTCCGGCCGCGCTCACTATCT forward GCGGAACC 142 Pfdx-R1 TTTGGTCTCGAATTCTGTAACACCTCC reverse TTAATTTTTAG 143 Ppfor-F1 AAAGGTCTCCGGCCGCAAAATAGTTGA forward TAATAATGCAGAG 144 Ppfor-R1 TTTGGTCTCGAATTCCTCTCCTTTTCA reverse AGCATATA 145 hbd1-F1 AAAGGTCTCGAATTCAAAGATCTATGT forward CTATTAAATCAGTTGCAG 146 hbd1-R1 TTTGGTCTCCCTCCTTTCTATTTCTAA reverse TATGCGAAAAATCCTTTACC 147 thlA-F1 AAAGGTCTCAGGAGGTGTTACATATGA forward AAGAAGTTGTAATAGCTAGTGC 148 thlA-R1 TTTGGTCTCCTCGAGTATGGATCCCTA reverse GCACTTTTCTAGCAATATTGC

[0279] The polymerase chain reactions were performed as follow using Kapa Taq PCR Kit (Kapa Biosystems). Set annealing temperature at 56.degree. C., and extension for 1 minute. Repeat PCR reaction for 30 cycles. Afterwards, PCR products were desalted using the DNA Clean & Concentrator Kit (Zymo Research Corporation).

[0280] pMTL83151 plasmid backbone was prepared by carrying out the NotI/XhoI double digestion using the FastDigest NotI and FastDigest XhoI (Thermo Fisher Scientific) following the protocol provided, followed by treatment with alkaline phosphate, using the FastAP Alkaline Phosphatase (Thermo Fisher Scientific) and the protocols provided. The digested backbone was then desalted with the DNA Clean & Concentrator Kit (Zymo Research Corporation).

[0281] The assembly of the PCR products and the plasmid backbone was carried out using the GeneArt Type IIs Kit (Thermo Fisher Scientific). The resulting plasmid was then isolated from the E. coli plasmid expression host using the QIAprep Spin Miniprep Kit (Qiagen).

[0282] To introduce the assembled plasmids pMTL8315-Pfdx-hbd1-thlA and pMTL8315-Ppfor-hbd1-thlA consisting of the operons, the plasmid was first introduced into the E. coli CA434 strain by chemical transformation. Afterwards, conjugation was performed by mixing the transformed CA434 strain with a C. autoethanogenum production host on a solid LB-agar media, and incubation in an anaerobic environment under pressure with a mix consisting of carbon monoxide and hydrogen as described in Example 2. C. autoethanogenum, after conjugation, was selected by successive growth on the solid media containing the proper antibiotic and trimethroprim to remove the remaining E. coli CA434 strain, under the anaerobic conditions.

[0283] The C. autoethanogenum strains carrying the introduced pMTL8315-Pfdx-hbd1-thlA or pMTL8315-Ppfor-hbd1-thlA plasmids consisting of the operon P-hbd1-rbs2-thlA were grown in a 10-mL PETC media in a 250-mL Schott bottle, sealed tight with rubber septum and cap, and pressurized at 30 psi with CO-containing steel mill gas (collected from New Zealand Steel site in Glenbrook, NZ) or a synthetic gas blend with same composition of 44% CO, 32% N.sub.2, 22% CO.sub.2, 2% H.sub.2. Metabolites were measured as described in previous examples.

[0284] Surprisingly, there was 3-hydroxybutyrate produced from gas in C. autoethanogenum cultures expressing thlA and hbd1 (FIG. 18A). A native thioesterase may convert the formed 3-hydroxybutyryl-CoA to 3-hydroxybutyrate. In the genome sequence, three putative thioesterases were identified. In the strain carrying pMTL8315-Pfdx-hbd1-thlA up to 2.55 g/L 3-hydroxybutyrate was found (FIG. 18A).

[0285] Even more surprising, it was also found that 3-hydroxybutyrate is over time converted to 1,3-butanediol, at the end of growth up to 1.1 g/L 1,3-butanediol was produced in strain carrying plasmid pMTL8315-Pfdx-hbd1-thlA (FIG. 18A). This may be due to native aldehyde:ferredoxin oxidoreductase (AOR) and alcohol dehydrogenase activity. Two AOR genes and several alcohol dehydrogenases are present in the genome of C. autoethanogenum (Mock, J Bacteriol, 197: 2965-2980, 2015). This reduction of 3-hydroxybutyrate (and reduction of acetate to ethanol; FIG. 18B) is powered by reduced ferredoxin and thus can be directly coupled to CO oxidation, which provides reduced ferredoxin (CO+Fd.sub.ox.fwdarw.CO.sub.2+Fd.sub.red) (FIG. 7).

[0286] The same strain of C. autoethanogenum carrying plasmid pMTL8315-Pfdx-hbd1-thlA was also tested in continuous fermentation. Fermentation was carried out as described in previous example, but the culture was turned continuous with a dilution rate with fresh media of around 0.05 at day 2 and then increased to 1.0 at day 3. High 3-hydroxybutyrate production of up to 7 g/L was observed with 1,3-BDO production of 0.5 g/L.

[0287] To improve production of (S)-3-hydroxybutyrate and 1,3-butanediol and avoid synthesis of another form of butanediol (2,3-butanediol), plasmid pMTL-HBD-ThlA was introduced into a strain that has an inactivated 2,3-butanediol pathway where the acetolactate decarboxylase gene BudA has been deleted (U.S. Pat. No. 9,297,026). This budA knockout eliminated the major pathway to 2,3-BDO, increasing the specificity for 3-HB and 1,3-BDO production. When pMTL-HBD-ThlA was expressed in the budA deletion strain, a total of 15% C-mol was achieved for both 3-HB and 1,3-BDO (FIG. 41).

TABLE-US-00014 Selectivity (C-mol %) Acetate 14.7 Ethanol 64.9 2,3-BDO 1.3 Biomass 3.7 3-HB 10.4 1,3-BDO 5.0

[0288] As a comparison, in a strain expressing the same plasmid, pMTL83159-hbd-thlA without budA knockout, the total specificity for the production of 3-HB and 1,3-BDO at the steady state was only 6.9%

TABLE-US-00015 Selectivity (C-mol %) Acetate 0.4 Ethanol 84.3 2,3-BDO 6.2 Biomass 2.2 3-HB 3.5 1,3-BDO 3.4

Example 6

[0289] This example demonstrates that the Ptb-Buk system is efficient in C. autoethanogenum on a range of acyl-CoAs including acetoacetyl-CoA, 3-hydroxybutyryl-CoA, and 2-hydroxyisobutyryl-CoA

[0290] The Ptb-Buk system was expressed from a plasmid in C. autoethanogenum and its activity measured using a CoA hydrolysis assay. For this, ptb-buk genes from C. beijerinckii NCIMB8052 (GenBank NC_009617, position 232027..234147; Cbei_0203-204; NCBI-GeneID 5291437-38) were amplified from genomic DNA of C. beijerinckii NCIMB8052 and cloned under control of a pyruvate-ferredoxin oxidoreductase promoter (P.sub.pfor isolated from C. autoethanogenum; SEQ ID NO: 139) into pMTL82251 vector ((Heap, J Microbiol Meth, 78: 79-85, 2009) by routine methods in molecular cloning, including restrictive enzyme digestion followed by ligation, overlap extension polymerase chain reaction, seamless cloning (Thermo Fisher Scientific), and GeneArt Type IIs (Thermo Fisher Scientific) as described in Example 5. Oligonucleotides are described below.

TABLE-US-00016 SEQ ID NO: Name Sequence Direction 149 Ppfor-F2 aaacagctatgaccgcGGCCGCAAAA forward TAGT 150 Ppfor-R2 ttactcatTGGATTCCTCTCCTTT reverse 151 Ptb-Buk-F2 ggaatccaATGAGTAAAAACTTTGAT forward GAG 152 Ptb-Buk-R2 caggcctcgagatctcCTAGTAAACC reverse TTAGCTTGTTC

[0291] The resulting plasmid pMTL82256-ptb-buk (SEQ ID NO: 153) was introduced into C. autoethanogenum as described in previous examples.

[0292] Acyl-CoA hydrolysis assays were performed as follows. C. autoethanogenum cells were harvested at OD 2 (late exponential phase) by centrifugation (14,000 rpm for 1 min at 4.degree. C.). Cells were re-suspended in 500 .mu.l lysis buffer (potassium phosphate buffer, pH 8). Cells were lysed using a freeze thaw cycle (optional), sonication 6.times.30 s at amplitude 20 on ice. Samples were centrifuged for 10 min at 14,000 rpm at 4.degree. C. and the supernatant with soluble proteins was removed. The protein concentration was measured, e.g., with a Bradford assay.

[0293] The assay mix contained: 484 .mu.l of potassium phosphate buffer pH 8.0, 1 .mu.l of DTNB (final concentration of 0.1 mM), 10 .mu.l of cell lysate, and 5 .mu.l of CoA (final concentration of 500 .mu.M). All the components were mixed in a quartz cuvette (1 ml cuvette with a read length of 1 cm) except the protein. The assay was started by adding the cell lysate and following the reaction in a spectrophotometer at 405 nm, 30.degree. C. for 3 min. A control without lysate was run to measure autolysis of the acyl-CoA.

[0294] To determine activity, slope on the linear part of the curve (usually in the first 30 s), was calculated. The protein amount was normalized and slope was divided by protein amount. An extinction coefficient (14,150 M.sup.-1 cm.sup.-1) was used to calculate the specific activity in M/s/mg. The activity of the negative control was subtracted.

[0295] The assay was performed with acetoacetyl-CoA, a racemic mix of 3-hydroxybutyryl-CoA (3-HB-CoA) and 2-hydroxyisobutyryl-CoA (2-HIB-CoA). The possibility of artificially low hydrolysis rates for 3-HB-CoA and 2-HIB-CoA due to potential substrate limitation was addressed by repeating the hydrolysis assays for C. autoethanogenum lysates using different concentrations of acyl-CoA, 500 .mu.M and 200 .mu.M.

[0296] The results of the assay show significantly increased CoA hydrolysis in lysates of C. autoethanogenum carrying plasmid pMTL82256-ptb-buk expressing the Ptb-Buk system on a range of acyl-CoAs including acetoacetyl-CoA, 3-hydroxybutyryl-CoA and 2-hydroxyisobutyryl-CoA (FIGS. 20A-B). Notably, there is also CoA hydrolysis for acyl-CoAs as 2-hydroxyisobutyryl-CoA that are not hydrolysed by the C. autoethanogenum wild-type. With acetoacetyl-CoA and 3-hydroxybutyryl-CoA some native CoA hydrolysis activity was observed.

Example 7

[0297] This example demonstrates the disruption of identified native thioesterase genes improve efficiency of the Ptb-Buk and CoA transferase system by increasing the pool of available acyl-CoAs such as acetoacetyl-CoA, 3-hydroxybutyryl-CoA or 2-hydroxyisobutyryl-CoA.

[0298] In contrast to the Ptb-Buk system, where energy is conserved in the form of ATP during conversion of acyl-CoAs to their respective acids, no energy is conserved if the CoAs are simply hydrolyzed.

[0299] In hydrolase assays it was found that there is native hydrolysis activity for acetoacetyl-CoA and 3-hydroxybutyryl-CoA in C. autoethanogenum.

[0300] Acyl-CoA hydrolysis assays with acetoacetyl-CoA, a racemic mix of 3-hydroxybutyryl-CoA (3-HB-CoA) and 2-hydroxyisobutyryl-CoA (2-HIB-CoA were performed as described in previous example. The results of the assay show cleavage of acetoacetyl-CoA and 3-HB-CoA, but not 2-HIB-CoA, and confirm native activity is present in C. autoethanogenum (FIG. 11).

[0301] An analysis of the genome of C. autoethanogenum led to identification of three putative CoA-thioesterases (thioester-hydrolases) that could be responsible for to the cleavage of acetoacetyl-CoA or 3-hydroxybutyryl-CoA thioester bond. These are also present in other acetogens such as C. ljungdahlii.

TABLE-US-00017 SEQ SEQ C. ID C. ID Description Annotation autoethanogenum NO: ljungdahlii NO: thioesterase 1 Palmitoyl-CoA AGY74947.1 154 ADK15695.1 157 (CAETHG_0718) hydrolase thioesterase 2 4-Hydroxybenzoyl- AGY75747.1 155 ADK16655.1 158 (CAETHG_1524) CoA thioesterase thioesterase 3 Putative AGY75999.1 156 ADK16959.1 159 (CAETHG_1780) Thioesterase

[0302] Inactivation of these three putative CoA-thioesterases lead to higher product titers, improving efficiency of the Ptb-Buk system. The three putative thioesterases were inactivated using ClosTron technology. In brief, the targeting domain of the type II Ltr was reprogrammed using the ClosTron website and the retargeted ClosTron plasmids were ordered from DNA 2.0. The ClosTron knock out vectors pMTL007C-E2-Cau-2640-571s targeting the thioesterase 1 (CAETHG_0718), pMTL007C-E2-PBor3782-166s targeting the thioesterase 2 (CAETHG_1524), and pMTL007C-E2-PBor4039-199s targeting the thioesterase 3 (CAETHG_1780) were introduced into C. autoethanogenum using conjugation.

[0303] Selection for integration was done by selecting PETC supplemented with 5 .mu.g/ml clarithromycin and successful inactivation by integration of the type II intron was confirmed by PCR across the insertion site.

[0304] The CoA hydrolase activity on acetoacetyl-CoA of both wild type C. autoethanogenum and each of the C. autoethanogenum with one of the putative genes inactivated was measured using the assay described above. It was shown that all three strains with the inactivated putative thioesterases showed less hydrolysis activity on acetoacetyl-CoA and 3-hydroxybutyryl-CoA (FIGS. 21A-B).

[0305] To demonstrate that the decreased CoA hydrolase activity, and thus an increased pool in acetoacetyl-CoA, is beneficial for production of acetoacetyl-CoA derived products, the isopropanol plasmid pMTL85147-thlA-ctfAB-adc encoding thl+ctfAB+adc (WO 2012/115527) was introduced into the C. autoethanogenum wild-type strain and the strain with inactivated thioesterase 1. A growth experiment was carried out 40 ml PETC medium in 1 L Schott bottles in technical triplicates with Co gas at 37.degree. C. at 110 rpm shaking. Synthetic gas (50% CO, 18% CO.sub.2, 2% H.sub.2, and 30% N.sub.2) was used as sole energy and carbon source. Headspace exchanged once and gassed to 21 psi (1.5 bar) at 37.degree. C. under synthetic gas (50% CO, 18% CO.sub.2, 2% H.sub.2, and 30% N.sub.2). Samples for OD and analytics were taken twice a day.

[0306] The strain with inactivated thioesterase 3 CAETHG_1780 produced significantly higher levels of isopropanol than the wild-type (FIG. 22 and FIGS. 23A-D).

[0307] Similarly, knockout of thioesterases in C. autoethanogenum would increase the pool of 3-hydroxybutyryl-CoA, allowing more efficient utilization of 3-hydroxybutyryl-CoA by Ptb-Buk and leading to higher production of acetone, isopropanol, isobutylene, (R)-3-hydroxybutyrate, 1,3-butanediol, and/or 2-hydroxyisobutyric acid. When plasmid pMTL8315-Pfdx-hbd1-thlA of Example 5 was introduced into C. autoethanogenum strain with interrupted thioesterase 2 CAETHG_1524, 3-hydroxybutyrate synthesis was abolished (compared to the up to 2.55 g/L 3-hydroxybutyrate that were found when expressing this plasmid in the C. autoethanogenum wild type strain). No competing activity for 3-hydroxybutyryl-CoA is present in this strain.

[0308] These results demonstrate that by reducing thioesterase activity, a higher CoA pool for the Ptb-Buk system and product synthesis is available.

[0309] Additionally, the production of 3-HB and 1,3-BDO can be increased by overexpression of ptb-buk. In a control experiment, whereby C. autoethanogenum as described in Example 2 was transformed with plasmids pMTL83159-phaB-thlA from Example 4 plus pMTL82256 (Heap, J Microbiol Methods, 78: 79-85, 2009), in which the latter is an empty plasmid used as a background control, the fermentation of such strain resulted in a production of 3-HB with highest titer at 1.68 g/L at day 10 (FIG. 42A). When pMTL82256-buk-ptb, instead of the empty plasmid pMTL82256, was coexpressed with pMTL83159-phaB-thlA in C. autoethanogenum, the fermentation resulted in a higher titter of 3-HB, at 4.76 g/L, at an earlier time, day 4 (FIG. 42B).

[0310] Deletion of native thioesterases enhances the efficiency of the ptb-buk system, which has preference for (R)-3-HB-CoA. The locus of the thioesterase gene in the genome was deleted and replaced with the buk-ptb dna fragment via the common molecular biology technique known as homologous recombination. The substitution of the thioesterase gene by the buk-ptb was confirmed by PCR, followed by agarose gel electrophoresis and dna sequencing.

[0311] In a bottle experiment, when pMTL83156-phaB-thlA was expressed without ptb-buk in the thioesterase deletion mutant, described above, the average maximum titer of 3-HB produced was 0.50.+-.0.05 g/L, similar to the titer obtained using an unmodified C. autoethanogenum strain. When pMTL82256-buk-ptb was coexpressed with the pMTL83156-phaB-thlA plasmid in a thioesterase knockout strain, the production of 3-HB increased to 1.29.+-.0.10 g/L (FIG. 43).

Example 8

[0312] This example demonstrates that it is possible to eliminate acetate production system in an acetogen C. autoethanogenum with the Ptb-buk system.

[0313] All acetogenic microorganisms are described to produce acetate (Drake, Acetogenic Prokaryotes, In: The Prokaryotes, 3.sup.rd edition, pages 354-420, New York, N.Y., Springer, 2006) as the production of acetate provides the microorganism with an option to directly generate ATP from substrate level phosphorylation via Pta (phosphotransacetylase) and Ack (phosphotransacetylase-acetate kinase). Native acetate-forming enzymes such as Pta-Ack are therefore considered to be essential in acetogens (Nagarajan, Microb Cell Factories, 12: 118, 2013). Since Ptb-Buk provides an alternative means for energy generation, it becomes possible to replace the native Pta-Ack system with Ptb-Buk.

[0314] The pta and ack genes in C. autoethanogenum are in one operon. To replace pta and ack genes with ptb and buk genes a plasmid, pMTL8225-pta-ack::ptb-buk (FIG. 24), with mazF counter selection marker that is under tetracycline inducible promoter, .about.1 kb upstream homology arm, ptb, buk, ermB cassette flanked by loxP sites and .about.1 kb downstream homology arm was assembled (SEQ ID NO: 160).

[0315] The .about.1 kb upstream and downstream homology arms were PCR amplified from C. autoethanogenum with primers SN22f/SN23r and SN28f/SN29r. Ptb and buk genes were PCR amplified from pIPA_16 plasmid using primers SN24f/SN25r. The ermB cassette with loxP sites was PCR amplified using primers SN26f/SN27r. The plasmid backbone was PCR amplified with primers SN30f/SN31r. KAPA polymerase was used for all PCR amplifications. The PCR products were assembled using GeneArt Seamless cloning kit from Life Technologies and plasmid with no mutations in the insert fragments was used to transform C. autoethanogenum by conjugation as described earlier.

[0316] Following conjugation and selection on trimethoprim and clarithromycin, 7 colonies were streaked twice on PETC-MES agar plates with clarithromycin and anhydrotetracycline to induce the expression of mazF genes. The colonies from clarithromycin and anhydrotetracycline should have the pta and ack genes replaced with ptb and buk genes and ermB cassette. This was verified by PCR using primers Og29f/Og30r flanking the homology arms and KAPA polymerase (FIG. 25). While a band of .about.4.6 kb is amplified from the wildtype strain, bands of .about.5.7 kb was amplified from colonies 1 and 4-7, indicating the replacement of pta and ack genes replaced with ptb and buk genes and ermB cassette. The above event was further confirmed by sequencing the PCR products from clones 4-7.

[0317] With the resulting modification the expression of ptb and buk genes is driven by the promoter upstream of pta gene.

TABLE-US-00018 SEQ ID NO: Name Sequence 161 SN22f TTTACAAATTCGGCCGGCCAAAGATTGCTCTATGTTTAAGCT 162 SN23r CATCAAAGTTTTTACTCATCAATTTCATGTTCATTTCCTCCC T 163 SN24f AGGGAGGAAATGAACATGAAATTGATGAGTAAAAACTTTGAT GAGT 164 SN25r GTATAGCATACATTATACGAACGGTACTAGTAAACCTTAGCT TGTTCTTC 165 SN26f GAAGAACAAGCTAAGGTTTACTAGTACCGTTCGTATAATGTA TGCTATAC 166 SN27r AGAGATGAGCATTAAAAGTCAAGTCTACCGTTCGTATAGCAT ACA 167 SN28f TGTATGCTATACGAACGGTAGACTTGACTTTTAATGCTCATC TCT 168 SN29r CATGAGATTATCAAAAAGGAGTTTAAATATCTATTTTGTCCT TAGGA 169 SN30f TCCTAAGGACAAAATAGATATTTAAACTCCTTTTTGATAATC TCATG 170 SN31r AGCTTAAACATAGAGCAATCTTTGGCCGGCCGAATTTGTAAA 171 Og29f AGCCACATCCAGTAGATTGAACTTT 172 Og30r AATTCGCCCTACGATTAAAGTGGAA

[0318] The resulting strain C. autoethanogenum pta-ack::ptb-buk, in which the pta-ack operon was replaced by the ptb-buk operon was transformed as described above with the isopropanol production plasmid pMTL85147-thlA-adc from Example 2. A growth study was carried out under autotrophic conditions and analyzed for metabolic end products. No acetate production was observed, while isopropanol (up to 0.355 g/L) and 3-HB (up to 0.29 g/L) was still produced alongside ethanol and 2,3-butanediol (FIGS. 39A and 39B). This demonstrates that it is possible to produce isopropanol and 3-HB without acetate production from gaseous substrates CO and/or CO.sub.2 and H.sub.2 using the Ptb-Buk system.

[0319] If acetone rather than isopropanol is the target product, the primary:secondary alcohol dehydrogenase gene (SEQ ID NO: 17) can be further knocked out this strain C. autoethanogenum pta-ack::ptb-buk using methods described above and in detail in WO 2015/085015. Introducing plasmid pMTL85147-thlA-adc into this strain results in production of acetone at similar levels as described above for isopropanol without co-production of acetate. Ethanol, 2,3-butanediol and 3-HB may be further products.

[0320] By further knock-outs it is possible to eliminate these products as well, e.g., knock-out of the acetolactate decarboxylase gene BudA results in a strain unable to produce 2,3-butanediol (U.S. Pat. No. 9,297,026). 3-HB production may be reduced or eliminated by deletion of 3-hydroxybutyrate dehydrogenase gene Bdh (SEQ ID NO: 62).

Example 9

[0321] This example demonstrates improvement of conversion of 3-hydroxybutyrate to 1,3-BDO by overexpression of the aldehyde:ferredoxin oxidoreductase gene aor1.

[0322] The pMTL82251 plasmid backbone was used for overexpression of the C. autoethanogenum aor1 gene. The pMTL82251 plasmid was selected since it has a different replication origin and antibiotic marker, but could be co-expressed with, the plasmid used in Example 5 that contained hbd1 and thlA. Preparation of the plasmid backbone and the assembly reaction were carried out following the procedures listed above, first generating plasmid pMTL82256 by introducing the C. autoethanogenum ferredoxin promoter into plasmid pMTL82251 and then adding the aor1 genes to form plasmid pMTL82256-aor1. The following primers were used.

TABLE-US-00019 SEQ ID NO: Name Sequence Direction 173 Pfdx-F1 AAAGGTCTCCGGCCGCGCTCACTATC forward TGCGGAACC 174 Pfdx-R1 TTTGGTCTCGAATTCTGTAACACCTC reverse CTTAATTTTTAG 175 aor1-F1 AAAGGTCTCGAATTCAAAGATCTATG forward TATGGTTATGATGGTAAAGTATTAAG 176 aor1-R1 TTTGGTCTCCTCGAGTATGGATCCCTA reverse GAACTTACCTATATATTCATCTAATCC

[0323] After transforming the resulting plasmid pMTL82256-aor1 into the E. coli CA434 strain, conjugation was performed on the previous C. autoethanogenum 1,3-BDO production host. Thus, the resulting C. autoethanogenum strain carried two plasmids, one for overexpressing hbd1 and thlA, and another for aor1, under different replication origins and selection marker. The production for 1,3-BDO was characterized and quantified following the procedures above.

[0324] The results clearly show that 1,3-BDO production can be improved by overexpressing aor1. Likewise other aldehyde:ferredoxin oxidoreductase genes could be expressed in C. autoethanogenum to facilitate conversion of 3-hydroxybutyrate to 1,3-butanediol.

[0325] To improve of 1,3-BDO production, AOR was overexpressed to improve conversion of 3-HB to 3-HB-aldehyde. To do this, pMTL82256-hbd-thlA and pMTL83159-aor1 were coexpressed in C. autoethanogenum. As compared to the strain that carried pMTL82256-hbd-thlA alone, the aor1-coexpressed strain produced higher ethanol and 1,3-BDO (FIG. 44).

Example 10

[0326] This example demonstrates the stereospecificity of Ptb-Buk that allows for the production of 2-hydroxyisobutyric acid without the production of unwanted byproducts.

[0327] 2-hydroxyisobutyric acid can be produced in E. coli and C. autoethanogenum by introduction of a thiolase and a 3-hydroxybutyryl-CoA dehydrogenase to convert acetyl-CoA to 3-hydroxybutyryl-CoA, a 2-hydroxyisobutyryl-CoA mutase enzyme for conversion of 3-hydroxybutyryl-CoA to 2-hydroxyisobutyryl-CoA and an enzyme that can hydrolyse the CoA to form 2-hydroxyisobutyric acid. The 3-hydroxybutyryl-CoA dehydrogenase can either be (R)- or (S)-specific and the enzyme converting 2-hydroxyisobutyryl-CoA to 2-hydroxybutyrate according to steps 1, 13, 19, and 20 of FIG. 1. This last step can either be done via a thioesterase or the Ptb-Buk system.

[0328] Three potential candidate genes, E. coli thioesterase type II TesB, the C. autoethanogenum phosphate acetyltransferase/acetate kinase pair and the C. beijerinckii butyryltransferase/butyrate kinase pair were cloned into E. coli pDUET T7 expression vectors via methods described above and primers below.

TABLE-US-00020 SEQ ID NO: Primer Sequence 177 pETDuet-pta-ack- GGGTACCTTATTTATTTTCAACTATTTC ack-DuetI2-R1 TTTTGTATC 178 pETDuet-pta-ack- TTGAAAATAAATAAGGTACCCTCGAGTC DuetI2-ack-F1 TGGTAAAG 179 pETDuet-pta-ack- TTTTTTCCATATGTATATCTCCTTCTTA DuetI2-pta-R1 TACTTAAC 180 pETDuet-pta-ack- AGGAGATATACATATGGAAAAAATTTGG pta-DuetI2-F 1 AGTAAGGC 181 pETDuet-tesB- GAAATCATAATTAAGGTACCCTCGAGTC DuetI2-tesB-F1 TGGTAAAG 182 pETDuet-tesB- CCTGACTCATATGTATATCTCCTTCTTA DuetI2-tesB-R1 TACTTAAC 183 pETDuet-tesB- AAGAAGGAGATATACATATGAGTCAGGC tesB-DuetI2-F1 ACTTAAAA 184 pETDuet-tesB- AGGGTACCTTAATTATGATTTCTCATAA testB-DuetI2-R1 CACCTTC

[0329] The obtained plasmids pDUET-pta-ack (SEQ ID NO: 185), pDUET-ptb-buk (SEQ ID NO: 186), pDUET-tesB (SEQ ID NO: 187) and introduced into E. coli BL21 (DE3) for expression and then assayed for their activity on acetoacetyl-CoA, 3-hydroxybutyryl-CoA and 2-hydroxyisobutyryl-CoA. The results are shown in FIG. 27. E. coli BL21 has a small but measurable amount of activity on all three substrates. Pta-Ack resulted in no activity above background, while both thioesterase TesB and Ptb-Buk showed high activity on all three substrates, including 2-hydroxyisobutyryl-CoA.

[0330] The activity of both thioesterase TesB and Ptb-Buk was higher on linear acetoacetyl-CoA, 3-hydroxybutyryl-CoA than on branched 2-hydroxyisobutyryl-CoA. This creates a problem in the pathway as it results in early termination of the pathway at 3-hydroxybutyryl-CoA, in particular as activities are higher than activities on the 2-hydroxyisobutyryl-CoA mutase enzyme.

[0331] However, Ptb-Buk in contrast to thioesterases is able to distinguish between stereoisomers and will only (or preferentially) act on (R)-3-hydroxybutyryl-CoA but not on (S)-3-hydroxybutyryl-CoA. This was demonstrated by expressing the Ptb-Buk system either with ThlA and (S)-specific Hbd (FIG. 28A) or (R)-specific phaB (FIG. 28B) in the pDuet system in E. coli. The constructs were constructed as described in Examples 1 and 3. Growth studies confirmed that appreciable amounts of 3-hydroxybutyrate were only formed when Ptb-Buk was expressed in combination with the (S)-specific Hbd but not the (R)-specific phaB.

[0332] Therefore, a route via an (S)-specific 3-hydroxybutyryl-CoA dehydrogenase and the Ptb-Buk provides significant advantages, as the Ptb-Buk system (unlike thioesterases) is not active on (S)-3-hydroxybutyryl-CoA but (S)-3-hydroxybutyryl-CoA is also the preferred isomer of the 2-hydroxyisobutyryl-CoA mutase (Yaneva, J Biol Chem, 287: 15502-15511, 2012). The produced 2-hydroxyisobutyryl-CoA can then be used via the Ptb-Buk to produce 2-hydroxyisobutyric acid and (unlike thioesterases) 2-hydroxyisobutyryl-CoA hydrolysis provides additional energy (FIG. 8).

[0333] Modular constructs were designed to compare performance of the pathway. A gene cassette containing the Wood-Ljungdahl promoter in front of the genes meaB, hcmA and hcmB was codon optimized and synthesized (SEQ ID NO: 188). HcmA and hcmB encode a 2-hydroxyisobutyryl-CoA mutase and meaB a chaperon from Aquincola tertiaricarbonis, in the construct hcmA and meaB genes were fused together as one protein as described (SEQ ID NO: 189) (Yaneva, J Biol Chem, 287: 15502-15511, 2012). The gene cassette was cloned into either a plasmid containing thiolase (thlA from C. acetobutylicum; SEQ ID NO: 136) and an (S)-specific 3-hydroxybutyrate dehydrogenase (hbd from C. acetobutylicum; SEQ ID NO: 190) (pMTL83155-thlA-hbd) or an (R)-specific 3-hydroxybutyrate dehydrogenase (phaB from R. eutropha) (pMTL83155-thlA-phaB) using the restriction enzymes KpnI and NcoI to form plasmids pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB (SEQ ID NO: 191) and pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB (SEQ ID NO: 192), respectively. Sub-cloning of the codon optimized 2-hydroxyisobutyryl-CoA mutase casette in E. coli Top-10 was only successful after some initial cloning complications; it was found that the 2-hydroxyisobutyryl-CoA mutase casette could only be cloned into the plasmid at a lower temperature (28.degree. C.).

[0334] Vector pMTL83155-thlA-hbd and pMTL83155-thlA-phaB were created by first amplifying a promoter region of the phosphate acetyltransferase of C. autoethanogenum (SEQ ID NO: 193) and cloning into vector pMTL83151 (FJ797647.1; Heap, J Microbiol Meth, 78: 79-85, 2009) using NotI and NdeI restriction sites before introducing genes thlA and hbd or respectively phaB via NdeI and KpnI in a double ligation reaction.

[0335] In addition, compatible plasmid modules for expressing ptb-buk or tesB were built. For this, the respective genes were amplified from genomic DNA and introduced into plasmid pMTL82256 described in Example 9 and then introducing either ptb-buk or phaB using NdeI and NcoI and Seamless Cloning kit (Life technologies) to form plasmids pMTL82256-ptb-buk (SEQ ID NO: 194) and pMTL82256-tesB (SEQ ID NO: 195).

[0336] Plasmids pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB, pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB, pMTL82256-ptb-buk and pMTL82256-tesB were introduced into E. coli Top-10 (all steps at 28.degree. C.) and C. autoethanogenum by transformation as described in previous examples in the following combinations: pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-ptb-buk, pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-tesB, pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB+pMTL82256-ptb-buk and pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB+pMTL82256-tesB.

[0337] Growth experiments were carried out with E. coli in LB medium at 30.degree. C. for 4 days and C. autoethanogenum in PETC medium with 30 psi CO-containing steel mill gas (collected from New Zealand Steel site in Glenbrook, NZ) at 30.degree. C. and 37.degree. C. for 6 days. Metabolites were measured as described above. In addition to measurement by GC-MS, 2-Hydroxyisobutyric acid production was also confirmed using liquid chromatography tandem mass spectrometry (LC-MS/MS) and .sup.1H nuclear magnetic resonance (NMR) spectroscopy.

[0338] Liquid chromatography tandem mass spectrometry (LC-MS/MS) data was acquired on a Dionex UltiMate 3000 liquid chromatography system (Dionex, California, USA) coupled to an ABSciex 4000 QTRAP mass spectrometer (ABSciex, Concord, Canada). The liquid chromatography system was controlled by Chromeleon software (Dionex), and chromatographic separation was achieved by injecting 10 .mu.L onto a Gemini-NX C18 150 mm.times.2 mm I.D., 3 .mu.m 110 .ANG. particle column (Phenomenex, Aschaffenburg, Germany) equipped with a pre-column Security Guard Gemini-NX C18 4 mm.times.2 mm I.D. cartridge. The column oven temperature was controlled and maintained at 55.degree. C. throughout the acquisition and the mobile phases were as follows: 7.5 mM aqueous tributylamine adjusted to pH 4.95 (.+-.0.05) with glacial acetic acid (eluent A) and acetonitrile (eluent B). The mobile phase flow rate was maintained at 300 .mu.L/min throughout a gradient profile and was introduced directly into the mass spectrometer with no split. The mass spectrometer was controlled by Analyst 1.5.2 software (ABSciex) and was equipped with a TurboV electrospray source operated in negative ionisation mode. The following previously optimized (and therefore general) parameters were used to acquire scheduled Multiple Reaction Monitoring (MRM) data: ionspray voltage -4500V, nebulizer (GS1), auxiliary (GS2), curtain (CUR) and collision (CAD) gases were 60, 60, 20 and medium (arbitrary units), respectively, generated via a N300DR nitrogen generator (Peak Scientific, Massachusetts, USA). The auxiliary gas temperature was maintained at 350.degree. C. The entrance potential (EP) was -10 volts. This method is also able to detect and separate 2-hydroxybutyric acid.

[0339] .sup.1H nuclear magnetic resonance (NMR) spectroscopy at a field strength of 400 MHz. Samples were prepared by diluting 400 .mu.L of sample with 400 .mu.L of 20 mM phosphate buffer prepared with D.sub.2O and containing trimethylsilyl proprionic acid (TMSP) as internal standard (pH of 7). The samples were then transferred glass NMR tube (5 mm.times.8 inches) and analysed by .sup.1H NMR using presaturation for water suppression with a 30.degree. excitation pulse, 15 second relaxation delay and 64 scans at a temperature of 27.degree. C. Once acquired the spectrum was transformed, flattened and integrated using Agilent VnmrJ software. The known concentration of TMSP was used for quantitation of 2-hydroxyisobutyric using the resonance at 1.36 ppm (singlet).

[0340] In both E. coli growing heterotrophically as well as C. autoethanogenum growing autotrophically, 2-hydroxyisobutyric acid could be detected in constructs pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-tesB (1.5 mg/L in LC-MS/MS method and 8 mg/L in GC-MS in C. autoethanogenum; 0.5 mg/L in LC-MS/MS method and 2 mg/L in GC-MS in E. coli) and pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB+pMTL82256-ptb-buk (15 mg/L in LC-MS/MS method and 75 mg/L in GC-MS in C. autoethanogenum; 1.1 mg/L in LC-MS/MS method and 8.5 mg/L in GC-MS in E. coli), but not in constructs all other constructs including the control. By far the highest production occurred in strain carrying plasmid pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB+pMTL82256-ptb-buk (10.times. higher than all other routes), that has the optimal pathway with thiolase, (S)-specific (S)-specific 3-hydroxybutyryl-CoA dehydrogenase, the 2-hydroxyisobutyryl-CoA mutase, and the Ptb-Buk system (FIGS. 29A-D). Surprisingly, also production of 2-hydroxybutyrate (2-HB) (up to 64 mg/L by LC-MS/MS and 50 mg/L by GC-MS in C. autoethanogenum; 12 mg/L by LC-MS/MS and 9.5 mg/L by GC-MS in E. coli) was found in this strain, indicating unspecific mutase activity (FIG. 30). This was also found in the tesB strain, but again at significant lower levels (18 mg/L in LC-MS-MS and 9 mg/L in GC-MS in C. autoethanogenum). Production of 2-hydroxyisobutyric acid was also confirmed by NMR.

[0341] In addition, also qRT-PCR was carried out to confirm expression of the genes thlA, hbd, meaBhcmA and hcmB (FIG. 31).

[0342] The RT-PCR graphs show that thlA gene product is expressed to slightly higher levels with the P.sub.pta-ack promoter than hbd (as expected with a second gene in an operon) and that hmcB shows slightly lower expression levels than meaBhcmA. Also there is lower expression in C. autoethanogenum at 30.degree. C. than at 37.degree. C. and E. coli at 30.degree. C. For specific cycle numbers see below.

TABLE-US-00021 Condition Target Cq Mean Cq Std Dev E. coli/30.degree. C. thlA 18.26 0.243 hbd 20.6 0.603 meaBhcmA 16.20 0.108 hmcB 18.30 0.666 C. autoethanogenum/30.degree. C. thlA 26.10 0.169 Hbd 27.54 0.415 meaBhcmA 20.63 0.604 hmcB 22.64 0.697 C. autoethanogenum/37.degree. C. thlA 18.48 0.069 hbd 21.85 0.222 meaBhcmA 16.72 0.119 hmcB 19.62 0.173

[0343] The ratio of (S)-3-hydroxybutyric acid to (R)-3-hydroxybutyric acid was measured by high-performance liquid chromatography (HPLC) on an Agilent 1260 Infinity LC with UV detection at 210 nm. Samples were prepared by centrifugation at 14,000 rpm for 3 minutes, followed by evaporation of 200 .mu.L of supernatant to dryness. The pellet was then re-suspended in 100% Isopropanol and sonicated under heat for 1 hour. Centrifugation was repeated and the supernatant transferred to an HPLC vial for analysis. Separation was achieved with a 5 .mu.L injection on to a TCI Chiral MB-S column (250 mm.times.4.6 mm.times.3 .mu.m) at 1.5 mL/min and 40.degree. C. under isocratic conditions, using 95-5 hexane-isopropanol mobile phase containing 0.1% trifluoracetic acid.

[0344] A stereospecific analysis of produce 3-HB has been performed. Surprisingly it was found that in C. autoethanogenum, a mix of isomers was produced. Enzymes Hbd and PhaB are described to be stereospecific, PhaB is R-specific and Hbd is S-specific and when expressing these enzymes in E. coli a stereopure product has been observed (Tseng, Appl Environ Microbiol, 75: 3137-3145, 2009).

[0345] The following table indicates the distribution of (R)- and (S)-form of 3-HB at equilibrium produced via three different routes in C. autoethanogenum. These data suggest the presence of isomerase in the C. autoethanogenum.

TABLE-US-00022 Route % R-form % S-form ThlA--PhaB 55 .+-. 7 53 .+-. 5 ThlA--HBD 12 .+-. 3 88 .+-. 3 ThlA--ctfAB 16 .+-. 7 84 .+-. 7

[0346] Knockout of native isomerases may prevent interconversion of (R) and (S) forms of 3-HB. Alternatively, expression or overexpression of isomerases could enable new ptb-buk routes. For example, Hbd could be used to generate (S)-3-HB, isomerase could convert (S)-3-HB to (R)-3-HB, and ptb-buk could act on (R)-3-HB to produce products of interest.

Example 11

[0347] This example demonstrates the production of isobutylene via Ptb-Buk conversion of 3-hydroxyisovaleryl-CoA and 3-hydroxyisovalerate.

[0348] Different routes for production of isobutylene have been described, for example the conversion of acetone to isobutylene via a hydroxyisovalerate synthase and decarboxylase (van Leeuwen, Appl Microbiol Biotechnol, 93: 1377-1387, 2012). However, the hydroxyisovalerate decarboxylase step is an ATP requiring step and kinetics of this enzyme may not be ideal. Two alternative routes to isobutylene using the Ptb-Buk system have been identified through 3-hydroxyisovaleryl-CoA which has been shown in vitro to be a viable substrate for the Ptb-Buk system (Liu, Appl Microbiol Biotechnol, 53: 545-552, 2000).

[0349] Alternative pathway 1 consists of a synthase that converts acetone into 3-hydroxyisovaleryl-CoA (FIG. 9).

[0350] Alternative pathway 2 proceeds via known intermediate 3-methyl-2-oxopentanoate of the isoleucine biosynthesis that is common to bacteria such as E. coli or C. autoethanogenum (FIG. 10).

Example 12

[0351] This example describes methods for characterizing Ptb-Buk variants.

[0352] Given the substrate promiscuity of Ptb-Buk, it is likely that Ptb-Buk systems of varying amino acid sequences will possess varying preferences for given substrates. In order to identify a Ptb-Buk system that favors a desired substrate (e.g. acetoacetyl-CoA, 3-hydroxybutyryl-CoA, 2-hydroxyisobutyryl-CoA, acetyl-CoA, and/or butyryl-CoA), a high-throughput screen is desirable. Such a screen can be accomplished by coupling firefly luciferase (Luc) to the Ptb-Buk system (FIG. 33). Luc reacts with D-luciferin, generating oxyluciferin, carbon dioxide, and light. In addition to magnesium and molecular oxygen, Luc requires ATP for the reaction to proceed. ATP is a product generated by Ptb-Buk when provided an appropriate acyl-CoA or enoyl-CoA substrate. Therefore, Ptb-Buk reaction rates and preferences can be compared for varying substrates by quantifying the amount of light generated by a reaction containing Ptb-Buk, Luc, d-luciferin, magnesium, molecular oxygen, phosphate, ADP, and an acyl-CoA or enoyl-CoA.

Example 13

[0353] This example uses genome-scale modeling to demonstrate that high non-native product selectivities can be achieved using Ptb-Buk. Furthermore, it shows that the use of Ptb-Buk could permit the coupling of cellular growth with product production, allowing the construction of stable and high-yielding fermentation strains.

[0354] A genome-scale metabolic model of C. autoethanogenum similar to the one described by Marcellin, Green Chem, 18: 3020-3028, 2006 was utilized. Variants of this model were created that incorporate additional metabolic reactions, each one representing a different genetically modified microorganism for non-native product formation. Three model versions were created for each non-native product pathway, incorporating either a thioesterase, acetate CoA-transferase or Ptb-Buk reaction.

[0355] Maximum selectivities were calculated using flux balance analysis (FBA), using scripts from the COBRA Toolbox v2.0 in MATLAB R2014a (The Mathworks, Inc.) with Gurobi version 6.0.4 as the solver (Gurobi Optimization, Inc.). Exchange reactions were constrained to represent a chemically defined minimal growth medium with CO as the source of carbon and energy. An evolutionary algorithm was used to search for the existence of strain designs incorporating up to ten gene knockouts that couple target non-native chemical production with growth.

[0356] FBA predicts that pathways using Ptb-Buk or CoA transferase offer the highest product selectivities due to ATP gain through substrate level phosphorylation. The results are illustrated in Table 2. However, it should be noted that one limitation of Genome-scale models and FBA analysis is that enzyme kinetics are not captured. The CoA transferase reaction requires a certain base level of acetate for functionality, therefore in reality the maximum selectivity using a CoA transferase would be less than 100% due to a base level of acetate required to be present.

TABLE-US-00023 Maximum selectivity % (C in target product/C in all fermentation products) Non-native product Thioesterase CoA-transferase Ptb-Buk Acetone 82.0 100 100 Isopropanol 82.1 100 100 Isobutylene 55.9 80.2 80.2 3-Hydroxybutyrate 86.0 100 100 1,3-Butanediol 88.6 100 100 2-Hydroxyisobutyrate 86.0 100 100

[0357] Table 2. Flux balance analysis (FBA) showing the maximum possible non-native product selectivities in C. autoethanogenum for a set of products and candidate enzymes.

[0358] It is desirable to construct strains where the target non-native chemical must be produced for cell growth. FBA predicts that in most cases it would be difficult to couple target chemical production with growth when using a thioesterase or a CoA transferase; instead, native products acetate and ethanol would be favored. However, when using Ptb-Buk, many growth-coupled chemical production strain designs exist, often incorporating a disruption of the phosphotransacetylase-acetate kinase reactions. Table 3 summarizes the growth coupling ability of each strain.

TABLE-US-00024 Ability to couple non-native chemical production with growth Non-native product Thioesterase CoA-transferase Ptb-Buk Acetone No No Yes Isopropanol No No Yes Isobutylene No No No 3-Hydroxybutyrate No No Yes 1,3-Butanediol No Yes Yes 2-Hydroxyisobutyrate No No Yes

[0359] Table 3. Potential to couple non-native chemical production with growth in C. autoethanogenum during growth on CO when reconfiguring the metabolic network with up to ten gene knockouts.

[0360] While both Ptb-Buk and CoA transferase can support high selectivities, flux balance analysis predicts that in most cases, only Ptb-Buk would allow the construction of stable, high-yielding fermentation strains that couple non-native chemical production with growth.

Example 14

[0361] This example demonstrates the production of adipic acid via Ptb-Buk from gaseous feedstock.

[0362] Production of adipic acid in E. coli from sugar has been described by a pathway utilizing Ptb-Buk (Yu, Biotechnol Bioeng, 111: 2580-2586, 2014). However production was low, in the .mu.g/L range. Without wishing to be bound by any particular theory, the inventors believe that this is likely a function of lacking driving force in forms of reducing power and surplus ATP. Using a reduced gaseous substrate as CO and H.sub.2 and an acetogenic bacterium such as C. autoethanogenum, this current limitation can be overcome. CO and H.sub.2 oxidation provide sufficient driving force for reduction of 3-oxo-adipyl-CoA to 3-hydroxyadipyl-CoA by 3-hydroxybutyryl-CoA dehydrogenase or acetoacetyl-CoA hydratase and 2,3-dehydroadipyl-CoA to adipyl-CoA by enoyl-CoA hydrolase or enoyl-CoA reductase (FIG. 34, steps 23 and 25), in contrast to E. coli growing heterotrophically on more oxidized sugars. Acetogenic bacteria live on the energetic limit of life and therefore ATP generating reactions like the Ptb-Buk system have a strong driving force, ensuring efficient conversion of adipyl-CoA to adipic acid (FIG. 34, step 26), in contrast to E. coli growing heterotrophically on sugars generating surplus ATP from glycolysis.

[0363] To produce adipic acid from gas in C. autoethanogenum, genes encoding a succinyl-CoA synthetase from E. coli (NP_415256, NP_415257), a ketoisovalerate oxidoreductase PaaJ from E. coli (WP_001206190.1), a 3-hydroxybutyryl-CoA dehydrogenase Hbd from Clostridium beijerinckii (WP_011967675.1), a trans-2-enoyl-CoA reductase Crt from C. acetobutylicum (NP_349318.1), trans-2-enoyl-CoA reductase Bcd from C. acetobutylicum (NP_349317.1) and electron flavoproteins EtfAB (NP_349315, NP_349316) are cloned on an expression plasmid and then transformed as described above in C. autoethanogenum strains pta-ack::ptb-buk or CAETHG_1524::ptb-buk from previous examples. Adipic acid is produce according to the steps depicted in FIG. 34.

Example 15

[0364] This example demonstrates the production of various products including 2-buten-1-ol, 3-methyl-2-butanol, 1,3-hexanediol (HDO) via Ptb-Buk and AOR.

[0365] As demonstrated in Example 6, Ptb-Buk is highly promiscuous and acts on a wide range of CoAs as substrates or can be engineered to use a range of non-natural CoAs as substrates. Likewise AOR enzyme has been shown to act on a wide range of substrates. Together these two enzymes can convert a wide range of CoAs via their acids into aldehydes, which then can be further converted to alcohols, ketones or enols via alcohol dehdydrogeneses, for which a wide variety exists in nature. While under standard conditions the reduction of acids with ferredoxin to aldehydes via the AOR is endergonic (Thauer, Bacteriol Rev, 41: 100-180, 1977) and as such not feasible, it surprisingly is in carboxydotrophic acetogens such as C. autoethanogenum that operate at low pH and with CO or H2 as substrate (Mock, J Bacteriol, 197: 2965-2980, 2015). One common limitation working with acetogens is that they are ATP-limited, living on the thermodynamic edge of life (Schuchmann, Nat Rev Microbiol, 12: 809-821, 2014), which can be overcome by coupling this acid reduction to ATP-linked formation of acids from CoAs via the Ptb-Buk system.

[0366] The Ptb-Buk system and AOR system has been demonstrated in above examples for several different products, but can be extended to further products, for example production of 2-buten-1-ol, 3-methyl-2-butanol, 1,3-hexanediol (HDO). 2-Buten-1-ol can be produced via Ptb-Buk, AOR and an alcohol dehydrogenase from crotonyl-CoA (FIG. 35). 1,3-Hexanediol can be produced via Ptb-Buk, AOR and an alcohol dehydrogenase from 3-hydroxy-hexanoyl-CoA (FIG. 35). By combining Ptb-Buk, Adc and an alcohol dehydrogenase (such as native primary: secondary alcohol dehydrogenase), 3-methyl-2-butanol can be formed from acetobutyryl-CoA.

[0367] All of these precursors, crotonyl-CoA, 3-hydroxy-hexanoyl-CoA, or acetobutyryl-CoA can be formed by reduction and elongation of acetyl-CoA, acetoacetyl-CoA and 3-HB-CoA which are described in previous examples via known fermentation pathways of, for example, Clostridium kluyveri (Barker, PNAS USA, 31: 373-381, 1945; Seedorf, PNAS USA, 105: 2128-2133, 2008) and other Clostridia. Involved enzymes include crotonyl-CoA hydratase (crotonase) or crotonyl-CoA reductase, butyryl-CoA dehydrogenase or trans-2-enoyl-CoA reductase, thiolase or acyl-CoA acetyltransferase and 3-hydroxybutyryl-CoA dehydrogenase or acetoacetyl-CoA hydratase (FIG. 35). Respective genes from C. kluyveri or other Clostridia have be cloned on an expression plasmid (U.S. 2011/0236941) and and then transformed as described above in C. autoethanogenum strains pta-ack::ptb-buk or CAETHG_1524::ptb-buk from previous examples for production of 2-buten-1-ol, 3-methyl-2-butanol, 1,3-hexanediol (HDO). 2-Buten-1-ol, 3-methyl-2-butanol, and 1,3-hexanediol (HDO) may be precursors for further downstream products.

[0368] While these are only a few examples, it should be clear that this pathway can be further extended using the same enzymes or engineered variants thereof that have specificity for higher chain length to produce a range of C4, C6, C8, C10, C12, C14 alcohols, ketones, enols or diols (FIG. 39). Different type of molecules can be obtained also by using primer or extender units different than acetyl-CoA in the thiolase step as been described elsewhere (Cheong, Nature Biotechnol, 34: 556-561, 2016).

[0369] All references, including publications, patent applications, and patents, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein. The reference to any prior art in this specification is not, and should not be taken as, an acknowledgement that that prior art forms part of the common general knowledge in the field of endeavour in any country.

[0370] The use of the terms "a" and "an" and "the" and similar referents in the context of describing the invention (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The terms "comprising," "having," "including," and "containing" are to be construed as open-ended terms (i.e., meaning "including, but not limited to") unless otherwise noted. Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., "such as") provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention.

[0371] Preferred embodiments of this invention are described herein. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate, and the inventors intend for the invention to be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context.

Sequence CWU 1 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 195 <210> SEQ ID NO 1 <211> LENGTH: 392 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: ThlA, WP_010966157.1 <400> SEQUENCE: 1 Met Lys Glu Val Val Ile Ala Ser Ala Val Arg Thr Ala Ile Gly Ser 1 5 10 15 Tyr Gly Lys Ser Leu Lys Asp Val Pro Ala Val Asp Leu Gly Ala Thr 20 25 30 Ala Ile Lys Glu Ala Val Lys Lys Ala Gly Ile Lys Pro Glu Asp Val 35 40 45 Asn Glu Val Ile Leu Gly Asn Val Leu Gln Ala Gly Leu Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ala Ser Phe Lys Ala Gly Leu Pro Val Glu Ile Pro 65 70 75 80 Ala Met Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Arg Thr Val Ser 85 90 95 Leu Ala Ala Gln Ile Ile Lys Ala Gly Asp Ala Asp Val Ile Ile Ala 100 105 110 Gly Gly Met Glu Asn Met Ser Arg Ala Pro Tyr Leu Ala Asn Asn Ala 115 120 125 Arg Trp Gly Tyr Arg Met Gly Asn Ala Lys Phe Val Asp Glu Met Ile 130 135 140 Thr Asp Gly Leu Trp Asp Ala Phe Asn Asp Tyr His Met Gly Ile Thr 145 150 155 160 Ala Glu Asn Ile Ala Glu Arg Trp Asn Ile Ser Arg Glu Glu Gln Asp 165 170 175 Glu Phe Ala Leu Ala Ser Gln Lys Lys Ala Glu Glu Ala Ile Lys Ser 180 185 190 Gly Gln Phe Lys Asp Glu Ile Val Pro Val Val Ile Lys Gly Arg Lys 195 200 205 Gly Glu Thr Val Val Asp Thr Asp Glu His Pro Arg Phe Gly Ser Thr 210 215 220 Ile Glu Gly Leu Ala Lys Leu Lys Pro Ala Phe Lys Lys Asp Gly Thr 225 230 235 240 Val Thr Ala Gly Asn Ala Ser Gly Leu Asn Asp Cys Ala Ala Val Leu 245 250 255 Val Ile Met Ser Ala Glu Lys Ala Lys Glu Leu Gly Val Lys Pro Leu 260 265 270 Ala Lys Ile Val Ser Tyr Gly Ser Ala Gly Val Asp Pro Ala Ile Met 275 280 285 Gly Tyr Gly Pro Phe Tyr Ala Thr Lys Ala Ala Ile Glu Lys Ala Gly 290 295 300 Trp Thr Val Asp Glu Leu Asp Leu Ile Glu Ser Asn Glu Ala Phe Ala 305 310 315 320 Ala Gln Ser Leu Ala Val Ala Lys Asp Leu Lys Phe Asp Met Asn Lys 325 330 335 Val Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly Ala 340 345 350 Ser Gly Ala Arg Ile Leu Val Thr Leu Val His Ala Met Gln Lys Arg 355 360 365 Asp Ala Lys Lys Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Gln Gly 370 375 380 Thr Ala Ile Leu Leu Glu Lys Cys 385 390 <210> SEQ ID NO 2 <211> LENGTH: 393 <212> TYPE: PRT <213> ORGANISM: Cupriavidus necator <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: PhaA, WP_013956452.1 <400> SEQUENCE: 2 Met Thr Asp Val Val Ile Val Ser Ala Ala Arg Thr Ala Val Gly Lys 1 5 10 15 Phe Gly Gly Ser Leu Ala Lys Ile Pro Ala Pro Glu Leu Gly Ala Val 20 25 30 Val Ile Lys Ala Ala Leu Glu Arg Ala Gly Val Lys Pro Glu Gln Val 35 40 45 Ser Glu Val Ile Met Gly Gln Val Leu Thr Ala Gly Ser Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ala Ala Ile Lys Ala Gly Leu Pro Ala Met Val Pro 65 70 75 80 Ala Met Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Lys Ala Val Met 85 90 95 Leu Ala Ala Asn Ala Ile Met Ala Gly Asp Ala Glu Ile Val Val Ala 100 105 110 Gly Gly Gln Glu Asn Met Ser Ala Ala Pro His Val Leu Pro Gly Ser 115 120 125 Arg Asp Gly Phe Arg Met Gly Asp Ala Lys Leu Val Asp Thr Met Ile 130 135 140 Val Asp Gly Leu Trp Asp Val Tyr Asn Gln Tyr His Met Gly Ile Thr 145 150 155 160 Ala Glu Asn Val Ala Lys Glu Tyr Gly Ile Thr Arg Glu Ala Gln Asp 165 170 175 Glu Leu Ala Val Gly Ser Gln Asn Lys Ala Glu Ala Ala Gln Lys Ala 180 185 190 Gly Lys Phe Asp Glu Glu Ile Val Pro Val Leu Ile Pro Gln Arg Lys 195 200 205 Gly Asp Pro Val Ala Phe Lys Thr Asp Glu Phe Val Arg Gln Gly Ala 210 215 220 Thr Leu Asp Ser Met Ser Gly Leu Lys Pro Ala Phe Asp Lys Ala Gly 225 230 235 240 Thr Val Thr Ala Ala Asn Ala Ser Gly Leu Asn Asp Gly Ala Ala Ala 245 250 255 Val Val Val Met Ser Ala Ala Lys Ala Lys Glu Leu Gly Leu Thr Pro 260 265 270 Leu Ala Thr Ile Lys Ser Tyr Ala Asn Ala Gly Val Asp Pro Lys Val 275 280 285 Met Gly Met Gly Pro Val Pro Ala Ser Lys Arg Ala Leu Ser Arg Ala 290 295 300 Glu Trp Thr Pro Gln Asp Leu Asp Leu Met Glu Ile Asn Glu Ala Phe 305 310 315 320 Ala Ala Gln Ala Leu Ala Val His Gln Gln Met Gly Trp Asp Thr Ser 325 330 335 Lys Val Asn Val Asn Gly Gly Ala Ile Ala Ile Gly His Pro Ile Gly 340 345 350 Ala Ser Gly Cys Arg Ile Leu Val Thr Leu Leu His Glu Met Lys Arg 355 360 365 Arg Asp Ala Lys Lys Gly Leu Ala Ser Leu Cys Ile Gly Gly Gly Met 370 375 380 Gly Val Ala Leu Ala Val Glu Arg Lys 385 390 <210> SEQ ID NO 3 <211> LENGTH: 394 <212> TYPE: PRT <213> ORGANISM: Cupriavidus necator <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: BktB, WP_011615089.1 <400> SEQUENCE: 3 Met Thr Arg Glu Val Val Val Val Ser Gly Val Arg Thr Ala Ile Gly 1 5 10 15 Thr Phe Gly Gly Ser Leu Lys Asp Val Ala Pro Ala Glu Leu Gly Ala 20 25 30 Leu Val Val Arg Glu Ala Leu Ala Arg Ala Gln Val Ser Gly Asp Asp 35 40 45 Val Gly His Val Val Phe Gly Asn Val Ile Gln Thr Glu Pro Arg Asp 50 55 60 Met Tyr Leu Gly Arg Val Ala Ala Val Asn Gly Gly Val Thr Ile Asn 65 70 75 80 Ala Pro Ala Leu Thr Val Asn Arg Leu Cys Gly Ser Gly Leu Gln Ala 85 90 95 Ile Val Ser Ala Ala Gln Thr Ile Leu Leu Gly Asp Thr Asp Val Ala 100 105 110 Ile Gly Gly Gly Ala Glu Ser Met Ser Arg Ala Pro Tyr Leu Ala Pro 115 120 125 Ala Ala Arg Trp Gly Ala Arg Met Gly Asp Ala Gly Leu Val Asp Met 130 135 140 Met Leu Gly Ala Leu His Asp Pro Phe His Arg Ile His Met Gly Val 145 150 155 160 Thr Ala Glu Asn Val Ala Lys Glu Tyr Asp Ile Ser Arg Ala Gln Gln 165 170 175 Asp Glu Ala Ala Leu Glu Ser His Arg Arg Ala Ser Ala Ala Ile Lys 180 185 190 Ala Gly Tyr Phe Lys Asp Gln Ile Val Pro Val Val Ser Lys Gly Arg 195 200 205 Lys Gly Asp Val Thr Phe Asp Thr Asp Glu His Val Arg His Asp Ala 210 215 220 Thr Ile Asp Asp Met Thr Lys Leu Arg Pro Val Phe Val Lys Glu Asn 225 230 235 240 Gly Thr Val Thr Ala Gly Asn Ala Ser Gly Leu Asn Asp Ala Ala Ala 245 250 255 Ala Val Val Met Met Glu Arg Ala Glu Ala Glu Arg Arg Gly Leu Lys 260 265 270 Pro Leu Ala Arg Leu Val Ser Tyr Gly His Ala Gly Val Asp Pro Lys 275 280 285 Ala Met Gly Ile Gly Pro Val Pro Ala Thr Lys Ile Ala Leu Glu Arg 290 295 300 Ala Gly Leu Gln Val Ser Asp Leu Asp Val Ile Glu Ala Asn Glu Ala 305 310 315 320 Phe Ala Ala Gln Ala Cys Ala Val Thr Lys Ala Leu Gly Leu Asp Pro 325 330 335 Ala Lys Val Asn Pro Asn Gly Ser Gly Ile Ser Leu Gly His Pro Ile 340 345 350 Gly Ala Thr Gly Ala Leu Ile Thr Val Lys Ala Leu His Glu Leu Asn 355 360 365 Arg Val Gln Gly Arg Tyr Ala Leu Val Thr Met Cys Ile Gly Gly Gly 370 375 380 Gln Gly Ile Ala Ala Ile Phe Glu Arg Ile 385 390 <210> SEQ ID NO 4 <211> LENGTH: 394 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AtoB, NP_416728.1 <400> SEQUENCE: 4 Met Lys Asn Cys Val Ile Val Ser Ala Val Arg Thr Ala Ile Gly Ser 1 5 10 15 Phe Asn Gly Ser Leu Ala Ser Thr Ser Ala Ile Asp Leu Gly Ala Thr 20 25 30 Val Ile Lys Ala Ala Ile Glu Arg Ala Lys Ile Asp Ser Gln His Val 35 40 45 Asp Glu Val Ile Met Gly Asn Val Leu Gln Ala Gly Leu Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ala Leu Leu Lys Ser Gly Leu Ala Glu Thr Val Cys 65 70 75 80 Gly Phe Thr Val Asn Lys Val Cys Gly Ser Gly Leu Lys Ser Val Ala 85 90 95 Leu Ala Ala Gln Ala Ile Gln Ala Gly Gln Ala Gln Ser Ile Val Ala 100 105 110 Gly Gly Met Glu Asn Met Ser Leu Ala Pro Tyr Leu Leu Asp Ala Lys 115 120 125 Ala Arg Ser Gly Tyr Arg Leu Gly Asp Gly Gln Val Tyr Asp Val Ile 130 135 140 Leu Arg Asp Gly Leu Met Cys Ala Thr His Gly Tyr His Met Gly Ile 145 150 155 160 Thr Ala Glu Asn Val Ala Lys Glu Tyr Gly Ile Thr Arg Glu Met Gln 165 170 175 Asp Glu Leu Ala Leu His Ser Gln Arg Lys Ala Ala Ala Ala Ile Glu 180 185 190 Ser Gly Ala Phe Thr Ala Glu Ile Val Pro Val Asn Val Val Thr Arg 195 200 205 Lys Lys Thr Phe Val Phe Ser Gln Asp Glu Phe Pro Lys Ala Asn Ser 210 215 220 Thr Ala Glu Ala Leu Gly Ala Leu Arg Pro Ala Phe Asp Lys Ala Gly 225 230 235 240 Thr Val Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ala Ala Ala 245 250 255 Leu Val Ile Met Glu Glu Ser Ala Ala Leu Ala Ala Gly Leu Thr Pro 260 265 270 Leu Ala Arg Ile Lys Ser Tyr Ala Ser Gly Gly Val Pro Pro Ala Leu 275 280 285 Met Gly Met Gly Pro Val Pro Ala Thr Gln Lys Ala Leu Gln Leu Ala 290 295 300 Gly Leu Gln Leu Ala Asp Ile Asp Leu Ile Glu Ala Asn Glu Ala Phe 305 310 315 320 Ala Ala Gln Phe Leu Ala Val Gly Lys Asn Leu Gly Phe Asp Ser Glu 325 330 335 Lys Val Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly 340 345 350 Ala Ser Gly Ala Arg Ile Leu Val Thr Leu Leu His Ala Met Gln Ala 355 360 365 Arg Asp Lys Thr Leu Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Gln 370 375 380 Gly Ile Ala Met Val Ile Glu Arg Leu Asn 385 390 <210> SEQ ID NO 5 <211> LENGTH: 217 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CtfA, WP_012059996.1 <400> SEQUENCE: 5 Met Asn Lys Leu Val Lys Leu Thr Asp Leu Lys Arg Ile Phe Lys Asp 1 5 10 15 Gly Met Thr Ile Met Val Gly Gly Phe Leu Asp Cys Gly Thr Pro Glu 20 25 30 Asn Ile Ile Asp Met Leu Val Asp Leu Asn Ile Lys Asn Leu Thr Ile 35 40 45 Ile Ser Asn Asp Thr Ala Phe Pro Asn Lys Gly Ile Gly Lys Leu Ile 50 55 60 Val Asn Gly Gln Val Ser Lys Val Ile Ala Ser His Ile Gly Thr Asn 65 70 75 80 Pro Glu Thr Gly Lys Lys Met Ser Ser Gly Glu Leu Lys Val Glu Leu 85 90 95 Ser Pro Gln Gly Thr Leu Ile Glu Arg Ile Arg Ala Ala Gly Ser Gly 100 105 110 Leu Gly Gly Val Leu Thr Pro Thr Gly Leu Gly Thr Ile Val Glu Glu 115 120 125 Gly Lys Lys Lys Val Thr Ile Asp Gly Lys Glu Tyr Leu Leu Glu Leu 130 135 140 Pro Leu Ser Ala Asp Val Ser Leu Ile Lys Gly Ser Ile Val Asp Glu 145 150 155 160 Phe Gly Asn Thr Phe Tyr Arg Ala Ala Thr Lys Asn Phe Asn Pro Tyr 165 170 175 Met Ala Met Ala Ala Lys Thr Val Ile Val Glu Ala Glu Asn Leu Val 180 185 190 Lys Cys Glu Asp Leu Lys Arg Asp Ala Ile Met Thr Pro Gly Val Leu 195 200 205 Val Asp Tyr Ile Val Lys Glu Ala Ala 210 215 <210> SEQ ID NO 6 <211> LENGTH: 221 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CtfB, WP_012059997.1 <400> SEQUENCE: 6 Met Ile Val Asp Lys Val Leu Ala Lys Glu Ile Ile Ala Lys Arg Val 1 5 10 15 Ala Lys Glu Leu Lys Lys Asp Gln Leu Val Asn Leu Gly Ile Gly Leu 20 25 30 Pro Thr Leu Val Ala Asn Tyr Val Pro Lys Glu Met Asn Ile Thr Phe 35 40 45 Glu Ser Glu Asn Gly Met Val Gly Met Ala Gln Met Ala Ser Ser Gly 50 55 60 Glu Asn Asp Pro Asp Ile Ile Asn Ala Gly Gly Glu Tyr Val Thr Leu 65 70 75 80 Leu Pro Gln Gly Ser Phe Phe Asp Ser Ser Met Ser Phe Ala Leu Ile 85 90 95 Arg Gly Gly His Val Asp Val Ala Val Leu Gly Ala Leu Glu Val Asp 100 105 110 Glu Lys Gly Asn Leu Ala Asn Trp Ile Val Pro Asn Lys Ile Val Pro 115 120 125 Gly Met Gly Gly Ala Met Asp Leu Ala Ile Gly Ala Lys Lys Ile Ile 130 135 140 Val Ala Met Gln His Thr Gly Lys Ser Lys Pro Lys Ile Val Lys Lys 145 150 155 160 Cys Thr Leu Pro Leu Thr Ala Lys Ala Gln Val Asp Leu Ile Val Thr 165 170 175 Glu Leu Cys Val Ile Asp Val Thr Asn Asp Gly Leu Leu Leu Lys Glu 180 185 190 Ile His Lys Asp Thr Thr Ile Asp Glu Ile Lys Phe Leu Thr Asp Ala 195 200 205 Asp Leu Ile Ile Pro Asp Asn Leu Lys Ile Met Asp Ile 210 215 220 <210> SEQ ID NO 7 <211> LENGTH: 286 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: TesB, NP_414986.1 <400> SEQUENCE: 7 Met Ser Gln Ala Leu Lys Asn Leu Leu Thr Leu Leu Asn Leu Glu Lys 1 5 10 15 Ile Glu Glu Gly Leu Phe Arg Gly Gln Ser Glu Asp Leu Gly Leu Arg 20 25 30 Gln Val Phe Gly Gly Gln Val Val Gly Gln Ala Leu Tyr Ala Ala Lys 35 40 45 Glu Thr Val Pro Glu Glu Arg Leu Val His Ser Phe His Ser Tyr Phe 50 55 60 Leu Arg Pro Gly Asp Ser Lys Lys Pro Ile Ile Tyr Asp Val Glu Thr 65 70 75 80 Leu Arg Asp Gly Asn Ser Phe Ser Ala Arg Arg Val Ala Ala Ile Gln 85 90 95 Asn Gly Lys Pro Ile Phe Tyr Met Thr Ala Ser Phe Gln Ala Pro Glu 100 105 110 Ala Gly Phe Glu His Gln Lys Thr Met Pro Ser Ala Pro Ala Pro Asp 115 120 125 Gly Leu Pro Ser Glu Thr Gln Ile Ala Gln Ser Leu Ala His Leu Leu 130 135 140 Pro Pro Val Leu Lys Asp Lys Phe Ile Cys Asp Arg Pro Leu Glu Val 145 150 155 160 Arg Pro Val Glu Phe His Asn Pro Leu Lys Gly His Val Ala Glu Pro 165 170 175 His Arg Gln Val Trp Ile Arg Ala Asn Gly Ser Val Pro Asp Asp Leu 180 185 190 Arg Val His Gln Tyr Leu Leu Gly Tyr Ala Ser Asp Leu Asn Phe Leu 195 200 205 Pro Val Ala Leu Gln Pro His Gly Ile Gly Phe Leu Glu Pro Gly Ile 210 215 220 Gln Ile Ala Thr Ile Asp His Ser Met Trp Phe His Arg Pro Phe Asn 225 230 235 240 Leu Asn Glu Trp Leu Leu Tyr Ser Val Glu Ser Thr Ser Ala Ser Ser 245 250 255 Ala Arg Gly Phe Val Arg Gly Glu Phe Tyr Thr Gln Asp Gly Val Leu 260 265 270 Val Ala Ser Thr Val Gln Glu Gly Val Met Arg Asn His Asn 275 280 285 <210> SEQ ID NO 8 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 1, AGY74947.1 <400> SEQUENCE: 8 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 9 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 2, AGY75747.1 <400> SEQUENCE: 9 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser Asn Cys Arg Tyr 50 55 60 Leu Gln Gly Ala Lys Tyr Glu Asp Glu Leu Leu Ile Lys Thr Trp Ile 65 70 75 80 Lys Glu Leu Thr Pro Val Lys Ala Glu Phe Asn Tyr Ser Val Ile Arg 85 90 95 Glu Asn Asp Gln Lys Glu Ile Ala Lys Gly Ser Thr Leu His Ala Phe 100 105 110 Val Asn Asn Asn Phe Lys Ile Ile Asn Leu Lys Lys Asn His Thr Glu 115 120 125 Leu Phe Lys Lys Leu Gln Ser Leu Ile 130 135 <210> SEQ ID NO 10 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 3, AGY75999.1 <400> SEQUENCE: 10 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 11 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 1, ADK15695.1 <400> SEQUENCE: 11 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 12 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 2, ADK16655.1 <400> SEQUENCE: 12 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser Asn Cys Arg Tyr 50 55 60 Leu Gln Gly Ala Lys Tyr Glu Asp Glu Leu Leu Ile Lys Thr Trp Ile 65 70 75 80 Lys Glu Leu Thr Pro Val Lys Ala Glu Phe Asn Tyr Ser Val Ile Arg 85 90 95 Glu Asn Asp Gln Lys Glu Ile Ala Lys Gly Ser Thr Leu His Ala Phe 100 105 110 Val Asn Asn Asn Phe Lys Ile Ile Asn Leu Lys Lys Asn His Thr Glu 115 120 125 Leu Phe Lys Lys Leu Gln Ser Leu Ile 130 135 <210> SEQ ID NO 13 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 3, ADK16959.1 <400> SEQUENCE: 13 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 14 <211> LENGTH: 246 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Adc, WP_012059998.1 <400> SEQUENCE: 14 Met Leu Glu Ser Glu Val Ser Lys Gln Ile Thr Thr Pro Leu Ala Ala 1 5 10 15 Pro Ala Phe Pro Arg Gly Pro Tyr Arg Phe His Asn Arg Glu Tyr Leu 20 25 30 Asn Ile Ile Tyr Arg Thr Asp Leu Asp Ala Leu Arg Lys Ile Val Pro 35 40 45 Glu Pro Leu Glu Leu Asp Arg Ala Tyr Val Arg Phe Glu Met Met Ala 50 55 60 Met Pro Asp Thr Thr Gly Leu Gly Ser Tyr Thr Glu Cys Gly Gln Ala 65 70 75 80 Ile Pro Val Lys Tyr Asn Gly Val Lys Gly Asp Tyr Leu His Met Met 85 90 95 Tyr Leu Asp Asn Glu Pro Ala Ile Ala Val Gly Arg Glu Ser Ser Ala 100 105 110 Tyr Pro Lys Lys Leu Gly Tyr Pro Lys Leu Phe Val Asp Ser Asp Thr 115 120 125 Leu Val Gly Thr Leu Lys Tyr Gly Thr Leu Pro Val Ala Thr Ala Thr 130 135 140 Met Gly Tyr Lys His Glu Pro Leu Asp Leu Lys Glu Ala Tyr Ala Gln 145 150 155 160 Ile Ala Arg Pro Asn Phe Met Leu Lys Ile Ile Gln Gly Tyr Asp Gly 165 170 175 Lys Pro Arg Ile Cys Glu Leu Ile Cys Ala Glu Asn Thr Asp Ile Thr 180 185 190 Ile His Gly Ala Trp Thr Gly Ser Ala Arg Leu Gln Leu Phe Ser His 195 200 205 Ala Leu Ala Pro Leu Ala Asp Leu Pro Val Leu Glu Ile Val Ser Ala 210 215 220 Ser His Ile Leu Thr Asp Leu Thr Leu Gly Thr Pro Lys Val Val His 225 230 235 240 Asp Tyr Leu Ser Val Lys 245 <210> SEQ ID NO 15 <211> LENGTH: 548 <212> TYPE: PRT <213> ORGANISM: Lactococcus lactis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: KivD <400> SEQUENCE: 15 Met Tyr Thr Val Gly Asp Tyr Leu Leu Asp Arg Leu His Glu Leu Gly 1 5 10 15 Ile Glu Glu Ile Phe Gly Val Pro Gly Asp Tyr Asn Leu Gln Phe Leu 20 25 30 Asp Gln Ile Ile Ser His Lys Asp Met Lys Trp Val Gly Asn Ala Asn 35 40 45 Glu Leu Asn Ala Ser Tyr Met Ala Asp Gly Tyr Ala Arg Thr Lys Lys 50 55 60 Ala Ala Ala Phe Leu Thr Thr Phe Gly Val Gly Glu Leu Ser Ala Val 65 70 75 80 Asn Gly Leu Ala Gly Ser Tyr Ala Glu Asn Leu Pro Val Val Glu Ile 85 90 95 Val Gly Ser Pro Thr Ser Lys Val Gln Asn Glu Gly Lys Phe Val His 100 105 110 His Thr Leu Ala Asp Gly Asp Phe Lys His Phe Met Lys Met His Glu 115 120 125 Pro Val Thr Ala Ala Arg Thr Leu Leu Thr Ala Glu Asn Ala Thr Val 130 135 140 Glu Ile Asp Arg Val Leu Ser Ala Leu Leu Lys Glu Arg Lys Pro Val 145 150 155 160 Tyr Ile Asn Leu Pro Val Asp Val Ala Ala Ala Lys Ala Glu Lys Pro 165 170 175 Ser Leu Pro Leu Lys Lys Glu Asn Ser Thr Ser Asn Thr Ser Asp Gln 180 185 190 Glu Ile Leu Asn Lys Ile Gln Glu Ser Leu Lys Asn Ala Lys Lys Pro 195 200 205 Ile Val Ile Thr Gly His Glu Ile Ile Ser Phe Gly Leu Glu Lys Thr 210 215 220 Val Thr Gln Phe Ile Ser Lys Thr Lys Leu Pro Ile Thr Thr Leu Asn 225 230 235 240 Phe Gly Lys Ser Ser Val Asp Glu Ala Leu Pro Ser Phe Leu Gly Ile 245 250 255 Tyr Asn Gly Thr Leu Ser Glu Pro Asn Leu Lys Glu Phe Val Glu Ser 260 265 270 Ala Asp Phe Ile Leu Met Leu Gly Val Lys Leu Thr Asp Ser Ser Thr 275 280 285 Gly Ala Phe Thr His His Leu Asn Glu Asn Lys Met Ile Ser Leu Asn 290 295 300 Ile Asp Glu Gly Lys Ile Phe Asn Glu Arg Ile Gln Asn Phe Asp Phe 305 310 315 320 Glu Ser Leu Ile Ser Ser Leu Leu Asp Leu Ser Glu Ile Glu Tyr Lys 325 330 335 Gly Lys Tyr Ile Asp Lys Lys Gln Glu Asp Phe Val Pro Ser Asn Ala 340 345 350 Leu Leu Ser Gln Asp Arg Leu Trp Gln Ala Val Glu Asn Leu Thr Gln 355 360 365 Ser Asn Glu Thr Ile Val Ala Glu Gln Gly Thr Ser Phe Phe Gly Ala 370 375 380 Ser Ser Ile Phe Leu Lys Ser Lys Ser His Phe Ile Gly Gln Pro Leu 385 390 395 400 Trp Gly Ser Ile Gly Tyr Thr Phe Pro Ala Ala Leu Gly Ser Gln Ile 405 410 415 Ala Asp Lys Glu Ser Arg His Leu Leu Phe Ile Gly Asp Gly Ser Leu 420 425 430 Gln Leu Thr Val Gln Glu Leu Gly Leu Ala Ile Arg Glu Lys Ile Asn 435 440 445 Pro Ile Cys Phe Ile Ile Asn Asn Asp Gly Tyr Thr Val Glu Arg Glu 450 455 460 Ile His Gly Pro Asn Gln Ser Tyr Asn Asp Ile Pro Met Trp Asn Tyr 465 470 475 480 Ser Lys Leu Pro Glu Ser Phe Gly Ala Thr Glu Asp Arg Val Val Ser 485 490 495 Lys Ile Val Arg Thr Glu Asn Glu Phe Val Ser Val Met Lys Glu Ala 500 505 510 Gln Ala Asp Pro Asn Arg Met Tyr Trp Ile Glu Leu Ile Leu Ala Lys 515 520 525 Glu Gly Ala Pro Lys Val Leu Lys Lys Met Gly Lys Leu Phe Ala Glu 530 535 540 Gln Asn Lys Ser 545 <210> SEQ ID NO 16 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, AGY74782.1 <400> SEQUENCE: 16 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser 130 135 140 Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val 210 215 220 Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val 245 250 255 Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met 290 295 300 Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu 325 330 335 Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe 340 345 350 <210> SEQ ID NO 17 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, ADK15544.1 <400> SEQUENCE: 17 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser 130 135 140 Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val 210 215 220 Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val 245 250 255 Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met 290 295 300 Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu 325 330 335 Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe 340 345 350 <210> SEQ ID NO 18 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium ragsdalei <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, WP_013239134.1 <400> SEQUENCE: 18 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser 130 135 140 Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val 210 215 220 Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val 245 250 255 Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met 290 295 300 Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu 325 330 335 Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe 340 345 350 <210> SEQ ID NO 19 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, WP_026889046.1 <400> SEQUENCE: 19 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Glu Arg Pro Val Ala Gly Ser Tyr Asp Ala Ile Val Arg Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asp Arg Lys Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Gly Glu Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Lys Asp Met Pro Leu Glu Asn 130 135 140 Ala Val Met Ile Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Gln Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ala Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Leu Asn Tyr Lys Asn Gly His Ile Val 210 215 220 Asp Gln Val Met Lys Leu Thr Asn Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ser Glu Thr Leu Ser Gln Ala Val Ser Met Val 245 250 255 Lys Pro Gly Gly Ile Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Ala Leu Leu Ile Pro Arg Val Glu Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Ala Glu Met 290 295 300 Leu Arg Asp Met Val Val Tyr Asn Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Tyr His Gly Phe Asp His Ile Glu Glu Ala Leu Leu Leu 325 330 335 Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Ala Val Val Ile Leu 340 345 350 <210> SEQ ID NO 20 <211> LENGTH: 352 <212> TYPE: PRT <213> ORGANISM: Thermoanaerobacter brokii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, 3FSR_A <400> SEQUENCE: 20 Met Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile Glu 1 5 10 15 Lys Glu Lys Pro Ala Pro Gly Pro Phe Asp Ala Ile Val Arg Pro Leu 20 25 30 Ala Val Ala Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Ile Gly Glu Arg His Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp Arg 65 70 75 80 Val Val Val Pro Ala Ile Thr Pro Asp Trp Arg Thr Ser Glu Val Gln 85 90 95 Arg Gly Tyr His Gln His Ser Gly Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Val Lys Asp Gly Val Phe Gly Glu Phe Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala His Leu Pro Lys Glu Ile Pro Leu Glu Ala 130 135 140 Ala Val Met Ile Pro Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Gln Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ala Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Ile Cys Val Glu Ala Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Leu Asn Tyr Lys Asn Gly His Ile Val 210 215 220 Asp Gln Val Met Lys Leu Thr Asn Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ser Glu Thr Leu Ser Gln Ala Val Ser Met Val 245 250 255 Lys Pro Gly Gly Ile Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Ala Leu Leu Ile Pro Arg Val Glu Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Arg 290 295 300 Leu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Arg Gly Phe Asp Asn Ile Glu Lys Ala Phe Met Leu 325 330 335 Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu Ala 340 345 350 <210> SEQ ID NO 21 <211> LENGTH: 520 <212> TYPE: PRT <213> ORGANISM: Mus musculus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HMG-CoA synthase <400> SEQUENCE: 21 Met Pro Gly Ser Leu Pro Leu Asn Ala Glu Ala Cys Trp Pro Lys Asp 1 5 10 15 Val Gly Ile Val Ala Leu Glu Ile Tyr Phe Pro Ser Gln Tyr Val Asp 20 25 30 Gln Ala Glu Leu Glu Lys Tyr Asp Gly Val Asp Ala Gly Lys Tyr Thr 35 40 45 Ile Gly Leu Gly Gln Ala Arg Met Gly Phe Cys Thr Asp Arg Glu Asp 50 55 60 Ile Asn Ser Leu Cys Leu Thr Val Val Gln Lys Leu Met Glu Arg His 65 70 75 80 Ser Leu Ser Tyr Asp Cys Ile Gly Arg Leu Glu Val Gly Thr Glu Thr 85 90 95 Ile Ile Asp Lys Ser Lys Ser Val Lys Ser Lys Leu Met Gln Leu Phe 100 105 110 Glu Glu Ser Gly Asn Thr Asp Ile Glu Gly Ile Asp Thr Thr Asn Ala 115 120 125 Cys Tyr Gly Gly Thr Ala Ala Val Phe Asn Ala Val Asn Trp Val Glu 130 135 140 Ser Ser Ser Trp Asp Gly Arg Tyr Ala Leu Val Val Ala Gly Asp Ile 145 150 155 160 Ala Ile Tyr Ala Thr Gly Asn Ala Arg Pro Thr Gly Gly Val Gly Ala 165 170 175 Val Ala Leu Leu Ile Gly Pro Asn Ala Pro Leu Ile Phe Asp Arg Gly 180 185 190 Leu Arg Gly Thr His Met Gln His Ala Tyr Asp Phe Tyr Lys Pro Asp 195 200 205 Met Leu Ser Glu Tyr Pro Val Val Asp Gly Lys Leu Ser Ile Gln Cys 210 215 220 Tyr Leu Ser Ala Leu Asp Arg Cys Tyr Ser Val Tyr Arg Lys Lys Ile 225 230 235 240 Arg Ala Gln Trp Gln Lys Glu Gly Lys Asp Lys Asp Phe Thr Leu Asn 245 250 255 Asp Phe Gly Phe Met Ile Phe His Ser Pro Tyr Cys Lys Leu Val Gln 260 265 270 Lys Ser Leu Ala Arg Met Phe Leu Asn Asp Phe Leu Asn Asp Gln Asn 275 280 285 Arg Asp Lys Asn Ser Ile Tyr Ser Gly Leu Glu Ala Phe Gly Asp Val 290 295 300 Lys Leu Glu Asp Thr Tyr Phe Asp Arg Asp Val Glu Lys Ala Phe Met 305 310 315 320 Lys Ala Ser Ser Glu Leu Phe Asn Gln Lys Thr Lys Ala Ser Leu Leu 325 330 335 Val Ser Asn Gln Asn Gly Asn Met Tyr Thr Ser Ser Val Tyr Gly Ser 340 345 350 Leu Ala Ser Val Leu Ala Gln Tyr Ser Pro Gln Gln Leu Ala Gly Lys 355 360 365 Arg Val Gly Val Phe Ser Tyr Gly Ser Gly Leu Ala Ala Thr Leu Tyr 370 375 380 Ser Leu Lys Val Thr Gln Asp Ala Thr Pro Gly Ser Ala Leu Asp Lys 385 390 395 400 Ile Thr Ala Ser Leu Cys Asp Leu Lys Ser Arg Leu Asp Ser Arg Thr 405 410 415 Cys Val Ala Pro Asp Val Phe Ala Glu Asn Met Lys Leu Arg Glu Asp 420 425 430 Thr His His Leu Ala Asn Tyr Ile Pro Gln Cys Ser Ile Asp Ser Leu 435 440 445 Phe Glu Gly Thr Trp Tyr Leu Val Arg Val Asp Glu Lys His Arg Arg 450 455 460 Thr Tyr Ala Arg Arg Pro Phe Thr Asn Asp His Ser Leu Asp Glu Gly 465 470 475 480 Met Gly Leu Val His Ser Asn Thr Ala Thr Glu His Ile Pro Ser Pro 485 490 495 Ala Lys Lys Val Pro Arg Leu Pro Ala Thr Ser Ala Glu Ser Glu Ser 500 505 510 Ala Val Ile Ser Asn Gly Glu His 515 520 <210> SEQ ID NO 22 <211> LENGTH: 396 <212> TYPE: PRT <213> ORGANISM: Saccharomyces cerevisiae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Mdd, CAA96324.1 <400> SEQUENCE: 22 Met Thr Val Tyr Thr Ala Ser Val Thr Ala Pro Val Asn Ile Ala Thr 1 5 10 15 Leu Lys Tyr Trp Gly Lys Arg Asp Thr Lys Leu Asn Leu Pro Thr Asn 20 25 30 Ser Ser Ile Ser Val Thr Leu Ser Gln Asp Asp Leu Arg Thr Leu Thr 35 40 45 Ser Ala Ala Thr Ala Pro Glu Phe Glu Arg Asp Thr Leu Trp Leu Asn 50 55 60 Gly Glu Pro His Ser Ile Asp Asn Glu Arg Thr Gln Asn Cys Leu Arg 65 70 75 80 Asp Leu Arg Gln Leu Arg Lys Glu Met Glu Ser Lys Asp Ala Ser Leu 85 90 95 Pro Thr Leu Ser Gln Trp Lys Leu His Ile Val Ser Glu Asn Asn Phe 100 105 110 Pro Thr Ala Ala Gly Leu Ala Ser Ser Ala Ala Gly Phe Ala Ala Leu 115 120 125 Val Ser Ala Ile Ala Lys Leu Tyr Gln Leu Pro Gln Ser Thr Ser Glu 130 135 140 Ile Ser Arg Ile Ala Arg Lys Gly Ser Gly Ser Ala Cys Arg Ser Leu 145 150 155 160 Phe Gly Gly Tyr Val Ala Trp Glu Met Gly Lys Ala Glu Asp Gly His 165 170 175 Asp Ser Met Ala Val Gln Ile Ala Asp Ser Ser Asp Trp Pro Gln Met 180 185 190 Lys Ala Cys Val Leu Val Val Ser Asp Ile Lys Lys Asp Val Ser Ser 195 200 205 Thr Gln Gly Met Gln Leu Thr Val Ala Thr Ser Glu Leu Phe Lys Glu 210 215 220 Arg Ile Glu His Val Val Pro Lys Arg Phe Glu Val Met Arg Lys Ala 225 230 235 240 Ile Val Glu Lys Asp Phe Ala Thr Phe Ala Lys Glu Thr Met Met Asp 245 250 255 Ser Asn Ser Phe His Ala Thr Cys Leu Asp Ser Phe Pro Pro Ile Phe 260 265 270 Tyr Met Asn Asp Thr Ser Lys Arg Ile Ile Ser Trp Cys His Thr Ile 275 280 285 Asn Gln Phe Tyr Gly Glu Thr Ile Val Ala Tyr Thr Phe Asp Ala Gly 290 295 300 Pro Asn Ala Val Leu Tyr Tyr Leu Ala Glu Asn Glu Ser Lys Leu Phe 305 310 315 320 Ala Phe Ile Tyr Lys Leu Phe Gly Ser Val Pro Gly Trp Asp Lys Lys 325 330 335 Phe Thr Thr Glu Gln Leu Glu Ala Phe Asn His Gln Phe Glu Ser Ser 340 345 350 Asn Phe Thr Ala Arg Glu Leu Asp Leu Glu Leu Gln Lys Asp Val Ala 355 360 365 Arg Val Ile Leu Thr Gln Val Gly Ser Gly Pro Gln Glu Thr Asn Glu 370 375 380 Ser Leu Ile Asp Ala Lys Thr Gly Leu Pro Lys Glu 385 390 395 <210> SEQ ID NO 23 <211> LENGTH: 324 <212> TYPE: PRT <213> ORGANISM: Picrophilus torridus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Mdd, WP_011178157.1 <400> SEQUENCE: 23 Met Glu Asn Tyr Asn Val Lys Thr Arg Ala Phe Pro Thr Ile Gly Ile 1 5 10 15 Ile Leu Leu Gly Gly Ile Ser Asp Lys Lys Asn Arg Ile Pro Leu His 20 25 30 Thr Thr Ala Gly Ile Ala Tyr Thr Gly Ile Asn Asn Asp Val Tyr Thr 35 40 45 Glu Thr Lys Leu Tyr Val Ser Lys Asp Glu Lys Cys Tyr Ile Asp Gly 50 55 60 Lys Glu Ile Asp Leu Asn Ser Asp Arg Ser Pro Ser Lys Val Ile Asp 65 70 75 80 Lys Phe Lys His Glu Ile Leu Met Arg Val Asn Leu Asp Asp Glu Asn 85 90 95 Asn Leu Ser Ile Asp Ser Arg Asn Phe Asn Ile Leu Ser Gly Ser Ser 100 105 110 Asp Ser Gly Ala Ala Ala Leu Gly Glu Cys Ile Glu Ser Ile Phe Glu 115 120 125 Tyr Asn Ile Asn Ile Phe Thr Phe Glu Asn Asp Leu Gln Arg Ile Ser 130 135 140 Glu Ser Val Gly Arg Ser Leu Tyr Gly Gly Leu Thr Val Asn Tyr Ala 145 150 155 160 Asn Gly Arg Glu Ser Leu Thr Glu Pro Leu Leu Glu Pro Glu Ala Phe 165 170 175 Asn Asn Phe Thr Ile Ile Gly Ala His Phe Asn Ile Asp Arg Lys Pro 180 185 190 Ser Asn Glu Ile His Glu Asn Ile Ile Lys His Glu Asn Tyr Arg Glu 195 200 205 Arg Ile Lys Ser Ala Glu Arg Lys Ala Lys Lys Leu Glu Glu Leu Ser 210 215 220 Arg Asn Ala Asn Ile Lys Gly Ile Phe Glu Leu Ala Glu Ser Asp Thr 225 230 235 240 Val Glu Tyr His Lys Met Leu His Asp Val Gly Val Asp Ile Ile Asn 245 250 255 Asp Arg Met Glu Asn Leu Ile Glu Arg Val Lys Glu Met Lys Asn Asn 260 265 270 Phe Trp Asn Ser Tyr Ile Val Thr Gly Gly Pro Asn Val Phe Val Ile 275 280 285 Thr Glu Lys Lys Asp Val Asp Lys Ala Met Glu Gly Leu Asn Asp Leu 290 295 300 Cys Asp Asp Ile Arg Leu Leu Lys Val Ala Gly Lys Pro Gln Val Ile 305 310 315 320 Ser Lys Asn Phe <210> SEQ ID NO 24 <211> LENGTH: 460 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CimA, AGY76958.1 <400> SEQUENCE: 24 Met Lys Lys Ser Ser Tyr Glu Tyr Lys Leu Asn Asn Val Asp Ser Pro 1 5 10 15 Asn Phe Tyr Lys Asn Ile Phe Pro Tyr Asp Glu Ile Pro Lys Ile Asn 20 25 30 Phe Asn Gly Val Gln Ile Pro Lys Asp Leu Pro Glu Asn Ile Tyr Ile 35 40 45 Thr Asp Thr Thr Phe Arg Asp Gly Gln Gln Ser Met Pro Pro Tyr Thr 50 55 60 Thr Glu Gln Ile Ile Arg Ile Phe Asp Tyr Leu His Asn Leu Asp Asn 65 70 75 80 Asn Ser Gly Ile Ile Lys Gln Thr Glu Phe Phe Leu Tyr Thr Glu Lys 85 90 95 Asp Arg Lys Ala Ala Gln Val Cys Met Glu Arg Gly Tyr Glu Phe Pro 100 105 110 Glu Val Thr Ser Trp Ile Arg Ala Asn Lys Glu Asp Phe Lys Leu Val 115 120 125 Lys Gln Met Gly Ile Lys Glu Thr Gly Met Leu Met Ser Cys Ser Asp 130 135 140 Tyr His Ile Phe Lys Lys Leu Arg Lys Thr Arg Lys Glu Thr Met Asp 145 150 155 160 Met Tyr Leu Gly Ile Val Lys Glu Ala Leu Asp Asn Gly Ile Arg Pro 165 170 175 Arg Cys His Leu Glu Asp Ile Thr Arg Ala Asp Phe Tyr Gly Phe Val 180 185 190 Val Pro Leu Val Asn Lys Leu Met Glu Leu Ser Lys Gln Ser Gly Ile 195 200 205 Pro Ile Lys Ile Arg Ala Cys Asp Thr Leu Gly Leu Gly Val Ser Tyr 210 215 220 Ser Gly Val Glu Leu Pro Arg Ser Val Gln Ala Ile Met Tyr Gly Leu 225 230 235 240 Arg Asn Asn Cys Gly Val Pro Ser Glu Cys Ile Glu Trp His Gly His 245 250 255 Asn Asp Phe Tyr Ala Val Val Asn Asn Ser Thr Thr Ala Trp Leu Tyr 260 265 270 Gly Ala Ser Ala Val Asn Thr Ser Phe Leu Gly Ile Gly Glu Arg Thr 275 280 285 Gly Asn Cys Pro Leu Glu Ala Met Ile Phe Glu Tyr Gly Gln Ile Lys 290 295 300 Gly Asn Thr Lys Asn Met Lys Leu Glu Val Ile Thr Glu Leu Ser Glu 305 310 315 320 Tyr Phe Lys Lys Glu Met Glu Tyr Ala Val Pro Pro Arg Thr Pro Phe 325 330 335 Val Gly Lys Glu Phe Asn Val Thr Arg Ala Gly Ile His Ala Asp Gly 340 345 350 Ile Leu Lys Asp Glu Glu Ile Tyr Asn Ile Phe Asp Thr Asp Lys Ile 355 360 365 Leu Gly Arg Pro Val Val Val Ala Val Asn Gln Tyr Ser Gly His Ala 370 375 380 Gly Ile Ala Ala Trp Ile Asn Thr Tyr Tyr Arg Leu Lys Asp Glu Glu 385 390 395 400 Lys Ile Asp Lys Trp Asp Thr Arg Ile Ala Lys Ile Lys Glu Trp Val 405 410 415 Asp Glu Gln Tyr Lys Ala Gly Arg Thr Ser Ile Ile Gly Asn Asp Glu 420 425 430 Leu Glu Leu Leu Val Asp Lys Met Leu Pro Asp Ile Ser Gln Lys Lys 435 440 445 Lys Lys Glu Leu Ala Arg Val Asp Thr Arg Phe Ile 450 455 460 <210> SEQ ID NO 25 <211> LENGTH: 491 <212> TYPE: PRT <213> ORGANISM: Methanocaldococcus jannaschii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CimA, NP_248395.1 <400> SEQUENCE: 25 Met Met Val Arg Ile Phe Asp Thr Thr Leu Arg Asp Gly Glu Gln Thr 1 5 10 15 Pro Gly Val Ser Leu Thr Pro Asn Asp Lys Leu Glu Ile Ala Lys Lys 20 25 30 Leu Asp Glu Leu Gly Val Asp Val Ile Glu Ala Gly Ser Ala Ile Thr 35 40 45 Ser Lys Gly Glu Arg Glu Gly Ile Lys Leu Ile Thr Lys Glu Gly Leu 50 55 60 Asn Ala Glu Ile Cys Ser Phe Val Arg Ala Leu Pro Val Asp Ile Asp 65 70 75 80 Ala Ala Leu Glu Cys Asp Val Asp Ser Val His Leu Val Val Pro Thr 85 90 95 Ser Pro Ile His Met Lys Tyr Lys Leu Arg Lys Thr Glu Asp Glu Val 100 105 110 Leu Glu Thr Ala Leu Lys Ala Val Glu Tyr Ala Lys Glu His Gly Leu 115 120 125 Ile Val Glu Leu Ser Ala Glu Asp Ala Thr Arg Ser Asp Val Asn Phe 130 135 140 Leu Ile Lys Leu Phe Asn Glu Gly Glu Lys Val Gly Ala Asp Arg Val 145 150 155 160 Cys Val Cys Asp Thr Val Gly Val Leu Thr Pro Gln Lys Ser Gln Glu 165 170 175 Leu Phe Lys Lys Ile Thr Glu Asn Val Asn Leu Pro Val Ser Val His 180 185 190 Cys His Asn Asp Phe Gly Met Ala Thr Ala Asn Thr Cys Ser Ala Val 195 200 205 Leu Gly Gly Ala Val Gln Cys His Val Thr Val Asn Gly Ile Gly Glu 210 215 220 Arg Ala Gly Asn Ala Ser Leu Glu Glu Val Val Ala Ala Leu Lys Ile 225 230 235 240 Leu Tyr Gly Tyr Asp Thr Lys Ile Lys Met Glu Lys Leu Tyr Glu Val 245 250 255 Ser Arg Ile Val Ser Arg Leu Met Lys Leu Pro Val Pro Pro Asn Lys 260 265 270 Ala Ile Val Gly Asp Asn Ala Phe Ala His Glu Ala Gly Ile His Val 275 280 285 Asp Gly Leu Ile Lys Asn Thr Glu Thr Tyr Glu Pro Ile Lys Pro Glu 290 295 300 Met Val Gly Asn Arg Arg Arg Ile Ile Leu Gly Lys His Ser Gly Arg 305 310 315 320 Lys Ala Leu Lys Tyr Lys Leu Asp Leu Met Gly Ile Asn Val Ser Asp 325 330 335 Glu Gln Leu Asn Lys Ile Tyr Glu Arg Val Lys Glu Phe Gly Asp Leu 340 345 350 Gly Lys Tyr Ile Ser Asp Ala Asp Leu Leu Ala Ile Val Arg Glu Val 355 360 365 Thr Gly Lys Leu Val Glu Glu Lys Ile Lys Leu Asp Glu Leu Thr Val 370 375 380 Val Ser Gly Asn Lys Ile Thr Pro Ile Ala Ser Val Lys Leu His Tyr 385 390 395 400 Lys Gly Glu Asp Ile Thr Leu Ile Glu Thr Ala Tyr Gly Val Gly Pro 405 410 415 Val Asp Ala Ala Ile Asn Ala Val Arg Lys Ala Ile Ser Gly Val Ala 420 425 430 Asp Ile Lys Leu Val Glu Tyr Arg Val Glu Ala Ile Gly Gly Gly Thr 435 440 445 Asp Ala Leu Ile Glu Val Val Val Lys Leu Arg Lys Gly Thr Glu Ile 450 455 460 Val Glu Val Arg Lys Ser Asp Ala Asp Ile Ile Arg Ala Ser Val Asp 465 470 475 480 Ala Val Met Glu Gly Ile Asn Met Leu Leu Asn 485 490 <210> SEQ ID NO 26 <211> LENGTH: 421 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuC, WP_023162955.1 <400> SEQUENCE: 26 Met Gly Met Thr Met Thr Gln Lys Ile Leu Ala His His Ala Lys Met 1 5 10 15 Asp Glu Val Lys Ala Gly Gln Leu Ile Lys Val Lys Leu Asp Leu Val 20 25 30 Leu Gly Asn Asp Ile Thr Thr Pro Val Ala Ile Asn Glu Phe Asn Lys 35 40 45 Ile Gly Leu Asn Asn Val Phe Asp Lys Asn Lys Ile Ala Ile Val Pro 50 55 60 Asp His Phe Thr Pro Asn Lys Asp Ile Lys Ser Ala Glu Gln Cys Lys 65 70 75 80 Tyr Val Arg Glu Phe Val Lys Lys Met Glu Ile Lys Asn Tyr Phe Glu 85 90 95 Val Gly Arg Met Gly Ile Glu His Ala Leu Ile Pro Glu Lys Gly Leu 100 105 110 Ala Val Cys Gly Asp Val Val Ile Gly Ala Asp Ser His Thr Cys Thr 115 120 125 Tyr Gly Ala Leu Gly Ala Phe Ser Thr Gly Ile Gly Ser Thr Asp Met 130 135 140 Ala Ala Gly Met Ala Thr Gly Glu Ala Trp Phe Lys Val Pro Glu Ala 145 150 155 160 Ile Lys Phe Val Leu Lys Gly Lys Leu Thr Lys Trp Val Ser Gly Lys 165 170 175 Asp Val Ile Leu His Ile Ile Gly Met Ile Gly Val Asp Gly Ala Leu 180 185 190 Tyr Lys Ser Met Glu Phe Thr Gly Glu Gly Val Ser Ser Leu Thr Met 195 200 205 Asp Asp Arg Phe Thr Ile Cys Asn Met Ala Ile Glu Ala Gly Ala Lys 210 215 220 Asn Gly Ile Phe Pro Val Asp Glu Asn Thr Ile Asn Tyr Val Lys Glu 225 230 235 240 His Ser Lys Lys Asn Tyr Thr Val Tyr Glu Ala Asp Ser Asp Ala Glu 245 250 255 Tyr Ser Gln Val Ile Glu Ile Asp Leu Ser Lys Ile Arg Pro Thr Val 260 265 270 Ala Phe Pro His Ile Pro Glu Asn Thr Lys Thr Ile Asp Glu Val Gly 275 280 285 Asp Ile Arg Ile Asp Gln Val Val Ile Gly Ser Cys Thr Asn Gly Arg 290 295 300 Ile Gly Asp Leu Arg Ala Ala Ala Ser Ile Leu Lys Gly Arg Lys Val 305 310 315 320 Asn Glu Asn Val Arg Ala Ile Ile Phe Pro Ala Thr Gln Ala Ile Tyr 325 330 335 Leu Gln Ala Met Lys Glu Gly Leu Ile Glu Ile Phe Ile Glu Ala Gly 340 345 350 Ala Val Val Ser Thr Pro Thr Cys Gly Pro Cys Leu Gly Gly His Met 355 360 365 Gly Ile Leu Ala Glu Gly Glu Arg Ala Val Ser Thr Thr Asn Arg Asn 370 375 380 Phe Val Gly Arg Met Gly His Val Lys Ser Glu Val Tyr Leu Ala Ser 385 390 395 400 Pro Glu Val Ala Ala Ala Ser Ala Val Thr Gly Lys Ile Ser Ser Pro 405 410 415 Glu Glu Val Val Lys 420 <210> SEQ ID NO 27 <211> LENGTH: 164 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuD, AGY77204.1 <400> SEQUENCE: 27 Met Ile Lys Gly Lys Ala Ile Lys Tyr Gly Asp Asn Val Asp Thr Asp 1 5 10 15 Val Ile Ile Pro Ala Arg Tyr Leu Asn Thr Ser Asp His Lys Glu Leu 20 25 30 Ala Ser His Cys Met Glu Asp Ile Asp Lys Asp Phe Ser Lys Lys Ile 35 40 45 Ser Lys Gly Asp Ile Met Ile Ala Gly Lys Asn Phe Gly Cys Gly Ser 50 55 60 Ser Arg Glu His Ala Pro Ile Ala Ile Lys Ala Ser Gly Ile Ser Cys 65 70 75 80 Ile Ile Ala Glu Thr Phe Ala Arg Ile Phe Phe Arg Asn Ser Ile Asn 85 90 95 Ile Gly Leu Pro Ile Met Glu Cys Glu Glu Ala Ala Lys Asp Ile Asp 100 105 110 Glu Lys Asp Glu Val Ser Val Asp Thr Val Ser Gly Val Ile Thr Asn 115 120 125 Ile Thr Lys Asn Lys Thr Tyr Lys Ala Val Pro Phe Pro Glu Phe Met 130 135 140 His Lys Ile Ile Lys Ser Glu Gly Leu Ile Asn Tyr Ile Lys Glu Glu 145 150 155 160 Val Glu Asn Lys <210> SEQ ID NO 28 <211> LENGTH: 466 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuC, NP_414614.1 <400> SEQUENCE: 28 Met Ala Lys Thr Leu Tyr Glu Lys Leu Phe Asp Ala His Val Val Tyr 1 5 10 15 Glu Ala Glu Asn Glu Thr Pro Leu Leu Tyr Ile Asp Arg His Leu Val 20 25 30 His Glu Val Thr Ser Pro Gln Ala Phe Asp Gly Leu Arg Ala His Gly 35 40 45 Arg Pro Val Arg Gln Pro Gly Lys Thr Phe Ala Thr Met Asp His Asn 50 55 60 Val Ser Thr Gln Thr Lys Asp Ile Asn Ala Cys Gly Glu Met Ala Arg 65 70 75 80 Ile Gln Met Gln Glu Leu Ile Lys Asn Cys Lys Glu Phe Gly Val Glu 85 90 95 Leu Tyr Asp Leu Asn His Pro Tyr Gln Gly Ile Val His Val Met Gly 100 105 110 Pro Glu Gln Gly Val Thr Leu Pro Gly Met Thr Ile Val Cys Gly Asp 115 120 125 Ser His Thr Ala Thr His Gly Ala Phe Gly Ala Leu Ala Phe Gly Ile 130 135 140 Gly Thr Ser Glu Val Glu His Val Leu Ala Thr Gln Thr Leu Lys Gln 145 150 155 160 Gly Arg Ala Lys Thr Met Lys Ile Glu Val Gln Gly Lys Ala Ala Pro 165 170 175 Gly Ile Thr Ala Lys Asp Ile Val Leu Ala Ile Ile Gly Lys Thr Gly 180 185 190 Ser Ala Gly Gly Thr Gly His Val Val Glu Phe Cys Gly Glu Ala Ile 195 200 205 Arg Asp Leu Ser Met Glu Gly Arg Met Thr Leu Cys Asn Met Ala Ile 210 215 220 Glu Met Gly Ala Lys Ala Gly Leu Val Ala Pro Asp Glu Thr Thr Phe 225 230 235 240 Asn Tyr Val Lys Gly Arg Leu His Ala Pro Lys Gly Lys Asp Phe Asp 245 250 255 Asp Ala Val Ala Tyr Trp Lys Thr Leu Gln Thr Asp Glu Gly Ala Thr 260 265 270 Phe Asp Thr Val Val Thr Leu Gln Ala Glu Glu Ile Ser Pro Gln Val 275 280 285 Thr Trp Gly Thr Asn Pro Gly Gln Val Ile Ser Val Asn Asp Asn Ile 290 295 300 Pro Asp Pro Ala Ser Phe Ala Asp Pro Val Glu Arg Ala Ser Ala Glu 305 310 315 320 Lys Ala Leu Ala Tyr Met Gly Leu Lys Pro Gly Ile Pro Leu Thr Glu 325 330 335 Val Ala Ile Asp Lys Val Phe Ile Gly Ser Cys Thr Asn Ser Arg Ile 340 345 350 Glu Asp Leu Arg Ala Ala Ala Glu Ile Ala Lys Gly Arg Lys Val Ala 355 360 365 Pro Gly Val Gln Ala Leu Val Val Pro Gly Ser Gly Pro Val Lys Ala 370 375 380 Gln Ala Glu Ala Glu Gly Leu Asp Lys Ile Phe Ile Glu Ala Gly Phe 385 390 395 400 Glu Trp Arg Leu Pro Gly Cys Ser Met Cys Leu Ala Met Asn Asn Asp 405 410 415 Arg Leu Asn Pro Gly Glu Arg Cys Ala Ser Thr Ser Asn Arg Asn Phe 420 425 430 Glu Gly Arg Gln Gly Arg Gly Gly Arg Thr His Leu Val Ser Pro Ala 435 440 445 Met Ala Ala Ala Ala Ala Val Thr Gly His Phe Ala Asp Ile Arg Asn 450 455 460 Ile Lys 465 <210> SEQ ID NO 29 <211> LENGTH: 201 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuD, NP_414613.1 <400> SEQUENCE: 29 Met Ala Glu Lys Phe Ile Lys His Thr Gly Leu Val Val Pro Leu Asp 1 5 10 15 Ala Ala Asn Val Asp Thr Asp Ala Ile Ile Pro Lys Gln Phe Leu Gln 20 25 30 Lys Val Thr Arg Thr Gly Phe Gly Ala His Leu Phe Asn Asp Trp Arg 35 40 45 Phe Leu Asp Glu Lys Gly Gln Gln Pro Asn Pro Asp Phe Val Leu Asn 50 55 60 Phe Pro Gln Tyr Gln Gly Ala Ser Ile Leu Leu Ala Arg Glu Asn Phe 65 70 75 80 Gly Cys Gly Ser Ser Arg Glu His Ala Pro Trp Ala Leu Thr Asp Tyr 85 90 95 Gly Phe Lys Val Val Ile Ala Pro Ser Phe Ala Asp Ile Phe Tyr Gly 100 105 110 Asn Ser Phe Asn Asn Gln Leu Leu Pro Val Lys Leu Ser Asp Ala Glu 115 120 125 Val Asp Glu Leu Phe Ala Leu Val Lys Ala Asn Pro Gly Ile His Phe 130 135 140 Asp Val Asp Leu Glu Ala Gln Glu Val Lys Ala Gly Glu Lys Thr Tyr 145 150 155 160 Arg Phe Thr Ile Asp Ala Phe Arg Arg His Cys Met Met Asn Gly Leu 165 170 175 Asp Ser Ile Gly Leu Thr Leu Gln His Asp Asp Ala Ile Ala Ala Tyr 180 185 190 Glu Ala Lys Gln Pro Ala Phe Met Asn 195 200 <210> SEQ ID NO 30 <211> LENGTH: 354 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuB, WP_023162957.1 <400> SEQUENCE: 30 Met Lys Ile Ala Ile Ile Pro Gly Asp Gly Ile Gly Lys Glu Ile Ile 1 5 10 15 Glu Gln Ala Lys Lys Val Leu Lys Ala Ala Ser Ala Lys Tyr Asn Phe 20 25 30 Asp Phe Glu Cys Glu Glu Val Leu Leu Gly Gly Ala Ala Val Asp Ala 35 40 45 Thr Gly Val Pro Leu Pro Asp Lys Thr Val Glu Val Cys Lys Lys Ser 50 55 60 Asp Ala Val Leu Leu Gly Ala Val Gly Gly Pro Lys Trp Asp Ser Leu 65 70 75 80 Pro Ser Lys Leu Arg Pro Glu Ala Gly Leu Leu Gly Ile Arg Lys Ala 85 90 95 Leu Gly Val Phe Ala Asn Leu Arg Pro Ala Ile Leu Phe Pro Glu Leu 100 105 110 Ile Ala Ala Ser Asn Leu Lys Pro Glu Val Leu Gly Gly Gly Leu Asp 115 120 125 Ile Met Ile Val Arg Glu Leu Ile Gly Gly Ala Tyr Phe Gly Glu Lys 130 135 140 Asn Arg Ile Asp Ile Glu Gly Gly Lys Lys Ala Trp Asp Thr Ile Ser 145 150 155 160 Tyr Thr Ser Phe Glu Ile Asp Arg Ile Thr Arg Lys Ala Phe Glu Ile 165 170 175 Ala Arg Lys Arg Ser Asn Arg Leu Thr Leu Val Asp Lys Ala Asn Val 180 185 190 Leu Glu Ser Ser Lys Leu Trp Arg Glu Val Val Gly Asn Ile Ala Lys 195 200 205 Glu Tyr Glu Asp Val Glu Ile Asn Tyr Met Tyr Val Asp Asn Ala Ser 210 215 220 Met Gln Leu Ile Arg Asp Pro Lys Gln Phe Asp Val Ile Leu Thr Glu 225 230 235 240 Asn Met Phe Gly Asp Ile Leu Ser Asp Glu Ala Ser Met Leu Thr Gly 245 250 255 Ser Leu Gly Met Leu Pro Ser Ala Ser Val Arg Gly Asp Ser Phe Gly 260 265 270 Leu Tyr Glu Pro Val His Gly Ser Ala Pro Asp Ile Ala Gly Gln Asn 275 280 285 Lys Ala Asn Pro Ile Gly Thr Ile Met Ser Val Ala Met Met Leu Lys 290 295 300 Tyr Ser Phe Asp Met Glu Gln Ala Tyr Val Asp Ile Lys Asn Ala Ile 305 310 315 320 Ser Lys Val Leu Lys Glu Gly Tyr Arg Thr Gly Asp Ile Ala Lys Glu 325 330 335 Asp Ser Lys Leu Val Gly Thr Glu Glu Met Gly Asp Leu Ile Val Lys 340 345 350 Asn Leu <210> SEQ ID NO 31 <211> LENGTH: 363 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuB, NP_414615.4 <400> SEQUENCE: 31 Met Ser Lys Asn Tyr His Ile Ala Val Leu Pro Gly Asp Gly Ile Gly 1 5 10 15 Pro Glu Val Met Thr Gln Ala Leu Lys Val Leu Asp Ala Val Arg Asn 20 25 30 Arg Phe Ala Met Arg Ile Thr Thr Ser His Tyr Asp Val Gly Gly Ala 35 40 45 Ala Ile Asp Asn His Gly Gln Pro Leu Pro Pro Ala Thr Val Glu Gly 50 55 60 Cys Glu Gln Ala Asp Ala Val Leu Phe Gly Ser Val Gly Gly Pro Lys 65 70 75 80 Trp Glu His Leu Pro Pro Asp Gln Gln Pro Glu Arg Gly Ala Leu Leu 85 90 95 Pro Leu Arg Lys His Phe Lys Leu Phe Ser Asn Leu Arg Pro Ala Lys 100 105 110 Leu Tyr Gln Gly Leu Glu Ala Phe Cys Pro Leu Arg Ala Asp Ile Ala 115 120 125 Ala Asn Gly Phe Asp Ile Leu Cys Val Arg Glu Leu Thr Gly Gly Ile 130 135 140 Tyr Phe Gly Gln Pro Lys Gly Arg Glu Gly Ser Gly Gln Tyr Glu Lys 145 150 155 160 Ala Phe Asp Thr Glu Val Tyr His Arg Phe Glu Ile Glu Arg Ile Ala 165 170 175 Arg Ile Ala Phe Glu Ser Ala Arg Lys Arg Arg His Lys Val Thr Ser 180 185 190 Ile Asp Lys Ala Asn Val Leu Gln Ser Ser Ile Leu Trp Arg Glu Ile 195 200 205 Val Asn Glu Ile Ala Thr Glu Tyr Pro Asp Val Glu Leu Ala His Met 210 215 220 Tyr Ile Asp Asn Ala Thr Met Gln Leu Ile Lys Asp Pro Ser Gln Phe 225 230 235 240 Asp Val Leu Leu Cys Ser Asn Leu Phe Gly Asp Ile Leu Ser Asp Glu 245 250 255 Cys Ala Met Ile Thr Gly Ser Met Gly Met Leu Pro Ser Ala Ser Leu 260 265 270 Asn Glu Gln Gly Phe Gly Leu Tyr Glu Pro Ala Gly Gly Ser Ala Pro 275 280 285 Asp Ile Ala Gly Lys Asn Ile Ala Asn Pro Ile Ala Gln Ile Leu Ser 290 295 300 Leu Ala Leu Leu Leu Arg Tyr Ser Leu Asp Ala Asp Asp Ala Ala Cys 305 310 315 320 Ala Ile Glu Arg Ala Ile Asn Arg Ala Leu Glu Glu Gly Ile Arg Thr 325 330 335 Gly Asp Leu Ala Arg Gly Ala Ala Ala Val Ser Thr Asp Glu Met Gly 340 345 350 Asp Ile Ile Ala Arg Tyr Val Ala Glu Gly Val 355 360 <210> SEQ ID NO 32 <211> LENGTH: 536 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvB, AGY74359.1 <400> SEQUENCE: 32 Met Lys Ala Ala Glu Ala Val Ile Gln Cys Leu Lys Lys Glu Asn Val 1 5 10 15 Asn Met Val Phe Gly Tyr Pro Gly Ala Ala Val Val Pro Ile Tyr Glu 20 25 30 Ala Leu Arg Lys Ser Asp Val Lys His Ile Leu Val Arg Gln Glu Gln 35 40 45 Ala Ala Gly His Ser Ala Ser Gly Tyr Ala Arg Ser Thr Gly Glu Val 50 55 60 Gly Val Cys Ile Val Thr Ser Gly Pro Gly Ala Thr Asn Leu Ile Thr 65 70 75 80 Ala Ile Ala Ala Ala Tyr Met Asp Ser Ile Pro Leu Val Val Ile Thr 85 90 95 Gly Gln Val Lys Ser Thr Leu Ile Gly Arg Asp Val Phe Gln Glu Leu 100 105 110 Asp Ile Thr Gly Ala Thr Glu Ser Phe Thr Lys Tyr Asn Phe Leu Val 115 120 125 Arg Asp Ala Lys Ser Ile Pro Lys Thr Ile Lys Glu Ala Phe Tyr Ile 130 135 140 Ala Glu Thr Gly Arg Lys Gly Pro Val Leu Val Asp Ile Pro Met Asp 145 150 155 160 Ile Met Glu Glu Asp Ile Asp Phe Glu Tyr Pro Glu Ser Val Asn Ile 165 170 175 Arg Gly Tyr Lys Pro Thr Val Lys Gly His Ser Gly Gln Ile Lys Lys 180 185 190 Ile Ile Asp Arg Ile Lys Val Ser Lys Arg Pro Leu Ile Cys Ala Gly 195 200 205 Gly Gly Val Ile Leu Ala Asn Ala Gln Lys Glu Leu Glu Gln Phe Val 210 215 220 Lys Lys Ser His Ile Pro Val Val His Thr Leu Met Gly Lys Gly Cys 225 230 235 240 Ile Asn Glu Asn Ser Asp Tyr Tyr Val Gly Leu Ile Gly Thr His Gly 245 250 255 Phe Ala Tyr Ala Asn Lys Val Val Gln Asn Ala Asp Val Leu Ile Leu 260 265 270 Ile Gly Ala Arg Ala Ser Asp Arg Thr Val Ser Gly Val Lys Ser Phe 275 280 285 Ala Lys Asp Ala Asp Ile Ile His Ile Asp Ile Asp Pro Ala Glu Ile 290 295 300 Gly Lys Ile Leu Asn Thr Tyr Ile Pro Val Val Gly Asp Cys Gly Ser 305 310 315 320 Val Leu Ser Asp Leu Asn Lys Glu Ile Val Ala Pro Gln Thr Glu Lys 325 330 335 Trp Met Glu Glu Ile Lys Asn Trp Lys Lys Asp Leu Tyr Ile Glu Arg 340 345 350 Lys Pro Thr Asp Lys Val Asn Pro Lys Tyr Val Leu Lys Thr Val Ser 355 360 365 Asp Thr Leu Gly Glu Glu Val Ile Leu Thr Ala Asp Val Gly Gln Asn 370 375 380 Gln Leu Trp Cys Ala Arg Asn Phe Arg Met Thr Gly Asn Arg Lys Phe 385 390 395 400 Leu Thr Ser Gly Gly Leu Gly Thr Met Gly Tyr Ser Leu Pro Ala Ala 405 410 415 Ile Gly Ala Lys Ile Ala Cys Pro Asp Lys Gln Val Ile Ala Phe Ala 420 425 430 Gly Asp Gly Gly Phe Gln Met Ser Leu Phe Glu Leu Gly Thr Ile Ala 435 440 445 Glu Asn Asn Leu Asn Ile Ile Ile Val Leu Phe Asn Asn Ser Gly Leu 450 455 460 Gly Met Val Arg Glu Ile Gln Asp Asn Lys Tyr Ser Gly Glu Phe Gly 465 470 475 480 Val Asn Phe Arg Thr Asn Pro Asp Phe Val Lys Leu Ala Glu Ala Tyr 485 490 495 Gly Leu Lys Ala Lys Arg Val Glu Asn Asp Ser Glu Phe Asn Gly Val 500 505 510 Phe Arg Glu Ala Leu Asp Ser Ser Lys Ala Phe Leu Ile Glu Cys Ile 515 520 525 Val Asp Pro His Glu Arg Thr Phe 530 535 <210> SEQ ID NO 33 <211> LENGTH: 558 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvB, AGY74635.1 <400> SEQUENCE: 33 Met Lys Ile Lys Gly Ala Glu Val Leu Leu Lys Cys Met Met Glu Gln 1 5 10 15 Gly Val Asp Thr Val Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro Ile 20 25 30 Tyr Asp Ala Leu Tyr Ala Ala Lys Gly Lys Ile Thr His Ile Ser Thr 35 40 45 Ser His Glu Gln Gly Ala Ala His Ala Ala Asp Gly Tyr Ala Arg Ser 50 55 60 Thr Gly Lys Val Gly Val Val Ile Ala Thr Ser Gly Pro Gly Ala Thr 65 70 75 80 Asn Thr Val Thr Ala Ile Ala Thr Ala Tyr Met Asp Ser Val Pro Ile 85 90 95 Val Val Phe Thr Gly Gln Val Ala Arg Ser Leu Leu Gly Lys Asp Ser 100 105 110 Phe Gln Glu Val Asn Ile Lys Asp Ile Thr Ala Ser Ile Thr Lys Lys 115 120 125 Ser Cys Ile Val Glu Lys Val Glu Asp Leu Ala Asp Thr Val Arg Glu 130 135 140 Ala Phe Gln Ile Ala Val Ser Gly Arg Pro Gly Pro Val Val Val Asp 145 150 155 160 Ile Pro Lys Asp Val Gln Ser Ala Glu Val Glu Tyr Glu Pro Phe Arg 165 170 175 Ser Lys Leu Ser Glu Ile Lys Glu Lys Lys Tyr Phe Asn Leu Asn Glu 180 185 190 Tyr Gly Asp Ser Leu Asn Lys Ala Ile Asp Met Ile Asn Arg Ser Glu 195 200 205 Arg Pro Val Ile Tyr Ser Gly Gly Gly Thr Val Thr Ser Gly Ala Gln 210 215 220 Asn Glu Leu Met Glu Leu Val Glu Lys Ile Asp Ser Pro Ile Thr Cys 225 230 235 240 Ser Leu Met Gly Ile Gly Ala Phe Pro Gly Asn Asn Glu Tyr Tyr Met 245 250 255 Gly Met Val Gly Met His Gly Ser Arg Cys Ser Asn Tyr Ala Val Ser 260 265 270 Asn Cys Asp Leu Leu Ile Ala Ile Gly Ala Arg Phe Ser Asp Arg Val 275 280 285 Ile Ser Lys Val Ser Ala Phe Ala Pro Lys Ala Arg Ile Ile His Ile 290 295 300 Asp Ile Asp Pro Lys Glu Phe Gly Lys Asn Val Asp Ile Asp Val Ala 305 310 315 320 Ile Lys Gly Asp Val Lys Glu Val Leu Gln Lys Ile Asn Cys Lys Leu 325 330 335 Glu Lys Ala Asp His Arg Asp Trp Met Glu Lys Ile Lys Gln Trp Lys 340 345 350 Ser Glu Gln Cys Glu Pro Phe Lys Glu Cys Lys Leu Ser Pro Lys Phe 355 360 365 Ile Met Asp Thr Leu Tyr Asn Leu Thr Gly Gly Glu Cys Ile Ile Thr 370 375 380 Thr Glu Val Gly Gln Asn Gln Ile Trp Thr Ala Gln Tyr Phe Lys Phe 385 390 395 400 Leu Lys Pro Arg Thr Phe Val Ser Ser Gly Gly Leu Gly Thr Met Gly 405 410 415 Phe Gly Leu Gly Ala Ser Ile Gly Ala Ser Met Gly Asn Pro Gly Lys 420 425 430 Lys Val Ile Asn Val Ala Gly Asp Gly Ser Phe Lys Met Asn Ser Thr 435 440 445 Glu Leu Ala Thr Val Ala Lys Tyr Lys Leu Pro Ile Val Gln Leu Leu 450 455 460 Leu Asn Asn Arg Ala Leu Gly Met Val Tyr Gln Trp Gln Asp Met Phe 465 470 475 480 Tyr Gly Lys Arg Phe Ser Asn Thr Glu Leu Gly Pro Asp Val Asp Phe 485 490 495 Met Lys Leu Gly Glu Ala Tyr Gly Ile Lys Thr Phe Lys Ile Glu Asp 500 505 510 Asn Ser Gln Val Glu Lys Cys Leu Lys Glu Ala Leu Asp Leu Asn Glu 515 520 525 Pro Val Ile Ile Glu Cys Asp Ile Asp Arg Lys Glu Lys Val Phe Pro 530 535 540 Ile Val Pro Pro Gly Ala Ala Ile Ser Asp Leu Val Glu Glu 545 550 555 <210> SEQ ID NO 34 <211> LENGTH: 158 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvN, AGY74360.1 <400> SEQUENCE: 34 Met Ser Val Leu Val Glu Asn His Ser Gly Val Leu Ser Lys Val Ala 1 5 10 15 Gly Leu Phe Ser Arg Arg Gly Tyr Asn Ile His Ser Leu Thr Val Gly 20 25 30 Val Thr Gly Asp Pro Glu Ile Ser Arg Met Thr Ile Val Ser Ile Gly 35 40 45 Asp Asp Tyr Met Phe Glu Gln Ile Ser Lys Gln Leu Asn Lys Leu Ile 50 55 60 Glu Val Ile Lys Val Ile Glu Leu Asn Pro Asp Ala Ser Val Tyr Arg 65 70 75 80 Glu Leu Ser Leu Ile Lys Val Ser Ala Glu Ser Asn Asn Lys Leu Leu 85 90 95 Ile Met Glu Ser Val Asn Thr Phe Arg Gly Lys Ile Val Asp Met Asn 100 105 110 Glu Lys Ser Met Ile Ile Glu Ile Thr Gly Asn Glu Lys Lys Ile Ser 115 120 125 Ala Phe Ile Glu Leu Met Lys Pro Tyr Gly Ile Lys Glu Ile Ile Arg 130 135 140 Thr Gly Leu Thr Ala Leu Gln Arg Gly Ser Lys Leu Glu Asp 145 150 155 <210> SEQ ID NO 35 <211> LENGTH: 562 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvB, NP_418127.1 <400> SEQUENCE: 35 Met Ala Ser Ser Gly Thr Thr Ser Thr Arg Lys Arg Phe Thr Gly Ala 1 5 10 15 Glu Phe Ile Val His Phe Leu Glu Gln Gln Gly Ile Lys Ile Val Thr 20 25 30 Gly Ile Pro Gly Gly Ser Ile Leu Pro Val Tyr Asp Ala Leu Ser Gln 35 40 45 Ser Thr Gln Ile Arg His Ile Leu Ala Arg His Glu Gln Gly Ala Gly 50 55 60 Phe Ile Ala Gln Gly Met Ala Arg Thr Asp Gly Lys Pro Ala Val Cys 65 70 75 80 Met Ala Cys Ser Gly Pro Gly Ala Thr Asn Leu Val Thr Ala Ile Ala 85 90 95 Asp Ala Arg Leu Asp Ser Ile Pro Leu Ile Cys Ile Thr Gly Gln Val 100 105 110 Pro Ala Ser Met Ile Gly Thr Asp Ala Phe Gln Glu Val Asp Thr Tyr 115 120 125 Gly Ile Ser Ile Pro Ile Thr Lys His Asn Tyr Leu Val Arg His Ile 130 135 140 Glu Glu Leu Pro Gln Val Met Ser Asp Ala Phe Arg Ile Ala Gln Ser 145 150 155 160 Gly Arg Pro Gly Pro Val Trp Ile Asp Ile Pro Lys Asp Val Gln Thr 165 170 175 Ala Val Phe Glu Ile Glu Thr Gln Pro Ala Met Ala Glu Lys Ala Ala 180 185 190 Ala Pro Ala Phe Ser Glu Glu Ser Ile Arg Asp Ala Ala Ala Met Ile 195 200 205 Asn Ala Ala Lys Arg Pro Val Leu Tyr Leu Gly Gly Gly Val Ile Asn 210 215 220 Ala Pro Ala Arg Val Arg Glu Leu Ala Glu Lys Ala Gln Leu Pro Thr 225 230 235 240 Thr Met Thr Leu Met Ala Leu Gly Met Leu Pro Lys Ala His Pro Leu 245 250 255 Ser Leu Gly Met Leu Gly Met His Gly Val Arg Ser Thr Asn Tyr Ile 260 265 270 Leu Gln Glu Ala Asp Leu Leu Ile Val Leu Gly Ala Arg Phe Asp Asp 275 280 285 Arg Ala Ile Gly Lys Thr Glu Gln Phe Cys Pro Asn Ala Lys Ile Ile 290 295 300 His Val Asp Ile Asp Arg Ala Glu Leu Gly Lys Ile Lys Gln Pro His 305 310 315 320 Val Ala Ile Gln Ala Asp Val Asp Asp Val Leu Ala Gln Leu Ile Pro 325 330 335 Leu Val Glu Ala Gln Pro Arg Ala Glu Trp His Gln Leu Val Ala Asp 340 345 350 Leu Gln Arg Glu Phe Pro Cys Pro Ile Pro Lys Ala Cys Asp Pro Leu 355 360 365 Ser His Tyr Gly Leu Ile Asn Ala Val Ala Ala Cys Val Asp Asp Asn 370 375 380 Ala Ile Ile Thr Thr Asp Val Gly Gln His Gln Met Trp Thr Ala Gln 385 390 395 400 Ala Tyr Pro Leu Asn Arg Pro Arg Gln Trp Leu Thr Ser Gly Gly Leu 405 410 415 Gly Thr Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala Ala Leu Ala 420 425 430 Asn Pro Asp Arg Lys Val Leu Cys Phe Ser Gly Asp Gly Ser Leu Met 435 440 445 Met Asn Ile Gln Glu Met Ala Thr Ala Ser Glu Asn Gln Leu Asp Val 450 455 460 Lys Ile Ile Leu Met Asn Asn Glu Ala Leu Gly Leu Val His Gln Gln 465 470 475 480 Gln Ser Leu Phe Tyr Glu Gln Gly Val Phe Ala Ala Thr Tyr Pro Gly 485 490 495 Lys Ile Asn Phe Met Gln Ile Ala Ala Gly Phe Gly Leu Glu Thr Cys 500 505 510 Asp Leu Asn Asn Glu Ala Asp Pro Gln Ala Ser Leu Gln Glu Ile Ile 515 520 525 Asn Arg Pro Gly Pro Ala Leu Ile His Val Arg Ile Asp Ala Glu Glu 530 535 540 Lys Val Tyr Pro Met Val Pro Pro Gly Ala Ala Asn Thr Glu Met Val 545 550 555 560 Gly Glu <210> SEQ ID NO 36 <211> LENGTH: 96 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvN, NP_418126.1 <400> SEQUENCE: 36 Met Gln Asn Thr Thr His Asp Asn Val Ile Leu Glu Leu Thr Val Arg 1 5 10 15 Asn His Pro Gly Val Met Thr His Val Cys Gly Leu Phe Ala Arg Arg 20 25 30 Ala Phe Asn Val Glu Gly Ile Leu Cys Leu Pro Ile Gln Asp Ser Asp 35 40 45 Lys Ser His Ile Trp Leu Leu Val Asn Asp Asp Gln Arg Leu Glu Gln 50 55 60 Met Ile Ser Gln Ile Asp Lys Leu Glu Asp Val Val Lys Val Gln Arg 65 70 75 80 Asn Gln Ser Asp Pro Thr Met Phe Asn Lys Ile Ala Val Phe Phe Gln 85 90 95 <210> SEQ ID NO 37 <211> LENGTH: 337 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvC, WP_013238693.1 <400> SEQUENCE: 37 Met Glu Lys Leu Lys Val Tyr Tyr Asp Glu Asp Ala Asp Leu Asn Leu 1 5 10 15 Leu Lys Gly Lys Lys Ile Ala Ile Leu Gly Phe Gly Ser Gln Gly His 20 25 30 Ala His Ala Leu Asn Leu Lys Glu Ser Gly Leu Asp Val Ile Val Gly 35 40 45 Leu Tyr Lys Gly Ser Lys Ser Trp Lys Lys Ala Glu Asp Tyr Gly Phe 50 55 60 Lys Val Tyr Glu Ile Ala Glu Ala Val Lys Gln Ala Asp Ile Ile Thr 65 70 75 80 Val Leu Leu Pro Asp Glu Lys Gln Lys Gln Ile Tyr Asp Glu Ser Ile 85 90 95 Lys Asp Asn Leu Ser Glu Gly Asn Ala Leu Phe Phe Ala His Gly Phe 100 105 110 Asn Ile His Phe Asn Gln Ile Val Pro Pro Lys Phe Val Asp Val Leu 115 120 125 Met Ile Ala Pro Lys Gly Pro Gly His Ile Val Arg Arg Glu Tyr Thr 130 135 140 Leu Gly Asn Gly Val Pro Cys Leu Tyr Ala Val Tyr Gln Asp Tyr Ser 145 150 155 160 Gly Lys Gly Lys Glu Ile Ala Leu Ala Tyr Gly Lys Gly Ile Gly Gly 165 170 175 Thr Arg Ala Gly Val Met Thr Thr Thr Phe Lys Val Glu Thr Glu Thr 180 185 190 Asp Leu Phe Gly Glu Gln Val Val Leu Cys Gly Gly Val Ala Glu Leu 195 200 205 Ile Lys Ala Gly Phe Asp Thr Leu Val Glu Ala Gly Tyr Ala Pro Glu 210 215 220 Asn Ala Tyr Phe Glu Cys Leu His Glu Met Lys Leu Ile Val Asp Leu 225 230 235 240 Ile Tyr Glu Gly Gly Leu Ala Arg Met Arg Tyr Ser Val Ser Asp Thr 245 250 255 Ala Glu Tyr Gly Asp Tyr Lys Ile Gly Lys Arg Ile Ile Asn Asp Asn 260 265 270 Thr Arg Ala Glu Met Lys Lys Val Leu Thr Glu Ile Gln Asp Gly Thr 275 280 285 Phe Ala Arg Glu Trp Leu Leu Glu Asn Gln Thr Gly Arg Pro Gly Phe 290 295 300 Thr Ala Arg Arg Arg Met Glu Lys Asp Ala Pro Ile Glu Lys Val Gly 305 310 315 320 Lys Glu Leu Arg Ser Met Met Ser Trp Ile Asn Glu Asn Pro Asp Asn 325 330 335 Glu <210> SEQ ID NO 38 <211> LENGTH: 491 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvC, NP_418222.1 <400> SEQUENCE: 38 Met Ala Asn Tyr Phe Asn Thr Leu Asn Leu Arg Gln Gln Leu Ala Gln 1 5 10 15 Leu Gly Lys Cys Arg Phe Met Gly Arg Asp Glu Phe Ala Asp Gly Ala 20 25 30 Ser Tyr Leu Gln Gly Lys Lys Val Val Ile Val Gly Cys Gly Ala Gln 35 40 45 Gly Leu Asn Gln Gly Leu Asn Met Arg Asp Ser Gly Leu Asp Ile Ser 50 55 60 Tyr Ala Leu Arg Lys Glu Ala Ile Ala Glu Lys Arg Ala Ser Trp Arg 65 70 75 80 Lys Ala Thr Glu Asn Gly Phe Lys Val Gly Thr Tyr Glu Glu Leu Ile 85 90 95 Pro Gln Ala Asp Leu Val Ile Asn Leu Thr Pro Asp Lys Gln His Ser 100 105 110 Asp Val Val Arg Thr Val Gln Pro Leu Met Lys Asp Gly Ala Ala Leu 115 120 125 Gly Tyr Ser His Gly Phe Asn Ile Val Glu Val Gly Glu Gln Ile Arg 130 135 140 Lys Asp Ile Thr Val Val Met Val Ala Pro Lys Cys Pro Gly Thr Glu 145 150 155 160 Val Arg Glu Glu Tyr Lys Arg Gly Phe Gly Val Pro Thr Leu Ile Ala 165 170 175 Val His Pro Glu Asn Asp Pro Lys Gly Glu Gly Met Ala Ile Ala Lys 180 185 190 Ala Trp Ala Ala Ala Thr Gly Gly His Arg Ala Gly Val Leu Glu Ser 195 200 205 Ser Phe Val Ala Glu Val Lys Ser Asp Leu Met Gly Glu Gln Thr Ile 210 215 220 Leu Cys Gly Met Leu Gln Ala Gly Ser Leu Leu Cys Phe Asp Lys Leu 225 230 235 240 Val Glu Glu Gly Thr Asp Pro Ala Tyr Ala Glu Lys Leu Ile Gln Phe 245 250 255 Gly Trp Glu Thr Ile Thr Glu Ala Leu Lys Gln Gly Gly Ile Thr Leu 260 265 270 Met Met Asp Arg Leu Ser Asn Pro Ala Lys Leu Arg Ala Tyr Ala Leu 275 280 285 Ser Glu Gln Leu Lys Glu Ile Met Ala Pro Leu Phe Gln Lys His Met 290 295 300 Asp Asp Ile Ile Ser Gly Glu Phe Ser Ser Gly Met Met Ala Asp Trp 305 310 315 320 Ala Asn Asp Asp Lys Lys Leu Leu Thr Trp Arg Glu Glu Thr Gly Lys 325 330 335 Thr Ala Phe Glu Thr Ala Pro Gln Tyr Glu Gly Lys Ile Gly Glu Gln 340 345 350 Glu Tyr Phe Asp Lys Gly Val Leu Met Ile Ala Met Val Lys Ala Gly 355 360 365 Val Glu Leu Ala Phe Glu Thr Met Val Asp Ser Gly Ile Ile Glu Glu 370 375 380 Ser Ala Tyr Tyr Glu Ser Leu His Glu Leu Pro Leu Ile Ala Asn Thr 385 390 395 400 Ile Ala Arg Lys Arg Leu Tyr Glu Met Asn Val Val Ile Ser Asp Thr 405 410 415 Ala Glu Tyr Gly Asn Tyr Leu Phe Ser Tyr Ala Cys Val Pro Leu Leu 420 425 430 Lys Pro Phe Met Ala Glu Leu Gln Pro Gly Asp Leu Gly Lys Ala Ile 435 440 445 Pro Glu Gly Ala Val Asp Asn Gly Gln Leu Arg Asp Val Asn Glu Ala 450 455 460 Ile Arg Ser His Ala Ile Glu Gln Val Gly Lys Lys Leu Arg Gly Tyr 465 470 475 480 Met Thr Asp Met Lys Arg Ile Ala Val Ala Gly 485 490 <210> SEQ ID NO 39 <211> LENGTH: 552 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvD, WP_013238694.1 <400> SEQUENCE: 39 Met Lys Ser Asp Ser Val Lys Lys Gly Ile Lys Ala Ala Pro Ala Arg 1 5 10 15 Ala Leu Met Tyr Gly Met Gly Tyr Thr Lys Glu Glu Ile Glu Arg Pro 20 25 30 Leu Ile Gly Ile Val Asn Ser Gln Asn Glu Ile Val Ala Gly His Met 35 40 45 His Leu Asp Glu Ile Ala Lys Ala Ala Lys Leu Gly Val Ala Met Ser 50 55 60 Gly Gly Thr Pro Ile Glu Phe Pro Ala Ile Ala Val Cys Asp Gly Ile 65 70 75 80 Ala Met Gly His Val Gly Met Lys Tyr Ser Leu Ala Ser Arg Glu Leu 85 90 95 Ile Ala Asp Ser Ile Glu Ala Met Ala Thr Ala His Gly Phe Asp Gly 100 105 110 Leu Val Leu Ile Pro Asn Cys Asp Lys Ile Val Pro Gly Met Leu Met 115 120 125 Ala Ala Ala Arg Leu Asn Ile Pro Ala Val Val Val Ser Gly Gly Pro 130 135 140 Met Arg Ala Gly Lys Leu Asn Asn Lys Ala Leu Asp Phe Ser Thr Cys 145 150 155 160 Ile Glu Lys Val Ala Ala Cys Ser Asp Gly Lys Val Thr Glu Glu Glu 165 170 175 Leu Glu Glu Glu Ala Lys Arg Ala Cys Pro Gly Cys Gly Ser Cys Ser 180 185 190 Gly Leu Phe Thr Ala Asn Ser Met Asn Ser Leu Thr Glu Val Leu Gly 195 200 205 Met Gly Leu Pro Leu Asn Gly Ser Ala Leu Ala Gln Thr Gly Glu Arg 210 215 220 Asn Gln Leu Ala Lys Tyr Ala Gly Met Tyr Val Met Asp Cys Val Lys 225 230 235 240 Asn Asp Arg Arg Pro Arg Asp Ile Leu Thr Leu Asp Ala Phe Lys Asn 245 250 255 Ala Ile Thr Val Asp Met Ala Met Ala Gly Ser Thr Asn Thr Val Leu 260 265 270 His Leu Pro Ala Ile Ala His Glu Ala Gly Ile Glu Leu Asn Leu Asp 275 280 285 Leu Phe His Glu Ile Ser Lys His Thr Pro Cys Leu Thr Lys Leu Ser 290 295 300 Pro Ser Gly Lys His His Met Glu Asp Leu His Leu Ala Gly Gly Ile 305 310 315 320 Pro Ala Leu Met Asn Glu Leu Ser Lys Lys Gly Leu Ile Asn Glu Asp 325 330 335 Ala Leu Thr Val Thr Gly Lys Thr Val Gly Glu Thr Ile Lys Asp Phe 340 345 350 Lys Val Leu Asp Tyr Glu Val Ile Arg Ser Val Asp Asn Ala Tyr Ser 355 360 365 Ser Glu Gly Gly Ile Ala Ile Leu Arg Gly Asn Leu Ala Pro Asp Gly 370 375 380 Ala Val Val Lys Glu Ser Ala Val Ser Lys Glu Met Met Val His Glu 385 390 395 400 Gly Pro Ala Arg Val Tyr Asn Ser Glu Glu Ala Ala Val Lys Ala Ile 405 410 415 Phe Gly Asn Glu Ile Asn Lys Gly Asp Val Ile Val Ile Arg Tyr Glu 420 425 430 Gly Pro Lys Gly Gly Pro Gly Met Arg Glu Met Leu Ser Pro Thr Ser 435 440 445 Ala Ile Ala Gly Met Gly Leu Asp Lys Asp Val Ala Leu Leu Thr Asp 450 455 460 Gly Arg Phe Ser Gly Ala Thr Arg Gly Ala Ser Ile Gly His Val Ser 465 470 475 480 Pro Glu Ala Met Glu Gly Gly Leu Ile Gly Leu Val Glu Glu Gly Asp 485 490 495 Thr Ile Phe Val Asp Ile Thr Asn Lys Lys Leu Glu Leu Lys Val Ser 500 505 510 Glu Glu Glu Leu Glu Lys Arg Arg Lys Asn Tyr Val Lys Pro Glu Pro 515 520 525 Lys Ile Lys Thr Gly Tyr Leu Ser Arg Tyr Ala Lys Leu Val Thr Ser 530 535 540 Ala Asn Thr Gly Ala Val Leu Lys 545 550 <210> SEQ ID NO 40 <211> LENGTH: 616 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvD, YP_026248.1 <400> SEQUENCE: 40 Met Pro Lys Tyr Arg Ser Ala Thr Thr Thr His Gly Arg Asn Met Ala 1 5 10 15 Gly Ala Arg Ala Leu Trp Arg Ala Thr Gly Met Thr Asp Ala Asp Phe 20 25 30 Gly Lys Pro Ile Ile Ala Val Val Asn Ser Phe Thr Gln Phe Val Pro 35 40 45 Gly His Val His Leu Arg Asp Leu Gly Lys Leu Val Ala Glu Gln Ile 50 55 60 Glu Ala Ala Gly Gly Val Ala Lys Glu Phe Asn Thr Ile Ala Val Asp 65 70 75 80 Asp Gly Ile Ala Met Gly His Gly Gly Met Leu Tyr Ser Leu Pro Ser 85 90 95 Arg Glu Leu Ile Ala Asp Ser Val Glu Tyr Met Val Asn Ala His Cys 100 105 110 Ala Asp Ala Met Val Cys Ile Ser Asn Cys Asp Lys Ile Thr Pro Gly 115 120 125 Met Leu Met Ala Ser Leu Arg Leu Asn Ile Pro Val Ile Phe Val Ser 130 135 140 Gly Gly Pro Met Glu Ala Gly Lys Thr Lys Leu Ser Asp Gln Ile Ile 145 150 155 160 Lys Leu Asp Leu Val Asp Ala Met Ile Gln Gly Ala Asp Pro Lys Val 165 170 175 Ser Asp Ser Gln Ser Asp Gln Val Glu Arg Ser Ala Cys Pro Thr Cys 180 185 190 Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Thr 195 200 205 Glu Ala Leu Gly Leu Ser Gln Pro Gly Asn Gly Ser Leu Leu Ala Thr 210 215 220 His Ala Asp Arg Lys Gln Leu Phe Leu Asn Ala Gly Lys Arg Ile Val 225 230 235 240 Glu Leu Thr Lys Arg Tyr Tyr Glu Gln Asn Asp Glu Ser Ala Leu Pro 245 250 255 Arg Asn Ile Ala Ser Lys Ala Ala Phe Glu Asn Ala Met Thr Leu Asp 260 265 270 Ile Ala Met Gly Gly Ser Thr Asn Thr Val Leu His Leu Leu Ala Ala 275 280 285 Ala Gln Glu Ala Glu Ile Asp Phe Thr Met Ser Asp Ile Asp Lys Leu 290 295 300 Ser Arg Lys Val Pro Gln Leu Cys Lys Val Ala Pro Ser Thr Gln Lys 305 310 315 320 Tyr His Met Glu Asp Val His Arg Ala Gly Gly Val Ile Gly Ile Leu 325 330 335 Gly Glu Leu Asp Arg Ala Gly Leu Leu Asn Arg Asp Val Lys Asn Val 340 345 350 Leu Gly Leu Thr Leu Pro Gln Thr Leu Glu Gln Tyr Asp Val Met Leu 355 360 365 Thr Gln Asp Asp Ala Val Lys Asn Met Phe Arg Ala Gly Pro Ala Gly 370 375 380 Ile Arg Thr Thr Gln Ala Phe Ser Gln Asp Cys Arg Trp Asp Thr Leu 385 390 395 400 Asp Asp Asp Arg Ala Asn Gly Cys Ile Arg Ser Leu Glu His Ala Tyr 405 410 415 Ser Lys Asp Gly Gly Leu Ala Val Leu Tyr Gly Asn Phe Ala Glu Asn 420 425 430 Gly Cys Ile Val Lys Thr Ala Gly Val Asp Asp Ser Ile Leu Lys Phe 435 440 445 Thr Gly Pro Ala Lys Val Tyr Glu Ser Gln Asp Asp Ala Val Glu Ala 450 455 460 Ile Leu Gly Gly Lys Val Val Ala Gly Asp Val Val Val Ile Arg Tyr 465 470 475 480 Glu Gly Pro Lys Gly Gly Pro Gly Met Gln Glu Met Leu Tyr Pro Thr 485 490 495 Ser Phe Leu Lys Ser Met Gly Leu Gly Lys Ala Cys Ala Leu Ile Thr 500 505 510 Asp Gly Arg Phe Ser Gly Gly Thr Ser Gly Leu Ser Ile Gly His Val 515 520 525 Ser Pro Glu Ala Ala Ser Gly Gly Ser Ile Gly Leu Ile Glu Asp Gly 530 535 540 Asp Leu Ile Ala Ile Asp Ile Pro Asn Arg Gly Ile Gln Leu Gln Val 545 550 555 560 Ser Asp Ala Glu Leu Ala Ala Arg Arg Glu Ala Gln Asp Ala Arg Gly 565 570 575 Asp Lys Ala Trp Thr Pro Lys Asn Arg Glu Arg Gln Val Ser Phe Ala 580 585 590 Leu Arg Ala Tyr Ala Ser Leu Ala Thr Ser Ala Asp Lys Gly Ala Val 595 600 605 Arg Asp Lys Ser Lys Leu Gly Gly 610 615 <210> SEQ ID NO 41 <211> LENGTH: 477 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorA, WP_010876344.1 <400> SEQUENCE: 41 Met Thr Lys Lys Val Ile Arg Lys Pro Asp Ser Leu His Asp Val Phe 1 5 10 15 Glu Arg Lys Gly Gly Ser Ala Pro Thr Ala Thr His Tyr Cys Ala Gly 20 25 30 Cys Gly His Gly Ile Leu His Lys Leu Ile Gly Glu Ala Met Asp Glu 35 40 45 Leu Gly Ile Gln Glu Arg Ala Val Met Ile Ser Pro Val Gly Cys Ala 50 55 60 Val Phe Ala Tyr Tyr Tyr Phe Asp Cys Gly Asn Val Gln Val Ala His 65 70 75 80 Gly Arg Ala Pro Ala Val Gly Thr Gly Ile Ser Arg Ala Glu Asp Asp 85 90 95 Ala Val Val Ile Leu Tyr Gln Gly Asp Gly Asp Leu Ala Ser Ile Gly 100 105 110 Leu Asn Glu Thr Ile Gln Ala Ala Asn Arg Gly Glu Lys Leu Ala Val 115 120 125 Phe Phe Val Asn Asn Thr Val Tyr Gly Met Thr Gly Gly Gln Met Ala 130 135 140 Pro Thr Thr Leu Val Gly Glu Val Thr Val Thr Cys Pro Thr Gly Arg 145 150 155 160 Asp Pro Arg Tyr Ala Gly Tyr Pro Leu His Met Cys Glu Leu Leu Asp 165 170 175 Asn Leu Gln Ala Pro Val Phe Ile Glu Arg Val Ser Leu Ala Asp Pro 180 185 190 Lys Arg Ile Arg Arg Ala Arg Arg Ala Ile Lys Arg Ala Leu Glu Ile 195 200 205 Gln Arg Asp Gly Lys Gly Tyr Ala Phe Val Glu Val Leu Ser Pro Cys 210 215 220 Pro Thr Asn Leu Arg Gln Asp Ala Glu Gly Ala Glu Arg Phe Leu Lys 225 230 235 240 Glu Glu Met Glu Lys Glu Phe Pro Val Lys Asn Phe Arg Asp Arg Ser 245 250 255 Ala Glu Thr Glu Pro Leu Ile Arg Ser Glu Ser Asp Phe Ser Arg Glu 260 265 270 Ser Leu Asp Arg Ile Phe Gln Ile Arg Glu Asp Ser Val Pro Asp Pro 275 280 285 Val Asp Asp Pro Glu Phe Pro Glu Val Arg Val Lys Ile Ala Gly Phe 290 295 300 Gly Gly Gln Gly Val Leu Ser Met Gly Leu Thr Leu Ala Gln Ala Ala 305 310 315 320 Cys Ser Glu Gly Arg His Thr Ser Trp Tyr Pro Ala Tyr Gly Pro Glu 325 330 335 Gln Arg Gly Gly Thr Ser Ser Cys Gly Val Val Ile Ser Gly Glu Arg 340 345 350 Val Gly Ser Pro Ala Val Asp Thr Pro Asp Val Leu Val Ala Leu Asn 355 360 365 Gln Pro Ser Leu Asp Glu Phe Ala Asp Asp Val Ala Asp Gly Gly Ile 370 375 380 Ile Leu Tyr Asp Ser Thr Thr Ala Ser Phe Ser Gly Gly Ala Val Arg 385 390 395 400 Ala Met Gly Val Pro Ala Leu Glu Ile Ala Arg Lys His Gly Thr Ala 405 410 415 Arg Ala Ala Asn Thr Val Met Leu Gly Val Met Met Ala Leu Gly Leu 420 425 430 Thr Gly Leu Asp Glu Glu Ser Phe Arg Glu Ala Ile Lys Phe Thr Phe 435 440 445 Ala Gly Lys Glu Lys Ile Ile Asp Met Asn Leu Arg Ile Leu Glu Ala 450 455 460 Gly Ala Glu Trp Ala Arg Glu Asn Ile Glu Gly Glu Leu 465 470 475 <210> SEQ ID NO 42 <211> LENGTH: 352 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorB, WP_010876343.1 <400> SEQUENCE: 42 Met Ala Thr Gln Met Val Lys Gly Asn Thr Ala Val Ile Ile Gly Ala 1 5 10 15 Met Tyr Ala Gly Cys Asp Cys Tyr Phe Gly Tyr Pro Ile Thr Pro Ala 20 25 30 Ser Glu Ile Leu His Glu Ala Ser Arg Tyr Phe Pro Met Val Gly Arg 35 40 45 Lys Phe Val Gln Ala Glu Ser Glu Glu Ala Ala Ile Asn Met Val Tyr 50 55 60 Gly Ala Ala Ala Ala Gly His Arg Val Met Thr Ala Ser Ser Gly Pro 65 70 75 80 Gly Ile Ser Leu Lys Gln Glu Gly Ile Ser Phe Leu Ala Gly Ala Glu 85 90 95 Leu Pro Ala Val Ile Val Asp Val Met Arg Ala Gly Pro Gly Leu Gly 100 105 110 Asn Ile Gly Pro Glu Gln Gly Asp Tyr Asn Gln Ile Val Lys Gly Gly 115 120 125 Gly His Gly Asn Tyr Arg Asn Met Val Leu Ala Pro Ser Ser Val Gln 130 135 140 Glu Met Cys Asp Leu Thr Met Glu Ala Phe Glu Leu Ala Asp Lys Tyr 145 150 155 160 Arg Asn Pro Val Val Val Leu Thr Asp Ala Val Leu Gly Gln Met Ala 165 170 175 Glu Pro Leu Arg Phe Pro Glu Glu Ala Val Glu His Arg Pro Asp Thr 180 185 190 Ser Trp Ala Val Cys Gly Asn Arg Glu Thr Met Lys Asn Leu Val Thr 195 200 205 Ser Ile Phe Leu Asp Phe Asp Glu Leu Glu Glu Phe Asn Phe Tyr Leu 210 215 220 Gln Glu Lys Tyr Ala Arg Ile Glu Glu Asn Glu Val Arg Tyr Glu Glu 225 230 235 240 Tyr Leu Val Asp Asp Ala Glu Ile Val Met Val Ala Tyr Gly Ile Ser 245 250 255 Ser Arg Val Ala Arg Ser Ala Val Glu Thr Ala Arg Ala Glu Gly Ile 260 265 270 Asn Val Gly Leu Leu Arg Pro Ile Thr Leu Phe Pro Phe Pro Ser Asp 275 280 285 Arg Ile Arg Glu Leu Ala Asp Gly Gly Cys Arg Phe Ile Ser Val Glu 290 295 300 Met Ser Ser Gly Gln Met Arg Glu Asp Ile Arg Met Ala Ser Gly Cys 305 310 315 320 Arg Asp Val Glu Leu Val Asn Arg Met Gly Gly Asn Leu Ile Glu Leu 325 330 335 Arg Asp Val Leu Glu Lys Ile Arg Glu Val Ala Gly Asp Ser Ser Asp 340 345 350 <210> SEQ ID NO 43 <211> LENGTH: 79 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorC, WP_010876342.1 <400> SEQUENCE: 43 Met Lys Lys Ala Tyr Pro Val Ile Asn Ser Val Glu Cys Lys Ala Cys 1 5 10 15 Glu Arg Cys Ile Ile Ala Cys Pro Arg Lys Val Leu Gln Met Ser Ser 20 25 30 Lys Ile Asn Glu Arg Gly Tyr His Tyr Val Glu Tyr Arg Gly Glu Gly 35 40 45 Cys Asn Gly Cys Gly Asn Cys Tyr Tyr Thr Cys Pro Glu Ile Asn Ala 50 55 60 Ile Glu Val His Ile Glu Arg Cys Glu Asp Gly Asn Thr Asp Gly 65 70 75 <210> SEQ ID NO 44 <211> LENGTH: 124 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorD, WP_010876341.1 <400> SEQUENCE: 44 Met Asp Glu Asp Gly Tyr Met Trp Phe Val Gly Arg Thr Asp Asp Ile 1 5 10 15 Ile Lys Ser Ser Gly Tyr Arg Ile Gly Pro Phe Glu Val Glu Ser Ala 20 25 30 Ile Ile Ser His Pro Ser Val Leu Glu Cys Ala Val Thr Gly Tyr Pro 35 40 45 Asp Pro Ile Arg Gly Gln Val Val Lys Ala Thr Ile Val Leu Ala Arg 50 55 60 Gly Tyr Glu Pro Ser Glu Glu Leu Lys Lys Glu Ile Gln Asp His Val 65 70 75 80 Lys Arg Val Thr Ala Pro Tyr Lys Tyr Pro Arg Ile Val Glu Phe Val 85 90 95 Asp Glu Leu Pro Lys Thr Ile Ser Gly Lys Ile Arg Arg Val Glu Ile 100 105 110 Arg Glu His Asp Leu Glu Gly Asp Gly Glu Asn Pro 115 120 <210> SEQ ID NO 45 <211> LENGTH: 394 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorA, WP_011012106.1 <400> SEQUENCE: 45 Met Glu Tyr Lys Pro Ile Arg Lys Val Val Ser Gly Asn Tyr Ala Ala 1 5 10 15 Ala Tyr Ala Ala Leu His Ala Arg Val Gln Val Val Ala Ala Tyr Pro 20 25 30 Ile Thr Pro Gln Thr Ser Ile Ile Glu Lys Ile Ala Glu Phe Ile Ala 35 40 45 Asn Gly Glu Ala Asp Ile Gln Tyr Ile Pro Val Glu Ser Glu His Ser 50 55 60 Ala Met Ala Ala Cys Ile Gly Ala Ser Ala Thr Gly Ala Arg Thr Phe 65 70 75 80 Thr Ala Thr Ser Ala Gln Gly Leu Ala Leu Met His Glu Met Leu His 85 90 95 Trp Ala Ala Gly Ala Arg Leu Pro Ile Val Met Val Asp Val Asn Arg 100 105 110 Ala Met Ala Pro Pro Trp Ser Val Trp Asp Asp Gln Thr Asp Ser Leu 115 120 125 Ser Gln Arg Asp Thr Gly Trp Met Gln Phe Tyr Ala Glu Asn Asn Gln 130 135 140 Glu Val Tyr Asp Gly Val Leu Met Ala Tyr Lys Val Ala Glu Thr Val 145 150 155 160 Asn Val Pro Ala Met Val Val Glu Ser Ala Phe Ile Leu Ser His Thr 165 170 175 Tyr Asp Val Val Glu Met Ile Pro Gln Glu Leu Val Asp Glu Phe Leu 180 185 190 Pro Pro Arg Lys Pro Leu Tyr Ser Leu Ala Asn Phe Asp Glu Pro Ile 195 200 205 Ala Val Gly Ala Leu Ala Thr Pro Asn Asp Tyr Tyr Glu Phe Arg Tyr 210 215 220 Lys Leu Ala Lys Ala His Glu Glu Ala Lys Lys Val Ile Lys Glu Val 225 230 235 240 Gly Lys Glu Phe Gly Glu Arg Phe Gly Arg Asp Tyr Ser Gln Met Ile 245 250 255 Glu Thr Gly Tyr Ile Asp Asp Ala Asp Phe Val Phe Met Gly Met Gly 260 265 270 Ser Leu Met Gly Thr Val Lys Glu Ala Val Asp Leu Leu Arg Lys Glu 275 280 285 Gly Tyr Lys Val Gly Tyr Ala Lys Val Arg Trp Phe Arg Pro Phe Pro 290 295 300 Lys Glu Glu Leu Val Glu Ile Ala Glu Ser Val Lys Gly Ile Ala Val 305 310 315 320 Leu Asp Arg Asn Phe Ser Phe Gly Gln Glu Gly Ile Leu Phe Thr Glu 325 330 335 Ser Lys Gly Ala Leu Tyr Asn Ser Ser Ala His Pro Leu Met Lys Asn 340 345 350 Tyr Ile Val Gly Leu Gly Gly Arg Asp Val Thr Val Lys Asp Ile Lys 355 360 365 Ala Ile Ala Asp Asp Met Lys Lys Val Ile Glu Ser Gly Lys Val Asp 370 375 380 Lys Glu Val Val Trp Tyr His Leu Lys Arg 385 390 <210> SEQ ID NO 46 <211> LENGTH: 311 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorB, WP_011012105.1 <400> SEQUENCE: 46 Met Glu Val Pro Glu Asn Ile Lys Lys Arg Val Thr Ile Pro Phe Glu 1 5 10 15 Glu His Phe Tyr Ala Gly His Thr Ala Cys Gln Gly Cys Gly Ala Ser 20 25 30 Leu Gly Leu Arg Tyr Val Leu Lys Ala Tyr Gly Lys Lys Thr Ile Leu 35 40 45 Val Ile Pro Ala Cys Cys Ser Thr Ile Ile Ala Gly Pro Trp Pro Tyr 50 55 60 Ser Ala Ile Asp Ala Asn Leu Phe His Thr Ala Phe Glu Thr Thr Gly 65 70 75 80 Ala Val Ile Ser Gly Ile Glu Ala Ala Leu Lys Ala Met Gly Tyr Lys 85 90 95 Val Lys Gly Glu Asp Gly Ile Met Val Val Gly Trp Ala Gly Asp Gly 100 105 110 Gly Thr Ala Asp Ile Gly Leu Gln Ala Leu Ser Gly Phe Leu Glu Arg 115 120 125 Gly His Asp Ala Val Tyr Ile Met Tyr Asp Asn Glu Ala Tyr Met Asn 130 135 140 Thr Gly Ile Gln Arg Ser Ser Ser Thr Pro Tyr Gly Ala Trp Thr Thr 145 150 155 160 Asn Thr Pro Gly Gly Arg Arg His Phe Leu Glu Lys Arg His Lys Lys 165 170 175 Lys Val Ile Asp Ile Val Ile Ala His Arg Ile Pro Tyr Ala Ala Thr 180 185 190 Ala Ser Ile Ala Tyr Pro Glu Asp Phe Ile Arg Lys Leu Lys Lys Ala 195 200 205 Gln Lys Ile Ser Gly Pro Ser Phe Ile Gln Leu Phe Ala Pro Cys Pro 210 215 220 Thr Gly Trp Arg Ala Pro Thr Asp Lys Ser Ile Glu Ile Ala Arg Leu 225 230 235 240 Ala Val Gln Thr Ala Tyr Phe Pro Leu Phe Glu Tyr Glu Asn Gly Lys 245 250 255 Tyr Lys Ile Asn Met Pro Asn Pro Lys Lys Glu Pro Lys Pro Ile Glu 260 265 270 Glu Phe Leu Lys Leu Gln Gly Arg Phe Lys Tyr Met Thr Lys Glu Asp 275 280 285 Ile Glu Thr Leu Gln Lys Trp Val Leu Glu Glu Trp Glu Arg Leu Lys 290 295 300 Lys Leu Ala Glu Val Phe Gly 305 310 <210> SEQ ID NO 47 <211> LENGTH: 185 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorC, WP_011012108.1 <400> SEQUENCE: 47 Met Ile Glu Val Arg Phe His Gly Arg Gly Gly Gln Gly Ala Val Thr 1 5 10 15 Ala Ala Asn Ile Leu Ala Glu Ala Ala Phe Leu Glu Gly Lys Tyr Val 20 25 30 Gln Ala Phe Pro Phe Phe Gly Val Glu Arg Arg Gly Ala Pro Val Thr 35 40 45 Ala Phe Thr Arg Ile Asp Asn Lys Pro Ile Arg Ile Lys Thr Gln Ile 50 55 60 Tyr Glu Pro Asp Val Val Val Val Leu Asp Pro Ser Leu Leu Asp Ala 65 70 75 80 Val Asp Val Thr Ala Gly Leu Lys Asp Glu Gly Ile Val Ile Val Asn 85 90 95 Thr Glu Lys Ser Lys Glu Glu Val Leu Glu Lys Leu Lys Lys Lys Pro 100 105 110 Lys Lys Leu Ala Ile Val Asp Ala Thr Thr Ile Ala Leu Glu Ile Leu 115 120 125 Gly Leu Pro Ile Thr Asn Thr Ala Ile Leu Gly Ala Val Ala Lys Ala 130 135 140 Thr Gly Leu Val Lys Ile Glu Ser Ile Glu Glu Ala Ile Lys Asp Thr 145 150 155 160 Phe Ser Gly Glu Leu Gly Glu Lys Asn Ala Arg Ala Ala Arg Glu Ala 165 170 175 Tyr Glu Lys Thr Glu Val Phe Glu Leu 180 185 <210> SEQ ID NO 48 <211> LENGTH: 105 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorD, WP_011012107.1 <400> SEQUENCE: 48 Met Asn Thr Leu Phe Gly Lys Thr Lys Glu Glu Ala Lys Pro Ile Val 1 5 10 15 Leu Lys Ser Val Asp Glu Tyr Pro Glu Ala Pro Ile Ser Leu Gly Thr 20 25 30 Thr Leu Val Asn Pro Thr Gly Asp Trp Arg Thr Phe Lys Pro Val Val 35 40 45 Asn Glu Glu Lys Cys Val Lys Cys Tyr Ile Cys Trp Lys Tyr Cys Pro 50 55 60 Glu Pro Ala Ile Tyr Ile Lys Pro Asp Gly Tyr Val Ala Ile Asp Tyr 65 70 75 80 Asp Tyr Cys Lys Gly Cys Gly Ile Cys Ala Asn Glu Cys Pro Thr Lys 85 90 95 Ala Ile Thr Met Ile Lys Glu Glu Lys 100 105 <210> SEQ ID NO 49 <211> LENGTH: 386 <212> TYPE: PRT <213> ORGANISM: Streptomyces avermitilis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AcdH, AAD44196.1 or BAB69160.1 <400> SEQUENCE: 49 Met Asp His Arg Leu Thr Pro Glu Leu Glu Glu Leu Arg Arg Thr Val 1 5 10 15 Glu Glu Phe Ala His Asp Val Val Ala Pro Lys Ile Gly Asp Phe Tyr 20 25 30 Glu Arg His Glu Phe Pro Tyr Glu Ile Val Arg Glu Met Gly Arg Met 35 40 45 Gly Leu Phe Gly Leu Pro Phe Pro Glu Glu Tyr Gly Gly Met Gly Gly 50 55 60 Asp Tyr Leu Ala Leu Gly Ile Ala Leu Glu Glu Leu Ala Arg Val Asp 65 70 75 80 Ser Ser Val Ala Ile Thr Leu Glu Ala Gly Val Ser Leu Gly Ala Met 85 90 95 Pro Ile His Leu Phe Gly Thr Asp Ala Gln Lys Ala Glu Trp Leu Pro 100 105 110 Arg Leu Cys Ser Gly Glu Ile Leu Gly Ala Phe Gly Leu Thr Glu Pro 115 120 125 Asp Gly Gly Ser Asp Ala Gly Ala Thr Arg Thr Thr Ala Arg Leu Asp 130 135 140 Glu Ser Thr Asn Glu Trp Val Ile Asn Gly Thr Lys Cys Phe Ile Thr 145 150 155 160 Asn Ser Gly Thr Asp Ile Thr Gly Leu Val Thr Val Thr Ala Val Thr 165 170 175 Gly Arg Lys Pro Asp Gly Lys Pro Leu Ile Ser Ser Ile Ile Val Pro 180 185 190 Ser Gly Thr Pro Gly Phe Thr Val Ala Ala Pro Tyr Ser Lys Val Gly 195 200 205 Trp Asn Ala Ser Asp Thr Arg Glu Leu Ser Phe Ala Asp Val Arg Val 210 215 220 Pro Ala Ala Asn Leu Leu Gly Glu Gln Gly Arg Gly Tyr Ala Gln Phe 225 230 235 240 Leu Arg Ile Leu Asp Glu Gly Arg Ile Ala Ile Ser Ala Leu Ala Thr 245 250 255 Gly Leu Ala Gln Gly Cys Val Asp Glu Ser Val Lys Tyr Ala Gly Glu 260 265 270 Arg His Ala Phe Gly Arg Asn Ile Gly Ala Tyr Gln Ala Ile Gln Phe 275 280 285 Lys Ile Ala Asp Met Glu Met Lys Ala His Met Ala Arg Val Gly Trp 290 295 300 Arg Asp Ala Ala Ser Arg Leu Val Ala Gly Glu Pro Phe Lys Lys Glu 305 310 315 320 Ala Ala Ile Ala Lys Leu Tyr Ser Ser Thr Val Ala Val Asp Asn Ala 325 330 335 Arg Glu Ala Thr Gln Ile His Gly Gly Tyr Gly Phe Met Asn Glu Tyr 340 345 350 Pro Val Ala Arg Met Trp Arg Asp Ser Lys Ile Leu Glu Ile Gly Glu 355 360 365 Gly Thr Ser Glu Val Gln Arg Met Leu Ile Ala Arg Glu Leu Gly Leu 370 375 380 Val Gly 385 <210> SEQ ID NO 50 <211> LENGTH: 386 <212> TYPE: PRT <213> ORGANISM: Streptomyces coelicolor <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AcdH, AAD44195.1 <400> SEQUENCE: 50 Met Asp His Lys Leu Ser Pro Glu Leu Glu Glu Leu Arg Arg Thr Val 1 5 10 15 Glu Gln Phe Ala His Asp Val Val Ala Pro Lys Ile Gly Asp Phe Tyr 20 25 30 Glu Arg His Glu Phe Pro Tyr Glu Ile Val Arg Glu Met Gly Arg Met 35 40 45 Gly Leu Phe Gly Leu Pro Phe Pro Glu Glu Tyr Gly Gly Met Gly Gly 50 55 60 Asp Tyr Phe Ala Leu Gly Val Ala Leu Glu Glu Leu Ala Arg Val Asp 65 70 75 80 Ser Ser Val Ala Ile Thr Leu Glu Ala Gly Val Ser Leu Gly Ala Met 85 90 95 Pro Leu His Leu Phe Gly Thr Glu Glu Gln Lys Arg Glu Trp Leu Pro 100 105 110 Arg Leu Cys Ser Gly Glu Ile Leu Gly Ala Phe Gly Leu Thr Glu Pro 115 120 125 Asp Gly Gly Ser Asp Ala Gly Ala Thr Arg Thr Thr Ala Arg Leu Asp 130 135 140 Glu Ala Thr Asn Glu Trp Val Ile Asn Gly Thr Lys Cys Phe Ile Thr 145 150 155 160 Asn Ser Gly Thr Asp Ile Thr Gly Leu Val Thr Val Thr Ala Val Thr 165 170 175 Gly Arg Lys Pro Asp Gly Arg Pro Leu Ile Ser Ser Ile Ile Val Pro 180 185 190 Ser Gly Thr Pro Gly Phe Thr Val Ala Ala Pro Tyr Ser Lys Val Gly 195 200 205 Trp Asn Ala Ser Asp Thr Arg Glu Leu Ser Phe Ala Asp Val Arg Val 210 215 220 Pro Ala Ala Asn Leu Leu Gly Glu Leu Gly Arg Gly Tyr Ala Gln Phe 225 230 235 240 Leu Arg Ile Leu Asp Glu Gly Arg Val Ala Ile Ala Ala Leu Gly Thr 245 250 255 Gly Leu Ala Gln Gly Cys Val Asp Glu Ser Val Ala Tyr Ala Lys Glu 260 265 270 Arg His Ala Phe Gly Arg Pro Ile Gly Ala Asn Gln Ala Ile Gln Phe 275 280 285 Lys Ile Ala Asp Met Glu Met Lys Ala His Thr Ala Arg Leu Ala Trp 290 295 300 Arg Asp Ala Ala Ser Arg Leu Val Ala Gly Glu Pro Phe Lys Lys Glu 305 310 315 320 Ala Ala Leu Ala Lys Leu Tyr Ser Ser Thr Val Ala Val Asp Asn Ala 325 330 335 Arg Asp Ala Thr Gln Val His Gly Gly Tyr Gly Phe Met Asn Glu Tyr 340 345 350 Pro Val Ala Arg Met Trp Arg Asp Ala Lys Ile Leu Glu Ile Gly Glu 355 360 365 Gly Thr Ser Glu Val Gln Arg Met Leu Ile Ala Arg Glu Leu Gly Leu 370 375 380 Val Gly 385 <210> SEQ ID NO 51 <211> LENGTH: 261 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Crt, ABR34202.1 <400> SEQUENCE: 51 Met Glu Leu Lys Asn Val Ile Leu Glu Lys Glu Gly His Leu Ala Ile 1 5 10 15 Val Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Glu Thr 20 25 30 Leu Lys Asp Leu Asp Ala Val Leu Glu Asp Leu Glu Lys Asp Ser Asn 35 40 45 Met Tyr Thr Val Ile Val Thr Gly Ala Gly Glu Lys Ser Phe Val Ala 50 55 60 Gly Ala Asp Ile Ser Glu Met Lys Asp Leu Asn Glu Glu Gln Gly Lys 65 70 75 80 Glu Phe Gly Ile Leu Gly Asn Asn Val Phe Arg Arg Leu Glu Arg Leu 85 90 95 Asp Lys Pro Val Ile Ala Ala Ile Ser Gly Phe Ala Leu Gly Gly Gly 100 105 110 Cys Glu Leu Ala Met Ser Cys Asp Ile Arg Ile Ala Ser Val Lys Ala 115 120 125 Lys Phe Gly Gln Pro Glu Ala Gly Leu Gly Ile Thr Pro Gly Phe Gly 130 135 140 Gly Thr Gln Arg Leu Ala Arg Ile Val Gly Pro Gly Lys Ala Lys Glu 145 150 155 160 Leu Ile Tyr Thr Cys Asp Leu Ile Asn Ala Glu Glu Ala Tyr Arg Ile 165 170 175 Gly Leu Val Asn Lys Val Val Glu Leu Glu Lys Leu Met Glu Glu Ala 180 185 190 Lys Ala Met Ala Asn Lys Ile Ala Ala Asn Ala Pro Lys Ala Val Ala 195 200 205 Tyr Cys Lys Asp Ala Ile Asp Arg Gly Met Gln Val Asp Ile Asp Ala 210 215 220 Ala Ile Leu Ile Glu Ala Glu Asp Phe Gly Lys Cys Phe Ala Thr Glu 225 230 235 240 Asp Gln Thr Glu Gly Met Thr Ala Phe Leu Glu Arg Arg Ala Glu Lys 245 250 255 Asn Phe Gln Asn Lys 260 <210> SEQ ID NO 52 <211> LENGTH: 261 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Crt, NP_349318.1 <400> SEQUENCE: 52 Met Glu Leu Asn Asn Val Ile Leu Glu Lys Glu Gly Lys Val Ala Val 1 5 10 15 Val Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Asp Thr 20 25 30 Leu Lys Glu Met Asp Tyr Val Ile Gly Glu Ile Glu Asn Asp Ser Glu 35 40 45 Val Leu Ala Val Ile Leu Thr Gly Ala Gly Glu Lys Ser Phe Val Ala 50 55 60 Gly Ala Asp Ile Ser Glu Met Lys Glu Met Asn Thr Ile Glu Gly Arg 65 70 75 80 Lys Phe Gly Ile Leu Gly Asn Lys Val Phe Arg Arg Leu Glu Leu Leu 85 90 95 Glu Lys Pro Val Ile Ala Ala Val Asn Gly Phe Ala Leu Gly Gly Gly 100 105 110 Cys Glu Ile Ala Met Ser Cys Asp Ile Arg Ile Ala Ser Ser Asn Ala 115 120 125 Arg Phe Gly Gln Pro Glu Val Gly Leu Gly Ile Thr Pro Gly Phe Gly 130 135 140 Gly Thr Gln Arg Leu Ser Arg Leu Val Gly Met Gly Met Ala Lys Gln 145 150 155 160 Leu Ile Phe Thr Ala Gln Asn Ile Lys Ala Asp Glu Ala Leu Arg Ile 165 170 175 Gly Leu Val Asn Lys Val Val Glu Pro Ser Glu Leu Met Asn Thr Ala 180 185 190 Lys Glu Ile Ala Asn Lys Ile Val Ser Asn Ala Pro Val Ala Val Lys 195 200 205 Leu Ser Lys Gln Ala Ile Asn Arg Gly Met Gln Cys Asp Ile Asp Thr 210 215 220 Ala Leu Ala Phe Glu Ser Glu Ala Phe Gly Glu Cys Phe Ser Thr Glu 225 230 235 240 Asp Gln Lys Asp Ala Met Thr Ala Phe Ile Glu Lys Arg Lys Ile Glu 245 250 255 Gly Phe Lys Asn Arg 260 <210> SEQ ID NO 53 <211> LENGTH: 397 <212> TYPE: PRT <213> ORGANISM: Treponema denticola <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ccr, NP_971211.1 <400> SEQUENCE: 53 Met Ile Val Lys Pro Met Val Arg Asn Asn Ile Cys Leu Asn Ala His 1 5 10 15 Pro Gln Gly Cys Lys Lys Gly Val Glu Asp Gln Ile Glu Tyr Thr Lys 20 25 30 Lys Arg Ile Thr Ala Glu Val Lys Ala Gly Ala Lys Ala Pro Lys Asn 35 40 45 Val Leu Val Leu Gly Cys Ser Asn Gly Tyr Gly Leu Ala Ser Arg Ile 50 55 60 Thr Ala Ala Phe Gly Tyr Gly Ala Ala Thr Ile Gly Val Ser Phe Glu 65 70 75 80 Lys Ala Gly Ser Glu Thr Lys Tyr Gly Thr Pro Gly Trp Tyr Asn Asn 85 90 95 Leu Ala Phe Asp Glu Ala Ala Lys Arg Glu Gly Leu Tyr Ser Val Thr 100 105 110 Ile Asp Gly Asp Ala Phe Ser Asp Glu Ile Lys Ala Gln Val Ile Glu 115 120 125 Glu Ala Lys Lys Lys Gly Ile Lys Phe Asp Leu Ile Val Tyr Ser Leu 130 135 140 Ala Ser Pro Val Arg Thr Asp Pro Asp Thr Gly Ile Met His Lys Ser 145 150 155 160 Val Leu Lys Pro Phe Gly Lys Thr Phe Thr Gly Lys Thr Val Asp Pro 165 170 175 Phe Thr Gly Glu Leu Lys Glu Ile Ser Ala Glu Pro Ala Asn Asp Glu 180 185 190 Glu Ala Ala Ala Thr Val Lys Val Met Gly Gly Glu Asp Trp Glu Arg 195 200 205 Trp Ile Lys Gln Leu Ser Lys Glu Gly Leu Leu Glu Glu Gly Cys Ile 210 215 220 Thr Leu Ala Tyr Ser Tyr Ile Gly Pro Glu Ala Thr Gln Ala Leu Tyr 225 230 235 240 Arg Lys Gly Thr Ile Gly Lys Ala Lys Glu His Leu Glu Ala Thr Ala 245 250 255 His Arg Leu Asn Lys Glu Asn Pro Ser Ile Arg Ala Phe Val Ser Val 260 265 270 Asn Lys Gly Leu Val Thr Arg Ala Ser Ala Val Ile Pro Val Ile Pro 275 280 285 Leu Tyr Leu Ala Ser Leu Phe Lys Val Met Lys Glu Lys Gly Asn His 290 295 300 Glu Gly Cys Ile Glu Gln Ile Thr Arg Leu Tyr Ala Glu Arg Leu Tyr 305 310 315 320 Arg Lys Asp Gly Thr Ile Pro Val Asp Glu Glu Asn Arg Ile Arg Ile 325 330 335 Asp Asp Trp Glu Leu Glu Glu Asp Val Gln Lys Ala Val Ser Ala Leu 340 345 350 Met Glu Lys Val Thr Gly Glu Asn Ala Glu Ser Leu Thr Asp Leu Ala 355 360 365 Gly Tyr Arg His Asp Phe Leu Ala Ser Asn Gly Phe Asp Val Glu Gly 370 375 380 Ile Asn Tyr Glu Ala Glu Val Glu Arg Phe Asp Arg Ile 385 390 395 <210> SEQ ID NO 54 <211> LENGTH: 539 <212> TYPE: PRT <213> ORGANISM: Euglena gracilis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ter, AAW66853.1 <400> SEQUENCE: 54 Met Ser Cys Pro Ala Ser Pro Ser Ala Ala Val Val Ser Ala Gly Ala 1 5 10 15 Leu Cys Leu Cys Val Ala Thr Val Leu Leu Ala Thr Gly Ser Asn Pro 20 25 30 Thr Ala Leu Ser Thr Ala Ser Thr Arg Ser Pro Thr Ser Leu Val Arg 35 40 45 Gly Val Asp Arg Gly Leu Met Arg Pro Thr Thr Ala Ala Ala Leu Thr 50 55 60 Thr Met Arg Glu Val Pro Gln Met Ala Glu Gly Phe Ser Gly Glu Ala 65 70 75 80 Thr Ser Ala Trp Ala Ala Ala Gly Pro Gln Trp Ala Ala Pro Leu Val 85 90 95 Ala Ala Ala Ser Ser Ala Leu Ala Leu Trp Trp Trp Ala Ala Arg Arg 100 105 110 Ser Val Arg Arg Pro Leu Ala Ala Leu Ala Glu Leu Pro Thr Ala Val 115 120 125 Thr His Leu Ala Pro Pro Met Ala Met Phe Thr Thr Thr Ala Lys Val 130 135 140 Ile Gln Pro Lys Ile Arg Gly Phe Ile Cys Thr Thr Thr His Pro Ile 145 150 155 160 Gly Cys Glu Lys Arg Val Gln Glu Glu Ile Ala Tyr Ala Arg Ala His 165 170 175 Pro Pro Thr Ser Pro Gly Pro Lys Arg Val Leu Val Ile Gly Cys Ser 180 185 190 Thr Gly Tyr Gly Leu Ser Thr Arg Ile Thr Ala Ala Phe Gly Tyr Gln 195 200 205 Ala Ala Thr Leu Gly Val Phe Leu Ala Gly Pro Pro Thr Lys Gly Arg 210 215 220 Pro Ala Ala Ala Gly Trp Tyr Asn Thr Val Ala Phe Glu Lys Ala Ala 225 230 235 240 Leu Glu Ala Gly Leu Tyr Ala Arg Ser Leu Asn Gly Asp Ala Phe Asp 245 250 255 Ser Thr Thr Lys Ala Arg Thr Val Glu Ala Ile Lys Arg Asp Leu Gly 260 265 270 Thr Val Asp Leu Val Val Tyr Ser Ile Ala Ala Pro Lys Arg Thr Asp 275 280 285 Pro Ala Thr Gly Val Leu His Lys Ala Cys Leu Lys Pro Ile Gly Ala 290 295 300 Thr Tyr Thr Asn Arg Thr Val Asn Thr Asp Lys Ala Glu Val Thr Asp 305 310 315 320 Val Ser Ile Glu Pro Ala Ser Pro Glu Glu Ile Ala Asp Thr Val Lys 325 330 335 Val Met Gly Gly Glu Asp Trp Glu Leu Trp Ile Gln Ala Leu Ser Glu 340 345 350 Ala Gly Val Leu Ala Glu Gly Ala Lys Thr Val Ala Tyr Ser Tyr Ile 355 360 365 Gly Pro Glu Met Thr Trp Pro Val Tyr Trp Ser Gly Thr Ile Gly Glu 370 375 380 Ala Lys Lys Asp Val Glu Lys Ala Ala Lys Arg Ile Thr Gln Gln Tyr 385 390 395 400 Gly Cys Pro Ala Tyr Pro Val Val Ala Lys Ala Leu Val Thr Gln Ala 405 410 415 Ser Ser Ala Ile Pro Val Val Pro Leu Tyr Ile Cys Leu Leu Tyr Arg 420 425 430 Val Met Lys Glu Lys Gly Thr His Glu Gly Cys Ile Glu Gln Met Val 435 440 445 Arg Leu Leu Thr Thr Lys Leu Tyr Pro Glu Asn Gly Ala Pro Ile Val 450 455 460 Asp Glu Ala Gly Arg Val Arg Val Asp Asp Trp Glu Met Ala Glu Asp 465 470 475 480 Val Gln Gln Ala Val Lys Asp Leu Trp Ser Gln Val Ser Thr Ala Asn 485 490 495 Leu Lys Asp Ile Ser Asp Phe Ala Gly Tyr Gln Thr Glu Phe Leu Arg 500 505 510 Leu Phe Gly Phe Gly Ile Asp Gly Val Asp Tyr Asp Gln Pro Val Asp 515 520 525 Val Glu Ala Asp Leu Pro Ser Ala Ala Gln Gln 530 535 <210> SEQ ID NO 55 <211> LENGTH: 282 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Hbd, WP_011967675.1 <400> SEQUENCE: 55 Met Lys Lys Ile Phe Val Leu Gly Ala Gly Thr Met Gly Ala Gly Ile 1 5 10 15 Val Gln Ala Phe Ala Gln Lys Gly Cys Glu Val Ile Val Arg Asp Ile 20 25 30 Lys Glu Glu Phe Val Asp Arg Gly Ile Ala Gly Ile Thr Lys Gly Leu 35 40 45 Glu Lys Gln Val Ala Lys Gly Lys Met Ser Glu Glu Asp Lys Glu Ala 50 55 60 Ile Leu Ser Arg Ile Ser Gly Thr Thr Asp Met Lys Leu Ala Ala Asp 65 70 75 80 Cys Asp Leu Val Val Glu Ala Ala Ile Glu Asn Met Lys Ile Lys Lys 85 90 95 Glu Ile Phe Ala Glu Leu Asp Gly Ile Cys Lys Pro Glu Ala Ile Leu 100 105 110 Ala Ser Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Ser Ala Thr 115 120 125 Lys Arg Pro Asp Lys Val Ile Gly Met His Phe Phe Asn Pro Ala Pro 130 135 140 Val Met Lys Leu Val Glu Ile Ile Lys Gly Ile Ala Thr Ser Gln Glu 145 150 155 160 Thr Phe Asp Ala Val Lys Glu Leu Ser Val Ala Ile Gly Lys Glu Pro 165 170 175 Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Arg Ile Leu Ile 180 185 190 Pro Met Ile Asn Glu Ala Ser Phe Ile Leu Gln Glu Gly Ile Ala Ser 195 200 205 Val Glu Asp Ile Asp Thr Ala Met Lys Tyr Gly Ala Asn His Pro Met 210 215 220 Gly Pro Leu Ala Leu Gly Asp Leu Ile Gly Leu Asp Val Cys Leu Ala 225 230 235 240 Ile Met Asp Val Leu Phe Thr Glu Thr Gly Asp Asn Lys Tyr Arg Ala 245 250 255 Ser Ser Ile Leu Arg Lys Tyr Val Arg Ala Gly Trp Leu Gly Arg Lys 260 265 270 Ser Gly Lys Gly Phe Tyr Asp Tyr Ser Lys 275 280 <210> SEQ ID NO 56 <211> LENGTH: 282 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Hbd, NP_349314.1 <400> SEQUENCE: 56 Met Lys Lys Val Cys Val Ile Gly Ala Gly Thr Met Gly Ser Gly Ile 1 5 10 15 Ala Gln Ala Phe Ala Ala Lys Gly Phe Glu Val Val Leu Arg Asp Ile 20 25 30 Lys Asp Glu Phe Val Asp Arg Gly Leu Asp Phe Ile Asn Lys Asn Leu 35 40 45 Ser Lys Leu Val Lys Lys Gly Lys Ile Glu Glu Ala Thr Lys Val Glu 50 55 60 Ile Leu Thr Arg Ile Ser Gly Thr Val Asp Leu Asn Met Ala Ala Asp 65 70 75 80 Cys Asp Leu Val Ile Glu Ala Ala Val Glu Arg Met Asp Ile Lys Lys 85 90 95 Gln Ile Phe Ala Asp Leu Asp Asn Ile Cys Lys Pro Glu Thr Ile Leu 100 105 110 Ala Ser Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Ser Ala Thr 115 120 125 Lys Arg Pro Asp Lys Val Ile Gly Met His Phe Phe Asn Pro Ala Pro 130 135 140 Val Met Lys Leu Val Glu Val Ile Arg Gly Ile Ala Thr Ser Gln Glu 145 150 155 160 Thr Phe Asp Ala Val Lys Glu Thr Ser Ile Ala Ile Gly Lys Asp Pro 165 170 175 Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Arg Ile Leu Ile 180 185 190 Pro Met Ile Asn Glu Ala Val Gly Ile Leu Ala Glu Gly Ile Ala Ser 195 200 205 Val Glu Asp Ile Asp Lys Ala Met Lys Leu Gly Ala Asn His Pro Met 210 215 220 Gly Pro Leu Glu Leu Gly Asp Phe Ile Gly Leu Asp Ile Cys Leu Ala 225 230 235 240 Ile Met Asp Val Leu Tyr Ser Glu Thr Gly Asp Ser Lys Tyr Arg Pro 245 250 255 His Thr Leu Leu Lys Lys Tyr Val Arg Ala Gly Trp Leu Gly Arg Lys 260 265 270 Ser Gly Lys Gly Phe Tyr Asp Tyr Ser Lys 275 280 <210> SEQ ID NO 57 <211> LENGTH: 282 <212> TYPE: PRT <213> ORGANISM: Clostridium kluyveri <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Hbd1, WP_011989027.1 <400> SEQUENCE: 57 Met Ser Ile Lys Ser Val Ala Val Leu Gly Ser Gly Thr Met Ser Arg 1 5 10 15 Gly Ile Val Gln Ala Phe Ala Glu Ala Gly Ile Asp Val Ile Ile Arg 20 25 30 Gly Arg Thr Glu Gly Ser Ile Gly Lys Gly Leu Ala Ala Val Lys Lys 35 40 45 Ala Tyr Asp Lys Lys Val Ser Lys Gly Lys Ile Ser Gln Glu Asp Ala 50 55 60 Asp Lys Ile Val Gly Arg Val Ser Thr Thr Thr Glu Leu Glu Lys Leu 65 70 75 80 Ala Asp Cys Asp Leu Ile Ile Glu Ala Ala Ser Glu Asp Met Asn Ile 85 90 95 Lys Lys Asp Tyr Phe Gly Lys Leu Glu Glu Ile Cys Lys Pro Glu Thr 100 105 110 Ile Phe Ala Thr Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Thr 115 120 125 Ala Thr Lys Arg Pro Asp Lys Phe Ile Gly Met His Phe Phe Asn Pro 130 135 140 Ala Asn Val Met Lys Leu Val Glu Ile Ile Arg Gly Met Asn Thr Ser 145 150 155 160 Gln Glu Thr Phe Asp Ile Ile Lys Glu Ala Ser Ile Lys Ile Gly Lys 165 170 175 Thr Pro Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Lys Ile 180 185 190 Leu Val Pro Met Ile Asn Glu Ala Val Gly Ile Leu Ala Glu Gly Ile 195 200 205 Ala Ser Ala Glu Asp Ile Asp Thr Ala Met Lys Leu Gly Ala Asn His 210 215 220 Pro Met Gly Pro Leu Ala Leu Gly Asp Leu Ile Gly Leu Asp Val Val 225 230 235 240 Leu Ala Val Met Asp Val Leu Tyr Ser Glu Thr Gly Asp Ser Lys Tyr 245 250 255 Arg Ala His Thr Leu Leu Arg Lys Tyr Val Arg Ala Gly Trp Leu Gly 260 265 270 Arg Lys Ser Gly Lys Gly Phe Phe Ala Tyr 275 280 <210> SEQ ID NO 58 <211> LENGTH: 246 <212> TYPE: PRT <213> ORGANISM: Cupriavidus necator <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: PhaB, WP_010810131.1 <400> SEQUENCE: 58 Met Thr Gln Arg Ile Ala Tyr Val Thr Gly Gly Met Gly Gly Ile Gly 1 5 10 15 Thr Ala Ile Cys Gln Arg Leu Ala Lys Asp Gly Phe Arg Val Val Ala 20 25 30 Gly Cys Gly Pro Asn Ser Pro Arg Arg Glu Lys Trp Leu Glu Gln Gln 35 40 45 Lys Ala Leu Gly Phe Asp Phe Ile Ala Ser Glu Gly Asn Val Ala Asp 50 55 60 Trp Asp Ser Thr Lys Thr Ala Phe Asp Lys Val Lys Ser Glu Val Gly 65 70 75 80 Glu Val Asp Val Leu Ile Asn Asn Ala Gly Ile Thr Arg Asp Val Val 85 90 95 Phe Arg Lys Met Thr Arg Ala Asp Trp Asp Ala Val Ile Asp Thr Asn 100 105 110 Leu Thr Ser Leu Phe Asn Val Thr Lys Gln Val Ile Asp Gly Met Ala 115 120 125 Asp Arg Gly Trp Gly Arg Ile Val Asn Ile Ser Ser Val Asn Gly Gln 130 135 140 Lys Gly Gln Phe Gly Gln Thr Asn Tyr Ser Thr Ala Lys Ala Gly Leu 145 150 155 160 His Gly Phe Thr Met Ala Leu Ala Gln Glu Val Ala Thr Lys Gly Val 165 170 175 Thr Val Asn Thr Val Ser Pro Gly Tyr Ile Ala Thr Asp Met Val Lys 180 185 190 Ala Ile Arg Gln Asp Val Leu Asp Lys Ile Val Ala Thr Ile Pro Val 195 200 205 Lys Arg Leu Gly Leu Pro Glu Glu Ile Ala Ser Ile Cys Ala Trp Leu 210 215 220 Ser Ser Glu Glu Ser Gly Phe Ser Thr Gly Ala Asp Phe Ser Leu Asn 225 230 235 240 Gly Gly Leu His Met Gly 245 <210> SEQ ID NO 59 <211> LENGTH: 134 <212> TYPE: PRT <213> ORGANISM: Aeromonas caviae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: PhaJ, O32472 <400> SEQUENCE: 59 Met Ser Ala Gln Ser Leu Glu Val Gly Gln Lys Ala Arg Leu Ser Lys 1 5 10 15 Arg Phe Gly Ala Ala Glu Val Ala Ala Phe Ala Ala Leu Ser Glu Asp 20 25 30 Phe Asn Pro Leu His Leu Asp Pro Ala Phe Ala Ala Thr Thr Ala Phe 35 40 45 Glu Arg Pro Ile Val His Gly Met Leu Leu Ala Ser Leu Phe Ser Gly 50 55 60 Leu Leu Gly Gln Gln Leu Pro Gly Lys Gly Ser Ile Tyr Leu Gly Gln 65 70 75 80 Ser Leu Ser Phe Lys Leu Pro Val Phe Val Gly Asp Glu Val Thr Ala 85 90 95 Glu Val Glu Val Thr Ala Leu Arg Glu Asp Lys Pro Ile Ala Thr Leu 100 105 110 Thr Thr Arg Ile Phe Thr Gln Gly Gly Ala Leu Ala Val Thr Gly Glu 115 120 125 Ala Val Val Lys Leu Pro 130 <210> SEQ ID NO 60 <211> LENGTH: 260 <212> TYPE: PRT <213> ORGANISM: Ralstonia pickettii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh1, BAE72684.1 <400> SEQUENCE: 60 Met Gln Leu Lys Gly Lys Ser Ala Ile Val Thr Gly Ala Ala Ser Gly 1 5 10 15 Ile Gly Lys Ala Ile Ala Glu Leu Leu Ala Lys Glu Gly Ala Ala Val 20 25 30 Ala Ile Ala Asp Leu Asn Leu Glu Ala Ala Arg Ala Ala Ala Ala Gly 35 40 45 Ile Glu Ala Ala Gly Gly Lys Ala Ile Ala Val Ala Met Asp Val Thr 50 55 60 Ser Glu Ala Ser Val Asn Gln Ala Thr Asp Glu Val Ala Gln Ala Phe 65 70 75 80 Gly Asn Ile Asp Ile Leu Val Ser Asn Ala Gly Ile Gln Ile Val Asn 85 90 95 Pro Ile Gln Asn Tyr Ala Phe Ser Asp Trp Lys Lys Met Gln Ala Ile 100 105 110 His Val Asp Gly Ala Phe Leu Thr Thr Lys Ala Ala Leu Lys Tyr Met 115 120 125 Tyr Arg Asp Lys Arg Gly Gly Thr Val Ile Tyr Met Gly Ser Val His 130 135 140 Ser His Glu Ala Ser Pro Leu Lys Ser Ala Tyr Val Ala Ala Lys His 145 150 155 160 Ala Leu Leu Gly Leu Ala Arg Val Leu Ala Lys Glu Gly Ala Glu Phe 165 170 175 Asn Val Arg Ser His Val Ile Cys Pro Gly Phe Val Arg Thr Pro Leu 180 185 190 Val Asp Lys Gln Ile Pro Glu Gln Ala Lys Glu Leu Gly Ile Ser Glu 195 200 205 Glu Glu Val Val Arg Arg Val Met Leu Gly Gly Thr Val Asp Gly Val 210 215 220 Phe Thr Thr Val Asp Asp Val Ala Arg Thr Ala Leu Phe Leu Cys Ala 225 230 235 240 Phe Pro Ser Ala Ala Leu Thr Gly Gln Ser Phe Ile Val Ser His Gly 245 250 255 Trp Tyr Met Gln 260 <210> SEQ ID NO 61 <211> LENGTH: 256 <212> TYPE: PRT <213> ORGANISM: Ralstonia pickettii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh2, BAE72685.1 <400> SEQUENCE: 61 Met Leu Gln Gly Lys Thr Ala Leu Val Thr Gly Ser Thr Cys Gly Ile 1 5 10 15 Gly Leu Gly Ile Ala Gln Ala Leu Ala Ala Gln Gly Ala Asn Ile Ile 20 25 30 Val Asn Gly Phe Arg Arg Ala Asp Gly Ala Arg Gln Gln Ile Ala Ala 35 40 45 Ala Gly Gln Val Ile Arg Leu Gly Tyr His Gly Ala Asp Met Ser Lys 50 55 60 Ala Ser Glu Ile Glu Asp Met Met Arg Tyr Ala Glu Ala Glu Phe Ala 65 70 75 80 Ala Asp Ile Leu Val Asn Asn Ala Gly Ile Gln His Val Ala Ser Ile 85 90 95 Glu Asp Phe Pro Pro Glu Arg Trp Asp Ala Ile Ile Ala Ile Asn Leu 100 105 110 Thr Ser Ala Phe His Thr Thr Arg Leu Ala Leu Pro Gly Met Arg Gln 115 120 125 Lys Asn Trp Gly Arg Val Ile Asn Ile Ala Ser Thr His Gly Leu Val 130 135 140 Ala Ser Ala Gln Lys Ser Ala Tyr Val Ala Ala Lys His Gly Ile Val 145 150 155 160 Gly Leu Thr Lys Val Thr Ala Leu Glu Thr Ala Gln Asn Arg Val Thr 165 170 175 Ala Asn Ala Ile Cys Pro Gly Trp Val Leu Thr Pro Leu Val Gln Lys 180 185 190 Gln Val Gln Ala Arg Pro Ala His Gly Ile Ser Val Glu Gln Ala Lys 195 200 205 Arg Glu Leu Val Ile Glu Lys Gln Pro Ser Gly Gln Phe Val Thr Pro 210 215 220 Asp Glu Leu Gly Ala Leu Ala Val Phe Leu Ala Ser Glu Ala Gly Arg 225 230 235 240 Gln Val Arg Gly Ala Ile Trp Asn Met Ala Gly Gly Trp Phe Ala Gln 245 250 255 <210> SEQ ID NO 62 <211> LENGTH: 254 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh, AGY75962 <400> SEQUENCE: 62 Met Arg Leu Glu Asn Lys Val Ala Ile Val Thr Gly Ser Ala Met Gly 1 5 10 15 Ile Gly Lys Ala Ile Val Arg Asp Phe Val Asn Glu Gly Ala Lys Val 20 25 30 Ile Ile Ser Asp Ile Leu Glu Ala Glu Gly Gln Ala Leu Glu Glu Glu 35 40 45 Leu Gln Lys Lys Gly His Ser Val Tyr Phe Phe Lys Thr Asp Val Ser 50 55 60 Ser Glu Lys Asn Ile Lys Glu Leu Val Lys Phe Thr Leu Glu Lys Phe 65 70 75 80 Gly Thr Ile Asn Ile Leu Cys Asn Asn Ala Ala Val Asn Ile Pro Gly 85 90 95 Ser Val Leu Glu Leu Thr Glu Asp Ile Trp Asn Lys Thr Met Asp Val 100 105 110 Asn Val Lys Ser His Phe Leu Val Ser Lys His Val Ile Pro Val Met 115 120 125 Gln Lys Ala Gly Gly Gly Ser Ile Val Asn Thr Ala Ser Ala Asn Ser 130 135 140 Phe Val Ala Glu Pro Arg Leu Ser Ala Tyr Val Ala Ser Lys Gly Ala 145 150 155 160 Ile Leu Met Leu Thr Arg Ala Met Ala Leu Asp Phe Ala Lys Asp Asn 165 170 175 Ile Arg Val Asn Cys Ile Cys Pro Gly Trp Val Asp Thr Thr Phe Asn 180 185 190 Asp Ala His Ala Glu Leu Phe Gly Gly Arg Glu Ala Val Leu Lys Asp 195 200 205 Leu Ala Ser Val Gln Pro Ile Gly Arg Pro Ile Ala Pro Met Glu Ile 210 215 220 Ala Lys Ile Ala Thr Phe Leu Ala Ser Asp Asp Ser Ser Cys Met Thr 225 230 235 240 Gly Ser Pro Val Ile Ala Asp Gly Gly Ile Thr Ala Gly Val 245 250 <210> SEQ ID NO 63 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, WP_013238665.1 <400> SEQUENCE: 63 Met Tyr Gly Tyr Asp Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 1 5 10 15 Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 35 40 45 Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 85 90 95 Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 100 105 110 Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 115 120 125 Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 130 135 140 Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Lys Ser Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 195 200 205 Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 210 215 220 Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 225 230 235 240 Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Glu Ser Tyr Thr Asn Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 305 310 315 320 Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 370 375 380 Trp Ile Lys Arg Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 385 390 395 400 Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Ala Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Leu Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Glu Ser Thr Tyr Asp Ala Asp Ser Leu Leu Glu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 545 550 555 560 Asp Gly Pro Ser Lys Gly Glu Val His Arg Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 595 600 605 <210> SEQ ID NO 64 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, WP_013238675.1 <400> SEQUENCE: 64 Met Tyr Gly Tyr Lys Gly Lys Val Leu Arg Ile Asn Leu Ser Ser Lys 1 5 10 15 Thr Tyr Ile Val Glu Glu Leu Lys Ile Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Val Asp Pro 35 40 45 Lys Val Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Ala Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ser Pro Leu Thr Gly Thr Ile Ala Ile Ala Asn Ser Gly Gly 85 90 95 Lys Trp Gly Ala Glu Phe Lys Ala Ala Gly Tyr Asp Met Ile Ile Val 100 105 110 Glu Gly Lys Ser Asp Lys Glu Val Tyr Val Asn Ile Val Asp Asp Lys 115 120 125 Val Glu Phe Arg Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu 130 135 140 Thr Thr Lys Met Leu Gln Gln Glu Thr Asp Ser Arg Ala Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Lys Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 195 200 205 Lys Leu Phe Asp Glu Gln Lys Val Lys Glu Val Ala Leu Glu Lys Thr 210 215 220 Asn Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 225 230 235 240 Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Lys Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Lys Asp Cys Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Lys Leu Asp Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 305 310 315 320 Cys Asp Val Tyr Asp Ile Asn Ala Val Asn Thr Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Leu Asp Thr Ile Thr Ala Gly Cys Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Ala Asp Gly Leu Ser Leu Asn Trp Gly Asp Ala Lys Ser Met Val Glu 370 375 380 Trp Val Lys Lys Met Gly Leu Arg Glu Gly Phe Gly Asp Lys Met Ala 385 390 395 400 Asp Gly Ser Tyr Arg Leu Cys Asp Ser Tyr Gly Val Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Val Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Leu Ala Val Glu Gly Lys Ala Gly Tyr Ala Arg Val Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Gly Glu Leu His Asp Val Asn Ser Leu Met Leu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Gln Ile Pro 545 550 555 560 Glu Gly Pro Ser Lys Gly Glu Val His Lys Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Val Gly Lys Leu 595 600 605 <210> SEQ ID NO 65 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, ADK15073.1 <400> SEQUENCE: 65 Met Tyr Gly Tyr Asp Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 1 5 10 15 Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 35 40 45 Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 85 90 95 Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 100 105 110 Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 115 120 125 Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 130 135 140 Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Lys Ser Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 195 200 205 Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 210 215 220 Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 225 230 235 240 Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Glu Ser Tyr Thr Asn Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 305 310 315 320 Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 370 375 380 Trp Ile Lys Arg Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 385 390 395 400 Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Ala Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Leu Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Glu Ser Thr Tyr Asp Ala Asp Ser Leu Leu Glu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 545 550 555 560 Asp Gly Pro Ser Lys Gly Glu Val His Arg Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 595 600 605 <210> SEQ ID NO 66 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, ADK15083.1 <400> SEQUENCE: 66 Met Tyr Gly Tyr Lys Gly Lys Val Leu Arg Ile Asn Leu Ser Ser Lys 1 5 10 15 Thr Tyr Ile Val Glu Glu Leu Lys Ile Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Val Asp Pro 35 40 45 Lys Val Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Ala Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ser Pro Leu Thr Gly Thr Ile Ala Ile Ala Asn Ser Gly Gly 85 90 95 Lys Trp Gly Ala Glu Phe Lys Ala Ala Gly Tyr Asp Met Ile Ile Val 100 105 110 Glu Gly Lys Ser Asp Lys Glu Val Tyr Val Asn Ile Val Asp Asp Lys 115 120 125 Val Glu Phe Arg Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu 130 135 140 Thr Thr Lys Met Leu Gln Gln Glu Thr Asp Ser Arg Ala Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Lys Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 195 200 205 Lys Leu Phe Asp Glu Gln Lys Val Lys Glu Val Ala Leu Glu Lys Thr 210 215 220 Asn Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 225 230 235 240 Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Lys Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Lys Asp Cys Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Lys Leu Asp Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 305 310 315 320 Cys Asp Val Tyr Asp Ile Asn Ala Val Asn Thr Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Leu Asp Thr Ile Thr Ala Gly Cys Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Ala Asp Gly Leu Ser Leu Asn Trp Gly Asp Ala Lys Ser Met Val Glu 370 375 380 Trp Val Lys Lys Met Gly Leu Arg Glu Gly Phe Gly Asp Lys Met Ala 385 390 395 400 Asp Gly Ser Tyr Arg Leu Cys Asp Ser Tyr Gly Val Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Val Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Leu Ala Val Glu Gly Lys Ala Gly Tyr Ala Arg Val Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Gly Glu Leu His Asp Val Asn Ser Leu Met Leu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Gln Ile Pro 545 550 555 560 Glu Gly Pro Ser Lys Gly Glu Val His Lys Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Val Gly Lys Leu 595 600 605 <210> SEQ ID NO 67 <211> LENGTH: 405 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Adh, AGY76060.1 <400> SEQUENCE: 67 Met Lys Tyr Met Gly Ile Lys Ile Tyr Gly Asn Lys Ile Arg Gly Ile 1 5 10 15 Ile Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp 20 25 30 Ala Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val 35 40 45 Val Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu 50 55 60 Glu Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val 65 70 75 80 Glu Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met 85 90 95 Thr Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro 100 105 110 Ile Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe 115 120 125 Thr Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln 130 135 140 Lys Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu 145 150 155 160 Val Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr 165 170 175 Pro Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro 180 185 190 Ala Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met 195 200 205 Asp Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser 210 215 220 Asp Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp 225 230 235 240 Asn Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met 245 250 255 His Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu 260 265 270 Gly Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile 275 280 285 Pro His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe 290 295 300 Asn Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu 305 310 315 320 Gly Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys 325 330 335 Met Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys 340 345 350 Asp Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile 355 360 365 Ala Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro 370 375 380 Ile Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly 385 390 395 400 Lys Lys Val Thr Phe 405 <210> SEQ ID NO 68 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Adh, ADK17019.1 <400> SEQUENCE: 68 Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp Ala 1 5 10 15 Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu Glu 35 40 45 Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met Thr 65 70 75 80 Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln Lys 115 120 125 Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro Ala 165 170 175 Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser Asp 195 200 205 Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp Asn 210 215 220 Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met His 225 230 235 240 Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu Gly 290 295 300 Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys Met 305 310 315 320 Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys Asp 325 330 335 Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile Ala 340 345 350 Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro Ile 355 360 365 Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly Lys 370 375 380 Lys Val Thr Phe 385 <210> SEQ ID NO 69 <211> LENGTH: 390 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: BdhB, NP_349891.1 <400> SEQUENCE: 69 Met Val Asp Phe Glu Tyr Ser Ile Pro Thr Arg Ile Phe Phe Gly Lys 1 5 10 15 Asp Lys Ile Asn Val Leu Gly Arg Glu Leu Lys Lys Tyr Gly Ser Lys 20 25 30 Val Leu Ile Val Tyr Gly Gly Gly Ser Ile Lys Arg Asn Gly Ile Tyr 35 40 45 Asp Lys Ala Val Ser Ile Leu Glu Lys Asn Ser Ile Lys Phe Tyr Glu 50 55 60 Leu Ala Gly Val Glu Pro Asn Pro Arg Val Thr Thr Val Glu Lys Gly 65 70 75 80 Val Lys Ile Cys Arg Glu Asn Gly Val Glu Val Val Leu Ala Ile Gly 85 90 95 Gly Gly Ser Ala Ile Asp Cys Ala Lys Val Ile Ala Ala Ala Cys Glu 100 105 110 Tyr Asp Gly Asn Pro Trp Asp Ile Val Leu Asp Gly Ser Lys Ile Lys 115 120 125 Arg Val Leu Pro Ile Ala Ser Ile Leu Thr Ile Ala Ala Thr Gly Ser 130 135 140 Glu Met Asp Thr Trp Ala Val Ile Asn Asn Met Asp Thr Asn Glu Lys 145 150 155 160 Leu Ile Ala Ala His Pro Asp Met Ala Pro Lys Phe Ser Ile Leu Asp 165 170 175 Pro Thr Tyr Thr Tyr Thr Val Pro Thr Asn Gln Thr Ala Ala Gly Thr 180 185 190 Ala Asp Ile Met Ser His Ile Phe Glu Val Tyr Phe Ser Asn Thr Lys 195 200 205 Thr Ala Tyr Leu Gln Asp Arg Met Ala Glu Ala Leu Leu Arg Thr Cys 210 215 220 Ile Lys Tyr Gly Gly Ile Ala Leu Glu Lys Pro Asp Asp Tyr Glu Ala 225 230 235 240 Arg Ala Asn Leu Met Trp Ala Ser Ser Leu Ala Ile Asn Gly Leu Leu 245 250 255 Thr Tyr Gly Lys Asp Thr Asn Trp Ser Val His Leu Met Glu His Glu 260 265 270 Leu Ser Ala Tyr Tyr Asp Ile Thr His Gly Val Gly Leu Ala Ile Leu 275 280 285 Thr Pro Asn Trp Met Glu Tyr Ile Leu Asn Asn Asp Thr Val Tyr Lys 290 295 300 Phe Val Glu Tyr Gly Val Asn Val Trp Gly Ile Asp Lys Glu Lys Asn 305 310 315 320 His Tyr Asp Ile Ala His Gln Ala Ile Gln Lys Thr Arg Asp Tyr Phe 325 330 335 Val Asn Val Leu Gly Leu Pro Ser Arg Leu Arg Asp Val Gly Ile Glu 340 345 350 Glu Glu Lys Leu Asp Ile Met Ala Lys Glu Ser Val Lys Leu Thr Gly 355 360 365 Gly Thr Ile Gly Asn Leu Arg Pro Val Asn Ala Ser Glu Val Leu Gln 370 375 380 Ile Phe Lys Lys Ser Val 385 390 <210> SEQ ID NO 70 <211> LENGTH: 387 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh, WP_041897187.1 <400> SEQUENCE: 70 Met Glu Asn Phe Asn Tyr Ser Ile Pro Thr Lys Val Tyr Phe Gly Lys 1 5 10 15 Gly Gln Ile Lys Asn Leu Ala Ala Ile Ile Lys Glu Tyr Gly Asn Lys 20 25 30 Ile Phe Ile Ala Tyr Gly Gly Gly Ser Ile Lys Lys Ile Gly Leu Tyr 35 40 45 Asp Glu Met Ile Lys Ile Leu Asn Asp Asn Ser Ile Ser Tyr Val Glu 50 55 60 Leu Ser Gly Ile Glu Pro Asn Pro Arg Ile Glu Thr Val Arg Lys Gly 65 70 75 80 Ile Lys Ile Cys Lys Glu Asn Asn Val Glu Val Val Leu Ala Val Gly 85 90 95 Gly Gly Ser Thr Ile Asp Cys Ala Lys Val Ile Ala Ala Gly Val Lys 100 105 110 Tyr Glu Gly Asp Pro Trp Asp Leu Val Thr Ser Pro Gln Lys Ile Asn 115 120 125 Glu Val Leu Pro Ile Val Thr Ile Leu Thr Leu Ser Ala Thr Gly Ser 130 135 140 Glu Met Asp Pro His Ala Val Ile Ser Asp Met Thr Thr Asn Gln Lys 145 150 155 160 Leu Gly Thr Gly His Glu Asn Met Lys Pro Lys Ala Ser Ile Leu Asp 165 170 175 Pro Glu Tyr Thr Tyr Ser Val Pro Lys Asn Gln Thr Ala Ala Gly Thr 180 185 190 Ala Asp Ile Met Ser His Ile Phe Glu Thr Tyr Phe Asn His Thr Lys 195 200 205 Gly Val Asp Ile Gln Asp Ser Thr Ala Glu Gly Leu Leu Arg Ala Cys 210 215 220 Ile Lys Tyr Gly Lys Ile Ala Ile Glu Asn Pro Lys Asp Tyr Asp Ala 225 230 235 240 Arg Ala Asn Leu Met Trp Ala Ser Ser Trp Ala Ile Asn Gly Leu Ile 245 250 255 Ser Tyr Gly Thr Asn Ser Pro Trp Val Val His Pro Met Glu His Glu 260 265 270 Leu Ser Ala Phe Tyr Asp Ile Thr His Gly Val Gly Leu Ala Ile Leu 275 280 285 Thr Pro His Trp Met Lys Tyr Ser Leu Asp Asp Thr Thr Val Phe Lys 290 295 300 Phe Ala Gln Tyr Gly Ile Asn Val Trp Gly Ile Asp Lys Asn Leu Asp 305 310 315 320 Lys Phe Glu Ile Ala Asn Lys Ala Ile Glu Lys Thr Ser Glu Phe Phe 325 330 335 Lys Glu Leu Gly Ile Pro Ser Thr Leu Arg Glu Val Gly Ile Glu Glu 340 345 350 Glu Lys Leu Glu Leu Met Ala Lys Lys Ala Met Asn Pro Tyr Phe Lys 355 360 365 Tyr Ala Phe Lys Pro Leu Asp Glu Asn Asp Ile Leu Lys Ile Phe Lys 370 375 380 Ala Ala Leu 385 <210> SEQ ID NO 71 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh1, YP_003780648.1 <400> SEQUENCE: 71 Met Gly Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asn Ala 1 5 10 15 Leu Glu Asn Leu Lys Asn Leu Asp Gly Asn Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Ala Lys Val Glu Lys 35 40 45 Tyr Leu Lys Glu Thr Gly Met Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Asp Thr Val Met Asn Gly Ala Lys Ile Met Arg 65 70 75 80 Asp Phe Asn Pro Asp Trp Ile Val Ser Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Ile Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Glu Lys Ala Val Val Pro Phe Gly Ile Pro Lys Leu Arg Gln Lys 115 120 125 Ala Gln Phe Val Ala Ile Pro Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Ile Asp Pro Ser 165 170 175 Leu Ala Glu Thr Met Pro Lys Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Ile Glu Ala Tyr Val Ala Ser Leu His Ser Asp 195 200 205 Phe Ser Asp Pro Leu Ala Met His Ala Ile Thr Met Ile His Lys Tyr 210 215 220 Leu Leu Lys Ser Tyr Glu Glu Asp Lys Glu Ala Arg Gly His Met His 225 230 235 240 Ile Ala Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Ile Ala His Lys Thr Gly Ala Val Phe His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Ile Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Glu Arg Tyr Ala Lys Ile Ala Lys Lys Leu His 290 295 300 Leu Ser Gly Asn Ser Glu Asp Glu Leu Ile Asp Ser Leu Thr Glu Met 305 310 315 320 Ile Arg Thr Met Asn Lys Lys Met Asp Ile Pro Leu Thr Ile Lys Asp 325 330 335 Tyr Gly Ile Ser Glu Asn Asp Phe Asn Glu Asn Leu Asp Phe Ile Ala 340 345 350 His Asn Ala Met Met Asp Ala Cys Thr Gly Ser Asn Pro Arg Ala Ile 355 360 365 Thr Glu Glu Glu Met Lys Lys Leu Leu Gln Tyr Met Tyr Asn Gly Gln 370 375 380 Lys Val Asn Phe 385 <210> SEQ ID NO 72 <211> LENGTH: 405 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh1, AGY76060.1 <400> SEQUENCE: 72 Met Lys Tyr Met Gly Ile Lys Ile Tyr Gly Asn Lys Ile Arg Gly Ile 1 5 10 15 Ile Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp 20 25 30 Ala Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val 35 40 45 Val Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu 50 55 60 Glu Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val 65 70 75 80 Glu Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met 85 90 95 Thr Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro 100 105 110 Ile Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe 115 120 125 Thr Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln 130 135 140 Lys Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu 145 150 155 160 Val Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr 165 170 175 Pro Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro 180 185 190 Ala Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met 195 200 205 Asp Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser 210 215 220 Asp Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp 225 230 235 240 Asn Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met 245 250 255 His Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu 260 265 270 Gly Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile 275 280 285 Pro His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe 290 295 300 Asn Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu 305 310 315 320 Gly Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys 325 330 335 Met Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys 340 345 350 Asp Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile 355 360 365 Ala Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro 370 375 380 Ile Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly 385 390 395 400 Lys Lys Val Thr Phe 405 <210> SEQ ID NO 73 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh2, YP_003782121.1 <400> SEQUENCE: 73 Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp Ala 1 5 10 15 Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu Glu 35 40 45 Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met Thr 65 70 75 80 Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln Lys 115 120 125 Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro Ala 165 170 175 Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser Asp 195 200 205 Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp Asn 210 215 220 Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met His 225 230 235 240 Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu Gly 290 295 300 Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys Met 305 310 315 320 Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys Asp 325 330 335 Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile Ala 340 345 350 Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro Ile 355 360 365 Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly Lys 370 375 380 Lys Val Thr Phe 385 <210> SEQ ID NO 74 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh2, AGY74784.1 <400> SEQUENCE: 74 Met Gly Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asn Ala 1 5 10 15 Leu Glu Asn Leu Lys Asn Leu Asp Gly Asn Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Ala Lys Val Glu Lys 35 40 45 Tyr Leu Lys Glu Thr Gly Met Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Asp Thr Val Met Asn Gly Ala Lys Ile Met Arg 65 70 75 80 Asp Phe Asn Pro Asp Trp Ile Val Ser Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Ile Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Glu Lys Ala Val Val Pro Phe Gly Ile Pro Lys Leu Arg Gln Lys 115 120 125 Ala Gln Phe Val Ala Ile Pro Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Ile Asp Pro Ser 165 170 175 Leu Ala Glu Thr Met Pro Lys Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Ile Glu Ala Tyr Val Ala Ser Leu His Ser Asp 195 200 205 Phe Ser Asp Pro Leu Ala Met His Ala Ile Thr Met Ile His Lys Tyr 210 215 220 Leu Leu Lys Ser Tyr Glu Glu Asp Lys Glu Ala Arg Gly His Met His 225 230 235 240 Ile Ala Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Ile Ala His Lys Thr Gly Ala Val Phe His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Ile Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Glu Arg Tyr Ala Lys Ile Ala Lys Lys Leu His 290 295 300 Leu Ser Gly Asn Ser Glu Asp Glu Leu Ile Asp Ser Leu Thr Glu Met 305 310 315 320 Ile Arg Thr Met Asn Lys Lys Met Asp Ile Pro Leu Thr Ile Lys Asp 325 330 335 Tyr Gly Ile Ser Glu Asn Asp Phe Asn Glu Asn Leu Asp Phe Ile Ala 340 345 350 His Asn Ala Met Met Asp Ala Cys Thr Gly Ser Asn Pro Arg Ala Ile 355 360 365 Thr Glu Glu Glu Met Lys Lys Leu Leu Gln Tyr Met Tyr Asn Gly Gln 370 375 380 Lys Val Asn Phe 385 <210> SEQ ID NO 75 <211> LENGTH: 862 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE1, NP_149325.1 <400> SEQUENCE: 75 Met Lys Val Thr Thr Val Lys Glu Leu Asp Glu Lys Leu Lys Val Ile 1 5 10 15 Lys Glu Ala Gln Lys Lys Phe Ser Cys Tyr Ser Gln Glu Met Val Asp 20 25 30 Glu Ile Phe Arg Asn Ala Ala Met Ala Ala Ile Asp Ala Arg Ile Glu 35 40 45 Leu Ala Lys Ala Ala Val Leu Glu Thr Gly Met Gly Leu Val Glu Asp 50 55 60 Lys Val Ile Lys Asn His Phe Ala Gly Glu Tyr Ile Tyr Asn Lys Tyr 65 70 75 80 Lys Asp Glu Lys Thr Cys Gly Ile Ile Glu Arg Asn Glu Pro Tyr Gly 85 90 95 Ile Thr Lys Ile Ala Glu Pro Ile Gly Val Val Ala Ala Ile Ile Pro 100 105 110 Val Thr Asn Pro Thr Ser Thr Thr Ile Phe Lys Ser Leu Ile Ser Leu 115 120 125 Lys Thr Arg Asn Gly Ile Phe Phe Ser Pro His Pro Arg Ala Lys Lys 130 135 140 Ser Thr Ile Leu Ala Ala Lys Thr Ile Leu Asp Ala Ala Val Lys Ser 145 150 155 160 Gly Ala Pro Glu Asn Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Thr Gln Tyr Leu Met Gln Lys Ala Asp Ile Thr Leu Ala Thr Gly 180 185 190 Gly Pro Ser Leu Val Lys Ser Ala Tyr Ser Ser Gly Lys Pro Ala Ile 195 200 205 Gly Val Gly Pro Gly Asn Thr Pro Val Ile Ile Asp Glu Ser Ala His 210 215 220 Ile Lys Met Ala Val Ser Ser Ile Ile Leu Ser Lys Thr Tyr Asp Asn 225 230 235 240 Gly Val Ile Cys Ala Ser Glu Gln Ser Val Ile Val Leu Lys Ser Ile 245 250 255 Tyr Asn Lys Val Lys Asp Glu Phe Gln Glu Arg Gly Ala Tyr Ile Ile 260 265 270 Lys Lys Asn Glu Leu Asp Lys Val Arg Glu Val Ile Phe Lys Asp Gly 275 280 285 Ser Val Asn Pro Lys Ile Val Gly Gln Ser Ala Tyr Thr Ile Ala Ala 290 295 300 Met Ala Gly Ile Lys Val Pro Lys Thr Thr Arg Ile Leu Ile Gly Glu 305 310 315 320 Val Thr Ser Leu Gly Glu Glu Glu Pro Phe Ala His Glu Lys Leu Ser 325 330 335 Pro Val Leu Ala Met Tyr Glu Ala Asp Asn Phe Asp Asp Ala Leu Lys 340 345 350 Lys Ala Val Thr Leu Ile Asn Leu Gly Gly Leu Gly His Thr Ser Gly 355 360 365 Ile Tyr Ala Asp Glu Ile Lys Ala Arg Asp Lys Ile Asp Arg Phe Ser 370 375 380 Ser Ala Met Lys Thr Val Arg Thr Phe Val Asn Ile Pro Thr Ser Gln 385 390 395 400 Gly Ala Ser Gly Asp Leu Tyr Asn Phe Arg Ile Pro Pro Ser Phe Thr 405 410 415 Leu Gly Cys Gly Phe Trp Gly Gly Asn Ser Val Ser Glu Asn Val Gly 420 425 430 Pro Lys His Leu Leu Asn Ile Lys Thr Val Ala Glu Arg Arg Glu Asn 435 440 445 Met Leu Trp Phe Arg Val Pro His Lys Val Tyr Phe Lys Phe Gly Cys 450 455 460 Leu Gln Phe Ala Leu Lys Asp Leu Lys Asp Leu Lys Lys Lys Arg Ala 465 470 475 480 Phe Ile Val Thr Asp Ser Asp Pro Tyr Asn Leu Asn Tyr Val Asp Ser 485 490 495 Ile Ile Lys Ile Leu Glu His Leu Asp Ile Asp Phe Lys Val Phe Asn 500 505 510 Lys Val Gly Arg Glu Ala Asp Leu Lys Thr Ile Lys Lys Ala Thr Glu 515 520 525 Glu Met Ser Ser Phe Met Pro Asp Thr Ile Ile Ala Leu Gly Gly Thr 530 535 540 Pro Glu Met Ser Ser Ala Lys Leu Met Trp Val Leu Tyr Glu His Pro 545 550 555 560 Glu Val Lys Phe Glu Asp Leu Ala Ile Lys Phe Met Asp Ile Arg Lys 565 570 575 Arg Ile Tyr Thr Phe Pro Lys Leu Gly Lys Lys Ala Met Leu Val Ala 580 585 590 Ile Thr Thr Ser Ala Gly Ser Gly Ser Glu Val Thr Pro Phe Ala Leu 595 600 605 Val Thr Asp Asn Asn Thr Gly Asn Lys Tyr Met Leu Ala Asp Tyr Glu 610 615 620 Met Thr Pro Asn Met Ala Ile Val Asp Ala Glu Leu Met Met Lys Met 625 630 635 640 Pro Lys Gly Leu Thr Ala Tyr Ser Gly Ile Asp Ala Leu Val Asn Ser 645 650 655 Ile Glu Ala Tyr Thr Ser Val Tyr Ala Ser Glu Tyr Thr Asn Gly Leu 660 665 670 Ala Leu Glu Ala Ile Arg Leu Ile Phe Lys Tyr Leu Pro Glu Ala Tyr 675 680 685 Lys Asn Gly Arg Thr Asn Glu Lys Ala Arg Glu Lys Met Ala His Ala 690 695 700 Ser Thr Met Ala Gly Met Ala Ser Ala Asn Ala Phe Leu Gly Leu Cys 705 710 715 720 His Ser Met Ala Ile Lys Leu Ser Ser Glu His Asn Ile Pro Ser Gly 725 730 735 Ile Ala Asn Ala Leu Leu Ile Glu Glu Val Ile Lys Phe Asn Ala Val 740 745 750 Asp Asn Pro Val Lys Gln Ala Pro Cys Pro Gln Tyr Lys Tyr Pro Asn 755 760 765 Thr Ile Phe Arg Tyr Ala Arg Ile Ala Asp Tyr Ile Lys Leu Gly Gly 770 775 780 Asn Thr Asp Glu Glu Lys Val Asp Leu Leu Ile Asn Lys Ile His Glu 785 790 795 800 Leu Lys Lys Ala Leu Asn Ile Pro Thr Ser Ile Lys Asp Ala Gly Val 805 810 815 Leu Glu Glu Asn Phe Tyr Ser Ser Leu Asp Arg Ile Ser Glu Leu Ala 820 825 830 Leu Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Phe Pro Leu Thr Ser 835 840 845 Glu Ile Lys Glu Met Tyr Ile Asn Cys Phe Lys Lys Gln Pro 850 855 860 <210> SEQ ID NO 76 <211> LENGTH: 858 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE2, NP_149199.1 <400> SEQUENCE: 76 Met Lys Val Thr Asn Gln Lys Glu Leu Lys Gln Lys Leu Asn Glu Leu 1 5 10 15 Arg Glu Ala Gln Lys Lys Phe Ala Thr Tyr Thr Gln Glu Gln Val Asp 20 25 30 Lys Ile Phe Lys Gln Cys Ala Ile Ala Ala Ala Lys Glu Arg Ile Asn 35 40 45 Leu Ala Lys Leu Ala Val Glu Glu Thr Gly Ile Gly Leu Val Glu Asp 50 55 60 Lys Ile Ile Lys Asn His Phe Ala Ala Glu Tyr Ile Tyr Asn Lys Tyr 65 70 75 80 Lys Asn Glu Lys Thr Cys Gly Ile Ile Asp His Asp Asp Ser Leu Gly 85 90 95 Ile Thr Lys Val Ala Glu Pro Ile Gly Ile Val Ala Ala Ile Val Pro 100 105 110 Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ser Leu 115 120 125 Lys Thr Arg Asn Ala Ile Phe Phe Ser Pro His Pro Arg Ala Lys Lys 130 135 140 Ser Thr Ile Ala Ala Ala Lys Leu Ile Leu Asp Ala Ala Val Lys Ala 145 150 155 160 Gly Ala Pro Lys Asn Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Ser Gln Asp Leu Met Ser Glu Ala Asp Ile Ile Leu Ala Thr Gly 180 185 190 Gly Pro Ser Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro Ala Ile 195 200 205 Gly Val Gly Ala Gly Asn Thr Pro Ala Ile Ile Asp Glu Ser Ala Asp 210 215 220 Ile Asp Met Ala Val Ser Ser Ile Ile Leu Ser Lys Thr Tyr Asp Asn 225 230 235 240 Gly Val Ile Cys Ala Ser Glu Gln Ser Ile Leu Val Met Asn Ser Ile 245 250 255 Tyr Glu Lys Val Lys Glu Glu Phe Val Lys Arg Gly Ser Tyr Ile Leu 260 265 270 Asn Gln Asn Glu Ile Ala Lys Ile Lys Glu Thr Met Phe Lys Asn Gly 275 280 285 Ala Ile Asn Ala Asp Ile Val Gly Lys Ser Ala Tyr Ile Ile Ala Lys 290 295 300 Met Ala Gly Ile Glu Val Pro Gln Thr Thr Lys Ile Leu Ile Gly Glu 305 310 315 320 Val Gln Ser Val Glu Lys Ser Glu Leu Phe Ser His Glu Lys Leu Ser 325 330 335 Pro Val Leu Ala Met Tyr Lys Val Lys Asp Phe Asp Glu Ala Leu Lys 340 345 350 Lys Ala Gln Arg Leu Ile Glu Leu Gly Gly Ser Gly His Thr Ser Ser 355 360 365 Leu Tyr Ile Asp Ser Gln Asn Asn Lys Asp Lys Val Lys Glu Phe Gly 370 375 380 Leu Ala Met Lys Thr Ser Arg Thr Phe Ile Asn Met Pro Ser Ser Gln 385 390 395 400 Gly Ala Ser Gly Asp Leu Tyr Asn Phe Ala Ile Ala Pro Ser Phe Thr 405 410 415 Leu Gly Cys Gly Thr Trp Gly Gly Asn Ser Val Ser Gln Asn Val Glu 420 425 430 Pro Lys His Leu Leu Asn Ile Lys Ser Val Ala Glu Arg Arg Glu Asn 435 440 445 Met Leu Trp Phe Lys Val Pro Gln Lys Ile Tyr Phe Lys Tyr Gly Cys 450 455 460 Leu Arg Phe Ala Leu Lys Glu Leu Lys Asp Met Asn Lys Lys Arg Ala 465 470 475 480 Phe Ile Val Thr Asp Lys Asp Leu Phe Lys Leu Gly Tyr Val Asn Lys 485 490 495 Ile Thr Lys Val Leu Asp Glu Ile Asp Ile Lys Tyr Ser Ile Phe Thr 500 505 510 Asp Ile Lys Ser Asp Pro Thr Ile Asp Ser Val Lys Lys Gly Ala Lys 515 520 525 Glu Met Leu Asn Phe Glu Pro Asp Thr Ile Ile Ser Ile Gly Gly Gly 530 535 540 Ser Pro Met Asp Ala Ala Lys Val Met His Leu Leu Tyr Glu Tyr Pro 545 550 555 560 Glu Ala Glu Ile Glu Asn Leu Ala Ile Asn Phe Met Asp Ile Arg Lys 565 570 575 Arg Ile Cys Asn Phe Pro Lys Leu Gly Thr Lys Ala Ile Ser Val Ala 580 585 590 Ile Pro Thr Thr Ala Gly Thr Gly Ser Glu Ala Thr Pro Phe Ala Val 595 600 605 Ile Thr Asn Asp Glu Thr Gly Met Lys Tyr Pro Leu Thr Ser Tyr Glu 610 615 620 Leu Thr Pro Asn Met Ala Ile Ile Asp Thr Glu Leu Met Leu Asn Met 625 630 635 640 Pro Arg Lys Leu Thr Ala Ala Thr Gly Ile Asp Ala Leu Val His Ala 645 650 655 Ile Glu Ala Tyr Val Ser Val Met Ala Thr Asp Tyr Thr Asp Glu Leu 660 665 670 Ala Leu Arg Ala Ile Lys Met Ile Phe Lys Tyr Leu Pro Arg Ala Tyr 675 680 685 Lys Asn Gly Thr Asn Asp Ile Glu Ala Arg Glu Lys Met Ala His Ala 690 695 700 Ser Asn Ile Ala Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Val Cys 705 710 715 720 His Ser Met Ala His Lys Leu Gly Ala Met His His Val Pro His Gly 725 730 735 Ile Ala Cys Ala Val Leu Ile Glu Glu Val Ile Lys Tyr Asn Ala Thr 740 745 750 Asp Cys Pro Thr Lys Gln Thr Ala Phe Pro Gln Tyr Lys Ser Pro Asn 755 760 765 Ala Lys Arg Lys Tyr Ala Glu Ile Ala Glu Tyr Leu Asn Leu Lys Gly 770 775 780 Thr Ser Asp Thr Glu Lys Val Thr Ala Leu Ile Glu Ala Ile Ser Lys 785 790 795 800 Leu Lys Ile Asp Leu Ser Ile Pro Gln Asn Ile Ser Ala Ala Gly Ile 805 810 815 Asn Lys Lys Asp Phe Tyr Asn Thr Leu Asp Lys Met Ser Glu Leu Ala 820 825 830 Phe Asp Asp Gln Cys Thr Thr Ala Asn Pro Arg Tyr Pro Leu Ile Ser 835 840 845 Glu Leu Lys Asp Ile Tyr Ile Lys Ser Phe 850 855 <210> SEQ ID NO 77 <211> LENGTH: 860 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE, WP_041893626.1 <400> SEQUENCE: 77 Met Arg Val Thr Asn Pro Glu Glu Leu Thr Lys Arg Ile Glu Gln Ile 1 5 10 15 Arg Glu Ala Gln Arg Glu Phe Ala Lys Phe Ser Gln Glu Glu Val Asp 20 25 30 Glu Ile Phe Arg Gln Ala Ala Met Ala Ala Asn Asp Ala Arg Ile Thr 35 40 45 Leu Ala Lys Met Ala Val Glu Glu Ser Gly Met Gly Ile Val Glu Asp 50 55 60 Lys Val Ile Lys Asn His Phe Ala Ala Glu Tyr Ile Tyr Asn Gln Tyr 65 70 75 80 Lys Asp Thr Lys Thr Cys Gly Val Ile Glu Arg Asp Glu Met Phe Gly 85 90 95 Ile Thr His Ile Ala Glu Pro Ile Gly Val Ile Ala Ala Ile Val Pro 100 105 110 Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Thr Leu Ile Ala Leu 115 120 125 Lys Thr Arg Asn Gly Ile Ile Ile Ser Pro His Pro Arg Ala Lys Asn 130 135 140 Ser Thr Ile Ala Ala Ala Lys Ile Val Leu Glu Ala Ala Glu Arg Ala 145 150 155 160 Gly Ala Pro Lys Gly Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Ser Arg Asn Val Met Ser Glu Ser Asp Ile Ile Leu Ala Thr Gly 180 185 190 Gly Pro Gly Met Val Arg Ala Ala Tyr Ser Ser Gly Lys Pro Ala Ile 195 200 205 Gly Val Gly Ala Gly Asn Thr Pro Ala Ile Ile Asp Asp Thr Ala His 210 215 220 Ile Lys Met Ala Val Asn Ser Ile Leu Leu Ser Lys Thr Phe Asp Asn 225 230 235 240 Gly Val Val Cys Ala Ser Glu Gln Ser Ile Ile Ala Met Glu Ser Val 245 250 255 Tyr Asp Glu Val Arg Lys Glu Leu Asp Glu Arg Gly Ala Tyr Ile Leu 260 265 270 Lys Gly Asp Glu Val Asp Lys Val Arg Ser Ile Ile Leu Asp Pro Lys 275 280 285 Gly Ser Leu Asn Ser Glu Ile Val Gly Gln Ser Ala Tyr Lys Ile Ala 290 295 300 Lys Met Ala Gly Val Glu Val Ser Glu Ala Val Lys Val Leu Ile Gly 305 310 315 320 Glu Val Glu Ser Pro Glu Leu Glu Glu Pro Phe Ser His Glu Lys Leu 325 330 335 Ser Pro Ile Leu Gly Met Tyr Lys Ala Lys Thr Phe Asp Asp Ala Leu 340 345 350 Arg Leu Ala Ser Arg Met Ile Glu Leu Gly Gly Phe Gly His Thr Ser 355 360 365 Ile Leu Tyr Thr Asn Gln Val Glu Ser Val Asp Arg Ile Glu Lys Phe 370 375 380 Gly Val Ala Met Lys Thr Ala Arg Thr Leu Ile Asn Met Pro Ala Ser 385 390 395 400 Gln Gly Ala Ile Gly Asp Ile Tyr Asn Phe Lys Leu Ala Pro Ser Leu 405 410 415 Thr Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Ile Ser Glu Asn Val 420 425 430 Gly Pro Lys His Leu Ile Asn Val Lys Arg Ile Ala Glu Arg Arg Glu 435 440 445 Asn Met Leu Trp Phe Arg Val Pro Asp Lys Ile Tyr Phe Lys Phe Gly 450 455 460 Cys Leu Pro Ile Ala Leu Glu Glu Leu Asn Ala Met Lys Lys Lys Arg 465 470 475 480 Ala Phe Ile Val Thr Asp Arg Val Leu Phe Asp Leu Gly Tyr Thr His 485 490 495 Lys Ile Thr Asp Ile Leu Ser Glu Asn His Ile Glu Tyr Lys Ile Phe 500 505 510 Ser Asp Val Glu Pro Asp Pro Thr Leu Lys Ala Ala Lys Leu Gly Ala 515 520 525 Asp Ala Met Arg Asp Phe Asn Pro Asp Val Ile Ile Ala Ile Gly Gly 530 535 540 Gly Ser Pro Met Asp Ala Ala Lys Ile Met Trp Val Met Tyr Glu His 545 550 555 560 Pro Asp Val Arg Phe Glu Asp Leu Ala Met Arg Phe Met Asp Ile Arg 565 570 575 Lys Arg Val Tyr Glu Phe Pro Pro Met Gly Glu Arg Ala Ile Leu Val 580 585 590 Ala Ile Pro Thr Ser Ala Gly Thr Gly Ser Glu Val Thr Pro Phe Ala 595 600 605 Val Ile Thr Asp Gln Gln Thr Gly Val Lys Tyr Pro Leu Ala Asp Tyr 610 615 620 Ala Leu Thr Pro Asn Met Ala Ile Ile Asp Ala Glu Leu Met Met Ser 625 630 635 640 Met Pro Lys Gly Leu Thr Ala Ala Ser Gly Ile Asp Ala Leu Val His 645 650 655 Ala Ile Glu Ala Tyr Val Ser Val Leu Ala Ser Glu Tyr Thr Asn Gly 660 665 670 Leu Ala Leu Glu Ala Ile Arg Leu Thr Phe Lys Tyr Leu Pro Asp Ala 675 680 685 Tyr Asn Gly Gly Thr Thr Asn Ile Lys Ala Arg Glu Lys Met Ala His 690 695 700 Ala Ser Ser Val Ala Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Ile 705 710 715 720 Cys His Ser Met Ala His Lys Leu Gly Ala Phe His His Val Pro His 725 730 735 Gly Ile Ala Asn Ala Leu Leu Ile Asp Glu Val Ile Arg Phe Asn Ala 740 745 750 Thr Asp Ala Pro Arg Lys Gln Ala Ala Phe Pro Gln Tyr Lys Tyr Pro 755 760 765 Asn Ala Gly Trp Arg Tyr Ala Arg Ile Ala Asp Tyr Leu Asn Leu Gly 770 775 780 Gly Asn Thr Glu Glu Glu Lys Val Glu Leu Leu Ile Lys Ala Ile Asp 785 790 795 800 Asp Leu Lys Val Lys Val Arg Ile Pro Lys Ser Ile Lys Glu Phe Gly 805 810 815 Val Ser Glu Glu Lys Phe Tyr Asp Ser Met Asp Glu Met Val Glu Gln 820 825 830 Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr Pro Leu Met 835 840 845 Ser Glu Ile Lys Glu Met Tyr Ile Lys Ser Tyr Asn 850 855 860 <210> SEQ ID NO 78 <211> LENGTH: 870 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE1, WP_023163372.1 <400> SEQUENCE: 78 Met Lys Val Thr Asn Val Glu Glu Leu Met Lys Arg Leu Glu Glu Ile 1 5 10 15 Lys Asp Ala Gln Lys Lys Phe Ala Thr Tyr Thr Gln Glu Gln Val Asp 20 25 30 Glu Ile Phe Arg Gln Ala Ala Met Ala Ala Asn Ser Ala Arg Ile Glu 35 40 45 Leu Ala Lys Met Ala Val Glu Glu Ser Gly Met Gly Ile Val Glu Asp 50 55 60 Lys Val Ile Lys Asn His Phe Ala Ser Glu Tyr Ile Tyr Asn Lys Tyr 65 70 75 80 Lys Asp Glu Lys Thr Cys Gly Val Leu Glu Arg Asp Ala Gly Phe Gly 85 90 95 Ile Val Arg Ile Ala Glu Pro Val Gly Val Ile Ala Ala Val Val Pro 100 105 110 Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ala Leu 115 120 125 Lys Thr Arg Asn Gly Ile Ile Phe Ser Pro His Pro Arg Ala Lys Lys 130 135 140 Ser Thr Ile Ala Ala Ala Lys Ile Val Leu Asp Ala Ala Val Lys Ala 145 150 155 160 Gly Ala Pro Glu Gly Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Ser Gln Val Val Met Gly Glu Ala Asn Leu Ile Leu Ala Thr Gly 180 185 190 Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro Ala Val 195 200 205 Gly Val Gly Pro Gly Asn Thr Pro Ala Val Ile Asp Glu Ser Ala Asp 210 215 220 Ile Lys Met Ala Val Asn Ser Ile Leu Leu Ser Lys Thr Phe Asp Asn 225 230 235 240 Gly Met Ile Cys Ala Ser Glu Gln Ser Val Ile Val Leu Asp Ser Ile 245 250 255 Tyr Glu Glu Val Lys Lys Glu Phe Ala Tyr Arg Gly Ala Tyr Ile Leu 260 265 270 Ser Lys Asp Glu Thr Asp Lys Val Gly Lys Ile Ile Leu Lys Asn Gly 275 280 285 Ala Leu Asn Ala Gly Ile Val Gly Gln Pro Ala Phe Lys Ile Ala Gln 290 295 300 Leu Ala Gly Val Asp Val Pro Glu Lys Ala Lys Val Leu Ile Gly Glu 305 310 315 320 Val Glu Ser Val Glu Leu Glu Glu Pro Phe Ser His Glu Lys Leu Ser 325 330 335 Pro Val Leu Ala Met Tyr Arg Ala Arg Asn Phe Glu Asp Ala Ile Ala 340 345 350 Lys Thr Asp Lys Leu Val Arg Ala Gly Gly Phe Gly His Thr Ser Ser 355 360 365 Leu Tyr Ile Asn Pro Met Thr Glu Lys Ala Lys Val Glu Lys Phe Ser 370 375 380 Thr Met Met Lys Thr Ser Arg Thr Ile Ile Asn Thr Pro Ser Ser Gln 385 390 395 400 Gly Gly Ile Gly Asp Ile Tyr Asn Phe Lys Leu Ala Pro Ser Leu Thr 405 410 415 Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Val Ser Glu Asn Val Gly 420 425 430 Pro Lys His Leu Leu Asn Ile Lys Ser Val Ala Glu Arg Arg Glu Asn 435 440 445 Met Leu Trp Phe Arg Val Pro Glu Lys Val Tyr Phe Lys Tyr Gly Ser 450 455 460 Leu Gly Val Ala Leu Lys Glu Leu Lys Val Met Asn Lys Lys Lys Val 465 470 475 480 Phe Ile Val Thr Asp Lys Val Leu Tyr Gln Leu Gly Tyr Val Asp Lys 485 490 495 Val Thr Lys Val Leu Glu Glu Leu Lys Ile Ser Tyr Lys Val Phe Thr 500 505 510 Asp Val Glu Pro Asp Pro Thr Leu Ala Thr Ala Lys Lys Gly Ala Ala 515 520 525 Glu Leu Leu Ser Tyr Glu Pro Asp Thr Ile Ile Ser Val Gly Gly Gly 530 535 540 Ser Ala Met Asp Ala Ala Lys Ile Met Trp Val Met Tyr Glu His Pro 545 550 555 560 Glu Val Lys Phe Glu Asp Leu Ala Met Arg Phe Met Asp Ile Arg Lys 565 570 575 Arg Val Tyr Val Phe Pro Lys Met Gly Glu Lys Ala Met Met Ile Ser 580 585 590 Val Ala Thr Ser Ala Gly Thr Gly Ser Glu Val Thr Pro Phe Ala Val 595 600 605 Ile Thr Asp Glu Lys Thr Gly Ala Lys Tyr Pro Leu Ala Asp Tyr Glu 610 615 620 Leu Thr Pro Asp Met Ala Ile Val Asp Ala Glu Leu Met Met Gly Met 625 630 635 640 Pro Arg Gly Leu Thr Ala Ala Ser Gly Ile Asp Ala Leu Thr His Ala 645 650 655 Leu Glu Ala Tyr Val Ser Ile Met Ala Thr Glu Phe Thr Asn Gly Leu 660 665 670 Ala Leu Glu Ala Val Lys Leu Ile Phe Glu Tyr Leu Pro Lys Ala Tyr 675 680 685 Thr Glu Gly Thr Thr Asn Val Lys Ala Arg Glu Lys Met Ala His Ala 690 695 700 Ser Cys Ile Ala Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Val Cys 705 710 715 720 His Ser Met Ala His Lys Leu Gly Ala Gln His His Ile Pro His Gly 725 730 735 Ile Ala Asn Ala Leu Met Ile Asp Glu Val Ile Lys Phe Asn Ala Val 740 745 750 Asp Asp Pro Ile Lys Gln Ala Ala Phe Pro Gln Tyr Glu Tyr Pro Asn 755 760 765 Ala Arg Tyr Arg Tyr Ala Gln Ile Ala Asp Cys Leu Asn Leu Gly Gly 770 775 780 Asn Thr Glu Glu Glu Lys Val Gln Leu Leu Ile Asn Ala Ile Asp Asp 785 790 795 800 Leu Lys Ala Lys Leu Asn Ile Pro Glu Thr Ile Lys Glu Ala Gly Val 805 810 815 Ser Glu Asp Lys Phe Tyr Ala Thr Leu Asp Lys Met Ser Glu Leu Ala 820 825 830 Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr Pro Leu Ile Ser 835 840 845 Glu Ile Lys Gln Met Tyr Ile Asn Val Phe Asp Lys Thr Glu Pro Ile 850 855 860 Val Glu Asp Glu Glu Lys 865 870 <210> SEQ ID NO 79 <211> LENGTH: 877 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE2, WP_023163373.1 <400> SEQUENCE: 79 Met Lys Val Thr Lys Val Thr Asn Val Glu Glu Leu Met Lys Lys Leu 1 5 10 15 Asp Glu Val Thr Ala Ala Gln Lys Lys Phe Ser Ser Tyr Thr Gln Glu 20 25 30 Gln Val Asp Glu Ile Phe Arg Gln Ala Ala Met Ala Ala Asn Ser Ala 35 40 45 Arg Ile Asp Leu Ala Lys Met Ala Val Glu Glu Ser Gly Met Gly Ile 50 55 60 Val Glu Asp Lys Val Ile Lys Asn His Phe Val Ala Glu Tyr Ile Tyr 65 70 75 80 Asn Lys Tyr Lys Gly Glu Lys Thr Cys Gly Val Leu Glu Gln Asp Glu 85 90 95 Gly Phe Gly Met Val Arg Ile Ala Glu Pro Val Gly Val Ile Ala Ala 100 105 110 Val Val Pro Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu 115 120 125 Ile Ala Leu Lys Thr Arg Asn Gly Ile Val Phe Ser Pro His Pro Arg 130 135 140 Ala Lys Lys Ser Thr Ile Ala Ala Ala Lys Ile Val Leu Asp Ala Ala 145 150 155 160 Val Lys Ala Gly Ala Pro Glu Gly Ile Ile Gly Trp Ile Asp Glu Pro 165 170 175 Ser Ile Glu Leu Ser Gln Val Val Met Lys Glu Ala Asp Leu Ile Leu 180 185 190 Ala Thr Gly Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys 195 200 205 Pro Ala Ile Gly Val Gly Pro Gly Asn Thr Pro Ala Val Ile Asp Glu 210 215 220 Ser Ala Asp Ile Lys Met Ala Val Asn Ser Ile Leu Leu Ser Lys Thr 225 230 235 240 Phe Asp Asn Gly Met Ile Cys Ala Ser Glu Gln Ser Val Ile Val Ala 245 250 255 Ser Ser Ile Tyr Asp Glu Val Lys Lys Glu Phe Ala Asp Arg Gly Ala 260 265 270 Tyr Ile Leu Ser Lys Asp Glu Thr Asp Lys Val Gly Lys Thr Ile Met 275 280 285 Ile Asn Gly Ala Leu Asn Ala Gly Ile Val Gly Gln Ser Ala Phe Lys 290 295 300 Ile Ala Gln Met Ala Gly Val Ser Val Pro Glu Asp Ala Lys Ile Leu 305 310 315 320 Ile Gly Glu Val Lys Ser Val Glu Pro Glu Glu Glu Pro Phe Ala His 325 330 335 Glu Lys Leu Ser Pro Val Leu Ala Met Tyr Lys Ala Lys Asp Phe Asp 340 345 350 Glu Ala Leu Leu Lys Ala Gly Arg Leu Val Glu Arg Gly Gly Ile Gly 355 360 365 His Thr Ser Val Leu Tyr Val Asn Ser Met Thr Glu Lys Val Lys Val 370 375 380 Glu Lys Phe Arg Glu Thr Met Lys Thr Gly Arg Thr Leu Ile Asn Met 385 390 395 400 Pro Ser Ala Gln Gly Ala Ile Gly Asp Ile Tyr Asn Phe Lys Leu Ala 405 410 415 Pro Ser Leu Thr Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Val Ser 420 425 430 Glu Asn Val Gly Pro Lys His Leu Leu Asn Ile Lys Ser Val Ala Glu 435 440 445 Arg Arg Glu Asn Met Leu Trp Phe Arg Val Pro Glu Lys Val Tyr Phe 450 455 460 Lys Tyr Gly Ser Leu Gly Val Ala Leu Lys Glu Leu Arg Ile Met Glu 465 470 475 480 Lys Lys Lys Ala Phe Ile Val Thr Asp Lys Val Leu Tyr Gln Leu Gly 485 490 495 Tyr Val Asp Lys Ile Thr Lys Asn Leu Asp Glu Leu Arg Val Ser Tyr 500 505 510 Lys Ile Phe Thr Asp Val Glu Pro Asp Pro Thr Leu Ala Thr Ala Lys 515 520 525 Lys Gly Ala Ala Glu Leu Leu Ser Tyr Glu Pro Asp Thr Ile Ile Ala 530 535 540 Val Gly Gly Gly Ser Ala Met Asp Ala Ala Lys Ile Met Trp Val Met 545 550 555 560 Tyr Glu His Pro Glu Val Arg Phe Glu Asp Leu Ala Met Arg Phe Met 565 570 575 Asp Ile Arg Lys Arg Val Tyr Val Phe Pro Lys Met Gly Glu Lys Ala 580 585 590 Met Met Ile Ser Val Ala Thr Ser Ala Gly Thr Gly Ser Glu Val Thr 595 600 605 Pro Phe Ala Val Ile Thr Asp Glu Arg Thr Gly Ala Lys Tyr Pro Leu 610 615 620 Ala Asp Tyr Glu Leu Thr Pro Asn Met Ala Ile Val Asp Ala Glu Leu 625 630 635 640 Met Met Gly Met Pro Lys Gly Leu Thr Ala Ala Ser Gly Ile Asp Ala 645 650 655 Leu Thr His Ala Leu Glu Ala Tyr Val Ser Ile Met Ala Ser Glu Tyr 660 665 670 Thr Asn Gly Leu Ala Leu Glu Ala Thr Arg Leu Val Phe Lys Tyr Leu 675 680 685 Pro Ile Ala Tyr Thr Glu Gly Thr Ile Asn Val Lys Ala Arg Glu Lys 690 695 700 Met Ala His Ala Ser Cys Ile Ala Gly Met Ala Phe Ala Asn Ala Phe 705 710 715 720 Leu Gly Val Cys His Ser Met Ala His Lys Leu Gly Ala Gln His His 725 730 735 Ile Pro His Gly Ile Ala Asn Ala Leu Met Ile Asp Glu Val Ile Lys 740 745 750 Phe Asn Ala Val Glu Ala Pro Arg Lys Gln Ala Ala Phe Pro Gln Tyr 755 760 765 Lys Tyr Pro Asn Val Lys Arg Arg Tyr Ala Arg Ile Ala Asp Tyr Leu 770 775 780 Asn Leu Gly Gly Ser Thr Asp Asp Glu Lys Val Gln Leu Leu Ile Asn 785 790 795 800 Ala Ile Asp Asp Leu Lys Thr Lys Leu Asn Ile Pro Lys Thr Ile Lys 805 810 815 Glu Ala Gly Val Ser Glu Asp Lys Phe Tyr Ala Thr Leu Asp Thr Met 820 825 830 Ser Glu Leu Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr 835 840 845 Pro Leu Ile Gly Glu Ile Lys Gln Met Tyr Ile Asn Ala Phe Asp Thr 850 855 860 Pro Lys Ala Thr Val Glu Lys Lys Thr Arg Lys Lys Lys 865 870 875 <210> SEQ ID NO 80 <211> LENGTH: 468 <212> TYPE: PRT <213> ORGANISM: Clostridium saccharoperbutylacetonicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bld, AAP42563.1 <400> SEQUENCE: 80 Met Ile Lys Asp Thr Leu Val Ser Ile Thr Lys Asp Leu Lys Leu Lys 1 5 10 15 Thr Asn Val Glu Asn Ala Asn Leu Lys Asn Tyr Lys Asp Asp Ser Ser 20 25 30 Cys Phe Gly Val Phe Glu Asn Val Glu Asn Ala Ile Ser Asn Ala Val 35 40 45 His Ala Gln Lys Ile Leu Ser Leu His Tyr Thr Lys Glu Gln Arg Glu 50 55 60 Lys Ile Ile Thr Glu Ile Arg Lys Ala Ala Leu Glu Asn Lys Glu Ile 65 70 75 80 Leu Ala Thr Met Ile Leu Glu Glu Thr His Met Gly Arg Tyr Glu Asp 85 90 95 Lys Ile Leu Lys His Glu Leu Val Ala Lys Tyr Thr Pro Gly Thr Glu 100 105 110 Asp Leu Thr Thr Thr Ala Trp Ser Gly Asp Asn Gly Leu Thr Val Val 115 120 125 Glu Met Ser Pro Tyr Gly Val Ile Gly Ala Ile Thr Pro Ser Thr Asn 130 135 140 Pro Thr Glu Thr Val Ile Cys Asn Ser Ile Gly Met Ile Ala Ala Gly 145 150 155 160 Asn Thr Val Val Phe Asn Gly His Pro Gly Ala Lys Lys Cys Val Ala 165 170 175 Phe Ala Val Glu Met Ile Asn Lys Ala Ile Ile Ser Cys Gly Gly Pro 180 185 190 Glu Asn Leu Val Thr Thr Ile Lys Asn Pro Thr Met Asp Ser Leu Asp 195 200 205 Ala Ile Ile Lys His Pro Ser Ile Lys Leu Leu Cys Gly Thr Gly Gly 210 215 220 Pro Gly Met Val Lys Thr Leu Leu Asn Ser Gly Lys Lys Ala Ile Gly 225 230 235 240 Ala Gly Ala Gly Asn Pro Pro Val Ile Val Asp Asp Thr Ala Asp Ile 245 250 255 Glu Lys Ala Gly Lys Ser Ile Ile Glu Gly Cys Ser Phe Asp Asn Asn 260 265 270 Leu Pro Cys Ile Ala Glu Lys Glu Val Phe Val Phe Glu Asn Val Ala 275 280 285 Asp Asp Leu Ile Ser Asn Met Leu Lys Asn Asn Ala Val Ile Ile Asn 290 295 300 Glu Asp Gln Val Ser Lys Leu Ile Asp Leu Val Leu Gln Lys Asn Asn 305 310 315 320 Glu Thr Gln Glu Tyr Ser Ile Asn Lys Lys Trp Val Gly Lys Asp Ala 325 330 335 Lys Leu Phe Leu Asp Glu Ile Asp Val Glu Ser Pro Ser Ser Val Lys 340 345 350 Cys Ile Ile Cys Glu Val Ser Ala Arg His Pro Phe Val Met Thr Glu 355 360 365 Leu Met Met Pro Ile Leu Pro Ile Val Arg Val Lys Asp Ile Asp Glu 370 375 380 Ala Ile Glu Tyr Ala Lys Ile Ala Glu Gln Asn Arg Lys His Ser Ala 385 390 395 400 Tyr Ile Tyr Ser Lys Asn Ile Asp Asn Leu Asn Arg Phe Glu Arg Glu 405 410 415 Ile Asp Thr Thr Ile Phe Val Lys Asn Ala Lys Ser Phe Ala Gly Val 420 425 430 Gly Tyr Glu Ala Glu Gly Phe Thr Thr Phe Thr Ile Ala Gly Ser Thr 435 440 445 Gly Glu Gly Ile Thr Ser Ala Arg Asn Phe Thr Arg Gln Arg Arg Cys 450 455 460 Val Leu Ala Gly 465 <210> SEQ ID NO 81 <211> LENGTH: 562 <212> TYPE: PRT <213> ORGANISM: Aquincola tertiaricarbonis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, large subunit, AFK77668.1 <400> SEQUENCE: 81 Met Thr Trp Leu Glu Pro Gln Ile Lys Ser Gln Leu Gln Ser Glu Arg 1 5 10 15 Lys Asp Trp Glu Ala Asn Glu Val Gly Ala Phe Leu Lys Lys Ala Pro 20 25 30 Glu Arg Lys Glu Gln Phe His Thr Ile Gly Asp Phe Pro Val Gln Arg 35 40 45 Thr Tyr Thr Ala Ala Asp Ile Ala Asp Thr Pro Leu Glu Asp Ile Gly 50 55 60 Leu Pro Gly Arg Tyr Pro Phe Thr Arg Gly Pro Tyr Pro Thr Met Tyr 65 70 75 80 Arg Ser Arg Thr Trp Thr Met Arg Gln Ile Ala Gly Phe Gly Thr Gly 85 90 95 Glu Asp Thr Asn Lys Arg Phe Lys Tyr Leu Ile Ala Gln Gly Gln Thr 100 105 110 Gly Ile Ser Thr Asp Phe Asp Met Pro Thr Leu Met Gly Tyr Asp Ser 115 120 125 Asp His Pro Met Ser Asp Gly Glu Val Gly Arg Glu Gly Val Ala Ile 130 135 140 Asp Thr Leu Ala Asp Met Glu Ala Leu Leu Ala Asp Ile Asp Leu Glu 145 150 155 160 Lys Ile Ser Val Ser Phe Thr Ile Asn Pro Ser Ala Trp Ile Leu Leu 165 170 175 Ala Met Tyr Val Ala Leu Gly Glu Lys Arg Gly Tyr Asp Leu Asn Lys 180 185 190 Leu Ser Gly Thr Val Gln Ala Asp Ile Leu Lys Glu Tyr Met Ala Gln 195 200 205 Lys Glu Tyr Ile Tyr Pro Ile Ala Pro Ser Val Arg Ile Val Arg Asp 210 215 220 Ile Ile Thr Tyr Ser Ala Lys Asn Leu Lys Arg Tyr Asn Pro Ile Asn 225 230 235 240 Ile Ser Gly Tyr His Ile Ser Glu Ala Gly Ser Ser Pro Leu Gln Glu 245 250 255 Ala Ala Phe Thr Leu Ala Asn Leu Ile Thr Tyr Val Asn Glu Val Thr 260 265 270 Lys Thr Gly Met His Val Asp Glu Phe Ala Pro Arg Leu Ala Phe Phe 275 280 285 Phe Val Ser Gln Gly Asp Phe Phe Glu Glu Val Ala Lys Phe Arg Ala 290 295 300 Leu Arg Arg Cys Tyr Ala Lys Ile Met Lys Glu Arg Phe Gly Ala Arg 305 310 315 320 Asn Pro Glu Ser Met Arg Leu Arg Phe His Cys Gln Thr Ala Ala Ala 325 330 335 Thr Leu Thr Lys Pro Gln Tyr Met Val Asn Val Val Arg Thr Ser Leu 340 345 350 Gln Ala Leu Ser Ala Val Leu Gly Gly Ala Gln Ser Leu His Thr Asn 355 360 365 Gly Tyr Asp Glu Ala Phe Ala Ile Pro Thr Glu Asp Ala Met Lys Met 370 375 380 Ala Leu Arg Thr Gln Gln Ile Ile Ala Glu Glu Ser Gly Val Ala Asp 385 390 395 400 Val Ile Asp Pro Leu Gly Gly Ser Tyr Tyr Val Glu Ala Leu Thr Thr 405 410 415 Glu Tyr Glu Lys Lys Ile Phe Glu Ile Leu Glu Glu Val Glu Lys Arg 420 425 430 Gly Gly Thr Ile Lys Leu Ile Glu Gln Gly Trp Phe Gln Lys Gln Ile 435 440 445 Ala Asp Phe Ala Tyr Glu Thr Ala Leu Arg Lys Gln Ser Gly Gln Lys 450 455 460 Pro Val Ile Gly Val Asn Arg Phe Val Glu Asn Glu Glu Asp Val Lys 465 470 475 480 Ile Glu Ile His Pro Tyr Asp Asn Thr Thr Ala Glu Arg Gln Ile Ser 485 490 495 Arg Thr Arg Arg Val Arg Ala Glu Arg Asp Glu Ala Lys Val Gln Ala 500 505 510 Met Leu Asp Gln Leu Val Ala Val Ala Lys Asp Glu Ser Gln Asn Leu 515 520 525 Met Pro Leu Thr Ile Glu Leu Val Lys Ala Gly Ala Thr Met Gly Asp 530 535 540 Ile Val Glu Lys Leu Lys Gly Ile Trp Gly Thr Tyr Arg Glu Thr Pro 545 550 555 560 Val Phe <210> SEQ ID NO 82 <211> LENGTH: 136 <212> TYPE: PRT <213> ORGANISM: Aquincola tertiaricarbonis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, small subunit, AFK77665.1 <400> SEQUENCE: 82 Met Asp Gln Thr Pro Ile Arg Val Leu Leu Ala Lys Val Gly Leu Asp 1 5 10 15 Gly His Asp Arg Gly Val Lys Val Val Ala Arg Ala Leu Arg Asp Ala 20 25 30 Gly Met Asp Val Ile Tyr Ser Gly Leu His Arg Thr Pro Glu Glu Val 35 40 45 Val Asn Thr Ala Ile Gln Glu Asp Val Asp Val Leu Gly Val Ser Leu 50 55 60 Leu Ser Gly Val Gln Leu Thr Val Phe Pro Lys Ile Phe Lys Leu Leu 65 70 75 80 Asp Glu Arg Gly Ala Gly Asp Leu Ile Val Ile Ala Gly Gly Val Met 85 90 95 Pro Asp Glu Asp Ala Ala Ala Ile Arg Lys Leu Gly Val Arg Glu Val 100 105 110 Leu Leu Gln Asp Thr Pro Pro Gln Ala Ile Ile Asp Ser Ile Arg Ser 115 120 125 Leu Val Ala Ala Arg Gly Ala Arg 130 135 <210> SEQ ID NO 83 <211> LENGTH: 563 <212> TYPE: PRT <213> ORGANISM: Kyrpidia tusciae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, large subunit, WP_013074530.1 <400> SEQUENCE: 83 Met Ala Asp Gln Glu Lys Leu Phe Asn Gly Asp Glu Ile Arg Arg Ile 1 5 10 15 Arg Gln Glu Lys Glu Arg Trp Tyr Arg Glu Thr Val Lys Gly Asn Asp 20 25 30 Gly Gly Asn Asp Tyr Val Thr Asp Ser Gly Ile Pro Val Asn Leu Ile 35 40 45 Tyr Gly Pro Asp Asp Ile Ala Asp Phe Asp Tyr Leu Lys Glu Ser Gly 50 55 60 Phe Ser Gly Glu Pro Pro Tyr Val Arg Gly Val Tyr Pro Asn Met Tyr 65 70 75 80 Arg Gly Arg Leu Phe Thr Ile Arg Gln Ile Ala Gly Phe Gly Thr Pro 85 90 95 Glu Asp Thr Asn Arg Arg Phe Lys Phe Leu Leu Glu Asn Gly Ala Thr 100 105 110 Gly Thr Ser Val Val Leu Asp Leu Pro Thr Ile Arg Gly Tyr Asp Ser 115 120 125 Asp Asp Pro Lys Ala Glu Gly His Val Gly Ala Ala Gly Val Ala Ile 130 135 140 Asp Ser Leu Glu Asp Met Glu Ala Leu Tyr Asp Gly Ile Pro Ile Asp 145 150 155 160 Gln Val Ser Ser Asn Ile Val Thr His Leu Pro Ser Thr Thr Val Val 165 170 175 Leu Met Ala Met Phe Val Ala Met Ala Glu Lys Arg Gly Leu Pro Leu 180 185 190 Glu Lys Leu Ser Gly Thr Asn Gln Asn Asp Phe Leu Met Glu Thr Thr 195 200 205 Ile Gly Ser Ser Leu Glu Ile Leu Pro Pro Lys Ala Ser Phe Arg Leu 210 215 220 Gln Cys Asp Ser Ile Glu Tyr Ala Ser Lys Arg Leu Pro Arg Trp Asn 225 230 235 240 Pro Val Ser Tyr Asn Gly Tyr Asn Leu Arg Glu Ala Gly Thr Thr Ala 245 250 255 Val Gln Glu Val Gly Cys Ala Ile Ala Asn Ala Ile Ala Thr Thr Glu 260 265 270 Glu Leu Ile Arg Arg Gly Asn Asp Val Asp Asp Phe Ala Lys Arg Leu 275 280 285 Ser Phe Phe Trp Asn Leu Phe Asn Asp Phe Phe Glu Glu Ile Ala Lys 290 295 300 Cys Arg Ala Ser Arg Leu Val Trp Tyr Asp Val Met Lys Asn Arg Phe 305 310 315 320 Gly Ala Lys Asn Pro Arg Ser Tyr Leu Met Arg Phe His Val Gln Thr 325 330 335 Gly Gly Ile Thr Leu Thr Lys Val Glu Pro Leu Asn Asn Ile Ala Arg 340 345 350 Ser Ala Ile Gln Gly Leu Ala Ala Val Leu Gly Gly Ala Gln Ser Leu 355 360 365 His Ile Asp Ser Tyr Asp Glu Ala Tyr Ser Ala Pro Thr Glu Gln Ala 370 375 380 Ala Leu Val Ser Leu Arg Thr Gln Gln Ile Ile Gln Val Glu Thr Gly 385 390 395 400 Val Val Asn Thr Val Asp Pro Leu Ala Gly Ser Tyr Tyr Val Glu Tyr 405 410 415 Leu Thr Arg Glu Met Ala Glu His Ile Arg Ala Tyr Ile Asp Gln Ile 420 425 430 Glu Ser Arg Gly Gly Ile Ile Ala Val Val Glu Ser Gly Trp Leu His 435 440 445 Arg Glu Ile Ala Glu Phe Ala Tyr Arg Thr Gln Gln Asp Ile Glu Thr 450 455 460 Gly Lys Arg Lys Val Val Gly Leu Asn Tyr Phe Pro Ser Lys Glu Ala 465 470 475 480 Glu Thr Lys Val Glu Val Phe Arg Tyr Pro Glu Asp Ala Glu Arg Met 485 490 495 Gln Lys Glu Lys Leu Ala Lys Leu Arg Ala Arg Arg Asp Pro Val Lys 500 505 510 Val Glu Gln Thr Leu Arg Val Leu Arg Glu Lys Cys His Glu Asp Val 515 520 525 Asn Ile Leu Pro Tyr Val Lys Asp Ala Val Glu Ala Tyr Cys Thr Leu 530 535 540 Gly Glu Ile Gln Asn Val Phe Arg Glu Glu Phe Gly Leu Trp Gln Phe 545 550 555 560 Pro Leu Val <210> SEQ ID NO 84 <211> LENGTH: 132 <212> TYPE: PRT <213> ORGANISM: Kyrpidia tusciae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, small subunit, WP_013074531.1 <400> SEQUENCE: 84 Met Glu Lys Lys Ile Lys Val Ile Met Val Lys Leu Gly Leu Asp Ile 1 5 10 15 His Trp Arg Gly Ala Leu Val Val Ser Lys Met Leu Arg Asp Arg Gly 20 25 30 Met Glu Val Val Tyr Leu Gly Asn Leu Phe Pro Glu Gln Ile Val Gln 35 40 45 Ala Ala Val Gln Glu Gly Ala Asp Val Val Gly Leu Ser Thr Leu Gly 50 55 60 Gly Asn His Leu Thr Leu Gly Pro Lys Val Val Glu Leu Leu Arg Ala 65 70 75 80 Lys Gly Met Glu Glu Val Leu Val Ile Met Gly Gly Val Ile Pro Glu 85 90 95 Glu Asp Val Pro Ala Leu Lys Glu Ala Gly Ile Ala Glu Val Phe Gly 100 105 110 Pro Glu Thr Pro Ile Asp Ala Ile Glu Ser Phe Ile Arg Ser Arg Phe 115 120 125 Pro Asp Arg Asp 130 <210> SEQ ID NO 85 <211> LENGTH: 327 <212> TYPE: PRT <213> ORGANISM: Aquincola tertiaricarbonis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: MeaB, AFK77667.1 <400> SEQUENCE: 85 Met Thr Tyr Val Pro Ser Ser Ala Leu Leu Glu Gln Leu Arg Ala Gly 1 5 10 15 Asn Thr Trp Ala Leu Gly Arg Leu Ile Ser Arg Ala Glu Ala Gly Val 20 25 30 Ala Glu Ala Arg Pro Ala Leu Ala Glu Val Tyr Arg His Ala Gly Ser 35 40 45 Ala His Val Ile Gly Leu Thr Gly Val Pro Gly Ser Gly Lys Ser Thr 50 55 60 Leu Val Ala Lys Leu Thr Ala Ala Leu Arg Lys Arg Gly Glu Lys Val 65 70 75 80 Gly Ile Val Ala Ile Asp Pro Ser Ser Pro Tyr Ser Gly Gly Ala Ile 85 90 95 Leu Gly Asp Arg Ile Arg Met Thr Glu Leu Ala Asn Asp Ser Gly Val 100 105 110 Phe Ile Arg Ser Met Ala Thr Arg Gly Ala Thr Gly Gly Met Ala Arg 115 120 125 Ala Ala Leu Asp Ala Val Asp Leu Leu Asp Val Ala Gly Tyr His Thr 130 135 140 Ile Ile Leu Glu Thr Val Gly Val Gly Gln Asp Glu Val Glu Val Ala 145 150 155 160 His Ala Ser Asp Thr Thr Val Val Val Ser Ala Pro Gly Leu Gly Asp 165 170 175 Glu Ile Gln Ala Ile Lys Ala Gly Val Leu Glu Ile Ala Asp Ile His 180 185 190 Val Val Ser Lys Cys Asp Arg Asp Asp Ala Asn Arg Thr Leu Thr Asp 195 200 205 Leu Lys Gln Met Leu Thr Leu Gly Thr Met Val Gly Pro Lys Arg Ala 210 215 220 Trp Ala Ile Pro Val Val Gly Val Ser Ser Tyr Thr Gly Glu Gly Val 225 230 235 240 Asp Asp Leu Leu Gly Arg Ile Ala Ala His Arg Gln Ala Thr Ala Asp 245 250 255 Thr Glu Leu Gly Arg Glu Arg Arg Arg Arg Val Ala Glu Phe Arg Leu 260 265 270 Gln Lys Thr Ala Glu Thr Leu Leu Leu Glu Arg Phe Thr Thr Gly Ala 275 280 285 Gln Pro Phe Ser Pro Ala Leu Ala Asp Ser Leu Ser Asn Arg Ala Ser 290 295 300 Asp Pro Tyr Ala Ala Ala Arg Glu Leu Ile Ala Arg Thr Ile Arg Lys 305 310 315 320 Glu Tyr Ser Asn Asp Leu Ala 325 <210> SEQ ID NO 86 <211> LENGTH: 312 <212> TYPE: PRT <213> ORGANISM: Kyrpidia tusciae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: MeaB, WP_013074529.1 <400> SEQUENCE: 86 Met Gln Glu Leu Leu Ser Arg Phe Asp Ala Gly Asp Pro Val Ala Leu 1 5 10 15 Gly Lys Leu Leu Lys Glu Val Glu Asn Gly Thr Ser Ser Gly Lys Glu 20 25 30 Ala Leu Arg Cys Thr Ala Ser Arg Gln Gly Arg Ala His Val Val Gly 35 40 45 Ile Thr Gly Pro Pro Gly Ala Gly Lys Ser Thr Leu Thr Ala Lys Leu 50 55 60 Ser Lys Arg Trp Ala Glu Ala Gly Arg Glu Val Gly Ile Val Cys Val 65 70 75 80 Asp Pro Thr Ser Pro Phe Ser Gly Gly Ala Leu Leu Gly Asp Arg Ile 85 90 95 Arg Met Leu Glu Leu Ser Ser Phe Pro Asn Val Phe Ile Lys Ser Leu 100 105 110 Ala Thr Arg Gly Ser Leu Gly Gly Met Ala Ala Ser Thr Ala Asp Ile 115 120 125 Ile Gln Leu Met Asp Ala Tyr Gly Lys Glu Val Val Val Val Glu Thr 130 135 140 Val Gly Val Gly Gln Val Glu Phe Asp Val Met Asp Leu Ser Asp Thr 145 150 155 160 Val Val Leu Val Asn Val Pro Gly Leu Gly Asp Ser Ile Gln Ala Leu 165 170 175 Lys Ala Gly Ile Leu Glu Ile Ala Asp Ile Phe Val Ile Asn Gln Ala 180 185 190 Asp Arg Pro Gly Ala Glu Asp Ser Val Arg Asp Leu Arg Gln Met Leu 195 200 205 Ala Asp Arg Lys Glu Thr Gly Trp Leu Trp Pro Val Val Lys Thr Val 210 215 220 Ala Thr Arg Gly Glu Gly Ile Asp Arg Leu Ala Glu Ala Ile Glu Ser 225 230 235 240 His Arg Ala Tyr Leu Lys Arg Glu Gln Leu Trp Glu Glu Lys Arg Cys 245 250 255 Arg Arg Asn Arg Gln Arg Leu Met Gln Glu Met Asp Arg Leu Phe Arg 260 265 270 Gln His Val Leu Thr Arg Ile Arg Thr Asp Pro Thr Ala Arg Ala Leu 275 280 285 Phe Glu Glu Val Glu Lys Gly Thr Gln Asp Pro Tyr Ser Ala Ala Arg 290 295 300 His Leu Phe Gln Glu Ile Val Asn 305 310 <210> SEQ ID NO 87 <211> LENGTH: 301 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ptb, WP_010966357.1 <400> SEQUENCE: 87 Met Ile Lys Ser Phe Asn Glu Ile Ile Met Lys Val Lys Ser Lys Glu 1 5 10 15 Met Lys Lys Val Ala Val Ala Val Ala Gln Asp Glu Pro Val Leu Glu 20 25 30 Ala Val Arg Asp Ala Lys Lys Asn Gly Ile Ala Asp Ala Ile Leu Val 35 40 45 Gly Asp His Asp Glu Ile Val Ser Ile Ala Leu Lys Ile Gly Met Asp 50 55 60 Val Asn Asp Phe Glu Ile Val Asn Glu Pro Asn Val Lys Lys Ala Ala 65 70 75 80 Leu Lys Ala Val Glu Leu Val Ser Thr Gly Lys Ala Asp Met Val Met 85 90 95 Lys Gly Leu Val Asn Thr Ala Thr Phe Leu Arg Ser Val Leu Asn Lys 100 105 110 Glu Val Gly Leu Arg Thr Gly Lys Thr Met Ser His Val Ala Val Phe 115 120 125 Glu Thr Glu Lys Phe Asp Arg Leu Leu Phe Leu Thr Asp Val Ala Phe 130 135 140 Asn Thr Tyr Pro Glu Leu Lys Glu Lys Ile Asp Ile Val Asn Asn Ser 145 150 155 160 Val Lys Val Ala His Ala Ile Gly Ile Glu Asn Pro Lys Val Ala Pro 165 170 175 Ile Cys Ala Val Glu Val Ile Asn Pro Lys Met Pro Ser Thr Leu Asp 180 185 190 Ala Ala Met Leu Ser Lys Met Ser Asp Arg Gly Gln Ile Lys Gly Cys 195 200 205 Val Val Asp Gly Pro Leu Ala Leu Asp Ile Ala Leu Ser Glu Glu Ala 210 215 220 Ala His His Lys Gly Val Thr Gly Glu Val Ala Gly Lys Ala Asp Ile 225 230 235 240 Phe Leu Met Pro Asn Ile Glu Thr Gly Asn Val Met Tyr Lys Thr Leu 245 250 255 Thr Tyr Thr Thr Asp Ser Lys Asn Gly Gly Ile Leu Val Gly Thr Ser 260 265 270 Ala Pro Val Val Leu Thr Ser Arg Ala Asp Ser His Glu Thr Lys Met 275 280 285 Asn Ser Ile Ala Leu Ala Ala Leu Val Ala Gly Asn Lys 290 295 300 <210> SEQ ID NO 88 <211> LENGTH: 302 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ptb <400> SEQUENCE: 88 Met Ser Lys Asn Phe Asp Glu Leu Leu Ser Arg Leu Lys Glu Val Pro 1 5 10 15 Thr Lys Lys Val Ala Val Ala Val Ala Gln Asp Glu Pro Val Leu Glu 20 25 30 Ala Ile Lys Glu Ala Thr Glu Asn Asn Ile Ala Glu Ala Ile Leu Val 35 40 45 Gly Asp Lys Gln Gln Ile His Glu Ile Ala Lys Lys Ile Asn Leu Asp 50 55 60 Leu Ser Asp Tyr Glu Ile Met Asp Ile Lys Asp Pro Lys Lys Ala Thr 65 70 75 80 Leu Glu Ala Val Lys Leu Val Ser Ser Gly His Ala Asp Met Leu Met 85 90 95 Lys Gly Leu Val Asp Thr Ala Thr Phe Leu Arg Ser Val Leu Asn Lys 100 105 110 Glu Val Gly Leu Arg Thr Gly Lys Leu Met Ser His Val Ala Val Phe 115 120 125 Asp Val Glu Gly Trp Asp Arg Leu Leu Phe Leu Thr Asp Ala Ala Phe 130 135 140 Asn Thr Tyr Pro Glu Phe Lys Asp Lys Val Gly Met Ile Asn Asn Ala 145 150 155 160 Val Val Val Ala His Ala Cys Gly Ile Asp Val Pro Arg Ile Ala Pro 165 170 175 Ile Cys Pro Val Glu Val Val Asn Thr Ser Met Gln Ser Thr Val Asp 180 185 190 Ala Ala Leu Leu Ala Lys Met Ser Asp Arg Gly Gln Ile Lys Gly Cys 195 200 205 Ile Ile Asp Gly Pro Phe Ala Leu Asp Asn Ala Ile Ser Glu Glu Ala 210 215 220 Ala His His Lys Gly Val Thr Gly Ser Val Ala Gly Lys Ala Asp Ile 225 230 235 240 Leu Leu Leu Pro Asn Ile Glu Ala Ala Asn Val Met Tyr Lys Thr Leu 245 250 255 Thr Tyr Phe Ser Lys Ser Arg Asn Gly Gly Leu Leu Val Gly Thr Ser 260 265 270 Ala Pro Val Ile Leu Thr Ser Arg Ala Asp Ser Phe Glu Thr Lys Val 275 280 285 Asn Ser Ile Ala Leu Ala Ala Leu Val Ala Ala Arg Asn Lys 290 295 300 <210> SEQ ID NO 89 <211> LENGTH: 302 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ptb, WP_041893500.1 <400> SEQUENCE: 89 Met Ser Lys Asn Phe Asp Glu Leu Leu Ser Arg Leu Lys Glu Val Pro 1 5 10 15 Thr Lys Lys Val Ala Val Ala Val Ala Gln Asp Glu Pro Val Leu Glu 20 25 30 Ala Ile Lys Glu Ala Thr Glu Asn Asn Ile Ala Gln Ala Ile Leu Val 35 40 45 Gly Asp Lys Gln Gln Ile His Glu Ile Ala Lys Lys Ile Asn Leu Asp 50 55 60 Leu Ser Asp Tyr Glu Ile Met Asp Ile Lys Asp Pro Lys Lys Ala Thr 65 70 75 80 Leu Glu Ala Val Lys Leu Val Ser Ser Gly His Ala Asp Met Leu Met 85 90 95 Lys Gly Leu Val Asp Thr Ala Thr Phe Leu Arg Ser Val Leu Asn Lys 100 105 110 Glu Val Gly Leu Arg Thr Gly Lys Leu Met Ser His Val Ala Val Phe 115 120 125 Asp Val Glu Gly Trp Asp Arg Leu Leu Phe Leu Thr Asp Ala Ala Phe 130 135 140 Asn Thr Tyr Pro Glu Phe Lys Asp Lys Val Gly Met Ile Asn Asn Ala 145 150 155 160 Val Val Val Ala His Ala Cys Gly Ile Asp Val Pro Arg Ile Ala Pro 165 170 175 Ile Cys Pro Val Glu Val Val Asn Thr Ser Met Gln Ser Thr Val Asp 180 185 190 Ala Ala Leu Leu Ala Lys Met Ser Asp Arg Gly Gln Ile Lys Gly Cys 195 200 205 Val Ile Asp Gly Pro Phe Ala Leu Asp Asn Ala Ile Ser Glu Glu Ala 210 215 220 Ala His His Lys Gly Val Thr Gly Ser Val Ala Gly Lys Ala Asp Ile 225 230 235 240 Leu Leu Leu Pro Asn Ile Glu Ala Ala Asn Val Met Tyr Lys Thr Leu 245 250 255 Thr Tyr Phe Ser Lys Ser Arg Asn Gly Gly Leu Leu Val Gly Thr Ser 260 265 270 Ala Pro Val Ile Leu Thr Ser Arg Ala Asp Ser Phe Glu Thr Lys Val 275 280 285 Asn Ser Ile Ala Leu Ala Ala Leu Val Ala Ala Arg Asn Lys 290 295 300 <210> SEQ ID NO 90 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_010966356.1 <400> SEQUENCE: 90 Met Tyr Arg Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys Ile 1 5 10 15 Gly Ile Tyr Asp Asp Glu Lys Glu Ile Phe Glu Lys Thr Leu Arg His 20 25 30 Ser Ala Glu Glu Ile Glu Lys Tyr Asn Thr Ile Phe Asp Gln Phe Gln 35 40 45 Phe Arg Lys Asn Val Ile Leu Asp Ala Leu Lys Glu Ala Asn Ile Glu 50 55 60 Val Ser Ser Leu Asn Ala Val Val Gly Arg Gly Gly Leu Leu Lys Pro 65 70 75 80 Ile Val Ser Gly Thr Tyr Ala Val Asn Gln Lys Met Leu Glu Asp Leu 85 90 95 Lys Val Gly Val Gln Gly Gln His Ala Ser Asn Leu Gly Gly Ile Ile 100 105 110 Ala Asn Glu Ile Ala Lys Glu Ile Asn Val Pro Ala Tyr Ile Val Asp 115 120 125 Pro Val Val Val Asp Glu Leu Asp Glu Val Ser Arg Ile Ser Gly Met 130 135 140 Ala Asp Ile Pro Arg Lys Ser Ile Phe His Ala Leu Asn Gln Lys Ala 145 150 155 160 Val Ala Arg Arg Tyr Ala Lys Glu Val Gly Lys Lys Tyr Glu Asp Leu 165 170 175 Asn Leu Ile Val Val His Met Gly Gly Gly Thr Ser Val Gly Thr His 180 185 190 Lys Asp Gly Arg Val Ile Glu Val Asn Asn Thr Leu Asp Gly Glu Gly 195 200 205 Pro Phe Ser Pro Glu Arg Ser Gly Gly Val Pro Ile Gly Asp Leu Val 210 215 220 Arg Leu Cys Phe Ser Asn Lys Tyr Thr Tyr Glu Glu Val Met Lys Lys 225 230 235 240 Ile Asn Gly Lys Gly Gly Val Val Ser Tyr Leu Asn Thr Ile Asp Phe 245 250 255 Lys Ala Val Val Asp Lys Ala Leu Glu Gly Asp Lys Lys Cys Ala Leu 260 265 270 Ile Tyr Glu Ala Phe Thr Phe Gln Val Ala Lys Glu Ile Gly Lys Cys 275 280 285 Ser Thr Val Leu Lys Gly Asn Val Asp Ala Ile Ile Leu Thr Gly Gly 290 295 300 Ile Ala Tyr Asn Glu His Val Cys Asn Ala Ile Glu Asp Arg Val Lys 305 310 315 320 Phe Ile Ala Pro Val Val Arg Tyr Gly Gly Glu Asp Glu Leu Leu Ala 325 330 335 Leu Ala Glu Gly Gly Leu Arg Val Leu Arg Gly Glu Glu Lys Ala Lys 340 345 350 Glu Tyr Lys 355 <210> SEQ ID NO 91 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_011967556 <400> SEQUENCE: 91 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Asn Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Ala Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Lys Met Glu Glu Gly Asp Lys Glu Cys Glu 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Ala Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 92 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_017209677 <400> SEQUENCE: 92 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Asn Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Val Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Lys Met Glu Glu Gly Asp Lys Glu Cys Gly 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Ala Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 93 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_026886638 <400> SEQUENCE: 93 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Asn Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Val Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Asn Met Glu Ser Gly Asp Lys Glu Cys Glu 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Glu Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 94 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_041893502 <400> SEQUENCE: 94 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Ser Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Val Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Lys Met Glu Glu Gly Asp Lys Glu Cys Gly 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Ala Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 95 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - pACYC-ptb-R1, reverse <400> SEQUENCE: 95 aagtttttac tcatatgtat atctccttct tatacttaac 40 <210> SEQ ID NO 96 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - ptb-pACYC-F1, forward <400> SEQUENCE: 96 agaaggagat atacatatga gtaaaaactt tgatgagtta 40 <210> SEQ ID NO 97 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - buk-pACYC-R1, reverse <400> SEQUENCE: 97 accagactcg agggtaccta gtaaacctta gcttgttc 38 <210> SEQ ID NO 98 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - pACYC-buk-F1, forward <400> SEQUENCE: 98 taaggtttac taggtaccct cgagtctggt aaagaaac 38 <210> SEQ ID NO 99 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - thlA-adc-R1, reverse <400> SEQUENCE: 99 acatatgtat atctccttct tactagcact tttctagcaa tattg 45 <210> SEQ ID NO 100 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - adc-ThlA-F1, forward <400> SEQUENCE: 100 agtaagaagg agatatacat atgttagaaa gtgaagtatc taaac 45 <210> SEQ ID NO 101 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - adc-pCOLA-R1, reverse <400> SEQUENCE: 101 cagactcgag ggtaccttat tttactgaaa gataatcatg tac 43 <210> SEQ ID NO 102 <211> LENGTH: 42 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - pCOLA-adc-F1, forward <400> SEQUENCE: 102 tctttcagta aaataaggta ccctcgagtc tggtaaagaa ac 42 <210> SEQ ID NO 103 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - thlA-pCOLA-F1, forward <400> SEQUENCE: 103 gaaggagata tacatatgaa agaagttgta atagctagtg 40 <210> SEQ ID NO 104 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - pCOLA-thlA-R1, reverse <400> SEQUENCE: 104 acaacttctt tcatatgtat atctccttct tatacttaac 40 <210> SEQ ID NO 105 <211> LENGTH: 5791 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYC-ptb-buk, plasmid <400> SEQUENCE: 105 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgagtaaaaa ctttgatgag ttattatcaa gattaaagga agttccaaca aaaaaagtgg 360 ctgtagccgt agcacaagat gaaccagtat tagaggctat aaaagaagct acagaaaata 420 acatcgcaca agcaatattg gttggtgata aacaacaaat ccatgaaatc gcaaagaaaa 480 taaacttgga cttatctgat tatgaaataa tggatattaa agatccaaag aaagcaacat 540 tagaagcagt aaaattagtt tctagtggtc atgcagatat gttaatgaaa ggtctagttg 600 atactgcaac attcctaaga agcgtattaa acaaagaggt tggtcttaga acaggaaaat 660 taatgtccca tgtagctgtg tttgatgtgg aaggttggga tagactgtta tttttaactg 720 atgcagcatt taatacatat ccagaattta aggataaagt tggaatgata aataatgcag 780 ttgtagttgc tcatgcatgt ggaatagatg ttccaagagt agcacctata tgcccagttg 840 aagttgtaaa tacaagtatg caatcaacag ttgatgcagc attgttagct aaaatgagtg 900 acagggggca aattaaagga tgcgtaattg atggaccttt tgccttagat aatgcaatat 960 cagaagaagc agctcatcat aaaggtgtta caggatcagt agcaggtaaa gctgatatat 1020 tattattacc aaatatagaa gcagcaaatg taatgtataa aacattaaca tatttctcta 1080 aatcaagaaa tggtggactt ttagtaggta catcagcacc agtaatttta acttcaagag 1140 cagattcatt cgaaactaaa gttaattcaa ttgctcttgc agcattagtt gcagcaagaa 1200 ataagtaata aatcaatcca taataattaa tgcataatta atggagagat ttatatggaa 1260 tttgcaatgc actattagat tctataataa tttcttctga aaattatgca ttatgactgt 1320 atagaatgca ttaaatttaa gggggattca gaatgtcata taagctatta ataatcaatc 1380 caggttcaac atcaacaaag attggtgttt acgaaggaga aaaggaacta tttgaagaaa 1440 ctttgagaca cacaaatgaa gaaataaaga gatatgatac aatatatgat caatttgaat 1500 ttagaaaaga agttatatta aatgttctta aagaaaagaa ttttgatata aagactctaa 1560 gtgctattgt tggtagaggt ggaatgctta gaccagttga aggtggaaca tatgcagtaa 1620 atgatgcaat ggttgaagat ttaaaagttg gagttcaagg acctcatgct tctaaccttg 1680 gcggaataat tgccaagtca attggagatg aattaaatat tccatcattt atagtagatc 1740 cagttgttac agatgagtta gcagatgtag caagactatc tggagtacca gaactaccaa 1800 gaaaaagtaa attccatgct ttaaatcaaa aagcggtagc taaaagatat ggaaaagaaa 1860 gtggacaagg atatgaaaac ctaaatcttg tagttgtaca tatgggtgga ggcgtttcag 1920 ttggtgctca caatcatggg aaagttgtcg atgtaaataa tgcattagat ggagatggcc 1980 cattctcacc agaaagagct ggatcagttc caattggtga tttagttaaa atgtgtttta 2040 gtggaaaata tagtgaagca gaagtatatg gcaaggctgt aggaaaaggt ggatttgttg 2100 gttatctaaa cacaaatgat gtaaaaggtg ttattgataa gatggaagaa ggagataaag 2160 aatgtgaatc aatatacaaa gcatttgttt atcaaatttc aaaagcaatc ggagaaatgt 2220 cagttgtatt agaaggtaaa gttgatcaaa ttatttttac cggaggaatt gcatactcac 2280 caacacttgt tccagacctt aaagcaaaag ttgaatggat agccccagtt acagtttatc 2340 ctggagaaga tgaattactt gctctagctc aaggtgctat aagagtactt gatggagaag 2400 aacaagctaa ggtttactag gtaccctcga gtctggtaaa gaaaccgctg ctgcgaaatt 2460 tgaacgccag cacatggact cgtctactag cgcagcttaa ttaacctagg ctgctgccac 2520 cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 2580 gctgaaacct caggcatttg agaagcacac ggtcacactg cttccggtag tcaataaacc 2640 ggtaaaccag caatagacat aagcggctat ttaacgaccc tgccctgaac cgacgacaag 2700 ctgacgaccg ggtctccgca agtggcactt ttcggggaaa tgtgcgcgga acccctattt 2760 gtttattttt ctaaatacat tcaaatatgt atccgctcat gaattaattc ttagaaaaac 2820 tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt 2880 tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca 2940 agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc tattaatttc 3000 ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt 3060 gagaatggca aaagtttatg catttctttc cagacttgtt caacaggcca gccattacgc 3120 tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg 3180 agacgaaata cgcggtcgct gttaaaagga caattacaaa caggaatcga atgcaaccgg 3240 cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat 3300 acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc atcaggagta 3360 cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt tagtctgacc 3420 atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc 3480 gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac attatcgcga 3540 gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg cctagagcaa 3600 gacgtttccc gttgaatatg gctcatactc ttcctttttc aatattattg aagcatttat 3660 cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 3720 ggcatgctag cgcagaaacg tcctagaaga tgccaggagg atacttagca gagagacaat 3780 aaggccggag cgaagccgtt tttccatagg ctccgccccc ctgacgaaca tcacgaaatc 3840 tgacgctcaa atcagtggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 3900 cctgatggct ccctcttgcg ctctcctgtt cccgtcctgc ggcgtccgtg ttgtggtgga 3960 ggctttaccc aaatcaccac gtcccgttcc gtgtagacag ttcgctccaa gctgggctgt 4020 gtgcaagaac cccccgttca gcccgactgc tgcgccttat ccggtaacta tcatcttgag 4080 tccaacccgg aaagacacga caaaacgcca ctggcagcag ccattggtaa ctgagaatta 4140 gtggatttag atatcgagag tcttgaagtg gtggcctaac agaggctaca ctgaaaggac 4200 agtatttggt atctgcgctc cactaaagcc agttaccagg ttaagcagtt ccccaactga 4260 cttaaccttc gatcaaaccg cctccccagg cggttttttc gtttacagag caggagatta 4320 cgacgatcgt aaaaggatct caagaagatc ctttacggat tcccgacacc atcactctag 4380 atttcagtgc aatttatctc ttcaaatgta gcacctgaag tcagccccat acgatataag 4440 ttgtaattct catgttagtc atgccccgcg cccaccggaa ggagctgact gggttgaagg 4500 ctctcaaggg catcggtcga gatcccggtg cctaatgagt gagctaactt acattaattg 4560 cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 4620 tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ccagggtggt ttttcttttc 4680 accagtgaga cgggcaacag ctgattgccc ttcaccgcct ggccctgaga gagttgcagc 4740 aagcggtcca cgctggtttg ccccagcagg cgaaaatcct gtttgatggt ggttaacggc 4800 gggatataac atgagctgtc ttcggtatcg tcgtatccca ctaccgagat gtccgcacca 4860 acgcgcagcc cggactcggt aatggcgcgc attgcgccca gcgccatctg atcgttggca 4920 accagcatcg cagtgggaac gatgccctca ttcagcattt gcatggtttg ttgaaaaccg 4980 gacatggcac tccagtcgcc ttcccgttcc gctatcggct gaatttgatt gcgagtgaga 5040 tatttatgcc agccagccag acgcagacgc gccgagacag aacttaatgg gcccgctaac 5100 agcgcgattt gctggtgacc caatgcgacc agatgctcca cgcccagtcg cgtaccgtct 5160 tcatgggaga aaataatact gttgatgggt gtctggtcag agacatcaag aaataacgcc 5220 ggaacattag tgcaggcagc ttccacagca atggcatcct ggtcatccag cggatagtta 5280 atgatcagcc cactgacgcg ttgcgcgaga agattgtgca ccgccgcttt acaggcttcg 5340 acgccgcttc gttctaccat cgacaccacc acgctggcac ccagttgatc ggcgcgagat 5400 ttaatcgccg cgacaatttg cgacggcgcg tgcagggcca gactggaggt ggcaacgcca 5460 atcagcaacg actgtttgcc cgccagttgt tgtgccacgc ggttgggaat gtaattcagc 5520 tccgccatcg ccgcttccac tttttcccgc gttttcgcag aaacgtggct ggcctggttc 5580 accacgcggg aaacggtctg ataagagaca ccggcatact ctgcgacatc gtataacgtt 5640 actggtttca cattcaccac cctgaattga ctctcttccg ggcgctatca tgccataccg 5700 cgaaaggttt tgcgccattc gatggtgtcc gggatctcga cgctctccct tatgcgactc 5760 ctgcattagg aaattaatac gactcactat a 5791 <210> SEQ ID NO 106 <211> LENGTH: 5609 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLA-thlA-adc, plasmid <400> SEQUENCE: 106 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgaaagaagt tgtaatagct agtgcagtaa gaacagcgat tggatcttat ggaaagtctc 360 ttaaggatgt accagcagta gatttaggag ctacagctat aaaggaagca gttaaaaaag 420 caggaataaa accagaggat gttaatgaag tcattttagg aaatgttctt caagcaggtt 480 taggacagaa tccagcaaga caggcatctt ttaaagcagg attaccagtt gaaattccag 540 ctatgactat taataaggtt tgtggttcag gacttagaac agttagctta gcagcacaaa 600 ttataaaagc aggagatgct gacgtaataa tagcaggtgg tatggaaaat atgtctagag 660 ctccttactt agcgaataac gctagatggg gatatagaat gggaaacgct aaatttgttg 720 atgaaatgat cactgacgga ttgtgggatg catttaatga ttaccacatg ggaataacag 780 cagaaaacat agctgagaga tggaacattt caagagaaga acaagatgag tttgctcttg 840 catcacaaaa aaaagctgaa gaagctataa aatcaggtca atttaaagat gaaatagttc 900 ctgtagtaat taaaggcaga aagggagaaa ctgtagttga tacagatgag caccctagat 960 ttggatcaac tatagaagga cttgcaaaat taaaacctgc cttcaaaaaa gatggaacag 1020 ttacagctgg taatgcatca ggattaaatg actgtgcagc agtacttgta atcatgagtg 1080 cagaaaaagc taaagagctt ggagtaaaac cacttgctaa gatagtttct tatggttcag 1140 caggagttga cccagcaata atgggatatg gacctttcta tgcaacaaaa gcagctattg 1200 aaaaagcagg ttggacagtt gatgaattag atttaataga atcaaatgaa gcttttgcag 1260 ctcaaagttt agcagtagca aaagatttaa aatttgatat gaataaagta aatgtaaatg 1320 gaggagctat tgcccttggt catccaattg gagcatcagg tgcaagaata ctcgttactc 1380 ttgtacacgc aatgcaaaaa agagatgcaa aaaaaggctt agcaacttta tgtataggtg 1440 gcggacaagg aacagcaata ttgctagaaa agtgctagta agaaggagat atacatatgt 1500 tagaaagtga agtatctaaa caaattacaa ctccacttgc tgctccagcg tttcctagag 1560 gaccatatag gtttcacaat agagaatatc taaacattat ttatcgaact gatttagatg 1620 ctcttcgaaa aatagtacca gagccacttg aattagatag agcatatgtt agatttgaaa 1680 tgatggctat gcctgataca accggactag gctcatatac agaatgtggt caagctattc 1740 cagtaaaata taatggtgtt aagggtgact acttgcatat gatgtatcta gataatgaac 1800 ctgctattgc tgttggaaga gaaagtagcg cttatccaaa aaagcttggc tatccaaagc 1860 tatttgttga ttcagatact ttagttggga cacttaaata tggtacatta ccagtagcta 1920 ctgcaacaat gggatataag cacgagcctc tagatcttaa agaagcctat gctcaaattg 1980 caagacccaa ttttatgcta aaaatcattc aaggttacga tggtaagcca agaatttgtg 2040 aactaatatg tgcagaaaat actgatataa ctattcacgg tgcttggact ggaagtgcac 2100 gtctacaatt atttagccat gcactagctc ctcttgctga tttacctgta ttagagattg 2160 tatcagcatc tcatatcctc acagatttaa ctcttggaac acctaaggtt gtacatgatt 2220 atctttcagt aaaataaggt accctcgagt ctggtaaaga aaccgctgct gcgaaatttg 2280 aacgccagca catggactcg tctactagcg cagcttaatt aacctaggct gctgccaccg 2340 ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc 2400 tgaaacctca ggcatttgag aagcacacgg tcacactgct tccggtagtc aataaaccgg 2460 taaaccagca atagacataa gcggctattt aacgaccctg ccctgaaccg acgacaagct 2520 gacgaccggg tctccgcaag tggcactttt cggggaaatg tgcgcggaac ccctatttgt 2580 ttatttttct aaatacattc aaatatgtat ccgctcatga attaattctt agaaaaactc 2640 atcgagcatc aaatgaaact gcaatttatt catatcagga ttatcaatac catatttttg 2700 aaaaagccgt ttctgtaatg aaggagaaaa ctcaccgagg cagttccata ggatggcaag 2760 atcctggtat cggtctgcga ttccgactcg tccaacatca atacaaccta ttaatttccc 2820 ctcgtcaaaa ataaggttat caagtgagaa atcaccatga gtgacgactg aatccggtga 2880 gaatggcaaa agtttatgca tttctttcca gacttgttca acaggccagc cattacgctc 2940 gtcatcaaaa tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag 3000 acgaaatacg cggtcgctgt taaaaggaca attacaaaca ggaatcgaat gcaaccggcg 3060 caggaacact gccagcgcat caacaatatt ttcacctgaa tcaggatatt cttctaatac 3120 ctggaatgct gttttcccgg ggatcgcagt ggtgagtaac catgcatcat caggagtacg 3180 gataaaatgc ttgatggtcg gaagaggcat aaattccgtc agccagttta gtctgaccat 3240 ctcatctgta acatcattgg caacgctacc tttgccatgt ttcagaaaca actctggcgc 3300 atcgggcttc ccatacaatc gatagattgt cgcacctgat tgcccgacat tatcgcgagc 3360 ccatttatac ccatataaat cagcatccat gttggaattt aatcgcggcc tagagcaaga 3420 cgtttcccgt tgaatatggc tcatactctt cctttttcaa tattattgaa gcatttatca 3480 gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3540 catgctagcg cagaaacgtc ctagaagatg ccaggaggat acttagcaga gagacaataa 3600 ggccggagcg aagccgtttt tccataggct ccgcccccct gacgaacatc acgaaatctg 3660 acgctcaaat cagtggtggc gaaacccgac aggactataa agataccagg cgtttccccc 3720 tgatggctcc ctcttgcgct ctcctgttcc cgtcctgcgg cgtccgtgtt gtggtggagg 3780 ctttacccaa atcaccacgt cccgttccgt gtagacagtt cgctccaagc tgggctgtgt 3840 gcaagaaccc cccgttcagc ccgactgctg cgccttatcc ggtaactatc atcttgagtc 3900 caacccggaa agacacgaca aaacgccact ggcagcagcc attggtaact gagaattagt 3960 ggatttagat atcgagagtc ttgaagtggt ggcctaacag aggctacact gaaaggacag 4020 tatttggtat ctgcgctcca ctaaagccag ttaccaggtt aagcagttcc ccaactgact 4080 taaccttcga tcaaaccgcc tccccaggcg gttttttcgt ttacagagca ggagattacg 4140 acgatcgtaa aaggatctca agaagatcct ttacggattc ccgacaccat cactctagat 4200 ttcagtgcaa tttatctctt caaatgtagc acctgaagtc agccccatac gatataagtt 4260 gtaattctca tgttagtcat gccccgcgcc caccggaagg agctgactgg gttgaaggct 4320 ctcaagggca tcggtcgaga tcccggtgcc taatgagtga gctaacttac attaattgcg 4380 ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 4440 ggccaacgcg cggggagagg cggtttgcgt attgggcgcc agggtggttt ttcttttcac 4500 cagtgagacg ggcaacagct gattgccctt caccgcctgg ccctgagaga gttgcagcaa 4560 gcggtccacg ctggtttgcc ccagcaggcg aaaatcctgt ttgatggtgg ttaacggcgg 4620 gatataacat gagctgtctt cggtatcgtc gtatcccact accgagatgt ccgcaccaac 4680 gcgcagcccg gactcggtaa tggcgcgcat tgcgcccagc gccatctgat cgttggcaac 4740 cagcatcgca gtgggaacga tgccctcatt cagcatttgc atggtttgtt gaaaaccgga 4800 catggcactc cagtcgcctt cccgttccgc tatcggctga atttgattgc gagtgagata 4860 tttatgccag ccagccagac gcagacgcgc cgagacagaa cttaatgggc ccgctaacag 4920 cgcgatttgc tggtgaccca atgcgaccag atgctccacg cccagtcgcg taccgtcttc 4980 atgggagaaa ataatactgt tgatgggtgt ctggtcagag acatcaagaa ataacgccgg 5040 aacattagtg caggcagctt ccacagcaat ggcatcctgg tcatccagcg gatagttaat 5100 gatcagccca ctgacgcgtt gcgcgagaag attgtgcacc gccgctttac aggcttcgac 5160 gccgcttcgt tctaccatcg acaccaccac gctggcaccc agttgatcgg cgcgagattt 5220 aatcgccgcg acaatttgcg acggcgcgtg cagggccaga ctggaggtgg caacgccaat 5280 cagcaacgac tgtttgcccg ccagttgttg tgccacgcgg ttgggaatgt aattcagctc 5340 cgccatcgcc gcttccactt tttcccgcgt tttcgcagaa acgtggctgg cctggttcac 5400 cacgcgggaa acggtctgat aagagacacc ggcatactct gcgacatcgt ataacgttac 5460 tggtttcaca ttcaccaccc tgaattgact ctcttccggg cgctatcatg ccataccgcg 5520 aaaggttttg cgccattcga tggtgtccgg gatctcgacg ctctccctta tgcgactcct 5580 gcattaggaa attaatacga ctcactata 5609 <210> SEQ ID NO 107 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-ptb-R1, reverse <400> SEQUENCE: 107 atttcctccc tttctagcac ttttctagca atattg 36 <210> SEQ ID NO 108 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: adc-buk-F1, forward <400> SEQUENCE: 108 taaggtttac taaggaggtt gttttatgtt agaaag 36 <210> SEQ ID NO 109 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-ptb-F1, forward <400> SEQUENCE: 109 gctagaaaag tgctagaaag ggaggaaatg aacatg 36 <210> SEQ ID NO 110 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Buk-adc-R1, reverse <400> SEQUENCE: 110 aaaacaacct ccttagtaaa ccttagcttg ttcttc 36 <210> SEQ ID NO 111 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDuet-insert2-R1, forward <400> SEQUENCE: 111 catatgtata tctccttctt atacttaac 29 <210> SEQ ID NO 112 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: insert2-pDuet-F1, forward <400> SEQUENCE: 112 gttaagtata agaaggagat atacatatg 29 <210> SEQ ID NO 113 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDuet-insert2-F1, forward <400> SEQUENCE: 113 cctcgagtct ggtaaagaaa c 21 <210> SEQ ID NO 114 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: insert2-pDuet-R1, forward <400> SEQUENCE: 114 gtttctttac cagactcgag g 21 <210> SEQ ID NO 115 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB - pACYC-phaB-R1, forward <400> SEQUENCE: 115 ctattctttg tgtcatggta tatctcctta ttaaag 36 <210> SEQ ID NO 116 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB - phaB-pACYC-F1, forward <400> SEQUENCE: 116 ataaggagat ataccatgac acaaagaata gcatac 36 <210> SEQ ID NO 117 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pcdf-phab - pacyc-phab-f1, forward <400> SEQUENCE: 117 tggtttacac atgggataag atccgaattc gagctc 36 <210> SEQ ID NO 118 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB - phaB-pACYC-R1, forward <400> SEQUENCE: 118 agctcgaatt cggatcttat cccatgtgta aaccac 36 <210> SEQ ID NO 119 <211> LENGTH: 4486 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB, plasmid <400> SEQUENCE: 119 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgacacaaa gaatagcata cgtaacaggt ggtatgggtg gtataggaac 120 tgcaatatgt caaagattag caaaagatgg atttagagtt gtagctggat gcggaccaaa 180 tagtcctaga agagaaaagt ggttagaaca acaaaaagca cttggatttg atttcatagc 240 ttctgaaggt aacgtagcag attgggactc aactaaaact gcttttgata aagttaaatc 300 tgaagttggt gaagttgatg tattaataaa taatgcaggt attactagag atgtagtatt 360 tagaaagatg acaagagctg actgggatgc agtaatagat actaatctta ctagtctttt 420 caatgtaact aagcaggtaa ttgatggtat ggcagataga ggttggggta gaatagtaaa 480 tattagttca gttaatggac aaaaaggtca gtttggacag acaaattatt ctacagctaa 540 agcaggtctt catggtttta caatggcttt agcacaggaa gttgctacaa aaggtgttac 600 agttaacact gttagtccag gatatattgc tactgacatg gtaaaggcta taagacaaga 660 tgttcttgat aaaattgttg ctacaatacc agtaaagaga ttaggacttc ctgaagagat 720 agcatctatt tgtgcatggt tatcaagtga agaatcagga ttctcaactg gtgctgattt 780 ttcattaaac ggtggtttac acatgggata agatccgaat tcgagctcgg cgcgcctgca 840 ggtcgacaag cttgcggccg cataatgctt aagtcgaaca gaaagtaatc gtattgtaca 900 cggccgcata atcgaaatta atacgactca ctatagggga attgtgagcg gataacaatt 960 ccccatctta gtatattagt taagtataag aaggagatat acatatggca gatctcaatt 1020 ggatatcggc cggccacgcg atcgctgacg tcggtaccct cgagtctggt aaagaaaccg 1080 ctgctgcgaa atttgaacgc cagcacatgg actcgtctac tagcgcagct taattaacct 1140 aggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct 1200 tgaggggttt tttgctgaaa cctcaggcat ttgagaagca cacggtcaca ctgcttccgg 1260 tagtcaataa accggtaaac cagcaataga cataagcggc tatttaacga ccctgccctg 1320 aaccgacgac cgggtcatcg tggccggatc ttgcggcccc tcggcttgaa cgaattgtta 1380 gacattattt gccgactacc ttggtgatct cgcctttcac gtagtggaca aattcttcca 1440 actgatctgc gcgcgaggcc aagcgatctt cttcttgtcc aagataagcc tgtctagctt 1500 caagtatgac gggctgatac tgggccggca ggcgctccat tgcccagtcg gcagcgacat 1560 ccttcggcgc gattttgccg gttactgcgc tgtaccaaat gcgggacaac gtaagcacta 1620 catttcgctc atcgccagcc cagtcgggcg gcgagttcca tagcgttaag gtttcattta 1680 gcgcctcaaa tagatcctgt tcaggaaccg gatcaaagag ttcctccgcc gctggaccta 1740 ccaaggcaac gctatgttct cttgcttttg tcagcaagat agccagatca atgtcgatcg 1800 tggctggctc gaagatacct gcaagaatgt cattgcgctg ccattctcca aattgcagtt 1860 cgcgcttagc tggataacgc cacggaatga tgtcgtcgtg cacaacaatg gtgacttcta 1920 cagcgcggag aatctcgctc tctccagggg aagccgaagt ttccaaaagg tcgttgatca 1980 aagctcgccg cgttgtttca tcaagcctta cggtcaccgt aaccagcaaa tcaatatcac 2040 tgtgtggctt caggccgcca tccactgcgg agccgtacaa atgtacggcc agcaacgtcg 2100 gttcgagatg gcgctcgatg acgccaacta cctctgatag ttgagtcgat acttcggcga 2160 tcaccgcttc cctcatactc ttcctttttc aatattattg aagcatttat cagggttatt 2220 gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata gctagctcac 2280 tcggtcgcta cgctccgggc gtgagactgc ggcgggcgct gcggacacat acaaagttac 2340 ccacagattc cgtggataag caggggacta acatgtgagg caaaacagca gggccgcgcc 2400 ggtggcgttt ttccataggc tccgccctcc tgccagagtt cacataaaca gacgcttttc 2460 cggtgcatct gtgggagccg tgaggctcaa ccatgaatct gacagtacgg gcgaaacccg 2520 acaggactta aagatcccca ccgtttccgg cgggtcgctc cctcttgcgc tctcctgttc 2580 cgaccctgcc gtttaccgga tacctgttcc gcctttctcc cttacgggaa gtgtggcgct 2640 ttctcatagc tcacacactg gtatctcggc tcggtgtagg tcgttcgctc caagctgggc 2700 tgtaagcaag aactccccgt tcagcccgac tgctgcgcct tatccggtaa ctgttcactt 2760 gagtccaacc cggaaaagca cggtaaaacg ccactggcag cagccattgg taactgggag 2820 ttcgcagagg atttgtttag ctaaacacgc ggttgctctt gaagtgtgcg ccaaagtccg 2880 gctacactgg aaggacagat ttggttgctg tgctctgcga aagccagtta ccacggttaa 2940 gcagttcccc aactgactta accttcgatc aaaccacctc cccaggtggt tttttcgttt 3000 acagggcaaa agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 3060 actgaaccgc tctagatttc agtgcaattt atctcttcaa atgtagcacc tgaagtcagc 3120 cccatacgat ataagttgta attctcatgt tagtcatgcc ccgcgcccac cggaaggagc 3180 tgactgggtt gaaggctctc aagggcatcg gtcgagatcc cggtgcctaa tgagtgagct 3240 aacttacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc 3300 agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgccagg 3360 gtggtttttc ttttcaccag tgagacgggc aacagctgat tgcccttcac cgcctggccc 3420 tgagagagtt gcagcaagcg gtccacgctg gtttgcccca gcaggcgaaa atcctgtttg 3480 atggtggtta acggcgggat ataacatgag ctgtcttcgg tatcgtcgta tcccactacc 3540 gagatgtccg caccaacgcg cagcccggac tcggtaatgg cgcgcattgc gcccagcgcc 3600 atctgatcgt tggcaaccag catcgcagtg ggaacgatgc cctcattcag catttgcatg 3660 gtttgttgaa aaccggacat ggcactccag tcgccttccc gttccgctat cggctgaatt 3720 tgattgcgag tgagatattt atgccagcca gccagacgca gacgcgccga gacagaactt 3780 aatgggcccg ctaacagcgc gatttgctgg tgacccaatg cgaccagatg ctccacgccc 3840 agtcgcgtac cgtcttcatg ggagaaaata atactgttga tgggtgtctg gtcagagaca 3900 tcaagaaata acgccggaac attagtgcag gcagcttcca cagcaatggc atcctggtca 3960 tccagcggat agttaatgat cagcccactg acgcgttgcg cgagaagatt gtgcaccgcc 4020 gctttacagg cttcgacgcc gcttcgttct accatcgaca ccaccacgct ggcacccagt 4080 tgatcggcgc gagatttaat cgccgcgaca atttgcgacg gcgcgtgcag ggccagactg 4140 gaggtggcaa cgccaatcag caacgactgt ttgcccgcca gttgttgtgc cacgcggttg 4200 ggaatgtaat tcagctccgc catcgccgct tccacttttt cccgcgtttt cgcagaaacg 4260 tggctggcct ggttcaccac gcgggaaacg gtctgataag agacaccggc atactctgcg 4320 acatcgtata acgttactgg tttcacattc accaccctga attgactctc ttccgggcgc 4380 tatcatgcca taccgcgaaa ggttttgcgc cattcgatgg tgtccgggat ctcgacgctc 4440 tcccttatgc gactcctgca ttaggaaatt aatacgactc actata 4486 <210> SEQ ID NO 120 <211> LENGTH: 5221 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB-bdh1, plasmid <400> SEQUENCE: 120 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgacacaaa gaatagcata cgtaacaggt ggtatgggtg gtataggaac 120 tgcaatatgt caaagattag caaaagatgg atttagagtt gtagctggat gcggaccaaa 180 tagtcctaga agagaaaagt ggttagaaca acaaaaagca cttggatttg atttcatagc 240 ttctgaaggt aacgtagcag attgggactc aactaaaact gcttttgata aagttaaatc 300 tgaagttggt gaagttgatg tattaataaa taatgcaggt attactagag atgtagtatt 360 tagaaagatg acaagagctg actgggatgc agtaatagat actaatctta ctagtctttt 420 caatgtaact aagcaggtaa ttgatggtat ggcagataga ggttggggta gaatagtaaa 480 tattagttca gttaatggac aaaaaggtca gtttggacag acaaattatt ctacagctaa 540 agcaggtctt catggtttta caatggcttt agcacaggaa gttgctacaa aaggtgttac 600 agttaacact gttagtccag gatatattgc tactgacatg gtaaaggcta taagacaaga 660 tgttcttgat aaaattgttg ctacaatacc agtaaagaga ttaggacttc ctgaagagat 720 agcatctatt tgtgcatggt tatcaagtga agaatcagga ttctcaactg gtgctgattt 780 ttcattaaac ggtggtttac acatgggata agatccgaat tcgagctcgg cgcgcctgca 840 ggtcgacaag cttgcggccg cataatgctt aagtcgaaca gaaagtaatc gtattgtaca 900 cggccgcata atcgaaatta atacgactca ctatagggga attgtgagcg gataacaatt 960 ccccatctta gtatattagt taagtataag aaggagatat acatatgcaa ttaaaaggta 1020 aaagtgcaat agtaactggt gcagcaagtg gaataggaaa agcaatagca gaattacttg 1080 caaaagaagg tgcagcagta gcaatagctg atttaaattt agaagcagca agagcagcag 1140 cagctggaat agaagcagct ggcggaaaag ctatagctgt agcaatggat gtaactagtg 1200 aagcaagtgt aaatcaagca actgatgaag tagcacaagc atttggaaat atagatatat 1260 tagtaagtaa tgctggaata caaatagtaa atcctataca aaattatgca tttagtgatt 1320 ggaaaaaaat gcaagcaata catgtagatg gtgcattttt aactactaaa gcagcattga 1380 aatatatgta tagagataaa agaggtggaa ctgtaatata tatgggaagt gtacattctc 1440 atgaagcaag tcctttaaaa agtgcttatg tagcagcaaa acatgcatta ttaggattag 1500 caagagtatt agctaaagaa ggtgctgaat tcaacgtaag atctcacgtt atatgtcctg 1560 gatttgtaag aactccttta gtagataaac aaatacctga acaagcaaaa gaattaggaa 1620 taagtgaaga agaagtagtt agaagagtaa tgttaggtgg aacagtagac ggtgtattta 1680 ctactgtaga tgatgtagca agaactgcat tatttttatg tgcatttcct agtgcagcat 1740 taactggaca aagttttata gtaagtcatg gatggtatat gcaataaggt accctcgagt 1800 ctggtaaaga aaccgctgct gcgaaatttg aacgccagca catggactcg tctactagcg 1860 cagcttaatt aacctaggct gctgccaccg ctgagcaata actagcataa ccccttgggg 1920 cctctaaacg ggtcttgagg ggttttttgc tgaaacctca ggcatttgag aagcacacgg 1980 tcacactgct tccggtagtc aataaaccgg taaaccagca atagacataa gcggctattt 2040 aacgaccctg ccctgaaccg acgaccgggt catcgtggcc ggatcttgcg gcccctcggc 2100 ttgaacgaat tgttagacat tatttgccga ctaccttggt gatctcgcct ttcacgtagt 2160 ggacaaattc ttccaactga tctgcgcgcg aggccaagcg atcttcttct tgtccaagat 2220 aagcctgtct agcttcaagt atgacgggct gatactgggc cggcaggcgc tccattgccc 2280 agtcggcagc gacatccttc ggcgcgattt tgccggttac tgcgctgtac caaatgcggg 2340 acaacgtaag cactacattt cgctcatcgc cagcccagtc gggcggcgag ttccatagcg 2400 ttaaggtttc atttagcgcc tcaaatagat cctgttcagg aaccggatca aagagttcct 2460 ccgccgctgg acctaccaag gcaacgctat gttctcttgc ttttgtcagc aagatagcca 2520 gatcaatgtc gatcgtggct ggctcgaaga tacctgcaag aatgtcattg cgctgccatt 2580 ctccaaattg cagttcgcgc ttagctggat aacgccacgg aatgatgtcg tcgtgcacaa 2640 caatggtgac ttctacagcg cggagaatct cgctctctcc aggggaagcc gaagtttcca 2700 aaaggtcgtt gatcaaagct cgccgcgttg tttcatcaag ccttacggtc accgtaacca 2760 gcaaatcaat atcactgtgt ggcttcaggc cgccatccac tgcggagccg tacaaatgta 2820 cggccagcaa cgtcggttcg agatggcgct cgatgacgcc aactacctct gatagttgag 2880 tcgatacttc ggcgatcacc gcttccctca tactcttcct ttttcaatat tattgaagca 2940 tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 3000 aaatagctag ctcactcggt cgctacgctc cgggcgtgag actgcggcgg gcgctgcgga 3060 cacatacaaa gttacccaca gattccgtgg ataagcaggg gactaacatg tgaggcaaaa 3120 cagcagggcc gcgccggtgg cgtttttcca taggctccgc cctcctgcca gagttcacat 3180 aaacagacgc ttttccggtg catctgtggg agccgtgagg ctcaaccatg aatctgacag 3240 tacgggcgaa acccgacagg acttaaagat ccccaccgtt tccggcgggt cgctccctct 3300 tgcgctctcc tgttccgacc ctgccgttta ccggatacct gttccgcctt tctcccttac 3360 gggaagtgtg gcgctttctc atagctcaca cactggtatc tcggctcggt gtaggtcgtt 3420 cgctccaagc tgggctgtaa gcaagaactc cccgttcagc ccgactgctg cgccttatcc 3480 ggtaactgtt cacttgagtc caacccggaa aagcacggta aaacgccact ggcagcagcc 3540 attggtaact gggagttcgc agaggatttg tttagctaaa cacgcggttg ctcttgaagt 3600 gtgcgccaaa gtccggctac actggaagga cagatttggt tgctgtgctc tgcgaaagcc 3660 agttaccacg gttaagcagt tccccaactg acttaacctt cgatcaaacc acctccccag 3720 gtggtttttt cgtttacagg gcaaaagatt acgcgcagaa aaaaaggatc tcaagaagat 3780 cctttgatct tttctactga accgctctag atttcagtgc aatttatctc ttcaaatgta 3840 gcacctgaag tcagccccat acgatataag ttgtaattct catgttagtc atgccccgcg 3900 cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga gatcccggtg 3960 cctaatgagt gagctaactt acattaattg cgttgcgctc actgcccgct ttccagtcgg 4020 gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 4080 gtattgggcg ccagggtggt ttttcttttc accagtgaga cgggcaacag ctgattgccc 4140 ttcaccgcct ggccctgaga gagttgcagc aagcggtcca cgctggtttg ccccagcagg 4200 cgaaaatcct gtttgatggt ggttaacggc gggatataac atgagctgtc ttcggtatcg 4260 tcgtatccca ctaccgagat gtccgcacca acgcgcagcc cggactcggt aatggcgcgc 4320 attgcgccca gcgccatctg atcgttggca accagcatcg cagtgggaac gatgccctca 4380 ttcagcattt gcatggtttg ttgaaaaccg gacatggcac tccagtcgcc ttcccgttcc 4440 gctatcggct gaatttgatt gcgagtgaga tatttatgcc agccagccag acgcagacgc 4500 gccgagacag aacttaatgg gcccgctaac agcgcgattt gctggtgacc caatgcgacc 4560 agatgctcca cgcccagtcg cgtaccgtct tcatgggaga aaataatact gttgatgggt 4620 gtctggtcag agacatcaag aaataacgcc ggaacattag tgcaggcagc ttccacagca 4680 atggcatcct ggtcatccag cggatagtta atgatcagcc cactgacgcg ttgcgcgaga 4740 agattgtgca ccgccgcttt acaggcttcg acgccgcttc gttctaccat cgacaccacc 4800 acgctggcac ccagttgatc ggcgcgagat ttaatcgccg cgacaatttg cgacggcgcg 4860 tgcagggcca gactggaggt ggcaacgcca atcagcaacg actgtttgcc cgccagttgt 4920 tgtgccacgc ggttgggaat gtaattcagc tccgccatcg ccgcttccac tttttcccgc 4980 gttttcgcag aaacgtggct ggcctggttc accacgcggg aaacggtctg ataagagaca 5040 ccggcatact ctgcgacatc gtataacgtt actggtttca cattcaccac cctgaattga 5100 ctctcttccg ggcgctatca tgccataccg cgaaaggttt tgcgccattc gatggtgtcc 5160 gggatctcga cgctctccct tatgcgactc ctgcattagg aaattaatac gactcactat 5220 a 5221 <210> SEQ ID NO 121 <211> LENGTH: 10922 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL8225-budA::thlA-phaB, plasmid <400> SEQUENCE: 121 aaactccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga 60 gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta 120 atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 180 gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact 240 gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca 300 tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 360 accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg 420 ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 480 cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta 540 agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat 600 ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 660 tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc 720 ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 780 cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc 840 gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca gggccccctg cttcggggtc 900 attatagcga ttttttcggt atatccatcc tttttcgcac gatatacagg attttgccaa 960 agggttcgtg tagactttcc ttggtgtatc caacggcgtc agccgggcag gataggtgaa 1020 gtaggcccac ccgcgagcgg gtgttccttc ttcactgtcc cttattcgca cctggcggtg 1080 ctcaacggga atcctgctct gcgaggctgg ccggctaccg ccggcgtaac agatgagggc 1140 aagcggatgg ctgatgaaac caagccaacc aggaagggca gcccacctat caaggtgtac 1200 tgccttccag acgaacgaag agcgattgag gaaaaggcgg cggcggccgg catgagcctg 1260 tcggcctacc tgctggccgt cggccagggc tacaaaatca cgggcgtcgt ggactatgag 1320 cacgtccgcg agctggcccg catcaatggc gacctgggcc gcctgggcgg cctgctgaaa 1380 ctctggctca ccgacgaccc gcgcacggcg cggttcggtg atgccacgat cctcgccctg 1440 ctggcgaaga tcgaagagaa gcaggacgag cttggcaagg tcatgatggg cgtggtccgc 1500 ccgagggcag agccatgact tttttagccg ctaaaacggc cggggggtgc gcgtgattgc 1560 caagcacgtc cccatgcgct ccatcaagaa gagcgacttc gcggagctgg tgaagtacat 1620 caccgacgag caaggcaaga ccgatcgggc cccctgcagg ataaaaaaat tgtagataaa 1680 ttttataaaa tagttttatc tacaattttt ttatcaggaa acagctatga ccgcggccgc 1740 ggcgccaagc ttagaaaaat ataaataaga agtagcttta agagaattaa attattaaga 1800 aaagcaaagg tgtttaaaaa ataaattttt aaacaccttt gcttttctta aattataaat 1860 aagataaaaa agaatcctga ataaaataaa aaggggtgtc tcaaaatttt attttgagac 1920 gacccctttt tattctatat gtcgatgcta tagctgagat cgtggaattc ttgttagcta 1980 ccagattcac atttaagttg tttctctaaa ccacagatta tcaattcaag tccaaaaaga 2040 aatgctggtt ctgcgccttg atgatcaaat aactctattg cttgtcttaa caatggaggc 2100 attgaatctg ttgttggtgt ttctctttcc tcttttgcaa cttgatgttc ttgatcctcc 2160 aatacgcaac ctaaagtaaa atgtcctaca gcacttagtg cgtataaggc attttctaaa 2220 ctaaaaccct gttgacataa gaatgctaat tgattttcta atgtttcata ttgtttttca 2280 gttggtctag ttcctaaatg tactttagcc ccatctctat gtgataatag agcacaacga 2340 aaagatttag cgttattcct aagaaaatct tgccatgatt caccttctaa aggacaaaag 2400 tgagtgtgat gtctatctaa catttcaata gctaaggcgt caagtaaagc tctcttattc 2460 ttcacatgcc aatacaacgt aggttgttct actccaagtt tctgagctaa ctttcttgta 2520 gttagtcctt ctattccaac ttcatttagt aattccaatg cactattgat aactttactt 2580 ttatcaagtc tagacatcat ttaatatcct cctcttcaat atatttaagt cgactgatcg 2640 gatcctgatc ggagctccca tggcggccgg tcgatatcga tgtgtagtag cctgtgaaat 2700 aagtaaggaa aaaaaagaag taagtgttat atatgatgat tattttgtag atgtagatag 2760 gataatagaa tccatagaaa atataggtta tacagttata taaaaattac tttaaaatct 2820 atcattgata gggtaaaata taaatcgtat aaagttgtgt aatttttaag gaggtgtgtt 2880 acagacgtcc gcgagagacc ttaaatatat tgaagaggag gaaatacata tggtttcaag 2940 atatgttcca gatatgggag atttaatatg ggttgatttt gatccaacaa aaggatcaga 3000 acaagcagga catagaccag cagttgtttt atcaccattt atgtataata ataaaacagg 3060 aatgtgttta tgtgttccat gtacaacaca atcaaaagga tatccatttg aagttgtttt 3120 atcaggacaa gaaagagatg gagttgcatt agcagatcaa gttaaatcaa tagcatggag 3180 agcaagagga gcaacaaaaa aaggaacagt tgcaccagaa gaattacaat taataaaagc 3240 aaaaataaat gttttaatag gataatgtta ttaagctagc ataaaaataa gaagcctgca 3300 tttgcaggct tcttattttt atggcgcgcc gttctgaatc cttagctaat ggttcaacag 3360 gtaactatga cgaagatagc accctggata agtctgtaat ggattctaag gcatttaatg 3420 aagacgtgta tataaaatgt gctaatgaaa aagaaaatgc gttaaaagag cctaaaatga 3480 gttcaaatgg ttttgaaatt gattggtagt ttaatttaat atattttttc tattggctat 3540 ctcgatacct atagaatctt ctgttcactt ttgtttttga aatataaaaa ggggcttttt 3600 agcccctttt ttttaaaact ccggaggagt ttcttcattc ttgatactat acgtaactat 3660 tttcgatttg acttcattgt caattaagct agtaaaatca atggttaaaa aacaaaaaac 3720 ttgcattttt ctacctagta atttataatt ttaagtgtcg agtttaaaag tataatttac 3780 caggaaagga gcaagttttt taataaggaa aaatttttcc ttttaaaatt ctatttcgtt 3840 atatgactaa ttataatcaa aaaaatgaaa ataaacaaga ggtaaaaact gctttagaga 3900 aatgtactga taaaaaaaga aaaaatccta gatttacgtc atacatagca cctttaacta 3960 ctaagaaaaa tattgaaagg acttccactt gtggagatta tttgtttatg ttgagtgatg 4020 cagacttaga acattttaaa ttacataaag gtaatttttg cggtaataga ttttgtccaa 4080 tgtgtagttg gcgacttgct tgtaaggata gtttagaaat atctattctt atggagcatt 4140 taagaaaaga agaaaataaa gagtttatat ttttaactct tacaactcca aatgtaaaaa 4200 gttatgatct taattattct attaaacaat ataataaatc ttttaaaaaa ttaatggagc 4260 gtaaggaagt taaggatata actaaaggtt atataagaaa attagaagta acttaccaaa 4320 aggaaaaata cataacaaag gatttatgga aaataaaaaa agattattat caaaaaaaag 4380 gacttgaaat tggtgattta gaacctaatt ttgatactta taatcctcat tttcatgtag 4440 ttattgcagt taataaaagt tattttacag ataaaaatta ttatataaat cgagaaagat 4500 ggttggaatt atggaagttt gctactaagg atgattctat aactcaagtt gatgttagaa 4560 aagcaaaaat taatgattat aaagaggttt acgaacttgc gaaatattca gctaaagaca 4620 ctgattattt aatatcgagg ccagtatttg aaatttttta taaagcatta aaaggcaagc 4680 aggtattagt ttttagtgga ttttttaaag atgcacacaa attgtacaag caaggaaaac 4740 ttgatgttta taaaaagaaa gatgaaatta aatatgtcta tatagtttat tataattggt 4800 gcaaaaaaca atatgaaaaa actagaataa gggaacttac ggaagatgaa aaagaagaat 4860 taaatcaaga tttaatagat gaaatagaaa tagattaaag tgtaactata ctttatatat 4920 atatgattaa aaaaataaaa aacaacagcc tattaggttg ttgtttttta ttttctttat 4980 taattttttt aatttttagt ttttagttct tttttaaaat aagtttcagc ctctttttca 5040 atatttttta aagaaggagt atttgcatga attgcctttt ttctaacaga cttaggaaat 5100 attttaacag tatcttcttg cgccggtgat tttggaactt cataacttac taatttataa 5160 ttattatttt cttttttaat tgtaacagtt gcaaaagaag ctgaacctgt tccttcaact 5220 agtttatcat cttcaatata atattcttga cctatatagt ataaatatat ttttattata 5280 tttttacttt tttctgaatc tattatttta taatcataaa aagttttacc accaaaagaa 5340 ggttgtactc cttctggtcc aacatatttt tttactatat tatctaaata atttttggga 5400 actggtgttg taatttgatt aatcgaacaa ccagttatac ttaaaggaat tataactata 5460 aaaatatata ggattatctt tttaaatttc attattggcc tcctttttat taaatttatg 5520 ttaccataaa aaggacataa cgggaatatg tagaatattt ttaatgtaga caaaatttta 5580 cataaatata aagaaaggaa gtgtttgttt aaattttata gcaaactatc aaaaattagg 5640 gggataaaaa tttatgaaaa aaaggttttc gatgttattt ttatgtttaa ctttaatagt 5700 ttgtggttta tttacaaatt cggccggcct acctcctcgt ataaataaga tgtttttgtt 5760 ttgcttgata ctactttttc ttcacaggaa aatatacttc agtaacaaga tctttaggaa 5820 tggtgacttg gtgggggtca gttacatata cttcatatgg tgggtttgta agtttatatc 5880 cttcattttc tacccattcc ctcaacttag catatacaga gatgttaatt ctgaatatga 5940 gccccttaaa acagacttcg cacaaaggac tccaggcaag tatcttgttc cctttacaat 6000 ctcctttatc ggaatggcaa gttctgtatc attgccagaa ggattgtatt cagcgctgtg 6060 ataaatagtt attggcttac caagaaagtc aattacaaaa atatatataa agaaagcaaa 6120 gctacatata ttaaagcatt taaggtaaaa ctaaaaatat tataaaaatg aaattatttt 6180 ttctcatagc taaagttaca taatacgagg aggatttata atgaaaaaag taataggaat 6240 tataagtatt gtactatttg tactcgtagc acttcaatcc tgtgctgcag gagtaggaaa 6300 tgcattaagt aataacaaag aagctagtgg atctgctgga ttatttttat ctgtatgtat 6360 gcttattgct ggaataatag caataatatc aaaatatagt aaaggtatga ctataacagc 6420 tatagtattt tatttgttag cttttgttgt agggattgct aatgttgggc atttttcaga 6480 tttgcaaatt tggtcaatca ttaacttgat atttgctgga ctattgatat ttcatttgct 6540 taaaaataag caattatata atagcagtgg gaaaaagtag aatcatatat tgtaattatt 6600 tttaattatg ttggcaaaat tgaaattgtc actgaaacac ctctaaatgt tttaaataca 6660 tatgtttaat tattgtgaca gattctaata gtagaaagta gaaatttgct atgttataat 6720 gacatagagg tgaatgtaat atgaaagaag ttgtaatagc tagtgcagta agaacagcga 6780 ttggatctta tggaaagtct cttaaggatg taccagcagt agatttagga gctacagcta 6840 taaaggaagc agttaaaaaa gcaggaataa aaccagagga tgttaatgaa gtcattttag 6900 gaaatgttct tcaagcaggt ttaggacaga atccagcaag acaggcatct tttaaagcag 6960 gattaccagt tgaaattcca gctatgacta ttaataaggt ttgtggttca ggacttagaa 7020 cagttagctt agcagcacaa attataaaag caggagatgc tgacgtaata atagcaggtg 7080 gtatggaaaa tatgtctaga gctccttact tagcgaataa cgctagatgg ggatatagaa 7140 tgggaaacgc taaatttgtt gatgaaatga tcactgacgg attgtgggat gcatttaatg 7200 attaccacat gggaataaca gcagaaaaca tagctgagag atggaacatt tcaagagaag 7260 aacaagatga gtttgctctt gcatcacaaa aaaaagctga agaagctata aaatcaggtc 7320 aatttaaaga tgaaatagtt cctgtagtaa ttaaaggcag aaagggagaa actgtagttg 7380 atacagatga gcaccctaga tttggatcaa ctatagaagg acttgcaaaa ttaaaacctg 7440 ccttcaaaaa agatggaaca gttacagctg gtaatgcatc aggattaaat gactgtgcag 7500 cagtacttgt aatcatgagt gcagaaaaag ctaaagagct tggagtaaaa ccacttgcta 7560 agatagtttc ttatggttca gcaggagttg acccagcaat aatgggatat ggacctttct 7620 atgcaacaaa agcagctatt gaaaaagcag gttggacagt tgatgaatta gatttaatag 7680 aatcaaatga agcttttgca gctcaaagtt tagcagtagc aaaagattta aaatttgata 7740 tgaataaagt aaatgtaaat ggaggagcta ttgcccttgg tcatccaatt ggagcatcag 7800 gtgcaagaat actcgttact cttgtacacg caatgcaaaa aagagatgca aaaaaaggct 7860 tagcaacttt atgtataggt ggcggacaag gaacagcaat attgctagaa aagtgctagg 7920 aattcaggag gtatagcata tgacacaaag aatagcatac gtaacaggtg gtatgggtgg 7980 tataggaact gcaatatgtc aaagattagc aaaagatgga tttagagttg tagctggatg 8040 cggaccaaat agtcctagaa gagaaaagtg gttagaacaa caaaaagcac ttggatttga 8100 tttcatagct tctgaaggta acgtagcaga ttgggactca actaaaactg cttttgataa 8160 agttaaatct gaagttggtg aagttgatgt attaataaat aatgcaggta ttactagaga 8220 tgtagtattt agaaagatga caagagctga ctgggatgca gtaatagata ctaatcttac 8280 tagtcttttc aatgtaacta agcaggtaat tgatggtatg gcagatagag gttggggtag 8340 aatagtaaat attagttcag ttaatggaca aaaaggtcag tttggacaga caaattattc 8400 tacagctaaa gcaggtcttc atggttttac aatggcttta gcacaggaag ttgctacaaa 8460 aggtgttaca gttaacactg ttagtccagg atatattgct actgacatgg taaaggctat 8520 aagacaagat gttcttgata aaattgttgc tacaatacca gtaaagagat taggacttcc 8580 tgaagagata gcatctattt gtgcatggtt atcaagtgaa gaatcaggat tctcaactgg 8640 tgctgatttt tcattaaacg gtggtttaca catgggataa taccgttcgt ataatgtatg 8700 ctatacgaag ttatccttag aagcaaactt aagagtgtgt tgatagtgca gtatcttaaa 8760 attttgtgta taataggaat tgaagttaaa ttagatgcta aaaatttgta attaagaagg 8820 agggattcgt catgttggta ttccaaatgc gtaatgtaga taaaacatct actgttttga 8880 aacagactaa aaacagtgat tacgcagata aataaatacg ttagattaat tcctaccagt 8940 gactaatctt atgacttttt aaacagataa ctaaaattac aaacaaatcg tttaacttct 9000 gtatttattt acagatgtaa tcacttcagg agtaattaca tgaacaaaaa tataaaatat 9060 tctcaaaact ttttaacgag tgaaaaagta ctcaaccaaa taataaaaca attgaattta 9120 aaagaaaccg ataccgttta cgaaattgga acaggtaaag ggcatttaac gacgaaactg 9180 gctaaaataa gtaaacaggt aacgtctatt gaattagaca gtcatctatt caacttatcg 9240 tcagaaaaat taaaactgaa cattcgtgtc actttaattc accaagatat tctacagttt 9300 caattcccta acaaacagag gtataaaatt gttgggagta ttccttacca tttaagcaca 9360 caaattatta aaaaagtggt ttttgaaagc catgcgtctg acatctatct gattgttgaa 9420 gaaggattct acaagcgtac cttggatatt caccgaacac tagggttgct cttgcacact 9480 caagtctcga ttcagcaatt gcttaagctg ccagcggaat gctttcatcc taaaccaaaa 9540 gtaaacagtg tcttaataaa acttacccgc cataccacag atgttccaga taaatattgg 9600 aagctatata cgtactttgt ttcaaaatgg gtcaatcgag aatatcgtca actgtttact 9660 aaaaatcagt ttcatcaagc aatgaaacac gccaaagtaa acaatttaag taccattact 9720 tatgagcaag tattgtctat ttttaatagt tatctattat ttaacgggag gaaataattc 9780 tatgagtcgc ttttttaaat ttggaaagtt acacgttact aaagggaatg gagataaatt 9840 attagatata ctactgacag cttccaagaa gctaaagagg tcataacttc gtataatgta 9900 tgctatacga acggtaagta ttgatagaaa aaaacactag acagtgctaa taacaatgtc 9960 tagtgctttt tatcttgctc aattttttca ttgagttcat ttaagtaagt ccacctgtcc 10020 atcttttcgt ctagctcttt ttccagtgaa ttcttttcgg ataagagatc ttcaagaagt 10080 gcataatcag atgaagcagc ttccatttct attttctttt cagatataga tttttctaga 10140 tgttcaatta cctcatctat tttgtcaaac tccatttgtt ctgcataggt aaattttaga 10200 ggcttttctt tttgcaactt atagttgttt ttagctgtat ttttcttaga gcttattttt 10260 tcctctgata tttttgcagt tttgtgaaaa taggaatagt ttcctgtata ttgagtgatt 10320 ttaccgtttc cttcaaaaga aaatatttta tcaactgttt tgtcaaggaa gtacctgtca 10380 tgagatacag ctataacagc tccttcaaaa tcgttaatat aatcttctag gattgtaagt 10440 gtttctatat ccagatcatt tgttggttcg tccagcaaaa gtacattagg gtaattcatc 10500 agtattttta gaagatataa tcttcttcgt tctcctcctg aaagttttcc aaggggagtc 10560 cattgaactg aaggttcaaa taaaaaattt tcaagtacag cagaagcact tattttttca 10620 cccgatgaag ttgacgcata ttctgatgtc ccacgtatgt attcaattac cctttcgttc 10680 atatccatat cagaaattcc ctgagaatag tatcctatct ttactgtttc acctatatct 10740 atagtgccgc tgtccggcag aattttttga actaaaatat tcataagagt ggatttacca 10800 cttccattag gtccaataat acctattctg tcattattta gtatgttata agtgaaattt 10860 ttaattaatg tcttttcacc aaaacttttg cttatgttat ccaggtttat gactttttgt 10920 tt 10922 <210> SEQ ID NO 122 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN01 <400> SEQUENCE: 122 atttacaaat tcggccggcc tacctcctcg tataaataag atg 43 <210> SEQ ID NO 123 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN02 <400> SEQUENCE: 123 ctagctatta caacttcttt catattacat tcacctctat gtc 43 <210> SEQ ID NO 124 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN03 <400> SEQUENCE: 124 gacatagagg tgaatgtaat atgaaagaag ttgtaatagc tag 43 <210> SEQ ID NO 125 <211> LENGTH: 49 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN04mod <400> SEQUENCE: 125 gtatagcata cattatacga acggtattat cccatgtgta aaccaccgt 49 <210> SEQ ID NO 126 <211> LENGTH: 48 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN05mod <400> SEQUENCE: 126 ttcgtataat gtatgctata cgaagttatc cttagaagca aacttaag 48 <210> SEQ ID NO 127 <211> LENGTH: 46 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN06 <400> SEQUENCE: 127 gtctagtgtt tttttctatc aatactctag ataccgttcg tatagc 46 <210> SEQ ID NO 128 <211> LENGTH: 46 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN07 <400> SEQUENCE: 128 tgtatgctat acgaacggta agtattgata gaaaaaaaca ctagac 46 <210> SEQ ID NO 129 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN08 <400> SEQUENCE: 129 caaaaaggag tttaaacaaa aagtcataaa cctggataac 40 <210> SEQ ID NO 130 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og31f <400> SEQUENCE: 130 ccgtttctca caacaacaat accag 25 <210> SEQ ID NO 131 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og32r <400> SEQUENCE: 131 aaaccacctt gacgatgaaa ccata 25 <210> SEQ ID NO 132 <211> LENGTH: 7951 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL8315-Pfdx-thlA-phaB-bld, plasmid <400> SEQUENCE: 132 cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt 60 atcaggaaac agctatgacc gcggccgctc actatctgcg gaacctgcct ccttatctga 120 taaaaaatat tcgctgcatc tttgacttgt tattttcttt caaatgccta aaattatctt 180 ttaaaattat aacaaatgtg ataaaataca ggggatgaaa acattatcta aaaattaagg 240 aggtgttaca tatgaaagaa gttgtaatag ctagtgcagt aagaacagcg attggatctt 300 atggaaagtc tcttaaggat gtaccagcag tagatttagg agctacagct ataaaggaag 360 cagttaaaaa agcaggaata aaaccagagg atgttaatga agtcatttta ggaaatgttc 420 ttcaagcagg tttaggacag aatccagcaa gacaggcatc ttttaaagca ggattaccag 480 ttgaaattcc agctatgact attaataagg tttgtggttc aggacttaga acagttagct 540 tagcagcaca aattataaaa gcaggagatg ctgacgtaat aatagcaggt ggtatggaaa 600 atatgtctag agctccttac ttagcgaata acgctagatg gggatataga atgggaaacg 660 ctaaatttgt tgatgaaatg atcactgacg gattgtggga tgcatttaat gattaccaca 720 tgggaataac agcagaaaac atagctgaga gatggaacat ttcaagagaa gaacaagatg 780 agtttgctct tgcatcacaa aaaaaagctg aagaagctat aaaatcaggt caatttaaag 840 atgaaatagt tcctgtagta attaaaggca gaaagggaga aactgtagtt gatacagatg 900 agcaccctag atttggatca actatagaag gacttgcaaa attaaaacct gccttcaaaa 960 aagatggaac agttacagct ggtaatgcat caggattaaa tgactgtgca gcagtacttg 1020 taatcatgag tgcagaaaaa gctaaagagc ttggagtaaa accacttgct aagatagttt 1080 cttatggttc agcaggagtt gacccagcaa taatgggata tggacctttc tatgcaacaa 1140 aagcagctat tgaaaaagca ggttggacag ttgatgaatt agatttaata gaatcaaatg 1200 aagcttttgc agctcaaagt ttagcagtag caaaagattt aaaatttgat atgaataaag 1260 taaatgtaaa tggaggagct attgcccttg gtcatccaat tggagcatca ggtgcaagaa 1320 tactcgttac tcttgtacac gcaatgcaaa aaagagatgc aaaaaaaggc ttagcaactt 1380 tatgtatagg tggcggacaa ggaacagcaa tattgctaga aaagtgctag gaattcagga 1440 ggtatagcat atgacacaaa gaatagcata cgtaacaggt ggtatgggtg gtataggaac 1500 tgcaatatgt caaagattag caaaagatgg atttagagtt gtagctggat gcggaccaaa 1560 tagtcctaga agagaaaagt ggttagaaca acaaaaagca cttggatttg atttcatagc 1620 ttctgaaggt aacgtagcag attgggactc aactaaaact gcttttgata aagttaaatc 1680 tgaagttggt gaagttgatg tattaataaa taatgcaggt attactagag atgtagtatt 1740 tagaaagatg acaagagctg actgggatgc agtaatagat actaatctta ctagtctttt 1800 caatgtaact aagcaggtaa ttgatggtat ggcagataga ggttggggta gaatagtaaa 1860 tattagttca gttaatggac aaaaaggtca gtttggacag acaaattatt ctacagctaa 1920 agcaggtctt catggtttta caatggcttt agcacaggaa gttgctacaa aaggtgttac 1980 agttaacact gttagtccag gatatattgc tactgacatg gtaaaggcta taagacaaga 2040 tgttcttgat aaaattgttg ctacaatacc agtaaagaga ttaggacttc ctgaagagat 2100 agcatctatt tgtgcatggt tatcaagtga agaatcagga ttctcaactg gtgctgattt 2160 ttcattaaac ggtggtttac acatgggata agaaggagat atacatatga taaaagatac 2220 acttgttagt attacaaaag atttaaaact taaaactaat gttgaaaatg caaatcttaa 2280 aaattataaa gatgatagtt cttgttttgg agtatttgaa aatgttgaaa atgcaataag 2340 taatgcagta catgctcaaa aaattttatc tcttcattat acaaaagaac agagagaaaa 2400 aattataact gaaattagaa aagcagcttt agaaaataaa gaaatattag ctacaatgat 2460 tcttgaagaa actcacatgg gaagatatga agataaaata cttaaacatg aacttgtagc 2520 aaaatataca cctggaactg aagatttaac tacaactgct tggtcaggtg ataatggact 2580 tacagtagtt gaaatgagtc cttatggagt tataggagca attacacctt ctactaatcc 2640 aacagaaact gtaatatgta attcaattgg tatgattgca gctggaaata ctgtagtttt 2700 taatggtcat cctggagcta aaaaatgtgt agcatttgct gttgaaatga ttaataaagc 2760 tataattagt tgtggaggtc ctgaaaatct tgttacaact ataaaaaatc caacaatgga 2820 ttctcttgat gcaataatta aacatccttc aattaaactt ctttgtggta caggaggtcc 2880 aggaatggta aaaactcttc ttaattctgg taaaaaagct ataggagcag gtgctggaaa 2940 tcctccagta attgttgatg atacagcaga tatagaaaaa gctggtaaat caattattga 3000 aggatgtagt tttgataata atttaccatg tatagcagaa aaagaagtat ttgtttttga 3060 aaatgttgct gatgatttaa ttagtaatat gcttaaaaat aatgcagtaa taattaatga 3120 agatcaagtt tctaaactta tagatttagt attacagaaa aataatgaaa cacaggaata 3180 ttctattaat aaaaaatggg taggaaaaga tgcaaaatta tttcttgatg aaatagatgt 3240 agaatcacct tcaagtgtta aatgtataat ttgtgaagtt tctgcttcac atccatttgt 3300 aatgactgaa ttaatgatgc ctatacttcc aattgtaaga gttaaagata tagatgaagc 3360 aatagaatat gcaaaaattg ctgaacagaa tagaaaacat agtgcttata tttattctaa 3420 aaatatagat aatttaaata gatttgaaag agaaatagat acaactattt ttgttaaaaa 3480 tgcaaaatca tttgctggtg taggatatga agcagaaggt tttacaactt ttacaatagc 3540 tggaagtact ggtgaaggta ttacaagtgc aagaaatttt acaagacaga gaagatgtgt 3600 tttagcaggt taatctagag tcgacgtcac gcgtccatgg agatctcgag gcctgcagac 3660 atgcaagctt ggcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3720 cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3780 cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgctagcata 3840 aaaataagaa gcctgcattt gcaggcttct tatttttatg gcgcgccgcc attatttttt 3900 tgaacaattg acaattcatt tcttattttt tattaagtga tagtcaaaag gcataacagt 3960 gctgaataga aagaaattta cagaaaagaa aattatagaa tttagtatga ttaattatac 4020 tcatttatga atgtttaatt gaatacaaaa aaaaatactt gttatgtatt caattacggg 4080 ttaaaatata gacaagttga aaaatttaat aaaaaaataa gtcctcagct cttatatatt 4140 aagctaccaa cttagtatat aagccaaaac ttaaatgtgc taccaacaca tcaagccgtt 4200 agagaactct atctatagca atatttcaaa tgtaccgaca tacaagagaa acattaacta 4260 tatatattca atttatgaga ttatcttaac agatataaat gtaaattgca ataagtaaga 4320 tttagaagtt tatagccttt gtgtattgga agcagtacgc aaaggctttt ttatttgata 4380 aaaattagaa gtatatttat tttttcataa ttaatttatg aaaatgaaag ggggtgagca 4440 aagtgacaga ggaaagcagt atcttatcaa ataacaaggt attagcaata tcattattga 4500 ctttagcagt aaacattatg acttttatag tgcttgtagc taagtagtac gaaaggggga 4560 gctttaaaaa gctccttgga atacatagaa ttcataaatt aatttatgaa aagaagggcg 4620 tatatgaaaa cttgtaaaaa ttgcaaagag tttattaaag atactgaaat atgcaaaata 4680 cattcgttga tgattcatga taaaacagta gcaacctatt gcagtaaata caatgagtca 4740 agatgtttac ataaagggaa agtccaatgt attaattgtt caaagatgaa ccgatatgga 4800 tggtgtgcca taaaaatgag atgttttaca gaggaagaac agaaaaaaga acgtacatgc 4860 attaaatatt atgcaaggag ctttaaaaaa gctcatgtaa agaagagtaa aaagaaaaaa 4920 taatttattt attaatttaa tattgagagt gccgacacag tatgcactaa aaaatatatc 4980 tgtggtgtag tgagccgata caaaaggata gtcactcgca ttttcataat acatcttatg 5040 ttatgattat gtgtcggtgg gacttcacga cgaaaaccca caataaaaaa agagttcggg 5100 gtagggttaa gcatagttga ggcaactaaa caatcaagct aggatatgca gtagcagacc 5160 gtaaggtcgt tgtttaggtg tgttgtaata catacgctat taagatgtaa aaatacggat 5220 accaatgaag ggaaaagtat aatttttgga tgtagtttgt ttgttcatct atgggcaaac 5280 tacgtccaaa gccgtttcca aatctgctaa aaagtatatc ctttctaaaa tcaaagtcaa 5340 gtatgaaatc ataaataaag tttaattttg aagttattat gatattatgt ttttctatta 5400 aaataaatta agtatataga atagtttaat aatagtatat acttaatgtg ataagtgtct 5460 gacagtgtca cagaaaggat gattgttatg gattataagc ggccggccag tgggcaagtt 5520 gaaaaattca caaaaatgtg gtataatatc tttgttcatt agagcgataa acttgaattt 5580 gagagggaac ttagatggta tttgaaaaaa ttgataaaaa tagttggaac agaaaagagt 5640 attttgacca ctactttgca agtgtacctt gtacctacag catgaccgtt aaagtggata 5700 tcacacaaat aaaggaaaag ggaatgaaac tatatcctgc aatgctttat tatattgcaa 5760 tgattgtaaa ccgccattca gagtttagga cggcaatcaa tcaagatggt gaattgggga 5820 tatatgatga gatgatacca agctatacaa tatttcacaa tgatactgaa acattttcca 5880 gcctttggac tgagtgtaag tctgacttta aatcattttt agcagattat gaaagtgata 5940 cgcaacggta tggaaacaat catagaatgg aaggaaagcc aaatgctccg gaaaacattt 6000 ttaatgtatc tatgataccg tggtcaacct tcgatggctt taatctgaat ttgcagaaag 6060 gatatgatta tttgattcct atttttacta tggggaaata ttataaagaa gataacaaaa 6120 ttatacttcc tttggcaatt caagttcatc acgcagtatg tgacggattt cacatttgcc 6180 gttttgtaaa cgaattgcag gaattgataa atagttaact tcaggtttgt ctgtaactaa 6240 aaacaagtat ttaagcaaaa acatcgtaga aatacggtgt tttttgttac cctaagttta 6300 aactcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 6360 cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 6420 tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 6480 agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 6540 ttcttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 6600 acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 6660 ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 6720 gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 6780 gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 6840 gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 6900 tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 6960 caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 7020 tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc 7080 gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg 7140 agtcagtgag cgaggaagcg gaagagcgcc caatacgcag ggccccctgc ttcggggtca 7200 ttatagcgat tttttcggta tatccatcct ttttcgcacg atatacagga ttttgccaaa 7260 gggttcgtgt agactttcct tggtgtatcc aacggcgtca gccgggcagg ataggtgaag 7320 taggcccacc cgcgagcggg tgttccttct tcactgtccc ttattcgcac ctggcggtgc 7380 tcaacgggaa tcctgctctg cgaggctggc cggctaccgc cggcgtaaca gatgagggca 7440 agcggatggc tgatgaaacc aagccaacca ggaagggcag cccacctatc aaggtgtact 7500 gccttccaga cgaacgaaga gcgattgagg aaaaggcggc ggcggccggc atgagcctgt 7560 cggcctacct gctggccgtc ggccagggct acaaaatcac gggcgtcgtg gactatgagc 7620 acgtccgcga gctggcccgc atcaatggcg acctgggccg cctgggcggc ctgctgaaac 7680 tctggctcac cgacgacccg cgcacggcgc ggttcggtga tgccacgatc ctcgccctgc 7740 tggcgaagat cgaagagaag caggacgagc ttggcaaggt catgatgggc gtggtccgcc 7800 cgagggcaga gccatgactt ttttagccgc taaaacggcc ggggggtgcg cgtgattgcc 7860 aagcacgtcc ccatgcgctc catcaagaag agcgacttcg cggagctggt gaagtacatc 7920 accgacgagc aaggcaagac cgatcgggcc c 7951 <210> SEQ ID NO 133 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: bld-phaB-F1, forward <400> SEQUENCE: 133 acatgggata agaaggagat atacatatga taaaag 36 <210> SEQ ID NO 134 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: bld-pMTL-R1, forward <400> SEQUENCE: 134 cgtcgactct agattaacct gctaaaacac atcttc 36 <210> SEQ ID NO 135 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL-bld-F1, forward <400> SEQUENCE: 135 gtgttttagc aggttaatct agagtcgacg tcacgc 36 <210> SEQ ID NO 136 <211> LENGTH: 1179 <212> TYPE: DNA <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA <400> SEQUENCE: 136 atgaaagaag ttgtaatagc tagtgcagta agaacagcga ttggatctta tggaaagtct 60 cttaaggatg taccagcagt agatttagga gctacagcta taaaggaagc agttaaaaaa 120 gcaggaataa aaccagagga tgttaatgaa gtcattttag gaaatgttct tcaagcaggt 180 ttaggacaga atccagcaag acaggcatct tttaaagcag gattaccagt tgaaattcca 240 gctatgacta ttaataaggt ttgtggttca ggacttagaa cagttagctt agcagcacaa 300 attataaaag caggagatgc tgacgtaata atagcaggtg gtatggaaaa tatgtctaga 360 gctccttact tagcgaataa cgctagatgg ggatatagaa tgggaaacgc taaatttgtt 420 gatgaaatga tcactgacgg attgtgggat gcatttaatg attaccacat gggaataaca 480 gcagaaaaca tagctgagag atggaacatt tcaagagaag aacaagatga gtttgctctt 540 gcatcacaaa aaaaagctga agaagctata aaatcaggtc aatttaaaga tgaaatagtt 600 cctgtagtaa ttaaaggcag aaagggagaa actgtagttg atacagatga gcaccctaga 660 tttggatcaa ctatagaagg acttgcaaaa ttaaaacctg ccttcaaaaa agatggaaca 720 gttacagctg gtaatgcatc aggattaaat gactgtgcag cagtacttgt aatcatgagt 780 gcagaaaaag ctaaagagct tggagtaaaa ccacttgcta agatagtttc ttatggttca 840 gcaggagttg acccagcaat aatgggatat ggacctttct atgcaacaaa agcagctatt 900 gaaaaagcag gttggacagt tgatgaatta gatttaatag aatcaaatga agcttttgca 960 gctcaaagtt tagcagtagc aaaagattta aaatttgata tgaataaagt aaatgtaaat 1020 ggaggagcta ttgcccttgg tcatccaatt ggagcatcag gtgcaagaat actcgttact 1080 cttgtacacg caatgcaaaa aagagatgca aaaaaaggct tagcaacttt atgtataggt 1140 ggcggacaag gaacagcaat attgctagaa aagtgctag 1179 <210> SEQ ID NO 137 <211> LENGTH: 849 <212> TYPE: DNA <213> ORGANISM: Clostridium kluyveri <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd1 <400> SEQUENCE: 137 atgagtatta aaagtgtagc ggttttaggt agtggaacta tgtctcgtgg aattgtgcag 60 gcttttgcag aagcaggtat agatgtaatt atccgtggaa gaactgaagg tagtattgga 120 aaaggtctag cagcagtaaa gaaagcttat gataaaaaag tatcaaaggg gaaaatttcc 180 caggaagatg ctgataaaat agttggaaga gtaagtacaa caactgaact tgaaaaattg 240 gctgattgtg atcttataat agaagcagca tcagaggata tgaatataaa gaaagactat 300 tttggaaaat tagaagaaat atgcaagcct gaaacaattt ttgctactaa tacttcttca 360 ttatctataa ctgaagtagc aacagctaca aagagaccag ataaattcat aggaatgcat 420 ttctttaatc cagcaaatgt tatgaaatta gttgaaatca taagaggtat gaatacttca 480 caagaaactt ttgatattat aaaagaagct tccattaaaa taggaaaaac tcctgtagaa 540 gttgcagaag ctccaggatt tgttgtaaac aagatattag taccaatgat caatgaagca 600 gtaggaattt tggcagaagg aatagcttca gcagaagata tcgatacagc tatgaaatta 660 ggcgctaatc acccaatggg tcctttagca ttaggagatc ttattggact tgatgtagtt 720 cttgcagtta tggatgtact ttatagtgaa actggagatt caaaatatag agctcataca 780 ttacttagaa aatatgtaag agcaggatgg cttggaagaa aatcaggaaa aggattcttc 840 gcttattaa 849 <210> SEQ ID NO 138 <211> LENGTH: 176 <212> TYPE: DNA <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: ferredoxin promoter <400> SEQUENCE: 138 ggccgcgctc actatctgcg gaacctgcct ccttatctga taaaaaatat tcgctgcatc 60 tttgacttgt tattttcttt caaatgccta aaattatctt ttaaaattat aacaaatgtg 120 ataaaataca ggggatgaaa acattatcta aaaattaagg aggtgttaca gaattc 176 <210> SEQ ID NO 139 <211> LENGTH: 474 <212> TYPE: DNA <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pyruvate-ferredoxin oxidoreductase promoter <400> SEQUENCE: 139 aaaatagttg ataataatgc agagttataa acaaaggtga aaagcattac ttgtattctt 60 ttttatatat tattataaat taaaatgaag ctgtattaga aaaaatacac acctgtaata 120 taaaatttta aattaatttt taattttttc aaaatgtatt ttacatgttt agaattttga 180 tgtatattaa aatagtagaa tacataagat acttaattta attaaagata gttaagtact 240 tttcaatgtg cttttttaga tgtttaatac aaatctttaa ttgtaaaaga aatgctgtac 300 tatttactgt actagtgacg ggattaaact gtattaatta taaataaaaa ataagtacag 360 ttgtttaaaa ttatattttg tattaaatct aatagtacga tgtaagttat tttatactat 420 tgctagttta ataaaaagat ttaattatat gcttgaaaag gagaggaatt cata 474 <210> SEQ ID NO 140 <211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: ribosome binding site rbs2 <400> SEQUENCE: 140 aaatagaaag gaggtgttac at 22 <210> SEQ ID NO 141 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-F1, forward <400> SEQUENCE: 141 aaaggtctcc ggccgcgctc actatctgcg gaacc 35 <210> SEQ ID NO 142 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-R1, reverse <400> SEQUENCE: 142 tttggtctcg aattctgtaa cacctcctta atttttag 38 <210> SEQ ID NO 143 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-F1, forward <400> SEQUENCE: 143 aaaggtctcc ggccgcaaaa tagttgataa taatgcagag 40 <210> SEQ ID NO 144 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-R1, reverse <400> SEQUENCE: 144 tttggtctcg aattcctctc cttttcaagc atata 35 <210> SEQ ID NO 145 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd1-F1, forward <400> SEQUENCE: 145 aaaggtctcg aattcaaaga tctatgtcta ttaaatcagt tgcag 45 <210> SEQ ID NO 146 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd1-R1, reverse <400> SEQUENCE: 146 tttggtctcc ctcctttcta tttctaatat gcgaaaaatc ctttacc 47 <210> SEQ ID NO 147 <211> LENGTH: 49 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-F1, forward <400> SEQUENCE: 147 aaaggtctca ggaggtgtta catatgaaag aagttgtaat agctagtgc 49 <210> SEQ ID NO 148 <211> LENGTH: 48 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-R1, reverse <400> SEQUENCE: 148 tttggtctcc tcgagtatgg atccctagca cttttctagc aatattgc 48 <210> SEQ ID NO 149 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-F2, forward <400> SEQUENCE: 149 aaacagctat gaccgcggcc gcaaaatagt 30 <210> SEQ ID NO 150 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-R2, reverse <400> SEQUENCE: 150 ttactcattg gattcctctc cttt 24 <210> SEQ ID NO 151 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ptb-Buk-F2, forward <400> SEQUENCE: 151 ggaatccaat gagtaaaaac tttgatgag 29 <210> SEQ ID NO 152 <211> LENGTH: 37 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ptb-Buk-F2, reverse <400> SEQUENCE: 152 caggcctcga gatctcctag taaaccttag cttgttc 37 <210> SEQ ID NO 153 <211> LENGTH: 7884 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL82256-ptb-buk, plasmid <400> SEQUENCE: 153 gagatctcga ggcctgcaga catgcaagct tggcactggc cgtcgtttta caacgtcgtg 60 actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 120 gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 180 atggcgaatg gcgctagcat aaaaataaga agcctgcatt tgcaggcttc ttatttttat 240 ggcgcgccgt tctgaatcct tagctaatgg ttcaacaggt aactatgacg aagatagcac 300 cctggataag tctgtaatgg attctaaggc atttaatgaa gacgtgtata taaaatgtgc 360 taatgaaaaa gaaaatgcgt taaaagagcc taaaatgagt tcaaatggtt ttgaaattga 420 ttggtagttt aatttaatat attttttcta ttggctatct cgatacctat agaatcttct 480 gttcactttt gtttttgaaa tataaaaagg ggctttttag cccctttttt ttaaaactcc 540 ggaggagttt cttcattctt gatactatac gtaactattt tcgatttgac ttcattgtca 600 attaagctag taaaatcaat ggttaaaaaa caaaaaactt gcatttttct acctagtaat 660 ttataatttt aagtgtcgag tttaaaagta taatttacca ggaaaggagc aagtttttta 720 ataaggaaaa atttttcctt ttaaaattct atttcgttat atgactaatt ataatcaaaa 780 aaatgaaaat aaacaagagg taaaaactgc tttagagaaa tgtactgata aaaaaagaaa 840 aaatcctaga tttacgtcat acatagcacc tttaactact aagaaaaata ttgaaaggac 900 ttccacttgt ggagattatt tgtttatgtt gagtgatgca gacttagaac attttaaatt 960 acataaaggt aatttttgcg gtaatagatt ttgtccaatg tgtagttggc gacttgcttg 1020 taaggatagt ttagaaatat ctattcttat ggagcattta agaaaagaag aaaataaaga 1080 gtttatattt ttaactctta caactccaaa tgtaaaaagt tatgatctta attattctat 1140 taaacaatat aataaatctt ttaaaaaatt aatggagcgt aaggaagtta aggatataac 1200 taaaggttat ataagaaaat tagaagtaac ttaccaaaag gaaaaataca taacaaagga 1260 tttatggaaa ataaaaaaag attattatca aaaaaaagga cttgaaattg gtgatttaga 1320 acctaatttt gatacttata atcctcattt tcatgtagtt attgcagtta ataaaagtta 1380 ttttacagat aaaaattatt atataaatcg agaaagatgg ttggaattat ggaagtttgc 1440 tactaaggat gattctataa ctcaagttga tgttagaaaa gcaaaaatta atgattataa 1500 agaggtttac gaacttgcga aatattcagc taaagacact gattatttaa tatcgaggcc 1560 agtatttgaa attttttata aagcattaaa aggcaagcag gtattagttt ttagtggatt 1620 ttttaaagat gcacacaaat tgtacaagca aggaaaactt gatgtttata aaaagaaaga 1680 tgaaattaaa tatgtctata tagtttatta taattggtgc aaaaaacaat atgaaaaaac 1740 tagaataagg gaacttacgg aagatgaaaa agaagaatta aatcaagatt taatagatga 1800 aatagaaata gattaaagtg taactatact ttatatatat atgattaaaa aaataaaaaa 1860 caacagccta ttaggttgtt gttttttatt ttctttatta atttttttaa tttttagttt 1920 ttagttcttt tttaaaataa gtttcagcct ctttttcaat attttttaaa gaaggagtat 1980 ttgcatgaat tgcctttttt ctaacagact taggaaatat tttaacagta tcttcttgcg 2040 ccggtgattt tggaacttca taacttacta atttataatt attattttct tttttaattg 2100 taacagttgc aaaagaagct gaacctgttc cttcaactag tttatcatct tcaatataat 2160 attcttgacc tatatagtat aaatatattt ttattatatt tttacttttt tctgaatcta 2220 ttattttata atcataaaaa gttttaccac caaaagaagg ttgtactcct tctggtccaa 2280 catatttttt tactatatta tctaaataat ttttgggaac tggtgttgta atttgattaa 2340 tcgaacaacc agttatactt aaaggaatta taactataaa aatatatagg attatctttt 2400 taaatttcat tattggcctc ctttttatta aatttatgtt accataaaaa ggacataacg 2460 ggaatatgta gaatattttt aatgtagaca aaattttaca taaatataaa gaaaggaagt 2520 gtttgtttaa attttatagc aaactatcaa aaattagggg gataaaaatt tatgaaaaaa 2580 aggttttcga tgttattttt atgtttaact ttaatagttt gtggtttatt tacaaattcg 2640 gccggccgaa gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtataat 2700 aggaattgaa gttaaattag atgctaaaaa tttgtaatta agaaggagtg attacatgaa 2760 caaaaatata aaatattctc aaaacttttt aacgagtgaa aaagtactca accaaataat 2820 aaaacaattg aatttaaaag aaaccgatac cgtttacgaa attggaacag gtaaagggca 2880 tttaacgacg aaactggcta aaataagtaa acaggtaacg tctattgaat tagacagtca 2940 tctattcaac ttatcgtcag aaaaattaaa actgaatact cgtgtcactt taattcacca 3000 agatattcta cagtttcaat tccctaacaa acagaggtat aaaattgttg ggagtattcc 3060 ttaccattta agcacacaaa ttattaaaaa agtggttttt gaaagccatg cgtctgacat 3120 ctatctgatt gttgaagaag gattctacaa gcgtaccttg gatattcacc gaacactagg 3180 gttgctcttg cacactcaag tctcgattca gcaattgctt aagctgccag cggaatgctt 3240 tcatcctaaa ccaaaagtaa acagtgtctt aataaaactt acccgccata ccacagatgt 3300 tccagataaa tattggaagc tatatacgta ctttgtttca aaatgggtca atcgagaata 3360 tcgtcaactg tttactaaaa atcagtttca tcaagcaatg aaacacgcca aagtaaacaa 3420 tttaagtacc gttacttatg agcaagtatt gtctattttt aatagttatc tattatttaa 3480 cgggaggaaa taattctatg agtcgctttt gtaaatttgg aaagttacac gttactaaag 3540 ggaatgtgtt taaactcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 3600 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 3660 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 3720 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 3780 taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 3840 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 3900 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 3960 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4020 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4080 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 4140 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 4200 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 4260 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 4320 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 4380 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc agggccccct 4440 gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag 4500 gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca 4560 ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc 4620 acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa 4680 cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta 4740 tcaaggtgta ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg 4800 gcatgagcct gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg 4860 tggactatga gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg 4920 gcctgctgaa actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga 4980 tcctcgccct gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg 5040 gcgtggtccg cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg 5100 cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg 5160 gtgaagtaca tcaccgacga gcaaggcaag accgatcggg ccccctgcag gataaaaaaa 5220 ttgtagataa attttataaa atagttttat ctacaatttt tttatcagga aacagctatg 5280 accgcggccg caaaatagtt gataataatg cagagttata aacaaaggtg aaaagcatta 5340 cttgtattct tttttatata ttattataaa ttaaaatgaa gctgtattag aaaaaataca 5400 cacctgtaat ataaaatttt aaattaattt ttaatttttt caaaatgtat tttacatgtt 5460 tagaattttg atgtatatta aaatagtaga atacataaga tacttaattt aattaaagat 5520 agttaagtac ttttcaatgt gcttttttag atgtttaata caaatcttta attgtaaaag 5580 aaatgctgta ctatttactg tactagtgac gggattaaac tgtattaatt ataaataaaa 5640 aataagtaca gttgtttaaa attatatttt gtattaaatc taatagtacg atgtaagtta 5700 ttttatacta ttgctagttt aataaaaaga tttaattata tgcttgaaaa ggagaggaat 5760 ccaatgagta aaaactttga tgagttatta tcaagattaa aggaagttcc aacaaaaaaa 5820 gtggctgtag ccgtagcaca agatgaacca gtattagagg ctataaaaga agctacagaa 5880 aataacatcg cacaagcaat attggttggt gataaacaac aaatccatga aatcgcaaag 5940 aaaataaact tggacttatc tgattatgaa ataatggata ttaaagatcc aaagaaagca 6000 acattagaag cagtaaaatt agtttctagt ggtcatgcag atatgttaat gaaaggtcta 6060 gttgatactg caacattcct aagaagcgta ttaaacaaag aggttggtct tagaacagga 6120 aaattaatgt cccatgtagc tgtgtttgat gtggaaggtt gggatagact gttattttta 6180 actgatgcag catttaatac atatccagaa tttaaggata aagttggaat gataaataat 6240 gcagttgtag ttgctcatgc atgtggaata gatgttccaa gagtagcacc tatatgccca 6300 gttgaagttg taaatacaag tatgcaatca acagttgatg cagcattgtt agctaaaatg 6360 agtgacaggg ggcaaattaa aggatgcgta attgatggac cttttgcctt agataatgca 6420 atatcagaag aagcagctca tcataaaggt gttacaggat cagtagcagg taaagctgat 6480 atattattat taccaaatat agaagcagca aatgtaatgt ataaaacatt aacatatttc 6540 tctaaatcaa gaaatggtgg acttttagta ggtacatcag caccagtaat tttaacttca 6600 agagcagatt cattcgaaac taaagttaat tcaattgctc ttgcagcatt agttgcagca 6660 agaaataagt aataaatcaa tccataataa ttaatgcata attaatggag agatttatat 6720 ggaatttgca atgcactatt agattctata ataatttctt ctgaaaatta tgcattatga 6780 ctgtatagaa tgcattaaat ttaaggggga ttcagaatgt catataagct attaataatc 6840 aatccaggtt caacatcaac aaagattggt gtttacgaag gagaaaagga actatttgaa 6900 gaaactttga gacacacaaa tgaagaaata aagagatatg atacaatata tgatcaattt 6960 gaatttagaa aagaagttat attaaatgtt cttaaagaaa agaattttga tataaagact 7020 ctaagtgcta ttgttggtag aggtggaatg cttagaccag ttgaaggtgg aacatatgca 7080 gtaaatgatg caatggttga agatttaaaa gttggagttc aaggacctca tgcttctaac 7140 cttggcggaa taattgccaa gtcaattgga gatgaattaa atattccatc atttatagta 7200 gatccagttg ttacagatga gttagcagat gtagcaagac tatctggagt accagaacta 7260 ccaagaaaaa gtaaattcca tgctttaaat caaaaagcgg tagctaaaag atatggaaaa 7320 gaaagtggac aaggatatga aaacctaaat cttgtagttg tacatatggg tggaggcgtt 7380 tcagttggtg ctcacaatca tgggaaagtt gtcgatgtaa ataatgcatt agatggagat 7440 ggcccattct caccagaaag agctggatca gttccaattg gtgatttagt taaaatgtgt 7500 tttagtggaa aatatagtga agcagaagta tatggcaagg ctgtaggaaa aggtggattt 7560 gttggttatc taaacacaaa tgatgtaaaa ggtgttattg ataagatgga agaaggagat 7620 aaagaatgtg aatcaatata caaagcattt gtttatcaaa tttcaaaagc aatcggagaa 7680 atgtcagttg tattagaagg taaagttgat caaattattt ttaccggagg aattgcatac 7740 tcaccaacac ttgttccaga ccttaaagca aaagttgaat ggatagcccc agttacagtt 7800 tatcctggag aagatgaatt acttgctcta gctcaaggtg ctataagagt acttgatgga 7860 gaagaacaag ctaaggttta ctag 7884 <210> SEQ ID NO 154 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 1 <400> SEQUENCE: 154 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 155 <211> LENGTH: 60 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 2 <400> SEQUENCE: 155 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser 50 55 60 <210> SEQ ID NO 156 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 3 <400> SEQUENCE: 156 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 157 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 1 <400> SEQUENCE: 157 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 158 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 2 <400> SEQUENCE: 158 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser Asn Cys Arg Tyr 50 55 60 Leu Gln Gly Ala Lys Tyr Glu Asp Glu Leu Leu Ile Lys Thr Trp Ile 65 70 75 80 Lys Glu Leu Thr Pro Val Lys Ala Glu Phe Asn Tyr Ser Val Ile Arg 85 90 95 Glu Asn Asp Gln Lys Glu Ile Ala Lys Gly Ser Thr Leu His Ala Phe 100 105 110 Val Asn Asn Asn Phe Lys Ile Ile Asn Leu Lys Lys Asn His Thr Glu 115 120 125 Leu Phe Lys Lys Leu Gln Ser Leu Ile 130 135 <210> SEQ ID NO 159 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 3 <400> SEQUENCE: 159 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 160 <211> LENGTH: 11184 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL8225-pta-ack::ptb-buk, plasmid <400> SEQUENCE: 160 aaactccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga 60 gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta 120 atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 180 gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact 240 gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca 300 tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 360 accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg 420 ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 480 cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta 540 agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat 600 ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 660 tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc 720 ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 780 cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc 840 gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca gggccccctg cttcggggtc 900 attatagcga ttttttcggt atatccatcc tttttcgcac gatatacagg attttgccaa 960 agggttcgtg tagactttcc ttggtgtatc caacggcgtc agccgggcag gataggtgaa 1020 gtaggcccac ccgcgagcgg gtgttccttc ttcactgtcc cttattcgca cctggcggtg 1080 ctcaacggga atcctgctct gcgaggctgg ccggctaccg ccggcgtaac agatgagggc 1140 aagcggatgg ctgatgaaac caagccaacc aggaagggca gcccacctat caaggtgtac 1200 tgccttccag acgaacgaag agcgattgag gaaaaggcgg cggcggccgg catgagcctg 1260 tcggcctacc tgctggccgt cggccagggc tacaaaatca cgggcgtcgt ggactatgag 1320 cacgtccgcg agctggcccg catcaatggc gacctgggcc gcctgggcgg cctgctgaaa 1380 ctctggctca ccgacgaccc gcgcacggcg cggttcggtg atgccacgat cctcgccctg 1440 ctggcgaaga tcgaagagaa gcaggacgag cttggcaagg tcatgatggg cgtggtccgc 1500 ccgagggcag agccatgact tttttagccg ctaaaacggc cggggggtgc gcgtgattgc 1560 caagcacgtc cccatgcgct ccatcaagaa gagcgacttc gcggagctgg tgaagtacat 1620 caccgacgag caaggcaaga ccgatcgggc cccctgcagg ataaaaaaat tgtagataaa 1680 ttttataaaa tagttttatc tacaattttt ttatcaggaa acagctatga ccgcggccgc 1740 ggcgccaagc ttagaaaaat ataaataaga agtagcttta agagaattaa attattaaga 1800 aaagcaaagg tgtttaaaaa ataaattttt aaacaccttt gcttttctta aattataaat 1860 aagataaaaa agaatcctga ataaaataaa aaggggtgtc tcaaaatttt attttgagac 1920 gacccctttt tattctatat gtcgatgcta tagctgagat cgtggaattc ttgttagcta 1980 ccagattcac atttaagttg tttctctaaa ccacagatta tcaattcaag tccaaaaaga 2040 aatgctggtt ctgcgccttg atgatcaaat aactctattg cttgtcttaa caatggaggc 2100 attgaatctg ttgttggtgt ttctctttcc tcttttgcaa cttgatgttc ttgatcctcc 2160 aatacgcaac ctaaagtaaa atgtcctaca gcacttagtg cgtataaggc attttctaaa 2220 ctaaaaccct gttgacataa gaatgctaat tgattttcta atgtttcata ttgtttttca 2280 gttggtctag ttcctaaatg tactttagcc ccatctctat gtgataatag agcacaacga 2340 aaagatttag cgttattcct aagaaaatct tgccatgatt caccttctaa aggacaaaag 2400 tgagtgtgat gtctatctaa catttcaata gctaaggcgt caagtaaagc tctcttattc 2460 ttcacatgcc aatacaacgt aggttgttct actccaagtt tctgagctaa ctttcttgta 2520 gttagtcctt ctattccaac ttcatttagt aattccaatg cactattgat aactttactt 2580 ttatcaagtc tagacatcat ttaatatcct cctcttcaat atatttaagt cgactgatcg 2640 gatcctgatc ggagctccca tggcggccgg tcgatatcga tgtgtagtag cctgtgaaat 2700 aagtaaggaa aaaaaagaag taagtgttat atatgatgat tattttgtag atgtagatag 2760 gataatagaa tccatagaaa atataggtta tacagttata taaaaattac tttaaaatct 2820 atcattgata gggtaaaata taaatcgtat aaagttgtgt aatttttaag gaggtgtgtt 2880 acagacgtcc gcgagagacc ttaaatatat tgaagaggag gaaatacata tggtttcaag 2940 atatgttcca gatatgggag atttaatatg ggttgatttt gatccaacaa aaggatcaga 3000 acaagcagga catagaccag cagttgtttt atcaccattt atgtataata ataaaacagg 3060 aatgtgttta tgtgttccat gtacaacaca atcaaaagga tatccatttg aagttgtttt 3120 atcaggacaa gaaagagatg gagttgcatt agcagatcaa gttaaatcaa tagcatggag 3180 agcaagagga gcaacaaaaa aaggaacagt tgcaccagaa gaattacaat taataaaagc 3240 aaaaataaat gttttaatag gataatgtta ttaagctagc ataaaaataa gaagcctgca 3300 tttgcaggct tcttattttt atggcgcgcc gttctgaatc cttagctaat ggttcaacag 3360 gtaactatga cgaagatagc accctggata agtctgtaat ggattctaag gcatttaatg 3420 aagacgtgta tataaaatgt gctaatgaaa aagaaaatgc gttaaaagag cctaaaatga 3480 gttcaaatgg ttttgaaatt gattggtagt ttaatttaat atattttttc tattggctat 3540 ctcgatacct atagaatctt ctgttcactt ttgtttttga aatataaaaa ggggcttttt 3600 agcccctttt ttttaaaact ccggaggagt ttcttcattc ttgatactat acgtaactat 3660 tttcgatttg acttcattgt caattaagct agtaaaatca atggttaaaa aacaaaaaac 3720 ttgcattttt ctacctagta atttataatt ttaagtgtcg agtttaaaag tataatttac 3780 caggaaagga gcaagttttt taataaggaa aaatttttcc ttttaaaatt ctatttcgtt 3840 atatgactaa ttataatcaa aaaaatgaaa ataaacaaga ggtaaaaact gctttagaga 3900 aatgtactga taaaaaaaga aaaaatccta gatttacgtc atacatagca cctttaacta 3960 ctaagaaaaa tattgaaagg acttccactt gtggagatta tttgtttatg ttgagtgatg 4020 cagacttaga acattttaaa ttacataaag gtaatttttg cggtaataga ttttgtccaa 4080 tgtgtagttg gcgacttgct tgtaaggata gtttagaaat atctattctt atggagcatt 4140 taagaaaaga agaaaataaa gagtttatat ttttaactct tacaactcca aatgtaaaaa 4200 gttatgatct taattattct attaaacaat ataataaatc ttttaaaaaa ttaatggagc 4260 gtaaggaagt taaggatata actaaaggtt atataagaaa attagaagta acttaccaaa 4320 aggaaaaata cataacaaag gatttatgga aaataaaaaa agattattat caaaaaaaag 4380 gacttgaaat tggtgattta gaacctaatt ttgatactta taatcctcat tttcatgtag 4440 ttattgcagt taataaaagt tattttacag ataaaaatta ttatataaat cgagaaagat 4500 ggttggaatt atggaagttt gctactaagg atgattctat aactcaagtt gatgttagaa 4560 aagcaaaaat taatgattat aaagaggttt acgaacttgc gaaatattca gctaaagaca 4620 ctgattattt aatatcgagg ccagtatttg aaatttttta taaagcatta aaaggcaagc 4680 aggtattagt ttttagtgga ttttttaaag atgcacacaa attgtacaag caaggaaaac 4740 ttgatgttta taaaaagaaa gatgaaatta aatatgtcta tatagtttat tataattggt 4800 gcaaaaaaca atatgaaaaa actagaataa gggaacttac ggaagatgaa aaagaagaat 4860 taaatcaaga tttaatagat gaaatagaaa tagattaaag tgtaactata ctttatatat 4920 atatgattaa aaaaataaaa aacaacagcc tattaggttg ttgtttttta ttttctttat 4980 taattttttt aatttttagt ttttagttct tttttaaaat aagtttcagc ctctttttca 5040 atatttttta aagaaggagt atttgcatga attgcctttt ttctaacaga cttaggaaat 5100 attttaacag tatcttcttg cgccggtgat tttggaactt cataacttac taatttataa 5160 ttattatttt cttttttaat tgtaacagtt gcaaaagaag ctgaacctgt tccttcaact 5220 agtttatcat cttcaatata atattcttga cctatatagt ataaatatat ttttattata 5280 tttttacttt tttctgaatc tattatttta taatcataaa aagttttacc accaaaagaa 5340 ggttgtactc cttctggtcc aacatatttt tttactatat tatctaaata atttttggga 5400 actggtgttg taatttgatt aatcgaacaa ccagttatac ttaaaggaat tataactata 5460 aaaatatata ggattatctt tttaaatttc attattggcc tcctttttat taaatttatg 5520 ttaccataaa aaggacataa cgggaatatg tagaatattt ttaatgtaga caaaatttta 5580 cataaatata aagaaaggaa gtgtttgttt aaattttata gcaaactatc aaaaattagg 5640 gggataaaaa tttatgaaaa aaaggttttc gatgttattt ttatgtttaa ctttaatagt 5700 ttgtggttta tttacaaatt cggccggcca aagattgctc tatgtttaag ctattatatg 5760 aacttccaat tctttttatt gatatgggag taatattgct ttttattctt attaggtttt 5820 ttaaatattc tatacctaaa atattgtttg gagattgaag tatttcatct atattgtact 5880 ttgtaagaga acttttagta tttaatagaa aattatttaa agcactattt cgtgcagaag 5940 gataggacat accctgtgac attttttcct ttaaaaataa tttaaattgg gtaggctctt 6000 ctgcaagaat ttttgcaata gatttcagca agtttatatt actatattcg cttccaaaac 6060 aaagattttt tactacaccc aagttttcta agagacttac agcaccatag gcaaaaaatt 6120 cagcagaaga tagactgtag ataacaggaa gttcaaatac caggtctact ccatttagaa 6180 gtgccatttt ggttttagtc catttgtcaa ctatagatgg tgaacctctt tgcacgaagt 6240 taccactcat aactgctatt acagcatcac attttgtagc agaacgagca ctttcaatat 6300 gatatttatg tccattgtga aagggattat attcaactat tattccagtt acgttcatag 6360 aaattttcct ttctaaaata ttttattcca tgtcaagaac tctgtttatt tcattaaaga 6420 actataagta caaagtataa ggcatttgaa aaaataggct agtatattga ttgattattt 6480 attttaaaat gcctaagtga aatatataca tattataaca ataaaataag tattagtgta 6540 ggatttttaa atagagtatc tattttcaga ttaaattttt gattatttga tttacattat 6600 ataatattga gtaaagtatt gactagcaaa attttttgat actttaattt gtgaaatttc 6660 ttatcaaaag ttatattttt gaataatttt tattgaaaaa tacaactaaa aaggattata 6720 gtataagtgt gtgtaatttt gtgttaaatt taaagggagg aaatgaacat gaaattgatg 6780 agtaaaaact ttgatgagtt attatcaaga ttaaaggaag ttccaacaaa aaaagtggct 6840 gtagccgtag cacaagatga accagtatta gaggctataa aagaagctac agaaaataac 6900 atcgcacaag caatattggt tggtgataaa caacaaatcc atgaaatcgc aaagaaaata 6960 aacttggact tatctgatta tgaaataatg gatattaaag atccaaagaa agcaacatta 7020 gaagcagtaa aattagtttc tagtggtcat gcagatatgt taatgaaagg tctagttgat 7080 actgcaacat tcctaagaag cgtattaaac aaagaggttg gtcttagaac aggaaaatta 7140 atgtcccatg tagctgtgtt tgatgtggaa ggttgggata gactgttatt tttaactgat 7200 gcagcattta atacatatcc agaatttaag gataaagttg gaatgataaa taatgcagtt 7260 gtagttgctc atgcatgtgg aatagatgtt ccaagagtag cacctatatg cccagttgaa 7320 gttgtaaata caagtatgca atcaacagtt gatgcagcat tgttagctaa aatgagtgac 7380 agggggcaaa ttaaaggatg cgtaattgat ggaccttttg ccttagataa tgcaatatca 7440 gaagaagcag ctcatcataa aggtgttaca ggatcagtag caggtaaagc tgatatatta 7500 ttattaccaa atatagaagc agcaaatgta atgtataaaa cattaacata tttctctaaa 7560 tcaagaaatg gtggactttt agtaggtaca tcagcaccag taattttaac ttcaagagca 7620 gattcattcg aaactaaagt taattcaatt gctcttgcag cattagttgc agcaagaaat 7680 aagtaataaa tcaatccata ataattaatg cataattaat ggagagattt atatggaatt 7740 tgcaatgcac tattagattc tataataatt tcttctgaaa attatgcatt atgactgtat 7800 agaatgcatt aaatttaagg gggattcaga atgtcatata agctattaat aatcaatcca 7860 ggttcaacat caacaaagat tggtgtttac gaaggagaaa aggaactatt tgaagaaact 7920 ttgagacaca caaatgaaga aataaagaga tatgatacaa tatatgatca atttgaattt 7980 agaaaagaag ttatattaaa tgttcttaaa gaaaagaatt ttgatataaa gactctaagt 8040 gctattgttg gtagaggtgg aatgcttaga ccagttgaag gtggaacata tgcagtaaat 8100 gatgcaatgg ttgaagattt aaaagttgga gttcaaggac ctcatgcttc taaccttggc 8160 ggaataattg ccaagtcaat tggagatgaa ttaaatattc catcatttat agtagatcca 8220 gttgttacag atgagttagc agatgtagca agactatctg gagtaccaga actaccaaga 8280 aaaagtaaat tccatgcttt aaatcaaaaa gcggtagcta aaagatatgg aaaagaaagt 8340 ggacaaggat atgaaaacct aaatcttgta gttgtacata tgggtggagg cgtttcagtt 8400 ggtgctcaca atcatgggaa agttgtcgat gtaaataatg cattagatgg agatggccca 8460 ttctcaccag aaagagctgg atcagttcca attggtgatt tagttaaaat gtgttttagt 8520 ggaaaatata gtgaagcaga agtatatggc aaggctgtag gaaaaggtgg atttgttggt 8580 tatctaaaca caaatgatgt aaaaggtgtt attgataaga tggaagaagg agataaagaa 8640 tgtgaatcaa tatacaaagc atttgtttat caaatttcaa aagcaatcgg agaaatgtca 8700 gttgtattag aaggtaaagt tgatcaaatt atttttaccg gaggaattgc atactcacca 8760 acacttgttc cagaccttaa agcaaaagtt gaatggatag ccccagttac agtttatcct 8820 ggagaagatg aattacttgc tctagctcaa ggtgctataa gagtacttga tggagaagaa 8880 caagctaagg tttactagta ccgttcgtat aatgtatgct atacgaagtt atccttagaa 8940 gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtgtata ataggaattg 9000 aagttaaatt agatgctaaa aatttgtaat taagaaggag ggattcgtca tgttggtatt 9060 ccaaatgcgt aatgtagata aaacatctac tgttttgaaa cagactaaaa acagtgatta 9120 cgcagataaa taaatacgtt agattaattc ctaccagtga ctaatcttat gactttttaa 9180 acagataact aaaattacaa acaaatcgtt taacttctgt atttatttac agatgtaatc 9240 acttcaggag taattacatg aacaaaaata taaaatattc tcaaaacttt ttaacgagtg 9300 aaaaagtact caaccaaata ataaaacaat tgaatttaaa agaaaccgat accgtttacg 9360 aaattggaac aggtaaaggg catttaacga cgaaactggc taaaataagt aaacaggtaa 9420 cgtctattga attagacagt catctattca acttatcgtc agaaaaatta aaactgaaca 9480 ttcgtgtcac tttaattcac caagatattc tacagtttca attccctaac aaacagaggt 9540 ataaaattgt tgggagtatt ccttaccatt taagcacaca aattattaaa aaagtggttt 9600 ttgaaagcca tgcgtctgac atctatctga ttgttgaaga aggattctac aagcgtacct 9660 tggatattca ccgaacacta gggttgctct tgcacactca agtctcgatt cagcaattgc 9720 ttaagctgcc agcggaatgc tttcatccta aaccaaaagt aaacagtgtc ttaataaaac 9780 ttacccgcca taccacagat gttccagata aatattggaa gctatatacg tactttgttt 9840 caaaatgggt caatcgagaa tatcgtcaac tgtttactaa aaatcagttt catcaagcaa 9900 tgaaacacgc caaagtaaac aatttaagta ccattactta tgagcaagta ttgtctattt 9960 ttaatagtta tctattattt aacgggagga aataattcta tgagtcgctt ttttaaattt 10020 ggaaagttac acgttactaa agggaatgga gataaattat tagatatact actgacagct 10080 tccaagaagc taaagaggtc ataacttcgt ataatgtatg ctatacgaac ggtagacttg 10140 acttttaatg ctcatctcta tataataggt tgtggctaat atatagaggt gagtgatatg 10200 aaattaaatg tatcagattt actaagtgaa gaagttgtta caaaggacat aaatgttaca 10260 gtagaagaaa agggattcta tgatggaagt gaatacataa agttattaga gcctctaaag 10320 tttagcggaa ctttaagtaa agaaggagat attcttctgt tggaaggaag aattaatact 10380 ttactagagc tcacttgttc acgatgtcta ggtaaattct cttatgctgt gaatgttgct 10440 attactgaaa aatttacaaa taataacaag gaaaataagg atgatgaagc catctttata 10500 gatagtaata tcattgatat tacggaaata atagaaaata acattatatt aattttacca 10560 attaagaggc tttgcagcga gaattgtaag gggttatgcc aacagtgcgg cactaactta 10620 aataattcta aatgtcagtg caaaagcgat gatattgatc cgagattggc aaagctaaaa 10680 gatatgtttt tcactgatta aggaggtgtt tactgtggga aatccagcca gcagaatatc 10740 aaaagcaaaa agagactcaa gaagagcaca gacttttaaa ttaggtttac caggtttagt 10800 tgagtgtcct cagtgccatg aaatgaaact tgcacataga gtttgtaaga attgtggata 10860 ttataagggt aaggaaatca tttcaactga aaataaataa aagaaagtca tttgactttc 10920 tttttttgtt catggggtct ataaaagtta gatcatatta agtaacaaaa ttaggtaaca 10980 aaggtccaga ttataggata ggatgtgaaa atatgataat tgctgtggat ggtatgggag 11040 gagattttgc accttgtgct gtagtggaag gtgtggtaga agcagttaaa aagcaaaacg 11100 taaatataat aataaccggc caaaaagagc aaattgaaaa tgaattagct aaatataatt 11160 atcctaagga caaaatagat attt 11184 <210> SEQ ID NO 161 <211> LENGTH: 42 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN22f <400> SEQUENCE: 161 tttacaaatt cggccggcca aagattgctc tatgtttaag ct 42 <210> SEQ ID NO 162 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN23r <400> SEQUENCE: 162 catcaaagtt tttactcatc aatttcatgt tcatttcctc cct 43 <210> SEQ ID NO 163 <211> LENGTH: 46 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN24f <400> SEQUENCE: 163 agggaggaaa tgaacatgaa attgatgagt aaaaactttg atgagt 46 <210> SEQ ID NO 164 <211> LENGTH: 50 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN25r <400> SEQUENCE: 164 gtatagcata cattatacga acggtactag taaaccttag cttgttcttc 50 <210> SEQ ID NO 165 <211> LENGTH: 50 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN26f <400> SEQUENCE: 165 gaagaacaag ctaaggttta ctagtaccgt tcgtataatg tatgctatac 50 <210> SEQ ID NO 166 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN27r <400> SEQUENCE: 166 agagatgagc attaaaagtc aagtctaccg ttcgtatagc ataca 45 <210> SEQ ID NO 167 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN28f <400> SEQUENCE: 167 tgtatgctat acgaacggta gacttgactt ttaatgctca tctct 45 <210> SEQ ID NO 168 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN29r <400> SEQUENCE: 168 catgagatta tcaaaaagga gtttaaatat ctattttgtc cttagga 47 <210> SEQ ID NO 169 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN30f <400> SEQUENCE: 169 tcctaaggac aaaatagata tttaaactcc tttttgataa tctcatg 47 <210> SEQ ID NO 170 <211> LENGTH: 42 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN31r <400> SEQUENCE: 170 agcttaaaca tagagcaatc tttggccggc cgaatttgta aa 42 <210> SEQ ID NO 171 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og29f <400> SEQUENCE: 171 agccacatcc agtagattga acttt 25 <210> SEQ ID NO 172 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og30r <400> SEQUENCE: 172 aattcgccct acgattaaag tggaa 25 <210> SEQ ID NO 173 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-F1, forward <400> SEQUENCE: 173 aaaggtctcc ggccgcgctc actatctgcg gaacc 35 <210> SEQ ID NO 174 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-R1, reverse <400> SEQUENCE: 174 tttggtctcg aattctgtaa cacctcctta atttttag 38 <210> SEQ ID NO 175 <211> LENGTH: 52 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: aor1-F1, forward <400> SEQUENCE: 175 aaaggtctcg aattcaaaga tctatgtatg gttatgatgg taaagtatta ag 52 <210> SEQ ID NO 176 <211> LENGTH: 54 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: aor1-R1, reverse <400> SEQUENCE: 176 tttggtctcc tcgagtatgg atccctagaa cttacctata tattcatcta atcc 54 <210> SEQ ID NO 177 <211> LENGTH: 37 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - ack-DuetI2-R1 <400> SEQUENCE: 177 gggtacctta tttattttca actatttctt ttgtatc 37 <210> SEQ ID NO 178 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - DuetI2-ack-F1 <400> SEQUENCE: 178 ttgaaaataa ataaggtacc ctcgagtctg gtaaag 36 <210> SEQ ID NO 179 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - DuetI2-pta-R1 <400> SEQUENCE: 179 ttttttccat atgtatatct ccttcttata cttaac 36 <210> SEQ ID NO 180 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - pta-DuetI2-F1 <400> SEQUENCE: 180 aggagatata catatggaaa aaatttggag taaggc 36 <210> SEQ ID NO 181 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - DuetI2-tesB-F1 <400> SEQUENCE: 181 gaaatcataa ttaaggtacc ctcgagtctg gtaaag 36 <210> SEQ ID NO 182 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - DuetI2-tesB-R1 <400> SEQUENCE: 182 cctgactcat atgtatatct ccttcttata cttaac 36 <210> SEQ ID NO 183 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - tesB-DuetI2-F1 <400> SEQUENCE: 183 aagaaggaga tatacatatg agtcaggcac ttaaaa 36 <210> SEQ ID NO 184 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - testB-DuetI2-R1 <400> SEQUENCE: 184 agggtacctt aattatgatt tctcataaca ccttc 35 <210> SEQ ID NO 185 <211> LENGTH: 7606 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDUET-pta-ack, plasmid <400> SEQUENCE: 185 ggggaattgt gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tggaaaaaat ttggagtaag gcaaaggaag acaaaaaaaa gattgtctta gctgaaggag 360 aagaagaaag aactcttcaa gcttgtgaaa aaataattaa agagggtatt gcaaatttaa 420 tccttgtagg gaatgaaaag gtaataaaag aaaaagcgtc aaaattaggt gtaagtttaa 480 atggagcaga aatagtagat ccagagattt cagataaact aaaggcatat gcagatgctt 540 tttatgaatt gagaaagaag aagggaataa cgccagaaaa agcggataaa atagtaagag 600 atccaatata ctttgctaca atgatggtta aacttggaga tgcagatgga ttggtttcag 660 gtgcggttca tactacaggc gatcttttga gaccaggact tcaaatagta aagacagctc 720 caggtacatc agtagtttcc agtacattta taatggaagt accaaattgt gagtatggtg 780 acaatggtgt acttctattt gctgattgtg ctgtaaatcc atgcccagat agtgatcaat 840 tggcttcaat tgcaataagt acagcagaaa ctgcaaagaa cttatgtgga atggatccaa 900 aagtagcaat gctttcattt tctactaagg gaagtgcaaa acacgaatta gtagacaaag 960 ttagaaatgc tgtagagatt gcaaaaaaag ctaaaccaga tttaagttta gacggagaat 1020 tacaattaga tgcctctatc gtagaaaagg ttgcaagttt aaaggctcct ggaagtgaag 1080 tagcaggaaa agcaaatgta cttgtatttc cagatctcca agcaggaaat ataggctata 1140 aactcgttca aagatttgca aaagcagatg ctataggacc tgtatgccaa ggatttgcaa 1200 aacctataaa tgatttgtca agaggatgta attctgatga tatagtaaat gtagtagctg 1260 taacagcagt tcaagcacaa gctcaaaagt aataacaaaa agcataaatg attcattttt 1320 aggaggaata ttaaacatga aaatattagt agtaaactgt ggaagttcat ctttaaaata 1380 tcaacttatt gatatgcaag atgaaagtgt tgtagcaaag ggtcttgtag aaagaatagg 1440 aatggacggt tcaattttaa cacacaaagt taatggagaa aagtttgtta cagagcaacc 1500 aatggaagac cacaaagttg ctatacaatt agtattaaat gctcttgtag ataaaaaaca 1560 tggtgtaata aaagacatgt cagaaatatc cgctgtagga catagagttt tgcacggtgg 1620 aaagaaatat gcagcatcca ttcttattga cgaaaatgta atgaaagcaa tagaagaatg 1680 tatcccacta ggaccactac ataatccagc taatataatg ggaatagatg cttgtaaaaa 1740 attaatgcca aatactccaa tggtagcagt atttgataca gcatttcatc agacaatgcc 1800 agattatgct tatacttatg caatacctta tgatatatct gaaaagtatg atatcagaaa 1860 atatggtttt catggaactt ctcatagatt cgtttcaatt gaagcagcta aattattaaa 1920 gaaagatcca aaagatctta agttaataac ttgtcattta ggaaatggag ctagcatatg 1980 tgcagtaaac caaggaaaag cagtagatac aactatggga cttactcctc ttgcaggact 2040 tgtaatggga actagatgcg gtgatataga tccagctata gtaccatttg taatgaaaag 2100 aacaggcatg tctgtagatg aagtggatac cttaatgaat aaaaagtcag gaatacttgg 2160 agtatcagga gtaagcagtg attttagaga tgtagaagaa gctgcaaatt caggaaatga 2220 tagagcaaaa cttgcattaa atatgtatta tcacaaagtt aaatctttca taggagctta 2280 tgttgcagtt ttaaatggag cagatgctat aatatttacg gcaggacttg gagaaaattc 2340 agcaactagc agatctgcta tatgtaatgg attaagctat tttggaatta aaatagatga 2400 agaaaagaat aagaaaaggg gagaggcact agaaataagc acacctgatt caaagataaa 2460 agtattagta attcctacaa atgaagaact tatgatagct agggatacaa aagaaatagt 2520 tgaaaataaa taaggtaccc tcgagtctgg taaagaaacc gctgctgcga aatttgaacg 2580 ccagcacatg gactcgtcta ctagcgcagc ttaattaacc taggctgctg ccaccgctga 2640 gcaataacta gcataacccc ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa 2700 aggaggaact atatccggat tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg 2760 gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct 2820 cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta 2880 aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa 2940 cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct 3000 ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc 3060 aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg 3120 ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt 3180 acaatttctg gcggcacgat ggcatgagat tatcaaaaag gatcttcacc tagatccttt 3240 taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 3300 gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 3360 tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 3420 ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa 3480 accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 3540 agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 3600 acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 3660 tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag 3720 cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 3780 tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 3840 ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt 3900 gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc 3960 tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat 4020 ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 4080 gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 4140 cacggaaatg ttgaatactc atactcttcc tttttcaatc atgattgaag catttatcag 4200 ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggt 4260 catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 4320 gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 4380 aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 4440 gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 4500 gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 4560 gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 4620 atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 4680 cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 4740 cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 4800 agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 4860 tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 4920 gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 4980 catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 5040 agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 5100 ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 5160 atatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatacactc 5220 cgctatcgct acgtgactgg gtcatggctg cgccccgaca cccgccaaca cccgctgacg 5280 cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg 5340 ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgagg cagctgcggt 5400 aaagctcatc agcgtggtcg tgaagcgatt cacagatgtc tgcctgttca tccgcgtcca 5460 gctcgttgag tttctccaga agcgttaatg tctggcttct gataaagcgg gccatgttaa 5520 gggcggtttt ttcctgtttg gtcactgatg cctccgtgta agggggattt ctgttcatgg 5580 gggtaatgat accgatgaaa cgagagagga tgctcacgat acgggttact gatgatgaac 5640 atgcccggtt actggaacgt tgtgagggta aacaactggc ggtatggatg cggcgggacc 5700 agagaaaaat cactcagggt caatgccagc gcttcgttaa tacagatgta ggtgttccac 5760 agggtagcca gcagcatcct gcgatgcaga tccggaacat aatggtgcag ggcgctgact 5820 tccgcgtttc cagactttac gaaacacgga aaccgaagac cattcatgtt gttgctcagg 5880 tcgcagacgt tttgcagcag cagtcgcttc acgttcgctc gcgtatcggt gattcattct 5940 gctaaccagt aaggcaaccc cgccagccta gccgggtcct caacgacagg agcacgatca 6000 tgctagtcat gccccgcgcc caccggaagg agctgactgg gttgaaggct ctcaagggca 6060 tcggtcgaga tcccggtgcc taatgagtga gctaacttac attaattgcg ttgcgctcac 6120 tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 6180 cggggagagg cggtttgcgt attgggcgcc agggtggttt ttcttttcac cagtgagacg 6240 ggcaacagct gattgccctt caccgcctgg ccctgagaga gttgcagcaa gcggtccacg 6300 ctggtttgcc ccagcaggcg aaaatcctgt ttgatggtgg ttaacggcgg gatataacat 6360 gagctgtctt cggtatcgtc gtatcccact accgagatgt ccgcaccaac gcgcagcccg 6420 gactcggtaa tggcgcgcat tgcgcccagc gccatctgat cgttggcaac cagcatcgca 6480 gtgggaacga tgccctcatt cagcatttgc atggtttgtt gaaaaccgga catggcactc 6540 cagtcgcctt cccgttccgc tatcggctga atttgattgc gagtgagata tttatgccag 6600 ccagccagac gcagacgcgc cgagacagaa cttaatgggc ccgctaacag cgcgatttgc 6660 tggtgaccca atgcgaccag atgctccacg cccagtcgcg taccgtcttc atgggagaaa 6720 ataatactgt tgatgggtgt ctggtcagag acatcaagaa ataacgccgg aacattagtg 6780 caggcagctt ccacagcaat ggcatcctgg tcatccagcg gatagttaat gatcagccca 6840 ctgacgcgtt gcgcgagaag attgtgcacc gccgctttac aggcttcgac gccgcttcgt 6900 tctaccatcg acaccaccac gctggcaccc agttgatcgg cgcgagattt aatcgccgcg 6960 acaatttgcg acggcgcgtg cagggccaga ctggaggtgg caacgccaat cagcaacgac 7020 tgtttgcccg ccagttgttg tgccacgcgg ttgggaatgt aattcagctc cgccatcgcc 7080 gcttccactt tttcccgcgt tttcgcagaa acgtggctgg cctggttcac cacgcgggaa 7140 acggtctgat aagagacacc ggcatactct gcgacatcgt ataacgttac tggtttcaca 7200 ttcaccaccc tgaattgact ctcttccggg cgctatcatg ccataccgcg aaaggttttg 7260 cgccattcga tggtgtccgg gatctcgacg ctctccctta tgcgactcct gcattaggaa 7320 gcagcccagt agtaggttga ggccgttgag caccgccgcc gcaaggaatg gtgcatgcaa 7380 ggagatggcg cccaacagtc ccccggccac ggggcctgcc accataccca cgccgaaaca 7440 agcgctcatg agcccgaagt ggcgagcccg atcttcccca tcggtgatgt cggcgatata 7500 ggcgccagca accgcacctg tggcgccggt gatgccggcc acgatgcgtc cggcgtagag 7560 gatcgagatc gatctcgatc ccgcgaaatt aatacgactc actata 7606 <210> SEQ ID NO 186 <211> LENGTH: 7492 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDUET-ptb-buk, plasmid <400> SEQUENCE: 186 ggggaattgt gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgagtaaaaa ctttgatgag ttattatcaa gattaaagga agttccaaca aaaaaagtgg 360 ctgtagccgt agcacaagat gaaccagtat tagaggctat aaaagaagct acagaaaata 420 acatcgcaca agcaatattg gttggtgata aacaacaaat ccatgaaatc gcaaagaaaa 480 taaacttgga cttatctgat tatgaaataa tggatattaa agatccaaag aaagcaacat 540 tagaagcagt aaaattagtt tctagtggtc atgcagatat gttaatgaaa ggtctagttg 600 atactgcaac attcctaaga agcgtattaa acaaagaggt tggtcttaga acaggaaaat 660 taatgtccca tgtagctgtg tttgatgtgg aaggttggga tagactgtta tttttaactg 720 atgcagcatt taatacatat ccagaattta aggataaagt tggaatgata aataatgcag 780 ttgtagttgc tcatgcatgt ggaatagatg ttccaagagt agcacctata tgcccagttg 840 aagttgtaaa tacaagtatg caatcaacag ttgatgcagc attgttagct aaaatgagtg 900 acagggggca aattaaagga tgcgtaattg atggaccttt tgccttagat aatgcaatat 960 cagaagaagc agctcatcat aaaggtgtta caggatcagt agcaggtaaa gctgatatat 1020 tattattacc aaatatagaa gcagcaaatg taatgtataa aacattaaca tatttctcta 1080 aatcaagaaa tggtggactt ttagtaggta catcagcacc agtaatttta acttcaagag 1140 cagattcatt cgaaactaaa gttaattcaa ttgctcttgc agcattagtt gcagcaagaa 1200 ataagtaata aatcaatcca taataattaa tgcataatta atggagagat ttatatggaa 1260 tttgcaatgc actattagat tctataataa tttcttctga aaattatgca ttatgactgt 1320 atagaatgca ttaaatttaa gggggattca gaatgtcata taagctatta ataatcaatc 1380 caggttcaac atcaacaaag attggtgttt acgaaggaga aaaggaacta tttgaagaaa 1440 ctttgagaca cacaaatgaa gaaataaaga gatatgatac aatatatgat caatttgaat 1500 ttagaaaaga agttatatta aatgttctta aagaaaagaa ttttgatata aagactctaa 1560 gtgctattgt tggtagaggt ggaatgctta gaccagttga aggtggaaca tatgcagtaa 1620 atgatgcaat ggttgaagat ttaaaagttg gagttcaagg acctcatgct tctaaccttg 1680 gcggaataat tgccaagtca attggagatg aattaaatat tccatcattt atagtagatc 1740 cagttgttac agatgagtta gcagatgtag caagactatc tggagtacca gaactaccaa 1800 gaaaaagtaa attccatgct ttaaatcaaa aagcggtagc taaaagatat ggaaaagaaa 1860 gtggacaagg atatgaaaac ctaaatcttg tagttgtaca tatgggtgga ggcgtttcag 1920 ttggtgctca caatcatggg aaagttgtcg atgtaaataa tgcattagat ggagatggcc 1980 cattctcacc agaaagagct ggatcagttc caattggtga tttagttaaa atgtgtttta 2040 gtggaaaata tagtgaagca gaagtatatg gcaaggctgt aggaaaaggt ggatttgttg 2100 gttatctaaa cacaaatgat gtaaaaggtg ttattgataa gatggaagaa ggagataaag 2160 aatgtgaatc aatatacaaa gcatttgttt atcaaatttc aaaagcaatc ggagaaatgt 2220 cagttgtatt agaaggtaaa gttgatcaaa ttatttttac cggaggaatt gcatactcac 2280 caacacttgt tccagacctt aaagcaaaag ttgaatggat agccccagtt acagtttatc 2340 ctggagaaga tgaattactt gctctagctc aaggtgctat aagagtactt gatggagaag 2400 aacaagctaa ggtttactag gtaccctcga gtctggtaaa gaaaccgctg ctgcgaaatt 2460 tgaacgccag cacatggact cgtctactag cgcagcttaa ttaacctagg ctgctgccac 2520 cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 2580 gctgaaagga ggaactatat ccggattggc gaatgggacg cgccctgtag cggcgcatta 2640 agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg 2700 cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa 2760 gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc 2820 aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt 2880 cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca 2940 acactcaacc ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc 3000 tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta 3060 acgtttacaa tttctggcgg cacgatggca tgagattatc aaaaaggatc ttcacctaga 3120 tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 3180 ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 3240 catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat 3300 ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag 3360 caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct 3420 ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt 3480 tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg 3540 cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca 3600 aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt 3660 tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat 3720 gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac 3780 cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa 3840 aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt 3900 tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt 3960 tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa 4020 gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatcatga ttgaagcatt 4080 tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 4140 ataggtcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 4200 agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 4260 aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 4320 ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 4380 gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 4440 aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 4500 aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 4560 gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 4620 aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 4680 aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 4740 cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 4800 cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 4860 tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4920 tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4980 ggaagcggaa gagcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca 5040 ccgcatatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat 5100 acactccgct atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg 5160 ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg 5220 tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc 5280 tgcggtaaag ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg 5340 cgtccagctc gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca 5400 tgttaagggc ggttttttcc tgtttggtca ctgatgcctc cgtgtaaggg ggatttctgt 5460 tcatgggggt aatgataccg atgaaacgag agaggatgct cacgatacgg gttactgatg 5520 atgaacatgc ccggttactg gaacgttgtg agggtaaaca actggcggta tggatgcggc 5580 gggaccagag aaaaatcact cagggtcaat gccagcgctt cgttaataca gatgtaggtg 5640 ttccacaggg tagccagcag catcctgcga tgcagatccg gaacataatg gtgcagggcg 5700 ctgacttccg cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg 5760 ctcaggtcgc agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt 5820 cattctgcta accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca 5880 cgatcatgct agtcatgccc cgcgcccacc ggaaggagct gactgggttg aaggctctca 5940 agggcatcgg tcgagatccc ggtgcctaat gagtgagcta acttacatta attgcgttgc 6000 gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc 6060 aacgcgcggg gagaggcggt ttgcgtattg ggcgccaggg tggtttttct tttcaccagt 6120 gagacgggca acagctgatt gcccttcacc gcctggccct gagagagttg cagcaagcgg 6180 tccacgctgg tttgccccag caggcgaaaa tcctgtttga tggtggttaa cggcgggata 6240 taacatgagc tgtcttcggt atcgtcgtat cccactaccg agatgtccgc accaacgcgc 6300 agcccggact cggtaatggc gcgcattgcg cccagcgcca tctgatcgtt ggcaaccagc 6360 atcgcagtgg gaacgatgcc ctcattcagc atttgcatgg tttgttgaaa accggacatg 6420 gcactccagt cgccttcccg ttccgctatc ggctgaattt gattgcgagt gagatattta 6480 tgccagccag ccagacgcag acgcgccgag acagaactta atgggcccgc taacagcgcg 6540 atttgctggt gacccaatgc gaccagatgc tccacgccca gtcgcgtacc gtcttcatgg 6600 gagaaaataa tactgttgat gggtgtctgg tcagagacat caagaaataa cgccggaaca 6660 ttagtgcagg cagcttccac agcaatggca tcctggtcat ccagcggata gttaatgatc 6720 agcccactga cgcgttgcgc gagaagattg tgcaccgccg ctttacaggc ttcgacgccg 6780 cttcgttcta ccatcgacac caccacgctg gcacccagtt gatcggcgcg agatttaatc 6840 gccgcgacaa tttgcgacgg cgcgtgcagg gccagactgg aggtggcaac gccaatcagc 6900 aacgactgtt tgcccgccag ttgttgtgcc acgcggttgg gaatgtaatt cagctccgcc 6960 atcgccgctt ccactttttc ccgcgttttc gcagaaacgt ggctggcctg gttcaccacg 7020 cgggaaacgg tctgataaga gacaccggca tactctgcga catcgtataa cgttactggt 7080 ttcacattca ccaccctgaa ttgactctct tccgggcgct atcatgccat accgcgaaag 7140 gttttgcgcc attcgatggt gtccgggatc tcgacgctct cccttatgcg actcctgcat 7200 taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc 7260 atgcaaggag atggcgccca acagtccccc ggccacgggg cctgccacca tacccacgcc 7320 gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc 7380 gatataggcg ccagcaaccg cacctgtggc gccggtgatg ccggccacga tgcgtccggc 7440 gtagaggatc gagatcgatc tcgatcccgc gaaattaata cgactcacta ta 7492 <210> SEQ ID NO 187 <211> LENGTH: 6233 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDUET-tesB, plasmid <400> SEQUENCE: 187 ggggaattgt gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgagtcaggc acttaaaaat ttacttactt tacttaatct tgaaaaaata gaagaaggtt 360 tatttagagg acagtcagaa gatttaggat taagacaagt atttggaggt caagtagttg 420 gtcaggcact ttatgcagct aaagaaactg tacctgaaga aagacttgtt catagttttc 480 attcttattt tcttagacct ggagattcta aaaaaccaat tatatatgat gtagaaactc 540 ttagagatgg aaattcattt agtgcaagaa gagttgcagc tattcaaaat ggtaaaccta 600 tattttacat gacagcttct tttcaagcac cagaagctgg atttgaacat cagaaaacta 660 tgccttcagc acctgctcca gatggattac catcagaaac acaaatagca cagagtttag 720 ctcatttact tcctccagta cttaaagata aatttatttg tgatagacct ttagaagtta 780 gaccagttga atttcataat cctcttaaag gacatgtagc agaaccacat agacaagttt 840 ggataagagc taatggaagt gtaccagatg atcttagagt tcatcagtat cttcttggtt 900 atgcatctga tttaaatttt cttcctgtag ctttacaacc acatggaata ggttttcttg 960 aacctggaat acagatagca actatagatc attcaatgtg gtttcataga ccatttaatc 1020 ttaatgaatg gcttctttat agtgtagaat ctacatcagc aagttctgct agaggatttg 1080 ttaggggtga attttatact caagatggag tacttgttgc tagtacagta caggaaggtg 1140 ttatgagaaa tcataattaa ggtaccctcg agtctggtaa agaaaccgct gctgcgaaat 1200 ttgaacgcca gcacatggac tcgtctacta gcgcagctta attaacctag gctgctgcca 1260 ccgctgagca ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt 1320 tgctgaaagg aggaactata tccggattgg cgaatgggac gcgccctgta gcggcgcatt 1380 aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 1440 gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 1500 agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 1560 caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 1620 tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 1680 aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc cgatttcggc 1740 ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt 1800 aacgtttaca atttctggcg gcacgatggc atgagattat caaaaaggat cttcacctag 1860 atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 1920 tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 1980 tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca 2040 tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca 2100 gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc 2160 tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt 2220 ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg 2280 gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc 2340 aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg 2400 ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga 2460 tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga 2520 ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta 2580 aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 2640 ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 2700 ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 2760 agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatcatg attgaagcat 2820 ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 2880 aataggtcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 2940 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 3000 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 3060 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 3120 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 3180 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 3240 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 3300 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 3360 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 3420 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 3480 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 3540 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 3600 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 3660 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 3720 aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 3780 accgcatata tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagta 3840 tacactccgc tatcgctacg tgactgggtc atggctgcgc cccgacaccc gccaacaccc 3900 gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 3960 gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag 4020 ctgcggtaaa gctcatcagc gtggtcgtga agcgattcac agatgtctgc ctgttcatcc 4080 gcgtccagct cgttgagttt ctccagaagc gttaatgtct ggcttctgat aaagcgggcc 4140 atgttaaggg cggttttttc ctgtttggtc actgatgcct ccgtgtaagg gggatttctg 4200 ttcatggggg taatgatacc gatgaaacga gagaggatgc tcacgatacg ggttactgat 4260 gatgaacatg cccggttact ggaacgttgt gagggtaaac aactggcggt atggatgcgg 4320 cgggaccaga gaaaaatcac tcagggtcaa tgccagcgct tcgttaatac agatgtaggt 4380 gttccacagg gtagccagca gcatcctgcg atgcagatcc ggaacataat ggtgcagggc 4440 gctgacttcc gcgtttccag actttacgaa acacggaaac cgaagaccat tcatgttgtt 4500 gctcaggtcg cagacgtttt gcagcagcag tcgcttcacg ttcgctcgcg tatcggtgat 4560 tcattctgct aaccagtaag gcaaccccgc cagcctagcc gggtcctcaa cgacaggagc 4620 acgatcatgc tagtcatgcc ccgcgcccac cggaaggagc tgactgggtt gaaggctctc 4680 aagggcatcg gtcgagatcc cggtgcctaa tgagtgagct aacttacatt aattgcgttg 4740 cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 4800 caacgcgcgg ggagaggcgg tttgcgtatt gggcgccagg gtggtttttc ttttcaccag 4860 tgagacgggc aacagctgat tgcccttcac cgcctggccc tgagagagtt gcagcaagcg 4920 gtccacgctg gtttgcccca gcaggcgaaa atcctgtttg atggtggtta acggcgggat 4980 ataacatgag ctgtcttcgg tatcgtcgta tcccactacc gagatgtccg caccaacgcg 5040 cagcccggac tcggtaatgg cgcgcattgc gcccagcgcc atctgatcgt tggcaaccag 5100 catcgcagtg ggaacgatgc cctcattcag catttgcatg gtttgttgaa aaccggacat 5160 ggcactccag tcgccttccc gttccgctat cggctgaatt tgattgcgag tgagatattt 5220 atgccagcca gccagacgca gacgcgccga gacagaactt aatgggcccg ctaacagcgc 5280 gatttgctgg tgacccaatg cgaccagatg ctccacgccc agtcgcgtac cgtcttcatg 5340 ggagaaaata atactgttga tgggtgtctg gtcagagaca tcaagaaata acgccggaac 5400 attagtgcag gcagcttcca cagcaatggc atcctggtca tccagcggat agttaatgat 5460 cagcccactg acgcgttgcg cgagaagatt gtgcaccgcc gctttacagg cttcgacgcc 5520 gcttcgttct accatcgaca ccaccacgct ggcacccagt tgatcggcgc gagatttaat 5580 cgccgcgaca atttgcgacg gcgcgtgcag ggccagactg gaggtggcaa cgccaatcag 5640 caacgactgt ttgcccgcca gttgttgtgc cacgcggttg ggaatgtaat tcagctccgc 5700 catcgccgct tccacttttt cccgcgtttt cgcagaaacg tggctggcct ggttcaccac 5760 gcgggaaacg gtctgataag agacaccggc atactctgcg acatcgtata acgttactgg 5820 tttcacattc accaccctga attgactctc ttccgggcgc tatcatgcca taccgcgaaa 5880 ggttttgcgc cattcgatgg tgtccgggat ctcgacgctc tcccttatgc gactcctgca 5940 ttaggaagca gcccagtagt aggttgaggc cgttgagcac cgccgccgca aggaatggtg 6000 catgcaagga gatggcgccc aacagtcccc cggccacggg gcctgccacc atacccacgc 6060 cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg gtgatgtcgg 6120 cgatataggc gccagcaacc gcacctgtgg cgccggtgat gccggccacg atgcgtccgg 6180 cgtagaggat cgagatcgat ctcgatcccg cgaaattaat acgactcact ata 6233 <210> SEQ ID NO 188 <211> LENGTH: 3120 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: codon optimized gene cassette containing the Wood-Ljungdahl promoter in front of the genes meaB, hcmA and hcmB <400> SEQUENCE: 188 atgacttatg taccatcatc agcactttta gaacaactta gagcaggaaa tacttgggct 60 ttaggaagac ttatatcaag agcagaagct ggagttgcag aagctagacc tgcacttgct 120 gaagtatata gacatgcagg ttcagctcat gttataggtt taacaggagt accaggatct 180 ggtaaatcaa ctcttgtagc aaaacttaca gcagctctta gaaaaagagg agaaaaagtt 240 ggtatagtag ctattgatcc tagttctcca tatagtggag gagcaatact tggagataga 300 attagaatga ctgaattagc aaatgattca ggagtattta taagaagtat ggcaactaga 360 ggtgctactg gaggaatggc tagagcagct cttgatgcag ttgatttact tgatgtagct 420 ggatatcata ctattatttt agaaacagtt ggagtaggtc aagatgaagt tgaagtagca 480 catgcttctg atactacagt agttgtatca gcacctggac ttggtgatga aatacaggca 540 attaaagctg gagttttaga aattgctgat attcatgttg taagtaaatg tgatagagat 600 gatgcaaata gaactcttac agatcttaaa caaatgctta ctttaggaac aatggtagga 660 cctaaaagag catgggctat accagttgta ggagtttcaa gttatacagg agaaggtgta 720 gatgatttac ttggtagaat tgcagctcat agacaagcaa ctgctgatac agaacttgga 780 agagaaagaa gaagaagagt agctgaattt agacttcaaa aaactgcaga aacattactt 840 ttagaaagat ttactacagg agcacagcct ttttcaccag cattagctga tagtctttct 900 aatagagcta gtgatcctta tgcagctgca agagaattaa tagctagaac tataagaaaa 960 gaatattcta atgatcttgc atgtgctaaa cttactataa catggttaga accacaaatt 1020 aaaagtcaac ttcagtctga aagaaaagat tgggaagcaa atgaagttgg agcatttctt 1080 aaaaaagcac ctgaaagaaa agaacaattt catacaattg gagattttcc agtacagaga 1140 acttatacag ctgcagatat agcagatact cctcttgaag atattggttt acctggaaga 1200 tatccattta ctagaggacc ttatccaaca atgtatagaa gtagaacttg gacaatgaga 1260 caaatagctg gatttggtac tggagaagat acaaataaaa gatttaaata tcttatagca 1320 cagggtcaga ctggaatatc aacagatttt gatatgccta cattaatggg atatgattca 1380 gatcatccaa tgagtgatgg tgaagttgga agagaaggtg tagctataga tacacttgca 1440 gatatggaag cacttcttgc tgatattgat ttagaaaaaa tttcagttag ttttactata 1500 aatccaagtg catggattct tttagcaatg tatgtagctt taggtgaaaa aagaggttat 1560 gatcttaata aactttctgg aacagtacaa gctgatatac ttaaagaata tatggcacag 1620 aaagaatata tttatcctat agctccaagt gttagaattg taagagatat aattacttat 1680 tctgcaaaaa atcttaaaag atataatcct attaatattt ctggatatca tatatcagaa 1740 gctggttctt caccattaca agaagctgca tttactcttg caaatcttat tacttatgta 1800 aatgaagtaa ctaaaacagg aatgcatgta gatgaatttg cacctagatt agcatttttc 1860 tttgttagtc aaggagattt ctttgaagaa gtagcaaaat ttagagcttt aagaagatgt 1920 tatgctaaaa taatgaaaga aagatttgga gcaagaaatc ctgaatctat gagacttaga 1980 tttcattgtc aaactgctgc agctactctt acaaaaccac agtatatggt taatgttgta 2040 agaacaagtc ttcaagcatt atctgctgta ttgggaggag cacaaagtct tcatactaat 2100 ggatatgatg aagcatttgc tatacctact gaagatgcaa tgaaaatggc tcttagaaca 2160 caacagatta tagctgaaga atctggagtt gcagatgtaa tagatcctct tggaggaagt 2220 tattatgttg aagcattaac tacagaatat gaaaagaaaa tatttgaaat tcttgaagaa 2280 gtagaaaaaa gaggtggaac tattaaactt attgaacaag gatggtttca aaaacagata 2340 gcagattttg cttatgaaac tgcacttaga aaacaatcag gacagaaacc tgttataggt 2400 gtaaatagat ttgttgaaaa tgaagaagat gtaaaaattg aaatacatcc ttatgataat 2460 actacagctg aaagacaaat atcaagaact agaagagtta gagcagaaag agatgaagca 2520 aaagtacaag ctatgcttga tcagttagtt gcagtagcta aagatgaaag tcagaatctt 2580 atgcctctta ctattgaatt agtaaaagca ggagctacaa tgggtgatat tgtagaaaaa 2640 cttaaaggta tttggggaac ttatagagaa acaccagtat tttaagcact agttggagag 2700 cttcccacga tggatcagat tcctattaga gtattattag caaaagtagg tttagatgga 2760 catgatagag gtgtaaaagt tgtagcaaga gcattaagag atgctggaat ggatgtaata 2820 tatagtggtc ttcatagaac tcctgaagaa gtagttaata cagcaattca agaagatgta 2880 gatgttttag gagttagttt actttctggt gtacagctta ctgtttttcc taaaattttt 2940 aaattacttg atgaaagagg agctggtgat ttaatagtaa ttgctggagg agtaatgcca 3000 gatgaagatg cagctgcaat aagaaaactt ggagtaagag aagttttact tcaagataca 3060 ccaccacagg caataataga ttcaataaga agtttagtag cagcaagagg agcaagataa 3120 <210> SEQ ID NO 189 <211> LENGTH: 894 <212> TYPE: PRT <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polypeptide <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: hcmA and meaB fusion <400> SEQUENCE: 189 Met Thr Tyr Val Pro Ser Ser Ala Leu Leu Glu Gln Leu Arg Ala Gly 1 5 10 15 Asn Thr Trp Ala Leu Gly Arg Leu Ile Ser Arg Ala Glu Ala Gly Val 20 25 30 Ala Glu Ala Arg Pro Ala Leu Ala Glu Val Tyr Arg His Ala Gly Ser 35 40 45 Ala His Val Ile Gly Leu Thr Gly Val Pro Gly Ser Gly Lys Ser Thr 50 55 60 Leu Val Ala Lys Leu Thr Ala Ala Leu Arg Lys Arg Gly Glu Lys Val 65 70 75 80 Gly Ile Val Ala Ile Asp Pro Ser Ser Pro Tyr Ser Gly Gly Ala Ile 85 90 95 Leu Gly Asp Arg Ile Arg Met Thr Glu Leu Ala Asn Asp Ser Gly Val 100 105 110 Phe Ile Arg Ser Met Ala Thr Arg Gly Ala Thr Gly Gly Met Ala Arg 115 120 125 Ala Ala Leu Asp Ala Val Asp Leu Leu Asp Val Ala Gly Tyr His Thr 130 135 140 Ile Ile Leu Glu Thr Val Gly Val Gly Gln Asp Glu Val Glu Val Ala 145 150 155 160 His Ala Ser Asp Thr Thr Val Val Val Ser Ala Pro Gly Leu Gly Asp 165 170 175 Glu Ile Gln Ala Ile Lys Ala Gly Val Leu Glu Ile Ala Asp Ile His 180 185 190 Val Val Ser Lys Cys Asp Arg Asp Asp Ala Asn Arg Thr Leu Thr Asp 195 200 205 Leu Lys Gln Met Leu Thr Leu Gly Thr Met Val Gly Pro Lys Arg Ala 210 215 220 Trp Ala Ile Pro Val Val Gly Val Ser Ser Tyr Thr Gly Glu Gly Val 225 230 235 240 Asp Asp Leu Leu Gly Arg Ile Ala Ala His Arg Gln Ala Thr Ala Asp 245 250 255 Thr Glu Leu Gly Arg Glu Arg Arg Arg Arg Val Ala Glu Phe Arg Leu 260 265 270 Gln Lys Thr Ala Glu Thr Leu Leu Leu Glu Arg Phe Thr Thr Gly Ala 275 280 285 Gln Pro Phe Ser Pro Ala Leu Ala Asp Ser Leu Ser Asn Arg Ala Ser 290 295 300 Asp Pro Tyr Ala Ala Ala Arg Glu Leu Ile Ala Arg Thr Ile Arg Lys 305 310 315 320 Glu Tyr Ser Asn Asp Leu Ala Cys Ala Lys Leu Thr Ile Thr Trp Leu 325 330 335 Glu Pro Gln Ile Lys Ser Gln Leu Gln Ser Glu Arg Lys Asp Trp Glu 340 345 350 Ala Asn Glu Val Gly Ala Phe Leu Lys Lys Ala Pro Glu Arg Lys Glu 355 360 365 Gln Phe His Thr Ile Gly Asp Phe Pro Val Gln Arg Thr Tyr Thr Ala 370 375 380 Ala Asp Ile Ala Asp Thr Pro Leu Glu Asp Ile Gly Leu Pro Gly Arg 385 390 395 400 Tyr Pro Phe Thr Arg Gly Pro Tyr Pro Thr Met Tyr Arg Ser Arg Thr 405 410 415 Trp Thr Met Arg Gln Ile Ala Gly Phe Gly Thr Gly Glu Asp Thr Asn 420 425 430 Lys Arg Phe Lys Tyr Leu Ile Ala Gln Gly Gln Thr Gly Ile Ser Thr 435 440 445 Asp Phe Asp Met Pro Thr Leu Met Gly Tyr Asp Ser Asp His Pro Met 450 455 460 Ser Asp Gly Glu Val Gly Arg Glu Gly Val Ala Ile Asp Thr Leu Ala 465 470 475 480 Asp Met Glu Ala Leu Leu Ala Asp Ile Asp Leu Glu Lys Ile Ser Val 485 490 495 Ser Phe Thr Ile Asn Pro Ser Ala Trp Ile Leu Leu Ala Met Tyr Val 500 505 510 Ala Leu Gly Glu Lys Arg Gly Tyr Asp Leu Asn Lys Leu Ser Gly Thr 515 520 525 Val Gln Ala Asp Ile Leu Lys Glu Tyr Met Ala Gln Lys Glu Tyr Ile 530 535 540 Tyr Pro Ile Ala Pro Ser Val Arg Ile Val Arg Asp Ile Ile Thr Tyr 545 550 555 560 Ser Ala Lys Asn Leu Lys Arg Tyr Asn Pro Ile Asn Ile Ser Gly Tyr 565 570 575 His Ile Ser Glu Ala Gly Ser Ser Pro Leu Gln Glu Ala Ala Phe Thr 580 585 590 Leu Ala Asn Leu Ile Thr Tyr Val Asn Glu Val Thr Lys Thr Gly Met 595 600 605 His Val Asp Glu Phe Ala Pro Arg Leu Ala Phe Phe Phe Val Ser Gln 610 615 620 Gly Asp Phe Phe Glu Glu Val Ala Lys Phe Arg Ala Leu Arg Arg Cys 625 630 635 640 Tyr Ala Lys Ile Met Lys Glu Arg Phe Gly Ala Arg Asn Pro Glu Ser 645 650 655 Met Arg Leu Arg Phe His Cys Gln Thr Ala Ala Ala Thr Leu Thr Lys 660 665 670 Pro Gln Tyr Met Val Asn Val Val Arg Thr Ser Leu Gln Ala Leu Ser 675 680 685 Ala Val Leu Gly Gly Ala Gln Ser Leu His Thr Asn Gly Tyr Asp Glu 690 695 700 Ala Phe Ala Ile Pro Thr Glu Asp Ala Met Lys Met Ala Leu Arg Thr 705 710 715 720 Gln Gln Ile Ile Ala Glu Glu Ser Gly Val Ala Asp Val Ile Asp Pro 725 730 735 Leu Gly Gly Ser Tyr Tyr Val Glu Ala Leu Thr Thr Glu Tyr Glu Lys 740 745 750 Lys Ile Phe Glu Ile Leu Glu Glu Val Glu Lys Arg Gly Gly Thr Ile 755 760 765 Lys Leu Ile Glu Gln Gly Trp Phe Gln Lys Gln Ile Ala Asp Phe Ala 770 775 780 Tyr Glu Thr Ala Leu Arg Lys Gln Ser Gly Gln Lys Pro Val Ile Gly 785 790 795 800 Val Asn Arg Phe Val Glu Asn Glu Glu Asp Val Lys Ile Glu Ile His 805 810 815 Pro Tyr Asp Asn Thr Thr Ala Glu Arg Gln Ile Ser Arg Thr Arg Arg 820 825 830 Val Arg Ala Glu Arg Asp Glu Ala Lys Val Gln Ala Met Leu Asp Gln 835 840 845 Leu Val Ala Val Ala Lys Asp Glu Ser Gln Asn Leu Met Pro Leu Thr 850 855 860 Ile Glu Leu Val Lys Ala Gly Ala Thr Met Gly Asp Ile Val Glu Lys 865 870 875 880 Leu Lys Gly Ile Trp Gly Thr Tyr Arg Glu Thr Pro Val Phe 885 890 <210> SEQ ID NO 190 <211> LENGTH: 849 <212> TYPE: DNA <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd <400> SEQUENCE: 190 atgagtatta aaagtgtagc ggttttaggt agtggaacta tgtctcgtgg aattgtgcag 60 gcttttgcag aagcaggtat agatgtaatt atccgtggaa gaactgaagg tagtattgga 120 aaaggtctag cagcagtaaa gaaagcttat gataaaaaag tatcaaaggg gaaaatttcc 180 caggaagatg ctgataaaat agttggaaga gtaagtacaa caactgaact tgaaaaattg 240 gctgattgtg atcttataat agaagcagca tcagaggata tgaatataaa gaaagactat 300 tttggaaaat tagaagaaat atgcaagcct gaaacaattt ttgctactaa tacttcttca 360 ttatctataa ctgaagtagc aacagctaca aagagaccag ataaattcat aggaatgcat 420 ttctttaatc cagcaaatgt tatgaaatta gttgaaatca taagaggtat gaatacttca 480 caagaaactt ttgatattat aaaagaagct tccattaaaa taggaaaaac tcctgtagaa 540 gttgcagaag ctccaggatt tgttgtaaac aagatattag taccaatgat caatgaagca 600 gtaggaattt tggcagaagg aatagcttca gcagaagata tcgatacagc tatgaaatta 660 ggcgctaatc acccaatggg tcctttagca ttaggagatc ttattggact tgatgtagtt 720 cttgcagtta tggatgtact ttatagtgaa actggagatt caaaatatag agctcataca 780 ttacttagaa aatatgtaag agcaggatgg cttggaagaa aatcaggaaa aggattcttc 840 gcttattaa 849 <210> SEQ ID NO 191 <211> LENGTH: 10647 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB <400> SEQUENCE: 191 cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt 60 atcaggaaac agctatgacc gcggccgcaa tatgatattt atgtccattg tgaaagggat 120 tatattcaac tattattcca gttacgttca tagaaatttt cctttctaaa atattttatt 180 ccatgtcaag aactctgttt atttcattaa agaactataa gtacaaagta taaggcattt 240 gaaaaaatag gctagtatat tgattgatta tttattttaa aatgcctaag tgaaatatat 300 acatattata acaataaaat aagtattagt gtaggatttt taaatagagt atctattttc 360 agattaaatt tttgattatt tgatttacat tatataatat tgagtaaagt attgactagc 420 aaaatttttt gatactttaa tttgtgaaat ttcttatcaa aagttatatt tttgaataat 480 ttttattgaa aaatacaact aaaaaggatt atagtataag tgtgtgtaat tttgtgttaa 540 atttaaaggg aggaaatgaa catgaaacat atgaaagaag ttgtaatagc tagtgcagta 600 agaacagcga ttggatctta tggaaagtct cttaaggatg taccagcagt agatttagga 660 gctacagcta taaaggaagc agttaaaaaa gcaggaataa aaccagagga tgttaatgaa 720 gtcattttag gaaatgttct tcaagcaggt ttaggacaga atccagcaag acaggcatct 780 tttaaagcag gattaccagt tgaaattcca gctatgacta ttaataaggt ttgtggttca 840 ggacttagaa cagttagctt agcagcacaa attataaaag caggagatgc tgacgtaata 900 atagcaggtg gtatggaaaa tatgtctaga gctccttact tagcgaataa cgctagatgg 960 ggatatagaa tgggaaacgc taaatttgtt gatgaaatga tcactgacgg attgtgggat 1020 gcatttaatg attaccacat gggaataaca gcagaaaaca tagctgagag atggaacatt 1080 tcaagagaag aacaagatga gtttgctctt gcatcacaaa aaaaagctga agaagctata 1140 aaatcaggtc aatttaaaga tgaaatagtt cctgtagtaa ttaaaggcag aaagggagaa 1200 actgtagttg atacagatga gcaccctaga tttggatcaa ctatagaagg acttgcaaaa 1260 ttaaaacctg ccttcaaaaa agatggaaca gttacagctg gtaatgcatc aggattaaat 1320 gactgtgcag cagtacttgt aatcatgagt gcagaaaaag ctaaagagct tggagtaaaa 1380 ccacttgcta agatagtttc ttatggttca gcaggagttg acccagcaat aatgggatat 1440 ggacctttct atgcaacaaa agcagctatt gaaaaagcag gttggacagt tgatgaatta 1500 gatttaatag aatcaaatga agcttttgca gctcaaagtt tagcagtagc aaaagattta 1560 aaatttgata tgaataaagt aaatgtaaat ggaggagcta ttgcccttgg tcatccaatt 1620 ggagcatcag gtgcaagaat actcgttact cttgtacacg caatgcaaaa aagagatgca 1680 aaaaaaggct tagcaacttt atgtataggt ggcggacaag gaacagcaat attgctagaa 1740 aagtgctagg aattctcaaa aattcggtta aataaaataa ttaggaggtt caatcatgtc 1800 tattaaatca gttgcagttt taggttcagg tacaatgtca agaggtattg ttcaagcatt 1860 tgctgaagca ggtatagatg taataattag aggtagaaca gaaggatcaa taggaaaagg 1920 acttgctgct gttaagaaag catacgataa aaaggtaagt aaaggaaaga tatcacaaga 1980 agatgctgat aaaatagttg gtagagtatc tactactaca gaattagaaa aattagcaga 2040 ttgcgacctt ataattgagg ctgcatcaga agatatgaac ataaagaaag attattttgg 2100 aaaacttgaa gaaatatgta aaccagaaac tatttttgct actaatacat caagtttaag 2160 tattacagaa gtagcaacag caactaaaag accagataag ttcataggaa tgcacttctt 2220 taatcctgct aatgtaatga agcttgtaga gattataaga ggtatgaata cttctcagga 2280 aacatttgat ataattaagg aagcaagtat taaaatagga aaaactcctg tagaagtagc 2340 agaagcacca ggatttgttg ttaataagat acttgttcct atgataaatg aggctgtagg 2400 tatacttgct gaaggtattg ctagtgctga agacatagac actgctatga agttaggtgc 2460 aaaccatcct atgggaccat tagcattagg tgatcttatt ggattagatg ttgttttagc 2520 agtaatggat gtactttatt ctgagacagg tgattctaaa tatagagctc atacacttct 2580 tagaaagtat gtaagagctg gttggttagg tagaaagtct ggtaaaggat ttttcgcata 2640 ttaaggtacc gcagatagtc ataatagttc cagaatagtt caatttagaa attagactaa 2700 acttcaaaat gtttgttaaa tatataccaa actagtatag atatttttta aatactggac 2760 ttaaacagta gtaatttgcc taaaaaattt tttcaatttt ttttaaaaaa tccttttcaa 2820 gttgtacatt gttatggtaa tatgtaattg aagaagttat gtagtaatat tgtaaacgtt 2880 tcttgatttt tttacatcca tgtagtgctt aaaaaaccaa aatatgtcac atgcaattgt 2940 atatttcaaa taacaatatt tattttctcg ttaaattcac aaataattta ttaataatat 3000 caataaccaa gattatactt aaatggatgt ttatttttta acacttttat agtaaatata 3060 tttattttat gtagtaaaaa ggttataatt ataattgtat ttattacaat taattaaaat 3120 aaaaaatagg gttttaggta aaattaagtt attttaagaa gtaattacaa taaaaattga 3180 agttatttct ttaaggaggg aattattcat atgacttatg taccatcatc agcactttta 3240 gaacaactta gagcaggaaa tacttgggct ttaggaagac ttatatcaag agcagaagct 3300 ggagttgcag aagctagacc tgcacttgct gaagtatata gacatgcagg ttcagctcat 3360 gttataggtt taacaggagt accaggatct ggtaaatcaa ctcttgtagc aaaacttaca 3420 gcagctctta gaaaaagagg agaaaaagtt ggtatagtag ctattgatcc tagttctcca 3480 tatagtggag gagcaatact tggagataga attagaatga ctgaattagc aaatgattca 3540 ggagtattta taagaagtat ggcaactaga ggtgctactg gaggaatggc tagagcagct 3600 cttgatgcag ttgatttact tgatgtagct ggatatcata ctattatttt agaaacagtt 3660 ggagtaggtc aagatgaagt tgaagtagca catgcttctg atactacagt agttgtatca 3720 gcacctggac ttggtgatga aatacaggca attaaagctg gagttttaga aattgctgat 3780 attcatgttg taagtaaatg tgatagagat gatgcaaata gaactcttac agatcttaaa 3840 caaatgctta ctttaggaac aatggtagga cctaaaagag catgggctat accagttgta 3900 ggagtttcaa gttatacagg agaaggtgta gatgatttac ttggtagaat tgcagctcat 3960 agacaagcaa ctgctgatac agaacttgga agagaaagaa gaagaagagt agctgaattt 4020 agacttcaaa aaactgcaga aacattactt ttagaaagat ttactacagg agcacagcct 4080 ttttcaccag cattagctga tagtctttct aatagagcta gtgatcctta tgcagctgca 4140 agagaattaa tagctagaac tataagaaaa gaatattcta atgatcttgc atgtgctaaa 4200 cttactataa catggttaga accacaaatt aaaagtcaac ttcagtctga aagaaaagat 4260 tgggaagcaa atgaagttgg agcatttctt aaaaaagcac ctgaaagaaa agaacaattt 4320 catacaattg gagattttcc agtacagaga acttatacag ctgcagatat agcagatact 4380 cctcttgaag atattggttt acctggaaga tatccattta ctagaggacc ttatccaaca 4440 atgtatagaa gtagaacttg gacaatgaga caaatagctg gatttggtac tggagaagat 4500 acaaataaaa gatttaaata tcttatagca cagggtcaga ctggaatatc aacagatttt 4560 gatatgccta cattaatggg atatgattca gatcatccaa tgagtgatgg tgaagttgga 4620 agagaaggtg tagctataga tacacttgca gatatggaag cacttcttgc tgatattgat 4680 ttagaaaaaa tttcagttag ttttactata aatccaagtg catggattct tttagcaatg 4740 tatgtagctt taggtgaaaa aagaggttat gatcttaata aactttctgg aacagtacaa 4800 gctgatatac ttaaagaata tatggcacag aaagaatata tttatcctat agctccaagt 4860 gttagaattg taagagatat aattacttat tctgcaaaaa atcttaaaag atataatcct 4920 attaatattt ctggatatca tatatcagaa gctggttctt caccattaca agaagctgca 4980 tttactcttg caaatcttat tacttatgta aatgaagtaa ctaaaacagg aatgcatgta 5040 gatgaatttg cacctagatt agcatttttc tttgttagtc aaggagattt ctttgaagaa 5100 gtagcaaaat ttagagcttt aagaagatgt tatgctaaaa taatgaaaga aagatttgga 5160 gcaagaaatc ctgaatctat gagacttaga tttcattgtc aaactgctgc agctactctt 5220 acaaaaccac agtatatggt taatgttgta agaacaagtc ttcaagcatt atctgctgta 5280 ttgggaggag cacaaagtct tcatactaat ggatatgatg aagcatttgc tatacctact 5340 gaagatgcaa tgaaaatggc tcttagaaca caacagatta tagctgaaga atctggagtt 5400 gcagatgtaa tagatcctct tggaggaagt tattatgttg aagcattaac tacagaatat 5460 gaaaagaaaa tatttgaaat tcttgaagaa gtagaaaaaa gaggtggaac tattaaactt 5520 attgaacaag gatggtttca aaaacagata gcagattttg cttatgaaac tgcacttaga 5580 aaacaatcag gacagaaacc tgttataggt gtaaatagat ttgttgaaaa tgaagaagat 5640 gtaaaaattg aaatacatcc ttatgataat actacagctg aaagacaaat atcaagaact 5700 agaagagtta gagcagaaag agatgaagca aaagtacaag ctatgcttga tcagttagtt 5760 gcagtagcta aagatgaaag tcagaatctt atgcctctta ctattgaatt agtaaaagca 5820 ggagctacaa tgggtgatat tgtagaaaaa cttaaaggta tttggggaac ttatagagaa 5880 acaccagtat tttaagcact agttggagag cttcccacga tggatcagat tcctattaga 5940 gtattattag caaaagtagg tttagatgga catgatagag gtgtaaaagt tgtagcaaga 6000 gcattaagag atgctggaat ggatgtaata tatagtggtc ttcatagaac tcctgaagaa 6060 gtagttaata cagcaattca agaagatgta gatgttttag gagttagttt actttctggt 6120 gtacagctta ctgtttttcc taaaattttt aaattacttg atgaaagagg agctggtgat 6180 ttaatagtaa ttgctggagg agtaatgcca gatgaagatg cagctgcaat aagaaaactt 6240 ggagtaagag aagttttact tcaagataca ccaccacagg caataataga ttcaataaga 6300 agtttagtag cagcaagagg agcaagataa ccatggagat ctcgaggcct gcagacatgc 6360 aagcttggca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca 6420 acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg 6480 caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgct agcataaaaa 6540 taagaagcct gcatttgcag gcttcttatt tttatggcgc gccgccatta tttttttgaa 6600 caattgacaa ttcatttctt attttttatt aagtgatagt caaaaggcat aacagtgctg 6660 aatagaaaga aatttacaga aaagaaaatt atagaattta gtatgattaa ttatactcat 6720 ttatgaatgt ttaattgaat acaaaaaaaa atacttgtta tgtattcaat tacgggttaa 6780 aatatagaca agttgaaaaa tttaataaaa aaataagtcc tcagctctta tatattaagc 6840 taccaactta gtatataagc caaaacttaa atgtgctacc aacacatcaa gccgttagag 6900 aactctatct atagcaatat ttcaaatgta ccgacataca agagaaacat taactatata 6960 tattcaattt atgagattat cttaacagat ataaatgtaa attgcaataa gtaagattta 7020 gaagtttata gcctttgtgt attggaagca gtacgcaaag gcttttttat ttgataaaaa 7080 ttagaagtat atttattttt tcataattaa tttatgaaaa tgaaaggggg tgagcaaagt 7140 gacagaggaa agcagtatct tatcaaataa caaggtatta gcaatatcat tattgacttt 7200 agcagtaaac attatgactt ttatagtgct tgtagctaag tagtacgaaa gggggagctt 7260 taaaaagctc cttggaatac atagaattca taaattaatt tatgaaaaga agggcgtata 7320 tgaaaacttg taaaaattgc aaagagttta ttaaagatac tgaaatatgc aaaatacatt 7380 cgttgatgat tcatgataaa acagtagcaa cctattgcag taaatacaat gagtcaagat 7440 gtttacataa agggaaagtc caatgtatta attgttcaaa gatgaaccga tatggatggt 7500 gtgccataaa aatgagatgt tttacagagg aagaacagaa aaaagaacgt acatgcatta 7560 aatattatgc aaggagcttt aaaaaagctc atgtaaagaa gagtaaaaag aaaaaataat 7620 ttatttatta atttaatatt gagagtgccg acacagtatg cactaaaaaa tatatctgtg 7680 gtgtagtgag ccgatacaaa aggatagtca ctcgcatttt cataatacat cttatgttat 7740 gattatgtgt cggtgggact tcacgacgaa aacccacaat aaaaaaagag ttcggggtag 7800 ggttaagcat agttgaggca actaaacaat caagctagga tatgcagtag cagaccgtaa 7860 ggtcgttgtt taggtgtgtt gtaatacata cgctattaag atgtaaaaat acggatacca 7920 atgaagggaa aagtataatt tttggatgta gtttgtttgt tcatctatgg gcaaactacg 7980 tccaaagccg tttccaaatc tgctaaaaag tatatccttt ctaaaatcaa agtcaagtat 8040 gaaatcataa ataaagttta attttgaagt tattatgata ttatgttttt ctattaaaat 8100 aaattaagta tatagaatag tttaataata gtatatactt aatgtgataa gtgtctgaca 8160 gtgtcacaga aaggatgatt gttatggatt ataagcggcc ggccagtggg caagttgaaa 8220 aattcacaaa aatgtggtat aatatctttg ttcattagag cgataaactt gaatttgaga 8280 gggaacttag atggtatttg aaaaaattga taaaaatagt tggaacagaa aagagtattt 8340 tgaccactac tttgcaagtg taccttgtac ctacagcatg accgttaaag tggatatcac 8400 acaaataaag gaaaagggaa tgaaactata tcctgcaatg ctttattata ttgcaatgat 8460 tgtaaaccgc cattcagagt ttaggacggc aatcaatcaa gatggtgaat tggggatata 8520 tgatgagatg ataccaagct atacaatatt tcacaatgat actgaaacat tttccagcct 8580 ttggactgag tgtaagtctg actttaaatc atttttagca gattatgaaa gtgatacgca 8640 acggtatgga aacaatcata gaatggaagg aaagccaaat gctccggaaa acatttttaa 8700 tgtatctatg ataccgtggt caaccttcga tggctttaat ctgaatttgc agaaaggata 8760 tgattatttg attcctattt ttactatggg gaaatattat aaagaagata acaaaattat 8820 acttcctttg gcaattcaag ttcatcacgc agtatgtgac ggatttcaca tttgccgttt 8880 tgtaaacgaa ttgcaggaat tgataaatag ttaacttcag gtttgtctgt aactaaaaac 8940 aagtatttaa gcaaaaacat cgtagaaata cggtgttttt tgttacccta agtttaaact 9000 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 9060 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 9120 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 9180 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 9240 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 9300 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 9360 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 9420 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 9480 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 9540 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 9600 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 9660 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 9720 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 9780 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 9840 agtgagcgag gaagcggaag agcgcccaat acgcagggcc ccctgcttcg gggtcattat 9900 agcgattttt tcggtatatc catccttttt cgcacgatat acaggatttt gccaaagggt 9960 tcgtgtagac tttccttggt gtatccaacg gcgtcagccg ggcaggatag gtgaagtagg 10020 cccacccgcg agcgggtgtt ccttcttcac tgtcccttat tcgcacctgg cggtgctcaa 10080 cgggaatcct gctctgcgag gctggccggc taccgccggc gtaacagatg agggcaagcg 10140 gatggctgat gaaaccaagc caaccaggaa gggcagccca cctatcaagg tgtactgcct 10200 tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg gccggcatga gcctgtcggc 10260 ctacctgctg gccgtcggcc agggctacaa aatcacgggc gtcgtggact atgagcacgt 10320 ccgcgagctg gcccgcatca atggcgacct gggccgcctg ggcggcctgc tgaaactctg 10380 gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc acgatcctcg ccctgctggc 10440 gaagatcgaa gagaagcagg acgagcttgg caaggtcatg atgggcgtgg tccgcccgag 10500 ggcagagcca tgactttttt agccgctaaa acggccgggg ggtgcgcgtg attgccaagc 10560 acgtccccat gcgctccatc aagaagagcg acttcgcgga gctggtgaag tacatcaccg 10620 acgagcaagg caagaccgat cgggccc 10647 <210> SEQ ID NO 192 <211> LENGTH: 10539 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB <400> SEQUENCE: 192 cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt 60 atcaggaaac agctatgacc gcggccgcaa tatgatattt atgtccattg tgaaagggat 120 tatattcaac tattattcca gttacgttca tagaaatttt cctttctaaa atattttatt 180 ccatgtcaag aactctgttt atttcattaa agaactataa gtacaaagta taaggcattt 240 gaaaaaatag gctagtatat tgattgatta tttattttaa aatgcctaag tgaaatatat 300 acatattata acaataaaat aagtattagt gtaggatttt taaatagagt atctattttc 360 agattaaatt tttgattatt tgatttacat tatataatat tgagtaaagt attgactagc 420 aaaatttttt gatactttaa tttgtgaaat ttcttatcaa aagttatatt tttgaataat 480 ttttattgaa aaatacaact aaaaaggatt atagtataag tgtgtgtaat tttgtgttaa 540 atttaaaggg aggaaatgaa catgaaacat atgaaagaag ttgtaatagc tagtgcagta 600 agaacagcga ttggatctta tggaaagtct cttaaggatg taccagcagt agatttagga 660 gctacagcta taaaggaagc agttaaaaaa gcaggaataa aaccagagga tgttaatgaa 720 gtcattttag gaaatgttct tcaagcaggt ttaggacaga atccagcaag acaggcatct 780 tttaaagcag gattaccagt tgaaattcca gctatgacta ttaataaggt ttgtggttca 840 ggacttagaa cagttagctt agcagcacaa attataaaag caggagatgc tgacgtaata 900 atagcaggtg gtatggaaaa tatgtctaga gctccttact tagcgaataa cgctagatgg 960 ggatatagaa tgggaaacgc taaatttgtt gatgaaatga tcactgacgg attgtgggat 1020 gcatttaatg attaccacat gggaataaca gcagaaaaca tagctgagag atggaacatt 1080 tcaagagaag aacaagatga gtttgctctt gcatcacaaa aaaaagctga agaagctata 1140 aaatcaggtc aatttaaaga tgaaatagtt cctgtagtaa ttaaaggcag aaagggagaa 1200 actgtagttg atacagatga gcaccctaga tttggatcaa ctatagaagg acttgcaaaa 1260 ttaaaacctg ccttcaaaaa agatggaaca gttacagctg gtaatgcatc aggattaaat 1320 gactgtgcag cagtacttgt aatcatgagt gcagaaaaag ctaaagagct tggagtaaaa 1380 ccacttgcta agatagtttc ttatggttca gcaggagttg acccagcaat aatgggatat 1440 ggacctttct atgcaacaaa agcagctatt gaaaaagcag gttggacagt tgatgaatta 1500 gatttaatag aatcaaatga agcttttgca gctcaaagtt tagcagtagc aaaagattta 1560 aaatttgata tgaataaagt aaatgtaaat ggaggagcta ttgcccttgg tcatccaatt 1620 ggagcatcag gtgcaagaat actcgttact cttgtacacg caatgcaaaa aagagatgca 1680 aaaaaaggct tagcaacttt atgtataggt ggcggacaag gaacagcaat attgctagaa 1740 aagtgctagg aattctcaaa aattcggtta aataaaataa ttaggaggtt caatcatgac 1800 tcagcgcatt gcgtatgtga ccggcggcat gggtggtatc ggaaccgcca tttgccagcg 1860 gctggccaag gatggctttc gtgtggtggc cggttgcggc cccaactcgc cgcgccgcga 1920 aaagtggctg gagcagcaga aggccctggg cttcgatttc attgcctcgg aaggcaatgt 1980 ggctgactgg gactcgacca agaccgcatt cgacaaggtc aagtccgagg tcggcgaggt 2040 tgatgtgctg atcaacaacg ccggtatcac ccgcgacgtg gtgttccgca agatgacccg 2100 cgccgactgg gatgcggtga tcgacaccaa cctgacctcg ctgttcaacg tcaccaagca 2160 ggtgatcgac ggcatggccg accgtggctg gggccgcatc gtcaacatct cgtcggtgaa 2220 cgggcagaag ggccagttcg gccagaccaa ctactccacc gccaaggccg gcctgcatgg 2280 cttcaccatg gcactggcgc aggaagtggc gaccaagggc gtgaccgtca acacggtctc 2340 tccgggctat atcgccaccg acatggtcaa ggcgatccgc caggacgtgc tcgacaagat 2400 cgtcgcgacg atcccggtca agcgcctggg cctgccggaa gagatcgcct cgatctgcgc 2460 ctggttgtcg tcggaggagt ccggtttctc gaccggcgcc gacttctcgc tcaacggcgg 2520 cctgcatatg ggctgaggta ccgcagatag tcataatagt tccagaatag ttcaatttag 2580 aaattagact aaacttcaaa atgtttgtta aatatatacc aaactagtat agatattttt 2640 taaatactgg acttaaacag tagtaatttg cctaaaaaat tttttcaatt ttttttaaaa 2700 aatccttttc aagttgtaca ttgttatggt aatatgtaat tgaagaagtt atgtagtaat 2760 attgtaaacg tttcttgatt tttttacatc catgtagtgc ttaaaaaacc aaaatatgtc 2820 acatgcaatt gtatatttca aataacaata tttattttct cgttaaattc acaaataatt 2880 tattaataat atcaataacc aagattatac ttaaatggat gtttattttt taacactttt 2940 atagtaaata tatttatttt atgtagtaaa aaggttataa ttataattgt atttattaca 3000 attaattaaa ataaaaaata gggttttagg taaaattaag ttattttaag aagtaattac 3060 aataaaaatt gaagttattt ctttaaggag ggaattattc atatgactta tgtaccatca 3120 tcagcacttt tagaacaact tagagcagga aatacttggg ctttaggaag acttatatca 3180 agagcagaag ctggagttgc agaagctaga cctgcacttg ctgaagtata tagacatgca 3240 ggttcagctc atgttatagg tttaacagga gtaccaggat ctggtaaatc aactcttgta 3300 gcaaaactta cagcagctct tagaaaaaga ggagaaaaag ttggtatagt agctattgat 3360 cctagttctc catatagtgg aggagcaata cttggagata gaattagaat gactgaatta 3420 gcaaatgatt caggagtatt tataagaagt atggcaacta gaggtgctac tggaggaatg 3480 gctagagcag ctcttgatgc agttgattta cttgatgtag ctggatatca tactattatt 3540 ttagaaacag ttggagtagg tcaagatgaa gttgaagtag cacatgcttc tgatactaca 3600 gtagttgtat cagcacctgg acttggtgat gaaatacagg caattaaagc tggagtttta 3660 gaaattgctg atattcatgt tgtaagtaaa tgtgatagag atgatgcaaa tagaactctt 3720 acagatctta aacaaatgct tactttagga acaatggtag gacctaaaag agcatgggct 3780 ataccagttg taggagtttc aagttataca ggagaaggtg tagatgattt acttggtaga 3840 attgcagctc atagacaagc aactgctgat acagaacttg gaagagaaag aagaagaaga 3900 gtagctgaat ttagacttca aaaaactgca gaaacattac ttttagaaag atttactaca 3960 ggagcacagc ctttttcacc agcattagct gatagtcttt ctaatagagc tagtgatcct 4020 tatgcagctg caagagaatt aatagctaga actataagaa aagaatattc taatgatctt 4080 gcatgtgcta aacttactat aacatggtta gaaccacaaa ttaaaagtca acttcagtct 4140 gaaagaaaag attgggaagc aaatgaagtt ggagcatttc ttaaaaaagc acctgaaaga 4200 aaagaacaat ttcatacaat tggagatttt ccagtacaga gaacttatac agctgcagat 4260 atagcagata ctcctcttga agatattggt ttacctggaa gatatccatt tactagagga 4320 ccttatccaa caatgtatag aagtagaact tggacaatga gacaaatagc tggatttggt 4380 actggagaag atacaaataa aagatttaaa tatcttatag cacagggtca gactggaata 4440 tcaacagatt ttgatatgcc tacattaatg ggatatgatt cagatcatcc aatgagtgat 4500 ggtgaagttg gaagagaagg tgtagctata gatacacttg cagatatgga agcacttctt 4560 gctgatattg atttagaaaa aatttcagtt agttttacta taaatccaag tgcatggatt 4620 cttttagcaa tgtatgtagc tttaggtgaa aaaagaggtt atgatcttaa taaactttct 4680 ggaacagtac aagctgatat acttaaagaa tatatggcac agaaagaata tatttatcct 4740 atagctccaa gtgttagaat tgtaagagat ataattactt attctgcaaa aaatcttaaa 4800 agatataatc ctattaatat ttctggatat catatatcag aagctggttc ttcaccatta 4860 caagaagctg catttactct tgcaaatctt attacttatg taaatgaagt aactaaaaca 4920 ggaatgcatg tagatgaatt tgcacctaga ttagcatttt tctttgttag tcaaggagat 4980 ttctttgaag aagtagcaaa atttagagct ttaagaagat gttatgctaa aataatgaaa 5040 gaaagatttg gagcaagaaa tcctgaatct atgagactta gatttcattg tcaaactgct 5100 gcagctactc ttacaaaacc acagtatatg gttaatgttg taagaacaag tcttcaagca 5160 ttatctgctg tattgggagg agcacaaagt cttcatacta atggatatga tgaagcattt 5220 gctataccta ctgaagatgc aatgaaaatg gctcttagaa cacaacagat tatagctgaa 5280 gaatctggag ttgcagatgt aatagatcct cttggaggaa gttattatgt tgaagcatta 5340 actacagaat atgaaaagaa aatatttgaa attcttgaag aagtagaaaa aagaggtgga 5400 actattaaac ttattgaaca aggatggttt caaaaacaga tagcagattt tgcttatgaa 5460 actgcactta gaaaacaatc aggacagaaa cctgttatag gtgtaaatag atttgttgaa 5520 aatgaagaag atgtaaaaat tgaaatacat ccttatgata atactacagc tgaaagacaa 5580 atatcaagaa ctagaagagt tagagcagaa agagatgaag caaaagtaca agctatgctt 5640 gatcagttag ttgcagtagc taaagatgaa agtcagaatc ttatgcctct tactattgaa 5700 ttagtaaaag caggagctac aatgggtgat attgtagaaa aacttaaagg tatttgggga 5760 acttatagag aaacaccagt attttaagca ctagttggag agcttcccac gatggatcag 5820 attcctatta gagtattatt agcaaaagta ggtttagatg gacatgatag aggtgtaaaa 5880 gttgtagcaa gagcattaag agatgctgga atggatgtaa tatatagtgg tcttcataga 5940 actcctgaag aagtagttaa tacagcaatt caagaagatg tagatgtttt aggagttagt 6000 ttactttctg gtgtacagct tactgttttt cctaaaattt ttaaattact tgatgaaaga 6060 ggagctggtg atttaatagt aattgctgga ggagtaatgc cagatgaaga tgcagctgca 6120 ataagaaaac ttggagtaag agaagtttta cttcaagata caccaccaca ggcaataata 6180 gattcaataa gaagtttagt agcagcaaga ggagcaagat aaccatggag atctcgaggc 6240 ctgcagacat gcaagcttgg cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 6300 tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 6360 cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 6420 ctagcataaa aataagaagc ctgcatttgc aggcttctta tttttatggc gcgccgccat 6480 tatttttttg aacaattgac aattcatttc ttatttttta ttaagtgata gtcaaaaggc 6540 ataacagtgc tgaatagaaa gaaatttaca gaaaagaaaa ttatagaatt tagtatgatt 6600 aattatactc atttatgaat gtttaattga atacaaaaaa aaatacttgt tatgtattca 6660 attacgggtt aaaatataga caagttgaaa aatttaataa aaaaataagt cctcagctct 6720 tatatattaa gctaccaact tagtatataa gccaaaactt aaatgtgcta ccaacacatc 6780 aagccgttag agaactctat ctatagcaat atttcaaatg taccgacata caagagaaac 6840 attaactata tatattcaat ttatgagatt atcttaacag atataaatgt aaattgcaat 6900 aagtaagatt tagaagttta tagcctttgt gtattggaag cagtacgcaa aggctttttt 6960 atttgataaa aattagaagt atatttattt tttcataatt aatttatgaa aatgaaaggg 7020 ggtgagcaaa gtgacagagg aaagcagtat cttatcaaat aacaaggtat tagcaatatc 7080 attattgact ttagcagtaa acattatgac ttttatagtg cttgtagcta agtagtacga 7140 aagggggagc tttaaaaagc tccttggaat acatagaatt cataaattaa tttatgaaaa 7200 gaagggcgta tatgaaaact tgtaaaaatt gcaaagagtt tattaaagat actgaaatat 7260 gcaaaataca ttcgttgatg attcatgata aaacagtagc aacctattgc agtaaataca 7320 atgagtcaag atgtttacat aaagggaaag tccaatgtat taattgttca aagatgaacc 7380 gatatggatg gtgtgccata aaaatgagat gttttacaga ggaagaacag aaaaaagaac 7440 gtacatgcat taaatattat gcaaggagct ttaaaaaagc tcatgtaaag aagagtaaaa 7500 agaaaaaata atttatttat taatttaata ttgagagtgc cgacacagta tgcactaaaa 7560 aatatatctg tggtgtagtg agccgataca aaaggatagt cactcgcatt ttcataatac 7620 atcttatgtt atgattatgt gtcggtggga cttcacgacg aaaacccaca ataaaaaaag 7680 agttcggggt agggttaagc atagttgagg caactaaaca atcaagctag gatatgcagt 7740 agcagaccgt aaggtcgttg tttaggtgtg ttgtaataca tacgctatta agatgtaaaa 7800 atacggatac caatgaaggg aaaagtataa tttttggatg tagtttgttt gttcatctat 7860 gggcaaacta cgtccaaagc cgtttccaaa tctgctaaaa agtatatcct ttctaaaatc 7920 aaagtcaagt atgaaatcat aaataaagtt taattttgaa gttattatga tattatgttt 7980 ttctattaaa ataaattaag tatatagaat agtttaataa tagtatatac ttaatgtgat 8040 aagtgtctga cagtgtcaca gaaaggatga ttgttatgga ttataagcgg ccggccagtg 8100 ggcaagttga aaaattcaca aaaatgtggt ataatatctt tgttcattag agcgataaac 8160 ttgaatttga gagggaactt agatggtatt tgaaaaaatt gataaaaata gttggaacag 8220 aaaagagtat tttgaccact actttgcaag tgtaccttgt acctacagca tgaccgttaa 8280 agtggatatc acacaaataa aggaaaaggg aatgaaacta tatcctgcaa tgctttatta 8340 tattgcaatg attgtaaacc gccattcaga gtttaggacg gcaatcaatc aagatggtga 8400 attggggata tatgatgaga tgataccaag ctatacaata tttcacaatg atactgaaac 8460 attttccagc ctttggactg agtgtaagtc tgactttaaa tcatttttag cagattatga 8520 aagtgatacg caacggtatg gaaacaatca tagaatggaa ggaaagccaa atgctccgga 8580 aaacattttt aatgtatcta tgataccgtg gtcaaccttc gatggcttta atctgaattt 8640 gcagaaagga tatgattatt tgattcctat ttttactatg gggaaatatt ataaagaaga 8700 taacaaaatt atacttcctt tggcaattca agttcatcac gcagtatgtg acggatttca 8760 catttgccgt tttgtaaacg aattgcagga attgataaat agttaacttc aggtttgtct 8820 gtaactaaaa acaagtattt aagcaaaaac atcgtagaaa tacggtgttt tttgttaccc 8880 taagtttaaa ctcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 8940 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 9000 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 9060 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 9120 aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 9180 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 9240 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 9300 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 9360 cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 9420 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 9480 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 9540 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 9600 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 9660 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 9720 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaggg ccccctgctt 9780 cggggtcatt atagcgattt tttcggtata tccatccttt ttcgcacgat atacaggatt 9840 ttgccaaagg gttcgtgtag actttccttg gtgtatccaa cggcgtcagc cgggcaggat 9900 aggtgaagta ggcccacccg cgagcgggtg ttccttcttc actgtccctt attcgcacct 9960 ggcggtgctc aacgggaatc ctgctctgcg aggctggccg gctaccgccg gcgtaacaga 10020 tgagggcaag cggatggctg atgaaaccaa gccaaccagg aagggcagcc cacctatcaa 10080 ggtgtactgc cttccagacg aacgaagagc gattgaggaa aaggcggcgg cggccggcat 10140 gagcctgtcg gcctacctgc tggccgtcgg ccagggctac aaaatcacgg gcgtcgtgga 10200 ctatgagcac gtccgcgagc tggcccgcat caatggcgac ctgggccgcc tgggcggcct 10260 gctgaaactc tggctcaccg acgacccgcg cacggcgcgg ttcggtgatg ccacgatcct 10320 cgccctgctg gcgaagatcg aagagaagca ggacgagctt ggcaaggtca tgatgggcgt 10380 ggtccgcccg agggcagagc catgactttt ttagccgcta aaacggccgg ggggtgcgcg 10440 tgattgccaa gcacgtcccc atgcgctcca tcaagaagag cgacttcgcg gagctggtga 10500 agtacatcac cgacgagcaa ggcaagaccg atcgggccc 10539 <210> SEQ ID NO 193 <211> LENGTH: 487 <212> TYPE: DNA <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: promoter region of phosphate acetyltransferase <400> SEQUENCE: 193 ggccgcaata tgatatttat gtccattgtg aaagggatta tattcaacta ttattccagt 60 tacgttcata gaaattttcc tttctaaaat attttattcc atgtcaagaa ctctgtttat 120 ttcattaaag aactataagt acaaagtata aggcatttga aaaaataggc tagtatattg 180 attgattatt tattttaaaa tgcctaagtg aaatatatac atattataac aataaaataa 240 gtattagtgt aggattttta aatagagtat ctattttcag attaaatttt tgattatttg 300 atttacatta tataatattg agtaaagtat tgactagcaa aattttttga tactttaatt 360 tgtgaaattt cttatcaaaa gttatatttt tgaataattt ttattgaaaa atacaactaa 420 aaaggattat agtataagtg tgtgtaattt tgtgttaaat ttaaagggag gaaatgaaca 480 tgaaaca 487 <210> SEQ ID NO 194 <211> LENGTH: 7884 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL82256-ptb-buk <400> SEQUENCE: 194 gagatctcga ggcctgcaga catgcaagct tggcactggc cgtcgtttta caacgtcgtg 60 actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 120 gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 180 atggcgaatg gcgctagcat aaaaataaga agcctgcatt tgcaggcttc ttatttttat 240 ggcgcgccgt tctgaatcct tagctaatgg ttcaacaggt aactatgacg aagatagcac 300 cctggataag tctgtaatgg attctaaggc atttaatgaa gacgtgtata taaaatgtgc 360 taatgaaaaa gaaaatgcgt taaaagagcc taaaatgagt tcaaatggtt ttgaaattga 420 ttggtagttt aatttaatat attttttcta ttggctatct cgatacctat agaatcttct 480 gttcactttt gtttttgaaa tataaaaagg ggctttttag cccctttttt ttaaaactcc 540 ggaggagttt cttcattctt gatactatac gtaactattt tcgatttgac ttcattgtca 600 attaagctag taaaatcaat ggttaaaaaa caaaaaactt gcatttttct acctagtaat 660 ttataatttt aagtgtcgag tttaaaagta taatttacca ggaaaggagc aagtttttta 720 ataaggaaaa atttttcctt ttaaaattct atttcgttat atgactaatt ataatcaaaa 780 aaatgaaaat aaacaagagg taaaaactgc tttagagaaa tgtactgata aaaaaagaaa 840 aaatcctaga tttacgtcat acatagcacc tttaactact aagaaaaata ttgaaaggac 900 ttccacttgt ggagattatt tgtttatgtt gagtgatgca gacttagaac attttaaatt 960 acataaaggt aatttttgcg gtaatagatt ttgtccaatg tgtagttggc gacttgcttg 1020 taaggatagt ttagaaatat ctattcttat ggagcattta agaaaagaag aaaataaaga 1080 gtttatattt ttaactctta caactccaaa tgtaaaaagt tatgatctta attattctat 1140 taaacaatat aataaatctt ttaaaaaatt aatggagcgt aaggaagtta aggatataac 1200 taaaggttat ataagaaaat tagaagtaac ttaccaaaag gaaaaataca taacaaagga 1260 tttatggaaa ataaaaaaag attattatca aaaaaaagga cttgaaattg gtgatttaga 1320 acctaatttt gatacttata atcctcattt tcatgtagtt attgcagtta ataaaagtta 1380 ttttacagat aaaaattatt atataaatcg agaaagatgg ttggaattat ggaagtttgc 1440 tactaaggat gattctataa ctcaagttga tgttagaaaa gcaaaaatta atgattataa 1500 agaggtttac gaacttgcga aatattcagc taaagacact gattatttaa tatcgaggcc 1560 agtatttgaa attttttata aagcattaaa aggcaagcag gtattagttt ttagtggatt 1620 ttttaaagat gcacacaaat tgtacaagca aggaaaactt gatgtttata aaaagaaaga 1680 tgaaattaaa tatgtctata tagtttatta taattggtgc aaaaaacaat atgaaaaaac 1740 tagaataagg gaacttacgg aagatgaaaa agaagaatta aatcaagatt taatagatga 1800 aatagaaata gattaaagtg taactatact ttatatatat atgattaaaa aaataaaaaa 1860 caacagccta ttaggttgtt gttttttatt ttctttatta atttttttaa tttttagttt 1920 ttagttcttt tttaaaataa gtttcagcct ctttttcaat attttttaaa gaaggagtat 1980 ttgcatgaat tgcctttttt ctaacagact taggaaatat tttaacagta tcttcttgcg 2040 ccggtgattt tggaacttca taacttacta atttataatt attattttct tttttaattg 2100 taacagttgc aaaagaagct gaacctgttc cttcaactag tttatcatct tcaatataat 2160 attcttgacc tatatagtat aaatatattt ttattatatt tttacttttt tctgaatcta 2220 ttattttata atcataaaaa gttttaccac caaaagaagg ttgtactcct tctggtccaa 2280 catatttttt tactatatta tctaaataat ttttgggaac tggtgttgta atttgattaa 2340 tcgaacaacc agttatactt aaaggaatta taactataaa aatatatagg attatctttt 2400 taaatttcat tattggcctc ctttttatta aatttatgtt accataaaaa ggacataacg 2460 ggaatatgta gaatattttt aatgtagaca aaattttaca taaatataaa gaaaggaagt 2520 gtttgtttaa attttatagc aaactatcaa aaattagggg gataaaaatt tatgaaaaaa 2580 aggttttcga tgttattttt atgtttaact ttaatagttt gtggtttatt tacaaattcg 2640 gccggccgaa gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtataat 2700 aggaattgaa gttaaattag atgctaaaaa tttgtaatta agaaggagtg attacatgaa 2760 caaaaatata aaatattctc aaaacttttt aacgagtgaa aaagtactca accaaataat 2820 aaaacaattg aatttaaaag aaaccgatac cgtttacgaa attggaacag gtaaagggca 2880 tttaacgacg aaactggcta aaataagtaa acaggtaacg tctattgaat tagacagtca 2940 tctattcaac ttatcgtcag aaaaattaaa actgaatact cgtgtcactt taattcacca 3000 agatattcta cagtttcaat tccctaacaa acagaggtat aaaattgttg ggagtattcc 3060 ttaccattta agcacacaaa ttattaaaaa agtggttttt gaaagccatg cgtctgacat 3120 ctatctgatt gttgaagaag gattctacaa gcgtaccttg gatattcacc gaacactagg 3180 gttgctcttg cacactcaag tctcgattca gcaattgctt aagctgccag cggaatgctt 3240 tcatcctaaa ccaaaagtaa acagtgtctt aataaaactt acccgccata ccacagatgt 3300 tccagataaa tattggaagc tatatacgta ctttgtttca aaatgggtca atcgagaata 3360 tcgtcaactg tttactaaaa atcagtttca tcaagcaatg aaacacgcca aagtaaacaa 3420 tttaagtacc gttacttatg agcaagtatt gtctattttt aatagttatc tattatttaa 3480 cgggaggaaa taattctatg agtcgctttt gtaaatttgg aaagttacac gttactaaag 3540 ggaatgtgtt taaactcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 3600 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 3660 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 3720 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 3780 taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 3840 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 3900 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 3960 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4020 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4080 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 4140 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 4200 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 4260 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 4320 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 4380 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc agggccccct 4440 gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag 4500 gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca 4560 ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc 4620 acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa 4680 cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta 4740 tcaaggtgta ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg 4800 gcatgagcct gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg 4860 tggactatga gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg 4920 gcctgctgaa actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga 4980 tcctcgccct gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg 5040 gcgtggtccg cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg 5100 cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg 5160 gtgaagtaca tcaccgacga gcaaggcaag accgatcggg ccccctgcag gataaaaaaa 5220 ttgtagataa attttataaa atagttttat ctacaatttt tttatcagga aacagctatg 5280 accgcggccg caaaatagtt gataataatg cagagttata aacaaaggtg aaaagcatta 5340 cttgtattct tttttatata ttattataaa ttaaaatgaa gctgtattag aaaaaataca 5400 cacctgtaat ataaaatttt aaattaattt ttaatttttt caaaatgtat tttacatgtt 5460 tagaattttg atgtatatta aaatagtaga atacataaga tacttaattt aattaaagat 5520 agttaagtac ttttcaatgt gcttttttag atgtttaata caaatcttta attgtaaaag 5580 aaatgctgta ctatttactg tactagtgac gggattaaac tgtattaatt ataaataaaa 5640 aataagtaca gttgtttaaa attatatttt gtattaaatc taatagtacg atgtaagtta 5700 ttttatacta ttgctagttt aataaaaaga tttaattata tgcttgaaaa ggagaggaat 5760 ccaatgagta aaaactttga tgagttatta tcaagattaa aggaagttcc aacaaaaaaa 5820 gtggctgtag ccgtagcaca agatgaacca gtattagagg ctataaaaga agctacagaa 5880 aataacatcg cacaagcaat attggttggt gataaacaac aaatccatga aatcgcaaag 5940 aaaataaact tggacttatc tgattatgaa ataatggata ttaaagatcc aaagaaagca 6000 acattagaag cagtaaaatt agtttctagt ggtcatgcag atatgttaat gaaaggtcta 6060 gttgatactg caacattcct aagaagcgta ttaaacaaag aggttggtct tagaacagga 6120 aaattaatgt cccatgtagc tgtgtttgat gtggaaggtt gggatagact gttattttta 6180 actgatgcag catttaatac atatccagaa tttaaggata aagttggaat gataaataat 6240 gcagttgtag ttgctcatgc atgtggaata gatgttccaa gagtagcacc tatatgccca 6300 gttgaagttg taaatacaag tatgcaatca acagttgatg cagcattgtt agctaaaatg 6360 agtgacaggg ggcaaattaa aggatgcgta attgatggac cttttgcctt agataatgca 6420 atatcagaag aagcagctca tcataaaggt gttacaggat cagtagcagg taaagctgat 6480 atattattat taccaaatat agaagcagca aatgtaatgt ataaaacatt aacatatttc 6540 tctaaatcaa gaaatggtgg acttttagta ggtacatcag caccagtaat tttaacttca 6600 agagcagatt cattcgaaac taaagttaat tcaattgctc ttgcagcatt agttgcagca 6660 agaaataagt aataaatcaa tccataataa ttaatgcata attaatggag agatttatat 6720 ggaatttgca atgcactatt agattctata ataatttctt ctgaaaatta tgcattatga 6780 ctgtatagaa tgcattaaat ttaaggggga ttcagaatgt catataagct attaataatc 6840 aatccaggtt caacatcaac aaagattggt gtttacgaag gagaaaagga actatttgaa 6900 gaaactttga gacacacaaa tgaagaaata aagagatatg atacaatata tgatcaattt 6960 gaatttagaa aagaagttat attaaatgtt cttaaagaaa agaattttga tataaagact 7020 ctaagtgcta ttgttggtag aggtggaatg cttagaccag ttgaaggtgg aacatatgca 7080 gtaaatgatg caatggttga agatttaaaa gttggagttc aaggacctca tgcttctaac 7140 cttggcggaa taattgccaa gtcaattgga gatgaattaa atattccatc atttatagta 7200 gatccagttg ttacagatga gttagcagat gtagcaagac tatctggagt accagaacta 7260 ccaagaaaaa gtaaattcca tgctttaaat caaaaagcgg tagctaaaag atatggaaaa 7320 gaaagtggac aaggatatga aaacctaaat cttgtagttg tacatatggg tggaggcgtt 7380 tcagttggtg ctcacaatca tgggaaagtt gtcgatgtaa ataatgcatt agatggagat 7440 ggcccattct caccagaaag agctggatca gttccaattg gtgatttagt taaaatgtgt 7500 tttagtggaa aatatagtga agcagaagta tatggcaagg ctgtaggaaa aggtggattt 7560 gttggttatc taaacacaaa tgatgtaaaa ggtgttattg ataagatgga agaaggagat 7620 aaagaatgtg aatcaatata caaagcattt gtttatcaaa tttcaaaagc aatcggagaa 7680 atgtcagttg tattagaagg taaagttgat caaattattt ttaccggagg aattgcatac 7740 tcaccaacac ttgttccaga ccttaaagca aaagttgaat ggatagcccc agttacagtt 7800 tatcctggag aagatgaatt acttgctcta gctcaaggtg ctataagagt acttgatgga 7860 gaagaacaag ctaaggttta ctag 7884 <210> SEQ ID NO 195 <211> LENGTH: 6624 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL82256-tesB <400> SEQUENCE: 195 gagatctcga ggcctgcaga catgcaagct tggcactggc cgtcgtttta caacgtcgtg 60 actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 120 gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 180 atggcgaatg gcgctagcat aaaaataaga agcctgcatt tgcaggcttc ttatttttat 240 ggcgcgccgt tctgaatcct tagctaatgg ttcaacaggt aactatgacg aagatagcac 300 cctggataag tctgtaatgg attctaaggc atttaatgaa gacgtgtata taaaatgtgc 360 taatgaaaaa gaaaatgcgt taaaagagcc taaaatgagt tcaaatggtt ttgaaattga 420 ttggtagttt aatttaatat attttttcta ttggctatct cgatacctat agaatcttct 480 gttcactttt gtttttgaaa tataaaaagg ggctttttag cccctttttt ttaaaactcc 540 ggaggagttt cttcattctt gatactatac gtaactattt tcgatttgac ttcattgtca 600 attaagctag taaaatcaat ggttaaaaaa caaaaaactt gcatttttct acctagtaat 660 ttataatttt aagtgtcgag tttaaaagta taatttacca ggaaaggagc aagtttttta 720 ataaggaaaa atttttcctt ttaaaattct atttcgttat atgactaatt ataatcaaaa 780 aaatgaaaat aaacaagagg taaaaactgc tttagagaaa tgtactgata aaaaaagaaa 840 aaatcctaga tttacgtcat acatagcacc tttaactact aagaaaaata ttgaaaggac 900 ttccacttgt ggagattatt tgtttatgtt gagtgatgca gacttagaac attttaaatt 960 acataaaggt aatttttgcg gtaatagatt ttgtccaatg tgtagttggc gacttgcttg 1020 taaggatagt ttagaaatat ctattcttat ggagcattta agaaaagaag aaaataaaga 1080 gtttatattt ttaactctta caactccaaa tgtaaaaagt tatgatctta attattctat 1140 taaacaatat aataaatctt ttaaaaaatt aatggagcgt aaggaagtta aggatataac 1200 taaaggttat ataagaaaat tagaagtaac ttaccaaaag gaaaaataca taacaaagga 1260 tttatggaaa ataaaaaaag attattatca aaaaaaagga cttgaaattg gtgatttaga 1320 acctaatttt gatacttata atcctcattt tcatgtagtt attgcagtta ataaaagtta 1380 ttttacagat aaaaattatt atataaatcg agaaagatgg ttggaattat ggaagtttgc 1440 tactaaggat gattctataa ctcaagttga tgttagaaaa gcaaaaatta atgattataa 1500 agaggtttac gaacttgcga aatattcagc taaagacact gattatttaa tatcgaggcc 1560 agtatttgaa attttttata aagcattaaa aggcaagcag gtattagttt ttagtggatt 1620 ttttaaagat gcacacaaat tgtacaagca aggaaaactt gatgtttata aaaagaaaga 1680 tgaaattaaa tatgtctata tagtttatta taattggtgc aaaaaacaat atgaaaaaac 1740 tagaataagg gaacttacgg aagatgaaaa agaagaatta aatcaagatt taatagatga 1800 aatagaaata gattaaagtg taactatact ttatatatat atgattaaaa aaataaaaaa 1860 caacagccta ttaggttgtt gttttttatt ttctttatta atttttttaa tttttagttt 1920 ttagttcttt tttaaaataa gtttcagcct ctttttcaat attttttaaa gaaggagtat 1980 ttgcatgaat tgcctttttt ctaacagact taggaaatat tttaacagta tcttcttgcg 2040 ccggtgattt tggaacttca taacttacta atttataatt attattttct tttttaattg 2100 taacagttgc aaaagaagct gaacctgttc cttcaactag tttatcatct tcaatataat 2160 attcttgacc tatatagtat aaatatattt ttattatatt tttacttttt tctgaatcta 2220 ttattttata atcataaaaa gttttaccac caaaagaagg ttgtactcct tctggtccaa 2280 catatttttt tactatatta tctaaataat ttttgggaac tggtgttgta atttgattaa 2340 tcgaacaacc agttatactt aaaggaatta taactataaa aatatatagg attatctttt 2400 taaatttcat tattggcctc ctttttatta aatttatgtt accataaaaa ggacataacg 2460 ggaatatgta gaatattttt aatgtagaca aaattttaca taaatataaa gaaaggaagt 2520 gtttgtttaa attttatagc aaactatcaa aaattagggg gataaaaatt tatgaaaaaa 2580 aggttttcga tgttattttt atgtttaact ttaatagttt gtggtttatt tacaaattcg 2640 gccggccgaa gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtataat 2700 aggaattgaa gttaaattag atgctaaaaa tttgtaatta agaaggagtg attacatgaa 2760 caaaaatata aaatattctc aaaacttttt aacgagtgaa aaagtactca accaaataat 2820 aaaacaattg aatttaaaag aaaccgatac cgtttacgaa attggaacag gtaaagggca 2880 tttaacgacg aaactggcta aaataagtaa acaggtaacg tctattgaat tagacagtca 2940 tctattcaac ttatcgtcag aaaaattaaa actgaatact cgtgtcactt taattcacca 3000 agatattcta cagtttcaat tccctaacaa acagaggtat aaaattgttg ggagtattcc 3060 ttaccattta agcacacaaa ttattaaaaa agtggttttt gaaagccatg cgtctgacat 3120 ctatctgatt gttgaagaag gattctacaa gcgtaccttg gatattcacc gaacactagg 3180 gttgctcttg cacactcaag tctcgattca gcaattgctt aagctgccag cggaatgctt 3240 tcatcctaaa ccaaaagtaa acagtgtctt aataaaactt acccgccata ccacagatgt 3300 tccagataaa tattggaagc tatatacgta ctttgtttca aaatgggtca atcgagaata 3360 tcgtcaactg tttactaaaa atcagtttca tcaagcaatg aaacacgcca aagtaaacaa 3420 tttaagtacc gttacttatg agcaagtatt gtctattttt aatagttatc tattatttaa 3480 cgggaggaaa taattctatg agtcgctttt gtaaatttgg aaagttacac gttactaaag 3540 ggaatgtgtt taaactcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 3600 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 3660 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 3720 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 3780 taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 3840 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 3900 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 3960 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4020 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4080 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 4140 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 4200 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 4260 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 4320 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 4380 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc agggccccct 4440 gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag 4500 gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca 4560 ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc 4620 acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa 4680 cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta 4740 tcaaggtgta ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg 4800 gcatgagcct gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg 4860 tggactatga gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg 4920 gcctgctgaa actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga 4980 tcctcgccct gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg 5040 gcgtggtccg cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg 5100 cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg 5160 gtgaagtaca tcaccgacga gcaaggcaag accgatcggg ccccctgcag gataaaaaaa 5220 ttgtagataa attttataaa atagttttat ctacaatttt tttatcagga aacagctatg 5280 accgcggccg caaaatagtt gataataatg cagagttata aacaaaggtg aaaagcatta 5340 cttgtattct tttttatata ttattataaa ttaaaatgaa gctgtattag aaaaaataca 5400 cacctgtaat ataaaatttt aaattaattt ttaatttttt caaaatgtat tttacatgtt 5460 tagaattttg atgtatatta aaatagtaga atacataaga tacttaattt aattaaagat 5520 agttaagtac ttttcaatgt gcttttttag atgtttaata caaatcttta attgtaaaag 5580 aaatgctgta ctatttactg tactagtgac gggattaaac tgtattaatt ataaataaaa 5640 aataagtaca gttgtttaaa attatatttt gtattaaatc taatagtacg atgtaagtta 5700 ttttatacta ttgctagttt aataaaaaga tttaattata tgcttgaaaa ggagaggaat 5760 ccaatgagtc aggcacttaa aaatttactt actttactta atcttgaaaa aatagaagaa 5820 ggtttattta gaggacagtc agaagattta ggattaagac aagtatttgg aggtcaagta 5880 gttggtcagg cactttatgc agctaaagaa actgtacctg aagaaagact tgttcatagt 5940 tttcattctt attttcttag acctggagat tctaaaaaac caattatata tgatgtagaa 6000 actcttagag atggaaattc atttagtgca agaagagttg cagctattca aaatggtaaa 6060 cctatatttt acatgacagc ttcttttcaa gcaccagaag ctggatttga acatcagaaa 6120 actatgcctt cagcacctgc tccagatgga ttaccatcag aaacacaaat agcacagagt 6180 ttagctcatt tacttcctcc agtacttaaa gataaattta tttgtgatag acctttagaa 6240 gttagaccag ttgaatttca taatcctctt aaaggacatg tagcagaacc acatagacaa 6300 gtttggataa gagctaatgg aagtgtacca gatgatctta gagttcatca gtatcttctt 6360 ggttatgcat ctgatttaaa ttttcttcct gtagctttac aaccacatgg aataggtttt 6420 cttgaacctg gaatacagat agcaactata gatcattcaa tgtggtttca tagaccattt 6480 aatcttaatg aatggcttct ttatagtgta gaatctacat cagcaagttc tgctagagga 6540 tttgttaggg gtgaatttta tactcaagat ggagtacttg ttgctagtac agtacaggaa 6600 ggtgttatga gaaatcataa ttaa 6624

1 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 195 <210> SEQ ID NO 1 <211> LENGTH: 392 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: ThlA, WP_010966157.1 <400> SEQUENCE: 1 Met Lys Glu Val Val Ile Ala Ser Ala Val Arg Thr Ala Ile Gly Ser 1 5 10 15 Tyr Gly Lys Ser Leu Lys Asp Val Pro Ala Val Asp Leu Gly Ala Thr 20 25 30 Ala Ile Lys Glu Ala Val Lys Lys Ala Gly Ile Lys Pro Glu Asp Val 35 40 45 Asn Glu Val Ile Leu Gly Asn Val Leu Gln Ala Gly Leu Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ala Ser Phe Lys Ala Gly Leu Pro Val Glu Ile Pro 65 70 75 80 Ala Met Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Arg Thr Val Ser 85 90 95 Leu Ala Ala Gln Ile Ile Lys Ala Gly Asp Ala Asp Val Ile Ile Ala 100 105 110 Gly Gly Met Glu Asn Met Ser Arg Ala Pro Tyr Leu Ala Asn Asn Ala 115 120 125 Arg Trp Gly Tyr Arg Met Gly Asn Ala Lys Phe Val Asp Glu Met Ile 130 135 140 Thr Asp Gly Leu Trp Asp Ala Phe Asn Asp Tyr His Met Gly Ile Thr 145 150 155 160 Ala Glu Asn Ile Ala Glu Arg Trp Asn Ile Ser Arg Glu Glu Gln Asp 165 170 175 Glu Phe Ala Leu Ala Ser Gln Lys Lys Ala Glu Glu Ala Ile Lys Ser 180 185 190 Gly Gln Phe Lys Asp Glu Ile Val Pro Val Val Ile Lys Gly Arg Lys 195 200 205 Gly Glu Thr Val Val Asp Thr Asp Glu His Pro Arg Phe Gly Ser Thr 210 215 220 Ile Glu Gly Leu Ala Lys Leu Lys Pro Ala Phe Lys Lys Asp Gly Thr 225 230 235 240 Val Thr Ala Gly Asn Ala Ser Gly Leu Asn Asp Cys Ala Ala Val Leu 245 250 255 Val Ile Met Ser Ala Glu Lys Ala Lys Glu Leu Gly Val Lys Pro Leu 260 265 270 Ala Lys Ile Val Ser Tyr Gly Ser Ala Gly Val Asp Pro Ala Ile Met 275 280 285 Gly Tyr Gly Pro Phe Tyr Ala Thr Lys Ala Ala Ile Glu Lys Ala Gly 290 295 300 Trp Thr Val Asp Glu Leu Asp Leu Ile Glu Ser Asn Glu Ala Phe Ala 305 310 315 320 Ala Gln Ser Leu Ala Val Ala Lys Asp Leu Lys Phe Asp Met Asn Lys 325 330 335 Val Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly Ala 340 345 350 Ser Gly Ala Arg Ile Leu Val Thr Leu Val His Ala Met Gln Lys Arg 355 360 365 Asp Ala Lys Lys Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Gln Gly 370 375 380 Thr Ala Ile Leu Leu Glu Lys Cys 385 390 <210> SEQ ID NO 2 <211> LENGTH: 393 <212> TYPE: PRT <213> ORGANISM: Cupriavidus necator <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: PhaA, WP_013956452.1 <400> SEQUENCE: 2 Met Thr Asp Val Val Ile Val Ser Ala Ala Arg Thr Ala Val Gly Lys 1 5 10 15 Phe Gly Gly Ser Leu Ala Lys Ile Pro Ala Pro Glu Leu Gly Ala Val 20 25 30 Val Ile Lys Ala Ala Leu Glu Arg Ala Gly Val Lys Pro Glu Gln Val 35 40 45 Ser Glu Val Ile Met Gly Gln Val Leu Thr Ala Gly Ser Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ala Ala Ile Lys Ala Gly Leu Pro Ala Met Val Pro 65 70 75 80 Ala Met Thr Ile Asn Lys Val Cys Gly Ser Gly Leu Lys Ala Val Met 85 90 95 Leu Ala Ala Asn Ala Ile Met Ala Gly Asp Ala Glu Ile Val Val Ala 100 105 110 Gly Gly Gln Glu Asn Met Ser Ala Ala Pro His Val Leu Pro Gly Ser 115 120 125 Arg Asp Gly Phe Arg Met Gly Asp Ala Lys Leu Val Asp Thr Met Ile 130 135 140 Val Asp Gly Leu Trp Asp Val Tyr Asn Gln Tyr His Met Gly Ile Thr 145 150 155 160 Ala Glu Asn Val Ala Lys Glu Tyr Gly Ile Thr Arg Glu Ala Gln Asp 165 170 175 Glu Leu Ala Val Gly Ser Gln Asn Lys Ala Glu Ala Ala Gln Lys Ala 180 185 190 Gly Lys Phe Asp Glu Glu Ile Val Pro Val Leu Ile Pro Gln Arg Lys 195 200 205 Gly Asp Pro Val Ala Phe Lys Thr Asp Glu Phe Val Arg Gln Gly Ala 210 215 220 Thr Leu Asp Ser Met Ser Gly Leu Lys Pro Ala Phe Asp Lys Ala Gly 225 230 235 240 Thr Val Thr Ala Ala Asn Ala Ser Gly Leu Asn Asp Gly Ala Ala Ala 245 250 255 Val Val Val Met Ser Ala Ala Lys Ala Lys Glu Leu Gly Leu Thr Pro 260 265 270 Leu Ala Thr Ile Lys Ser Tyr Ala Asn Ala Gly Val Asp Pro Lys Val 275 280 285 Met Gly Met Gly Pro Val Pro Ala Ser Lys Arg Ala Leu Ser Arg Ala 290 295 300 Glu Trp Thr Pro Gln Asp Leu Asp Leu Met Glu Ile Asn Glu Ala Phe 305 310 315 320 Ala Ala Gln Ala Leu Ala Val His Gln Gln Met Gly Trp Asp Thr Ser 325 330 335 Lys Val Asn Val Asn Gly Gly Ala Ile Ala Ile Gly His Pro Ile Gly 340 345 350 Ala Ser Gly Cys Arg Ile Leu Val Thr Leu Leu His Glu Met Lys Arg 355 360 365 Arg Asp Ala Lys Lys Gly Leu Ala Ser Leu Cys Ile Gly Gly Gly Met 370 375 380 Gly Val Ala Leu Ala Val Glu Arg Lys 385 390 <210> SEQ ID NO 3 <211> LENGTH: 394 <212> TYPE: PRT <213> ORGANISM: Cupriavidus necator <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: BktB, WP_011615089.1 <400> SEQUENCE: 3 Met Thr Arg Glu Val Val Val Val Ser Gly Val Arg Thr Ala Ile Gly 1 5 10 15 Thr Phe Gly Gly Ser Leu Lys Asp Val Ala Pro Ala Glu Leu Gly Ala 20 25 30 Leu Val Val Arg Glu Ala Leu Ala Arg Ala Gln Val Ser Gly Asp Asp 35 40 45 Val Gly His Val Val Phe Gly Asn Val Ile Gln Thr Glu Pro Arg Asp 50 55 60 Met Tyr Leu Gly Arg Val Ala Ala Val Asn Gly Gly Val Thr Ile Asn 65 70 75 80 Ala Pro Ala Leu Thr Val Asn Arg Leu Cys Gly Ser Gly Leu Gln Ala 85 90 95 Ile Val Ser Ala Ala Gln Thr Ile Leu Leu Gly Asp Thr Asp Val Ala 100 105 110 Ile Gly Gly Gly Ala Glu Ser Met Ser Arg Ala Pro Tyr Leu Ala Pro 115 120 125 Ala Ala Arg Trp Gly Ala Arg Met Gly Asp Ala Gly Leu Val Asp Met 130 135 140 Met Leu Gly Ala Leu His Asp Pro Phe His Arg Ile His Met Gly Val 145 150 155 160 Thr Ala Glu Asn Val Ala Lys Glu Tyr Asp Ile Ser Arg Ala Gln Gln 165 170 175 Asp Glu Ala Ala Leu Glu Ser His Arg Arg Ala Ser Ala Ala Ile Lys 180 185 190 Ala Gly Tyr Phe Lys Asp Gln Ile Val Pro Val Val Ser Lys Gly Arg 195 200 205 Lys Gly Asp Val Thr Phe Asp Thr Asp Glu His Val Arg His Asp Ala 210 215 220 Thr Ile Asp Asp Met Thr Lys Leu Arg Pro Val Phe Val Lys Glu Asn 225 230 235 240 Gly Thr Val Thr Ala Gly Asn Ala Ser Gly Leu Asn Asp Ala Ala Ala 245 250 255 Ala Val Val Met Met Glu Arg Ala Glu Ala Glu Arg Arg Gly Leu Lys 260 265 270 Pro Leu Ala Arg Leu Val Ser Tyr Gly His Ala Gly Val Asp Pro Lys 275 280 285 Ala Met Gly Ile Gly Pro Val Pro Ala Thr Lys Ile Ala Leu Glu Arg 290 295 300 Ala Gly Leu Gln Val Ser Asp Leu Asp Val Ile Glu Ala Asn Glu Ala 305 310 315 320 Phe Ala Ala Gln Ala Cys Ala Val Thr Lys Ala Leu Gly Leu Asp Pro 325 330 335

Ala Lys Val Asn Pro Asn Gly Ser Gly Ile Ser Leu Gly His Pro Ile 340 345 350 Gly Ala Thr Gly Ala Leu Ile Thr Val Lys Ala Leu His Glu Leu Asn 355 360 365 Arg Val Gln Gly Arg Tyr Ala Leu Val Thr Met Cys Ile Gly Gly Gly 370 375 380 Gln Gly Ile Ala Ala Ile Phe Glu Arg Ile 385 390 <210> SEQ ID NO 4 <211> LENGTH: 394 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AtoB, NP_416728.1 <400> SEQUENCE: 4 Met Lys Asn Cys Val Ile Val Ser Ala Val Arg Thr Ala Ile Gly Ser 1 5 10 15 Phe Asn Gly Ser Leu Ala Ser Thr Ser Ala Ile Asp Leu Gly Ala Thr 20 25 30 Val Ile Lys Ala Ala Ile Glu Arg Ala Lys Ile Asp Ser Gln His Val 35 40 45 Asp Glu Val Ile Met Gly Asn Val Leu Gln Ala Gly Leu Gly Gln Asn 50 55 60 Pro Ala Arg Gln Ala Leu Leu Lys Ser Gly Leu Ala Glu Thr Val Cys 65 70 75 80 Gly Phe Thr Val Asn Lys Val Cys Gly Ser Gly Leu Lys Ser Val Ala 85 90 95 Leu Ala Ala Gln Ala Ile Gln Ala Gly Gln Ala Gln Ser Ile Val Ala 100 105 110 Gly Gly Met Glu Asn Met Ser Leu Ala Pro Tyr Leu Leu Asp Ala Lys 115 120 125 Ala Arg Ser Gly Tyr Arg Leu Gly Asp Gly Gln Val Tyr Asp Val Ile 130 135 140 Leu Arg Asp Gly Leu Met Cys Ala Thr His Gly Tyr His Met Gly Ile 145 150 155 160 Thr Ala Glu Asn Val Ala Lys Glu Tyr Gly Ile Thr Arg Glu Met Gln 165 170 175 Asp Glu Leu Ala Leu His Ser Gln Arg Lys Ala Ala Ala Ala Ile Glu 180 185 190 Ser Gly Ala Phe Thr Ala Glu Ile Val Pro Val Asn Val Val Thr Arg 195 200 205 Lys Lys Thr Phe Val Phe Ser Gln Asp Glu Phe Pro Lys Ala Asn Ser 210 215 220 Thr Ala Glu Ala Leu Gly Ala Leu Arg Pro Ala Phe Asp Lys Ala Gly 225 230 235 240 Thr Val Thr Ala Gly Asn Ala Ser Gly Ile Asn Asp Gly Ala Ala Ala 245 250 255 Leu Val Ile Met Glu Glu Ser Ala Ala Leu Ala Ala Gly Leu Thr Pro 260 265 270 Leu Ala Arg Ile Lys Ser Tyr Ala Ser Gly Gly Val Pro Pro Ala Leu 275 280 285 Met Gly Met Gly Pro Val Pro Ala Thr Gln Lys Ala Leu Gln Leu Ala 290 295 300 Gly Leu Gln Leu Ala Asp Ile Asp Leu Ile Glu Ala Asn Glu Ala Phe 305 310 315 320 Ala Ala Gln Phe Leu Ala Val Gly Lys Asn Leu Gly Phe Asp Ser Glu 325 330 335 Lys Val Asn Val Asn Gly Gly Ala Ile Ala Leu Gly His Pro Ile Gly 340 345 350 Ala Ser Gly Ala Arg Ile Leu Val Thr Leu Leu His Ala Met Gln Ala 355 360 365 Arg Asp Lys Thr Leu Gly Leu Ala Thr Leu Cys Ile Gly Gly Gly Gln 370 375 380 Gly Ile Ala Met Val Ile Glu Arg Leu Asn 385 390 <210> SEQ ID NO 5 <211> LENGTH: 217 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CtfA, WP_012059996.1 <400> SEQUENCE: 5 Met Asn Lys Leu Val Lys Leu Thr Asp Leu Lys Arg Ile Phe Lys Asp 1 5 10 15 Gly Met Thr Ile Met Val Gly Gly Phe Leu Asp Cys Gly Thr Pro Glu 20 25 30 Asn Ile Ile Asp Met Leu Val Asp Leu Asn Ile Lys Asn Leu Thr Ile 35 40 45 Ile Ser Asn Asp Thr Ala Phe Pro Asn Lys Gly Ile Gly Lys Leu Ile 50 55 60 Val Asn Gly Gln Val Ser Lys Val Ile Ala Ser His Ile Gly Thr Asn 65 70 75 80 Pro Glu Thr Gly Lys Lys Met Ser Ser Gly Glu Leu Lys Val Glu Leu 85 90 95 Ser Pro Gln Gly Thr Leu Ile Glu Arg Ile Arg Ala Ala Gly Ser Gly 100 105 110 Leu Gly Gly Val Leu Thr Pro Thr Gly Leu Gly Thr Ile Val Glu Glu 115 120 125 Gly Lys Lys Lys Val Thr Ile Asp Gly Lys Glu Tyr Leu Leu Glu Leu 130 135 140 Pro Leu Ser Ala Asp Val Ser Leu Ile Lys Gly Ser Ile Val Asp Glu 145 150 155 160 Phe Gly Asn Thr Phe Tyr Arg Ala Ala Thr Lys Asn Phe Asn Pro Tyr 165 170 175 Met Ala Met Ala Ala Lys Thr Val Ile Val Glu Ala Glu Asn Leu Val 180 185 190 Lys Cys Glu Asp Leu Lys Arg Asp Ala Ile Met Thr Pro Gly Val Leu 195 200 205 Val Asp Tyr Ile Val Lys Glu Ala Ala 210 215 <210> SEQ ID NO 6 <211> LENGTH: 221 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CtfB, WP_012059997.1 <400> SEQUENCE: 6 Met Ile Val Asp Lys Val Leu Ala Lys Glu Ile Ile Ala Lys Arg Val 1 5 10 15 Ala Lys Glu Leu Lys Lys Asp Gln Leu Val Asn Leu Gly Ile Gly Leu 20 25 30 Pro Thr Leu Val Ala Asn Tyr Val Pro Lys Glu Met Asn Ile Thr Phe 35 40 45 Glu Ser Glu Asn Gly Met Val Gly Met Ala Gln Met Ala Ser Ser Gly 50 55 60 Glu Asn Asp Pro Asp Ile Ile Asn Ala Gly Gly Glu Tyr Val Thr Leu 65 70 75 80 Leu Pro Gln Gly Ser Phe Phe Asp Ser Ser Met Ser Phe Ala Leu Ile 85 90 95 Arg Gly Gly His Val Asp Val Ala Val Leu Gly Ala Leu Glu Val Asp 100 105 110 Glu Lys Gly Asn Leu Ala Asn Trp Ile Val Pro Asn Lys Ile Val Pro 115 120 125 Gly Met Gly Gly Ala Met Asp Leu Ala Ile Gly Ala Lys Lys Ile Ile 130 135 140 Val Ala Met Gln His Thr Gly Lys Ser Lys Pro Lys Ile Val Lys Lys 145 150 155 160 Cys Thr Leu Pro Leu Thr Ala Lys Ala Gln Val Asp Leu Ile Val Thr 165 170 175 Glu Leu Cys Val Ile Asp Val Thr Asn Asp Gly Leu Leu Leu Lys Glu 180 185 190 Ile His Lys Asp Thr Thr Ile Asp Glu Ile Lys Phe Leu Thr Asp Ala 195 200 205 Asp Leu Ile Ile Pro Asp Asn Leu Lys Ile Met Asp Ile 210 215 220 <210> SEQ ID NO 7 <211> LENGTH: 286 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: TesB, NP_414986.1 <400> SEQUENCE: 7 Met Ser Gln Ala Leu Lys Asn Leu Leu Thr Leu Leu Asn Leu Glu Lys 1 5 10 15 Ile Glu Glu Gly Leu Phe Arg Gly Gln Ser Glu Asp Leu Gly Leu Arg 20 25 30 Gln Val Phe Gly Gly Gln Val Val Gly Gln Ala Leu Tyr Ala Ala Lys 35 40 45 Glu Thr Val Pro Glu Glu Arg Leu Val His Ser Phe His Ser Tyr Phe 50 55 60 Leu Arg Pro Gly Asp Ser Lys Lys Pro Ile Ile Tyr Asp Val Glu Thr 65 70 75 80 Leu Arg Asp Gly Asn Ser Phe Ser Ala Arg Arg Val Ala Ala Ile Gln 85 90 95 Asn Gly Lys Pro Ile Phe Tyr Met Thr Ala Ser Phe Gln Ala Pro Glu 100 105 110 Ala Gly Phe Glu His Gln Lys Thr Met Pro Ser Ala Pro Ala Pro Asp 115 120 125 Gly Leu Pro Ser Glu Thr Gln Ile Ala Gln Ser Leu Ala His Leu Leu 130 135 140 Pro Pro Val Leu Lys Asp Lys Phe Ile Cys Asp Arg Pro Leu Glu Val 145 150 155 160 Arg Pro Val Glu Phe His Asn Pro Leu Lys Gly His Val Ala Glu Pro 165 170 175 His Arg Gln Val Trp Ile Arg Ala Asn Gly Ser Val Pro Asp Asp Leu 180 185 190 Arg Val His Gln Tyr Leu Leu Gly Tyr Ala Ser Asp Leu Asn Phe Leu

195 200 205 Pro Val Ala Leu Gln Pro His Gly Ile Gly Phe Leu Glu Pro Gly Ile 210 215 220 Gln Ile Ala Thr Ile Asp His Ser Met Trp Phe His Arg Pro Phe Asn 225 230 235 240 Leu Asn Glu Trp Leu Leu Tyr Ser Val Glu Ser Thr Ser Ala Ser Ser 245 250 255 Ala Arg Gly Phe Val Arg Gly Glu Phe Tyr Thr Gln Asp Gly Val Leu 260 265 270 Val Ala Ser Thr Val Gln Glu Gly Val Met Arg Asn His Asn 275 280 285 <210> SEQ ID NO 8 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 1, AGY74947.1 <400> SEQUENCE: 8 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 9 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 2, AGY75747.1 <400> SEQUENCE: 9 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser Asn Cys Arg Tyr 50 55 60 Leu Gln Gly Ala Lys Tyr Glu Asp Glu Leu Leu Ile Lys Thr Trp Ile 65 70 75 80 Lys Glu Leu Thr Pro Val Lys Ala Glu Phe Asn Tyr Ser Val Ile Arg 85 90 95 Glu Asn Asp Gln Lys Glu Ile Ala Lys Gly Ser Thr Leu His Ala Phe 100 105 110 Val Asn Asn Asn Phe Lys Ile Ile Asn Leu Lys Lys Asn His Thr Glu 115 120 125 Leu Phe Lys Lys Leu Gln Ser Leu Ile 130 135 <210> SEQ ID NO 10 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 3, AGY75999.1 <400> SEQUENCE: 10 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 11 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 1, ADK15695.1 <400> SEQUENCE: 11 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300

Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 12 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 2, ADK16655.1 <400> SEQUENCE: 12 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser Asn Cys Arg Tyr 50 55 60 Leu Gln Gly Ala Lys Tyr Glu Asp Glu Leu Leu Ile Lys Thr Trp Ile 65 70 75 80 Lys Glu Leu Thr Pro Val Lys Ala Glu Phe Asn Tyr Ser Val Ile Arg 85 90 95 Glu Asn Asp Gln Lys Glu Ile Ala Lys Gly Ser Thr Leu His Ala Phe 100 105 110 Val Asn Asn Asn Phe Lys Ile Ile Asn Leu Lys Lys Asn His Thr Glu 115 120 125 Leu Phe Lys Lys Leu Gln Ser Leu Ile 130 135 <210> SEQ ID NO 13 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: putative thioesterase 3, ADK16959.1 <400> SEQUENCE: 13 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 14 <211> LENGTH: 246 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Adc, WP_012059998.1 <400> SEQUENCE: 14 Met Leu Glu Ser Glu Val Ser Lys Gln Ile Thr Thr Pro Leu Ala Ala 1 5 10 15 Pro Ala Phe Pro Arg Gly Pro Tyr Arg Phe His Asn Arg Glu Tyr Leu 20 25 30 Asn Ile Ile Tyr Arg Thr Asp Leu Asp Ala Leu Arg Lys Ile Val Pro 35 40 45 Glu Pro Leu Glu Leu Asp Arg Ala Tyr Val Arg Phe Glu Met Met Ala 50 55 60 Met Pro Asp Thr Thr Gly Leu Gly Ser Tyr Thr Glu Cys Gly Gln Ala 65 70 75 80 Ile Pro Val Lys Tyr Asn Gly Val Lys Gly Asp Tyr Leu His Met Met 85 90 95 Tyr Leu Asp Asn Glu Pro Ala Ile Ala Val Gly Arg Glu Ser Ser Ala 100 105 110 Tyr Pro Lys Lys Leu Gly Tyr Pro Lys Leu Phe Val Asp Ser Asp Thr 115 120 125 Leu Val Gly Thr Leu Lys Tyr Gly Thr Leu Pro Val Ala Thr Ala Thr 130 135 140 Met Gly Tyr Lys His Glu Pro Leu Asp Leu Lys Glu Ala Tyr Ala Gln 145 150 155 160 Ile Ala Arg Pro Asn Phe Met Leu Lys Ile Ile Gln Gly Tyr Asp Gly 165 170 175 Lys Pro Arg Ile Cys Glu Leu Ile Cys Ala Glu Asn Thr Asp Ile Thr 180 185 190 Ile His Gly Ala Trp Thr Gly Ser Ala Arg Leu Gln Leu Phe Ser His 195 200 205 Ala Leu Ala Pro Leu Ala Asp Leu Pro Val Leu Glu Ile Val Ser Ala 210 215 220 Ser His Ile Leu Thr Asp Leu Thr Leu Gly Thr Pro Lys Val Val His 225 230 235 240 Asp Tyr Leu Ser Val Lys 245 <210> SEQ ID NO 15 <211> LENGTH: 548 <212> TYPE: PRT <213> ORGANISM: Lactococcus lactis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: KivD <400> SEQUENCE: 15 Met Tyr Thr Val Gly Asp Tyr Leu Leu Asp Arg Leu His Glu Leu Gly 1 5 10 15 Ile Glu Glu Ile Phe Gly Val Pro Gly Asp Tyr Asn Leu Gln Phe Leu 20 25 30 Asp Gln Ile Ile Ser His Lys Asp Met Lys Trp Val Gly Asn Ala Asn 35 40 45 Glu Leu Asn Ala Ser Tyr Met Ala Asp Gly Tyr Ala Arg Thr Lys Lys 50 55 60 Ala Ala Ala Phe Leu Thr Thr Phe Gly Val Gly Glu Leu Ser Ala Val 65 70 75 80 Asn Gly Leu Ala Gly Ser Tyr Ala Glu Asn Leu Pro Val Val Glu Ile 85 90 95 Val Gly Ser Pro Thr Ser Lys Val Gln Asn Glu Gly Lys Phe Val His 100 105 110 His Thr Leu Ala Asp Gly Asp Phe Lys His Phe Met Lys Met His Glu 115 120 125 Pro Val Thr Ala Ala Arg Thr Leu Leu Thr Ala Glu Asn Ala Thr Val 130 135 140 Glu Ile Asp Arg Val Leu Ser Ala Leu Leu Lys Glu Arg Lys Pro Val 145 150 155 160 Tyr Ile Asn Leu Pro Val Asp Val Ala Ala Ala Lys Ala Glu Lys Pro 165 170 175 Ser Leu Pro Leu Lys Lys Glu Asn Ser Thr Ser Asn Thr Ser Asp Gln 180 185 190 Glu Ile Leu Asn Lys Ile Gln Glu Ser Leu Lys Asn Ala Lys Lys Pro 195 200 205 Ile Val Ile Thr Gly His Glu Ile Ile Ser Phe Gly Leu Glu Lys Thr 210 215 220 Val Thr Gln Phe Ile Ser Lys Thr Lys Leu Pro Ile Thr Thr Leu Asn 225 230 235 240 Phe Gly Lys Ser Ser Val Asp Glu Ala Leu Pro Ser Phe Leu Gly Ile 245 250 255 Tyr Asn Gly Thr Leu Ser Glu Pro Asn Leu Lys Glu Phe Val Glu Ser 260 265 270 Ala Asp Phe Ile Leu Met Leu Gly Val Lys Leu Thr Asp Ser Ser Thr 275 280 285 Gly Ala Phe Thr His His Leu Asn Glu Asn Lys Met Ile Ser Leu Asn 290 295 300 Ile Asp Glu Gly Lys Ile Phe Asn Glu Arg Ile Gln Asn Phe Asp Phe 305 310 315 320 Glu Ser Leu Ile Ser Ser Leu Leu Asp Leu Ser Glu Ile Glu Tyr Lys 325 330 335 Gly Lys Tyr Ile Asp Lys Lys Gln Glu Asp Phe Val Pro Ser Asn Ala 340 345 350 Leu Leu Ser Gln Asp Arg Leu Trp Gln Ala Val Glu Asn Leu Thr Gln 355 360 365 Ser Asn Glu Thr Ile Val Ala Glu Gln Gly Thr Ser Phe Phe Gly Ala 370 375 380 Ser Ser Ile Phe Leu Lys Ser Lys Ser His Phe Ile Gly Gln Pro Leu 385 390 395 400 Trp Gly Ser Ile Gly Tyr Thr Phe Pro Ala Ala Leu Gly Ser Gln Ile 405 410 415 Ala Asp Lys Glu Ser Arg His Leu Leu Phe Ile Gly Asp Gly Ser Leu 420 425 430

Gln Leu Thr Val Gln Glu Leu Gly Leu Ala Ile Arg Glu Lys Ile Asn 435 440 445 Pro Ile Cys Phe Ile Ile Asn Asn Asp Gly Tyr Thr Val Glu Arg Glu 450 455 460 Ile His Gly Pro Asn Gln Ser Tyr Asn Asp Ile Pro Met Trp Asn Tyr 465 470 475 480 Ser Lys Leu Pro Glu Ser Phe Gly Ala Thr Glu Asp Arg Val Val Ser 485 490 495 Lys Ile Val Arg Thr Glu Asn Glu Phe Val Ser Val Met Lys Glu Ala 500 505 510 Gln Ala Asp Pro Asn Arg Met Tyr Trp Ile Glu Leu Ile Leu Ala Lys 515 520 525 Glu Gly Ala Pro Lys Val Leu Lys Lys Met Gly Lys Leu Phe Ala Glu 530 535 540 Gln Asn Lys Ser 545 <210> SEQ ID NO 16 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, AGY74782.1 <400> SEQUENCE: 16 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser 130 135 140 Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val 210 215 220 Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val 245 250 255 Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met 290 295 300 Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu 325 330 335 Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe 340 345 350 <210> SEQ ID NO 17 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, ADK15544.1 <400> SEQUENCE: 17 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser 130 135 140 Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val 210 215 220 Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val 245 250 255 Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met 290 295 300 Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu 325 330 335 Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe 340 345 350 <210> SEQ ID NO 18 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium ragsdalei <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, WP_013239134.1 <400> SEQUENCE: 18 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Lys Asn Pro Val Pro Gly Pro Tyr Asp Ala Ile Val His Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asn Arg Glu Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Ile Ala Glu Val Gly Ser Glu Val Lys Asp Phe Lys Val Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Ala Asp Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Asp Glu Ile Pro Leu Glu Ser 130 135 140 Ala Val Met Met Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Lys Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ser Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Val Asn Tyr Lys Asn Gly Asp Ile Val 210 215 220 Glu Gln Ile Met Asp Leu Thr His Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ala Glu Thr Leu Ala Gln Ala Val Thr Met Val 245 250 255 Lys Pro Gly Gly Val Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Thr Leu Pro Ile Pro Arg Val Gln Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Arg Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Met 290 295 300 Leu Arg Asp Leu Val Leu Tyr Lys Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Asp Gly Ala Glu Asn Ile Glu Lys Ala Leu Leu Leu 325 330 335

Met Lys Asn Lys Pro Lys Asp Leu Ile Lys Ser Val Val Thr Phe 340 345 350 <210> SEQ ID NO 19 <211> LENGTH: 351 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, WP_026889046.1 <400> SEQUENCE: 19 Met Lys Gly Phe Ala Met Leu Gly Ile Asn Lys Leu Gly Trp Ile Glu 1 5 10 15 Lys Glu Arg Pro Val Ala Gly Ser Tyr Asp Ala Ile Val Arg Pro Leu 20 25 30 Ala Val Ser Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Leu Gly Asp Arg Lys Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp Arg 65 70 75 80 Val Ile Val Pro Cys Thr Thr Pro Asp Trp Arg Ser Leu Glu Val Gln 85 90 95 Ala Gly Phe Gln Gln His Ser Asn Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Phe Lys Asp Gly Val Phe Gly Glu Tyr Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala Ile Leu Pro Lys Asp Met Pro Leu Glu Asn 130 135 140 Ala Val Met Ile Thr Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Gln Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ala Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Val Cys Val Glu Thr Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Leu Asn Tyr Lys Asn Gly His Ile Val 210 215 220 Asp Gln Val Met Lys Leu Thr Asn Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ser Glu Thr Leu Ser Gln Ala Val Ser Met Val 245 250 255 Lys Pro Gly Gly Ile Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Ala Leu Leu Ile Pro Arg Val Glu Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Ala Glu Met 290 295 300 Leu Arg Asp Met Val Val Tyr Asn Arg Val Asp Leu Ser Lys Leu Val 305 310 315 320 Thr His Val Tyr His Gly Phe Asp His Ile Glu Glu Ala Leu Leu Leu 325 330 335 Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Ala Val Val Ile Leu 340 345 350 <210> SEQ ID NO 20 <211> LENGTH: 352 <212> TYPE: PRT <213> ORGANISM: Thermoanaerobacter brokii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: SecAdh, 3FSR_A <400> SEQUENCE: 20 Met Lys Gly Phe Ala Met Leu Ser Ile Gly Lys Val Gly Trp Ile Glu 1 5 10 15 Lys Glu Lys Pro Ala Pro Gly Pro Phe Asp Ala Ile Val Arg Pro Leu 20 25 30 Ala Val Ala Pro Cys Thr Ser Asp Ile His Thr Val Phe Glu Gly Ala 35 40 45 Ile Gly Glu Arg His Asn Met Ile Leu Gly His Glu Ala Val Gly Glu 50 55 60 Val Val Glu Val Gly Ser Glu Val Lys Asp Phe Lys Pro Gly Asp Arg 65 70 75 80 Val Val Val Pro Ala Ile Thr Pro Asp Trp Arg Thr Ser Glu Val Gln 85 90 95 Arg Gly Tyr His Gln His Ser Gly Gly Met Leu Ala Gly Trp Lys Phe 100 105 110 Ser Asn Val Lys Asp Gly Val Phe Gly Glu Phe Phe His Val Asn Asp 115 120 125 Ala Asp Met Asn Leu Ala His Leu Pro Lys Glu Ile Pro Leu Glu Ala 130 135 140 Ala Val Met Ile Pro Asp Met Met Thr Thr Gly Phe His Gly Ala Glu 145 150 155 160 Leu Ala Asp Ile Gln Met Gly Ser Ser Val Val Val Ile Gly Ile Gly 165 170 175 Ala Val Gly Leu Met Gly Ile Ala Gly Ala Lys Leu Arg Gly Ala Gly 180 185 190 Arg Ile Ile Gly Val Gly Ser Arg Pro Ile Cys Val Glu Ala Ala Lys 195 200 205 Phe Tyr Gly Ala Thr Asp Ile Leu Asn Tyr Lys Asn Gly His Ile Val 210 215 220 Asp Gln Val Met Lys Leu Thr Asn Gly Lys Gly Val Asp Arg Val Ile 225 230 235 240 Met Ala Gly Gly Gly Ser Glu Thr Leu Ser Gln Ala Val Ser Met Val 245 250 255 Lys Pro Gly Gly Ile Ile Ser Asn Ile Asn Tyr His Gly Ser Gly Asp 260 265 270 Ala Leu Leu Ile Pro Arg Val Glu Trp Gly Cys Gly Met Ala His Lys 275 280 285 Thr Ile Lys Gly Gly Leu Cys Pro Gly Gly Arg Leu Arg Met Glu Arg 290 295 300 Leu Ile Asp Leu Val Phe Tyr Lys Arg Val Asp Pro Ser Lys Leu Val 305 310 315 320 Thr His Val Phe Arg Gly Phe Asp Asn Ile Glu Lys Ala Phe Met Leu 325 330 335 Met Lys Asp Lys Pro Lys Asp Leu Ile Lys Pro Val Val Ile Leu Ala 340 345 350 <210> SEQ ID NO 21 <211> LENGTH: 520 <212> TYPE: PRT <213> ORGANISM: Mus musculus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HMG-CoA synthase <400> SEQUENCE: 21 Met Pro Gly Ser Leu Pro Leu Asn Ala Glu Ala Cys Trp Pro Lys Asp 1 5 10 15 Val Gly Ile Val Ala Leu Glu Ile Tyr Phe Pro Ser Gln Tyr Val Asp 20 25 30 Gln Ala Glu Leu Glu Lys Tyr Asp Gly Val Asp Ala Gly Lys Tyr Thr 35 40 45 Ile Gly Leu Gly Gln Ala Arg Met Gly Phe Cys Thr Asp Arg Glu Asp 50 55 60 Ile Asn Ser Leu Cys Leu Thr Val Val Gln Lys Leu Met Glu Arg His 65 70 75 80 Ser Leu Ser Tyr Asp Cys Ile Gly Arg Leu Glu Val Gly Thr Glu Thr 85 90 95 Ile Ile Asp Lys Ser Lys Ser Val Lys Ser Lys Leu Met Gln Leu Phe 100 105 110 Glu Glu Ser Gly Asn Thr Asp Ile Glu Gly Ile Asp Thr Thr Asn Ala 115 120 125 Cys Tyr Gly Gly Thr Ala Ala Val Phe Asn Ala Val Asn Trp Val Glu 130 135 140 Ser Ser Ser Trp Asp Gly Arg Tyr Ala Leu Val Val Ala Gly Asp Ile 145 150 155 160 Ala Ile Tyr Ala Thr Gly Asn Ala Arg Pro Thr Gly Gly Val Gly Ala 165 170 175 Val Ala Leu Leu Ile Gly Pro Asn Ala Pro Leu Ile Phe Asp Arg Gly 180 185 190 Leu Arg Gly Thr His Met Gln His Ala Tyr Asp Phe Tyr Lys Pro Asp 195 200 205 Met Leu Ser Glu Tyr Pro Val Val Asp Gly Lys Leu Ser Ile Gln Cys 210 215 220 Tyr Leu Ser Ala Leu Asp Arg Cys Tyr Ser Val Tyr Arg Lys Lys Ile 225 230 235 240 Arg Ala Gln Trp Gln Lys Glu Gly Lys Asp Lys Asp Phe Thr Leu Asn 245 250 255 Asp Phe Gly Phe Met Ile Phe His Ser Pro Tyr Cys Lys Leu Val Gln 260 265 270 Lys Ser Leu Ala Arg Met Phe Leu Asn Asp Phe Leu Asn Asp Gln Asn 275 280 285 Arg Asp Lys Asn Ser Ile Tyr Ser Gly Leu Glu Ala Phe Gly Asp Val 290 295 300 Lys Leu Glu Asp Thr Tyr Phe Asp Arg Asp Val Glu Lys Ala Phe Met 305 310 315 320 Lys Ala Ser Ser Glu Leu Phe Asn Gln Lys Thr Lys Ala Ser Leu Leu 325 330 335 Val Ser Asn Gln Asn Gly Asn Met Tyr Thr Ser Ser Val Tyr Gly Ser 340 345 350 Leu Ala Ser Val Leu Ala Gln Tyr Ser Pro Gln Gln Leu Ala Gly Lys 355 360 365 Arg Val Gly Val Phe Ser Tyr Gly Ser Gly Leu Ala Ala Thr Leu Tyr 370 375 380 Ser Leu Lys Val Thr Gln Asp Ala Thr Pro Gly Ser Ala Leu Asp Lys 385 390 395 400 Ile Thr Ala Ser Leu Cys Asp Leu Lys Ser Arg Leu Asp Ser Arg Thr 405 410 415 Cys Val Ala Pro Asp Val Phe Ala Glu Asn Met Lys Leu Arg Glu Asp 420 425 430 Thr His His Leu Ala Asn Tyr Ile Pro Gln Cys Ser Ile Asp Ser Leu

435 440 445 Phe Glu Gly Thr Trp Tyr Leu Val Arg Val Asp Glu Lys His Arg Arg 450 455 460 Thr Tyr Ala Arg Arg Pro Phe Thr Asn Asp His Ser Leu Asp Glu Gly 465 470 475 480 Met Gly Leu Val His Ser Asn Thr Ala Thr Glu His Ile Pro Ser Pro 485 490 495 Ala Lys Lys Val Pro Arg Leu Pro Ala Thr Ser Ala Glu Ser Glu Ser 500 505 510 Ala Val Ile Ser Asn Gly Glu His 515 520 <210> SEQ ID NO 22 <211> LENGTH: 396 <212> TYPE: PRT <213> ORGANISM: Saccharomyces cerevisiae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Mdd, CAA96324.1 <400> SEQUENCE: 22 Met Thr Val Tyr Thr Ala Ser Val Thr Ala Pro Val Asn Ile Ala Thr 1 5 10 15 Leu Lys Tyr Trp Gly Lys Arg Asp Thr Lys Leu Asn Leu Pro Thr Asn 20 25 30 Ser Ser Ile Ser Val Thr Leu Ser Gln Asp Asp Leu Arg Thr Leu Thr 35 40 45 Ser Ala Ala Thr Ala Pro Glu Phe Glu Arg Asp Thr Leu Trp Leu Asn 50 55 60 Gly Glu Pro His Ser Ile Asp Asn Glu Arg Thr Gln Asn Cys Leu Arg 65 70 75 80 Asp Leu Arg Gln Leu Arg Lys Glu Met Glu Ser Lys Asp Ala Ser Leu 85 90 95 Pro Thr Leu Ser Gln Trp Lys Leu His Ile Val Ser Glu Asn Asn Phe 100 105 110 Pro Thr Ala Ala Gly Leu Ala Ser Ser Ala Ala Gly Phe Ala Ala Leu 115 120 125 Val Ser Ala Ile Ala Lys Leu Tyr Gln Leu Pro Gln Ser Thr Ser Glu 130 135 140 Ile Ser Arg Ile Ala Arg Lys Gly Ser Gly Ser Ala Cys Arg Ser Leu 145 150 155 160 Phe Gly Gly Tyr Val Ala Trp Glu Met Gly Lys Ala Glu Asp Gly His 165 170 175 Asp Ser Met Ala Val Gln Ile Ala Asp Ser Ser Asp Trp Pro Gln Met 180 185 190 Lys Ala Cys Val Leu Val Val Ser Asp Ile Lys Lys Asp Val Ser Ser 195 200 205 Thr Gln Gly Met Gln Leu Thr Val Ala Thr Ser Glu Leu Phe Lys Glu 210 215 220 Arg Ile Glu His Val Val Pro Lys Arg Phe Glu Val Met Arg Lys Ala 225 230 235 240 Ile Val Glu Lys Asp Phe Ala Thr Phe Ala Lys Glu Thr Met Met Asp 245 250 255 Ser Asn Ser Phe His Ala Thr Cys Leu Asp Ser Phe Pro Pro Ile Phe 260 265 270 Tyr Met Asn Asp Thr Ser Lys Arg Ile Ile Ser Trp Cys His Thr Ile 275 280 285 Asn Gln Phe Tyr Gly Glu Thr Ile Val Ala Tyr Thr Phe Asp Ala Gly 290 295 300 Pro Asn Ala Val Leu Tyr Tyr Leu Ala Glu Asn Glu Ser Lys Leu Phe 305 310 315 320 Ala Phe Ile Tyr Lys Leu Phe Gly Ser Val Pro Gly Trp Asp Lys Lys 325 330 335 Phe Thr Thr Glu Gln Leu Glu Ala Phe Asn His Gln Phe Glu Ser Ser 340 345 350 Asn Phe Thr Ala Arg Glu Leu Asp Leu Glu Leu Gln Lys Asp Val Ala 355 360 365 Arg Val Ile Leu Thr Gln Val Gly Ser Gly Pro Gln Glu Thr Asn Glu 370 375 380 Ser Leu Ile Asp Ala Lys Thr Gly Leu Pro Lys Glu 385 390 395 <210> SEQ ID NO 23 <211> LENGTH: 324 <212> TYPE: PRT <213> ORGANISM: Picrophilus torridus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Mdd, WP_011178157.1 <400> SEQUENCE: 23 Met Glu Asn Tyr Asn Val Lys Thr Arg Ala Phe Pro Thr Ile Gly Ile 1 5 10 15 Ile Leu Leu Gly Gly Ile Ser Asp Lys Lys Asn Arg Ile Pro Leu His 20 25 30 Thr Thr Ala Gly Ile Ala Tyr Thr Gly Ile Asn Asn Asp Val Tyr Thr 35 40 45 Glu Thr Lys Leu Tyr Val Ser Lys Asp Glu Lys Cys Tyr Ile Asp Gly 50 55 60 Lys Glu Ile Asp Leu Asn Ser Asp Arg Ser Pro Ser Lys Val Ile Asp 65 70 75 80 Lys Phe Lys His Glu Ile Leu Met Arg Val Asn Leu Asp Asp Glu Asn 85 90 95 Asn Leu Ser Ile Asp Ser Arg Asn Phe Asn Ile Leu Ser Gly Ser Ser 100 105 110 Asp Ser Gly Ala Ala Ala Leu Gly Glu Cys Ile Glu Ser Ile Phe Glu 115 120 125 Tyr Asn Ile Asn Ile Phe Thr Phe Glu Asn Asp Leu Gln Arg Ile Ser 130 135 140 Glu Ser Val Gly Arg Ser Leu Tyr Gly Gly Leu Thr Val Asn Tyr Ala 145 150 155 160 Asn Gly Arg Glu Ser Leu Thr Glu Pro Leu Leu Glu Pro Glu Ala Phe 165 170 175 Asn Asn Phe Thr Ile Ile Gly Ala His Phe Asn Ile Asp Arg Lys Pro 180 185 190 Ser Asn Glu Ile His Glu Asn Ile Ile Lys His Glu Asn Tyr Arg Glu 195 200 205 Arg Ile Lys Ser Ala Glu Arg Lys Ala Lys Lys Leu Glu Glu Leu Ser 210 215 220 Arg Asn Ala Asn Ile Lys Gly Ile Phe Glu Leu Ala Glu Ser Asp Thr 225 230 235 240 Val Glu Tyr His Lys Met Leu His Asp Val Gly Val Asp Ile Ile Asn 245 250 255 Asp Arg Met Glu Asn Leu Ile Glu Arg Val Lys Glu Met Lys Asn Asn 260 265 270 Phe Trp Asn Ser Tyr Ile Val Thr Gly Gly Pro Asn Val Phe Val Ile 275 280 285 Thr Glu Lys Lys Asp Val Asp Lys Ala Met Glu Gly Leu Asn Asp Leu 290 295 300 Cys Asp Asp Ile Arg Leu Leu Lys Val Ala Gly Lys Pro Gln Val Ile 305 310 315 320 Ser Lys Asn Phe <210> SEQ ID NO 24 <211> LENGTH: 460 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CimA, AGY76958.1 <400> SEQUENCE: 24 Met Lys Lys Ser Ser Tyr Glu Tyr Lys Leu Asn Asn Val Asp Ser Pro 1 5 10 15 Asn Phe Tyr Lys Asn Ile Phe Pro Tyr Asp Glu Ile Pro Lys Ile Asn 20 25 30 Phe Asn Gly Val Gln Ile Pro Lys Asp Leu Pro Glu Asn Ile Tyr Ile 35 40 45 Thr Asp Thr Thr Phe Arg Asp Gly Gln Gln Ser Met Pro Pro Tyr Thr 50 55 60 Thr Glu Gln Ile Ile Arg Ile Phe Asp Tyr Leu His Asn Leu Asp Asn 65 70 75 80 Asn Ser Gly Ile Ile Lys Gln Thr Glu Phe Phe Leu Tyr Thr Glu Lys 85 90 95 Asp Arg Lys Ala Ala Gln Val Cys Met Glu Arg Gly Tyr Glu Phe Pro 100 105 110 Glu Val Thr Ser Trp Ile Arg Ala Asn Lys Glu Asp Phe Lys Leu Val 115 120 125 Lys Gln Met Gly Ile Lys Glu Thr Gly Met Leu Met Ser Cys Ser Asp 130 135 140 Tyr His Ile Phe Lys Lys Leu Arg Lys Thr Arg Lys Glu Thr Met Asp 145 150 155 160 Met Tyr Leu Gly Ile Val Lys Glu Ala Leu Asp Asn Gly Ile Arg Pro 165 170 175 Arg Cys His Leu Glu Asp Ile Thr Arg Ala Asp Phe Tyr Gly Phe Val 180 185 190 Val Pro Leu Val Asn Lys Leu Met Glu Leu Ser Lys Gln Ser Gly Ile 195 200 205 Pro Ile Lys Ile Arg Ala Cys Asp Thr Leu Gly Leu Gly Val Ser Tyr 210 215 220 Ser Gly Val Glu Leu Pro Arg Ser Val Gln Ala Ile Met Tyr Gly Leu 225 230 235 240 Arg Asn Asn Cys Gly Val Pro Ser Glu Cys Ile Glu Trp His Gly His 245 250 255 Asn Asp Phe Tyr Ala Val Val Asn Asn Ser Thr Thr Ala Trp Leu Tyr 260 265 270 Gly Ala Ser Ala Val Asn Thr Ser Phe Leu Gly Ile Gly Glu Arg Thr 275 280 285 Gly Asn Cys Pro Leu Glu Ala Met Ile Phe Glu Tyr Gly Gln Ile Lys 290 295 300 Gly Asn Thr Lys Asn Met Lys Leu Glu Val Ile Thr Glu Leu Ser Glu 305 310 315 320 Tyr Phe Lys Lys Glu Met Glu Tyr Ala Val Pro Pro Arg Thr Pro Phe 325 330 335 Val Gly Lys Glu Phe Asn Val Thr Arg Ala Gly Ile His Ala Asp Gly 340 345 350

Ile Leu Lys Asp Glu Glu Ile Tyr Asn Ile Phe Asp Thr Asp Lys Ile 355 360 365 Leu Gly Arg Pro Val Val Val Ala Val Asn Gln Tyr Ser Gly His Ala 370 375 380 Gly Ile Ala Ala Trp Ile Asn Thr Tyr Tyr Arg Leu Lys Asp Glu Glu 385 390 395 400 Lys Ile Asp Lys Trp Asp Thr Arg Ile Ala Lys Ile Lys Glu Trp Val 405 410 415 Asp Glu Gln Tyr Lys Ala Gly Arg Thr Ser Ile Ile Gly Asn Asp Glu 420 425 430 Leu Glu Leu Leu Val Asp Lys Met Leu Pro Asp Ile Ser Gln Lys Lys 435 440 445 Lys Lys Glu Leu Ala Arg Val Asp Thr Arg Phe Ile 450 455 460 <210> SEQ ID NO 25 <211> LENGTH: 491 <212> TYPE: PRT <213> ORGANISM: Methanocaldococcus jannaschii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: CimA, NP_248395.1 <400> SEQUENCE: 25 Met Met Val Arg Ile Phe Asp Thr Thr Leu Arg Asp Gly Glu Gln Thr 1 5 10 15 Pro Gly Val Ser Leu Thr Pro Asn Asp Lys Leu Glu Ile Ala Lys Lys 20 25 30 Leu Asp Glu Leu Gly Val Asp Val Ile Glu Ala Gly Ser Ala Ile Thr 35 40 45 Ser Lys Gly Glu Arg Glu Gly Ile Lys Leu Ile Thr Lys Glu Gly Leu 50 55 60 Asn Ala Glu Ile Cys Ser Phe Val Arg Ala Leu Pro Val Asp Ile Asp 65 70 75 80 Ala Ala Leu Glu Cys Asp Val Asp Ser Val His Leu Val Val Pro Thr 85 90 95 Ser Pro Ile His Met Lys Tyr Lys Leu Arg Lys Thr Glu Asp Glu Val 100 105 110 Leu Glu Thr Ala Leu Lys Ala Val Glu Tyr Ala Lys Glu His Gly Leu 115 120 125 Ile Val Glu Leu Ser Ala Glu Asp Ala Thr Arg Ser Asp Val Asn Phe 130 135 140 Leu Ile Lys Leu Phe Asn Glu Gly Glu Lys Val Gly Ala Asp Arg Val 145 150 155 160 Cys Val Cys Asp Thr Val Gly Val Leu Thr Pro Gln Lys Ser Gln Glu 165 170 175 Leu Phe Lys Lys Ile Thr Glu Asn Val Asn Leu Pro Val Ser Val His 180 185 190 Cys His Asn Asp Phe Gly Met Ala Thr Ala Asn Thr Cys Ser Ala Val 195 200 205 Leu Gly Gly Ala Val Gln Cys His Val Thr Val Asn Gly Ile Gly Glu 210 215 220 Arg Ala Gly Asn Ala Ser Leu Glu Glu Val Val Ala Ala Leu Lys Ile 225 230 235 240 Leu Tyr Gly Tyr Asp Thr Lys Ile Lys Met Glu Lys Leu Tyr Glu Val 245 250 255 Ser Arg Ile Val Ser Arg Leu Met Lys Leu Pro Val Pro Pro Asn Lys 260 265 270 Ala Ile Val Gly Asp Asn Ala Phe Ala His Glu Ala Gly Ile His Val 275 280 285 Asp Gly Leu Ile Lys Asn Thr Glu Thr Tyr Glu Pro Ile Lys Pro Glu 290 295 300 Met Val Gly Asn Arg Arg Arg Ile Ile Leu Gly Lys His Ser Gly Arg 305 310 315 320 Lys Ala Leu Lys Tyr Lys Leu Asp Leu Met Gly Ile Asn Val Ser Asp 325 330 335 Glu Gln Leu Asn Lys Ile Tyr Glu Arg Val Lys Glu Phe Gly Asp Leu 340 345 350 Gly Lys Tyr Ile Ser Asp Ala Asp Leu Leu Ala Ile Val Arg Glu Val 355 360 365 Thr Gly Lys Leu Val Glu Glu Lys Ile Lys Leu Asp Glu Leu Thr Val 370 375 380 Val Ser Gly Asn Lys Ile Thr Pro Ile Ala Ser Val Lys Leu His Tyr 385 390 395 400 Lys Gly Glu Asp Ile Thr Leu Ile Glu Thr Ala Tyr Gly Val Gly Pro 405 410 415 Val Asp Ala Ala Ile Asn Ala Val Arg Lys Ala Ile Ser Gly Val Ala 420 425 430 Asp Ile Lys Leu Val Glu Tyr Arg Val Glu Ala Ile Gly Gly Gly Thr 435 440 445 Asp Ala Leu Ile Glu Val Val Val Lys Leu Arg Lys Gly Thr Glu Ile 450 455 460 Val Glu Val Arg Lys Ser Asp Ala Asp Ile Ile Arg Ala Ser Val Asp 465 470 475 480 Ala Val Met Glu Gly Ile Asn Met Leu Leu Asn 485 490 <210> SEQ ID NO 26 <211> LENGTH: 421 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuC, WP_023162955.1 <400> SEQUENCE: 26 Met Gly Met Thr Met Thr Gln Lys Ile Leu Ala His His Ala Lys Met 1 5 10 15 Asp Glu Val Lys Ala Gly Gln Leu Ile Lys Val Lys Leu Asp Leu Val 20 25 30 Leu Gly Asn Asp Ile Thr Thr Pro Val Ala Ile Asn Glu Phe Asn Lys 35 40 45 Ile Gly Leu Asn Asn Val Phe Asp Lys Asn Lys Ile Ala Ile Val Pro 50 55 60 Asp His Phe Thr Pro Asn Lys Asp Ile Lys Ser Ala Glu Gln Cys Lys 65 70 75 80 Tyr Val Arg Glu Phe Val Lys Lys Met Glu Ile Lys Asn Tyr Phe Glu 85 90 95 Val Gly Arg Met Gly Ile Glu His Ala Leu Ile Pro Glu Lys Gly Leu 100 105 110 Ala Val Cys Gly Asp Val Val Ile Gly Ala Asp Ser His Thr Cys Thr 115 120 125 Tyr Gly Ala Leu Gly Ala Phe Ser Thr Gly Ile Gly Ser Thr Asp Met 130 135 140 Ala Ala Gly Met Ala Thr Gly Glu Ala Trp Phe Lys Val Pro Glu Ala 145 150 155 160 Ile Lys Phe Val Leu Lys Gly Lys Leu Thr Lys Trp Val Ser Gly Lys 165 170 175 Asp Val Ile Leu His Ile Ile Gly Met Ile Gly Val Asp Gly Ala Leu 180 185 190 Tyr Lys Ser Met Glu Phe Thr Gly Glu Gly Val Ser Ser Leu Thr Met 195 200 205 Asp Asp Arg Phe Thr Ile Cys Asn Met Ala Ile Glu Ala Gly Ala Lys 210 215 220 Asn Gly Ile Phe Pro Val Asp Glu Asn Thr Ile Asn Tyr Val Lys Glu 225 230 235 240 His Ser Lys Lys Asn Tyr Thr Val Tyr Glu Ala Asp Ser Asp Ala Glu 245 250 255 Tyr Ser Gln Val Ile Glu Ile Asp Leu Ser Lys Ile Arg Pro Thr Val 260 265 270 Ala Phe Pro His Ile Pro Glu Asn Thr Lys Thr Ile Asp Glu Val Gly 275 280 285 Asp Ile Arg Ile Asp Gln Val Val Ile Gly Ser Cys Thr Asn Gly Arg 290 295 300 Ile Gly Asp Leu Arg Ala Ala Ala Ser Ile Leu Lys Gly Arg Lys Val 305 310 315 320 Asn Glu Asn Val Arg Ala Ile Ile Phe Pro Ala Thr Gln Ala Ile Tyr 325 330 335 Leu Gln Ala Met Lys Glu Gly Leu Ile Glu Ile Phe Ile Glu Ala Gly 340 345 350 Ala Val Val Ser Thr Pro Thr Cys Gly Pro Cys Leu Gly Gly His Met 355 360 365 Gly Ile Leu Ala Glu Gly Glu Arg Ala Val Ser Thr Thr Asn Arg Asn 370 375 380 Phe Val Gly Arg Met Gly His Val Lys Ser Glu Val Tyr Leu Ala Ser 385 390 395 400 Pro Glu Val Ala Ala Ala Ser Ala Val Thr Gly Lys Ile Ser Ser Pro 405 410 415 Glu Glu Val Val Lys 420 <210> SEQ ID NO 27 <211> LENGTH: 164 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuD, AGY77204.1 <400> SEQUENCE: 27 Met Ile Lys Gly Lys Ala Ile Lys Tyr Gly Asp Asn Val Asp Thr Asp 1 5 10 15 Val Ile Ile Pro Ala Arg Tyr Leu Asn Thr Ser Asp His Lys Glu Leu 20 25 30 Ala Ser His Cys Met Glu Asp Ile Asp Lys Asp Phe Ser Lys Lys Ile 35 40 45 Ser Lys Gly Asp Ile Met Ile Ala Gly Lys Asn Phe Gly Cys Gly Ser 50 55 60 Ser Arg Glu His Ala Pro Ile Ala Ile Lys Ala Ser Gly Ile Ser Cys 65 70 75 80 Ile Ile Ala Glu Thr Phe Ala Arg Ile Phe Phe Arg Asn Ser Ile Asn 85 90 95 Ile Gly Leu Pro Ile Met Glu Cys Glu Glu Ala Ala Lys Asp Ile Asp 100 105 110 Glu Lys Asp Glu Val Ser Val Asp Thr Val Ser Gly Val Ile Thr Asn 115 120 125

Ile Thr Lys Asn Lys Thr Tyr Lys Ala Val Pro Phe Pro Glu Phe Met 130 135 140 His Lys Ile Ile Lys Ser Glu Gly Leu Ile Asn Tyr Ile Lys Glu Glu 145 150 155 160 Val Glu Asn Lys <210> SEQ ID NO 28 <211> LENGTH: 466 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuC, NP_414614.1 <400> SEQUENCE: 28 Met Ala Lys Thr Leu Tyr Glu Lys Leu Phe Asp Ala His Val Val Tyr 1 5 10 15 Glu Ala Glu Asn Glu Thr Pro Leu Leu Tyr Ile Asp Arg His Leu Val 20 25 30 His Glu Val Thr Ser Pro Gln Ala Phe Asp Gly Leu Arg Ala His Gly 35 40 45 Arg Pro Val Arg Gln Pro Gly Lys Thr Phe Ala Thr Met Asp His Asn 50 55 60 Val Ser Thr Gln Thr Lys Asp Ile Asn Ala Cys Gly Glu Met Ala Arg 65 70 75 80 Ile Gln Met Gln Glu Leu Ile Lys Asn Cys Lys Glu Phe Gly Val Glu 85 90 95 Leu Tyr Asp Leu Asn His Pro Tyr Gln Gly Ile Val His Val Met Gly 100 105 110 Pro Glu Gln Gly Val Thr Leu Pro Gly Met Thr Ile Val Cys Gly Asp 115 120 125 Ser His Thr Ala Thr His Gly Ala Phe Gly Ala Leu Ala Phe Gly Ile 130 135 140 Gly Thr Ser Glu Val Glu His Val Leu Ala Thr Gln Thr Leu Lys Gln 145 150 155 160 Gly Arg Ala Lys Thr Met Lys Ile Glu Val Gln Gly Lys Ala Ala Pro 165 170 175 Gly Ile Thr Ala Lys Asp Ile Val Leu Ala Ile Ile Gly Lys Thr Gly 180 185 190 Ser Ala Gly Gly Thr Gly His Val Val Glu Phe Cys Gly Glu Ala Ile 195 200 205 Arg Asp Leu Ser Met Glu Gly Arg Met Thr Leu Cys Asn Met Ala Ile 210 215 220 Glu Met Gly Ala Lys Ala Gly Leu Val Ala Pro Asp Glu Thr Thr Phe 225 230 235 240 Asn Tyr Val Lys Gly Arg Leu His Ala Pro Lys Gly Lys Asp Phe Asp 245 250 255 Asp Ala Val Ala Tyr Trp Lys Thr Leu Gln Thr Asp Glu Gly Ala Thr 260 265 270 Phe Asp Thr Val Val Thr Leu Gln Ala Glu Glu Ile Ser Pro Gln Val 275 280 285 Thr Trp Gly Thr Asn Pro Gly Gln Val Ile Ser Val Asn Asp Asn Ile 290 295 300 Pro Asp Pro Ala Ser Phe Ala Asp Pro Val Glu Arg Ala Ser Ala Glu 305 310 315 320 Lys Ala Leu Ala Tyr Met Gly Leu Lys Pro Gly Ile Pro Leu Thr Glu 325 330 335 Val Ala Ile Asp Lys Val Phe Ile Gly Ser Cys Thr Asn Ser Arg Ile 340 345 350 Glu Asp Leu Arg Ala Ala Ala Glu Ile Ala Lys Gly Arg Lys Val Ala 355 360 365 Pro Gly Val Gln Ala Leu Val Val Pro Gly Ser Gly Pro Val Lys Ala 370 375 380 Gln Ala Glu Ala Glu Gly Leu Asp Lys Ile Phe Ile Glu Ala Gly Phe 385 390 395 400 Glu Trp Arg Leu Pro Gly Cys Ser Met Cys Leu Ala Met Asn Asn Asp 405 410 415 Arg Leu Asn Pro Gly Glu Arg Cys Ala Ser Thr Ser Asn Arg Asn Phe 420 425 430 Glu Gly Arg Gln Gly Arg Gly Gly Arg Thr His Leu Val Ser Pro Ala 435 440 445 Met Ala Ala Ala Ala Ala Val Thr Gly His Phe Ala Asp Ile Arg Asn 450 455 460 Ile Lys 465 <210> SEQ ID NO 29 <211> LENGTH: 201 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuD, NP_414613.1 <400> SEQUENCE: 29 Met Ala Glu Lys Phe Ile Lys His Thr Gly Leu Val Val Pro Leu Asp 1 5 10 15 Ala Ala Asn Val Asp Thr Asp Ala Ile Ile Pro Lys Gln Phe Leu Gln 20 25 30 Lys Val Thr Arg Thr Gly Phe Gly Ala His Leu Phe Asn Asp Trp Arg 35 40 45 Phe Leu Asp Glu Lys Gly Gln Gln Pro Asn Pro Asp Phe Val Leu Asn 50 55 60 Phe Pro Gln Tyr Gln Gly Ala Ser Ile Leu Leu Ala Arg Glu Asn Phe 65 70 75 80 Gly Cys Gly Ser Ser Arg Glu His Ala Pro Trp Ala Leu Thr Asp Tyr 85 90 95 Gly Phe Lys Val Val Ile Ala Pro Ser Phe Ala Asp Ile Phe Tyr Gly 100 105 110 Asn Ser Phe Asn Asn Gln Leu Leu Pro Val Lys Leu Ser Asp Ala Glu 115 120 125 Val Asp Glu Leu Phe Ala Leu Val Lys Ala Asn Pro Gly Ile His Phe 130 135 140 Asp Val Asp Leu Glu Ala Gln Glu Val Lys Ala Gly Glu Lys Thr Tyr 145 150 155 160 Arg Phe Thr Ile Asp Ala Phe Arg Arg His Cys Met Met Asn Gly Leu 165 170 175 Asp Ser Ile Gly Leu Thr Leu Gln His Asp Asp Ala Ile Ala Ala Tyr 180 185 190 Glu Ala Lys Gln Pro Ala Phe Met Asn 195 200 <210> SEQ ID NO 30 <211> LENGTH: 354 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuB, WP_023162957.1 <400> SEQUENCE: 30 Met Lys Ile Ala Ile Ile Pro Gly Asp Gly Ile Gly Lys Glu Ile Ile 1 5 10 15 Glu Gln Ala Lys Lys Val Leu Lys Ala Ala Ser Ala Lys Tyr Asn Phe 20 25 30 Asp Phe Glu Cys Glu Glu Val Leu Leu Gly Gly Ala Ala Val Asp Ala 35 40 45 Thr Gly Val Pro Leu Pro Asp Lys Thr Val Glu Val Cys Lys Lys Ser 50 55 60 Asp Ala Val Leu Leu Gly Ala Val Gly Gly Pro Lys Trp Asp Ser Leu 65 70 75 80 Pro Ser Lys Leu Arg Pro Glu Ala Gly Leu Leu Gly Ile Arg Lys Ala 85 90 95 Leu Gly Val Phe Ala Asn Leu Arg Pro Ala Ile Leu Phe Pro Glu Leu 100 105 110 Ile Ala Ala Ser Asn Leu Lys Pro Glu Val Leu Gly Gly Gly Leu Asp 115 120 125 Ile Met Ile Val Arg Glu Leu Ile Gly Gly Ala Tyr Phe Gly Glu Lys 130 135 140 Asn Arg Ile Asp Ile Glu Gly Gly Lys Lys Ala Trp Asp Thr Ile Ser 145 150 155 160 Tyr Thr Ser Phe Glu Ile Asp Arg Ile Thr Arg Lys Ala Phe Glu Ile 165 170 175 Ala Arg Lys Arg Ser Asn Arg Leu Thr Leu Val Asp Lys Ala Asn Val 180 185 190 Leu Glu Ser Ser Lys Leu Trp Arg Glu Val Val Gly Asn Ile Ala Lys 195 200 205 Glu Tyr Glu Asp Val Glu Ile Asn Tyr Met Tyr Val Asp Asn Ala Ser 210 215 220 Met Gln Leu Ile Arg Asp Pro Lys Gln Phe Asp Val Ile Leu Thr Glu 225 230 235 240 Asn Met Phe Gly Asp Ile Leu Ser Asp Glu Ala Ser Met Leu Thr Gly 245 250 255 Ser Leu Gly Met Leu Pro Ser Ala Ser Val Arg Gly Asp Ser Phe Gly 260 265 270 Leu Tyr Glu Pro Val His Gly Ser Ala Pro Asp Ile Ala Gly Gln Asn 275 280 285 Lys Ala Asn Pro Ile Gly Thr Ile Met Ser Val Ala Met Met Leu Lys 290 295 300 Tyr Ser Phe Asp Met Glu Gln Ala Tyr Val Asp Ile Lys Asn Ala Ile 305 310 315 320 Ser Lys Val Leu Lys Glu Gly Tyr Arg Thr Gly Asp Ile Ala Lys Glu 325 330 335 Asp Ser Lys Leu Val Gly Thr Glu Glu Met Gly Asp Leu Ile Val Lys 340 345 350 Asn Leu <210> SEQ ID NO 31 <211> LENGTH: 363 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: LeuB, NP_414615.4 <400> SEQUENCE: 31 Met Ser Lys Asn Tyr His Ile Ala Val Leu Pro Gly Asp Gly Ile Gly

1 5 10 15 Pro Glu Val Met Thr Gln Ala Leu Lys Val Leu Asp Ala Val Arg Asn 20 25 30 Arg Phe Ala Met Arg Ile Thr Thr Ser His Tyr Asp Val Gly Gly Ala 35 40 45 Ala Ile Asp Asn His Gly Gln Pro Leu Pro Pro Ala Thr Val Glu Gly 50 55 60 Cys Glu Gln Ala Asp Ala Val Leu Phe Gly Ser Val Gly Gly Pro Lys 65 70 75 80 Trp Glu His Leu Pro Pro Asp Gln Gln Pro Glu Arg Gly Ala Leu Leu 85 90 95 Pro Leu Arg Lys His Phe Lys Leu Phe Ser Asn Leu Arg Pro Ala Lys 100 105 110 Leu Tyr Gln Gly Leu Glu Ala Phe Cys Pro Leu Arg Ala Asp Ile Ala 115 120 125 Ala Asn Gly Phe Asp Ile Leu Cys Val Arg Glu Leu Thr Gly Gly Ile 130 135 140 Tyr Phe Gly Gln Pro Lys Gly Arg Glu Gly Ser Gly Gln Tyr Glu Lys 145 150 155 160 Ala Phe Asp Thr Glu Val Tyr His Arg Phe Glu Ile Glu Arg Ile Ala 165 170 175 Arg Ile Ala Phe Glu Ser Ala Arg Lys Arg Arg His Lys Val Thr Ser 180 185 190 Ile Asp Lys Ala Asn Val Leu Gln Ser Ser Ile Leu Trp Arg Glu Ile 195 200 205 Val Asn Glu Ile Ala Thr Glu Tyr Pro Asp Val Glu Leu Ala His Met 210 215 220 Tyr Ile Asp Asn Ala Thr Met Gln Leu Ile Lys Asp Pro Ser Gln Phe 225 230 235 240 Asp Val Leu Leu Cys Ser Asn Leu Phe Gly Asp Ile Leu Ser Asp Glu 245 250 255 Cys Ala Met Ile Thr Gly Ser Met Gly Met Leu Pro Ser Ala Ser Leu 260 265 270 Asn Glu Gln Gly Phe Gly Leu Tyr Glu Pro Ala Gly Gly Ser Ala Pro 275 280 285 Asp Ile Ala Gly Lys Asn Ile Ala Asn Pro Ile Ala Gln Ile Leu Ser 290 295 300 Leu Ala Leu Leu Leu Arg Tyr Ser Leu Asp Ala Asp Asp Ala Ala Cys 305 310 315 320 Ala Ile Glu Arg Ala Ile Asn Arg Ala Leu Glu Glu Gly Ile Arg Thr 325 330 335 Gly Asp Leu Ala Arg Gly Ala Ala Ala Val Ser Thr Asp Glu Met Gly 340 345 350 Asp Ile Ile Ala Arg Tyr Val Ala Glu Gly Val 355 360 <210> SEQ ID NO 32 <211> LENGTH: 536 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvB, AGY74359.1 <400> SEQUENCE: 32 Met Lys Ala Ala Glu Ala Val Ile Gln Cys Leu Lys Lys Glu Asn Val 1 5 10 15 Asn Met Val Phe Gly Tyr Pro Gly Ala Ala Val Val Pro Ile Tyr Glu 20 25 30 Ala Leu Arg Lys Ser Asp Val Lys His Ile Leu Val Arg Gln Glu Gln 35 40 45 Ala Ala Gly His Ser Ala Ser Gly Tyr Ala Arg Ser Thr Gly Glu Val 50 55 60 Gly Val Cys Ile Val Thr Ser Gly Pro Gly Ala Thr Asn Leu Ile Thr 65 70 75 80 Ala Ile Ala Ala Ala Tyr Met Asp Ser Ile Pro Leu Val Val Ile Thr 85 90 95 Gly Gln Val Lys Ser Thr Leu Ile Gly Arg Asp Val Phe Gln Glu Leu 100 105 110 Asp Ile Thr Gly Ala Thr Glu Ser Phe Thr Lys Tyr Asn Phe Leu Val 115 120 125 Arg Asp Ala Lys Ser Ile Pro Lys Thr Ile Lys Glu Ala Phe Tyr Ile 130 135 140 Ala Glu Thr Gly Arg Lys Gly Pro Val Leu Val Asp Ile Pro Met Asp 145 150 155 160 Ile Met Glu Glu Asp Ile Asp Phe Glu Tyr Pro Glu Ser Val Asn Ile 165 170 175 Arg Gly Tyr Lys Pro Thr Val Lys Gly His Ser Gly Gln Ile Lys Lys 180 185 190 Ile Ile Asp Arg Ile Lys Val Ser Lys Arg Pro Leu Ile Cys Ala Gly 195 200 205 Gly Gly Val Ile Leu Ala Asn Ala Gln Lys Glu Leu Glu Gln Phe Val 210 215 220 Lys Lys Ser His Ile Pro Val Val His Thr Leu Met Gly Lys Gly Cys 225 230 235 240 Ile Asn Glu Asn Ser Asp Tyr Tyr Val Gly Leu Ile Gly Thr His Gly 245 250 255 Phe Ala Tyr Ala Asn Lys Val Val Gln Asn Ala Asp Val Leu Ile Leu 260 265 270 Ile Gly Ala Arg Ala Ser Asp Arg Thr Val Ser Gly Val Lys Ser Phe 275 280 285 Ala Lys Asp Ala Asp Ile Ile His Ile Asp Ile Asp Pro Ala Glu Ile 290 295 300 Gly Lys Ile Leu Asn Thr Tyr Ile Pro Val Val Gly Asp Cys Gly Ser 305 310 315 320 Val Leu Ser Asp Leu Asn Lys Glu Ile Val Ala Pro Gln Thr Glu Lys 325 330 335 Trp Met Glu Glu Ile Lys Asn Trp Lys Lys Asp Leu Tyr Ile Glu Arg 340 345 350 Lys Pro Thr Asp Lys Val Asn Pro Lys Tyr Val Leu Lys Thr Val Ser 355 360 365 Asp Thr Leu Gly Glu Glu Val Ile Leu Thr Ala Asp Val Gly Gln Asn 370 375 380 Gln Leu Trp Cys Ala Arg Asn Phe Arg Met Thr Gly Asn Arg Lys Phe 385 390 395 400 Leu Thr Ser Gly Gly Leu Gly Thr Met Gly Tyr Ser Leu Pro Ala Ala 405 410 415 Ile Gly Ala Lys Ile Ala Cys Pro Asp Lys Gln Val Ile Ala Phe Ala 420 425 430 Gly Asp Gly Gly Phe Gln Met Ser Leu Phe Glu Leu Gly Thr Ile Ala 435 440 445 Glu Asn Asn Leu Asn Ile Ile Ile Val Leu Phe Asn Asn Ser Gly Leu 450 455 460 Gly Met Val Arg Glu Ile Gln Asp Asn Lys Tyr Ser Gly Glu Phe Gly 465 470 475 480 Val Asn Phe Arg Thr Asn Pro Asp Phe Val Lys Leu Ala Glu Ala Tyr 485 490 495 Gly Leu Lys Ala Lys Arg Val Glu Asn Asp Ser Glu Phe Asn Gly Val 500 505 510 Phe Arg Glu Ala Leu Asp Ser Ser Lys Ala Phe Leu Ile Glu Cys Ile 515 520 525 Val Asp Pro His Glu Arg Thr Phe 530 535 <210> SEQ ID NO 33 <211> LENGTH: 558 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvB, AGY74635.1 <400> SEQUENCE: 33 Met Lys Ile Lys Gly Ala Glu Val Leu Leu Lys Cys Met Met Glu Gln 1 5 10 15 Gly Val Asp Thr Val Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro Ile 20 25 30 Tyr Asp Ala Leu Tyr Ala Ala Lys Gly Lys Ile Thr His Ile Ser Thr 35 40 45 Ser His Glu Gln Gly Ala Ala His Ala Ala Asp Gly Tyr Ala Arg Ser 50 55 60 Thr Gly Lys Val Gly Val Val Ile Ala Thr Ser Gly Pro Gly Ala Thr 65 70 75 80 Asn Thr Val Thr Ala Ile Ala Thr Ala Tyr Met Asp Ser Val Pro Ile 85 90 95 Val Val Phe Thr Gly Gln Val Ala Arg Ser Leu Leu Gly Lys Asp Ser 100 105 110 Phe Gln Glu Val Asn Ile Lys Asp Ile Thr Ala Ser Ile Thr Lys Lys 115 120 125 Ser Cys Ile Val Glu Lys Val Glu Asp Leu Ala Asp Thr Val Arg Glu 130 135 140 Ala Phe Gln Ile Ala Val Ser Gly Arg Pro Gly Pro Val Val Val Asp 145 150 155 160 Ile Pro Lys Asp Val Gln Ser Ala Glu Val Glu Tyr Glu Pro Phe Arg 165 170 175 Ser Lys Leu Ser Glu Ile Lys Glu Lys Lys Tyr Phe Asn Leu Asn Glu 180 185 190 Tyr Gly Asp Ser Leu Asn Lys Ala Ile Asp Met Ile Asn Arg Ser Glu 195 200 205 Arg Pro Val Ile Tyr Ser Gly Gly Gly Thr Val Thr Ser Gly Ala Gln 210 215 220 Asn Glu Leu Met Glu Leu Val Glu Lys Ile Asp Ser Pro Ile Thr Cys 225 230 235 240 Ser Leu Met Gly Ile Gly Ala Phe Pro Gly Asn Asn Glu Tyr Tyr Met 245 250 255 Gly Met Val Gly Met His Gly Ser Arg Cys Ser Asn Tyr Ala Val Ser 260 265 270 Asn Cys Asp Leu Leu Ile Ala Ile Gly Ala Arg Phe Ser Asp Arg Val 275 280 285 Ile Ser Lys Val Ser Ala Phe Ala Pro Lys Ala Arg Ile Ile His Ile 290 295 300 Asp Ile Asp Pro Lys Glu Phe Gly Lys Asn Val Asp Ile Asp Val Ala 305 310 315 320

Ile Lys Gly Asp Val Lys Glu Val Leu Gln Lys Ile Asn Cys Lys Leu 325 330 335 Glu Lys Ala Asp His Arg Asp Trp Met Glu Lys Ile Lys Gln Trp Lys 340 345 350 Ser Glu Gln Cys Glu Pro Phe Lys Glu Cys Lys Leu Ser Pro Lys Phe 355 360 365 Ile Met Asp Thr Leu Tyr Asn Leu Thr Gly Gly Glu Cys Ile Ile Thr 370 375 380 Thr Glu Val Gly Gln Asn Gln Ile Trp Thr Ala Gln Tyr Phe Lys Phe 385 390 395 400 Leu Lys Pro Arg Thr Phe Val Ser Ser Gly Gly Leu Gly Thr Met Gly 405 410 415 Phe Gly Leu Gly Ala Ser Ile Gly Ala Ser Met Gly Asn Pro Gly Lys 420 425 430 Lys Val Ile Asn Val Ala Gly Asp Gly Ser Phe Lys Met Asn Ser Thr 435 440 445 Glu Leu Ala Thr Val Ala Lys Tyr Lys Leu Pro Ile Val Gln Leu Leu 450 455 460 Leu Asn Asn Arg Ala Leu Gly Met Val Tyr Gln Trp Gln Asp Met Phe 465 470 475 480 Tyr Gly Lys Arg Phe Ser Asn Thr Glu Leu Gly Pro Asp Val Asp Phe 485 490 495 Met Lys Leu Gly Glu Ala Tyr Gly Ile Lys Thr Phe Lys Ile Glu Asp 500 505 510 Asn Ser Gln Val Glu Lys Cys Leu Lys Glu Ala Leu Asp Leu Asn Glu 515 520 525 Pro Val Ile Ile Glu Cys Asp Ile Asp Arg Lys Glu Lys Val Phe Pro 530 535 540 Ile Val Pro Pro Gly Ala Ala Ile Ser Asp Leu Val Glu Glu 545 550 555 <210> SEQ ID NO 34 <211> LENGTH: 158 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvN, AGY74360.1 <400> SEQUENCE: 34 Met Ser Val Leu Val Glu Asn His Ser Gly Val Leu Ser Lys Val Ala 1 5 10 15 Gly Leu Phe Ser Arg Arg Gly Tyr Asn Ile His Ser Leu Thr Val Gly 20 25 30 Val Thr Gly Asp Pro Glu Ile Ser Arg Met Thr Ile Val Ser Ile Gly 35 40 45 Asp Asp Tyr Met Phe Glu Gln Ile Ser Lys Gln Leu Asn Lys Leu Ile 50 55 60 Glu Val Ile Lys Val Ile Glu Leu Asn Pro Asp Ala Ser Val Tyr Arg 65 70 75 80 Glu Leu Ser Leu Ile Lys Val Ser Ala Glu Ser Asn Asn Lys Leu Leu 85 90 95 Ile Met Glu Ser Val Asn Thr Phe Arg Gly Lys Ile Val Asp Met Asn 100 105 110 Glu Lys Ser Met Ile Ile Glu Ile Thr Gly Asn Glu Lys Lys Ile Ser 115 120 125 Ala Phe Ile Glu Leu Met Lys Pro Tyr Gly Ile Lys Glu Ile Ile Arg 130 135 140 Thr Gly Leu Thr Ala Leu Gln Arg Gly Ser Lys Leu Glu Asp 145 150 155 <210> SEQ ID NO 35 <211> LENGTH: 562 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvB, NP_418127.1 <400> SEQUENCE: 35 Met Ala Ser Ser Gly Thr Thr Ser Thr Arg Lys Arg Phe Thr Gly Ala 1 5 10 15 Glu Phe Ile Val His Phe Leu Glu Gln Gln Gly Ile Lys Ile Val Thr 20 25 30 Gly Ile Pro Gly Gly Ser Ile Leu Pro Val Tyr Asp Ala Leu Ser Gln 35 40 45 Ser Thr Gln Ile Arg His Ile Leu Ala Arg His Glu Gln Gly Ala Gly 50 55 60 Phe Ile Ala Gln Gly Met Ala Arg Thr Asp Gly Lys Pro Ala Val Cys 65 70 75 80 Met Ala Cys Ser Gly Pro Gly Ala Thr Asn Leu Val Thr Ala Ile Ala 85 90 95 Asp Ala Arg Leu Asp Ser Ile Pro Leu Ile Cys Ile Thr Gly Gln Val 100 105 110 Pro Ala Ser Met Ile Gly Thr Asp Ala Phe Gln Glu Val Asp Thr Tyr 115 120 125 Gly Ile Ser Ile Pro Ile Thr Lys His Asn Tyr Leu Val Arg His Ile 130 135 140 Glu Glu Leu Pro Gln Val Met Ser Asp Ala Phe Arg Ile Ala Gln Ser 145 150 155 160 Gly Arg Pro Gly Pro Val Trp Ile Asp Ile Pro Lys Asp Val Gln Thr 165 170 175 Ala Val Phe Glu Ile Glu Thr Gln Pro Ala Met Ala Glu Lys Ala Ala 180 185 190 Ala Pro Ala Phe Ser Glu Glu Ser Ile Arg Asp Ala Ala Ala Met Ile 195 200 205 Asn Ala Ala Lys Arg Pro Val Leu Tyr Leu Gly Gly Gly Val Ile Asn 210 215 220 Ala Pro Ala Arg Val Arg Glu Leu Ala Glu Lys Ala Gln Leu Pro Thr 225 230 235 240 Thr Met Thr Leu Met Ala Leu Gly Met Leu Pro Lys Ala His Pro Leu 245 250 255 Ser Leu Gly Met Leu Gly Met His Gly Val Arg Ser Thr Asn Tyr Ile 260 265 270 Leu Gln Glu Ala Asp Leu Leu Ile Val Leu Gly Ala Arg Phe Asp Asp 275 280 285 Arg Ala Ile Gly Lys Thr Glu Gln Phe Cys Pro Asn Ala Lys Ile Ile 290 295 300 His Val Asp Ile Asp Arg Ala Glu Leu Gly Lys Ile Lys Gln Pro His 305 310 315 320 Val Ala Ile Gln Ala Asp Val Asp Asp Val Leu Ala Gln Leu Ile Pro 325 330 335 Leu Val Glu Ala Gln Pro Arg Ala Glu Trp His Gln Leu Val Ala Asp 340 345 350 Leu Gln Arg Glu Phe Pro Cys Pro Ile Pro Lys Ala Cys Asp Pro Leu 355 360 365 Ser His Tyr Gly Leu Ile Asn Ala Val Ala Ala Cys Val Asp Asp Asn 370 375 380 Ala Ile Ile Thr Thr Asp Val Gly Gln His Gln Met Trp Thr Ala Gln 385 390 395 400 Ala Tyr Pro Leu Asn Arg Pro Arg Gln Trp Leu Thr Ser Gly Gly Leu 405 410 415 Gly Thr Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala Ala Leu Ala 420 425 430 Asn Pro Asp Arg Lys Val Leu Cys Phe Ser Gly Asp Gly Ser Leu Met 435 440 445 Met Asn Ile Gln Glu Met Ala Thr Ala Ser Glu Asn Gln Leu Asp Val 450 455 460 Lys Ile Ile Leu Met Asn Asn Glu Ala Leu Gly Leu Val His Gln Gln 465 470 475 480 Gln Ser Leu Phe Tyr Glu Gln Gly Val Phe Ala Ala Thr Tyr Pro Gly 485 490 495 Lys Ile Asn Phe Met Gln Ile Ala Ala Gly Phe Gly Leu Glu Thr Cys 500 505 510 Asp Leu Asn Asn Glu Ala Asp Pro Gln Ala Ser Leu Gln Glu Ile Ile 515 520 525 Asn Arg Pro Gly Pro Ala Leu Ile His Val Arg Ile Asp Ala Glu Glu 530 535 540 Lys Val Tyr Pro Met Val Pro Pro Gly Ala Ala Asn Thr Glu Met Val 545 550 555 560 Gly Glu <210> SEQ ID NO 36 <211> LENGTH: 96 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvN, NP_418126.1 <400> SEQUENCE: 36 Met Gln Asn Thr Thr His Asp Asn Val Ile Leu Glu Leu Thr Val Arg 1 5 10 15 Asn His Pro Gly Val Met Thr His Val Cys Gly Leu Phe Ala Arg Arg 20 25 30 Ala Phe Asn Val Glu Gly Ile Leu Cys Leu Pro Ile Gln Asp Ser Asp 35 40 45 Lys Ser His Ile Trp Leu Leu Val Asn Asp Asp Gln Arg Leu Glu Gln 50 55 60 Met Ile Ser Gln Ile Asp Lys Leu Glu Asp Val Val Lys Val Gln Arg 65 70 75 80 Asn Gln Ser Asp Pro Thr Met Phe Asn Lys Ile Ala Val Phe Phe Gln 85 90 95 <210> SEQ ID NO 37 <211> LENGTH: 337 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvC, WP_013238693.1 <400> SEQUENCE: 37 Met Glu Lys Leu Lys Val Tyr Tyr Asp Glu Asp Ala Asp Leu Asn Leu 1 5 10 15 Leu Lys Gly Lys Lys Ile Ala Ile Leu Gly Phe Gly Ser Gln Gly His 20 25 30

Ala His Ala Leu Asn Leu Lys Glu Ser Gly Leu Asp Val Ile Val Gly 35 40 45 Leu Tyr Lys Gly Ser Lys Ser Trp Lys Lys Ala Glu Asp Tyr Gly Phe 50 55 60 Lys Val Tyr Glu Ile Ala Glu Ala Val Lys Gln Ala Asp Ile Ile Thr 65 70 75 80 Val Leu Leu Pro Asp Glu Lys Gln Lys Gln Ile Tyr Asp Glu Ser Ile 85 90 95 Lys Asp Asn Leu Ser Glu Gly Asn Ala Leu Phe Phe Ala His Gly Phe 100 105 110 Asn Ile His Phe Asn Gln Ile Val Pro Pro Lys Phe Val Asp Val Leu 115 120 125 Met Ile Ala Pro Lys Gly Pro Gly His Ile Val Arg Arg Glu Tyr Thr 130 135 140 Leu Gly Asn Gly Val Pro Cys Leu Tyr Ala Val Tyr Gln Asp Tyr Ser 145 150 155 160 Gly Lys Gly Lys Glu Ile Ala Leu Ala Tyr Gly Lys Gly Ile Gly Gly 165 170 175 Thr Arg Ala Gly Val Met Thr Thr Thr Phe Lys Val Glu Thr Glu Thr 180 185 190 Asp Leu Phe Gly Glu Gln Val Val Leu Cys Gly Gly Val Ala Glu Leu 195 200 205 Ile Lys Ala Gly Phe Asp Thr Leu Val Glu Ala Gly Tyr Ala Pro Glu 210 215 220 Asn Ala Tyr Phe Glu Cys Leu His Glu Met Lys Leu Ile Val Asp Leu 225 230 235 240 Ile Tyr Glu Gly Gly Leu Ala Arg Met Arg Tyr Ser Val Ser Asp Thr 245 250 255 Ala Glu Tyr Gly Asp Tyr Lys Ile Gly Lys Arg Ile Ile Asn Asp Asn 260 265 270 Thr Arg Ala Glu Met Lys Lys Val Leu Thr Glu Ile Gln Asp Gly Thr 275 280 285 Phe Ala Arg Glu Trp Leu Leu Glu Asn Gln Thr Gly Arg Pro Gly Phe 290 295 300 Thr Ala Arg Arg Arg Met Glu Lys Asp Ala Pro Ile Glu Lys Val Gly 305 310 315 320 Lys Glu Leu Arg Ser Met Met Ser Trp Ile Asn Glu Asn Pro Asp Asn 325 330 335 Glu <210> SEQ ID NO 38 <211> LENGTH: 491 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvC, NP_418222.1 <400> SEQUENCE: 38 Met Ala Asn Tyr Phe Asn Thr Leu Asn Leu Arg Gln Gln Leu Ala Gln 1 5 10 15 Leu Gly Lys Cys Arg Phe Met Gly Arg Asp Glu Phe Ala Asp Gly Ala 20 25 30 Ser Tyr Leu Gln Gly Lys Lys Val Val Ile Val Gly Cys Gly Ala Gln 35 40 45 Gly Leu Asn Gln Gly Leu Asn Met Arg Asp Ser Gly Leu Asp Ile Ser 50 55 60 Tyr Ala Leu Arg Lys Glu Ala Ile Ala Glu Lys Arg Ala Ser Trp Arg 65 70 75 80 Lys Ala Thr Glu Asn Gly Phe Lys Val Gly Thr Tyr Glu Glu Leu Ile 85 90 95 Pro Gln Ala Asp Leu Val Ile Asn Leu Thr Pro Asp Lys Gln His Ser 100 105 110 Asp Val Val Arg Thr Val Gln Pro Leu Met Lys Asp Gly Ala Ala Leu 115 120 125 Gly Tyr Ser His Gly Phe Asn Ile Val Glu Val Gly Glu Gln Ile Arg 130 135 140 Lys Asp Ile Thr Val Val Met Val Ala Pro Lys Cys Pro Gly Thr Glu 145 150 155 160 Val Arg Glu Glu Tyr Lys Arg Gly Phe Gly Val Pro Thr Leu Ile Ala 165 170 175 Val His Pro Glu Asn Asp Pro Lys Gly Glu Gly Met Ala Ile Ala Lys 180 185 190 Ala Trp Ala Ala Ala Thr Gly Gly His Arg Ala Gly Val Leu Glu Ser 195 200 205 Ser Phe Val Ala Glu Val Lys Ser Asp Leu Met Gly Glu Gln Thr Ile 210 215 220 Leu Cys Gly Met Leu Gln Ala Gly Ser Leu Leu Cys Phe Asp Lys Leu 225 230 235 240 Val Glu Glu Gly Thr Asp Pro Ala Tyr Ala Glu Lys Leu Ile Gln Phe 245 250 255 Gly Trp Glu Thr Ile Thr Glu Ala Leu Lys Gln Gly Gly Ile Thr Leu 260 265 270 Met Met Asp Arg Leu Ser Asn Pro Ala Lys Leu Arg Ala Tyr Ala Leu 275 280 285 Ser Glu Gln Leu Lys Glu Ile Met Ala Pro Leu Phe Gln Lys His Met 290 295 300 Asp Asp Ile Ile Ser Gly Glu Phe Ser Ser Gly Met Met Ala Asp Trp 305 310 315 320 Ala Asn Asp Asp Lys Lys Leu Leu Thr Trp Arg Glu Glu Thr Gly Lys 325 330 335 Thr Ala Phe Glu Thr Ala Pro Gln Tyr Glu Gly Lys Ile Gly Glu Gln 340 345 350 Glu Tyr Phe Asp Lys Gly Val Leu Met Ile Ala Met Val Lys Ala Gly 355 360 365 Val Glu Leu Ala Phe Glu Thr Met Val Asp Ser Gly Ile Ile Glu Glu 370 375 380 Ser Ala Tyr Tyr Glu Ser Leu His Glu Leu Pro Leu Ile Ala Asn Thr 385 390 395 400 Ile Ala Arg Lys Arg Leu Tyr Glu Met Asn Val Val Ile Ser Asp Thr 405 410 415 Ala Glu Tyr Gly Asn Tyr Leu Phe Ser Tyr Ala Cys Val Pro Leu Leu 420 425 430 Lys Pro Phe Met Ala Glu Leu Gln Pro Gly Asp Leu Gly Lys Ala Ile 435 440 445 Pro Glu Gly Ala Val Asp Asn Gly Gln Leu Arg Asp Val Asn Glu Ala 450 455 460 Ile Arg Ser His Ala Ile Glu Gln Val Gly Lys Lys Leu Arg Gly Tyr 465 470 475 480 Met Thr Asp Met Lys Arg Ile Ala Val Ala Gly 485 490 <210> SEQ ID NO 39 <211> LENGTH: 552 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvD, WP_013238694.1 <400> SEQUENCE: 39 Met Lys Ser Asp Ser Val Lys Lys Gly Ile Lys Ala Ala Pro Ala Arg 1 5 10 15 Ala Leu Met Tyr Gly Met Gly Tyr Thr Lys Glu Glu Ile Glu Arg Pro 20 25 30 Leu Ile Gly Ile Val Asn Ser Gln Asn Glu Ile Val Ala Gly His Met 35 40 45 His Leu Asp Glu Ile Ala Lys Ala Ala Lys Leu Gly Val Ala Met Ser 50 55 60 Gly Gly Thr Pro Ile Glu Phe Pro Ala Ile Ala Val Cys Asp Gly Ile 65 70 75 80 Ala Met Gly His Val Gly Met Lys Tyr Ser Leu Ala Ser Arg Glu Leu 85 90 95 Ile Ala Asp Ser Ile Glu Ala Met Ala Thr Ala His Gly Phe Asp Gly 100 105 110 Leu Val Leu Ile Pro Asn Cys Asp Lys Ile Val Pro Gly Met Leu Met 115 120 125 Ala Ala Ala Arg Leu Asn Ile Pro Ala Val Val Val Ser Gly Gly Pro 130 135 140 Met Arg Ala Gly Lys Leu Asn Asn Lys Ala Leu Asp Phe Ser Thr Cys 145 150 155 160 Ile Glu Lys Val Ala Ala Cys Ser Asp Gly Lys Val Thr Glu Glu Glu 165 170 175 Leu Glu Glu Glu Ala Lys Arg Ala Cys Pro Gly Cys Gly Ser Cys Ser 180 185 190 Gly Leu Phe Thr Ala Asn Ser Met Asn Ser Leu Thr Glu Val Leu Gly 195 200 205 Met Gly Leu Pro Leu Asn Gly Ser Ala Leu Ala Gln Thr Gly Glu Arg 210 215 220 Asn Gln Leu Ala Lys Tyr Ala Gly Met Tyr Val Met Asp Cys Val Lys 225 230 235 240 Asn Asp Arg Arg Pro Arg Asp Ile Leu Thr Leu Asp Ala Phe Lys Asn 245 250 255 Ala Ile Thr Val Asp Met Ala Met Ala Gly Ser Thr Asn Thr Val Leu 260 265 270 His Leu Pro Ala Ile Ala His Glu Ala Gly Ile Glu Leu Asn Leu Asp 275 280 285 Leu Phe His Glu Ile Ser Lys His Thr Pro Cys Leu Thr Lys Leu Ser 290 295 300 Pro Ser Gly Lys His His Met Glu Asp Leu His Leu Ala Gly Gly Ile 305 310 315 320 Pro Ala Leu Met Asn Glu Leu Ser Lys Lys Gly Leu Ile Asn Glu Asp 325 330 335 Ala Leu Thr Val Thr Gly Lys Thr Val Gly Glu Thr Ile Lys Asp Phe 340 345 350 Lys Val Leu Asp Tyr Glu Val Ile Arg Ser Val Asp Asn Ala Tyr Ser 355 360 365 Ser Glu Gly Gly Ile Ala Ile Leu Arg Gly Asn Leu Ala Pro Asp Gly 370 375 380 Ala Val Val Lys Glu Ser Ala Val Ser Lys Glu Met Met Val His Glu 385 390 395 400 Gly Pro Ala Arg Val Tyr Asn Ser Glu Glu Ala Ala Val Lys Ala Ile 405 410 415

Phe Gly Asn Glu Ile Asn Lys Gly Asp Val Ile Val Ile Arg Tyr Glu 420 425 430 Gly Pro Lys Gly Gly Pro Gly Met Arg Glu Met Leu Ser Pro Thr Ser 435 440 445 Ala Ile Ala Gly Met Gly Leu Asp Lys Asp Val Ala Leu Leu Thr Asp 450 455 460 Gly Arg Phe Ser Gly Ala Thr Arg Gly Ala Ser Ile Gly His Val Ser 465 470 475 480 Pro Glu Ala Met Glu Gly Gly Leu Ile Gly Leu Val Glu Glu Gly Asp 485 490 495 Thr Ile Phe Val Asp Ile Thr Asn Lys Lys Leu Glu Leu Lys Val Ser 500 505 510 Glu Glu Glu Leu Glu Lys Arg Arg Lys Asn Tyr Val Lys Pro Glu Pro 515 520 525 Lys Ile Lys Thr Gly Tyr Leu Ser Arg Tyr Ala Lys Leu Val Thr Ser 530 535 540 Ala Asn Thr Gly Ala Val Leu Lys 545 550 <210> SEQ ID NO 40 <211> LENGTH: 616 <212> TYPE: PRT <213> ORGANISM: Escherichia coli <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: IlvD, YP_026248.1 <400> SEQUENCE: 40 Met Pro Lys Tyr Arg Ser Ala Thr Thr Thr His Gly Arg Asn Met Ala 1 5 10 15 Gly Ala Arg Ala Leu Trp Arg Ala Thr Gly Met Thr Asp Ala Asp Phe 20 25 30 Gly Lys Pro Ile Ile Ala Val Val Asn Ser Phe Thr Gln Phe Val Pro 35 40 45 Gly His Val His Leu Arg Asp Leu Gly Lys Leu Val Ala Glu Gln Ile 50 55 60 Glu Ala Ala Gly Gly Val Ala Lys Glu Phe Asn Thr Ile Ala Val Asp 65 70 75 80 Asp Gly Ile Ala Met Gly His Gly Gly Met Leu Tyr Ser Leu Pro Ser 85 90 95 Arg Glu Leu Ile Ala Asp Ser Val Glu Tyr Met Val Asn Ala His Cys 100 105 110 Ala Asp Ala Met Val Cys Ile Ser Asn Cys Asp Lys Ile Thr Pro Gly 115 120 125 Met Leu Met Ala Ser Leu Arg Leu Asn Ile Pro Val Ile Phe Val Ser 130 135 140 Gly Gly Pro Met Glu Ala Gly Lys Thr Lys Leu Ser Asp Gln Ile Ile 145 150 155 160 Lys Leu Asp Leu Val Asp Ala Met Ile Gln Gly Ala Asp Pro Lys Val 165 170 175 Ser Asp Ser Gln Ser Asp Gln Val Glu Arg Ser Ala Cys Pro Thr Cys 180 185 190 Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Thr 195 200 205 Glu Ala Leu Gly Leu Ser Gln Pro Gly Asn Gly Ser Leu Leu Ala Thr 210 215 220 His Ala Asp Arg Lys Gln Leu Phe Leu Asn Ala Gly Lys Arg Ile Val 225 230 235 240 Glu Leu Thr Lys Arg Tyr Tyr Glu Gln Asn Asp Glu Ser Ala Leu Pro 245 250 255 Arg Asn Ile Ala Ser Lys Ala Ala Phe Glu Asn Ala Met Thr Leu Asp 260 265 270 Ile Ala Met Gly Gly Ser Thr Asn Thr Val Leu His Leu Leu Ala Ala 275 280 285 Ala Gln Glu Ala Glu Ile Asp Phe Thr Met Ser Asp Ile Asp Lys Leu 290 295 300 Ser Arg Lys Val Pro Gln Leu Cys Lys Val Ala Pro Ser Thr Gln Lys 305 310 315 320 Tyr His Met Glu Asp Val His Arg Ala Gly Gly Val Ile Gly Ile Leu 325 330 335 Gly Glu Leu Asp Arg Ala Gly Leu Leu Asn Arg Asp Val Lys Asn Val 340 345 350 Leu Gly Leu Thr Leu Pro Gln Thr Leu Glu Gln Tyr Asp Val Met Leu 355 360 365 Thr Gln Asp Asp Ala Val Lys Asn Met Phe Arg Ala Gly Pro Ala Gly 370 375 380 Ile Arg Thr Thr Gln Ala Phe Ser Gln Asp Cys Arg Trp Asp Thr Leu 385 390 395 400 Asp Asp Asp Arg Ala Asn Gly Cys Ile Arg Ser Leu Glu His Ala Tyr 405 410 415 Ser Lys Asp Gly Gly Leu Ala Val Leu Tyr Gly Asn Phe Ala Glu Asn 420 425 430 Gly Cys Ile Val Lys Thr Ala Gly Val Asp Asp Ser Ile Leu Lys Phe 435 440 445 Thr Gly Pro Ala Lys Val Tyr Glu Ser Gln Asp Asp Ala Val Glu Ala 450 455 460 Ile Leu Gly Gly Lys Val Val Ala Gly Asp Val Val Val Ile Arg Tyr 465 470 475 480 Glu Gly Pro Lys Gly Gly Pro Gly Met Gln Glu Met Leu Tyr Pro Thr 485 490 495 Ser Phe Leu Lys Ser Met Gly Leu Gly Lys Ala Cys Ala Leu Ile Thr 500 505 510 Asp Gly Arg Phe Ser Gly Gly Thr Ser Gly Leu Ser Ile Gly His Val 515 520 525 Ser Pro Glu Ala Ala Ser Gly Gly Ser Ile Gly Leu Ile Glu Asp Gly 530 535 540 Asp Leu Ile Ala Ile Asp Ile Pro Asn Arg Gly Ile Gln Leu Gln Val 545 550 555 560 Ser Asp Ala Glu Leu Ala Ala Arg Arg Glu Ala Gln Asp Ala Arg Gly 565 570 575 Asp Lys Ala Trp Thr Pro Lys Asn Arg Glu Arg Gln Val Ser Phe Ala 580 585 590 Leu Arg Ala Tyr Ala Ser Leu Ala Thr Ser Ala Asp Lys Gly Ala Val 595 600 605 Arg Asp Lys Ser Lys Leu Gly Gly 610 615 <210> SEQ ID NO 41 <211> LENGTH: 477 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorA, WP_010876344.1 <400> SEQUENCE: 41 Met Thr Lys Lys Val Ile Arg Lys Pro Asp Ser Leu His Asp Val Phe 1 5 10 15 Glu Arg Lys Gly Gly Ser Ala Pro Thr Ala Thr His Tyr Cys Ala Gly 20 25 30 Cys Gly His Gly Ile Leu His Lys Leu Ile Gly Glu Ala Met Asp Glu 35 40 45 Leu Gly Ile Gln Glu Arg Ala Val Met Ile Ser Pro Val Gly Cys Ala 50 55 60 Val Phe Ala Tyr Tyr Tyr Phe Asp Cys Gly Asn Val Gln Val Ala His 65 70 75 80 Gly Arg Ala Pro Ala Val Gly Thr Gly Ile Ser Arg Ala Glu Asp Asp 85 90 95 Ala Val Val Ile Leu Tyr Gln Gly Asp Gly Asp Leu Ala Ser Ile Gly 100 105 110 Leu Asn Glu Thr Ile Gln Ala Ala Asn Arg Gly Glu Lys Leu Ala Val 115 120 125 Phe Phe Val Asn Asn Thr Val Tyr Gly Met Thr Gly Gly Gln Met Ala 130 135 140 Pro Thr Thr Leu Val Gly Glu Val Thr Val Thr Cys Pro Thr Gly Arg 145 150 155 160 Asp Pro Arg Tyr Ala Gly Tyr Pro Leu His Met Cys Glu Leu Leu Asp 165 170 175 Asn Leu Gln Ala Pro Val Phe Ile Glu Arg Val Ser Leu Ala Asp Pro 180 185 190 Lys Arg Ile Arg Arg Ala Arg Arg Ala Ile Lys Arg Ala Leu Glu Ile 195 200 205 Gln Arg Asp Gly Lys Gly Tyr Ala Phe Val Glu Val Leu Ser Pro Cys 210 215 220 Pro Thr Asn Leu Arg Gln Asp Ala Glu Gly Ala Glu Arg Phe Leu Lys 225 230 235 240 Glu Glu Met Glu Lys Glu Phe Pro Val Lys Asn Phe Arg Asp Arg Ser 245 250 255 Ala Glu Thr Glu Pro Leu Ile Arg Ser Glu Ser Asp Phe Ser Arg Glu 260 265 270 Ser Leu Asp Arg Ile Phe Gln Ile Arg Glu Asp Ser Val Pro Asp Pro 275 280 285 Val Asp Asp Pro Glu Phe Pro Glu Val Arg Val Lys Ile Ala Gly Phe 290 295 300 Gly Gly Gln Gly Val Leu Ser Met Gly Leu Thr Leu Ala Gln Ala Ala 305 310 315 320 Cys Ser Glu Gly Arg His Thr Ser Trp Tyr Pro Ala Tyr Gly Pro Glu 325 330 335 Gln Arg Gly Gly Thr Ser Ser Cys Gly Val Val Ile Ser Gly Glu Arg 340 345 350 Val Gly Ser Pro Ala Val Asp Thr Pro Asp Val Leu Val Ala Leu Asn 355 360 365 Gln Pro Ser Leu Asp Glu Phe Ala Asp Asp Val Ala Asp Gly Gly Ile 370 375 380 Ile Leu Tyr Asp Ser Thr Thr Ala Ser Phe Ser Gly Gly Ala Val Arg 385 390 395 400 Ala Met Gly Val Pro Ala Leu Glu Ile Ala Arg Lys His Gly Thr Ala 405 410 415 Arg Ala Ala Asn Thr Val Met Leu Gly Val Met Met Ala Leu Gly Leu 420 425 430 Thr Gly Leu Asp Glu Glu Ser Phe Arg Glu Ala Ile Lys Phe Thr Phe 435 440 445 Ala Gly Lys Glu Lys Ile Ile Asp Met Asn Leu Arg Ile Leu Glu Ala

450 455 460 Gly Ala Glu Trp Ala Arg Glu Asn Ile Glu Gly Glu Leu 465 470 475 <210> SEQ ID NO 42 <211> LENGTH: 352 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorB, WP_010876343.1 <400> SEQUENCE: 42 Met Ala Thr Gln Met Val Lys Gly Asn Thr Ala Val Ile Ile Gly Ala 1 5 10 15 Met Tyr Ala Gly Cys Asp Cys Tyr Phe Gly Tyr Pro Ile Thr Pro Ala 20 25 30 Ser Glu Ile Leu His Glu Ala Ser Arg Tyr Phe Pro Met Val Gly Arg 35 40 45 Lys Phe Val Gln Ala Glu Ser Glu Glu Ala Ala Ile Asn Met Val Tyr 50 55 60 Gly Ala Ala Ala Ala Gly His Arg Val Met Thr Ala Ser Ser Gly Pro 65 70 75 80 Gly Ile Ser Leu Lys Gln Glu Gly Ile Ser Phe Leu Ala Gly Ala Glu 85 90 95 Leu Pro Ala Val Ile Val Asp Val Met Arg Ala Gly Pro Gly Leu Gly 100 105 110 Asn Ile Gly Pro Glu Gln Gly Asp Tyr Asn Gln Ile Val Lys Gly Gly 115 120 125 Gly His Gly Asn Tyr Arg Asn Met Val Leu Ala Pro Ser Ser Val Gln 130 135 140 Glu Met Cys Asp Leu Thr Met Glu Ala Phe Glu Leu Ala Asp Lys Tyr 145 150 155 160 Arg Asn Pro Val Val Val Leu Thr Asp Ala Val Leu Gly Gln Met Ala 165 170 175 Glu Pro Leu Arg Phe Pro Glu Glu Ala Val Glu His Arg Pro Asp Thr 180 185 190 Ser Trp Ala Val Cys Gly Asn Arg Glu Thr Met Lys Asn Leu Val Thr 195 200 205 Ser Ile Phe Leu Asp Phe Asp Glu Leu Glu Glu Phe Asn Phe Tyr Leu 210 215 220 Gln Glu Lys Tyr Ala Arg Ile Glu Glu Asn Glu Val Arg Tyr Glu Glu 225 230 235 240 Tyr Leu Val Asp Asp Ala Glu Ile Val Met Val Ala Tyr Gly Ile Ser 245 250 255 Ser Arg Val Ala Arg Ser Ala Val Glu Thr Ala Arg Ala Glu Gly Ile 260 265 270 Asn Val Gly Leu Leu Arg Pro Ile Thr Leu Phe Pro Phe Pro Ser Asp 275 280 285 Arg Ile Arg Glu Leu Ala Asp Gly Gly Cys Arg Phe Ile Ser Val Glu 290 295 300 Met Ser Ser Gly Gln Met Arg Glu Asp Ile Arg Met Ala Ser Gly Cys 305 310 315 320 Arg Asp Val Glu Leu Val Asn Arg Met Gly Gly Asn Leu Ile Glu Leu 325 330 335 Arg Asp Val Leu Glu Lys Ile Arg Glu Val Ala Gly Asp Ser Ser Asp 340 345 350 <210> SEQ ID NO 43 <211> LENGTH: 79 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorC, WP_010876342.1 <400> SEQUENCE: 43 Met Lys Lys Ala Tyr Pro Val Ile Asn Ser Val Glu Cys Lys Ala Cys 1 5 10 15 Glu Arg Cys Ile Ile Ala Cys Pro Arg Lys Val Leu Gln Met Ser Ser 20 25 30 Lys Ile Asn Glu Arg Gly Tyr His Tyr Val Glu Tyr Arg Gly Glu Gly 35 40 45 Cys Asn Gly Cys Gly Asn Cys Tyr Tyr Thr Cys Pro Glu Ile Asn Ala 50 55 60 Ile Glu Val His Ile Glu Arg Cys Glu Asp Gly Asn Thr Asp Gly 65 70 75 <210> SEQ ID NO 44 <211> LENGTH: 124 <212> TYPE: PRT <213> ORGANISM: Methanothermobacter thermautotrophicus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorD, WP_010876341.1 <400> SEQUENCE: 44 Met Asp Glu Asp Gly Tyr Met Trp Phe Val Gly Arg Thr Asp Asp Ile 1 5 10 15 Ile Lys Ser Ser Gly Tyr Arg Ile Gly Pro Phe Glu Val Glu Ser Ala 20 25 30 Ile Ile Ser His Pro Ser Val Leu Glu Cys Ala Val Thr Gly Tyr Pro 35 40 45 Asp Pro Ile Arg Gly Gln Val Val Lys Ala Thr Ile Val Leu Ala Arg 50 55 60 Gly Tyr Glu Pro Ser Glu Glu Leu Lys Lys Glu Ile Gln Asp His Val 65 70 75 80 Lys Arg Val Thr Ala Pro Tyr Lys Tyr Pro Arg Ile Val Glu Phe Val 85 90 95 Asp Glu Leu Pro Lys Thr Ile Ser Gly Lys Ile Arg Arg Val Glu Ile 100 105 110 Arg Glu His Asp Leu Glu Gly Asp Gly Glu Asn Pro 115 120 <210> SEQ ID NO 45 <211> LENGTH: 394 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorA, WP_011012106.1 <400> SEQUENCE: 45 Met Glu Tyr Lys Pro Ile Arg Lys Val Val Ser Gly Asn Tyr Ala Ala 1 5 10 15 Ala Tyr Ala Ala Leu His Ala Arg Val Gln Val Val Ala Ala Tyr Pro 20 25 30 Ile Thr Pro Gln Thr Ser Ile Ile Glu Lys Ile Ala Glu Phe Ile Ala 35 40 45 Asn Gly Glu Ala Asp Ile Gln Tyr Ile Pro Val Glu Ser Glu His Ser 50 55 60 Ala Met Ala Ala Cys Ile Gly Ala Ser Ala Thr Gly Ala Arg Thr Phe 65 70 75 80 Thr Ala Thr Ser Ala Gln Gly Leu Ala Leu Met His Glu Met Leu His 85 90 95 Trp Ala Ala Gly Ala Arg Leu Pro Ile Val Met Val Asp Val Asn Arg 100 105 110 Ala Met Ala Pro Pro Trp Ser Val Trp Asp Asp Gln Thr Asp Ser Leu 115 120 125 Ser Gln Arg Asp Thr Gly Trp Met Gln Phe Tyr Ala Glu Asn Asn Gln 130 135 140 Glu Val Tyr Asp Gly Val Leu Met Ala Tyr Lys Val Ala Glu Thr Val 145 150 155 160 Asn Val Pro Ala Met Val Val Glu Ser Ala Phe Ile Leu Ser His Thr 165 170 175 Tyr Asp Val Val Glu Met Ile Pro Gln Glu Leu Val Asp Glu Phe Leu 180 185 190 Pro Pro Arg Lys Pro Leu Tyr Ser Leu Ala Asn Phe Asp Glu Pro Ile 195 200 205 Ala Val Gly Ala Leu Ala Thr Pro Asn Asp Tyr Tyr Glu Phe Arg Tyr 210 215 220 Lys Leu Ala Lys Ala His Glu Glu Ala Lys Lys Val Ile Lys Glu Val 225 230 235 240 Gly Lys Glu Phe Gly Glu Arg Phe Gly Arg Asp Tyr Ser Gln Met Ile 245 250 255 Glu Thr Gly Tyr Ile Asp Asp Ala Asp Phe Val Phe Met Gly Met Gly 260 265 270 Ser Leu Met Gly Thr Val Lys Glu Ala Val Asp Leu Leu Arg Lys Glu 275 280 285 Gly Tyr Lys Val Gly Tyr Ala Lys Val Arg Trp Phe Arg Pro Phe Pro 290 295 300 Lys Glu Glu Leu Val Glu Ile Ala Glu Ser Val Lys Gly Ile Ala Val 305 310 315 320 Leu Asp Arg Asn Phe Ser Phe Gly Gln Glu Gly Ile Leu Phe Thr Glu 325 330 335 Ser Lys Gly Ala Leu Tyr Asn Ser Ser Ala His Pro Leu Met Lys Asn 340 345 350 Tyr Ile Val Gly Leu Gly Gly Arg Asp Val Thr Val Lys Asp Ile Lys 355 360 365 Ala Ile Ala Asp Asp Met Lys Lys Val Ile Glu Ser Gly Lys Val Asp 370 375 380 Lys Glu Val Val Trp Tyr His Leu Lys Arg 385 390 <210> SEQ ID NO 46 <211> LENGTH: 311 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorB, WP_011012105.1 <400> SEQUENCE: 46 Met Glu Val Pro Glu Asn Ile Lys Lys Arg Val Thr Ile Pro Phe Glu 1 5 10 15 Glu His Phe Tyr Ala Gly His Thr Ala Cys Gln Gly Cys Gly Ala Ser 20 25 30 Leu Gly Leu Arg Tyr Val Leu Lys Ala Tyr Gly Lys Lys Thr Ile Leu 35 40 45 Val Ile Pro Ala Cys Cys Ser Thr Ile Ile Ala Gly Pro Trp Pro Tyr 50 55 60

Ser Ala Ile Asp Ala Asn Leu Phe His Thr Ala Phe Glu Thr Thr Gly 65 70 75 80 Ala Val Ile Ser Gly Ile Glu Ala Ala Leu Lys Ala Met Gly Tyr Lys 85 90 95 Val Lys Gly Glu Asp Gly Ile Met Val Val Gly Trp Ala Gly Asp Gly 100 105 110 Gly Thr Ala Asp Ile Gly Leu Gln Ala Leu Ser Gly Phe Leu Glu Arg 115 120 125 Gly His Asp Ala Val Tyr Ile Met Tyr Asp Asn Glu Ala Tyr Met Asn 130 135 140 Thr Gly Ile Gln Arg Ser Ser Ser Thr Pro Tyr Gly Ala Trp Thr Thr 145 150 155 160 Asn Thr Pro Gly Gly Arg Arg His Phe Leu Glu Lys Arg His Lys Lys 165 170 175 Lys Val Ile Asp Ile Val Ile Ala His Arg Ile Pro Tyr Ala Ala Thr 180 185 190 Ala Ser Ile Ala Tyr Pro Glu Asp Phe Ile Arg Lys Leu Lys Lys Ala 195 200 205 Gln Lys Ile Ser Gly Pro Ser Phe Ile Gln Leu Phe Ala Pro Cys Pro 210 215 220 Thr Gly Trp Arg Ala Pro Thr Asp Lys Ser Ile Glu Ile Ala Arg Leu 225 230 235 240 Ala Val Gln Thr Ala Tyr Phe Pro Leu Phe Glu Tyr Glu Asn Gly Lys 245 250 255 Tyr Lys Ile Asn Met Pro Asn Pro Lys Lys Glu Pro Lys Pro Ile Glu 260 265 270 Glu Phe Leu Lys Leu Gln Gly Arg Phe Lys Tyr Met Thr Lys Glu Asp 275 280 285 Ile Glu Thr Leu Gln Lys Trp Val Leu Glu Glu Trp Glu Arg Leu Lys 290 295 300 Lys Leu Ala Glu Val Phe Gly 305 310 <210> SEQ ID NO 47 <211> LENGTH: 185 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorC, WP_011012108.1 <400> SEQUENCE: 47 Met Ile Glu Val Arg Phe His Gly Arg Gly Gly Gln Gly Ala Val Thr 1 5 10 15 Ala Ala Asn Ile Leu Ala Glu Ala Ala Phe Leu Glu Gly Lys Tyr Val 20 25 30 Gln Ala Phe Pro Phe Phe Gly Val Glu Arg Arg Gly Ala Pro Val Thr 35 40 45 Ala Phe Thr Arg Ile Asp Asn Lys Pro Ile Arg Ile Lys Thr Gln Ile 50 55 60 Tyr Glu Pro Asp Val Val Val Val Leu Asp Pro Ser Leu Leu Asp Ala 65 70 75 80 Val Asp Val Thr Ala Gly Leu Lys Asp Glu Gly Ile Val Ile Val Asn 85 90 95 Thr Glu Lys Ser Lys Glu Glu Val Leu Glu Lys Leu Lys Lys Lys Pro 100 105 110 Lys Lys Leu Ala Ile Val Asp Ala Thr Thr Ile Ala Leu Glu Ile Leu 115 120 125 Gly Leu Pro Ile Thr Asn Thr Ala Ile Leu Gly Ala Val Ala Lys Ala 130 135 140 Thr Gly Leu Val Lys Ile Glu Ser Ile Glu Glu Ala Ile Lys Asp Thr 145 150 155 160 Phe Ser Gly Glu Leu Gly Glu Lys Asn Ala Arg Ala Ala Arg Glu Ala 165 170 175 Tyr Glu Lys Thr Glu Val Phe Glu Leu 180 185 <210> SEQ ID NO 48 <211> LENGTH: 105 <212> TYPE: PRT <213> ORGANISM: Pyrococcus furiosus <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: VorD, WP_011012107.1 <400> SEQUENCE: 48 Met Asn Thr Leu Phe Gly Lys Thr Lys Glu Glu Ala Lys Pro Ile Val 1 5 10 15 Leu Lys Ser Val Asp Glu Tyr Pro Glu Ala Pro Ile Ser Leu Gly Thr 20 25 30 Thr Leu Val Asn Pro Thr Gly Asp Trp Arg Thr Phe Lys Pro Val Val 35 40 45 Asn Glu Glu Lys Cys Val Lys Cys Tyr Ile Cys Trp Lys Tyr Cys Pro 50 55 60 Glu Pro Ala Ile Tyr Ile Lys Pro Asp Gly Tyr Val Ala Ile Asp Tyr 65 70 75 80 Asp Tyr Cys Lys Gly Cys Gly Ile Cys Ala Asn Glu Cys Pro Thr Lys 85 90 95 Ala Ile Thr Met Ile Lys Glu Glu Lys 100 105 <210> SEQ ID NO 49 <211> LENGTH: 386 <212> TYPE: PRT <213> ORGANISM: Streptomyces avermitilis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AcdH, AAD44196.1 or BAB69160.1 <400> SEQUENCE: 49 Met Asp His Arg Leu Thr Pro Glu Leu Glu Glu Leu Arg Arg Thr Val 1 5 10 15 Glu Glu Phe Ala His Asp Val Val Ala Pro Lys Ile Gly Asp Phe Tyr 20 25 30 Glu Arg His Glu Phe Pro Tyr Glu Ile Val Arg Glu Met Gly Arg Met 35 40 45 Gly Leu Phe Gly Leu Pro Phe Pro Glu Glu Tyr Gly Gly Met Gly Gly 50 55 60 Asp Tyr Leu Ala Leu Gly Ile Ala Leu Glu Glu Leu Ala Arg Val Asp 65 70 75 80 Ser Ser Val Ala Ile Thr Leu Glu Ala Gly Val Ser Leu Gly Ala Met 85 90 95 Pro Ile His Leu Phe Gly Thr Asp Ala Gln Lys Ala Glu Trp Leu Pro 100 105 110 Arg Leu Cys Ser Gly Glu Ile Leu Gly Ala Phe Gly Leu Thr Glu Pro 115 120 125 Asp Gly Gly Ser Asp Ala Gly Ala Thr Arg Thr Thr Ala Arg Leu Asp 130 135 140 Glu Ser Thr Asn Glu Trp Val Ile Asn Gly Thr Lys Cys Phe Ile Thr 145 150 155 160 Asn Ser Gly Thr Asp Ile Thr Gly Leu Val Thr Val Thr Ala Val Thr 165 170 175 Gly Arg Lys Pro Asp Gly Lys Pro Leu Ile Ser Ser Ile Ile Val Pro 180 185 190 Ser Gly Thr Pro Gly Phe Thr Val Ala Ala Pro Tyr Ser Lys Val Gly 195 200 205 Trp Asn Ala Ser Asp Thr Arg Glu Leu Ser Phe Ala Asp Val Arg Val 210 215 220 Pro Ala Ala Asn Leu Leu Gly Glu Gln Gly Arg Gly Tyr Ala Gln Phe 225 230 235 240 Leu Arg Ile Leu Asp Glu Gly Arg Ile Ala Ile Ser Ala Leu Ala Thr 245 250 255 Gly Leu Ala Gln Gly Cys Val Asp Glu Ser Val Lys Tyr Ala Gly Glu 260 265 270 Arg His Ala Phe Gly Arg Asn Ile Gly Ala Tyr Gln Ala Ile Gln Phe 275 280 285 Lys Ile Ala Asp Met Glu Met Lys Ala His Met Ala Arg Val Gly Trp 290 295 300 Arg Asp Ala Ala Ser Arg Leu Val Ala Gly Glu Pro Phe Lys Lys Glu 305 310 315 320 Ala Ala Ile Ala Lys Leu Tyr Ser Ser Thr Val Ala Val Asp Asn Ala 325 330 335 Arg Glu Ala Thr Gln Ile His Gly Gly Tyr Gly Phe Met Asn Glu Tyr 340 345 350 Pro Val Ala Arg Met Trp Arg Asp Ser Lys Ile Leu Glu Ile Gly Glu 355 360 365 Gly Thr Ser Glu Val Gln Arg Met Leu Ile Ala Arg Glu Leu Gly Leu 370 375 380 Val Gly 385 <210> SEQ ID NO 50 <211> LENGTH: 386 <212> TYPE: PRT <213> ORGANISM: Streptomyces coelicolor <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AcdH, AAD44195.1 <400> SEQUENCE: 50 Met Asp His Lys Leu Ser Pro Glu Leu Glu Glu Leu Arg Arg Thr Val 1 5 10 15 Glu Gln Phe Ala His Asp Val Val Ala Pro Lys Ile Gly Asp Phe Tyr 20 25 30 Glu Arg His Glu Phe Pro Tyr Glu Ile Val Arg Glu Met Gly Arg Met 35 40 45 Gly Leu Phe Gly Leu Pro Phe Pro Glu Glu Tyr Gly Gly Met Gly Gly 50 55 60 Asp Tyr Phe Ala Leu Gly Val Ala Leu Glu Glu Leu Ala Arg Val Asp 65 70 75 80 Ser Ser Val Ala Ile Thr Leu Glu Ala Gly Val Ser Leu Gly Ala Met 85 90 95 Pro Leu His Leu Phe Gly Thr Glu Glu Gln Lys Arg Glu Trp Leu Pro 100 105 110 Arg Leu Cys Ser Gly Glu Ile Leu Gly Ala Phe Gly Leu Thr Glu Pro 115 120 125 Asp Gly Gly Ser Asp Ala Gly Ala Thr Arg Thr Thr Ala Arg Leu Asp 130 135 140

Glu Ala Thr Asn Glu Trp Val Ile Asn Gly Thr Lys Cys Phe Ile Thr 145 150 155 160 Asn Ser Gly Thr Asp Ile Thr Gly Leu Val Thr Val Thr Ala Val Thr 165 170 175 Gly Arg Lys Pro Asp Gly Arg Pro Leu Ile Ser Ser Ile Ile Val Pro 180 185 190 Ser Gly Thr Pro Gly Phe Thr Val Ala Ala Pro Tyr Ser Lys Val Gly 195 200 205 Trp Asn Ala Ser Asp Thr Arg Glu Leu Ser Phe Ala Asp Val Arg Val 210 215 220 Pro Ala Ala Asn Leu Leu Gly Glu Leu Gly Arg Gly Tyr Ala Gln Phe 225 230 235 240 Leu Arg Ile Leu Asp Glu Gly Arg Val Ala Ile Ala Ala Leu Gly Thr 245 250 255 Gly Leu Ala Gln Gly Cys Val Asp Glu Ser Val Ala Tyr Ala Lys Glu 260 265 270 Arg His Ala Phe Gly Arg Pro Ile Gly Ala Asn Gln Ala Ile Gln Phe 275 280 285 Lys Ile Ala Asp Met Glu Met Lys Ala His Thr Ala Arg Leu Ala Trp 290 295 300 Arg Asp Ala Ala Ser Arg Leu Val Ala Gly Glu Pro Phe Lys Lys Glu 305 310 315 320 Ala Ala Leu Ala Lys Leu Tyr Ser Ser Thr Val Ala Val Asp Asn Ala 325 330 335 Arg Asp Ala Thr Gln Val His Gly Gly Tyr Gly Phe Met Asn Glu Tyr 340 345 350 Pro Val Ala Arg Met Trp Arg Asp Ala Lys Ile Leu Glu Ile Gly Glu 355 360 365 Gly Thr Ser Glu Val Gln Arg Met Leu Ile Ala Arg Glu Leu Gly Leu 370 375 380 Val Gly 385 <210> SEQ ID NO 51 <211> LENGTH: 261 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Crt, ABR34202.1 <400> SEQUENCE: 51 Met Glu Leu Lys Asn Val Ile Leu Glu Lys Glu Gly His Leu Ala Ile 1 5 10 15 Val Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Glu Thr 20 25 30 Leu Lys Asp Leu Asp Ala Val Leu Glu Asp Leu Glu Lys Asp Ser Asn 35 40 45 Met Tyr Thr Val Ile Val Thr Gly Ala Gly Glu Lys Ser Phe Val Ala 50 55 60 Gly Ala Asp Ile Ser Glu Met Lys Asp Leu Asn Glu Glu Gln Gly Lys 65 70 75 80 Glu Phe Gly Ile Leu Gly Asn Asn Val Phe Arg Arg Leu Glu Arg Leu 85 90 95 Asp Lys Pro Val Ile Ala Ala Ile Ser Gly Phe Ala Leu Gly Gly Gly 100 105 110 Cys Glu Leu Ala Met Ser Cys Asp Ile Arg Ile Ala Ser Val Lys Ala 115 120 125 Lys Phe Gly Gln Pro Glu Ala Gly Leu Gly Ile Thr Pro Gly Phe Gly 130 135 140 Gly Thr Gln Arg Leu Ala Arg Ile Val Gly Pro Gly Lys Ala Lys Glu 145 150 155 160 Leu Ile Tyr Thr Cys Asp Leu Ile Asn Ala Glu Glu Ala Tyr Arg Ile 165 170 175 Gly Leu Val Asn Lys Val Val Glu Leu Glu Lys Leu Met Glu Glu Ala 180 185 190 Lys Ala Met Ala Asn Lys Ile Ala Ala Asn Ala Pro Lys Ala Val Ala 195 200 205 Tyr Cys Lys Asp Ala Ile Asp Arg Gly Met Gln Val Asp Ile Asp Ala 210 215 220 Ala Ile Leu Ile Glu Ala Glu Asp Phe Gly Lys Cys Phe Ala Thr Glu 225 230 235 240 Asp Gln Thr Glu Gly Met Thr Ala Phe Leu Glu Arg Arg Ala Glu Lys 245 250 255 Asn Phe Gln Asn Lys 260 <210> SEQ ID NO 52 <211> LENGTH: 261 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Crt, NP_349318.1 <400> SEQUENCE: 52 Met Glu Leu Asn Asn Val Ile Leu Glu Lys Glu Gly Lys Val Ala Val 1 5 10 15 Val Thr Ile Asn Arg Pro Lys Ala Leu Asn Ala Leu Asn Ser Asp Thr 20 25 30 Leu Lys Glu Met Asp Tyr Val Ile Gly Glu Ile Glu Asn Asp Ser Glu 35 40 45 Val Leu Ala Val Ile Leu Thr Gly Ala Gly Glu Lys Ser Phe Val Ala 50 55 60 Gly Ala Asp Ile Ser Glu Met Lys Glu Met Asn Thr Ile Glu Gly Arg 65 70 75 80 Lys Phe Gly Ile Leu Gly Asn Lys Val Phe Arg Arg Leu Glu Leu Leu 85 90 95 Glu Lys Pro Val Ile Ala Ala Val Asn Gly Phe Ala Leu Gly Gly Gly 100 105 110 Cys Glu Ile Ala Met Ser Cys Asp Ile Arg Ile Ala Ser Ser Asn Ala 115 120 125 Arg Phe Gly Gln Pro Glu Val Gly Leu Gly Ile Thr Pro Gly Phe Gly 130 135 140 Gly Thr Gln Arg Leu Ser Arg Leu Val Gly Met Gly Met Ala Lys Gln 145 150 155 160 Leu Ile Phe Thr Ala Gln Asn Ile Lys Ala Asp Glu Ala Leu Arg Ile 165 170 175 Gly Leu Val Asn Lys Val Val Glu Pro Ser Glu Leu Met Asn Thr Ala 180 185 190 Lys Glu Ile Ala Asn Lys Ile Val Ser Asn Ala Pro Val Ala Val Lys 195 200 205 Leu Ser Lys Gln Ala Ile Asn Arg Gly Met Gln Cys Asp Ile Asp Thr 210 215 220 Ala Leu Ala Phe Glu Ser Glu Ala Phe Gly Glu Cys Phe Ser Thr Glu 225 230 235 240 Asp Gln Lys Asp Ala Met Thr Ala Phe Ile Glu Lys Arg Lys Ile Glu 245 250 255 Gly Phe Lys Asn Arg 260 <210> SEQ ID NO 53 <211> LENGTH: 397 <212> TYPE: PRT <213> ORGANISM: Treponema denticola <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ccr, NP_971211.1 <400> SEQUENCE: 53 Met Ile Val Lys Pro Met Val Arg Asn Asn Ile Cys Leu Asn Ala His 1 5 10 15 Pro Gln Gly Cys Lys Lys Gly Val Glu Asp Gln Ile Glu Tyr Thr Lys 20 25 30 Lys Arg Ile Thr Ala Glu Val Lys Ala Gly Ala Lys Ala Pro Lys Asn 35 40 45 Val Leu Val Leu Gly Cys Ser Asn Gly Tyr Gly Leu Ala Ser Arg Ile 50 55 60 Thr Ala Ala Phe Gly Tyr Gly Ala Ala Thr Ile Gly Val Ser Phe Glu 65 70 75 80 Lys Ala Gly Ser Glu Thr Lys Tyr Gly Thr Pro Gly Trp Tyr Asn Asn 85 90 95 Leu Ala Phe Asp Glu Ala Ala Lys Arg Glu Gly Leu Tyr Ser Val Thr 100 105 110 Ile Asp Gly Asp Ala Phe Ser Asp Glu Ile Lys Ala Gln Val Ile Glu 115 120 125 Glu Ala Lys Lys Lys Gly Ile Lys Phe Asp Leu Ile Val Tyr Ser Leu 130 135 140 Ala Ser Pro Val Arg Thr Asp Pro Asp Thr Gly Ile Met His Lys Ser 145 150 155 160 Val Leu Lys Pro Phe Gly Lys Thr Phe Thr Gly Lys Thr Val Asp Pro 165 170 175 Phe Thr Gly Glu Leu Lys Glu Ile Ser Ala Glu Pro Ala Asn Asp Glu 180 185 190 Glu Ala Ala Ala Thr Val Lys Val Met Gly Gly Glu Asp Trp Glu Arg 195 200 205 Trp Ile Lys Gln Leu Ser Lys Glu Gly Leu Leu Glu Glu Gly Cys Ile 210 215 220 Thr Leu Ala Tyr Ser Tyr Ile Gly Pro Glu Ala Thr Gln Ala Leu Tyr 225 230 235 240 Arg Lys Gly Thr Ile Gly Lys Ala Lys Glu His Leu Glu Ala Thr Ala 245 250 255 His Arg Leu Asn Lys Glu Asn Pro Ser Ile Arg Ala Phe Val Ser Val 260 265 270 Asn Lys Gly Leu Val Thr Arg Ala Ser Ala Val Ile Pro Val Ile Pro 275 280 285 Leu Tyr Leu Ala Ser Leu Phe Lys Val Met Lys Glu Lys Gly Asn His 290 295 300 Glu Gly Cys Ile Glu Gln Ile Thr Arg Leu Tyr Ala Glu Arg Leu Tyr 305 310 315 320 Arg Lys Asp Gly Thr Ile Pro Val Asp Glu Glu Asn Arg Ile Arg Ile 325 330 335 Asp Asp Trp Glu Leu Glu Glu Asp Val Gln Lys Ala Val Ser Ala Leu 340 345 350 Met Glu Lys Val Thr Gly Glu Asn Ala Glu Ser Leu Thr Asp Leu Ala 355 360 365

Gly Tyr Arg His Asp Phe Leu Ala Ser Asn Gly Phe Asp Val Glu Gly 370 375 380 Ile Asn Tyr Glu Ala Glu Val Glu Arg Phe Asp Arg Ile 385 390 395 <210> SEQ ID NO 54 <211> LENGTH: 539 <212> TYPE: PRT <213> ORGANISM: Euglena gracilis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ter, AAW66853.1 <400> SEQUENCE: 54 Met Ser Cys Pro Ala Ser Pro Ser Ala Ala Val Val Ser Ala Gly Ala 1 5 10 15 Leu Cys Leu Cys Val Ala Thr Val Leu Leu Ala Thr Gly Ser Asn Pro 20 25 30 Thr Ala Leu Ser Thr Ala Ser Thr Arg Ser Pro Thr Ser Leu Val Arg 35 40 45 Gly Val Asp Arg Gly Leu Met Arg Pro Thr Thr Ala Ala Ala Leu Thr 50 55 60 Thr Met Arg Glu Val Pro Gln Met Ala Glu Gly Phe Ser Gly Glu Ala 65 70 75 80 Thr Ser Ala Trp Ala Ala Ala Gly Pro Gln Trp Ala Ala Pro Leu Val 85 90 95 Ala Ala Ala Ser Ser Ala Leu Ala Leu Trp Trp Trp Ala Ala Arg Arg 100 105 110 Ser Val Arg Arg Pro Leu Ala Ala Leu Ala Glu Leu Pro Thr Ala Val 115 120 125 Thr His Leu Ala Pro Pro Met Ala Met Phe Thr Thr Thr Ala Lys Val 130 135 140 Ile Gln Pro Lys Ile Arg Gly Phe Ile Cys Thr Thr Thr His Pro Ile 145 150 155 160 Gly Cys Glu Lys Arg Val Gln Glu Glu Ile Ala Tyr Ala Arg Ala His 165 170 175 Pro Pro Thr Ser Pro Gly Pro Lys Arg Val Leu Val Ile Gly Cys Ser 180 185 190 Thr Gly Tyr Gly Leu Ser Thr Arg Ile Thr Ala Ala Phe Gly Tyr Gln 195 200 205 Ala Ala Thr Leu Gly Val Phe Leu Ala Gly Pro Pro Thr Lys Gly Arg 210 215 220 Pro Ala Ala Ala Gly Trp Tyr Asn Thr Val Ala Phe Glu Lys Ala Ala 225 230 235 240 Leu Glu Ala Gly Leu Tyr Ala Arg Ser Leu Asn Gly Asp Ala Phe Asp 245 250 255 Ser Thr Thr Lys Ala Arg Thr Val Glu Ala Ile Lys Arg Asp Leu Gly 260 265 270 Thr Val Asp Leu Val Val Tyr Ser Ile Ala Ala Pro Lys Arg Thr Asp 275 280 285 Pro Ala Thr Gly Val Leu His Lys Ala Cys Leu Lys Pro Ile Gly Ala 290 295 300 Thr Tyr Thr Asn Arg Thr Val Asn Thr Asp Lys Ala Glu Val Thr Asp 305 310 315 320 Val Ser Ile Glu Pro Ala Ser Pro Glu Glu Ile Ala Asp Thr Val Lys 325 330 335 Val Met Gly Gly Glu Asp Trp Glu Leu Trp Ile Gln Ala Leu Ser Glu 340 345 350 Ala Gly Val Leu Ala Glu Gly Ala Lys Thr Val Ala Tyr Ser Tyr Ile 355 360 365 Gly Pro Glu Met Thr Trp Pro Val Tyr Trp Ser Gly Thr Ile Gly Glu 370 375 380 Ala Lys Lys Asp Val Glu Lys Ala Ala Lys Arg Ile Thr Gln Gln Tyr 385 390 395 400 Gly Cys Pro Ala Tyr Pro Val Val Ala Lys Ala Leu Val Thr Gln Ala 405 410 415 Ser Ser Ala Ile Pro Val Val Pro Leu Tyr Ile Cys Leu Leu Tyr Arg 420 425 430 Val Met Lys Glu Lys Gly Thr His Glu Gly Cys Ile Glu Gln Met Val 435 440 445 Arg Leu Leu Thr Thr Lys Leu Tyr Pro Glu Asn Gly Ala Pro Ile Val 450 455 460 Asp Glu Ala Gly Arg Val Arg Val Asp Asp Trp Glu Met Ala Glu Asp 465 470 475 480 Val Gln Gln Ala Val Lys Asp Leu Trp Ser Gln Val Ser Thr Ala Asn 485 490 495 Leu Lys Asp Ile Ser Asp Phe Ala Gly Tyr Gln Thr Glu Phe Leu Arg 500 505 510 Leu Phe Gly Phe Gly Ile Asp Gly Val Asp Tyr Asp Gln Pro Val Asp 515 520 525 Val Glu Ala Asp Leu Pro Ser Ala Ala Gln Gln 530 535 <210> SEQ ID NO 55 <211> LENGTH: 282 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Hbd, WP_011967675.1 <400> SEQUENCE: 55 Met Lys Lys Ile Phe Val Leu Gly Ala Gly Thr Met Gly Ala Gly Ile 1 5 10 15 Val Gln Ala Phe Ala Gln Lys Gly Cys Glu Val Ile Val Arg Asp Ile 20 25 30 Lys Glu Glu Phe Val Asp Arg Gly Ile Ala Gly Ile Thr Lys Gly Leu 35 40 45 Glu Lys Gln Val Ala Lys Gly Lys Met Ser Glu Glu Asp Lys Glu Ala 50 55 60 Ile Leu Ser Arg Ile Ser Gly Thr Thr Asp Met Lys Leu Ala Ala Asp 65 70 75 80 Cys Asp Leu Val Val Glu Ala Ala Ile Glu Asn Met Lys Ile Lys Lys 85 90 95 Glu Ile Phe Ala Glu Leu Asp Gly Ile Cys Lys Pro Glu Ala Ile Leu 100 105 110 Ala Ser Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Ser Ala Thr 115 120 125 Lys Arg Pro Asp Lys Val Ile Gly Met His Phe Phe Asn Pro Ala Pro 130 135 140 Val Met Lys Leu Val Glu Ile Ile Lys Gly Ile Ala Thr Ser Gln Glu 145 150 155 160 Thr Phe Asp Ala Val Lys Glu Leu Ser Val Ala Ile Gly Lys Glu Pro 165 170 175 Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Arg Ile Leu Ile 180 185 190 Pro Met Ile Asn Glu Ala Ser Phe Ile Leu Gln Glu Gly Ile Ala Ser 195 200 205 Val Glu Asp Ile Asp Thr Ala Met Lys Tyr Gly Ala Asn His Pro Met 210 215 220 Gly Pro Leu Ala Leu Gly Asp Leu Ile Gly Leu Asp Val Cys Leu Ala 225 230 235 240 Ile Met Asp Val Leu Phe Thr Glu Thr Gly Asp Asn Lys Tyr Arg Ala 245 250 255 Ser Ser Ile Leu Arg Lys Tyr Val Arg Ala Gly Trp Leu Gly Arg Lys 260 265 270 Ser Gly Lys Gly Phe Tyr Asp Tyr Ser Lys 275 280 <210> SEQ ID NO 56 <211> LENGTH: 282 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Hbd, NP_349314.1 <400> SEQUENCE: 56 Met Lys Lys Val Cys Val Ile Gly Ala Gly Thr Met Gly Ser Gly Ile 1 5 10 15 Ala Gln Ala Phe Ala Ala Lys Gly Phe Glu Val Val Leu Arg Asp Ile 20 25 30 Lys Asp Glu Phe Val Asp Arg Gly Leu Asp Phe Ile Asn Lys Asn Leu 35 40 45 Ser Lys Leu Val Lys Lys Gly Lys Ile Glu Glu Ala Thr Lys Val Glu 50 55 60 Ile Leu Thr Arg Ile Ser Gly Thr Val Asp Leu Asn Met Ala Ala Asp 65 70 75 80 Cys Asp Leu Val Ile Glu Ala Ala Val Glu Arg Met Asp Ile Lys Lys 85 90 95 Gln Ile Phe Ala Asp Leu Asp Asn Ile Cys Lys Pro Glu Thr Ile Leu 100 105 110 Ala Ser Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Ser Ala Thr 115 120 125 Lys Arg Pro Asp Lys Val Ile Gly Met His Phe Phe Asn Pro Ala Pro 130 135 140 Val Met Lys Leu Val Glu Val Ile Arg Gly Ile Ala Thr Ser Gln Glu 145 150 155 160 Thr Phe Asp Ala Val Lys Glu Thr Ser Ile Ala Ile Gly Lys Asp Pro 165 170 175 Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Arg Ile Leu Ile 180 185 190 Pro Met Ile Asn Glu Ala Val Gly Ile Leu Ala Glu Gly Ile Ala Ser 195 200 205 Val Glu Asp Ile Asp Lys Ala Met Lys Leu Gly Ala Asn His Pro Met 210 215 220 Gly Pro Leu Glu Leu Gly Asp Phe Ile Gly Leu Asp Ile Cys Leu Ala 225 230 235 240 Ile Met Asp Val Leu Tyr Ser Glu Thr Gly Asp Ser Lys Tyr Arg Pro 245 250 255 His Thr Leu Leu Lys Lys Tyr Val Arg Ala Gly Trp Leu Gly Arg Lys 260 265 270 Ser Gly Lys Gly Phe Tyr Asp Tyr Ser Lys 275 280

<210> SEQ ID NO 57 <211> LENGTH: 282 <212> TYPE: PRT <213> ORGANISM: Clostridium kluyveri <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Hbd1, WP_011989027.1 <400> SEQUENCE: 57 Met Ser Ile Lys Ser Val Ala Val Leu Gly Ser Gly Thr Met Ser Arg 1 5 10 15 Gly Ile Val Gln Ala Phe Ala Glu Ala Gly Ile Asp Val Ile Ile Arg 20 25 30 Gly Arg Thr Glu Gly Ser Ile Gly Lys Gly Leu Ala Ala Val Lys Lys 35 40 45 Ala Tyr Asp Lys Lys Val Ser Lys Gly Lys Ile Ser Gln Glu Asp Ala 50 55 60 Asp Lys Ile Val Gly Arg Val Ser Thr Thr Thr Glu Leu Glu Lys Leu 65 70 75 80 Ala Asp Cys Asp Leu Ile Ile Glu Ala Ala Ser Glu Asp Met Asn Ile 85 90 95 Lys Lys Asp Tyr Phe Gly Lys Leu Glu Glu Ile Cys Lys Pro Glu Thr 100 105 110 Ile Phe Ala Thr Asn Thr Ser Ser Leu Ser Ile Thr Glu Val Ala Thr 115 120 125 Ala Thr Lys Arg Pro Asp Lys Phe Ile Gly Met His Phe Phe Asn Pro 130 135 140 Ala Asn Val Met Lys Leu Val Glu Ile Ile Arg Gly Met Asn Thr Ser 145 150 155 160 Gln Glu Thr Phe Asp Ile Ile Lys Glu Ala Ser Ile Lys Ile Gly Lys 165 170 175 Thr Pro Val Glu Val Ala Glu Ala Pro Gly Phe Val Val Asn Lys Ile 180 185 190 Leu Val Pro Met Ile Asn Glu Ala Val Gly Ile Leu Ala Glu Gly Ile 195 200 205 Ala Ser Ala Glu Asp Ile Asp Thr Ala Met Lys Leu Gly Ala Asn His 210 215 220 Pro Met Gly Pro Leu Ala Leu Gly Asp Leu Ile Gly Leu Asp Val Val 225 230 235 240 Leu Ala Val Met Asp Val Leu Tyr Ser Glu Thr Gly Asp Ser Lys Tyr 245 250 255 Arg Ala His Thr Leu Leu Arg Lys Tyr Val Arg Ala Gly Trp Leu Gly 260 265 270 Arg Lys Ser Gly Lys Gly Phe Phe Ala Tyr 275 280 <210> SEQ ID NO 58 <211> LENGTH: 246 <212> TYPE: PRT <213> ORGANISM: Cupriavidus necator <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: PhaB, WP_010810131.1 <400> SEQUENCE: 58 Met Thr Gln Arg Ile Ala Tyr Val Thr Gly Gly Met Gly Gly Ile Gly 1 5 10 15 Thr Ala Ile Cys Gln Arg Leu Ala Lys Asp Gly Phe Arg Val Val Ala 20 25 30 Gly Cys Gly Pro Asn Ser Pro Arg Arg Glu Lys Trp Leu Glu Gln Gln 35 40 45 Lys Ala Leu Gly Phe Asp Phe Ile Ala Ser Glu Gly Asn Val Ala Asp 50 55 60 Trp Asp Ser Thr Lys Thr Ala Phe Asp Lys Val Lys Ser Glu Val Gly 65 70 75 80 Glu Val Asp Val Leu Ile Asn Asn Ala Gly Ile Thr Arg Asp Val Val 85 90 95 Phe Arg Lys Met Thr Arg Ala Asp Trp Asp Ala Val Ile Asp Thr Asn 100 105 110 Leu Thr Ser Leu Phe Asn Val Thr Lys Gln Val Ile Asp Gly Met Ala 115 120 125 Asp Arg Gly Trp Gly Arg Ile Val Asn Ile Ser Ser Val Asn Gly Gln 130 135 140 Lys Gly Gln Phe Gly Gln Thr Asn Tyr Ser Thr Ala Lys Ala Gly Leu 145 150 155 160 His Gly Phe Thr Met Ala Leu Ala Gln Glu Val Ala Thr Lys Gly Val 165 170 175 Thr Val Asn Thr Val Ser Pro Gly Tyr Ile Ala Thr Asp Met Val Lys 180 185 190 Ala Ile Arg Gln Asp Val Leu Asp Lys Ile Val Ala Thr Ile Pro Val 195 200 205 Lys Arg Leu Gly Leu Pro Glu Glu Ile Ala Ser Ile Cys Ala Trp Leu 210 215 220 Ser Ser Glu Glu Ser Gly Phe Ser Thr Gly Ala Asp Phe Ser Leu Asn 225 230 235 240 Gly Gly Leu His Met Gly 245 <210> SEQ ID NO 59 <211> LENGTH: 134 <212> TYPE: PRT <213> ORGANISM: Aeromonas caviae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: PhaJ, O32472 <400> SEQUENCE: 59 Met Ser Ala Gln Ser Leu Glu Val Gly Gln Lys Ala Arg Leu Ser Lys 1 5 10 15 Arg Phe Gly Ala Ala Glu Val Ala Ala Phe Ala Ala Leu Ser Glu Asp 20 25 30 Phe Asn Pro Leu His Leu Asp Pro Ala Phe Ala Ala Thr Thr Ala Phe 35 40 45 Glu Arg Pro Ile Val His Gly Met Leu Leu Ala Ser Leu Phe Ser Gly 50 55 60 Leu Leu Gly Gln Gln Leu Pro Gly Lys Gly Ser Ile Tyr Leu Gly Gln 65 70 75 80 Ser Leu Ser Phe Lys Leu Pro Val Phe Val Gly Asp Glu Val Thr Ala 85 90 95 Glu Val Glu Val Thr Ala Leu Arg Glu Asp Lys Pro Ile Ala Thr Leu 100 105 110 Thr Thr Arg Ile Phe Thr Gln Gly Gly Ala Leu Ala Val Thr Gly Glu 115 120 125 Ala Val Val Lys Leu Pro 130 <210> SEQ ID NO 60 <211> LENGTH: 260 <212> TYPE: PRT <213> ORGANISM: Ralstonia pickettii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh1, BAE72684.1 <400> SEQUENCE: 60 Met Gln Leu Lys Gly Lys Ser Ala Ile Val Thr Gly Ala Ala Ser Gly 1 5 10 15 Ile Gly Lys Ala Ile Ala Glu Leu Leu Ala Lys Glu Gly Ala Ala Val 20 25 30 Ala Ile Ala Asp Leu Asn Leu Glu Ala Ala Arg Ala Ala Ala Ala Gly 35 40 45 Ile Glu Ala Ala Gly Gly Lys Ala Ile Ala Val Ala Met Asp Val Thr 50 55 60 Ser Glu Ala Ser Val Asn Gln Ala Thr Asp Glu Val Ala Gln Ala Phe 65 70 75 80 Gly Asn Ile Asp Ile Leu Val Ser Asn Ala Gly Ile Gln Ile Val Asn 85 90 95 Pro Ile Gln Asn Tyr Ala Phe Ser Asp Trp Lys Lys Met Gln Ala Ile 100 105 110 His Val Asp Gly Ala Phe Leu Thr Thr Lys Ala Ala Leu Lys Tyr Met 115 120 125 Tyr Arg Asp Lys Arg Gly Gly Thr Val Ile Tyr Met Gly Ser Val His 130 135 140 Ser His Glu Ala Ser Pro Leu Lys Ser Ala Tyr Val Ala Ala Lys His 145 150 155 160 Ala Leu Leu Gly Leu Ala Arg Val Leu Ala Lys Glu Gly Ala Glu Phe 165 170 175 Asn Val Arg Ser His Val Ile Cys Pro Gly Phe Val Arg Thr Pro Leu 180 185 190 Val Asp Lys Gln Ile Pro Glu Gln Ala Lys Glu Leu Gly Ile Ser Glu 195 200 205 Glu Glu Val Val Arg Arg Val Met Leu Gly Gly Thr Val Asp Gly Val 210 215 220 Phe Thr Thr Val Asp Asp Val Ala Arg Thr Ala Leu Phe Leu Cys Ala 225 230 235 240 Phe Pro Ser Ala Ala Leu Thr Gly Gln Ser Phe Ile Val Ser His Gly 245 250 255 Trp Tyr Met Gln 260 <210> SEQ ID NO 61 <211> LENGTH: 256 <212> TYPE: PRT <213> ORGANISM: Ralstonia pickettii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh2, BAE72685.1 <400> SEQUENCE: 61 Met Leu Gln Gly Lys Thr Ala Leu Val Thr Gly Ser Thr Cys Gly Ile 1 5 10 15 Gly Leu Gly Ile Ala Gln Ala Leu Ala Ala Gln Gly Ala Asn Ile Ile 20 25 30 Val Asn Gly Phe Arg Arg Ala Asp Gly Ala Arg Gln Gln Ile Ala Ala 35 40 45 Ala Gly Gln Val Ile Arg Leu Gly Tyr His Gly Ala Asp Met Ser Lys 50 55 60 Ala Ser Glu Ile Glu Asp Met Met Arg Tyr Ala Glu Ala Glu Phe Ala 65 70 75 80 Ala Asp Ile Leu Val Asn Asn Ala Gly Ile Gln His Val Ala Ser Ile 85 90 95

Glu Asp Phe Pro Pro Glu Arg Trp Asp Ala Ile Ile Ala Ile Asn Leu 100 105 110 Thr Ser Ala Phe His Thr Thr Arg Leu Ala Leu Pro Gly Met Arg Gln 115 120 125 Lys Asn Trp Gly Arg Val Ile Asn Ile Ala Ser Thr His Gly Leu Val 130 135 140 Ala Ser Ala Gln Lys Ser Ala Tyr Val Ala Ala Lys His Gly Ile Val 145 150 155 160 Gly Leu Thr Lys Val Thr Ala Leu Glu Thr Ala Gln Asn Arg Val Thr 165 170 175 Ala Asn Ala Ile Cys Pro Gly Trp Val Leu Thr Pro Leu Val Gln Lys 180 185 190 Gln Val Gln Ala Arg Pro Ala His Gly Ile Ser Val Glu Gln Ala Lys 195 200 205 Arg Glu Leu Val Ile Glu Lys Gln Pro Ser Gly Gln Phe Val Thr Pro 210 215 220 Asp Glu Leu Gly Ala Leu Ala Val Phe Leu Ala Ser Glu Ala Gly Arg 225 230 235 240 Gln Val Arg Gly Ala Ile Trp Asn Met Ala Gly Gly Trp Phe Ala Gln 245 250 255 <210> SEQ ID NO 62 <211> LENGTH: 254 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh, AGY75962 <400> SEQUENCE: 62 Met Arg Leu Glu Asn Lys Val Ala Ile Val Thr Gly Ser Ala Met Gly 1 5 10 15 Ile Gly Lys Ala Ile Val Arg Asp Phe Val Asn Glu Gly Ala Lys Val 20 25 30 Ile Ile Ser Asp Ile Leu Glu Ala Glu Gly Gln Ala Leu Glu Glu Glu 35 40 45 Leu Gln Lys Lys Gly His Ser Val Tyr Phe Phe Lys Thr Asp Val Ser 50 55 60 Ser Glu Lys Asn Ile Lys Glu Leu Val Lys Phe Thr Leu Glu Lys Phe 65 70 75 80 Gly Thr Ile Asn Ile Leu Cys Asn Asn Ala Ala Val Asn Ile Pro Gly 85 90 95 Ser Val Leu Glu Leu Thr Glu Asp Ile Trp Asn Lys Thr Met Asp Val 100 105 110 Asn Val Lys Ser His Phe Leu Val Ser Lys His Val Ile Pro Val Met 115 120 125 Gln Lys Ala Gly Gly Gly Ser Ile Val Asn Thr Ala Ser Ala Asn Ser 130 135 140 Phe Val Ala Glu Pro Arg Leu Ser Ala Tyr Val Ala Ser Lys Gly Ala 145 150 155 160 Ile Leu Met Leu Thr Arg Ala Met Ala Leu Asp Phe Ala Lys Asp Asn 165 170 175 Ile Arg Val Asn Cys Ile Cys Pro Gly Trp Val Asp Thr Thr Phe Asn 180 185 190 Asp Ala His Ala Glu Leu Phe Gly Gly Arg Glu Ala Val Leu Lys Asp 195 200 205 Leu Ala Ser Val Gln Pro Ile Gly Arg Pro Ile Ala Pro Met Glu Ile 210 215 220 Ala Lys Ile Ala Thr Phe Leu Ala Ser Asp Asp Ser Ser Cys Met Thr 225 230 235 240 Gly Ser Pro Val Ile Ala Asp Gly Gly Ile Thr Ala Gly Val 245 250 <210> SEQ ID NO 63 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, WP_013238665.1 <400> SEQUENCE: 63 Met Tyr Gly Tyr Asp Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 1 5 10 15 Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 35 40 45 Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 85 90 95 Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 100 105 110 Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 115 120 125 Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 130 135 140 Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Lys Ser Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 195 200 205 Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 210 215 220 Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 225 230 235 240 Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Glu Ser Tyr Thr Asn Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 305 310 315 320 Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 370 375 380 Trp Ile Lys Arg Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 385 390 395 400 Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Ala Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Leu Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Glu Ser Thr Tyr Asp Ala Asp Ser Leu Leu Glu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 545 550 555 560 Asp Gly Pro Ser Lys Gly Glu Val His Arg Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 595 600 605 <210> SEQ ID NO 64 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, WP_013238675.1 <400> SEQUENCE: 64 Met Tyr Gly Tyr Lys Gly Lys Val Leu Arg Ile Asn Leu Ser Ser Lys 1 5 10 15 Thr Tyr Ile Val Glu Glu Leu Lys Ile Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Val Asp Pro 35 40 45 Lys Val Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Ala Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ser Pro Leu Thr Gly Thr Ile Ala Ile Ala Asn Ser Gly Gly 85 90 95 Lys Trp Gly Ala Glu Phe Lys Ala Ala Gly Tyr Asp Met Ile Ile Val 100 105 110 Glu Gly Lys Ser Asp Lys Glu Val Tyr Val Asn Ile Val Asp Asp Lys 115 120 125 Val Glu Phe Arg Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu

130 135 140 Thr Thr Lys Met Leu Gln Gln Glu Thr Asp Ser Arg Ala Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Lys Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 195 200 205 Lys Leu Phe Asp Glu Gln Lys Val Lys Glu Val Ala Leu Glu Lys Thr 210 215 220 Asn Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 225 230 235 240 Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Lys Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Lys Asp Cys Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Lys Leu Asp Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 305 310 315 320 Cys Asp Val Tyr Asp Ile Asn Ala Val Asn Thr Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Leu Asp Thr Ile Thr Ala Gly Cys Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Ala Asp Gly Leu Ser Leu Asn Trp Gly Asp Ala Lys Ser Met Val Glu 370 375 380 Trp Val Lys Lys Met Gly Leu Arg Glu Gly Phe Gly Asp Lys Met Ala 385 390 395 400 Asp Gly Ser Tyr Arg Leu Cys Asp Ser Tyr Gly Val Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Val Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Leu Ala Val Glu Gly Lys Ala Gly Tyr Ala Arg Val Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Gly Glu Leu His Asp Val Asn Ser Leu Met Leu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Gln Ile Pro 545 550 555 560 Glu Gly Pro Ser Lys Gly Glu Val His Lys Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Val Gly Lys Leu 595 600 605 <210> SEQ ID NO 65 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, ADK15073.1 <400> SEQUENCE: 65 Met Tyr Gly Tyr Asp Gly Lys Val Leu Arg Ile Asn Leu Lys Glu Arg 1 5 10 15 Thr Cys Lys Ser Glu Asn Leu Asp Leu Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Cys Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Ile Asp Pro 35 40 45 Lys Ile Asp Ala Leu Ser Pro Glu Asn Lys Phe Ile Ile Val Thr Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ala Pro Leu Thr Gly Thr Ile Gly Ile Ser Asn Ser Gly Gly 85 90 95 Lys Trp Gly Val Asp Leu Lys Lys Ala Gly Trp Asp Met Ile Ile Val 100 105 110 Glu Asp Lys Ala Asp Ser Pro Val Tyr Ile Glu Ile Val Asp Asp Lys 115 120 125 Val Glu Ile Lys Asp Ala Ser Gln Leu Trp Gly Lys Val Thr Ser Glu 130 135 140 Thr Thr Lys Glu Leu Glu Lys Ile Thr Glu Asn Lys Ser Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Arg Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Ala Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Thr Val Lys Gly Thr Gly Lys Ile 195 200 205 Ala Leu Ala Asp Lys Glu Lys Val Lys Lys Val Ser Val Glu Lys Ile 210 215 220 Thr Thr Leu Lys Asn Asp Pro Val Ala Gly Gln Gly Met Pro Thr Tyr 225 230 235 240 Gly Thr Ala Ile Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Glu Ser Tyr Thr Asn Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Ala Asn Gln Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Ser Cys Pro Ile Gly Cys Gly Arg Trp Val Arg Leu Lys Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Cys Phe Gly Ser Asp 305 310 315 320 Cys Gly Ser Tyr Asp Leu Asp Ala Ile Asn Glu Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Ile Asp Thr Ile Thr Cys Gly Ala Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Gly Asp Asn Leu Ser Leu Lys Trp Gly Asp Thr Glu Ser Met Ile Gly 370 375 380 Trp Ile Lys Arg Met Val Tyr Ser Glu Gly Phe Gly Ala Lys Met Thr 385 390 395 400 Asn Gly Ser Tyr Arg Leu Cys Glu Gly Tyr Gly Ala Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Ile Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Ile Asn Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Phe Ala Leu Asp Gly Lys Ala Ala Tyr Ala Lys Leu Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ile Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Glu Ser Thr Tyr Asp Ala Asp Ser Leu Leu Glu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Leu Phe Asn Leu Ala Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Pro Ile Pro 545 550 555 560 Asp Gly Pro Ser Lys Gly Glu Val His Arg Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Ser Lys Glu Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Ile Gly Lys Phe 595 600 605 <210> SEQ ID NO 66 <211> LENGTH: 607 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AOR, ADK15083.1 <400> SEQUENCE: 66 Met Tyr Gly Tyr Lys Gly Lys Val Leu Arg Ile Asn Leu Ser Ser Lys 1 5 10 15 Thr Tyr Ile Val Glu Glu Leu Lys Ile Asp Lys Ala Lys Lys Phe Ile 20 25 30 Gly Ala Arg Gly Leu Gly Val Lys Thr Leu Phe Asp Glu Val Asp Pro 35 40 45 Lys Val Asp Pro Leu Ser Pro Asp Asn Lys Phe Ile Ile Ala Ala Gly 50 55 60 Pro Leu Thr Gly Ala Pro Val Pro Thr Ser Gly Arg Phe Met Val Val 65 70 75 80 Thr Lys Ser Pro Leu Thr Gly Thr Ile Ala Ile Ala Asn Ser Gly Gly 85 90 95 Lys Trp Gly Ala Glu Phe Lys Ala Ala Gly Tyr Asp Met Ile Ile Val 100 105 110 Glu Gly Lys Ser Asp Lys Glu Val Tyr Val Asn Ile Val Asp Asp Lys 115 120 125 Val Glu Phe Arg Asp Ala Ser His Val Trp Gly Lys Leu Thr Glu Glu 130 135 140

Thr Thr Lys Met Leu Gln Gln Glu Thr Asp Ser Arg Ala Lys Val Leu 145 150 155 160 Cys Ile Gly Pro Ala Gly Glu Lys Leu Ser Leu Met Ala Ala Val Met 165 170 175 Asn Asp Val Asp Arg Thr Ala Gly Arg Gly Gly Val Gly Ala Val Met 180 185 190 Gly Ser Lys Asn Leu Lys Ala Ile Val Val Lys Gly Ser Gly Lys Val 195 200 205 Lys Leu Phe Asp Glu Gln Lys Val Lys Glu Val Ala Leu Glu Lys Thr 210 215 220 Asn Ile Leu Arg Lys Asp Pro Val Ala Gly Gly Gly Leu Pro Thr Tyr 225 230 235 240 Gly Thr Ala Val Leu Val Asn Ile Ile Asn Glu Asn Gly Val His Pro 245 250 255 Val Lys Asn Phe Gln Lys Ser Tyr Thr Asp Gln Ala Asp Lys Ile Ser 260 265 270 Gly Glu Thr Leu Thr Lys Asp Cys Leu Val Arg Lys Asn Pro Cys Tyr 275 280 285 Arg Cys Pro Ile Ala Cys Gly Arg Trp Val Lys Leu Asp Asp Gly Thr 290 295 300 Glu Cys Gly Gly Pro Glu Tyr Glu Thr Leu Trp Ser Phe Gly Ser Asp 305 310 315 320 Cys Asp Val Tyr Asp Ile Asn Ala Val Asn Thr Ala Asn Met Leu Cys 325 330 335 Asn Glu Tyr Gly Leu Asp Thr Ile Thr Ala Gly Cys Thr Ile Ala Ala 340 345 350 Ala Met Glu Leu Tyr Gln Arg Gly Tyr Ile Lys Asp Glu Glu Ile Ala 355 360 365 Ala Asp Gly Leu Ser Leu Asn Trp Gly Asp Ala Lys Ser Met Val Glu 370 375 380 Trp Val Lys Lys Met Gly Leu Arg Glu Gly Phe Gly Asp Lys Met Ala 385 390 395 400 Asp Gly Ser Tyr Arg Leu Cys Asp Ser Tyr Gly Val Pro Glu Tyr Ser 405 410 415 Met Thr Val Lys Lys Gln Glu Leu Pro Ala Tyr Asp Pro Arg Gly Ile 420 425 430 Gln Gly His Gly Ile Thr Tyr Ala Val Asn Asn Arg Gly Gly Cys His 435 440 445 Ile Lys Gly Tyr Met Val Ser Pro Glu Ile Leu Gly Tyr Pro Glu Lys 450 455 460 Leu Asp Arg Leu Ala Val Glu Gly Lys Ala Gly Tyr Ala Arg Val Phe 465 470 475 480 His Asp Leu Thr Ala Val Ile Asp Ser Leu Gly Leu Cys Ile Phe Thr 485 490 495 Thr Phe Gly Leu Gly Ala Gln Asp Tyr Val Asp Met Tyr Asn Ala Val 500 505 510 Val Gly Gly Glu Leu His Asp Val Asn Ser Leu Met Leu Ala Gly Asp 515 520 525 Arg Ile Trp Thr Leu Glu Lys Ile Phe Asn Leu Lys Ala Gly Ile Asp 530 535 540 Ser Ser Gln Asp Thr Leu Pro Lys Arg Leu Leu Glu Glu Gln Ile Pro 545 550 555 560 Glu Gly Pro Ser Lys Gly Glu Val His Lys Leu Asp Val Leu Leu Pro 565 570 575 Glu Tyr Tyr Ser Val Arg Gly Trp Asp Lys Asn Gly Ile Pro Thr Glu 580 585 590 Glu Thr Leu Lys Lys Leu Gly Leu Asp Glu Tyr Val Gly Lys Leu 595 600 605 <210> SEQ ID NO 67 <211> LENGTH: 405 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Adh, AGY76060.1 <400> SEQUENCE: 67 Met Lys Tyr Met Gly Ile Lys Ile Tyr Gly Asn Lys Ile Arg Gly Ile 1 5 10 15 Ile Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp 20 25 30 Ala Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val 35 40 45 Val Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu 50 55 60 Glu Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val 65 70 75 80 Glu Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met 85 90 95 Thr Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro 100 105 110 Ile Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe 115 120 125 Thr Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln 130 135 140 Lys Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu 145 150 155 160 Val Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr 165 170 175 Pro Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro 180 185 190 Ala Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met 195 200 205 Asp Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser 210 215 220 Asp Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp 225 230 235 240 Asn Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met 245 250 255 His Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu 260 265 270 Gly Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile 275 280 285 Pro His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe 290 295 300 Asn Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu 305 310 315 320 Gly Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys 325 330 335 Met Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys 340 345 350 Asp Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile 355 360 365 Ala Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro 370 375 380 Ile Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly 385 390 395 400 Lys Lys Val Thr Phe 405 <210> SEQ ID NO 68 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Adh, ADK17019.1 <400> SEQUENCE: 68 Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp Ala 1 5 10 15 Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu Glu 35 40 45 Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met Thr 65 70 75 80 Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln Lys 115 120 125 Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro Ala 165 170 175 Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser Asp 195 200 205 Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp Asn 210 215 220 Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met His 225 230 235 240 Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu Gly 290 295 300 Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys Met 305 310 315 320 Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys Asp 325 330 335

Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile Ala 340 345 350 Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro Ile 355 360 365 Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly Lys 370 375 380 Lys Val Thr Phe 385 <210> SEQ ID NO 69 <211> LENGTH: 390 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: BdhB, NP_349891.1 <400> SEQUENCE: 69 Met Val Asp Phe Glu Tyr Ser Ile Pro Thr Arg Ile Phe Phe Gly Lys 1 5 10 15 Asp Lys Ile Asn Val Leu Gly Arg Glu Leu Lys Lys Tyr Gly Ser Lys 20 25 30 Val Leu Ile Val Tyr Gly Gly Gly Ser Ile Lys Arg Asn Gly Ile Tyr 35 40 45 Asp Lys Ala Val Ser Ile Leu Glu Lys Asn Ser Ile Lys Phe Tyr Glu 50 55 60 Leu Ala Gly Val Glu Pro Asn Pro Arg Val Thr Thr Val Glu Lys Gly 65 70 75 80 Val Lys Ile Cys Arg Glu Asn Gly Val Glu Val Val Leu Ala Ile Gly 85 90 95 Gly Gly Ser Ala Ile Asp Cys Ala Lys Val Ile Ala Ala Ala Cys Glu 100 105 110 Tyr Asp Gly Asn Pro Trp Asp Ile Val Leu Asp Gly Ser Lys Ile Lys 115 120 125 Arg Val Leu Pro Ile Ala Ser Ile Leu Thr Ile Ala Ala Thr Gly Ser 130 135 140 Glu Met Asp Thr Trp Ala Val Ile Asn Asn Met Asp Thr Asn Glu Lys 145 150 155 160 Leu Ile Ala Ala His Pro Asp Met Ala Pro Lys Phe Ser Ile Leu Asp 165 170 175 Pro Thr Tyr Thr Tyr Thr Val Pro Thr Asn Gln Thr Ala Ala Gly Thr 180 185 190 Ala Asp Ile Met Ser His Ile Phe Glu Val Tyr Phe Ser Asn Thr Lys 195 200 205 Thr Ala Tyr Leu Gln Asp Arg Met Ala Glu Ala Leu Leu Arg Thr Cys 210 215 220 Ile Lys Tyr Gly Gly Ile Ala Leu Glu Lys Pro Asp Asp Tyr Glu Ala 225 230 235 240 Arg Ala Asn Leu Met Trp Ala Ser Ser Leu Ala Ile Asn Gly Leu Leu 245 250 255 Thr Tyr Gly Lys Asp Thr Asn Trp Ser Val His Leu Met Glu His Glu 260 265 270 Leu Ser Ala Tyr Tyr Asp Ile Thr His Gly Val Gly Leu Ala Ile Leu 275 280 285 Thr Pro Asn Trp Met Glu Tyr Ile Leu Asn Asn Asp Thr Val Tyr Lys 290 295 300 Phe Val Glu Tyr Gly Val Asn Val Trp Gly Ile Asp Lys Glu Lys Asn 305 310 315 320 His Tyr Asp Ile Ala His Gln Ala Ile Gln Lys Thr Arg Asp Tyr Phe 325 330 335 Val Asn Val Leu Gly Leu Pro Ser Arg Leu Arg Asp Val Gly Ile Glu 340 345 350 Glu Glu Lys Leu Asp Ile Met Ala Lys Glu Ser Val Lys Leu Thr Gly 355 360 365 Gly Thr Ile Gly Asn Leu Arg Pro Val Asn Ala Ser Glu Val Leu Gln 370 375 380 Ile Phe Lys Lys Ser Val 385 390 <210> SEQ ID NO 70 <211> LENGTH: 387 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh, WP_041897187.1 <400> SEQUENCE: 70 Met Glu Asn Phe Asn Tyr Ser Ile Pro Thr Lys Val Tyr Phe Gly Lys 1 5 10 15 Gly Gln Ile Lys Asn Leu Ala Ala Ile Ile Lys Glu Tyr Gly Asn Lys 20 25 30 Ile Phe Ile Ala Tyr Gly Gly Gly Ser Ile Lys Lys Ile Gly Leu Tyr 35 40 45 Asp Glu Met Ile Lys Ile Leu Asn Asp Asn Ser Ile Ser Tyr Val Glu 50 55 60 Leu Ser Gly Ile Glu Pro Asn Pro Arg Ile Glu Thr Val Arg Lys Gly 65 70 75 80 Ile Lys Ile Cys Lys Glu Asn Asn Val Glu Val Val Leu Ala Val Gly 85 90 95 Gly Gly Ser Thr Ile Asp Cys Ala Lys Val Ile Ala Ala Gly Val Lys 100 105 110 Tyr Glu Gly Asp Pro Trp Asp Leu Val Thr Ser Pro Gln Lys Ile Asn 115 120 125 Glu Val Leu Pro Ile Val Thr Ile Leu Thr Leu Ser Ala Thr Gly Ser 130 135 140 Glu Met Asp Pro His Ala Val Ile Ser Asp Met Thr Thr Asn Gln Lys 145 150 155 160 Leu Gly Thr Gly His Glu Asn Met Lys Pro Lys Ala Ser Ile Leu Asp 165 170 175 Pro Glu Tyr Thr Tyr Ser Val Pro Lys Asn Gln Thr Ala Ala Gly Thr 180 185 190 Ala Asp Ile Met Ser His Ile Phe Glu Thr Tyr Phe Asn His Thr Lys 195 200 205 Gly Val Asp Ile Gln Asp Ser Thr Ala Glu Gly Leu Leu Arg Ala Cys 210 215 220 Ile Lys Tyr Gly Lys Ile Ala Ile Glu Asn Pro Lys Asp Tyr Asp Ala 225 230 235 240 Arg Ala Asn Leu Met Trp Ala Ser Ser Trp Ala Ile Asn Gly Leu Ile 245 250 255 Ser Tyr Gly Thr Asn Ser Pro Trp Val Val His Pro Met Glu His Glu 260 265 270 Leu Ser Ala Phe Tyr Asp Ile Thr His Gly Val Gly Leu Ala Ile Leu 275 280 285 Thr Pro His Trp Met Lys Tyr Ser Leu Asp Asp Thr Thr Val Phe Lys 290 295 300 Phe Ala Gln Tyr Gly Ile Asn Val Trp Gly Ile Asp Lys Asn Leu Asp 305 310 315 320 Lys Phe Glu Ile Ala Asn Lys Ala Ile Glu Lys Thr Ser Glu Phe Phe 325 330 335 Lys Glu Leu Gly Ile Pro Ser Thr Leu Arg Glu Val Gly Ile Glu Glu 340 345 350 Glu Lys Leu Glu Leu Met Ala Lys Lys Ala Met Asn Pro Tyr Phe Lys 355 360 365 Tyr Ala Phe Lys Pro Leu Asp Glu Asn Asp Ile Leu Lys Ile Phe Lys 370 375 380 Ala Ala Leu 385 <210> SEQ ID NO 71 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh1, YP_003780648.1 <400> SEQUENCE: 71 Met Gly Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asn Ala 1 5 10 15 Leu Glu Asn Leu Lys Asn Leu Asp Gly Asn Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Ala Lys Val Glu Lys 35 40 45 Tyr Leu Lys Glu Thr Gly Met Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Asp Thr Val Met Asn Gly Ala Lys Ile Met Arg 65 70 75 80 Asp Phe Asn Pro Asp Trp Ile Val Ser Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Ile Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Glu Lys Ala Val Val Pro Phe Gly Ile Pro Lys Leu Arg Gln Lys 115 120 125 Ala Gln Phe Val Ala Ile Pro Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Ile Asp Pro Ser 165 170 175 Leu Ala Glu Thr Met Pro Lys Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Ile Glu Ala Tyr Val Ala Ser Leu His Ser Asp 195 200 205 Phe Ser Asp Pro Leu Ala Met His Ala Ile Thr Met Ile His Lys Tyr 210 215 220 Leu Leu Lys Ser Tyr Glu Glu Asp Lys Glu Ala Arg Gly His Met His 225 230 235 240 Ile Ala Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Ile Ala His Lys Thr Gly Ala Val Phe His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Ile Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Glu Arg Tyr Ala Lys Ile Ala Lys Lys Leu His 290 295 300

Leu Ser Gly Asn Ser Glu Asp Glu Leu Ile Asp Ser Leu Thr Glu Met 305 310 315 320 Ile Arg Thr Met Asn Lys Lys Met Asp Ile Pro Leu Thr Ile Lys Asp 325 330 335 Tyr Gly Ile Ser Glu Asn Asp Phe Asn Glu Asn Leu Asp Phe Ile Ala 340 345 350 His Asn Ala Met Met Asp Ala Cys Thr Gly Ser Asn Pro Arg Ala Ile 355 360 365 Thr Glu Glu Glu Met Lys Lys Leu Leu Gln Tyr Met Tyr Asn Gly Gln 370 375 380 Lys Val Asn Phe 385 <210> SEQ ID NO 72 <211> LENGTH: 405 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh1, AGY76060.1 <400> SEQUENCE: 72 Met Lys Tyr Met Gly Ile Lys Ile Tyr Gly Asn Lys Ile Arg Gly Ile 1 5 10 15 Ile Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp 20 25 30 Ala Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val 35 40 45 Val Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu 50 55 60 Glu Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val 65 70 75 80 Glu Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met 85 90 95 Thr Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro 100 105 110 Ile Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe 115 120 125 Thr Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln 130 135 140 Lys Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu 145 150 155 160 Val Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr 165 170 175 Pro Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro 180 185 190 Ala Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met 195 200 205 Asp Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser 210 215 220 Asp Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp 225 230 235 240 Asn Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met 245 250 255 His Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu 260 265 270 Gly Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile 275 280 285 Pro His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe 290 295 300 Asn Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu 305 310 315 320 Gly Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys 325 330 335 Met Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys 340 345 350 Asp Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile 355 360 365 Ala Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro 370 375 380 Ile Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly 385 390 395 400 Lys Lys Val Thr Phe 405 <210> SEQ ID NO 73 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh2, YP_003782121.1 <400> SEQUENCE: 73 Met Glu Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asp Ala 1 5 10 15 Leu Gly Ala Leu Lys Thr Leu Lys Gly Lys Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Asp Lys Val Glu Glu 35 40 45 Tyr Leu Lys Glu Ala Asn Ile Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Glu Thr Val Met Lys Gly Ala Lys Ile Met Thr 65 70 75 80 Glu Phe Gly Pro Asp Trp Ile Val Ala Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Leu Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Lys Gln Ala Ile Val Pro Phe Gly Leu Pro Glu Leu Arg Gln Lys 115 120 125 Ala Lys Phe Val Ala Ile Ala Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Val Asp Pro Ala 165 170 175 Leu Ala Gln Thr Met Pro Pro Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Leu Glu Ala Tyr Val Ala Ser Ala Arg Ser Asp 195 200 205 Ile Ser Asp Pro Leu Ala Ile His Ser Ile Ile Met Thr Arg Asp Asn 210 215 220 Leu Leu Lys Ser Tyr Lys Gly Asp Lys Asp Ala Arg Asn Lys Met His 225 230 235 240 Ile Ser Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly 245 250 255 Ile Thr His Ser Leu Ala His Lys Thr Gly Ala Val Trp His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Leu Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Asp Arg Tyr Ala Asn Ile Ala Lys Ile Leu Gly 290 295 300 Leu Lys Gly Thr Thr Glu Asp Glu Leu Val Asp Ser Leu Val Lys Met 305 310 315 320 Val Gln Asp Met Asp Lys Glu Leu Asn Ile Pro Leu Thr Leu Lys Asp 325 330 335 Tyr Gly Ile Ser Lys Asp Asp Phe Asn Ser Asn Val Asp Phe Ile Ala 340 345 350 Lys Asn Ala Leu Leu Asp Ala Cys Thr Gly Ala Asn Pro Arg Pro Ile 355 360 365 Asp Phe Asp Gln Met Lys Lys Ile Leu Gln Cys Ile Tyr Asp Gly Lys 370 375 380 Lys Val Thr Phe 385 <210> SEQ ID NO 74 <211> LENGTH: 388 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bdh2, AGY74784.1 <400> SEQUENCE: 74 Met Gly Arg Phe Thr Leu Pro Arg Asp Ile Tyr Phe Gly Glu Asn Ala 1 5 10 15 Leu Glu Asn Leu Lys Asn Leu Asp Gly Asn Lys Ala Val Val Val Val 20 25 30 Gly Gly Gly Ser Met Lys Arg Phe Gly Phe Leu Ala Lys Val Glu Lys 35 40 45 Tyr Leu Lys Glu Thr Gly Met Glu Val Lys Leu Ile Glu Gly Val Glu 50 55 60 Pro Asp Pro Ser Val Asp Thr Val Met Asn Gly Ala Lys Ile Met Arg 65 70 75 80 Asp Phe Asn Pro Asp Trp Ile Val Ser Ile Gly Gly Gly Ser Pro Ile 85 90 95 Asp Ala Ala Lys Ala Met Trp Ile Phe Tyr Glu Tyr Pro Asp Phe Thr 100 105 110 Phe Glu Lys Ala Val Val Pro Phe Gly Ile Pro Lys Leu Arg Gln Lys 115 120 125 Ala Gln Phe Val Ala Ile Pro Ser Thr Ser Gly Thr Ala Thr Glu Val 130 135 140 Thr Ser Phe Ser Val Ile Thr Asp Tyr Lys Ala Lys Ile Lys Tyr Pro 145 150 155 160 Leu Ala Asp Phe Asn Leu Thr Pro Asp Ile Ala Ile Ile Asp Pro Ser 165 170 175 Leu Ala Glu Thr Met Pro Lys Lys Leu Thr Ala His Thr Gly Met Asp 180 185 190 Ala Leu Thr His Ala Ile Glu Ala Tyr Val Ala Ser Leu His Ser Asp 195 200 205 Phe Ser Asp Pro Leu Ala Met His Ala Ile Thr Met Ile His Lys Tyr 210 215 220 Leu Leu Lys Ser Tyr Glu Glu Asp Lys Glu Ala Arg Gly His Met His 225 230 235 240 Ile Ala Gln Cys Leu Ala Gly Met Ala Phe Ser Asn Ala Leu Leu Gly

245 250 255 Ile Thr His Ser Ile Ala His Lys Thr Gly Ala Val Phe His Ile Pro 260 265 270 His Gly Cys Ala Asn Ala Ile Tyr Leu Pro Tyr Val Ile Asp Phe Asn 275 280 285 Lys Lys Ala Cys Ser Glu Arg Tyr Ala Lys Ile Ala Lys Lys Leu His 290 295 300 Leu Ser Gly Asn Ser Glu Asp Glu Leu Ile Asp Ser Leu Thr Glu Met 305 310 315 320 Ile Arg Thr Met Asn Lys Lys Met Asp Ile Pro Leu Thr Ile Lys Asp 325 330 335 Tyr Gly Ile Ser Glu Asn Asp Phe Asn Glu Asn Leu Asp Phe Ile Ala 340 345 350 His Asn Ala Met Met Asp Ala Cys Thr Gly Ser Asn Pro Arg Ala Ile 355 360 365 Thr Glu Glu Glu Met Lys Lys Leu Leu Gln Tyr Met Tyr Asn Gly Gln 370 375 380 Lys Val Asn Phe 385 <210> SEQ ID NO 75 <211> LENGTH: 862 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE1, NP_149325.1 <400> SEQUENCE: 75 Met Lys Val Thr Thr Val Lys Glu Leu Asp Glu Lys Leu Lys Val Ile 1 5 10 15 Lys Glu Ala Gln Lys Lys Phe Ser Cys Tyr Ser Gln Glu Met Val Asp 20 25 30 Glu Ile Phe Arg Asn Ala Ala Met Ala Ala Ile Asp Ala Arg Ile Glu 35 40 45 Leu Ala Lys Ala Ala Val Leu Glu Thr Gly Met Gly Leu Val Glu Asp 50 55 60 Lys Val Ile Lys Asn His Phe Ala Gly Glu Tyr Ile Tyr Asn Lys Tyr 65 70 75 80 Lys Asp Glu Lys Thr Cys Gly Ile Ile Glu Arg Asn Glu Pro Tyr Gly 85 90 95 Ile Thr Lys Ile Ala Glu Pro Ile Gly Val Val Ala Ala Ile Ile Pro 100 105 110 Val Thr Asn Pro Thr Ser Thr Thr Ile Phe Lys Ser Leu Ile Ser Leu 115 120 125 Lys Thr Arg Asn Gly Ile Phe Phe Ser Pro His Pro Arg Ala Lys Lys 130 135 140 Ser Thr Ile Leu Ala Ala Lys Thr Ile Leu Asp Ala Ala Val Lys Ser 145 150 155 160 Gly Ala Pro Glu Asn Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Thr Gln Tyr Leu Met Gln Lys Ala Asp Ile Thr Leu Ala Thr Gly 180 185 190 Gly Pro Ser Leu Val Lys Ser Ala Tyr Ser Ser Gly Lys Pro Ala Ile 195 200 205 Gly Val Gly Pro Gly Asn Thr Pro Val Ile Ile Asp Glu Ser Ala His 210 215 220 Ile Lys Met Ala Val Ser Ser Ile Ile Leu Ser Lys Thr Tyr Asp Asn 225 230 235 240 Gly Val Ile Cys Ala Ser Glu Gln Ser Val Ile Val Leu Lys Ser Ile 245 250 255 Tyr Asn Lys Val Lys Asp Glu Phe Gln Glu Arg Gly Ala Tyr Ile Ile 260 265 270 Lys Lys Asn Glu Leu Asp Lys Val Arg Glu Val Ile Phe Lys Asp Gly 275 280 285 Ser Val Asn Pro Lys Ile Val Gly Gln Ser Ala Tyr Thr Ile Ala Ala 290 295 300 Met Ala Gly Ile Lys Val Pro Lys Thr Thr Arg Ile Leu Ile Gly Glu 305 310 315 320 Val Thr Ser Leu Gly Glu Glu Glu Pro Phe Ala His Glu Lys Leu Ser 325 330 335 Pro Val Leu Ala Met Tyr Glu Ala Asp Asn Phe Asp Asp Ala Leu Lys 340 345 350 Lys Ala Val Thr Leu Ile Asn Leu Gly Gly Leu Gly His Thr Ser Gly 355 360 365 Ile Tyr Ala Asp Glu Ile Lys Ala Arg Asp Lys Ile Asp Arg Phe Ser 370 375 380 Ser Ala Met Lys Thr Val Arg Thr Phe Val Asn Ile Pro Thr Ser Gln 385 390 395 400 Gly Ala Ser Gly Asp Leu Tyr Asn Phe Arg Ile Pro Pro Ser Phe Thr 405 410 415 Leu Gly Cys Gly Phe Trp Gly Gly Asn Ser Val Ser Glu Asn Val Gly 420 425 430 Pro Lys His Leu Leu Asn Ile Lys Thr Val Ala Glu Arg Arg Glu Asn 435 440 445 Met Leu Trp Phe Arg Val Pro His Lys Val Tyr Phe Lys Phe Gly Cys 450 455 460 Leu Gln Phe Ala Leu Lys Asp Leu Lys Asp Leu Lys Lys Lys Arg Ala 465 470 475 480 Phe Ile Val Thr Asp Ser Asp Pro Tyr Asn Leu Asn Tyr Val Asp Ser 485 490 495 Ile Ile Lys Ile Leu Glu His Leu Asp Ile Asp Phe Lys Val Phe Asn 500 505 510 Lys Val Gly Arg Glu Ala Asp Leu Lys Thr Ile Lys Lys Ala Thr Glu 515 520 525 Glu Met Ser Ser Phe Met Pro Asp Thr Ile Ile Ala Leu Gly Gly Thr 530 535 540 Pro Glu Met Ser Ser Ala Lys Leu Met Trp Val Leu Tyr Glu His Pro 545 550 555 560 Glu Val Lys Phe Glu Asp Leu Ala Ile Lys Phe Met Asp Ile Arg Lys 565 570 575 Arg Ile Tyr Thr Phe Pro Lys Leu Gly Lys Lys Ala Met Leu Val Ala 580 585 590 Ile Thr Thr Ser Ala Gly Ser Gly Ser Glu Val Thr Pro Phe Ala Leu 595 600 605 Val Thr Asp Asn Asn Thr Gly Asn Lys Tyr Met Leu Ala Asp Tyr Glu 610 615 620 Met Thr Pro Asn Met Ala Ile Val Asp Ala Glu Leu Met Met Lys Met 625 630 635 640 Pro Lys Gly Leu Thr Ala Tyr Ser Gly Ile Asp Ala Leu Val Asn Ser 645 650 655 Ile Glu Ala Tyr Thr Ser Val Tyr Ala Ser Glu Tyr Thr Asn Gly Leu 660 665 670 Ala Leu Glu Ala Ile Arg Leu Ile Phe Lys Tyr Leu Pro Glu Ala Tyr 675 680 685 Lys Asn Gly Arg Thr Asn Glu Lys Ala Arg Glu Lys Met Ala His Ala 690 695 700 Ser Thr Met Ala Gly Met Ala Ser Ala Asn Ala Phe Leu Gly Leu Cys 705 710 715 720 His Ser Met Ala Ile Lys Leu Ser Ser Glu His Asn Ile Pro Ser Gly 725 730 735 Ile Ala Asn Ala Leu Leu Ile Glu Glu Val Ile Lys Phe Asn Ala Val 740 745 750 Asp Asn Pro Val Lys Gln Ala Pro Cys Pro Gln Tyr Lys Tyr Pro Asn 755 760 765 Thr Ile Phe Arg Tyr Ala Arg Ile Ala Asp Tyr Ile Lys Leu Gly Gly 770 775 780 Asn Thr Asp Glu Glu Lys Val Asp Leu Leu Ile Asn Lys Ile His Glu 785 790 795 800 Leu Lys Lys Ala Leu Asn Ile Pro Thr Ser Ile Lys Asp Ala Gly Val 805 810 815 Leu Glu Glu Asn Phe Tyr Ser Ser Leu Asp Arg Ile Ser Glu Leu Ala 820 825 830 Leu Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Phe Pro Leu Thr Ser 835 840 845 Glu Ile Lys Glu Met Tyr Ile Asn Cys Phe Lys Lys Gln Pro 850 855 860 <210> SEQ ID NO 76 <211> LENGTH: 858 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE2, NP_149199.1 <400> SEQUENCE: 76 Met Lys Val Thr Asn Gln Lys Glu Leu Lys Gln Lys Leu Asn Glu Leu 1 5 10 15 Arg Glu Ala Gln Lys Lys Phe Ala Thr Tyr Thr Gln Glu Gln Val Asp 20 25 30 Lys Ile Phe Lys Gln Cys Ala Ile Ala Ala Ala Lys Glu Arg Ile Asn 35 40 45 Leu Ala Lys Leu Ala Val Glu Glu Thr Gly Ile Gly Leu Val Glu Asp 50 55 60 Lys Ile Ile Lys Asn His Phe Ala Ala Glu Tyr Ile Tyr Asn Lys Tyr 65 70 75 80 Lys Asn Glu Lys Thr Cys Gly Ile Ile Asp His Asp Asp Ser Leu Gly 85 90 95 Ile Thr Lys Val Ala Glu Pro Ile Gly Ile Val Ala Ala Ile Val Pro 100 105 110 Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ser Leu 115 120 125 Lys Thr Arg Asn Ala Ile Phe Phe Ser Pro His Pro Arg Ala Lys Lys 130 135 140 Ser Thr Ile Ala Ala Ala Lys Leu Ile Leu Asp Ala Ala Val Lys Ala 145 150 155 160 Gly Ala Pro Lys Asn Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Ser Gln Asp Leu Met Ser Glu Ala Asp Ile Ile Leu Ala Thr Gly 180 185 190 Gly Pro Ser Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro Ala Ile 195 200 205

Gly Val Gly Ala Gly Asn Thr Pro Ala Ile Ile Asp Glu Ser Ala Asp 210 215 220 Ile Asp Met Ala Val Ser Ser Ile Ile Leu Ser Lys Thr Tyr Asp Asn 225 230 235 240 Gly Val Ile Cys Ala Ser Glu Gln Ser Ile Leu Val Met Asn Ser Ile 245 250 255 Tyr Glu Lys Val Lys Glu Glu Phe Val Lys Arg Gly Ser Tyr Ile Leu 260 265 270 Asn Gln Asn Glu Ile Ala Lys Ile Lys Glu Thr Met Phe Lys Asn Gly 275 280 285 Ala Ile Asn Ala Asp Ile Val Gly Lys Ser Ala Tyr Ile Ile Ala Lys 290 295 300 Met Ala Gly Ile Glu Val Pro Gln Thr Thr Lys Ile Leu Ile Gly Glu 305 310 315 320 Val Gln Ser Val Glu Lys Ser Glu Leu Phe Ser His Glu Lys Leu Ser 325 330 335 Pro Val Leu Ala Met Tyr Lys Val Lys Asp Phe Asp Glu Ala Leu Lys 340 345 350 Lys Ala Gln Arg Leu Ile Glu Leu Gly Gly Ser Gly His Thr Ser Ser 355 360 365 Leu Tyr Ile Asp Ser Gln Asn Asn Lys Asp Lys Val Lys Glu Phe Gly 370 375 380 Leu Ala Met Lys Thr Ser Arg Thr Phe Ile Asn Met Pro Ser Ser Gln 385 390 395 400 Gly Ala Ser Gly Asp Leu Tyr Asn Phe Ala Ile Ala Pro Ser Phe Thr 405 410 415 Leu Gly Cys Gly Thr Trp Gly Gly Asn Ser Val Ser Gln Asn Val Glu 420 425 430 Pro Lys His Leu Leu Asn Ile Lys Ser Val Ala Glu Arg Arg Glu Asn 435 440 445 Met Leu Trp Phe Lys Val Pro Gln Lys Ile Tyr Phe Lys Tyr Gly Cys 450 455 460 Leu Arg Phe Ala Leu Lys Glu Leu Lys Asp Met Asn Lys Lys Arg Ala 465 470 475 480 Phe Ile Val Thr Asp Lys Asp Leu Phe Lys Leu Gly Tyr Val Asn Lys 485 490 495 Ile Thr Lys Val Leu Asp Glu Ile Asp Ile Lys Tyr Ser Ile Phe Thr 500 505 510 Asp Ile Lys Ser Asp Pro Thr Ile Asp Ser Val Lys Lys Gly Ala Lys 515 520 525 Glu Met Leu Asn Phe Glu Pro Asp Thr Ile Ile Ser Ile Gly Gly Gly 530 535 540 Ser Pro Met Asp Ala Ala Lys Val Met His Leu Leu Tyr Glu Tyr Pro 545 550 555 560 Glu Ala Glu Ile Glu Asn Leu Ala Ile Asn Phe Met Asp Ile Arg Lys 565 570 575 Arg Ile Cys Asn Phe Pro Lys Leu Gly Thr Lys Ala Ile Ser Val Ala 580 585 590 Ile Pro Thr Thr Ala Gly Thr Gly Ser Glu Ala Thr Pro Phe Ala Val 595 600 605 Ile Thr Asn Asp Glu Thr Gly Met Lys Tyr Pro Leu Thr Ser Tyr Glu 610 615 620 Leu Thr Pro Asn Met Ala Ile Ile Asp Thr Glu Leu Met Leu Asn Met 625 630 635 640 Pro Arg Lys Leu Thr Ala Ala Thr Gly Ile Asp Ala Leu Val His Ala 645 650 655 Ile Glu Ala Tyr Val Ser Val Met Ala Thr Asp Tyr Thr Asp Glu Leu 660 665 670 Ala Leu Arg Ala Ile Lys Met Ile Phe Lys Tyr Leu Pro Arg Ala Tyr 675 680 685 Lys Asn Gly Thr Asn Asp Ile Glu Ala Arg Glu Lys Met Ala His Ala 690 695 700 Ser Asn Ile Ala Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Val Cys 705 710 715 720 His Ser Met Ala His Lys Leu Gly Ala Met His His Val Pro His Gly 725 730 735 Ile Ala Cys Ala Val Leu Ile Glu Glu Val Ile Lys Tyr Asn Ala Thr 740 745 750 Asp Cys Pro Thr Lys Gln Thr Ala Phe Pro Gln Tyr Lys Ser Pro Asn 755 760 765 Ala Lys Arg Lys Tyr Ala Glu Ile Ala Glu Tyr Leu Asn Leu Lys Gly 770 775 780 Thr Ser Asp Thr Glu Lys Val Thr Ala Leu Ile Glu Ala Ile Ser Lys 785 790 795 800 Leu Lys Ile Asp Leu Ser Ile Pro Gln Asn Ile Ser Ala Ala Gly Ile 805 810 815 Asn Lys Lys Asp Phe Tyr Asn Thr Leu Asp Lys Met Ser Glu Leu Ala 820 825 830 Phe Asp Asp Gln Cys Thr Thr Ala Asn Pro Arg Tyr Pro Leu Ile Ser 835 840 845 Glu Leu Lys Asp Ile Tyr Ile Lys Ser Phe 850 855 <210> SEQ ID NO 77 <211> LENGTH: 860 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE, WP_041893626.1 <400> SEQUENCE: 77 Met Arg Val Thr Asn Pro Glu Glu Leu Thr Lys Arg Ile Glu Gln Ile 1 5 10 15 Arg Glu Ala Gln Arg Glu Phe Ala Lys Phe Ser Gln Glu Glu Val Asp 20 25 30 Glu Ile Phe Arg Gln Ala Ala Met Ala Ala Asn Asp Ala Arg Ile Thr 35 40 45 Leu Ala Lys Met Ala Val Glu Glu Ser Gly Met Gly Ile Val Glu Asp 50 55 60 Lys Val Ile Lys Asn His Phe Ala Ala Glu Tyr Ile Tyr Asn Gln Tyr 65 70 75 80 Lys Asp Thr Lys Thr Cys Gly Val Ile Glu Arg Asp Glu Met Phe Gly 85 90 95 Ile Thr His Ile Ala Glu Pro Ile Gly Val Ile Ala Ala Ile Val Pro 100 105 110 Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Thr Leu Ile Ala Leu 115 120 125 Lys Thr Arg Asn Gly Ile Ile Ile Ser Pro His Pro Arg Ala Lys Asn 130 135 140 Ser Thr Ile Ala Ala Ala Lys Ile Val Leu Glu Ala Ala Glu Arg Ala 145 150 155 160 Gly Ala Pro Lys Gly Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Ser Arg Asn Val Met Ser Glu Ser Asp Ile Ile Leu Ala Thr Gly 180 185 190 Gly Pro Gly Met Val Arg Ala Ala Tyr Ser Ser Gly Lys Pro Ala Ile 195 200 205 Gly Val Gly Ala Gly Asn Thr Pro Ala Ile Ile Asp Asp Thr Ala His 210 215 220 Ile Lys Met Ala Val Asn Ser Ile Leu Leu Ser Lys Thr Phe Asp Asn 225 230 235 240 Gly Val Val Cys Ala Ser Glu Gln Ser Ile Ile Ala Met Glu Ser Val 245 250 255 Tyr Asp Glu Val Arg Lys Glu Leu Asp Glu Arg Gly Ala Tyr Ile Leu 260 265 270 Lys Gly Asp Glu Val Asp Lys Val Arg Ser Ile Ile Leu Asp Pro Lys 275 280 285 Gly Ser Leu Asn Ser Glu Ile Val Gly Gln Ser Ala Tyr Lys Ile Ala 290 295 300 Lys Met Ala Gly Val Glu Val Ser Glu Ala Val Lys Val Leu Ile Gly 305 310 315 320 Glu Val Glu Ser Pro Glu Leu Glu Glu Pro Phe Ser His Glu Lys Leu 325 330 335 Ser Pro Ile Leu Gly Met Tyr Lys Ala Lys Thr Phe Asp Asp Ala Leu 340 345 350 Arg Leu Ala Ser Arg Met Ile Glu Leu Gly Gly Phe Gly His Thr Ser 355 360 365 Ile Leu Tyr Thr Asn Gln Val Glu Ser Val Asp Arg Ile Glu Lys Phe 370 375 380 Gly Val Ala Met Lys Thr Ala Arg Thr Leu Ile Asn Met Pro Ala Ser 385 390 395 400 Gln Gly Ala Ile Gly Asp Ile Tyr Asn Phe Lys Leu Ala Pro Ser Leu 405 410 415 Thr Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Ile Ser Glu Asn Val 420 425 430 Gly Pro Lys His Leu Ile Asn Val Lys Arg Ile Ala Glu Arg Arg Glu 435 440 445 Asn Met Leu Trp Phe Arg Val Pro Asp Lys Ile Tyr Phe Lys Phe Gly 450 455 460 Cys Leu Pro Ile Ala Leu Glu Glu Leu Asn Ala Met Lys Lys Lys Arg 465 470 475 480 Ala Phe Ile Val Thr Asp Arg Val Leu Phe Asp Leu Gly Tyr Thr His 485 490 495 Lys Ile Thr Asp Ile Leu Ser Glu Asn His Ile Glu Tyr Lys Ile Phe 500 505 510 Ser Asp Val Glu Pro Asp Pro Thr Leu Lys Ala Ala Lys Leu Gly Ala 515 520 525 Asp Ala Met Arg Asp Phe Asn Pro Asp Val Ile Ile Ala Ile Gly Gly 530 535 540 Gly Ser Pro Met Asp Ala Ala Lys Ile Met Trp Val Met Tyr Glu His 545 550 555 560 Pro Asp Val Arg Phe Glu Asp Leu Ala Met Arg Phe Met Asp Ile Arg 565 570 575 Lys Arg Val Tyr Glu Phe Pro Pro Met Gly Glu Arg Ala Ile Leu Val 580 585 590 Ala Ile Pro Thr Ser Ala Gly Thr Gly Ser Glu Val Thr Pro Phe Ala 595 600 605 Val Ile Thr Asp Gln Gln Thr Gly Val Lys Tyr Pro Leu Ala Asp Tyr 610 615 620

Ala Leu Thr Pro Asn Met Ala Ile Ile Asp Ala Glu Leu Met Met Ser 625 630 635 640 Met Pro Lys Gly Leu Thr Ala Ala Ser Gly Ile Asp Ala Leu Val His 645 650 655 Ala Ile Glu Ala Tyr Val Ser Val Leu Ala Ser Glu Tyr Thr Asn Gly 660 665 670 Leu Ala Leu Glu Ala Ile Arg Leu Thr Phe Lys Tyr Leu Pro Asp Ala 675 680 685 Tyr Asn Gly Gly Thr Thr Asn Ile Lys Ala Arg Glu Lys Met Ala His 690 695 700 Ala Ser Ser Val Ala Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Ile 705 710 715 720 Cys His Ser Met Ala His Lys Leu Gly Ala Phe His His Val Pro His 725 730 735 Gly Ile Ala Asn Ala Leu Leu Ile Asp Glu Val Ile Arg Phe Asn Ala 740 745 750 Thr Asp Ala Pro Arg Lys Gln Ala Ala Phe Pro Gln Tyr Lys Tyr Pro 755 760 765 Asn Ala Gly Trp Arg Tyr Ala Arg Ile Ala Asp Tyr Leu Asn Leu Gly 770 775 780 Gly Asn Thr Glu Glu Glu Lys Val Glu Leu Leu Ile Lys Ala Ile Asp 785 790 795 800 Asp Leu Lys Val Lys Val Arg Ile Pro Lys Ser Ile Lys Glu Phe Gly 805 810 815 Val Ser Glu Glu Lys Phe Tyr Asp Ser Met Asp Glu Met Val Glu Gln 820 825 830 Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr Pro Leu Met 835 840 845 Ser Glu Ile Lys Glu Met Tyr Ile Lys Ser Tyr Asn 850 855 860 <210> SEQ ID NO 78 <211> LENGTH: 870 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE1, WP_023163372.1 <400> SEQUENCE: 78 Met Lys Val Thr Asn Val Glu Glu Leu Met Lys Arg Leu Glu Glu Ile 1 5 10 15 Lys Asp Ala Gln Lys Lys Phe Ala Thr Tyr Thr Gln Glu Gln Val Asp 20 25 30 Glu Ile Phe Arg Gln Ala Ala Met Ala Ala Asn Ser Ala Arg Ile Glu 35 40 45 Leu Ala Lys Met Ala Val Glu Glu Ser Gly Met Gly Ile Val Glu Asp 50 55 60 Lys Val Ile Lys Asn His Phe Ala Ser Glu Tyr Ile Tyr Asn Lys Tyr 65 70 75 80 Lys Asp Glu Lys Thr Cys Gly Val Leu Glu Arg Asp Ala Gly Phe Gly 85 90 95 Ile Val Arg Ile Ala Glu Pro Val Gly Val Ile Ala Ala Val Val Pro 100 105 110 Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ala Leu 115 120 125 Lys Thr Arg Asn Gly Ile Ile Phe Ser Pro His Pro Arg Ala Lys Lys 130 135 140 Ser Thr Ile Ala Ala Ala Lys Ile Val Leu Asp Ala Ala Val Lys Ala 145 150 155 160 Gly Ala Pro Glu Gly Ile Ile Gly Trp Ile Asp Glu Pro Ser Ile Glu 165 170 175 Leu Ser Gln Val Val Met Gly Glu Ala Asn Leu Ile Leu Ala Thr Gly 180 185 190 Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro Ala Val 195 200 205 Gly Val Gly Pro Gly Asn Thr Pro Ala Val Ile Asp Glu Ser Ala Asp 210 215 220 Ile Lys Met Ala Val Asn Ser Ile Leu Leu Ser Lys Thr Phe Asp Asn 225 230 235 240 Gly Met Ile Cys Ala Ser Glu Gln Ser Val Ile Val Leu Asp Ser Ile 245 250 255 Tyr Glu Glu Val Lys Lys Glu Phe Ala Tyr Arg Gly Ala Tyr Ile Leu 260 265 270 Ser Lys Asp Glu Thr Asp Lys Val Gly Lys Ile Ile Leu Lys Asn Gly 275 280 285 Ala Leu Asn Ala Gly Ile Val Gly Gln Pro Ala Phe Lys Ile Ala Gln 290 295 300 Leu Ala Gly Val Asp Val Pro Glu Lys Ala Lys Val Leu Ile Gly Glu 305 310 315 320 Val Glu Ser Val Glu Leu Glu Glu Pro Phe Ser His Glu Lys Leu Ser 325 330 335 Pro Val Leu Ala Met Tyr Arg Ala Arg Asn Phe Glu Asp Ala Ile Ala 340 345 350 Lys Thr Asp Lys Leu Val Arg Ala Gly Gly Phe Gly His Thr Ser Ser 355 360 365 Leu Tyr Ile Asn Pro Met Thr Glu Lys Ala Lys Val Glu Lys Phe Ser 370 375 380 Thr Met Met Lys Thr Ser Arg Thr Ile Ile Asn Thr Pro Ser Ser Gln 385 390 395 400 Gly Gly Ile Gly Asp Ile Tyr Asn Phe Lys Leu Ala Pro Ser Leu Thr 405 410 415 Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Val Ser Glu Asn Val Gly 420 425 430 Pro Lys His Leu Leu Asn Ile Lys Ser Val Ala Glu Arg Arg Glu Asn 435 440 445 Met Leu Trp Phe Arg Val Pro Glu Lys Val Tyr Phe Lys Tyr Gly Ser 450 455 460 Leu Gly Val Ala Leu Lys Glu Leu Lys Val Met Asn Lys Lys Lys Val 465 470 475 480 Phe Ile Val Thr Asp Lys Val Leu Tyr Gln Leu Gly Tyr Val Asp Lys 485 490 495 Val Thr Lys Val Leu Glu Glu Leu Lys Ile Ser Tyr Lys Val Phe Thr 500 505 510 Asp Val Glu Pro Asp Pro Thr Leu Ala Thr Ala Lys Lys Gly Ala Ala 515 520 525 Glu Leu Leu Ser Tyr Glu Pro Asp Thr Ile Ile Ser Val Gly Gly Gly 530 535 540 Ser Ala Met Asp Ala Ala Lys Ile Met Trp Val Met Tyr Glu His Pro 545 550 555 560 Glu Val Lys Phe Glu Asp Leu Ala Met Arg Phe Met Asp Ile Arg Lys 565 570 575 Arg Val Tyr Val Phe Pro Lys Met Gly Glu Lys Ala Met Met Ile Ser 580 585 590 Val Ala Thr Ser Ala Gly Thr Gly Ser Glu Val Thr Pro Phe Ala Val 595 600 605 Ile Thr Asp Glu Lys Thr Gly Ala Lys Tyr Pro Leu Ala Asp Tyr Glu 610 615 620 Leu Thr Pro Asp Met Ala Ile Val Asp Ala Glu Leu Met Met Gly Met 625 630 635 640 Pro Arg Gly Leu Thr Ala Ala Ser Gly Ile Asp Ala Leu Thr His Ala 645 650 655 Leu Glu Ala Tyr Val Ser Ile Met Ala Thr Glu Phe Thr Asn Gly Leu 660 665 670 Ala Leu Glu Ala Val Lys Leu Ile Phe Glu Tyr Leu Pro Lys Ala Tyr 675 680 685 Thr Glu Gly Thr Thr Asn Val Lys Ala Arg Glu Lys Met Ala His Ala 690 695 700 Ser Cys Ile Ala Gly Met Ala Phe Ala Asn Ala Phe Leu Gly Val Cys 705 710 715 720 His Ser Met Ala His Lys Leu Gly Ala Gln His His Ile Pro His Gly 725 730 735 Ile Ala Asn Ala Leu Met Ile Asp Glu Val Ile Lys Phe Asn Ala Val 740 745 750 Asp Asp Pro Ile Lys Gln Ala Ala Phe Pro Gln Tyr Glu Tyr Pro Asn 755 760 765 Ala Arg Tyr Arg Tyr Ala Gln Ile Ala Asp Cys Leu Asn Leu Gly Gly 770 775 780 Asn Thr Glu Glu Glu Lys Val Gln Leu Leu Ile Asn Ala Ile Asp Asp 785 790 795 800 Leu Lys Ala Lys Leu Asn Ile Pro Glu Thr Ile Lys Glu Ala Gly Val 805 810 815 Ser Glu Asp Lys Phe Tyr Ala Thr Leu Asp Lys Met Ser Glu Leu Ala 820 825 830 Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr Pro Leu Ile Ser 835 840 845 Glu Ile Lys Gln Met Tyr Ile Asn Val Phe Asp Lys Thr Glu Pro Ile 850 855 860 Val Glu Asp Glu Glu Lys 865 870 <210> SEQ ID NO 79 <211> LENGTH: 877 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: AdhE2, WP_023163373.1 <400> SEQUENCE: 79 Met Lys Val Thr Lys Val Thr Asn Val Glu Glu Leu Met Lys Lys Leu 1 5 10 15 Asp Glu Val Thr Ala Ala Gln Lys Lys Phe Ser Ser Tyr Thr Gln Glu 20 25 30 Gln Val Asp Glu Ile Phe Arg Gln Ala Ala Met Ala Ala Asn Ser Ala 35 40 45 Arg Ile Asp Leu Ala Lys Met Ala Val Glu Glu Ser Gly Met Gly Ile 50 55 60 Val Glu Asp Lys Val Ile Lys Asn His Phe Val Ala Glu Tyr Ile Tyr 65 70 75 80 Asn Lys Tyr Lys Gly Glu Lys Thr Cys Gly Val Leu Glu Gln Asp Glu 85 90 95

Gly Phe Gly Met Val Arg Ile Ala Glu Pro Val Gly Val Ile Ala Ala 100 105 110 Val Val Pro Thr Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu 115 120 125 Ile Ala Leu Lys Thr Arg Asn Gly Ile Val Phe Ser Pro His Pro Arg 130 135 140 Ala Lys Lys Ser Thr Ile Ala Ala Ala Lys Ile Val Leu Asp Ala Ala 145 150 155 160 Val Lys Ala Gly Ala Pro Glu Gly Ile Ile Gly Trp Ile Asp Glu Pro 165 170 175 Ser Ile Glu Leu Ser Gln Val Val Met Lys Glu Ala Asp Leu Ile Leu 180 185 190 Ala Thr Gly Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys 195 200 205 Pro Ala Ile Gly Val Gly Pro Gly Asn Thr Pro Ala Val Ile Asp Glu 210 215 220 Ser Ala Asp Ile Lys Met Ala Val Asn Ser Ile Leu Leu Ser Lys Thr 225 230 235 240 Phe Asp Asn Gly Met Ile Cys Ala Ser Glu Gln Ser Val Ile Val Ala 245 250 255 Ser Ser Ile Tyr Asp Glu Val Lys Lys Glu Phe Ala Asp Arg Gly Ala 260 265 270 Tyr Ile Leu Ser Lys Asp Glu Thr Asp Lys Val Gly Lys Thr Ile Met 275 280 285 Ile Asn Gly Ala Leu Asn Ala Gly Ile Val Gly Gln Ser Ala Phe Lys 290 295 300 Ile Ala Gln Met Ala Gly Val Ser Val Pro Glu Asp Ala Lys Ile Leu 305 310 315 320 Ile Gly Glu Val Lys Ser Val Glu Pro Glu Glu Glu Pro Phe Ala His 325 330 335 Glu Lys Leu Ser Pro Val Leu Ala Met Tyr Lys Ala Lys Asp Phe Asp 340 345 350 Glu Ala Leu Leu Lys Ala Gly Arg Leu Val Glu Arg Gly Gly Ile Gly 355 360 365 His Thr Ser Val Leu Tyr Val Asn Ser Met Thr Glu Lys Val Lys Val 370 375 380 Glu Lys Phe Arg Glu Thr Met Lys Thr Gly Arg Thr Leu Ile Asn Met 385 390 395 400 Pro Ser Ala Gln Gly Ala Ile Gly Asp Ile Tyr Asn Phe Lys Leu Ala 405 410 415 Pro Ser Leu Thr Leu Gly Cys Gly Ser Trp Gly Gly Asn Ser Val Ser 420 425 430 Glu Asn Val Gly Pro Lys His Leu Leu Asn Ile Lys Ser Val Ala Glu 435 440 445 Arg Arg Glu Asn Met Leu Trp Phe Arg Val Pro Glu Lys Val Tyr Phe 450 455 460 Lys Tyr Gly Ser Leu Gly Val Ala Leu Lys Glu Leu Arg Ile Met Glu 465 470 475 480 Lys Lys Lys Ala Phe Ile Val Thr Asp Lys Val Leu Tyr Gln Leu Gly 485 490 495 Tyr Val Asp Lys Ile Thr Lys Asn Leu Asp Glu Leu Arg Val Ser Tyr 500 505 510 Lys Ile Phe Thr Asp Val Glu Pro Asp Pro Thr Leu Ala Thr Ala Lys 515 520 525 Lys Gly Ala Ala Glu Leu Leu Ser Tyr Glu Pro Asp Thr Ile Ile Ala 530 535 540 Val Gly Gly Gly Ser Ala Met Asp Ala Ala Lys Ile Met Trp Val Met 545 550 555 560 Tyr Glu His Pro Glu Val Arg Phe Glu Asp Leu Ala Met Arg Phe Met 565 570 575 Asp Ile Arg Lys Arg Val Tyr Val Phe Pro Lys Met Gly Glu Lys Ala 580 585 590 Met Met Ile Ser Val Ala Thr Ser Ala Gly Thr Gly Ser Glu Val Thr 595 600 605 Pro Phe Ala Val Ile Thr Asp Glu Arg Thr Gly Ala Lys Tyr Pro Leu 610 615 620 Ala Asp Tyr Glu Leu Thr Pro Asn Met Ala Ile Val Asp Ala Glu Leu 625 630 635 640 Met Met Gly Met Pro Lys Gly Leu Thr Ala Ala Ser Gly Ile Asp Ala 645 650 655 Leu Thr His Ala Leu Glu Ala Tyr Val Ser Ile Met Ala Ser Glu Tyr 660 665 670 Thr Asn Gly Leu Ala Leu Glu Ala Thr Arg Leu Val Phe Lys Tyr Leu 675 680 685 Pro Ile Ala Tyr Thr Glu Gly Thr Ile Asn Val Lys Ala Arg Glu Lys 690 695 700 Met Ala His Ala Ser Cys Ile Ala Gly Met Ala Phe Ala Asn Ala Phe 705 710 715 720 Leu Gly Val Cys His Ser Met Ala His Lys Leu Gly Ala Gln His His 725 730 735 Ile Pro His Gly Ile Ala Asn Ala Leu Met Ile Asp Glu Val Ile Lys 740 745 750 Phe Asn Ala Val Glu Ala Pro Arg Lys Gln Ala Ala Phe Pro Gln Tyr 755 760 765 Lys Tyr Pro Asn Val Lys Arg Arg Tyr Ala Arg Ile Ala Asp Tyr Leu 770 775 780 Asn Leu Gly Gly Ser Thr Asp Asp Glu Lys Val Gln Leu Leu Ile Asn 785 790 795 800 Ala Ile Asp Asp Leu Lys Thr Lys Leu Asn Ile Pro Lys Thr Ile Lys 805 810 815 Glu Ala Gly Val Ser Glu Asp Lys Phe Tyr Ala Thr Leu Asp Thr Met 820 825 830 Ser Glu Leu Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr 835 840 845 Pro Leu Ile Gly Glu Ile Lys Gln Met Tyr Ile Asn Ala Phe Asp Thr 850 855 860 Pro Lys Ala Thr Val Glu Lys Lys Thr Arg Lys Lys Lys 865 870 875 <210> SEQ ID NO 80 <211> LENGTH: 468 <212> TYPE: PRT <213> ORGANISM: Clostridium saccharoperbutylacetonicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Bld, AAP42563.1 <400> SEQUENCE: 80 Met Ile Lys Asp Thr Leu Val Ser Ile Thr Lys Asp Leu Lys Leu Lys 1 5 10 15 Thr Asn Val Glu Asn Ala Asn Leu Lys Asn Tyr Lys Asp Asp Ser Ser 20 25 30 Cys Phe Gly Val Phe Glu Asn Val Glu Asn Ala Ile Ser Asn Ala Val 35 40 45 His Ala Gln Lys Ile Leu Ser Leu His Tyr Thr Lys Glu Gln Arg Glu 50 55 60 Lys Ile Ile Thr Glu Ile Arg Lys Ala Ala Leu Glu Asn Lys Glu Ile 65 70 75 80 Leu Ala Thr Met Ile Leu Glu Glu Thr His Met Gly Arg Tyr Glu Asp 85 90 95 Lys Ile Leu Lys His Glu Leu Val Ala Lys Tyr Thr Pro Gly Thr Glu 100 105 110 Asp Leu Thr Thr Thr Ala Trp Ser Gly Asp Asn Gly Leu Thr Val Val 115 120 125 Glu Met Ser Pro Tyr Gly Val Ile Gly Ala Ile Thr Pro Ser Thr Asn 130 135 140 Pro Thr Glu Thr Val Ile Cys Asn Ser Ile Gly Met Ile Ala Ala Gly 145 150 155 160 Asn Thr Val Val Phe Asn Gly His Pro Gly Ala Lys Lys Cys Val Ala 165 170 175 Phe Ala Val Glu Met Ile Asn Lys Ala Ile Ile Ser Cys Gly Gly Pro 180 185 190 Glu Asn Leu Val Thr Thr Ile Lys Asn Pro Thr Met Asp Ser Leu Asp 195 200 205 Ala Ile Ile Lys His Pro Ser Ile Lys Leu Leu Cys Gly Thr Gly Gly 210 215 220 Pro Gly Met Val Lys Thr Leu Leu Asn Ser Gly Lys Lys Ala Ile Gly 225 230 235 240 Ala Gly Ala Gly Asn Pro Pro Val Ile Val Asp Asp Thr Ala Asp Ile 245 250 255 Glu Lys Ala Gly Lys Ser Ile Ile Glu Gly Cys Ser Phe Asp Asn Asn 260 265 270 Leu Pro Cys Ile Ala Glu Lys Glu Val Phe Val Phe Glu Asn Val Ala 275 280 285 Asp Asp Leu Ile Ser Asn Met Leu Lys Asn Asn Ala Val Ile Ile Asn 290 295 300 Glu Asp Gln Val Ser Lys Leu Ile Asp Leu Val Leu Gln Lys Asn Asn 305 310 315 320 Glu Thr Gln Glu Tyr Ser Ile Asn Lys Lys Trp Val Gly Lys Asp Ala 325 330 335 Lys Leu Phe Leu Asp Glu Ile Asp Val Glu Ser Pro Ser Ser Val Lys 340 345 350 Cys Ile Ile Cys Glu Val Ser Ala Arg His Pro Phe Val Met Thr Glu 355 360 365 Leu Met Met Pro Ile Leu Pro Ile Val Arg Val Lys Asp Ile Asp Glu 370 375 380 Ala Ile Glu Tyr Ala Lys Ile Ala Glu Gln Asn Arg Lys His Ser Ala 385 390 395 400 Tyr Ile Tyr Ser Lys Asn Ile Asp Asn Leu Asn Arg Phe Glu Arg Glu 405 410 415 Ile Asp Thr Thr Ile Phe Val Lys Asn Ala Lys Ser Phe Ala Gly Val 420 425 430 Gly Tyr Glu Ala Glu Gly Phe Thr Thr Phe Thr Ile Ala Gly Ser Thr 435 440 445 Gly Glu Gly Ile Thr Ser Ala Arg Asn Phe Thr Arg Gln Arg Arg Cys 450 455 460 Val Leu Ala Gly 465 <210> SEQ ID NO 81 <211> LENGTH: 562

<212> TYPE: PRT <213> ORGANISM: Aquincola tertiaricarbonis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, large subunit, AFK77668.1 <400> SEQUENCE: 81 Met Thr Trp Leu Glu Pro Gln Ile Lys Ser Gln Leu Gln Ser Glu Arg 1 5 10 15 Lys Asp Trp Glu Ala Asn Glu Val Gly Ala Phe Leu Lys Lys Ala Pro 20 25 30 Glu Arg Lys Glu Gln Phe His Thr Ile Gly Asp Phe Pro Val Gln Arg 35 40 45 Thr Tyr Thr Ala Ala Asp Ile Ala Asp Thr Pro Leu Glu Asp Ile Gly 50 55 60 Leu Pro Gly Arg Tyr Pro Phe Thr Arg Gly Pro Tyr Pro Thr Met Tyr 65 70 75 80 Arg Ser Arg Thr Trp Thr Met Arg Gln Ile Ala Gly Phe Gly Thr Gly 85 90 95 Glu Asp Thr Asn Lys Arg Phe Lys Tyr Leu Ile Ala Gln Gly Gln Thr 100 105 110 Gly Ile Ser Thr Asp Phe Asp Met Pro Thr Leu Met Gly Tyr Asp Ser 115 120 125 Asp His Pro Met Ser Asp Gly Glu Val Gly Arg Glu Gly Val Ala Ile 130 135 140 Asp Thr Leu Ala Asp Met Glu Ala Leu Leu Ala Asp Ile Asp Leu Glu 145 150 155 160 Lys Ile Ser Val Ser Phe Thr Ile Asn Pro Ser Ala Trp Ile Leu Leu 165 170 175 Ala Met Tyr Val Ala Leu Gly Glu Lys Arg Gly Tyr Asp Leu Asn Lys 180 185 190 Leu Ser Gly Thr Val Gln Ala Asp Ile Leu Lys Glu Tyr Met Ala Gln 195 200 205 Lys Glu Tyr Ile Tyr Pro Ile Ala Pro Ser Val Arg Ile Val Arg Asp 210 215 220 Ile Ile Thr Tyr Ser Ala Lys Asn Leu Lys Arg Tyr Asn Pro Ile Asn 225 230 235 240 Ile Ser Gly Tyr His Ile Ser Glu Ala Gly Ser Ser Pro Leu Gln Glu 245 250 255 Ala Ala Phe Thr Leu Ala Asn Leu Ile Thr Tyr Val Asn Glu Val Thr 260 265 270 Lys Thr Gly Met His Val Asp Glu Phe Ala Pro Arg Leu Ala Phe Phe 275 280 285 Phe Val Ser Gln Gly Asp Phe Phe Glu Glu Val Ala Lys Phe Arg Ala 290 295 300 Leu Arg Arg Cys Tyr Ala Lys Ile Met Lys Glu Arg Phe Gly Ala Arg 305 310 315 320 Asn Pro Glu Ser Met Arg Leu Arg Phe His Cys Gln Thr Ala Ala Ala 325 330 335 Thr Leu Thr Lys Pro Gln Tyr Met Val Asn Val Val Arg Thr Ser Leu 340 345 350 Gln Ala Leu Ser Ala Val Leu Gly Gly Ala Gln Ser Leu His Thr Asn 355 360 365 Gly Tyr Asp Glu Ala Phe Ala Ile Pro Thr Glu Asp Ala Met Lys Met 370 375 380 Ala Leu Arg Thr Gln Gln Ile Ile Ala Glu Glu Ser Gly Val Ala Asp 385 390 395 400 Val Ile Asp Pro Leu Gly Gly Ser Tyr Tyr Val Glu Ala Leu Thr Thr 405 410 415 Glu Tyr Glu Lys Lys Ile Phe Glu Ile Leu Glu Glu Val Glu Lys Arg 420 425 430 Gly Gly Thr Ile Lys Leu Ile Glu Gln Gly Trp Phe Gln Lys Gln Ile 435 440 445 Ala Asp Phe Ala Tyr Glu Thr Ala Leu Arg Lys Gln Ser Gly Gln Lys 450 455 460 Pro Val Ile Gly Val Asn Arg Phe Val Glu Asn Glu Glu Asp Val Lys 465 470 475 480 Ile Glu Ile His Pro Tyr Asp Asn Thr Thr Ala Glu Arg Gln Ile Ser 485 490 495 Arg Thr Arg Arg Val Arg Ala Glu Arg Asp Glu Ala Lys Val Gln Ala 500 505 510 Met Leu Asp Gln Leu Val Ala Val Ala Lys Asp Glu Ser Gln Asn Leu 515 520 525 Met Pro Leu Thr Ile Glu Leu Val Lys Ala Gly Ala Thr Met Gly Asp 530 535 540 Ile Val Glu Lys Leu Lys Gly Ile Trp Gly Thr Tyr Arg Glu Thr Pro 545 550 555 560 Val Phe <210> SEQ ID NO 82 <211> LENGTH: 136 <212> TYPE: PRT <213> ORGANISM: Aquincola tertiaricarbonis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, small subunit, AFK77665.1 <400> SEQUENCE: 82 Met Asp Gln Thr Pro Ile Arg Val Leu Leu Ala Lys Val Gly Leu Asp 1 5 10 15 Gly His Asp Arg Gly Val Lys Val Val Ala Arg Ala Leu Arg Asp Ala 20 25 30 Gly Met Asp Val Ile Tyr Ser Gly Leu His Arg Thr Pro Glu Glu Val 35 40 45 Val Asn Thr Ala Ile Gln Glu Asp Val Asp Val Leu Gly Val Ser Leu 50 55 60 Leu Ser Gly Val Gln Leu Thr Val Phe Pro Lys Ile Phe Lys Leu Leu 65 70 75 80 Asp Glu Arg Gly Ala Gly Asp Leu Ile Val Ile Ala Gly Gly Val Met 85 90 95 Pro Asp Glu Asp Ala Ala Ala Ile Arg Lys Leu Gly Val Arg Glu Val 100 105 110 Leu Leu Gln Asp Thr Pro Pro Gln Ala Ile Ile Asp Ser Ile Arg Ser 115 120 125 Leu Val Ala Ala Arg Gly Ala Arg 130 135 <210> SEQ ID NO 83 <211> LENGTH: 563 <212> TYPE: PRT <213> ORGANISM: Kyrpidia tusciae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, large subunit, WP_013074530.1 <400> SEQUENCE: 83 Met Ala Asp Gln Glu Lys Leu Phe Asn Gly Asp Glu Ile Arg Arg Ile 1 5 10 15 Arg Gln Glu Lys Glu Arg Trp Tyr Arg Glu Thr Val Lys Gly Asn Asp 20 25 30 Gly Gly Asn Asp Tyr Val Thr Asp Ser Gly Ile Pro Val Asn Leu Ile 35 40 45 Tyr Gly Pro Asp Asp Ile Ala Asp Phe Asp Tyr Leu Lys Glu Ser Gly 50 55 60 Phe Ser Gly Glu Pro Pro Tyr Val Arg Gly Val Tyr Pro Asn Met Tyr 65 70 75 80 Arg Gly Arg Leu Phe Thr Ile Arg Gln Ile Ala Gly Phe Gly Thr Pro 85 90 95 Glu Asp Thr Asn Arg Arg Phe Lys Phe Leu Leu Glu Asn Gly Ala Thr 100 105 110 Gly Thr Ser Val Val Leu Asp Leu Pro Thr Ile Arg Gly Tyr Asp Ser 115 120 125 Asp Asp Pro Lys Ala Glu Gly His Val Gly Ala Ala Gly Val Ala Ile 130 135 140 Asp Ser Leu Glu Asp Met Glu Ala Leu Tyr Asp Gly Ile Pro Ile Asp 145 150 155 160 Gln Val Ser Ser Asn Ile Val Thr His Leu Pro Ser Thr Thr Val Val 165 170 175 Leu Met Ala Met Phe Val Ala Met Ala Glu Lys Arg Gly Leu Pro Leu 180 185 190 Glu Lys Leu Ser Gly Thr Asn Gln Asn Asp Phe Leu Met Glu Thr Thr 195 200 205 Ile Gly Ser Ser Leu Glu Ile Leu Pro Pro Lys Ala Ser Phe Arg Leu 210 215 220 Gln Cys Asp Ser Ile Glu Tyr Ala Ser Lys Arg Leu Pro Arg Trp Asn 225 230 235 240 Pro Val Ser Tyr Asn Gly Tyr Asn Leu Arg Glu Ala Gly Thr Thr Ala 245 250 255 Val Gln Glu Val Gly Cys Ala Ile Ala Asn Ala Ile Ala Thr Thr Glu 260 265 270 Glu Leu Ile Arg Arg Gly Asn Asp Val Asp Asp Phe Ala Lys Arg Leu 275 280 285 Ser Phe Phe Trp Asn Leu Phe Asn Asp Phe Phe Glu Glu Ile Ala Lys 290 295 300 Cys Arg Ala Ser Arg Leu Val Trp Tyr Asp Val Met Lys Asn Arg Phe 305 310 315 320 Gly Ala Lys Asn Pro Arg Ser Tyr Leu Met Arg Phe His Val Gln Thr 325 330 335 Gly Gly Ile Thr Leu Thr Lys Val Glu Pro Leu Asn Asn Ile Ala Arg 340 345 350 Ser Ala Ile Gln Gly Leu Ala Ala Val Leu Gly Gly Ala Gln Ser Leu 355 360 365 His Ile Asp Ser Tyr Asp Glu Ala Tyr Ser Ala Pro Thr Glu Gln Ala 370 375 380 Ala Leu Val Ser Leu Arg Thr Gln Gln Ile Ile Gln Val Glu Thr Gly 385 390 395 400 Val Val Asn Thr Val Asp Pro Leu Ala Gly Ser Tyr Tyr Val Glu Tyr 405 410 415 Leu Thr Arg Glu Met Ala Glu His Ile Arg Ala Tyr Ile Asp Gln Ile 420 425 430 Glu Ser Arg Gly Gly Ile Ile Ala Val Val Glu Ser Gly Trp Leu His 435 440 445 Arg Glu Ile Ala Glu Phe Ala Tyr Arg Thr Gln Gln Asp Ile Glu Thr 450 455 460

Gly Lys Arg Lys Val Val Gly Leu Asn Tyr Phe Pro Ser Lys Glu Ala 465 470 475 480 Glu Thr Lys Val Glu Val Phe Arg Tyr Pro Glu Asp Ala Glu Arg Met 485 490 495 Gln Lys Glu Lys Leu Ala Lys Leu Arg Ala Arg Arg Asp Pro Val Lys 500 505 510 Val Glu Gln Thr Leu Arg Val Leu Arg Glu Lys Cys His Glu Asp Val 515 520 525 Asn Ile Leu Pro Tyr Val Lys Asp Ala Val Glu Ala Tyr Cys Thr Leu 530 535 540 Gly Glu Ile Gln Asn Val Phe Arg Glu Glu Phe Gly Leu Trp Gln Phe 545 550 555 560 Pro Leu Val <210> SEQ ID NO 84 <211> LENGTH: 132 <212> TYPE: PRT <213> ORGANISM: Kyrpidia tusciae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: HcmAB, small subunit, WP_013074531.1 <400> SEQUENCE: 84 Met Glu Lys Lys Ile Lys Val Ile Met Val Lys Leu Gly Leu Asp Ile 1 5 10 15 His Trp Arg Gly Ala Leu Val Val Ser Lys Met Leu Arg Asp Arg Gly 20 25 30 Met Glu Val Val Tyr Leu Gly Asn Leu Phe Pro Glu Gln Ile Val Gln 35 40 45 Ala Ala Val Gln Glu Gly Ala Asp Val Val Gly Leu Ser Thr Leu Gly 50 55 60 Gly Asn His Leu Thr Leu Gly Pro Lys Val Val Glu Leu Leu Arg Ala 65 70 75 80 Lys Gly Met Glu Glu Val Leu Val Ile Met Gly Gly Val Ile Pro Glu 85 90 95 Glu Asp Val Pro Ala Leu Lys Glu Ala Gly Ile Ala Glu Val Phe Gly 100 105 110 Pro Glu Thr Pro Ile Asp Ala Ile Glu Ser Phe Ile Arg Ser Arg Phe 115 120 125 Pro Asp Arg Asp 130 <210> SEQ ID NO 85 <211> LENGTH: 327 <212> TYPE: PRT <213> ORGANISM: Aquincola tertiaricarbonis <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: MeaB, AFK77667.1 <400> SEQUENCE: 85 Met Thr Tyr Val Pro Ser Ser Ala Leu Leu Glu Gln Leu Arg Ala Gly 1 5 10 15 Asn Thr Trp Ala Leu Gly Arg Leu Ile Ser Arg Ala Glu Ala Gly Val 20 25 30 Ala Glu Ala Arg Pro Ala Leu Ala Glu Val Tyr Arg His Ala Gly Ser 35 40 45 Ala His Val Ile Gly Leu Thr Gly Val Pro Gly Ser Gly Lys Ser Thr 50 55 60 Leu Val Ala Lys Leu Thr Ala Ala Leu Arg Lys Arg Gly Glu Lys Val 65 70 75 80 Gly Ile Val Ala Ile Asp Pro Ser Ser Pro Tyr Ser Gly Gly Ala Ile 85 90 95 Leu Gly Asp Arg Ile Arg Met Thr Glu Leu Ala Asn Asp Ser Gly Val 100 105 110 Phe Ile Arg Ser Met Ala Thr Arg Gly Ala Thr Gly Gly Met Ala Arg 115 120 125 Ala Ala Leu Asp Ala Val Asp Leu Leu Asp Val Ala Gly Tyr His Thr 130 135 140 Ile Ile Leu Glu Thr Val Gly Val Gly Gln Asp Glu Val Glu Val Ala 145 150 155 160 His Ala Ser Asp Thr Thr Val Val Val Ser Ala Pro Gly Leu Gly Asp 165 170 175 Glu Ile Gln Ala Ile Lys Ala Gly Val Leu Glu Ile Ala Asp Ile His 180 185 190 Val Val Ser Lys Cys Asp Arg Asp Asp Ala Asn Arg Thr Leu Thr Asp 195 200 205 Leu Lys Gln Met Leu Thr Leu Gly Thr Met Val Gly Pro Lys Arg Ala 210 215 220 Trp Ala Ile Pro Val Val Gly Val Ser Ser Tyr Thr Gly Glu Gly Val 225 230 235 240 Asp Asp Leu Leu Gly Arg Ile Ala Ala His Arg Gln Ala Thr Ala Asp 245 250 255 Thr Glu Leu Gly Arg Glu Arg Arg Arg Arg Val Ala Glu Phe Arg Leu 260 265 270 Gln Lys Thr Ala Glu Thr Leu Leu Leu Glu Arg Phe Thr Thr Gly Ala 275 280 285 Gln Pro Phe Ser Pro Ala Leu Ala Asp Ser Leu Ser Asn Arg Ala Ser 290 295 300 Asp Pro Tyr Ala Ala Ala Arg Glu Leu Ile Ala Arg Thr Ile Arg Lys 305 310 315 320 Glu Tyr Ser Asn Asp Leu Ala 325 <210> SEQ ID NO 86 <211> LENGTH: 312 <212> TYPE: PRT <213> ORGANISM: Kyrpidia tusciae <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: MeaB, WP_013074529.1 <400> SEQUENCE: 86 Met Gln Glu Leu Leu Ser Arg Phe Asp Ala Gly Asp Pro Val Ala Leu 1 5 10 15 Gly Lys Leu Leu Lys Glu Val Glu Asn Gly Thr Ser Ser Gly Lys Glu 20 25 30 Ala Leu Arg Cys Thr Ala Ser Arg Gln Gly Arg Ala His Val Val Gly 35 40 45 Ile Thr Gly Pro Pro Gly Ala Gly Lys Ser Thr Leu Thr Ala Lys Leu 50 55 60 Ser Lys Arg Trp Ala Glu Ala Gly Arg Glu Val Gly Ile Val Cys Val 65 70 75 80 Asp Pro Thr Ser Pro Phe Ser Gly Gly Ala Leu Leu Gly Asp Arg Ile 85 90 95 Arg Met Leu Glu Leu Ser Ser Phe Pro Asn Val Phe Ile Lys Ser Leu 100 105 110 Ala Thr Arg Gly Ser Leu Gly Gly Met Ala Ala Ser Thr Ala Asp Ile 115 120 125 Ile Gln Leu Met Asp Ala Tyr Gly Lys Glu Val Val Val Val Glu Thr 130 135 140 Val Gly Val Gly Gln Val Glu Phe Asp Val Met Asp Leu Ser Asp Thr 145 150 155 160 Val Val Leu Val Asn Val Pro Gly Leu Gly Asp Ser Ile Gln Ala Leu 165 170 175 Lys Ala Gly Ile Leu Glu Ile Ala Asp Ile Phe Val Ile Asn Gln Ala 180 185 190 Asp Arg Pro Gly Ala Glu Asp Ser Val Arg Asp Leu Arg Gln Met Leu 195 200 205 Ala Asp Arg Lys Glu Thr Gly Trp Leu Trp Pro Val Val Lys Thr Val 210 215 220 Ala Thr Arg Gly Glu Gly Ile Asp Arg Leu Ala Glu Ala Ile Glu Ser 225 230 235 240 His Arg Ala Tyr Leu Lys Arg Glu Gln Leu Trp Glu Glu Lys Arg Cys 245 250 255 Arg Arg Asn Arg Gln Arg Leu Met Gln Glu Met Asp Arg Leu Phe Arg 260 265 270 Gln His Val Leu Thr Arg Ile Arg Thr Asp Pro Thr Ala Arg Ala Leu 275 280 285 Phe Glu Glu Val Glu Lys Gly Thr Gln Asp Pro Tyr Ser Ala Ala Arg 290 295 300 His Leu Phe Gln Glu Ile Val Asn 305 310 <210> SEQ ID NO 87 <211> LENGTH: 301 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ptb, WP_010966357.1 <400> SEQUENCE: 87 Met Ile Lys Ser Phe Asn Glu Ile Ile Met Lys Val Lys Ser Lys Glu 1 5 10 15 Met Lys Lys Val Ala Val Ala Val Ala Gln Asp Glu Pro Val Leu Glu 20 25 30 Ala Val Arg Asp Ala Lys Lys Asn Gly Ile Ala Asp Ala Ile Leu Val 35 40 45 Gly Asp His Asp Glu Ile Val Ser Ile Ala Leu Lys Ile Gly Met Asp 50 55 60 Val Asn Asp Phe Glu Ile Val Asn Glu Pro Asn Val Lys Lys Ala Ala 65 70 75 80 Leu Lys Ala Val Glu Leu Val Ser Thr Gly Lys Ala Asp Met Val Met 85 90 95 Lys Gly Leu Val Asn Thr Ala Thr Phe Leu Arg Ser Val Leu Asn Lys 100 105 110 Glu Val Gly Leu Arg Thr Gly Lys Thr Met Ser His Val Ala Val Phe 115 120 125 Glu Thr Glu Lys Phe Asp Arg Leu Leu Phe Leu Thr Asp Val Ala Phe 130 135 140 Asn Thr Tyr Pro Glu Leu Lys Glu Lys Ile Asp Ile Val Asn Asn Ser 145 150 155 160 Val Lys Val Ala His Ala Ile Gly Ile Glu Asn Pro Lys Val Ala Pro 165 170 175 Ile Cys Ala Val Glu Val Ile Asn Pro Lys Met Pro Ser Thr Leu Asp 180 185 190 Ala Ala Met Leu Ser Lys Met Ser Asp Arg Gly Gln Ile Lys Gly Cys 195 200 205

Val Val Asp Gly Pro Leu Ala Leu Asp Ile Ala Leu Ser Glu Glu Ala 210 215 220 Ala His His Lys Gly Val Thr Gly Glu Val Ala Gly Lys Ala Asp Ile 225 230 235 240 Phe Leu Met Pro Asn Ile Glu Thr Gly Asn Val Met Tyr Lys Thr Leu 245 250 255 Thr Tyr Thr Thr Asp Ser Lys Asn Gly Gly Ile Leu Val Gly Thr Ser 260 265 270 Ala Pro Val Val Leu Thr Ser Arg Ala Asp Ser His Glu Thr Lys Met 275 280 285 Asn Ser Ile Ala Leu Ala Ala Leu Val Ala Gly Asn Lys 290 295 300 <210> SEQ ID NO 88 <211> LENGTH: 302 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ptb <400> SEQUENCE: 88 Met Ser Lys Asn Phe Asp Glu Leu Leu Ser Arg Leu Lys Glu Val Pro 1 5 10 15 Thr Lys Lys Val Ala Val Ala Val Ala Gln Asp Glu Pro Val Leu Glu 20 25 30 Ala Ile Lys Glu Ala Thr Glu Asn Asn Ile Ala Glu Ala Ile Leu Val 35 40 45 Gly Asp Lys Gln Gln Ile His Glu Ile Ala Lys Lys Ile Asn Leu Asp 50 55 60 Leu Ser Asp Tyr Glu Ile Met Asp Ile Lys Asp Pro Lys Lys Ala Thr 65 70 75 80 Leu Glu Ala Val Lys Leu Val Ser Ser Gly His Ala Asp Met Leu Met 85 90 95 Lys Gly Leu Val Asp Thr Ala Thr Phe Leu Arg Ser Val Leu Asn Lys 100 105 110 Glu Val Gly Leu Arg Thr Gly Lys Leu Met Ser His Val Ala Val Phe 115 120 125 Asp Val Glu Gly Trp Asp Arg Leu Leu Phe Leu Thr Asp Ala Ala Phe 130 135 140 Asn Thr Tyr Pro Glu Phe Lys Asp Lys Val Gly Met Ile Asn Asn Ala 145 150 155 160 Val Val Val Ala His Ala Cys Gly Ile Asp Val Pro Arg Ile Ala Pro 165 170 175 Ile Cys Pro Val Glu Val Val Asn Thr Ser Met Gln Ser Thr Val Asp 180 185 190 Ala Ala Leu Leu Ala Lys Met Ser Asp Arg Gly Gln Ile Lys Gly Cys 195 200 205 Ile Ile Asp Gly Pro Phe Ala Leu Asp Asn Ala Ile Ser Glu Glu Ala 210 215 220 Ala His His Lys Gly Val Thr Gly Ser Val Ala Gly Lys Ala Asp Ile 225 230 235 240 Leu Leu Leu Pro Asn Ile Glu Ala Ala Asn Val Met Tyr Lys Thr Leu 245 250 255 Thr Tyr Phe Ser Lys Ser Arg Asn Gly Gly Leu Leu Val Gly Thr Ser 260 265 270 Ala Pro Val Ile Leu Thr Ser Arg Ala Asp Ser Phe Glu Thr Lys Val 275 280 285 Asn Ser Ile Ala Leu Ala Ala Leu Val Ala Ala Arg Asn Lys 290 295 300 <210> SEQ ID NO 89 <211> LENGTH: 302 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Ptb, WP_041893500.1 <400> SEQUENCE: 89 Met Ser Lys Asn Phe Asp Glu Leu Leu Ser Arg Leu Lys Glu Val Pro 1 5 10 15 Thr Lys Lys Val Ala Val Ala Val Ala Gln Asp Glu Pro Val Leu Glu 20 25 30 Ala Ile Lys Glu Ala Thr Glu Asn Asn Ile Ala Gln Ala Ile Leu Val 35 40 45 Gly Asp Lys Gln Gln Ile His Glu Ile Ala Lys Lys Ile Asn Leu Asp 50 55 60 Leu Ser Asp Tyr Glu Ile Met Asp Ile Lys Asp Pro Lys Lys Ala Thr 65 70 75 80 Leu Glu Ala Val Lys Leu Val Ser Ser Gly His Ala Asp Met Leu Met 85 90 95 Lys Gly Leu Val Asp Thr Ala Thr Phe Leu Arg Ser Val Leu Asn Lys 100 105 110 Glu Val Gly Leu Arg Thr Gly Lys Leu Met Ser His Val Ala Val Phe 115 120 125 Asp Val Glu Gly Trp Asp Arg Leu Leu Phe Leu Thr Asp Ala Ala Phe 130 135 140 Asn Thr Tyr Pro Glu Phe Lys Asp Lys Val Gly Met Ile Asn Asn Ala 145 150 155 160 Val Val Val Ala His Ala Cys Gly Ile Asp Val Pro Arg Ile Ala Pro 165 170 175 Ile Cys Pro Val Glu Val Val Asn Thr Ser Met Gln Ser Thr Val Asp 180 185 190 Ala Ala Leu Leu Ala Lys Met Ser Asp Arg Gly Gln Ile Lys Gly Cys 195 200 205 Val Ile Asp Gly Pro Phe Ala Leu Asp Asn Ala Ile Ser Glu Glu Ala 210 215 220 Ala His His Lys Gly Val Thr Gly Ser Val Ala Gly Lys Ala Asp Ile 225 230 235 240 Leu Leu Leu Pro Asn Ile Glu Ala Ala Asn Val Met Tyr Lys Thr Leu 245 250 255 Thr Tyr Phe Ser Lys Ser Arg Asn Gly Gly Leu Leu Val Gly Thr Ser 260 265 270 Ala Pro Val Ile Leu Thr Ser Arg Ala Asp Ser Phe Glu Thr Lys Val 275 280 285 Asn Ser Ile Ala Leu Ala Ala Leu Val Ala Ala Arg Asn Lys 290 295 300 <210> SEQ ID NO 90 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_010966356.1 <400> SEQUENCE: 90 Met Tyr Arg Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys Ile 1 5 10 15 Gly Ile Tyr Asp Asp Glu Lys Glu Ile Phe Glu Lys Thr Leu Arg His 20 25 30 Ser Ala Glu Glu Ile Glu Lys Tyr Asn Thr Ile Phe Asp Gln Phe Gln 35 40 45 Phe Arg Lys Asn Val Ile Leu Asp Ala Leu Lys Glu Ala Asn Ile Glu 50 55 60 Val Ser Ser Leu Asn Ala Val Val Gly Arg Gly Gly Leu Leu Lys Pro 65 70 75 80 Ile Val Ser Gly Thr Tyr Ala Val Asn Gln Lys Met Leu Glu Asp Leu 85 90 95 Lys Val Gly Val Gln Gly Gln His Ala Ser Asn Leu Gly Gly Ile Ile 100 105 110 Ala Asn Glu Ile Ala Lys Glu Ile Asn Val Pro Ala Tyr Ile Val Asp 115 120 125 Pro Val Val Val Asp Glu Leu Asp Glu Val Ser Arg Ile Ser Gly Met 130 135 140 Ala Asp Ile Pro Arg Lys Ser Ile Phe His Ala Leu Asn Gln Lys Ala 145 150 155 160 Val Ala Arg Arg Tyr Ala Lys Glu Val Gly Lys Lys Tyr Glu Asp Leu 165 170 175 Asn Leu Ile Val Val His Met Gly Gly Gly Thr Ser Val Gly Thr His 180 185 190 Lys Asp Gly Arg Val Ile Glu Val Asn Asn Thr Leu Asp Gly Glu Gly 195 200 205 Pro Phe Ser Pro Glu Arg Ser Gly Gly Val Pro Ile Gly Asp Leu Val 210 215 220 Arg Leu Cys Phe Ser Asn Lys Tyr Thr Tyr Glu Glu Val Met Lys Lys 225 230 235 240 Ile Asn Gly Lys Gly Gly Val Val Ser Tyr Leu Asn Thr Ile Asp Phe 245 250 255 Lys Ala Val Val Asp Lys Ala Leu Glu Gly Asp Lys Lys Cys Ala Leu 260 265 270 Ile Tyr Glu Ala Phe Thr Phe Gln Val Ala Lys Glu Ile Gly Lys Cys 275 280 285 Ser Thr Val Leu Lys Gly Asn Val Asp Ala Ile Ile Leu Thr Gly Gly 290 295 300 Ile Ala Tyr Asn Glu His Val Cys Asn Ala Ile Glu Asp Arg Val Lys 305 310 315 320 Phe Ile Ala Pro Val Val Arg Tyr Gly Gly Glu Asp Glu Leu Leu Ala 325 330 335 Leu Ala Glu Gly Gly Leu Arg Val Leu Arg Gly Glu Glu Lys Ala Lys 340 345 350 Glu Tyr Lys 355 <210> SEQ ID NO 91 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_011967556 <400> SEQUENCE: 91 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30

His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Asn Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Ala Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Lys Met Glu Glu Gly Asp Lys Glu Cys Glu 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Ala Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 92 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_017209677 <400> SEQUENCE: 92 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Asn Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Val Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Lys Met Glu Glu Gly Asp Lys Glu Cys Gly 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Ala Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 93 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_026886638 <400> SEQUENCE: 93 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp 85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Asn Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Val Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Asn Met Glu Ser Gly Asp Lys Glu Cys Glu 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Glu Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 94 <211> LENGTH: 355 <212> TYPE: PRT <213> ORGANISM: Clostridium beijerinckii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: Buk, WP_041893502 <400> SEQUENCE: 94 Met Ser Tyr Lys Leu Leu Ile Ile Asn Pro Gly Ser Thr Ser Thr Lys 1 5 10 15 Ile Gly Val Tyr Glu Gly Glu Lys Glu Leu Phe Glu Glu Thr Leu Arg 20 25 30 His Thr Asn Glu Glu Ile Lys Arg Tyr Asp Thr Ile Tyr Asp Gln Phe 35 40 45 Glu Phe Arg Lys Glu Val Ile Leu Asn Val Leu Lys Glu Lys Asn Phe 50 55 60 Asp Ile Lys Thr Leu Ser Ala Ile Val Gly Arg Gly Gly Met Leu Arg 65 70 75 80 Pro Val Glu Gly Gly Thr Tyr Ala Val Asn Asp Ala Met Val Glu Asp

85 90 95 Leu Lys Val Gly Val Gln Gly Pro His Ala Ser Asn Leu Gly Gly Ile 100 105 110 Ile Ala Lys Ser Ile Gly Asp Glu Leu Ser Ile Pro Ser Phe Ile Val 115 120 125 Asp Pro Val Val Thr Asp Glu Leu Ala Asp Val Ala Arg Leu Ser Gly 130 135 140 Val Pro Glu Leu Pro Arg Lys Ser Lys Phe His Ala Leu Asn Gln Lys 145 150 155 160 Ala Val Ala Lys Arg Tyr Gly Lys Glu Ser Gly Gln Gly Tyr Glu Asn 165 170 175 Leu Asn Leu Val Val Val His Met Gly Gly Gly Val Ser Val Gly Ala 180 185 190 His Asn His Gly Lys Val Val Asp Val Asn Asn Ala Leu Asp Gly Asp 195 200 205 Gly Pro Phe Ser Pro Glu Arg Ala Gly Ser Val Pro Ile Gly Asp Leu 210 215 220 Val Lys Met Cys Phe Ser Gly Lys Tyr Ser Glu Ala Glu Val Tyr Gly 225 230 235 240 Lys Val Val Gly Lys Gly Gly Phe Val Gly Tyr Leu Asn Thr Asn Asp 245 250 255 Val Lys Gly Val Ile Asp Lys Met Glu Glu Gly Asp Lys Glu Cys Gly 260 265 270 Ser Ile Tyr Lys Ala Phe Val Tyr Gln Ile Ser Lys Ala Ile Gly Glu 275 280 285 Met Ser Val Val Leu Glu Gly Lys Val Asp Gln Ile Ile Phe Thr Gly 290 295 300 Gly Ile Ala Tyr Ser Pro Thr Leu Val Pro Asp Leu Lys Ala Lys Val 305 310 315 320 Glu Trp Ile Ala Pro Val Thr Val Tyr Pro Gly Glu Asp Glu Leu Leu 325 330 335 Ala Leu Ala Gln Gly Ala Ile Arg Val Leu Asp Gly Glu Glu Gln Ala 340 345 350 Lys Val Tyr 355 <210> SEQ ID NO 95 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - pACYC-ptb-R1, reverse <400> SEQUENCE: 95 aagtttttac tcatatgtat atctccttct tatacttaac 40 <210> SEQ ID NO 96 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - ptb-pACYC-F1, forward <400> SEQUENCE: 96 agaaggagat atacatatga gtaaaaactt tgatgagtta 40 <210> SEQ ID NO 97 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - buk-pACYC-R1, reverse <400> SEQUENCE: 97 accagactcg agggtaccta gtaaacctta gcttgttc 38 <210> SEQ ID NO 98 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYCDuet-ptb-buk - pACYC-buk-F1, forward <400> SEQUENCE: 98 taaggtttac taggtaccct cgagtctggt aaagaaac 38 <210> SEQ ID NO 99 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - thlA-adc-R1, reverse <400> SEQUENCE: 99 acatatgtat atctccttct tactagcact tttctagcaa tattg 45 <210> SEQ ID NO 100 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - adc-ThlA-F1, forward <400> SEQUENCE: 100 agtaagaagg agatatacat atgttagaaa gtgaagtatc taaac 45 <210> SEQ ID NO 101 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - adc-pCOLA-R1, reverse <400> SEQUENCE: 101 cagactcgag ggtaccttat tttactgaaa gataatcatg tac 43 <210> SEQ ID NO 102 <211> LENGTH: 42 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - pCOLA-adc-F1, forward <400> SEQUENCE: 102 tctttcagta aaataaggta ccctcgagtc tggtaaagaa ac 42 <210> SEQ ID NO 103 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - thlA-pCOLA-F1, forward <400> SEQUENCE: 103 gaaggagata tacatatgaa agaagttgta atagctagtg 40 <210> SEQ ID NO 104 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLADuet-thlA-adc - pCOLA-thlA-R1, reverse <400> SEQUENCE: 104 acaacttctt tcatatgtat atctccttct tatacttaac 40 <210> SEQ ID NO 105 <211> LENGTH: 5791 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pACYC-ptb-buk, plasmid <400> SEQUENCE: 105 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgagtaaaaa ctttgatgag ttattatcaa gattaaagga agttccaaca aaaaaagtgg 360 ctgtagccgt agcacaagat gaaccagtat tagaggctat aaaagaagct acagaaaata 420 acatcgcaca agcaatattg gttggtgata aacaacaaat ccatgaaatc gcaaagaaaa 480 taaacttgga cttatctgat tatgaaataa tggatattaa agatccaaag aaagcaacat 540 tagaagcagt aaaattagtt tctagtggtc atgcagatat gttaatgaaa ggtctagttg 600 atactgcaac attcctaaga agcgtattaa acaaagaggt tggtcttaga acaggaaaat 660 taatgtccca tgtagctgtg tttgatgtgg aaggttggga tagactgtta tttttaactg 720 atgcagcatt taatacatat ccagaattta aggataaagt tggaatgata aataatgcag 780 ttgtagttgc tcatgcatgt ggaatagatg ttccaagagt agcacctata tgcccagttg 840 aagttgtaaa tacaagtatg caatcaacag ttgatgcagc attgttagct aaaatgagtg 900 acagggggca aattaaagga tgcgtaattg atggaccttt tgccttagat aatgcaatat 960 cagaagaagc agctcatcat aaaggtgtta caggatcagt agcaggtaaa gctgatatat 1020

tattattacc aaatatagaa gcagcaaatg taatgtataa aacattaaca tatttctcta 1080 aatcaagaaa tggtggactt ttagtaggta catcagcacc agtaatttta acttcaagag 1140 cagattcatt cgaaactaaa gttaattcaa ttgctcttgc agcattagtt gcagcaagaa 1200 ataagtaata aatcaatcca taataattaa tgcataatta atggagagat ttatatggaa 1260 tttgcaatgc actattagat tctataataa tttcttctga aaattatgca ttatgactgt 1320 atagaatgca ttaaatttaa gggggattca gaatgtcata taagctatta ataatcaatc 1380 caggttcaac atcaacaaag attggtgttt acgaaggaga aaaggaacta tttgaagaaa 1440 ctttgagaca cacaaatgaa gaaataaaga gatatgatac aatatatgat caatttgaat 1500 ttagaaaaga agttatatta aatgttctta aagaaaagaa ttttgatata aagactctaa 1560 gtgctattgt tggtagaggt ggaatgctta gaccagttga aggtggaaca tatgcagtaa 1620 atgatgcaat ggttgaagat ttaaaagttg gagttcaagg acctcatgct tctaaccttg 1680 gcggaataat tgccaagtca attggagatg aattaaatat tccatcattt atagtagatc 1740 cagttgttac agatgagtta gcagatgtag caagactatc tggagtacca gaactaccaa 1800 gaaaaagtaa attccatgct ttaaatcaaa aagcggtagc taaaagatat ggaaaagaaa 1860 gtggacaagg atatgaaaac ctaaatcttg tagttgtaca tatgggtgga ggcgtttcag 1920 ttggtgctca caatcatggg aaagttgtcg atgtaaataa tgcattagat ggagatggcc 1980 cattctcacc agaaagagct ggatcagttc caattggtga tttagttaaa atgtgtttta 2040 gtggaaaata tagtgaagca gaagtatatg gcaaggctgt aggaaaaggt ggatttgttg 2100 gttatctaaa cacaaatgat gtaaaaggtg ttattgataa gatggaagaa ggagataaag 2160 aatgtgaatc aatatacaaa gcatttgttt atcaaatttc aaaagcaatc ggagaaatgt 2220 cagttgtatt agaaggtaaa gttgatcaaa ttatttttac cggaggaatt gcatactcac 2280 caacacttgt tccagacctt aaagcaaaag ttgaatggat agccccagtt acagtttatc 2340 ctggagaaga tgaattactt gctctagctc aaggtgctat aagagtactt gatggagaag 2400 aacaagctaa ggtttactag gtaccctcga gtctggtaaa gaaaccgctg ctgcgaaatt 2460 tgaacgccag cacatggact cgtctactag cgcagcttaa ttaacctagg ctgctgccac 2520 cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 2580 gctgaaacct caggcatttg agaagcacac ggtcacactg cttccggtag tcaataaacc 2640 ggtaaaccag caatagacat aagcggctat ttaacgaccc tgccctgaac cgacgacaag 2700 ctgacgaccg ggtctccgca agtggcactt ttcggggaaa tgtgcgcgga acccctattt 2760 gtttattttt ctaaatacat tcaaatatgt atccgctcat gaattaattc ttagaaaaac 2820 tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt 2880 tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca 2940 agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc tattaatttc 3000 ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt 3060 gagaatggca aaagtttatg catttctttc cagacttgtt caacaggcca gccattacgc 3120 tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg 3180 agacgaaata cgcggtcgct gttaaaagga caattacaaa caggaatcga atgcaaccgg 3240 cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat 3300 acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc atcaggagta 3360 cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt tagtctgacc 3420 atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc 3480 gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac attatcgcga 3540 gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg cctagagcaa 3600 gacgtttccc gttgaatatg gctcatactc ttcctttttc aatattattg aagcatttat 3660 cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata 3720 ggcatgctag cgcagaaacg tcctagaaga tgccaggagg atacttagca gagagacaat 3780 aaggccggag cgaagccgtt tttccatagg ctccgccccc ctgacgaaca tcacgaaatc 3840 tgacgctcaa atcagtggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc 3900 cctgatggct ccctcttgcg ctctcctgtt cccgtcctgc ggcgtccgtg ttgtggtgga 3960 ggctttaccc aaatcaccac gtcccgttcc gtgtagacag ttcgctccaa gctgggctgt 4020 gtgcaagaac cccccgttca gcccgactgc tgcgccttat ccggtaacta tcatcttgag 4080 tccaacccgg aaagacacga caaaacgcca ctggcagcag ccattggtaa ctgagaatta 4140 gtggatttag atatcgagag tcttgaagtg gtggcctaac agaggctaca ctgaaaggac 4200 agtatttggt atctgcgctc cactaaagcc agttaccagg ttaagcagtt ccccaactga 4260 cttaaccttc gatcaaaccg cctccccagg cggttttttc gtttacagag caggagatta 4320 cgacgatcgt aaaaggatct caagaagatc ctttacggat tcccgacacc atcactctag 4380 atttcagtgc aatttatctc ttcaaatgta gcacctgaag tcagccccat acgatataag 4440 ttgtaattct catgttagtc atgccccgcg cccaccggaa ggagctgact gggttgaagg 4500 ctctcaaggg catcggtcga gatcccggtg cctaatgagt gagctaactt acattaattg 4560 cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 4620 tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ccagggtggt ttttcttttc 4680 accagtgaga cgggcaacag ctgattgccc ttcaccgcct ggccctgaga gagttgcagc 4740 aagcggtcca cgctggtttg ccccagcagg cgaaaatcct gtttgatggt ggttaacggc 4800 gggatataac atgagctgtc ttcggtatcg tcgtatccca ctaccgagat gtccgcacca 4860 acgcgcagcc cggactcggt aatggcgcgc attgcgccca gcgccatctg atcgttggca 4920 accagcatcg cagtgggaac gatgccctca ttcagcattt gcatggtttg ttgaaaaccg 4980 gacatggcac tccagtcgcc ttcccgttcc gctatcggct gaatttgatt gcgagtgaga 5040 tatttatgcc agccagccag acgcagacgc gccgagacag aacttaatgg gcccgctaac 5100 agcgcgattt gctggtgacc caatgcgacc agatgctcca cgcccagtcg cgtaccgtct 5160 tcatgggaga aaataatact gttgatgggt gtctggtcag agacatcaag aaataacgcc 5220 ggaacattag tgcaggcagc ttccacagca atggcatcct ggtcatccag cggatagtta 5280 atgatcagcc cactgacgcg ttgcgcgaga agattgtgca ccgccgcttt acaggcttcg 5340 acgccgcttc gttctaccat cgacaccacc acgctggcac ccagttgatc ggcgcgagat 5400 ttaatcgccg cgacaatttg cgacggcgcg tgcagggcca gactggaggt ggcaacgcca 5460 atcagcaacg actgtttgcc cgccagttgt tgtgccacgc ggttgggaat gtaattcagc 5520 tccgccatcg ccgcttccac tttttcccgc gttttcgcag aaacgtggct ggcctggttc 5580 accacgcggg aaacggtctg ataagagaca ccggcatact ctgcgacatc gtataacgtt 5640 actggtttca cattcaccac cctgaattga ctctcttccg ggcgctatca tgccataccg 5700 cgaaaggttt tgcgccattc gatggtgtcc gggatctcga cgctctccct tatgcgactc 5760 ctgcattagg aaattaatac gactcactat a 5791 <210> SEQ ID NO 106 <211> LENGTH: 5609 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCOLA-thlA-adc, plasmid <400> SEQUENCE: 106 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgaaagaagt tgtaatagct agtgcagtaa gaacagcgat tggatcttat ggaaagtctc 360 ttaaggatgt accagcagta gatttaggag ctacagctat aaaggaagca gttaaaaaag 420 caggaataaa accagaggat gttaatgaag tcattttagg aaatgttctt caagcaggtt 480 taggacagaa tccagcaaga caggcatctt ttaaagcagg attaccagtt gaaattccag 540 ctatgactat taataaggtt tgtggttcag gacttagaac agttagctta gcagcacaaa 600 ttataaaagc aggagatgct gacgtaataa tagcaggtgg tatggaaaat atgtctagag 660 ctccttactt agcgaataac gctagatggg gatatagaat gggaaacgct aaatttgttg 720 atgaaatgat cactgacgga ttgtgggatg catttaatga ttaccacatg ggaataacag 780 cagaaaacat agctgagaga tggaacattt caagagaaga acaagatgag tttgctcttg 840 catcacaaaa aaaagctgaa gaagctataa aatcaggtca atttaaagat gaaatagttc 900 ctgtagtaat taaaggcaga aagggagaaa ctgtagttga tacagatgag caccctagat 960 ttggatcaac tatagaagga cttgcaaaat taaaacctgc cttcaaaaaa gatggaacag 1020 ttacagctgg taatgcatca ggattaaatg actgtgcagc agtacttgta atcatgagtg 1080 cagaaaaagc taaagagctt ggagtaaaac cacttgctaa gatagtttct tatggttcag 1140 caggagttga cccagcaata atgggatatg gacctttcta tgcaacaaaa gcagctattg 1200 aaaaagcagg ttggacagtt gatgaattag atttaataga atcaaatgaa gcttttgcag 1260 ctcaaagttt agcagtagca aaagatttaa aatttgatat gaataaagta aatgtaaatg 1320 gaggagctat tgcccttggt catccaattg gagcatcagg tgcaagaata ctcgttactc 1380 ttgtacacgc aatgcaaaaa agagatgcaa aaaaaggctt agcaacttta tgtataggtg 1440 gcggacaagg aacagcaata ttgctagaaa agtgctagta agaaggagat atacatatgt 1500 tagaaagtga agtatctaaa caaattacaa ctccacttgc tgctccagcg tttcctagag 1560 gaccatatag gtttcacaat agagaatatc taaacattat ttatcgaact gatttagatg 1620 ctcttcgaaa aatagtacca gagccacttg aattagatag agcatatgtt agatttgaaa 1680 tgatggctat gcctgataca accggactag gctcatatac agaatgtggt caagctattc 1740 cagtaaaata taatggtgtt aagggtgact acttgcatat gatgtatcta gataatgaac 1800 ctgctattgc tgttggaaga gaaagtagcg cttatccaaa aaagcttggc tatccaaagc 1860 tatttgttga ttcagatact ttagttggga cacttaaata tggtacatta ccagtagcta 1920 ctgcaacaat gggatataag cacgagcctc tagatcttaa agaagcctat gctcaaattg 1980 caagacccaa ttttatgcta aaaatcattc aaggttacga tggtaagcca agaatttgtg 2040 aactaatatg tgcagaaaat actgatataa ctattcacgg tgcttggact ggaagtgcac 2100 gtctacaatt atttagccat gcactagctc ctcttgctga tttacctgta ttagagattg 2160 tatcagcatc tcatatcctc acagatttaa ctcttggaac acctaaggtt gtacatgatt 2220 atctttcagt aaaataaggt accctcgagt ctggtaaaga aaccgctgct gcgaaatttg 2280 aacgccagca catggactcg tctactagcg cagcttaatt aacctaggct gctgccaccg 2340

ctgagcaata actagcataa ccccttgggg cctctaaacg ggtcttgagg ggttttttgc 2400 tgaaacctca ggcatttgag aagcacacgg tcacactgct tccggtagtc aataaaccgg 2460 taaaccagca atagacataa gcggctattt aacgaccctg ccctgaaccg acgacaagct 2520 gacgaccggg tctccgcaag tggcactttt cggggaaatg tgcgcggaac ccctatttgt 2580 ttatttttct aaatacattc aaatatgtat ccgctcatga attaattctt agaaaaactc 2640 atcgagcatc aaatgaaact gcaatttatt catatcagga ttatcaatac catatttttg 2700 aaaaagccgt ttctgtaatg aaggagaaaa ctcaccgagg cagttccata ggatggcaag 2760 atcctggtat cggtctgcga ttccgactcg tccaacatca atacaaccta ttaatttccc 2820 ctcgtcaaaa ataaggttat caagtgagaa atcaccatga gtgacgactg aatccggtga 2880 gaatggcaaa agtttatgca tttctttcca gacttgttca acaggccagc cattacgctc 2940 gtcatcaaaa tcactcgcat caaccaaacc gttattcatt cgtgattgcg cctgagcgag 3000 acgaaatacg cggtcgctgt taaaaggaca attacaaaca ggaatcgaat gcaaccggcg 3060 caggaacact gccagcgcat caacaatatt ttcacctgaa tcaggatatt cttctaatac 3120 ctggaatgct gttttcccgg ggatcgcagt ggtgagtaac catgcatcat caggagtacg 3180 gataaaatgc ttgatggtcg gaagaggcat aaattccgtc agccagttta gtctgaccat 3240 ctcatctgta acatcattgg caacgctacc tttgccatgt ttcagaaaca actctggcgc 3300 atcgggcttc ccatacaatc gatagattgt cgcacctgat tgcccgacat tatcgcgagc 3360 ccatttatac ccatataaat cagcatccat gttggaattt aatcgcggcc tagagcaaga 3420 cgtttcccgt tgaatatggc tcatactctt cctttttcaa tattattgaa gcatttatca 3480 gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3540 catgctagcg cagaaacgtc ctagaagatg ccaggaggat acttagcaga gagacaataa 3600 ggccggagcg aagccgtttt tccataggct ccgcccccct gacgaacatc acgaaatctg 3660 acgctcaaat cagtggtggc gaaacccgac aggactataa agataccagg cgtttccccc 3720 tgatggctcc ctcttgcgct ctcctgttcc cgtcctgcgg cgtccgtgtt gtggtggagg 3780 ctttacccaa atcaccacgt cccgttccgt gtagacagtt cgctccaagc tgggctgtgt 3840 gcaagaaccc cccgttcagc ccgactgctg cgccttatcc ggtaactatc atcttgagtc 3900 caacccggaa agacacgaca aaacgccact ggcagcagcc attggtaact gagaattagt 3960 ggatttagat atcgagagtc ttgaagtggt ggcctaacag aggctacact gaaaggacag 4020 tatttggtat ctgcgctcca ctaaagccag ttaccaggtt aagcagttcc ccaactgact 4080 taaccttcga tcaaaccgcc tccccaggcg gttttttcgt ttacagagca ggagattacg 4140 acgatcgtaa aaggatctca agaagatcct ttacggattc ccgacaccat cactctagat 4200 ttcagtgcaa tttatctctt caaatgtagc acctgaagtc agccccatac gatataagtt 4260 gtaattctca tgttagtcat gccccgcgcc caccggaagg agctgactgg gttgaaggct 4320 ctcaagggca tcggtcgaga tcccggtgcc taatgagtga gctaacttac attaattgcg 4380 ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 4440 ggccaacgcg cggggagagg cggtttgcgt attgggcgcc agggtggttt ttcttttcac 4500 cagtgagacg ggcaacagct gattgccctt caccgcctgg ccctgagaga gttgcagcaa 4560 gcggtccacg ctggtttgcc ccagcaggcg aaaatcctgt ttgatggtgg ttaacggcgg 4620 gatataacat gagctgtctt cggtatcgtc gtatcccact accgagatgt ccgcaccaac 4680 gcgcagcccg gactcggtaa tggcgcgcat tgcgcccagc gccatctgat cgttggcaac 4740 cagcatcgca gtgggaacga tgccctcatt cagcatttgc atggtttgtt gaaaaccgga 4800 catggcactc cagtcgcctt cccgttccgc tatcggctga atttgattgc gagtgagata 4860 tttatgccag ccagccagac gcagacgcgc cgagacagaa cttaatgggc ccgctaacag 4920 cgcgatttgc tggtgaccca atgcgaccag atgctccacg cccagtcgcg taccgtcttc 4980 atgggagaaa ataatactgt tgatgggtgt ctggtcagag acatcaagaa ataacgccgg 5040 aacattagtg caggcagctt ccacagcaat ggcatcctgg tcatccagcg gatagttaat 5100 gatcagccca ctgacgcgtt gcgcgagaag attgtgcacc gccgctttac aggcttcgac 5160 gccgcttcgt tctaccatcg acaccaccac gctggcaccc agttgatcgg cgcgagattt 5220 aatcgccgcg acaatttgcg acggcgcgtg cagggccaga ctggaggtgg caacgccaat 5280 cagcaacgac tgtttgcccg ccagttgttg tgccacgcgg ttgggaatgt aattcagctc 5340 cgccatcgcc gcttccactt tttcccgcgt tttcgcagaa acgtggctgg cctggttcac 5400 cacgcgggaa acggtctgat aagagacacc ggcatactct gcgacatcgt ataacgttac 5460 tggtttcaca ttcaccaccc tgaattgact ctcttccggg cgctatcatg ccataccgcg 5520 aaaggttttg cgccattcga tggtgtccgg gatctcgacg ctctccctta tgcgactcct 5580 gcattaggaa attaatacga ctcactata 5609 <210> SEQ ID NO 107 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-ptb-R1, reverse <400> SEQUENCE: 107 atttcctccc tttctagcac ttttctagca atattg 36 <210> SEQ ID NO 108 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: adc-buk-F1, forward <400> SEQUENCE: 108 taaggtttac taaggaggtt gttttatgtt agaaag 36 <210> SEQ ID NO 109 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-ptb-F1, forward <400> SEQUENCE: 109 gctagaaaag tgctagaaag ggaggaaatg aacatg 36 <210> SEQ ID NO 110 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Buk-adc-R1, reverse <400> SEQUENCE: 110 aaaacaacct ccttagtaaa ccttagcttg ttcttc 36 <210> SEQ ID NO 111 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDuet-insert2-R1, forward <400> SEQUENCE: 111 catatgtata tctccttctt atacttaac 29 <210> SEQ ID NO 112 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: insert2-pDuet-F1, forward <400> SEQUENCE: 112 gttaagtata agaaggagat atacatatg 29 <210> SEQ ID NO 113 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDuet-insert2-F1, forward <400> SEQUENCE: 113 cctcgagtct ggtaaagaaa c 21 <210> SEQ ID NO 114 <211> LENGTH: 21 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: insert2-pDuet-R1, forward <400> SEQUENCE: 114 gtttctttac cagactcgag g 21 <210> SEQ ID NO 115 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB - pACYC-phaB-R1, forward <400> SEQUENCE: 115 ctattctttg tgtcatggta tatctcctta ttaaag 36 <210> SEQ ID NO 116 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence

<220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB - phaB-pACYC-F1, forward <400> SEQUENCE: 116 ataaggagat ataccatgac acaaagaata gcatac 36 <210> SEQ ID NO 117 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pcdf-phab - pacyc-phab-f1, forward <400> SEQUENCE: 117 tggtttacac atgggataag atccgaattc gagctc 36 <210> SEQ ID NO 118 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB - phaB-pACYC-R1, forward <400> SEQUENCE: 118 agctcgaatt cggatcttat cccatgtgta aaccac 36 <210> SEQ ID NO 119 <211> LENGTH: 4486 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB, plasmid <400> SEQUENCE: 119 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgacacaaa gaatagcata cgtaacaggt ggtatgggtg gtataggaac 120 tgcaatatgt caaagattag caaaagatgg atttagagtt gtagctggat gcggaccaaa 180 tagtcctaga agagaaaagt ggttagaaca acaaaaagca cttggatttg atttcatagc 240 ttctgaaggt aacgtagcag attgggactc aactaaaact gcttttgata aagttaaatc 300 tgaagttggt gaagttgatg tattaataaa taatgcaggt attactagag atgtagtatt 360 tagaaagatg acaagagctg actgggatgc agtaatagat actaatctta ctagtctttt 420 caatgtaact aagcaggtaa ttgatggtat ggcagataga ggttggggta gaatagtaaa 480 tattagttca gttaatggac aaaaaggtca gtttggacag acaaattatt ctacagctaa 540 agcaggtctt catggtttta caatggcttt agcacaggaa gttgctacaa aaggtgttac 600 agttaacact gttagtccag gatatattgc tactgacatg gtaaaggcta taagacaaga 660 tgttcttgat aaaattgttg ctacaatacc agtaaagaga ttaggacttc ctgaagagat 720 agcatctatt tgtgcatggt tatcaagtga agaatcagga ttctcaactg gtgctgattt 780 ttcattaaac ggtggtttac acatgggata agatccgaat tcgagctcgg cgcgcctgca 840 ggtcgacaag cttgcggccg cataatgctt aagtcgaaca gaaagtaatc gtattgtaca 900 cggccgcata atcgaaatta atacgactca ctatagggga attgtgagcg gataacaatt 960 ccccatctta gtatattagt taagtataag aaggagatat acatatggca gatctcaatt 1020 ggatatcggc cggccacgcg atcgctgacg tcggtaccct cgagtctggt aaagaaaccg 1080 ctgctgcgaa atttgaacgc cagcacatgg actcgtctac tagcgcagct taattaacct 1140 aggctgctgc caccgctgag caataactag cataacccct tggggcctct aaacgggtct 1200 tgaggggttt tttgctgaaa cctcaggcat ttgagaagca cacggtcaca ctgcttccgg 1260 tagtcaataa accggtaaac cagcaataga cataagcggc tatttaacga ccctgccctg 1320 aaccgacgac cgggtcatcg tggccggatc ttgcggcccc tcggcttgaa cgaattgtta 1380 gacattattt gccgactacc ttggtgatct cgcctttcac gtagtggaca aattcttcca 1440 actgatctgc gcgcgaggcc aagcgatctt cttcttgtcc aagataagcc tgtctagctt 1500 caagtatgac gggctgatac tgggccggca ggcgctccat tgcccagtcg gcagcgacat 1560 ccttcggcgc gattttgccg gttactgcgc tgtaccaaat gcgggacaac gtaagcacta 1620 catttcgctc atcgccagcc cagtcgggcg gcgagttcca tagcgttaag gtttcattta 1680 gcgcctcaaa tagatcctgt tcaggaaccg gatcaaagag ttcctccgcc gctggaccta 1740 ccaaggcaac gctatgttct cttgcttttg tcagcaagat agccagatca atgtcgatcg 1800 tggctggctc gaagatacct gcaagaatgt cattgcgctg ccattctcca aattgcagtt 1860 cgcgcttagc tggataacgc cacggaatga tgtcgtcgtg cacaacaatg gtgacttcta 1920 cagcgcggag aatctcgctc tctccagggg aagccgaagt ttccaaaagg tcgttgatca 1980 aagctcgccg cgttgtttca tcaagcctta cggtcaccgt aaccagcaaa tcaatatcac 2040 tgtgtggctt caggccgcca tccactgcgg agccgtacaa atgtacggcc agcaacgtcg 2100 gttcgagatg gcgctcgatg acgccaacta cctctgatag ttgagtcgat acttcggcga 2160 tcaccgcttc cctcatactc ttcctttttc aatattattg aagcatttat cagggttatt 2220 gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata gctagctcac 2280 tcggtcgcta cgctccgggc gtgagactgc ggcgggcgct gcggacacat acaaagttac 2340 ccacagattc cgtggataag caggggacta acatgtgagg caaaacagca gggccgcgcc 2400 ggtggcgttt ttccataggc tccgccctcc tgccagagtt cacataaaca gacgcttttc 2460 cggtgcatct gtgggagccg tgaggctcaa ccatgaatct gacagtacgg gcgaaacccg 2520 acaggactta aagatcccca ccgtttccgg cgggtcgctc cctcttgcgc tctcctgttc 2580 cgaccctgcc gtttaccgga tacctgttcc gcctttctcc cttacgggaa gtgtggcgct 2640 ttctcatagc tcacacactg gtatctcggc tcggtgtagg tcgttcgctc caagctgggc 2700 tgtaagcaag aactccccgt tcagcccgac tgctgcgcct tatccggtaa ctgttcactt 2760 gagtccaacc cggaaaagca cggtaaaacg ccactggcag cagccattgg taactgggag 2820 ttcgcagagg atttgtttag ctaaacacgc ggttgctctt gaagtgtgcg ccaaagtccg 2880 gctacactgg aaggacagat ttggttgctg tgctctgcga aagccagtta ccacggttaa 2940 gcagttcccc aactgactta accttcgatc aaaccacctc cccaggtggt tttttcgttt 3000 acagggcaaa agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct 3060 actgaaccgc tctagatttc agtgcaattt atctcttcaa atgtagcacc tgaagtcagc 3120 cccatacgat ataagttgta attctcatgt tagtcatgcc ccgcgcccac cggaaggagc 3180 tgactgggtt gaaggctctc aagggcatcg gtcgagatcc cggtgcctaa tgagtgagct 3240 aacttacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc 3300 agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggcgccagg 3360 gtggtttttc ttttcaccag tgagacgggc aacagctgat tgcccttcac cgcctggccc 3420 tgagagagtt gcagcaagcg gtccacgctg gtttgcccca gcaggcgaaa atcctgtttg 3480 atggtggtta acggcgggat ataacatgag ctgtcttcgg tatcgtcgta tcccactacc 3540 gagatgtccg caccaacgcg cagcccggac tcggtaatgg cgcgcattgc gcccagcgcc 3600 atctgatcgt tggcaaccag catcgcagtg ggaacgatgc cctcattcag catttgcatg 3660 gtttgttgaa aaccggacat ggcactccag tcgccttccc gttccgctat cggctgaatt 3720 tgattgcgag tgagatattt atgccagcca gccagacgca gacgcgccga gacagaactt 3780 aatgggcccg ctaacagcgc gatttgctgg tgacccaatg cgaccagatg ctccacgccc 3840 agtcgcgtac cgtcttcatg ggagaaaata atactgttga tgggtgtctg gtcagagaca 3900 tcaagaaata acgccggaac attagtgcag gcagcttcca cagcaatggc atcctggtca 3960 tccagcggat agttaatgat cagcccactg acgcgttgcg cgagaagatt gtgcaccgcc 4020 gctttacagg cttcgacgcc gcttcgttct accatcgaca ccaccacgct ggcacccagt 4080 tgatcggcgc gagatttaat cgccgcgaca atttgcgacg gcgcgtgcag ggccagactg 4140 gaggtggcaa cgccaatcag caacgactgt ttgcccgcca gttgttgtgc cacgcggttg 4200 ggaatgtaat tcagctccgc catcgccgct tccacttttt cccgcgtttt cgcagaaacg 4260 tggctggcct ggttcaccac gcgggaaacg gtctgataag agacaccggc atactctgcg 4320 acatcgtata acgttactgg tttcacattc accaccctga attgactctc ttccgggcgc 4380 tatcatgcca taccgcgaaa ggttttgcgc cattcgatgg tgtccgggat ctcgacgctc 4440 tcccttatgc gactcctgca ttaggaaatt aatacgactc actata 4486 <210> SEQ ID NO 120 <211> LENGTH: 5221 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pCDF-phaB-bdh1, plasmid <400> SEQUENCE: 120 ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag 60 gagatatacc atgacacaaa gaatagcata cgtaacaggt ggtatgggtg gtataggaac 120 tgcaatatgt caaagattag caaaagatgg atttagagtt gtagctggat gcggaccaaa 180 tagtcctaga agagaaaagt ggttagaaca acaaaaagca cttggatttg atttcatagc 240 ttctgaaggt aacgtagcag attgggactc aactaaaact gcttttgata aagttaaatc 300 tgaagttggt gaagttgatg tattaataaa taatgcaggt attactagag atgtagtatt 360 tagaaagatg acaagagctg actgggatgc agtaatagat actaatctta ctagtctttt 420 caatgtaact aagcaggtaa ttgatggtat ggcagataga ggttggggta gaatagtaaa 480 tattagttca gttaatggac aaaaaggtca gtttggacag acaaattatt ctacagctaa 540 agcaggtctt catggtttta caatggcttt agcacaggaa gttgctacaa aaggtgttac 600 agttaacact gttagtccag gatatattgc tactgacatg gtaaaggcta taagacaaga 660 tgttcttgat aaaattgttg ctacaatacc agtaaagaga ttaggacttc ctgaagagat 720 agcatctatt tgtgcatggt tatcaagtga agaatcagga ttctcaactg gtgctgattt 780 ttcattaaac ggtggtttac acatgggata agatccgaat tcgagctcgg cgcgcctgca 840 ggtcgacaag cttgcggccg cataatgctt aagtcgaaca gaaagtaatc gtattgtaca 900 cggccgcata atcgaaatta atacgactca ctatagggga attgtgagcg gataacaatt 960 ccccatctta gtatattagt taagtataag aaggagatat acatatgcaa ttaaaaggta 1020 aaagtgcaat agtaactggt gcagcaagtg gaataggaaa agcaatagca gaattacttg 1080

caaaagaagg tgcagcagta gcaatagctg atttaaattt agaagcagca agagcagcag 1140 cagctggaat agaagcagct ggcggaaaag ctatagctgt agcaatggat gtaactagtg 1200 aagcaagtgt aaatcaagca actgatgaag tagcacaagc atttggaaat atagatatat 1260 tagtaagtaa tgctggaata caaatagtaa atcctataca aaattatgca tttagtgatt 1320 ggaaaaaaat gcaagcaata catgtagatg gtgcattttt aactactaaa gcagcattga 1380 aatatatgta tagagataaa agaggtggaa ctgtaatata tatgggaagt gtacattctc 1440 atgaagcaag tcctttaaaa agtgcttatg tagcagcaaa acatgcatta ttaggattag 1500 caagagtatt agctaaagaa ggtgctgaat tcaacgtaag atctcacgtt atatgtcctg 1560 gatttgtaag aactccttta gtagataaac aaatacctga acaagcaaaa gaattaggaa 1620 taagtgaaga agaagtagtt agaagagtaa tgttaggtgg aacagtagac ggtgtattta 1680 ctactgtaga tgatgtagca agaactgcat tatttttatg tgcatttcct agtgcagcat 1740 taactggaca aagttttata gtaagtcatg gatggtatat gcaataaggt accctcgagt 1800 ctggtaaaga aaccgctgct gcgaaatttg aacgccagca catggactcg tctactagcg 1860 cagcttaatt aacctaggct gctgccaccg ctgagcaata actagcataa ccccttgggg 1920 cctctaaacg ggtcttgagg ggttttttgc tgaaacctca ggcatttgag aagcacacgg 1980 tcacactgct tccggtagtc aataaaccgg taaaccagca atagacataa gcggctattt 2040 aacgaccctg ccctgaaccg acgaccgggt catcgtggcc ggatcttgcg gcccctcggc 2100 ttgaacgaat tgttagacat tatttgccga ctaccttggt gatctcgcct ttcacgtagt 2160 ggacaaattc ttccaactga tctgcgcgcg aggccaagcg atcttcttct tgtccaagat 2220 aagcctgtct agcttcaagt atgacgggct gatactgggc cggcaggcgc tccattgccc 2280 agtcggcagc gacatccttc ggcgcgattt tgccggttac tgcgctgtac caaatgcggg 2340 acaacgtaag cactacattt cgctcatcgc cagcccagtc gggcggcgag ttccatagcg 2400 ttaaggtttc atttagcgcc tcaaatagat cctgttcagg aaccggatca aagagttcct 2460 ccgccgctgg acctaccaag gcaacgctat gttctcttgc ttttgtcagc aagatagcca 2520 gatcaatgtc gatcgtggct ggctcgaaga tacctgcaag aatgtcattg cgctgccatt 2580 ctccaaattg cagttcgcgc ttagctggat aacgccacgg aatgatgtcg tcgtgcacaa 2640 caatggtgac ttctacagcg cggagaatct cgctctctcc aggggaagcc gaagtttcca 2700 aaaggtcgtt gatcaaagct cgccgcgttg tttcatcaag ccttacggtc accgtaacca 2760 gcaaatcaat atcactgtgt ggcttcaggc cgccatccac tgcggagccg tacaaatgta 2820 cggccagcaa cgtcggttcg agatggcgct cgatgacgcc aactacctct gatagttgag 2880 tcgatacttc ggcgatcacc gcttccctca tactcttcct ttttcaatat tattgaagca 2940 tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 3000 aaatagctag ctcactcggt cgctacgctc cgggcgtgag actgcggcgg gcgctgcgga 3060 cacatacaaa gttacccaca gattccgtgg ataagcaggg gactaacatg tgaggcaaaa 3120 cagcagggcc gcgccggtgg cgtttttcca taggctccgc cctcctgcca gagttcacat 3180 aaacagacgc ttttccggtg catctgtggg agccgtgagg ctcaaccatg aatctgacag 3240 tacgggcgaa acccgacagg acttaaagat ccccaccgtt tccggcgggt cgctccctct 3300 tgcgctctcc tgttccgacc ctgccgttta ccggatacct gttccgcctt tctcccttac 3360 gggaagtgtg gcgctttctc atagctcaca cactggtatc tcggctcggt gtaggtcgtt 3420 cgctccaagc tgggctgtaa gcaagaactc cccgttcagc ccgactgctg cgccttatcc 3480 ggtaactgtt cacttgagtc caacccggaa aagcacggta aaacgccact ggcagcagcc 3540 attggtaact gggagttcgc agaggatttg tttagctaaa cacgcggttg ctcttgaagt 3600 gtgcgccaaa gtccggctac actggaagga cagatttggt tgctgtgctc tgcgaaagcc 3660 agttaccacg gttaagcagt tccccaactg acttaacctt cgatcaaacc acctccccag 3720 gtggtttttt cgtttacagg gcaaaagatt acgcgcagaa aaaaaggatc tcaagaagat 3780 cctttgatct tttctactga accgctctag atttcagtgc aatttatctc ttcaaatgta 3840 gcacctgaag tcagccccat acgatataag ttgtaattct catgttagtc atgccccgcg 3900 cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga gatcccggtg 3960 cctaatgagt gagctaactt acattaattg cgttgcgctc actgcccgct ttccagtcgg 4020 gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc 4080 gtattgggcg ccagggtggt ttttcttttc accagtgaga cgggcaacag ctgattgccc 4140 ttcaccgcct ggccctgaga gagttgcagc aagcggtcca cgctggtttg ccccagcagg 4200 cgaaaatcct gtttgatggt ggttaacggc gggatataac atgagctgtc ttcggtatcg 4260 tcgtatccca ctaccgagat gtccgcacca acgcgcagcc cggactcggt aatggcgcgc 4320 attgcgccca gcgccatctg atcgttggca accagcatcg cagtgggaac gatgccctca 4380 ttcagcattt gcatggtttg ttgaaaaccg gacatggcac tccagtcgcc ttcccgttcc 4440 gctatcggct gaatttgatt gcgagtgaga tatttatgcc agccagccag acgcagacgc 4500 gccgagacag aacttaatgg gcccgctaac agcgcgattt gctggtgacc caatgcgacc 4560 agatgctcca cgcccagtcg cgtaccgtct tcatgggaga aaataatact gttgatgggt 4620 gtctggtcag agacatcaag aaataacgcc ggaacattag tgcaggcagc ttccacagca 4680 atggcatcct ggtcatccag cggatagtta atgatcagcc cactgacgcg ttgcgcgaga 4740 agattgtgca ccgccgcttt acaggcttcg acgccgcttc gttctaccat cgacaccacc 4800 acgctggcac ccagttgatc ggcgcgagat ttaatcgccg cgacaatttg cgacggcgcg 4860 tgcagggcca gactggaggt ggcaacgcca atcagcaacg actgtttgcc cgccagttgt 4920 tgtgccacgc ggttgggaat gtaattcagc tccgccatcg ccgcttccac tttttcccgc 4980 gttttcgcag aaacgtggct ggcctggttc accacgcggg aaacggtctg ataagagaca 5040 ccggcatact ctgcgacatc gtataacgtt actggtttca cattcaccac cctgaattga 5100 ctctcttccg ggcgctatca tgccataccg cgaaaggttt tgcgccattc gatggtgtcc 5160 gggatctcga cgctctccct tatgcgactc ctgcattagg aaattaatac gactcactat 5220 a 5221 <210> SEQ ID NO 121 <211> LENGTH: 10922 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL8225-budA::thlA-phaB, plasmid <400> SEQUENCE: 121 aaactccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga 60 gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta 120 atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 180 gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact 240 gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca 300 tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 360 accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg 420 ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 480 cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta 540 agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat 600 ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 660 tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc 720 ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 780 cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc 840 gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca gggccccctg cttcggggtc 900 attatagcga ttttttcggt atatccatcc tttttcgcac gatatacagg attttgccaa 960 agggttcgtg tagactttcc ttggtgtatc caacggcgtc agccgggcag gataggtgaa 1020 gtaggcccac ccgcgagcgg gtgttccttc ttcactgtcc cttattcgca cctggcggtg 1080 ctcaacggga atcctgctct gcgaggctgg ccggctaccg ccggcgtaac agatgagggc 1140 aagcggatgg ctgatgaaac caagccaacc aggaagggca gcccacctat caaggtgtac 1200 tgccttccag acgaacgaag agcgattgag gaaaaggcgg cggcggccgg catgagcctg 1260 tcggcctacc tgctggccgt cggccagggc tacaaaatca cgggcgtcgt ggactatgag 1320 cacgtccgcg agctggcccg catcaatggc gacctgggcc gcctgggcgg cctgctgaaa 1380 ctctggctca ccgacgaccc gcgcacggcg cggttcggtg atgccacgat cctcgccctg 1440 ctggcgaaga tcgaagagaa gcaggacgag cttggcaagg tcatgatggg cgtggtccgc 1500 ccgagggcag agccatgact tttttagccg ctaaaacggc cggggggtgc gcgtgattgc 1560 caagcacgtc cccatgcgct ccatcaagaa gagcgacttc gcggagctgg tgaagtacat 1620 caccgacgag caaggcaaga ccgatcgggc cccctgcagg ataaaaaaat tgtagataaa 1680 ttttataaaa tagttttatc tacaattttt ttatcaggaa acagctatga ccgcggccgc 1740 ggcgccaagc ttagaaaaat ataaataaga agtagcttta agagaattaa attattaaga 1800 aaagcaaagg tgtttaaaaa ataaattttt aaacaccttt gcttttctta aattataaat 1860 aagataaaaa agaatcctga ataaaataaa aaggggtgtc tcaaaatttt attttgagac 1920 gacccctttt tattctatat gtcgatgcta tagctgagat cgtggaattc ttgttagcta 1980 ccagattcac atttaagttg tttctctaaa ccacagatta tcaattcaag tccaaaaaga 2040 aatgctggtt ctgcgccttg atgatcaaat aactctattg cttgtcttaa caatggaggc 2100 attgaatctg ttgttggtgt ttctctttcc tcttttgcaa cttgatgttc ttgatcctcc 2160 aatacgcaac ctaaagtaaa atgtcctaca gcacttagtg cgtataaggc attttctaaa 2220 ctaaaaccct gttgacataa gaatgctaat tgattttcta atgtttcata ttgtttttca 2280 gttggtctag ttcctaaatg tactttagcc ccatctctat gtgataatag agcacaacga 2340 aaagatttag cgttattcct aagaaaatct tgccatgatt caccttctaa aggacaaaag 2400 tgagtgtgat gtctatctaa catttcaata gctaaggcgt caagtaaagc tctcttattc 2460 ttcacatgcc aatacaacgt aggttgttct actccaagtt tctgagctaa ctttcttgta 2520 gttagtcctt ctattccaac ttcatttagt aattccaatg cactattgat aactttactt 2580 ttatcaagtc tagacatcat ttaatatcct cctcttcaat atatttaagt cgactgatcg 2640 gatcctgatc ggagctccca tggcggccgg tcgatatcga tgtgtagtag cctgtgaaat 2700 aagtaaggaa aaaaaagaag taagtgttat atatgatgat tattttgtag atgtagatag 2760 gataatagaa tccatagaaa atataggtta tacagttata taaaaattac tttaaaatct 2820 atcattgata gggtaaaata taaatcgtat aaagttgtgt aatttttaag gaggtgtgtt 2880 acagacgtcc gcgagagacc ttaaatatat tgaagaggag gaaatacata tggtttcaag 2940

atatgttcca gatatgggag atttaatatg ggttgatttt gatccaacaa aaggatcaga 3000 acaagcagga catagaccag cagttgtttt atcaccattt atgtataata ataaaacagg 3060 aatgtgttta tgtgttccat gtacaacaca atcaaaagga tatccatttg aagttgtttt 3120 atcaggacaa gaaagagatg gagttgcatt agcagatcaa gttaaatcaa tagcatggag 3180 agcaagagga gcaacaaaaa aaggaacagt tgcaccagaa gaattacaat taataaaagc 3240 aaaaataaat gttttaatag gataatgtta ttaagctagc ataaaaataa gaagcctgca 3300 tttgcaggct tcttattttt atggcgcgcc gttctgaatc cttagctaat ggttcaacag 3360 gtaactatga cgaagatagc accctggata agtctgtaat ggattctaag gcatttaatg 3420 aagacgtgta tataaaatgt gctaatgaaa aagaaaatgc gttaaaagag cctaaaatga 3480 gttcaaatgg ttttgaaatt gattggtagt ttaatttaat atattttttc tattggctat 3540 ctcgatacct atagaatctt ctgttcactt ttgtttttga aatataaaaa ggggcttttt 3600 agcccctttt ttttaaaact ccggaggagt ttcttcattc ttgatactat acgtaactat 3660 tttcgatttg acttcattgt caattaagct agtaaaatca atggttaaaa aacaaaaaac 3720 ttgcattttt ctacctagta atttataatt ttaagtgtcg agtttaaaag tataatttac 3780 caggaaagga gcaagttttt taataaggaa aaatttttcc ttttaaaatt ctatttcgtt 3840 atatgactaa ttataatcaa aaaaatgaaa ataaacaaga ggtaaaaact gctttagaga 3900 aatgtactga taaaaaaaga aaaaatccta gatttacgtc atacatagca cctttaacta 3960 ctaagaaaaa tattgaaagg acttccactt gtggagatta tttgtttatg ttgagtgatg 4020 cagacttaga acattttaaa ttacataaag gtaatttttg cggtaataga ttttgtccaa 4080 tgtgtagttg gcgacttgct tgtaaggata gtttagaaat atctattctt atggagcatt 4140 taagaaaaga agaaaataaa gagtttatat ttttaactct tacaactcca aatgtaaaaa 4200 gttatgatct taattattct attaaacaat ataataaatc ttttaaaaaa ttaatggagc 4260 gtaaggaagt taaggatata actaaaggtt atataagaaa attagaagta acttaccaaa 4320 aggaaaaata cataacaaag gatttatgga aaataaaaaa agattattat caaaaaaaag 4380 gacttgaaat tggtgattta gaacctaatt ttgatactta taatcctcat tttcatgtag 4440 ttattgcagt taataaaagt tattttacag ataaaaatta ttatataaat cgagaaagat 4500 ggttggaatt atggaagttt gctactaagg atgattctat aactcaagtt gatgttagaa 4560 aagcaaaaat taatgattat aaagaggttt acgaacttgc gaaatattca gctaaagaca 4620 ctgattattt aatatcgagg ccagtatttg aaatttttta taaagcatta aaaggcaagc 4680 aggtattagt ttttagtgga ttttttaaag atgcacacaa attgtacaag caaggaaaac 4740 ttgatgttta taaaaagaaa gatgaaatta aatatgtcta tatagtttat tataattggt 4800 gcaaaaaaca atatgaaaaa actagaataa gggaacttac ggaagatgaa aaagaagaat 4860 taaatcaaga tttaatagat gaaatagaaa tagattaaag tgtaactata ctttatatat 4920 atatgattaa aaaaataaaa aacaacagcc tattaggttg ttgtttttta ttttctttat 4980 taattttttt aatttttagt ttttagttct tttttaaaat aagtttcagc ctctttttca 5040 atatttttta aagaaggagt atttgcatga attgcctttt ttctaacaga cttaggaaat 5100 attttaacag tatcttcttg cgccggtgat tttggaactt cataacttac taatttataa 5160 ttattatttt cttttttaat tgtaacagtt gcaaaagaag ctgaacctgt tccttcaact 5220 agtttatcat cttcaatata atattcttga cctatatagt ataaatatat ttttattata 5280 tttttacttt tttctgaatc tattatttta taatcataaa aagttttacc accaaaagaa 5340 ggttgtactc cttctggtcc aacatatttt tttactatat tatctaaata atttttggga 5400 actggtgttg taatttgatt aatcgaacaa ccagttatac ttaaaggaat tataactata 5460 aaaatatata ggattatctt tttaaatttc attattggcc tcctttttat taaatttatg 5520 ttaccataaa aaggacataa cgggaatatg tagaatattt ttaatgtaga caaaatttta 5580 cataaatata aagaaaggaa gtgtttgttt aaattttata gcaaactatc aaaaattagg 5640 gggataaaaa tttatgaaaa aaaggttttc gatgttattt ttatgtttaa ctttaatagt 5700 ttgtggttta tttacaaatt cggccggcct acctcctcgt ataaataaga tgtttttgtt 5760 ttgcttgata ctactttttc ttcacaggaa aatatacttc agtaacaaga tctttaggaa 5820 tggtgacttg gtgggggtca gttacatata cttcatatgg tgggtttgta agtttatatc 5880 cttcattttc tacccattcc ctcaacttag catatacaga gatgttaatt ctgaatatga 5940 gccccttaaa acagacttcg cacaaaggac tccaggcaag tatcttgttc cctttacaat 6000 ctcctttatc ggaatggcaa gttctgtatc attgccagaa ggattgtatt cagcgctgtg 6060 ataaatagtt attggcttac caagaaagtc aattacaaaa atatatataa agaaagcaaa 6120 gctacatata ttaaagcatt taaggtaaaa ctaaaaatat tataaaaatg aaattatttt 6180 ttctcatagc taaagttaca taatacgagg aggatttata atgaaaaaag taataggaat 6240 tataagtatt gtactatttg tactcgtagc acttcaatcc tgtgctgcag gagtaggaaa 6300 tgcattaagt aataacaaag aagctagtgg atctgctgga ttatttttat ctgtatgtat 6360 gcttattgct ggaataatag caataatatc aaaatatagt aaaggtatga ctataacagc 6420 tatagtattt tatttgttag cttttgttgt agggattgct aatgttgggc atttttcaga 6480 tttgcaaatt tggtcaatca ttaacttgat atttgctgga ctattgatat ttcatttgct 6540 taaaaataag caattatata atagcagtgg gaaaaagtag aatcatatat tgtaattatt 6600 tttaattatg ttggcaaaat tgaaattgtc actgaaacac ctctaaatgt tttaaataca 6660 tatgtttaat tattgtgaca gattctaata gtagaaagta gaaatttgct atgttataat 6720 gacatagagg tgaatgtaat atgaaagaag ttgtaatagc tagtgcagta agaacagcga 6780 ttggatctta tggaaagtct cttaaggatg taccagcagt agatttagga gctacagcta 6840 taaaggaagc agttaaaaaa gcaggaataa aaccagagga tgttaatgaa gtcattttag 6900 gaaatgttct tcaagcaggt ttaggacaga atccagcaag acaggcatct tttaaagcag 6960 gattaccagt tgaaattcca gctatgacta ttaataaggt ttgtggttca ggacttagaa 7020 cagttagctt agcagcacaa attataaaag caggagatgc tgacgtaata atagcaggtg 7080 gtatggaaaa tatgtctaga gctccttact tagcgaataa cgctagatgg ggatatagaa 7140 tgggaaacgc taaatttgtt gatgaaatga tcactgacgg attgtgggat gcatttaatg 7200 attaccacat gggaataaca gcagaaaaca tagctgagag atggaacatt tcaagagaag 7260 aacaagatga gtttgctctt gcatcacaaa aaaaagctga agaagctata aaatcaggtc 7320 aatttaaaga tgaaatagtt cctgtagtaa ttaaaggcag aaagggagaa actgtagttg 7380 atacagatga gcaccctaga tttggatcaa ctatagaagg acttgcaaaa ttaaaacctg 7440 ccttcaaaaa agatggaaca gttacagctg gtaatgcatc aggattaaat gactgtgcag 7500 cagtacttgt aatcatgagt gcagaaaaag ctaaagagct tggagtaaaa ccacttgcta 7560 agatagtttc ttatggttca gcaggagttg acccagcaat aatgggatat ggacctttct 7620 atgcaacaaa agcagctatt gaaaaagcag gttggacagt tgatgaatta gatttaatag 7680 aatcaaatga agcttttgca gctcaaagtt tagcagtagc aaaagattta aaatttgata 7740 tgaataaagt aaatgtaaat ggaggagcta ttgcccttgg tcatccaatt ggagcatcag 7800 gtgcaagaat actcgttact cttgtacacg caatgcaaaa aagagatgca aaaaaaggct 7860 tagcaacttt atgtataggt ggcggacaag gaacagcaat attgctagaa aagtgctagg 7920 aattcaggag gtatagcata tgacacaaag aatagcatac gtaacaggtg gtatgggtgg 7980 tataggaact gcaatatgtc aaagattagc aaaagatgga tttagagttg tagctggatg 8040 cggaccaaat agtcctagaa gagaaaagtg gttagaacaa caaaaagcac ttggatttga 8100 tttcatagct tctgaaggta acgtagcaga ttgggactca actaaaactg cttttgataa 8160 agttaaatct gaagttggtg aagttgatgt attaataaat aatgcaggta ttactagaga 8220 tgtagtattt agaaagatga caagagctga ctgggatgca gtaatagata ctaatcttac 8280 tagtcttttc aatgtaacta agcaggtaat tgatggtatg gcagatagag gttggggtag 8340 aatagtaaat attagttcag ttaatggaca aaaaggtcag tttggacaga caaattattc 8400 tacagctaaa gcaggtcttc atggttttac aatggcttta gcacaggaag ttgctacaaa 8460 aggtgttaca gttaacactg ttagtccagg atatattgct actgacatgg taaaggctat 8520 aagacaagat gttcttgata aaattgttgc tacaatacca gtaaagagat taggacttcc 8580 tgaagagata gcatctattt gtgcatggtt atcaagtgaa gaatcaggat tctcaactgg 8640 tgctgatttt tcattaaacg gtggtttaca catgggataa taccgttcgt ataatgtatg 8700 ctatacgaag ttatccttag aagcaaactt aagagtgtgt tgatagtgca gtatcttaaa 8760 attttgtgta taataggaat tgaagttaaa ttagatgcta aaaatttgta attaagaagg 8820 agggattcgt catgttggta ttccaaatgc gtaatgtaga taaaacatct actgttttga 8880 aacagactaa aaacagtgat tacgcagata aataaatacg ttagattaat tcctaccagt 8940 gactaatctt atgacttttt aaacagataa ctaaaattac aaacaaatcg tttaacttct 9000 gtatttattt acagatgtaa tcacttcagg agtaattaca tgaacaaaaa tataaaatat 9060 tctcaaaact ttttaacgag tgaaaaagta ctcaaccaaa taataaaaca attgaattta 9120 aaagaaaccg ataccgttta cgaaattgga acaggtaaag ggcatttaac gacgaaactg 9180 gctaaaataa gtaaacaggt aacgtctatt gaattagaca gtcatctatt caacttatcg 9240 tcagaaaaat taaaactgaa cattcgtgtc actttaattc accaagatat tctacagttt 9300 caattcccta acaaacagag gtataaaatt gttgggagta ttccttacca tttaagcaca 9360 caaattatta aaaaagtggt ttttgaaagc catgcgtctg acatctatct gattgttgaa 9420 gaaggattct acaagcgtac cttggatatt caccgaacac tagggttgct cttgcacact 9480 caagtctcga ttcagcaatt gcttaagctg ccagcggaat gctttcatcc taaaccaaaa 9540 gtaaacagtg tcttaataaa acttacccgc cataccacag atgttccaga taaatattgg 9600 aagctatata cgtactttgt ttcaaaatgg gtcaatcgag aatatcgtca actgtttact 9660 aaaaatcagt ttcatcaagc aatgaaacac gccaaagtaa acaatttaag taccattact 9720 tatgagcaag tattgtctat ttttaatagt tatctattat ttaacgggag gaaataattc 9780 tatgagtcgc ttttttaaat ttggaaagtt acacgttact aaagggaatg gagataaatt 9840 attagatata ctactgacag cttccaagaa gctaaagagg tcataacttc gtataatgta 9900 tgctatacga acggtaagta ttgatagaaa aaaacactag acagtgctaa taacaatgtc 9960 tagtgctttt tatcttgctc aattttttca ttgagttcat ttaagtaagt ccacctgtcc 10020 atcttttcgt ctagctcttt ttccagtgaa ttcttttcgg ataagagatc ttcaagaagt 10080 gcataatcag atgaagcagc ttccatttct attttctttt cagatataga tttttctaga 10140 tgttcaatta cctcatctat tttgtcaaac tccatttgtt ctgcataggt aaattttaga 10200 ggcttttctt tttgcaactt atagttgttt ttagctgtat ttttcttaga gcttattttt 10260 tcctctgata tttttgcagt tttgtgaaaa taggaatagt ttcctgtata ttgagtgatt 10320 ttaccgtttc cttcaaaaga aaatatttta tcaactgttt tgtcaaggaa gtacctgtca 10380 tgagatacag ctataacagc tccttcaaaa tcgttaatat aatcttctag gattgtaagt 10440

gtttctatat ccagatcatt tgttggttcg tccagcaaaa gtacattagg gtaattcatc 10500 agtattttta gaagatataa tcttcttcgt tctcctcctg aaagttttcc aaggggagtc 10560 cattgaactg aaggttcaaa taaaaaattt tcaagtacag cagaagcact tattttttca 10620 cccgatgaag ttgacgcata ttctgatgtc ccacgtatgt attcaattac cctttcgttc 10680 atatccatat cagaaattcc ctgagaatag tatcctatct ttactgtttc acctatatct 10740 atagtgccgc tgtccggcag aattttttga actaaaatat tcataagagt ggatttacca 10800 cttccattag gtccaataat acctattctg tcattattta gtatgttata agtgaaattt 10860 ttaattaatg tcttttcacc aaaacttttg cttatgttat ccaggtttat gactttttgt 10920 tt 10922 <210> SEQ ID NO 122 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN01 <400> SEQUENCE: 122 atttacaaat tcggccggcc tacctcctcg tataaataag atg 43 <210> SEQ ID NO 123 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN02 <400> SEQUENCE: 123 ctagctatta caacttcttt catattacat tcacctctat gtc 43 <210> SEQ ID NO 124 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN03 <400> SEQUENCE: 124 gacatagagg tgaatgtaat atgaaagaag ttgtaatagc tag 43 <210> SEQ ID NO 125 <211> LENGTH: 49 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN04mod <400> SEQUENCE: 125 gtatagcata cattatacga acggtattat cccatgtgta aaccaccgt 49 <210> SEQ ID NO 126 <211> LENGTH: 48 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN05mod <400> SEQUENCE: 126 ttcgtataat gtatgctata cgaagttatc cttagaagca aacttaag 48 <210> SEQ ID NO 127 <211> LENGTH: 46 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN06 <400> SEQUENCE: 127 gtctagtgtt tttttctatc aatactctag ataccgttcg tatagc 46 <210> SEQ ID NO 128 <211> LENGTH: 46 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN07 <400> SEQUENCE: 128 tgtatgctat acgaacggta agtattgata gaaaaaaaca ctagac 46 <210> SEQ ID NO 129 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN08 <400> SEQUENCE: 129 caaaaaggag tttaaacaaa aagtcataaa cctggataac 40 <210> SEQ ID NO 130 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og31f <400> SEQUENCE: 130 ccgtttctca caacaacaat accag 25 <210> SEQ ID NO 131 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og32r <400> SEQUENCE: 131 aaaccacctt gacgatgaaa ccata 25 <210> SEQ ID NO 132 <211> LENGTH: 7951 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL8315-Pfdx-thlA-phaB-bld, plasmid <400> SEQUENCE: 132 cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt 60 atcaggaaac agctatgacc gcggccgctc actatctgcg gaacctgcct ccttatctga 120 taaaaaatat tcgctgcatc tttgacttgt tattttcttt caaatgccta aaattatctt 180 ttaaaattat aacaaatgtg ataaaataca ggggatgaaa acattatcta aaaattaagg 240 aggtgttaca tatgaaagaa gttgtaatag ctagtgcagt aagaacagcg attggatctt 300 atggaaagtc tcttaaggat gtaccagcag tagatttagg agctacagct ataaaggaag 360 cagttaaaaa agcaggaata aaaccagagg atgttaatga agtcatttta ggaaatgttc 420 ttcaagcagg tttaggacag aatccagcaa gacaggcatc ttttaaagca ggattaccag 480 ttgaaattcc agctatgact attaataagg tttgtggttc aggacttaga acagttagct 540 tagcagcaca aattataaaa gcaggagatg ctgacgtaat aatagcaggt ggtatggaaa 600 atatgtctag agctccttac ttagcgaata acgctagatg gggatataga atgggaaacg 660 ctaaatttgt tgatgaaatg atcactgacg gattgtggga tgcatttaat gattaccaca 720 tgggaataac agcagaaaac atagctgaga gatggaacat ttcaagagaa gaacaagatg 780 agtttgctct tgcatcacaa aaaaaagctg aagaagctat aaaatcaggt caatttaaag 840 atgaaatagt tcctgtagta attaaaggca gaaagggaga aactgtagtt gatacagatg 900 agcaccctag atttggatca actatagaag gacttgcaaa attaaaacct gccttcaaaa 960 aagatggaac agttacagct ggtaatgcat caggattaaa tgactgtgca gcagtacttg 1020 taatcatgag tgcagaaaaa gctaaagagc ttggagtaaa accacttgct aagatagttt 1080 cttatggttc agcaggagtt gacccagcaa taatgggata tggacctttc tatgcaacaa 1140 aagcagctat tgaaaaagca ggttggacag ttgatgaatt agatttaata gaatcaaatg 1200 aagcttttgc agctcaaagt ttagcagtag caaaagattt aaaatttgat atgaataaag 1260 taaatgtaaa tggaggagct attgcccttg gtcatccaat tggagcatca ggtgcaagaa 1320 tactcgttac tcttgtacac gcaatgcaaa aaagagatgc aaaaaaaggc ttagcaactt 1380 tatgtatagg tggcggacaa ggaacagcaa tattgctaga aaagtgctag gaattcagga 1440 ggtatagcat atgacacaaa gaatagcata cgtaacaggt ggtatgggtg gtataggaac 1500 tgcaatatgt caaagattag caaaagatgg atttagagtt gtagctggat gcggaccaaa 1560 tagtcctaga agagaaaagt ggttagaaca acaaaaagca cttggatttg atttcatagc 1620 ttctgaaggt aacgtagcag attgggactc aactaaaact gcttttgata aagttaaatc 1680 tgaagttggt gaagttgatg tattaataaa taatgcaggt attactagag atgtagtatt 1740 tagaaagatg acaagagctg actgggatgc agtaatagat actaatctta ctagtctttt 1800 caatgtaact aagcaggtaa ttgatggtat ggcagataga ggttggggta gaatagtaaa 1860 tattagttca gttaatggac aaaaaggtca gtttggacag acaaattatt ctacagctaa 1920 agcaggtctt catggtttta caatggcttt agcacaggaa gttgctacaa aaggtgttac 1980 agttaacact gttagtccag gatatattgc tactgacatg gtaaaggcta taagacaaga 2040 tgttcttgat aaaattgttg ctacaatacc agtaaagaga ttaggacttc ctgaagagat 2100

agcatctatt tgtgcatggt tatcaagtga agaatcagga ttctcaactg gtgctgattt 2160 ttcattaaac ggtggtttac acatgggata agaaggagat atacatatga taaaagatac 2220 acttgttagt attacaaaag atttaaaact taaaactaat gttgaaaatg caaatcttaa 2280 aaattataaa gatgatagtt cttgttttgg agtatttgaa aatgttgaaa atgcaataag 2340 taatgcagta catgctcaaa aaattttatc tcttcattat acaaaagaac agagagaaaa 2400 aattataact gaaattagaa aagcagcttt agaaaataaa gaaatattag ctacaatgat 2460 tcttgaagaa actcacatgg gaagatatga agataaaata cttaaacatg aacttgtagc 2520 aaaatataca cctggaactg aagatttaac tacaactgct tggtcaggtg ataatggact 2580 tacagtagtt gaaatgagtc cttatggagt tataggagca attacacctt ctactaatcc 2640 aacagaaact gtaatatgta attcaattgg tatgattgca gctggaaata ctgtagtttt 2700 taatggtcat cctggagcta aaaaatgtgt agcatttgct gttgaaatga ttaataaagc 2760 tataattagt tgtggaggtc ctgaaaatct tgttacaact ataaaaaatc caacaatgga 2820 ttctcttgat gcaataatta aacatccttc aattaaactt ctttgtggta caggaggtcc 2880 aggaatggta aaaactcttc ttaattctgg taaaaaagct ataggagcag gtgctggaaa 2940 tcctccagta attgttgatg atacagcaga tatagaaaaa gctggtaaat caattattga 3000 aggatgtagt tttgataata atttaccatg tatagcagaa aaagaagtat ttgtttttga 3060 aaatgttgct gatgatttaa ttagtaatat gcttaaaaat aatgcagtaa taattaatga 3120 agatcaagtt tctaaactta tagatttagt attacagaaa aataatgaaa cacaggaata 3180 ttctattaat aaaaaatggg taggaaaaga tgcaaaatta tttcttgatg aaatagatgt 3240 agaatcacct tcaagtgtta aatgtataat ttgtgaagtt tctgcttcac atccatttgt 3300 aatgactgaa ttaatgatgc ctatacttcc aattgtaaga gttaaagata tagatgaagc 3360 aatagaatat gcaaaaattg ctgaacagaa tagaaaacat agtgcttata tttattctaa 3420 aaatatagat aatttaaata gatttgaaag agaaatagat acaactattt ttgttaaaaa 3480 tgcaaaatca tttgctggtg taggatatga agcagaaggt tttacaactt ttacaatagc 3540 tggaagtact ggtgaaggta ttacaagtgc aagaaatttt acaagacaga gaagatgtgt 3600 tttagcaggt taatctagag tcgacgtcac gcgtccatgg agatctcgag gcctgcagac 3660 atgcaagctt ggcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3720 cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3780 cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg cgctagcata 3840 aaaataagaa gcctgcattt gcaggcttct tatttttatg gcgcgccgcc attatttttt 3900 tgaacaattg acaattcatt tcttattttt tattaagtga tagtcaaaag gcataacagt 3960 gctgaataga aagaaattta cagaaaagaa aattatagaa tttagtatga ttaattatac 4020 tcatttatga atgtttaatt gaatacaaaa aaaaatactt gttatgtatt caattacggg 4080 ttaaaatata gacaagttga aaaatttaat aaaaaaataa gtcctcagct cttatatatt 4140 aagctaccaa cttagtatat aagccaaaac ttaaatgtgc taccaacaca tcaagccgtt 4200 agagaactct atctatagca atatttcaaa tgtaccgaca tacaagagaa acattaacta 4260 tatatattca atttatgaga ttatcttaac agatataaat gtaaattgca ataagtaaga 4320 tttagaagtt tatagccttt gtgtattgga agcagtacgc aaaggctttt ttatttgata 4380 aaaattagaa gtatatttat tttttcataa ttaatttatg aaaatgaaag ggggtgagca 4440 aagtgacaga ggaaagcagt atcttatcaa ataacaaggt attagcaata tcattattga 4500 ctttagcagt aaacattatg acttttatag tgcttgtagc taagtagtac gaaaggggga 4560 gctttaaaaa gctccttgga atacatagaa ttcataaatt aatttatgaa aagaagggcg 4620 tatatgaaaa cttgtaaaaa ttgcaaagag tttattaaag atactgaaat atgcaaaata 4680 cattcgttga tgattcatga taaaacagta gcaacctatt gcagtaaata caatgagtca 4740 agatgtttac ataaagggaa agtccaatgt attaattgtt caaagatgaa ccgatatgga 4800 tggtgtgcca taaaaatgag atgttttaca gaggaagaac agaaaaaaga acgtacatgc 4860 attaaatatt atgcaaggag ctttaaaaaa gctcatgtaa agaagagtaa aaagaaaaaa 4920 taatttattt attaatttaa tattgagagt gccgacacag tatgcactaa aaaatatatc 4980 tgtggtgtag tgagccgata caaaaggata gtcactcgca ttttcataat acatcttatg 5040 ttatgattat gtgtcggtgg gacttcacga cgaaaaccca caataaaaaa agagttcggg 5100 gtagggttaa gcatagttga ggcaactaaa caatcaagct aggatatgca gtagcagacc 5160 gtaaggtcgt tgtttaggtg tgttgtaata catacgctat taagatgtaa aaatacggat 5220 accaatgaag ggaaaagtat aatttttgga tgtagtttgt ttgttcatct atgggcaaac 5280 tacgtccaaa gccgtttcca aatctgctaa aaagtatatc ctttctaaaa tcaaagtcaa 5340 gtatgaaatc ataaataaag tttaattttg aagttattat gatattatgt ttttctatta 5400 aaataaatta agtatataga atagtttaat aatagtatat acttaatgtg ataagtgtct 5460 gacagtgtca cagaaaggat gattgttatg gattataagc ggccggccag tgggcaagtt 5520 gaaaaattca caaaaatgtg gtataatatc tttgttcatt agagcgataa acttgaattt 5580 gagagggaac ttagatggta tttgaaaaaa ttgataaaaa tagttggaac agaaaagagt 5640 attttgacca ctactttgca agtgtacctt gtacctacag catgaccgtt aaagtggata 5700 tcacacaaat aaaggaaaag ggaatgaaac tatatcctgc aatgctttat tatattgcaa 5760 tgattgtaaa ccgccattca gagtttagga cggcaatcaa tcaagatggt gaattgggga 5820 tatatgatga gatgatacca agctatacaa tatttcacaa tgatactgaa acattttcca 5880 gcctttggac tgagtgtaag tctgacttta aatcattttt agcagattat gaaagtgata 5940 cgcaacggta tggaaacaat catagaatgg aaggaaagcc aaatgctccg gaaaacattt 6000 ttaatgtatc tatgataccg tggtcaacct tcgatggctt taatctgaat ttgcagaaag 6060 gatatgatta tttgattcct atttttacta tggggaaata ttataaagaa gataacaaaa 6120 ttatacttcc tttggcaatt caagttcatc acgcagtatg tgacggattt cacatttgcc 6180 gttttgtaaa cgaattgcag gaattgataa atagttaact tcaggtttgt ctgtaactaa 6240 aaacaagtat ttaagcaaaa acatcgtaga aatacggtgt tttttgttac cctaagttta 6300 aactcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag 6360 cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa 6420 tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag 6480 agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg 6540 ttcttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat 6600 acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta 6660 ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg 6720 gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc 6780 gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa 6840 gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc 6900 tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt 6960 caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct 7020 tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc 7080 gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg 7140 agtcagtgag cgaggaagcg gaagagcgcc caatacgcag ggccccctgc ttcggggtca 7200 ttatagcgat tttttcggta tatccatcct ttttcgcacg atatacagga ttttgccaaa 7260 gggttcgtgt agactttcct tggtgtatcc aacggcgtca gccgggcagg ataggtgaag 7320 taggcccacc cgcgagcggg tgttccttct tcactgtccc ttattcgcac ctggcggtgc 7380 tcaacgggaa tcctgctctg cgaggctggc cggctaccgc cggcgtaaca gatgagggca 7440 agcggatggc tgatgaaacc aagccaacca ggaagggcag cccacctatc aaggtgtact 7500 gccttccaga cgaacgaaga gcgattgagg aaaaggcggc ggcggccggc atgagcctgt 7560 cggcctacct gctggccgtc ggccagggct acaaaatcac gggcgtcgtg gactatgagc 7620 acgtccgcga gctggcccgc atcaatggcg acctgggccg cctgggcggc ctgctgaaac 7680 tctggctcac cgacgacccg cgcacggcgc ggttcggtga tgccacgatc ctcgccctgc 7740 tggcgaagat cgaagagaag caggacgagc ttggcaaggt catgatgggc gtggtccgcc 7800 cgagggcaga gccatgactt ttttagccgc taaaacggcc ggggggtgcg cgtgattgcc 7860 aagcacgtcc ccatgcgctc catcaagaag agcgacttcg cggagctggt gaagtacatc 7920 accgacgagc aaggcaagac cgatcgggcc c 7951 <210> SEQ ID NO 133 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: bld-phaB-F1, forward <400> SEQUENCE: 133 acatgggata agaaggagat atacatatga taaaag 36 <210> SEQ ID NO 134 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: bld-pMTL-R1, forward <400> SEQUENCE: 134 cgtcgactct agattaacct gctaaaacac atcttc 36 <210> SEQ ID NO 135 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL-bld-F1, forward <400> SEQUENCE: 135 gtgttttagc aggttaatct agagtcgacg tcacgc 36 <210> SEQ ID NO 136 <211> LENGTH: 1179 <212> TYPE: DNA <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA <400> SEQUENCE: 136

atgaaagaag ttgtaatagc tagtgcagta agaacagcga ttggatctta tggaaagtct 60 cttaaggatg taccagcagt agatttagga gctacagcta taaaggaagc agttaaaaaa 120 gcaggaataa aaccagagga tgttaatgaa gtcattttag gaaatgttct tcaagcaggt 180 ttaggacaga atccagcaag acaggcatct tttaaagcag gattaccagt tgaaattcca 240 gctatgacta ttaataaggt ttgtggttca ggacttagaa cagttagctt agcagcacaa 300 attataaaag caggagatgc tgacgtaata atagcaggtg gtatggaaaa tatgtctaga 360 gctccttact tagcgaataa cgctagatgg ggatatagaa tgggaaacgc taaatttgtt 420 gatgaaatga tcactgacgg attgtgggat gcatttaatg attaccacat gggaataaca 480 gcagaaaaca tagctgagag atggaacatt tcaagagaag aacaagatga gtttgctctt 540 gcatcacaaa aaaaagctga agaagctata aaatcaggtc aatttaaaga tgaaatagtt 600 cctgtagtaa ttaaaggcag aaagggagaa actgtagttg atacagatga gcaccctaga 660 tttggatcaa ctatagaagg acttgcaaaa ttaaaacctg ccttcaaaaa agatggaaca 720 gttacagctg gtaatgcatc aggattaaat gactgtgcag cagtacttgt aatcatgagt 780 gcagaaaaag ctaaagagct tggagtaaaa ccacttgcta agatagtttc ttatggttca 840 gcaggagttg acccagcaat aatgggatat ggacctttct atgcaacaaa agcagctatt 900 gaaaaagcag gttggacagt tgatgaatta gatttaatag aatcaaatga agcttttgca 960 gctcaaagtt tagcagtagc aaaagattta aaatttgata tgaataaagt aaatgtaaat 1020 ggaggagcta ttgcccttgg tcatccaatt ggagcatcag gtgcaagaat actcgttact 1080 cttgtacacg caatgcaaaa aagagatgca aaaaaaggct tagcaacttt atgtataggt 1140 ggcggacaag gaacagcaat attgctagaa aagtgctag 1179 <210> SEQ ID NO 137 <211> LENGTH: 849 <212> TYPE: DNA <213> ORGANISM: Clostridium kluyveri <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd1 <400> SEQUENCE: 137 atgagtatta aaagtgtagc ggttttaggt agtggaacta tgtctcgtgg aattgtgcag 60 gcttttgcag aagcaggtat agatgtaatt atccgtggaa gaactgaagg tagtattgga 120 aaaggtctag cagcagtaaa gaaagcttat gataaaaaag tatcaaaggg gaaaatttcc 180 caggaagatg ctgataaaat agttggaaga gtaagtacaa caactgaact tgaaaaattg 240 gctgattgtg atcttataat agaagcagca tcagaggata tgaatataaa gaaagactat 300 tttggaaaat tagaagaaat atgcaagcct gaaacaattt ttgctactaa tacttcttca 360 ttatctataa ctgaagtagc aacagctaca aagagaccag ataaattcat aggaatgcat 420 ttctttaatc cagcaaatgt tatgaaatta gttgaaatca taagaggtat gaatacttca 480 caagaaactt ttgatattat aaaagaagct tccattaaaa taggaaaaac tcctgtagaa 540 gttgcagaag ctccaggatt tgttgtaaac aagatattag taccaatgat caatgaagca 600 gtaggaattt tggcagaagg aatagcttca gcagaagata tcgatacagc tatgaaatta 660 ggcgctaatc acccaatggg tcctttagca ttaggagatc ttattggact tgatgtagtt 720 cttgcagtta tggatgtact ttatagtgaa actggagatt caaaatatag agctcataca 780 ttacttagaa aatatgtaag agcaggatgg cttggaagaa aatcaggaaa aggattcttc 840 gcttattaa 849 <210> SEQ ID NO 138 <211> LENGTH: 176 <212> TYPE: DNA <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: ferredoxin promoter <400> SEQUENCE: 138 ggccgcgctc actatctgcg gaacctgcct ccttatctga taaaaaatat tcgctgcatc 60 tttgacttgt tattttcttt caaatgccta aaattatctt ttaaaattat aacaaatgtg 120 ataaaataca ggggatgaaa acattatcta aaaattaagg aggtgttaca gaattc 176 <210> SEQ ID NO 139 <211> LENGTH: 474 <212> TYPE: DNA <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pyruvate-ferredoxin oxidoreductase promoter <400> SEQUENCE: 139 aaaatagttg ataataatgc agagttataa acaaaggtga aaagcattac ttgtattctt 60 ttttatatat tattataaat taaaatgaag ctgtattaga aaaaatacac acctgtaata 120 taaaatttta aattaatttt taattttttc aaaatgtatt ttacatgttt agaattttga 180 tgtatattaa aatagtagaa tacataagat acttaattta attaaagata gttaagtact 240 tttcaatgtg cttttttaga tgtttaatac aaatctttaa ttgtaaaaga aatgctgtac 300 tatttactgt actagtgacg ggattaaact gtattaatta taaataaaaa ataagtacag 360 ttgtttaaaa ttatattttg tattaaatct aatagtacga tgtaagttat tttatactat 420 tgctagttta ataaaaagat ttaattatat gcttgaaaag gagaggaatt cata 474 <210> SEQ ID NO 140 <211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: ribosome binding site rbs2 <400> SEQUENCE: 140 aaatagaaag gaggtgttac at 22 <210> SEQ ID NO 141 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-F1, forward <400> SEQUENCE: 141 aaaggtctcc ggccgcgctc actatctgcg gaacc 35 <210> SEQ ID NO 142 <211> LENGTH: 38 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-R1, reverse <400> SEQUENCE: 142 tttggtctcg aattctgtaa cacctcctta atttttag 38 <210> SEQ ID NO 143 <211> LENGTH: 40 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-F1, forward <400> SEQUENCE: 143 aaaggtctcc ggccgcaaaa tagttgataa taatgcagag 40 <210> SEQ ID NO 144 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-R1, reverse <400> SEQUENCE: 144 tttggtctcg aattcctctc cttttcaagc atata 35 <210> SEQ ID NO 145 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd1-F1, forward <400> SEQUENCE: 145 aaaggtctcg aattcaaaga tctatgtcta ttaaatcagt tgcag 45 <210> SEQ ID NO 146 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd1-R1, reverse <400> SEQUENCE: 146 tttggtctcc ctcctttcta tttctaatat gcgaaaaatc ctttacc 47 <210> SEQ ID NO 147 <211> LENGTH: 49 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-F1, forward <400> SEQUENCE: 147 aaaggtctca ggaggtgtta catatgaaag aagttgtaat agctagtgc 49 <210> SEQ ID NO 148 <211> LENGTH: 48 <212> TYPE: DNA

<213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: thlA-R1, reverse <400> SEQUENCE: 148 tttggtctcc tcgagtatgg atccctagca cttttctagc aatattgc 48 <210> SEQ ID NO 149 <211> LENGTH: 30 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-F2, forward <400> SEQUENCE: 149 aaacagctat gaccgcggcc gcaaaatagt 30 <210> SEQ ID NO 150 <211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ppfor-R2, reverse <400> SEQUENCE: 150 ttactcattg gattcctctc cttt 24 <210> SEQ ID NO 151 <211> LENGTH: 29 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ptb-Buk-F2, forward <400> SEQUENCE: 151 ggaatccaat gagtaaaaac tttgatgag 29 <210> SEQ ID NO 152 <211> LENGTH: 37 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Ptb-Buk-F2, reverse <400> SEQUENCE: 152 caggcctcga gatctcctag taaaccttag cttgttc 37 <210> SEQ ID NO 153 <211> LENGTH: 7884 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL82256-ptb-buk, plasmid <400> SEQUENCE: 153 gagatctcga ggcctgcaga catgcaagct tggcactggc cgtcgtttta caacgtcgtg 60 actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 120 gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 180 atggcgaatg gcgctagcat aaaaataaga agcctgcatt tgcaggcttc ttatttttat 240 ggcgcgccgt tctgaatcct tagctaatgg ttcaacaggt aactatgacg aagatagcac 300 cctggataag tctgtaatgg attctaaggc atttaatgaa gacgtgtata taaaatgtgc 360 taatgaaaaa gaaaatgcgt taaaagagcc taaaatgagt tcaaatggtt ttgaaattga 420 ttggtagttt aatttaatat attttttcta ttggctatct cgatacctat agaatcttct 480 gttcactttt gtttttgaaa tataaaaagg ggctttttag cccctttttt ttaaaactcc 540 ggaggagttt cttcattctt gatactatac gtaactattt tcgatttgac ttcattgtca 600 attaagctag taaaatcaat ggttaaaaaa caaaaaactt gcatttttct acctagtaat 660 ttataatttt aagtgtcgag tttaaaagta taatttacca ggaaaggagc aagtttttta 720 ataaggaaaa atttttcctt ttaaaattct atttcgttat atgactaatt ataatcaaaa 780 aaatgaaaat aaacaagagg taaaaactgc tttagagaaa tgtactgata aaaaaagaaa 840 aaatcctaga tttacgtcat acatagcacc tttaactact aagaaaaata ttgaaaggac 900 ttccacttgt ggagattatt tgtttatgtt gagtgatgca gacttagaac attttaaatt 960 acataaaggt aatttttgcg gtaatagatt ttgtccaatg tgtagttggc gacttgcttg 1020 taaggatagt ttagaaatat ctattcttat ggagcattta agaaaagaag aaaataaaga 1080 gtttatattt ttaactctta caactccaaa tgtaaaaagt tatgatctta attattctat 1140 taaacaatat aataaatctt ttaaaaaatt aatggagcgt aaggaagtta aggatataac 1200 taaaggttat ataagaaaat tagaagtaac ttaccaaaag gaaaaataca taacaaagga 1260 tttatggaaa ataaaaaaag attattatca aaaaaaagga cttgaaattg gtgatttaga 1320 acctaatttt gatacttata atcctcattt tcatgtagtt attgcagtta ataaaagtta 1380 ttttacagat aaaaattatt atataaatcg agaaagatgg ttggaattat ggaagtttgc 1440 tactaaggat gattctataa ctcaagttga tgttagaaaa gcaaaaatta atgattataa 1500 agaggtttac gaacttgcga aatattcagc taaagacact gattatttaa tatcgaggcc 1560 agtatttgaa attttttata aagcattaaa aggcaagcag gtattagttt ttagtggatt 1620 ttttaaagat gcacacaaat tgtacaagca aggaaaactt gatgtttata aaaagaaaga 1680 tgaaattaaa tatgtctata tagtttatta taattggtgc aaaaaacaat atgaaaaaac 1740 tagaataagg gaacttacgg aagatgaaaa agaagaatta aatcaagatt taatagatga 1800 aatagaaata gattaaagtg taactatact ttatatatat atgattaaaa aaataaaaaa 1860 caacagccta ttaggttgtt gttttttatt ttctttatta atttttttaa tttttagttt 1920 ttagttcttt tttaaaataa gtttcagcct ctttttcaat attttttaaa gaaggagtat 1980 ttgcatgaat tgcctttttt ctaacagact taggaaatat tttaacagta tcttcttgcg 2040 ccggtgattt tggaacttca taacttacta atttataatt attattttct tttttaattg 2100 taacagttgc aaaagaagct gaacctgttc cttcaactag tttatcatct tcaatataat 2160 attcttgacc tatatagtat aaatatattt ttattatatt tttacttttt tctgaatcta 2220 ttattttata atcataaaaa gttttaccac caaaagaagg ttgtactcct tctggtccaa 2280 catatttttt tactatatta tctaaataat ttttgggaac tggtgttgta atttgattaa 2340 tcgaacaacc agttatactt aaaggaatta taactataaa aatatatagg attatctttt 2400 taaatttcat tattggcctc ctttttatta aatttatgtt accataaaaa ggacataacg 2460 ggaatatgta gaatattttt aatgtagaca aaattttaca taaatataaa gaaaggaagt 2520 gtttgtttaa attttatagc aaactatcaa aaattagggg gataaaaatt tatgaaaaaa 2580 aggttttcga tgttattttt atgtttaact ttaatagttt gtggtttatt tacaaattcg 2640 gccggccgaa gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtataat 2700 aggaattgaa gttaaattag atgctaaaaa tttgtaatta agaaggagtg attacatgaa 2760 caaaaatata aaatattctc aaaacttttt aacgagtgaa aaagtactca accaaataat 2820 aaaacaattg aatttaaaag aaaccgatac cgtttacgaa attggaacag gtaaagggca 2880 tttaacgacg aaactggcta aaataagtaa acaggtaacg tctattgaat tagacagtca 2940 tctattcaac ttatcgtcag aaaaattaaa actgaatact cgtgtcactt taattcacca 3000 agatattcta cagtttcaat tccctaacaa acagaggtat aaaattgttg ggagtattcc 3060 ttaccattta agcacacaaa ttattaaaaa agtggttttt gaaagccatg cgtctgacat 3120 ctatctgatt gttgaagaag gattctacaa gcgtaccttg gatattcacc gaacactagg 3180 gttgctcttg cacactcaag tctcgattca gcaattgctt aagctgccag cggaatgctt 3240 tcatcctaaa ccaaaagtaa acagtgtctt aataaaactt acccgccata ccacagatgt 3300 tccagataaa tattggaagc tatatacgta ctttgtttca aaatgggtca atcgagaata 3360 tcgtcaactg tttactaaaa atcagtttca tcaagcaatg aaacacgcca aagtaaacaa 3420 tttaagtacc gttacttatg agcaagtatt gtctattttt aatagttatc tattatttaa 3480 cgggaggaaa taattctatg agtcgctttt gtaaatttgg aaagttacac gttactaaag 3540 ggaatgtgtt taaactcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 3600 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 3660 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 3720 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 3780 taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 3840 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 3900 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 3960 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4020 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4080 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 4140 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 4200 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 4260 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 4320 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 4380 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc agggccccct 4440 gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag 4500 gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca 4560 ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc 4620 acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa 4680 cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta 4740 tcaaggtgta ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg 4800 gcatgagcct gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg 4860 tggactatga gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg 4920 gcctgctgaa actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga 4980 tcctcgccct gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg 5040

gcgtggtccg cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg 5100 cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg 5160 gtgaagtaca tcaccgacga gcaaggcaag accgatcggg ccccctgcag gataaaaaaa 5220 ttgtagataa attttataaa atagttttat ctacaatttt tttatcagga aacagctatg 5280 accgcggccg caaaatagtt gataataatg cagagttata aacaaaggtg aaaagcatta 5340 cttgtattct tttttatata ttattataaa ttaaaatgaa gctgtattag aaaaaataca 5400 cacctgtaat ataaaatttt aaattaattt ttaatttttt caaaatgtat tttacatgtt 5460 tagaattttg atgtatatta aaatagtaga atacataaga tacttaattt aattaaagat 5520 agttaagtac ttttcaatgt gcttttttag atgtttaata caaatcttta attgtaaaag 5580 aaatgctgta ctatttactg tactagtgac gggattaaac tgtattaatt ataaataaaa 5640 aataagtaca gttgtttaaa attatatttt gtattaaatc taatagtacg atgtaagtta 5700 ttttatacta ttgctagttt aataaaaaga tttaattata tgcttgaaaa ggagaggaat 5760 ccaatgagta aaaactttga tgagttatta tcaagattaa aggaagttcc aacaaaaaaa 5820 gtggctgtag ccgtagcaca agatgaacca gtattagagg ctataaaaga agctacagaa 5880 aataacatcg cacaagcaat attggttggt gataaacaac aaatccatga aatcgcaaag 5940 aaaataaact tggacttatc tgattatgaa ataatggata ttaaagatcc aaagaaagca 6000 acattagaag cagtaaaatt agtttctagt ggtcatgcag atatgttaat gaaaggtcta 6060 gttgatactg caacattcct aagaagcgta ttaaacaaag aggttggtct tagaacagga 6120 aaattaatgt cccatgtagc tgtgtttgat gtggaaggtt gggatagact gttattttta 6180 actgatgcag catttaatac atatccagaa tttaaggata aagttggaat gataaataat 6240 gcagttgtag ttgctcatgc atgtggaata gatgttccaa gagtagcacc tatatgccca 6300 gttgaagttg taaatacaag tatgcaatca acagttgatg cagcattgtt agctaaaatg 6360 agtgacaggg ggcaaattaa aggatgcgta attgatggac cttttgcctt agataatgca 6420 atatcagaag aagcagctca tcataaaggt gttacaggat cagtagcagg taaagctgat 6480 atattattat taccaaatat agaagcagca aatgtaatgt ataaaacatt aacatatttc 6540 tctaaatcaa gaaatggtgg acttttagta ggtacatcag caccagtaat tttaacttca 6600 agagcagatt cattcgaaac taaagttaat tcaattgctc ttgcagcatt agttgcagca 6660 agaaataagt aataaatcaa tccataataa ttaatgcata attaatggag agatttatat 6720 ggaatttgca atgcactatt agattctata ataatttctt ctgaaaatta tgcattatga 6780 ctgtatagaa tgcattaaat ttaaggggga ttcagaatgt catataagct attaataatc 6840 aatccaggtt caacatcaac aaagattggt gtttacgaag gagaaaagga actatttgaa 6900 gaaactttga gacacacaaa tgaagaaata aagagatatg atacaatata tgatcaattt 6960 gaatttagaa aagaagttat attaaatgtt cttaaagaaa agaattttga tataaagact 7020 ctaagtgcta ttgttggtag aggtggaatg cttagaccag ttgaaggtgg aacatatgca 7080 gtaaatgatg caatggttga agatttaaaa gttggagttc aaggacctca tgcttctaac 7140 cttggcggaa taattgccaa gtcaattgga gatgaattaa atattccatc atttatagta 7200 gatccagttg ttacagatga gttagcagat gtagcaagac tatctggagt accagaacta 7260 ccaagaaaaa gtaaattcca tgctttaaat caaaaagcgg tagctaaaag atatggaaaa 7320 gaaagtggac aaggatatga aaacctaaat cttgtagttg tacatatggg tggaggcgtt 7380 tcagttggtg ctcacaatca tgggaaagtt gtcgatgtaa ataatgcatt agatggagat 7440 ggcccattct caccagaaag agctggatca gttccaattg gtgatttagt taaaatgtgt 7500 tttagtggaa aatatagtga agcagaagta tatggcaagg ctgtaggaaa aggtggattt 7560 gttggttatc taaacacaaa tgatgtaaaa ggtgttattg ataagatgga agaaggagat 7620 aaagaatgtg aatcaatata caaagcattt gtttatcaaa tttcaaaagc aatcggagaa 7680 atgtcagttg tattagaagg taaagttgat caaattattt ttaccggagg aattgcatac 7740 tcaccaacac ttgttccaga ccttaaagca aaagttgaat ggatagcccc agttacagtt 7800 tatcctggag aagatgaatt acttgctcta gctcaaggtg ctataagagt acttgatgga 7860 gaagaacaag ctaaggttta ctag 7884 <210> SEQ ID NO 154 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 1 <400> SEQUENCE: 154 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 155 <211> LENGTH: 60 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 2 <400> SEQUENCE: 155 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser 50 55 60 <210> SEQ ID NO 156 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 3 <400> SEQUENCE: 156 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125

<210> SEQ ID NO 157 <211> LENGTH: 436 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 1 <400> SEQUENCE: 157 Met Asn Asn Asp Asn Cys Thr Ile Lys Ile Thr Pro Glu Val Ser Arg 1 5 10 15 Val Asp Glu Pro Val Asp Ile Lys Ile Asn Gly Leu Pro Lys Asn Glu 20 25 30 Lys Val Ile Ile Arg Ala Val Ser Ser Asp Tyr Tyr Cys Ile Asn Ala 35 40 45 Ser Ile Leu Glu Ile Gly Asp Asn Thr Leu Trp Glu Ser Tyr Ala Val 50 55 60 Phe Glu Thr Asp Glu Cys Gly Asn Ile Asn Phe Glu Asn Ala Val Pro 65 70 75 80 Val Asp Gly Thr Tyr Ser Asn Cys Asp Lys Met Gly Leu Phe Tyr Ser 85 90 95 Met Arg Pro Lys Gln Ile Arg Lys Ser Lys Leu Ile Gln Lys Leu Ser 100 105 110 Ser Ile Asn Glu Asn Arg Lys Tyr Lys Ile Thr Phe Thr Val Glu Lys 115 120 125 Asn Gly Lys Ile Ile Gly Ser Lys Glu His Thr Arg Val Tyr Cys Asp 130 135 140 Asp Thr Ile Lys Ser Ile Asp Val Val Glu Lys Asn Leu Leu Ala Arg 145 150 155 160 Tyr Phe Thr Ser Lys Asp Asn Ile Lys His Pro Ala Ile Ile Val Leu 165 170 175 Ser Gly Ser Asp Gly Arg Ile Glu Lys Ala Gln Ala Ile Ala Glu Leu 180 185 190 Phe Ala Met Arg Gly Tyr Ser Ala Leu Ala Val Cys Tyr Phe Gly Leu 195 200 205 Glu Gly Thr Pro Glu Asp Leu Asn Met Ile Pro Leu Glu Tyr Val Glu 210 215 220 Asn Ala Val Lys Trp Leu Lys Arg Gln Asp Thr Val Asp Glu Asn Lys 225 230 235 240 Ile Ala Ile Tyr Gly Arg Ser Lys Gly Gly Glu Leu Val Leu Leu Ala 245 250 255 Ala Ser Met Phe Lys Asp Ile Ala Cys Val Ile Ala Asn Thr Pro Ser 260 265 270 Cys Tyr Val Tyr Glu Gly Ile Lys Ser Asn Lys Leu Pro Ser His His 275 280 285 Ser Ser Trp Met Tyr Arg Gly Arg Glu Ile Pro Tyr Leu Lys Phe Asn 290 295 300 Phe His Ile Ile Leu Arg Leu Ile Ile Lys Met Met Lys Lys Glu Lys 305 310 315 320 Gly Ala Leu Ala Trp Met Tyr Lys Lys Leu Ile Glu Glu Gly Asp Arg 325 330 335 Asp Lys Ala Thr Ile Ala Leu Asp Lys Ile Asn Gly Ser Val Leu Met 340 345 350 Ile Ser Ser Ala Ala Asp Glu Ile Trp Pro Ser Lys Met His Ser Glu 355 360 365 Thr Val Cys Ser Ile Phe Glu Lys Ser His Phe Lys His Glu Tyr Lys 370 375 380 His Ile Thr Phe Ala Lys Ser Gly His Ile Leu Thr Val Pro Phe Gln 385 390 395 400 Ser Ile Tyr Pro Ser Glu Lys Tyr Pro Tyr Asp Val Glu Ser Trp Ala 405 410 415 Lys Ala Asn Met Asp Ser Trp Asn Glu Thr Ile Lys Phe Leu Glu Lys 420 425 430 Trp Ala Ser Lys 435 <210> SEQ ID NO 158 <211> LENGTH: 137 <212> TYPE: PRT <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 2 <400> SEQUENCE: 158 Met Tyr Ile Asn Glu Thr Lys Val Val Val Arg Tyr Ala Glu Thr Asp 1 5 10 15 Lys Met Gly Ile Val His His Ser Asn Tyr Tyr Ile Tyr Phe Glu Glu 20 25 30 Ala Arg Thr Gln Phe Ile Lys Lys Thr Gly Ile Ser Tyr Ser Gln Met 35 40 45 Glu Lys Asp Gly Ile Met Phe Pro Leu Val Glu Ser Asn Cys Arg Tyr 50 55 60 Leu Gln Gly Ala Lys Tyr Glu Asp Glu Leu Leu Ile Lys Thr Trp Ile 65 70 75 80 Lys Glu Leu Thr Pro Val Lys Ala Glu Phe Asn Tyr Ser Val Ile Arg 85 90 95 Glu Asn Asp Gln Lys Glu Ile Ala Lys Gly Ser Thr Leu His Ala Phe 100 105 110 Val Asn Asn Asn Phe Lys Ile Ile Asn Leu Lys Lys Asn His Thr Glu 115 120 125 Leu Phe Lys Lys Leu Gln Ser Leu Ile 130 135 <210> SEQ ID NO 159 <211> LENGTH: 128 <212> TYPE: PRT <213> ORGANISM: Clostridium ljungdahlii <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: thioesterase 3 <400> SEQUENCE: 159 Met Asp Phe Ser Lys Leu Phe Lys Val Gly Ser Thr Tyr Val Ser Glu 1 5 10 15 Tyr Ile Val Lys Pro Glu Asp Thr Ala Asn Phe Ile Gly Asn Asn Gly 20 25 30 Val Val Met Leu Ser Thr Pro Ala Met Ile Lys Tyr Met Glu Tyr Thr 35 40 45 Thr Leu His Ile Val Asp Asn Val Ile Pro Lys Asn Tyr Arg Pro Val 50 55 60 Gly Thr Lys Ile Asp Val Glu His Ile Lys Pro Ile Pro Ala Asn Met 65 70 75 80 Lys Val Val Val Lys Val Thr Leu Ile Ser Ile Glu Gly Lys Lys Leu 85 90 95 Arg Tyr Asn Val Glu Ala Phe Asn Glu Lys Asn Cys Lys Val Gly Phe 100 105 110 Gly Ile Tyr Glu Gln Gln Ile Val Asn Leu Glu Gln Phe Leu Asn Arg 115 120 125 <210> SEQ ID NO 160 <211> LENGTH: 11184 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL8225-pta-ack::ptb-buk, plasmid <400> SEQUENCE: 160 aaactccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga 60 gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta 120 atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa 180 gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact 240 gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca 300 tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 360 accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg 420 ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag 480 cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta 540 agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat 600 ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 660 tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc 720 ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac 780 cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac cgagcgcagc 840 gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca gggccccctg cttcggggtc 900 attatagcga ttttttcggt atatccatcc tttttcgcac gatatacagg attttgccaa 960 agggttcgtg tagactttcc ttggtgtatc caacggcgtc agccgggcag gataggtgaa 1020 gtaggcccac ccgcgagcgg gtgttccttc ttcactgtcc cttattcgca cctggcggtg 1080 ctcaacggga atcctgctct gcgaggctgg ccggctaccg ccggcgtaac agatgagggc 1140 aagcggatgg ctgatgaaac caagccaacc aggaagggca gcccacctat caaggtgtac 1200 tgccttccag acgaacgaag agcgattgag gaaaaggcgg cggcggccgg catgagcctg 1260 tcggcctacc tgctggccgt cggccagggc tacaaaatca cgggcgtcgt ggactatgag 1320 cacgtccgcg agctggcccg catcaatggc gacctgggcc gcctgggcgg cctgctgaaa 1380 ctctggctca ccgacgaccc gcgcacggcg cggttcggtg atgccacgat cctcgccctg 1440 ctggcgaaga tcgaagagaa gcaggacgag cttggcaagg tcatgatggg cgtggtccgc 1500 ccgagggcag agccatgact tttttagccg ctaaaacggc cggggggtgc gcgtgattgc 1560 caagcacgtc cccatgcgct ccatcaagaa gagcgacttc gcggagctgg tgaagtacat 1620 caccgacgag caaggcaaga ccgatcgggc cccctgcagg ataaaaaaat tgtagataaa 1680 ttttataaaa tagttttatc tacaattttt ttatcaggaa acagctatga ccgcggccgc 1740 ggcgccaagc ttagaaaaat ataaataaga agtagcttta agagaattaa attattaaga 1800 aaagcaaagg tgtttaaaaa ataaattttt aaacaccttt gcttttctta aattataaat 1860 aagataaaaa agaatcctga ataaaataaa aaggggtgtc tcaaaatttt attttgagac 1920 gacccctttt tattctatat gtcgatgcta tagctgagat cgtggaattc ttgttagcta 1980 ccagattcac atttaagttg tttctctaaa ccacagatta tcaattcaag tccaaaaaga 2040 aatgctggtt ctgcgccttg atgatcaaat aactctattg cttgtcttaa caatggaggc 2100 attgaatctg ttgttggtgt ttctctttcc tcttttgcaa cttgatgttc ttgatcctcc 2160

aatacgcaac ctaaagtaaa atgtcctaca gcacttagtg cgtataaggc attttctaaa 2220 ctaaaaccct gttgacataa gaatgctaat tgattttcta atgtttcata ttgtttttca 2280 gttggtctag ttcctaaatg tactttagcc ccatctctat gtgataatag agcacaacga 2340 aaagatttag cgttattcct aagaaaatct tgccatgatt caccttctaa aggacaaaag 2400 tgagtgtgat gtctatctaa catttcaata gctaaggcgt caagtaaagc tctcttattc 2460 ttcacatgcc aatacaacgt aggttgttct actccaagtt tctgagctaa ctttcttgta 2520 gttagtcctt ctattccaac ttcatttagt aattccaatg cactattgat aactttactt 2580 ttatcaagtc tagacatcat ttaatatcct cctcttcaat atatttaagt cgactgatcg 2640 gatcctgatc ggagctccca tggcggccgg tcgatatcga tgtgtagtag cctgtgaaat 2700 aagtaaggaa aaaaaagaag taagtgttat atatgatgat tattttgtag atgtagatag 2760 gataatagaa tccatagaaa atataggtta tacagttata taaaaattac tttaaaatct 2820 atcattgata gggtaaaata taaatcgtat aaagttgtgt aatttttaag gaggtgtgtt 2880 acagacgtcc gcgagagacc ttaaatatat tgaagaggag gaaatacata tggtttcaag 2940 atatgttcca gatatgggag atttaatatg ggttgatttt gatccaacaa aaggatcaga 3000 acaagcagga catagaccag cagttgtttt atcaccattt atgtataata ataaaacagg 3060 aatgtgttta tgtgttccat gtacaacaca atcaaaagga tatccatttg aagttgtttt 3120 atcaggacaa gaaagagatg gagttgcatt agcagatcaa gttaaatcaa tagcatggag 3180 agcaagagga gcaacaaaaa aaggaacagt tgcaccagaa gaattacaat taataaaagc 3240 aaaaataaat gttttaatag gataatgtta ttaagctagc ataaaaataa gaagcctgca 3300 tttgcaggct tcttattttt atggcgcgcc gttctgaatc cttagctaat ggttcaacag 3360 gtaactatga cgaagatagc accctggata agtctgtaat ggattctaag gcatttaatg 3420 aagacgtgta tataaaatgt gctaatgaaa aagaaaatgc gttaaaagag cctaaaatga 3480 gttcaaatgg ttttgaaatt gattggtagt ttaatttaat atattttttc tattggctat 3540 ctcgatacct atagaatctt ctgttcactt ttgtttttga aatataaaaa ggggcttttt 3600 agcccctttt ttttaaaact ccggaggagt ttcttcattc ttgatactat acgtaactat 3660 tttcgatttg acttcattgt caattaagct agtaaaatca atggttaaaa aacaaaaaac 3720 ttgcattttt ctacctagta atttataatt ttaagtgtcg agtttaaaag tataatttac 3780 caggaaagga gcaagttttt taataaggaa aaatttttcc ttttaaaatt ctatttcgtt 3840 atatgactaa ttataatcaa aaaaatgaaa ataaacaaga ggtaaaaact gctttagaga 3900 aatgtactga taaaaaaaga aaaaatccta gatttacgtc atacatagca cctttaacta 3960 ctaagaaaaa tattgaaagg acttccactt gtggagatta tttgtttatg ttgagtgatg 4020 cagacttaga acattttaaa ttacataaag gtaatttttg cggtaataga ttttgtccaa 4080 tgtgtagttg gcgacttgct tgtaaggata gtttagaaat atctattctt atggagcatt 4140 taagaaaaga agaaaataaa gagtttatat ttttaactct tacaactcca aatgtaaaaa 4200 gttatgatct taattattct attaaacaat ataataaatc ttttaaaaaa ttaatggagc 4260 gtaaggaagt taaggatata actaaaggtt atataagaaa attagaagta acttaccaaa 4320 aggaaaaata cataacaaag gatttatgga aaataaaaaa agattattat caaaaaaaag 4380 gacttgaaat tggtgattta gaacctaatt ttgatactta taatcctcat tttcatgtag 4440 ttattgcagt taataaaagt tattttacag ataaaaatta ttatataaat cgagaaagat 4500 ggttggaatt atggaagttt gctactaagg atgattctat aactcaagtt gatgttagaa 4560 aagcaaaaat taatgattat aaagaggttt acgaacttgc gaaatattca gctaaagaca 4620 ctgattattt aatatcgagg ccagtatttg aaatttttta taaagcatta aaaggcaagc 4680 aggtattagt ttttagtgga ttttttaaag atgcacacaa attgtacaag caaggaaaac 4740 ttgatgttta taaaaagaaa gatgaaatta aatatgtcta tatagtttat tataattggt 4800 gcaaaaaaca atatgaaaaa actagaataa gggaacttac ggaagatgaa aaagaagaat 4860 taaatcaaga tttaatagat gaaatagaaa tagattaaag tgtaactata ctttatatat 4920 atatgattaa aaaaataaaa aacaacagcc tattaggttg ttgtttttta ttttctttat 4980 taattttttt aatttttagt ttttagttct tttttaaaat aagtttcagc ctctttttca 5040 atatttttta aagaaggagt atttgcatga attgcctttt ttctaacaga cttaggaaat 5100 attttaacag tatcttcttg cgccggtgat tttggaactt cataacttac taatttataa 5160 ttattatttt cttttttaat tgtaacagtt gcaaaagaag ctgaacctgt tccttcaact 5220 agtttatcat cttcaatata atattcttga cctatatagt ataaatatat ttttattata 5280 tttttacttt tttctgaatc tattatttta taatcataaa aagttttacc accaaaagaa 5340 ggttgtactc cttctggtcc aacatatttt tttactatat tatctaaata atttttggga 5400 actggtgttg taatttgatt aatcgaacaa ccagttatac ttaaaggaat tataactata 5460 aaaatatata ggattatctt tttaaatttc attattggcc tcctttttat taaatttatg 5520 ttaccataaa aaggacataa cgggaatatg tagaatattt ttaatgtaga caaaatttta 5580 cataaatata aagaaaggaa gtgtttgttt aaattttata gcaaactatc aaaaattagg 5640 gggataaaaa tttatgaaaa aaaggttttc gatgttattt ttatgtttaa ctttaatagt 5700 ttgtggttta tttacaaatt cggccggcca aagattgctc tatgtttaag ctattatatg 5760 aacttccaat tctttttatt gatatgggag taatattgct ttttattctt attaggtttt 5820 ttaaatattc tatacctaaa atattgtttg gagattgaag tatttcatct atattgtact 5880 ttgtaagaga acttttagta tttaatagaa aattatttaa agcactattt cgtgcagaag 5940 gataggacat accctgtgac attttttcct ttaaaaataa tttaaattgg gtaggctctt 6000 ctgcaagaat ttttgcaata gatttcagca agtttatatt actatattcg cttccaaaac 6060 aaagattttt tactacaccc aagttttcta agagacttac agcaccatag gcaaaaaatt 6120 cagcagaaga tagactgtag ataacaggaa gttcaaatac caggtctact ccatttagaa 6180 gtgccatttt ggttttagtc catttgtcaa ctatagatgg tgaacctctt tgcacgaagt 6240 taccactcat aactgctatt acagcatcac attttgtagc agaacgagca ctttcaatat 6300 gatatttatg tccattgtga aagggattat attcaactat tattccagtt acgttcatag 6360 aaattttcct ttctaaaata ttttattcca tgtcaagaac tctgtttatt tcattaaaga 6420 actataagta caaagtataa ggcatttgaa aaaataggct agtatattga ttgattattt 6480 attttaaaat gcctaagtga aatatataca tattataaca ataaaataag tattagtgta 6540 ggatttttaa atagagtatc tattttcaga ttaaattttt gattatttga tttacattat 6600 ataatattga gtaaagtatt gactagcaaa attttttgat actttaattt gtgaaatttc 6660 ttatcaaaag ttatattttt gaataatttt tattgaaaaa tacaactaaa aaggattata 6720 gtataagtgt gtgtaatttt gtgttaaatt taaagggagg aaatgaacat gaaattgatg 6780 agtaaaaact ttgatgagtt attatcaaga ttaaaggaag ttccaacaaa aaaagtggct 6840 gtagccgtag cacaagatga accagtatta gaggctataa aagaagctac agaaaataac 6900 atcgcacaag caatattggt tggtgataaa caacaaatcc atgaaatcgc aaagaaaata 6960 aacttggact tatctgatta tgaaataatg gatattaaag atccaaagaa agcaacatta 7020 gaagcagtaa aattagtttc tagtggtcat gcagatatgt taatgaaagg tctagttgat 7080 actgcaacat tcctaagaag cgtattaaac aaagaggttg gtcttagaac aggaaaatta 7140 atgtcccatg tagctgtgtt tgatgtggaa ggttgggata gactgttatt tttaactgat 7200 gcagcattta atacatatcc agaatttaag gataaagttg gaatgataaa taatgcagtt 7260 gtagttgctc atgcatgtgg aatagatgtt ccaagagtag cacctatatg cccagttgaa 7320 gttgtaaata caagtatgca atcaacagtt gatgcagcat tgttagctaa aatgagtgac 7380 agggggcaaa ttaaaggatg cgtaattgat ggaccttttg ccttagataa tgcaatatca 7440 gaagaagcag ctcatcataa aggtgttaca ggatcagtag caggtaaagc tgatatatta 7500 ttattaccaa atatagaagc agcaaatgta atgtataaaa cattaacata tttctctaaa 7560 tcaagaaatg gtggactttt agtaggtaca tcagcaccag taattttaac ttcaagagca 7620 gattcattcg aaactaaagt taattcaatt gctcttgcag cattagttgc agcaagaaat 7680 aagtaataaa tcaatccata ataattaatg cataattaat ggagagattt atatggaatt 7740 tgcaatgcac tattagattc tataataatt tcttctgaaa attatgcatt atgactgtat 7800 agaatgcatt aaatttaagg gggattcaga atgtcatata agctattaat aatcaatcca 7860 ggttcaacat caacaaagat tggtgtttac gaaggagaaa aggaactatt tgaagaaact 7920 ttgagacaca caaatgaaga aataaagaga tatgatacaa tatatgatca atttgaattt 7980 agaaaagaag ttatattaaa tgttcttaaa gaaaagaatt ttgatataaa gactctaagt 8040 gctattgttg gtagaggtgg aatgcttaga ccagttgaag gtggaacata tgcagtaaat 8100 gatgcaatgg ttgaagattt aaaagttgga gttcaaggac ctcatgcttc taaccttggc 8160 ggaataattg ccaagtcaat tggagatgaa ttaaatattc catcatttat agtagatcca 8220 gttgttacag atgagttagc agatgtagca agactatctg gagtaccaga actaccaaga 8280 aaaagtaaat tccatgcttt aaatcaaaaa gcggtagcta aaagatatgg aaaagaaagt 8340 ggacaaggat atgaaaacct aaatcttgta gttgtacata tgggtggagg cgtttcagtt 8400 ggtgctcaca atcatgggaa agttgtcgat gtaaataatg cattagatgg agatggccca 8460 ttctcaccag aaagagctgg atcagttcca attggtgatt tagttaaaat gtgttttagt 8520 ggaaaatata gtgaagcaga agtatatggc aaggctgtag gaaaaggtgg atttgttggt 8580 tatctaaaca caaatgatgt aaaaggtgtt attgataaga tggaagaagg agataaagaa 8640 tgtgaatcaa tatacaaagc atttgtttat caaatttcaa aagcaatcgg agaaatgtca 8700 gttgtattag aaggtaaagt tgatcaaatt atttttaccg gaggaattgc atactcacca 8760 acacttgttc cagaccttaa agcaaaagtt gaatggatag ccccagttac agtttatcct 8820 ggagaagatg aattacttgc tctagctcaa ggtgctataa gagtacttga tggagaagaa 8880 caagctaagg tttactagta ccgttcgtat aatgtatgct atacgaagtt atccttagaa 8940 gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtgtata ataggaattg 9000 aagttaaatt agatgctaaa aatttgtaat taagaaggag ggattcgtca tgttggtatt 9060 ccaaatgcgt aatgtagata aaacatctac tgttttgaaa cagactaaaa acagtgatta 9120 cgcagataaa taaatacgtt agattaattc ctaccagtga ctaatcttat gactttttaa 9180 acagataact aaaattacaa acaaatcgtt taacttctgt atttatttac agatgtaatc 9240 acttcaggag taattacatg aacaaaaata taaaatattc tcaaaacttt ttaacgagtg 9300 aaaaagtact caaccaaata ataaaacaat tgaatttaaa agaaaccgat accgtttacg 9360 aaattggaac aggtaaaggg catttaacga cgaaactggc taaaataagt aaacaggtaa 9420 cgtctattga attagacagt catctattca acttatcgtc agaaaaatta aaactgaaca 9480 ttcgtgtcac tttaattcac caagatattc tacagtttca attccctaac aaacagaggt 9540 ataaaattgt tgggagtatt ccttaccatt taagcacaca aattattaaa aaagtggttt 9600 ttgaaagcca tgcgtctgac atctatctga ttgttgaaga aggattctac aagcgtacct 9660

tggatattca ccgaacacta gggttgctct tgcacactca agtctcgatt cagcaattgc 9720 ttaagctgcc agcggaatgc tttcatccta aaccaaaagt aaacagtgtc ttaataaaac 9780 ttacccgcca taccacagat gttccagata aatattggaa gctatatacg tactttgttt 9840 caaaatgggt caatcgagaa tatcgtcaac tgtttactaa aaatcagttt catcaagcaa 9900 tgaaacacgc caaagtaaac aatttaagta ccattactta tgagcaagta ttgtctattt 9960 ttaatagtta tctattattt aacgggagga aataattcta tgagtcgctt ttttaaattt 10020 ggaaagttac acgttactaa agggaatgga gataaattat tagatatact actgacagct 10080 tccaagaagc taaagaggtc ataacttcgt ataatgtatg ctatacgaac ggtagacttg 10140 acttttaatg ctcatctcta tataataggt tgtggctaat atatagaggt gagtgatatg 10200 aaattaaatg tatcagattt actaagtgaa gaagttgtta caaaggacat aaatgttaca 10260 gtagaagaaa agggattcta tgatggaagt gaatacataa agttattaga gcctctaaag 10320 tttagcggaa ctttaagtaa agaaggagat attcttctgt tggaaggaag aattaatact 10380 ttactagagc tcacttgttc acgatgtcta ggtaaattct cttatgctgt gaatgttgct 10440 attactgaaa aatttacaaa taataacaag gaaaataagg atgatgaagc catctttata 10500 gatagtaata tcattgatat tacggaaata atagaaaata acattatatt aattttacca 10560 attaagaggc tttgcagcga gaattgtaag gggttatgcc aacagtgcgg cactaactta 10620 aataattcta aatgtcagtg caaaagcgat gatattgatc cgagattggc aaagctaaaa 10680 gatatgtttt tcactgatta aggaggtgtt tactgtggga aatccagcca gcagaatatc 10740 aaaagcaaaa agagactcaa gaagagcaca gacttttaaa ttaggtttac caggtttagt 10800 tgagtgtcct cagtgccatg aaatgaaact tgcacataga gtttgtaaga attgtggata 10860 ttataagggt aaggaaatca tttcaactga aaataaataa aagaaagtca tttgactttc 10920 tttttttgtt catggggtct ataaaagtta gatcatatta agtaacaaaa ttaggtaaca 10980 aaggtccaga ttataggata ggatgtgaaa atatgataat tgctgtggat ggtatgggag 11040 gagattttgc accttgtgct gtagtggaag gtgtggtaga agcagttaaa aagcaaaacg 11100 taaatataat aataaccggc caaaaagagc aaattgaaaa tgaattagct aaatataatt 11160 atcctaagga caaaatagat attt 11184 <210> SEQ ID NO 161 <211> LENGTH: 42 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN22f <400> SEQUENCE: 161 tttacaaatt cggccggcca aagattgctc tatgtttaag ct 42 <210> SEQ ID NO 162 <211> LENGTH: 43 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN23r <400> SEQUENCE: 162 catcaaagtt tttactcatc aatttcatgt tcatttcctc cct 43 <210> SEQ ID NO 163 <211> LENGTH: 46 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN24f <400> SEQUENCE: 163 agggaggaaa tgaacatgaa attgatgagt aaaaactttg atgagt 46 <210> SEQ ID NO 164 <211> LENGTH: 50 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN25r <400> SEQUENCE: 164 gtatagcata cattatacga acggtactag taaaccttag cttgttcttc 50 <210> SEQ ID NO 165 <211> LENGTH: 50 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN26f <400> SEQUENCE: 165 gaagaacaag ctaaggttta ctagtaccgt tcgtataatg tatgctatac 50 <210> SEQ ID NO 166 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN27r <400> SEQUENCE: 166 agagatgagc attaaaagtc aagtctaccg ttcgtatagc ataca 45 <210> SEQ ID NO 167 <211> LENGTH: 45 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN28f <400> SEQUENCE: 167 tgtatgctat acgaacggta gacttgactt ttaatgctca tctct 45 <210> SEQ ID NO 168 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN29r <400> SEQUENCE: 168 catgagatta tcaaaaagga gtttaaatat ctattttgtc cttagga 47 <210> SEQ ID NO 169 <211> LENGTH: 47 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN30f <400> SEQUENCE: 169 tcctaaggac aaaatagata tttaaactcc tttttgataa tctcatg 47 <210> SEQ ID NO 170 <211> LENGTH: 42 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: SN31r <400> SEQUENCE: 170 agcttaaaca tagagcaatc tttggccggc cgaatttgta aa 42 <210> SEQ ID NO 171 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og29f <400> SEQUENCE: 171 agccacatcc agtagattga acttt 25 <210> SEQ ID NO 172 <211> LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Og30r <400> SEQUENCE: 172 aattcgccct acgattaaag tggaa 25 <210> SEQ ID NO 173 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-F1, forward <400> SEQUENCE: 173 aaaggtctcc ggccgcgctc actatctgcg gaacc 35 <210> SEQ ID NO 174 <211> LENGTH: 38 <212> TYPE: DNA

<213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: Pfdx-R1, reverse <400> SEQUENCE: 174 tttggtctcg aattctgtaa cacctcctta atttttag 38 <210> SEQ ID NO 175 <211> LENGTH: 52 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: aor1-F1, forward <400> SEQUENCE: 175 aaaggtctcg aattcaaaga tctatgtatg gttatgatgg taaagtatta ag 52 <210> SEQ ID NO 176 <211> LENGTH: 54 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: aor1-R1, reverse <400> SEQUENCE: 176 tttggtctcc tcgagtatgg atccctagaa cttacctata tattcatcta atcc 54 <210> SEQ ID NO 177 <211> LENGTH: 37 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - ack-DuetI2-R1 <400> SEQUENCE: 177 gggtacctta tttattttca actatttctt ttgtatc 37 <210> SEQ ID NO 178 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - DuetI2-ack-F1 <400> SEQUENCE: 178 ttgaaaataa ataaggtacc ctcgagtctg gtaaag 36 <210> SEQ ID NO 179 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - DuetI2-pta-R1 <400> SEQUENCE: 179 ttttttccat atgtatatct ccttcttata cttaac 36 <210> SEQ ID NO 180 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-pta-ack - pta-DuetI2-F1 <400> SEQUENCE: 180 aggagatata catatggaaa aaatttggag taaggc 36 <210> SEQ ID NO 181 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - DuetI2-tesB-F1 <400> SEQUENCE: 181 gaaatcataa ttaaggtacc ctcgagtctg gtaaag 36 <210> SEQ ID NO 182 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - DuetI2-tesB-R1 <400> SEQUENCE: 182 cctgactcat atgtatatct ccttcttata cttaac 36 <210> SEQ ID NO 183 <211> LENGTH: 36 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - tesB-DuetI2-F1 <400> SEQUENCE: 183 aagaaggaga tatacatatg agtcaggcac ttaaaa 36 <210> SEQ ID NO 184 <211> LENGTH: 35 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pETDuet-tesB - testB-DuetI2-R1 <400> SEQUENCE: 184 agggtacctt aattatgatt tctcataaca ccttc 35 <210> SEQ ID NO 185 <211> LENGTH: 7606 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDUET-pta-ack, plasmid <400> SEQUENCE: 185 ggggaattgt gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tggaaaaaat ttggagtaag gcaaaggaag acaaaaaaaa gattgtctta gctgaaggag 360 aagaagaaag aactcttcaa gcttgtgaaa aaataattaa agagggtatt gcaaatttaa 420 tccttgtagg gaatgaaaag gtaataaaag aaaaagcgtc aaaattaggt gtaagtttaa 480 atggagcaga aatagtagat ccagagattt cagataaact aaaggcatat gcagatgctt 540 tttatgaatt gagaaagaag aagggaataa cgccagaaaa agcggataaa atagtaagag 600 atccaatata ctttgctaca atgatggtta aacttggaga tgcagatgga ttggtttcag 660 gtgcggttca tactacaggc gatcttttga gaccaggact tcaaatagta aagacagctc 720 caggtacatc agtagtttcc agtacattta taatggaagt accaaattgt gagtatggtg 780 acaatggtgt acttctattt gctgattgtg ctgtaaatcc atgcccagat agtgatcaat 840 tggcttcaat tgcaataagt acagcagaaa ctgcaaagaa cttatgtgga atggatccaa 900 aagtagcaat gctttcattt tctactaagg gaagtgcaaa acacgaatta gtagacaaag 960 ttagaaatgc tgtagagatt gcaaaaaaag ctaaaccaga tttaagttta gacggagaat 1020 tacaattaga tgcctctatc gtagaaaagg ttgcaagttt aaaggctcct ggaagtgaag 1080 tagcaggaaa agcaaatgta cttgtatttc cagatctcca agcaggaaat ataggctata 1140 aactcgttca aagatttgca aaagcagatg ctataggacc tgtatgccaa ggatttgcaa 1200 aacctataaa tgatttgtca agaggatgta attctgatga tatagtaaat gtagtagctg 1260 taacagcagt tcaagcacaa gctcaaaagt aataacaaaa agcataaatg attcattttt 1320 aggaggaata ttaaacatga aaatattagt agtaaactgt ggaagttcat ctttaaaata 1380 tcaacttatt gatatgcaag atgaaagtgt tgtagcaaag ggtcttgtag aaagaatagg 1440 aatggacggt tcaattttaa cacacaaagt taatggagaa aagtttgtta cagagcaacc 1500 aatggaagac cacaaagttg ctatacaatt agtattaaat gctcttgtag ataaaaaaca 1560 tggtgtaata aaagacatgt cagaaatatc cgctgtagga catagagttt tgcacggtgg 1620 aaagaaatat gcagcatcca ttcttattga cgaaaatgta atgaaagcaa tagaagaatg 1680 tatcccacta ggaccactac ataatccagc taatataatg ggaatagatg cttgtaaaaa 1740 attaatgcca aatactccaa tggtagcagt atttgataca gcatttcatc agacaatgcc 1800 agattatgct tatacttatg caatacctta tgatatatct gaaaagtatg atatcagaaa 1860 atatggtttt catggaactt ctcatagatt cgtttcaatt gaagcagcta aattattaaa 1920 gaaagatcca aaagatctta agttaataac ttgtcattta ggaaatggag ctagcatatg 1980 tgcagtaaac caaggaaaag cagtagatac aactatggga cttactcctc ttgcaggact 2040 tgtaatggga actagatgcg gtgatataga tccagctata gtaccatttg taatgaaaag 2100 aacaggcatg tctgtagatg aagtggatac cttaatgaat aaaaagtcag gaatacttgg 2160 agtatcagga gtaagcagtg attttagaga tgtagaagaa gctgcaaatt caggaaatga 2220 tagagcaaaa cttgcattaa atatgtatta tcacaaagtt aaatctttca taggagctta 2280 tgttgcagtt ttaaatggag cagatgctat aatatttacg gcaggacttg gagaaaattc 2340

agcaactagc agatctgcta tatgtaatgg attaagctat tttggaatta aaatagatga 2400 agaaaagaat aagaaaaggg gagaggcact agaaataagc acacctgatt caaagataaa 2460 agtattagta attcctacaa atgaagaact tatgatagct agggatacaa aagaaatagt 2520 tgaaaataaa taaggtaccc tcgagtctgg taaagaaacc gctgctgcga aatttgaacg 2580 ccagcacatg gactcgtcta ctagcgcagc ttaattaacc taggctgctg ccaccgctga 2640 gcaataacta gcataacccc ttggggcctc taaacgggtc ttgaggggtt ttttgctgaa 2700 aggaggaact atatccggat tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg 2760 gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct 2820 cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta 2880 aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa 2940 cttgattagg gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct 3000 ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc 3060 aaccctatct cggtctattc ttttgattta taagggattt tgccgatttc ggcctattgg 3120 ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt 3180 acaatttctg gcggcacgat ggcatgagat tatcaaaaag gatcttcacc tagatccttt 3240 taaattaaaa atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca 3300 gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 3360 tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 3420 ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa 3480 accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 3540 agtctattaa ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 3600 acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 3660 tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag 3720 cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 3780 tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 3840 ctgtgactgg tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt 3900 gctcttgccc ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc 3960 tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat 4020 ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 4080 gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 4140 cacggaaatg ttgaatactc atactcttcc tttttcaatc atgattgaag catttatcag 4200 ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggt 4260 catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc ccgtagaaaa 4320 gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct tgcaaacaaa 4380 aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa ctctttttcc 4440 gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag tgtagccgta 4500 gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc tgctaatcct 4560 gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg actcaagacg 4620 atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca cacagcccag 4680 cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat gagaaagcgc 4740 cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg tcggaacagg 4800 agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc ctgtcgggtt 4860 tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc ggagcctatg 4920 gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc cttttgctca 4980 catgttcttt cctgcgttat cccctgattc tgtggataac cgtattaccg cctttgagtg 5040 agctgatacc gctcgccgca gccgaacgac cgagcgcagc gagtcagtga gcgaggaagc 5100 ggaagagcgc ctgatgcggt attttctcct tacgcatctg tgcggtattt cacaccgcat 5160 atatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gtatacactc 5220 cgctatcgct acgtgactgg gtcatggctg cgccccgaca cccgccaaca cccgctgacg 5280 cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg 5340 ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgagg cagctgcggt 5400 aaagctcatc agcgtggtcg tgaagcgatt cacagatgtc tgcctgttca tccgcgtcca 5460 gctcgttgag tttctccaga agcgttaatg tctggcttct gataaagcgg gccatgttaa 5520 gggcggtttt ttcctgtttg gtcactgatg cctccgtgta agggggattt ctgttcatgg 5580 gggtaatgat accgatgaaa cgagagagga tgctcacgat acgggttact gatgatgaac 5640 atgcccggtt actggaacgt tgtgagggta aacaactggc ggtatggatg cggcgggacc 5700 agagaaaaat cactcagggt caatgccagc gcttcgttaa tacagatgta ggtgttccac 5760 agggtagcca gcagcatcct gcgatgcaga tccggaacat aatggtgcag ggcgctgact 5820 tccgcgtttc cagactttac gaaacacgga aaccgaagac cattcatgtt gttgctcagg 5880 tcgcagacgt tttgcagcag cagtcgcttc acgttcgctc gcgtatcggt gattcattct 5940 gctaaccagt aaggcaaccc cgccagccta gccgggtcct caacgacagg agcacgatca 6000 tgctagtcat gccccgcgcc caccggaagg agctgactgg gttgaaggct ctcaagggca 6060 tcggtcgaga tcccggtgcc taatgagtga gctaacttac attaattgcg ttgcgctcac 6120 tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 6180 cggggagagg cggtttgcgt attgggcgcc agggtggttt ttcttttcac cagtgagacg 6240 ggcaacagct gattgccctt caccgcctgg ccctgagaga gttgcagcaa gcggtccacg 6300 ctggtttgcc ccagcaggcg aaaatcctgt ttgatggtgg ttaacggcgg gatataacat 6360 gagctgtctt cggtatcgtc gtatcccact accgagatgt ccgcaccaac gcgcagcccg 6420 gactcggtaa tggcgcgcat tgcgcccagc gccatctgat cgttggcaac cagcatcgca 6480 gtgggaacga tgccctcatt cagcatttgc atggtttgtt gaaaaccgga catggcactc 6540 cagtcgcctt cccgttccgc tatcggctga atttgattgc gagtgagata tttatgccag 6600 ccagccagac gcagacgcgc cgagacagaa cttaatgggc ccgctaacag cgcgatttgc 6660 tggtgaccca atgcgaccag atgctccacg cccagtcgcg taccgtcttc atgggagaaa 6720 ataatactgt tgatgggtgt ctggtcagag acatcaagaa ataacgccgg aacattagtg 6780 caggcagctt ccacagcaat ggcatcctgg tcatccagcg gatagttaat gatcagccca 6840 ctgacgcgtt gcgcgagaag attgtgcacc gccgctttac aggcttcgac gccgcttcgt 6900 tctaccatcg acaccaccac gctggcaccc agttgatcgg cgcgagattt aatcgccgcg 6960 acaatttgcg acggcgcgtg cagggccaga ctggaggtgg caacgccaat cagcaacgac 7020 tgtttgcccg ccagttgttg tgccacgcgg ttgggaatgt aattcagctc cgccatcgcc 7080 gcttccactt tttcccgcgt tttcgcagaa acgtggctgg cctggttcac cacgcgggaa 7140 acggtctgat aagagacacc ggcatactct gcgacatcgt ataacgttac tggtttcaca 7200 ttcaccaccc tgaattgact ctcttccggg cgctatcatg ccataccgcg aaaggttttg 7260 cgccattcga tggtgtccgg gatctcgacg ctctccctta tgcgactcct gcattaggaa 7320 gcagcccagt agtaggttga ggccgttgag caccgccgcc gcaaggaatg gtgcatgcaa 7380 ggagatggcg cccaacagtc ccccggccac ggggcctgcc accataccca cgccgaaaca 7440 agcgctcatg agcccgaagt ggcgagcccg atcttcccca tcggtgatgt cggcgatata 7500 ggcgccagca accgcacctg tggcgccggt gatgccggcc acgatgcgtc cggcgtagag 7560 gatcgagatc gatctcgatc ccgcgaaatt aatacgactc actata 7606 <210> SEQ ID NO 186 <211> LENGTH: 7492 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDUET-ptb-buk, plasmid <400> SEQUENCE: 186 ggggaattgt gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgagtaaaaa ctttgatgag ttattatcaa gattaaagga agttccaaca aaaaaagtgg 360 ctgtagccgt agcacaagat gaaccagtat tagaggctat aaaagaagct acagaaaata 420 acatcgcaca agcaatattg gttggtgata aacaacaaat ccatgaaatc gcaaagaaaa 480 taaacttgga cttatctgat tatgaaataa tggatattaa agatccaaag aaagcaacat 540 tagaagcagt aaaattagtt tctagtggtc atgcagatat gttaatgaaa ggtctagttg 600 atactgcaac attcctaaga agcgtattaa acaaagaggt tggtcttaga acaggaaaat 660 taatgtccca tgtagctgtg tttgatgtgg aaggttggga tagactgtta tttttaactg 720 atgcagcatt taatacatat ccagaattta aggataaagt tggaatgata aataatgcag 780 ttgtagttgc tcatgcatgt ggaatagatg ttccaagagt agcacctata tgcccagttg 840 aagttgtaaa tacaagtatg caatcaacag ttgatgcagc attgttagct aaaatgagtg 900 acagggggca aattaaagga tgcgtaattg atggaccttt tgccttagat aatgcaatat 960 cagaagaagc agctcatcat aaaggtgtta caggatcagt agcaggtaaa gctgatatat 1020 tattattacc aaatatagaa gcagcaaatg taatgtataa aacattaaca tatttctcta 1080 aatcaagaaa tggtggactt ttagtaggta catcagcacc agtaatttta acttcaagag 1140 cagattcatt cgaaactaaa gttaattcaa ttgctcttgc agcattagtt gcagcaagaa 1200 ataagtaata aatcaatcca taataattaa tgcataatta atggagagat ttatatggaa 1260 tttgcaatgc actattagat tctataataa tttcttctga aaattatgca ttatgactgt 1320 atagaatgca ttaaatttaa gggggattca gaatgtcata taagctatta ataatcaatc 1380 caggttcaac atcaacaaag attggtgttt acgaaggaga aaaggaacta tttgaagaaa 1440 ctttgagaca cacaaatgaa gaaataaaga gatatgatac aatatatgat caatttgaat 1500 ttagaaaaga agttatatta aatgttctta aagaaaagaa ttttgatata aagactctaa 1560 gtgctattgt tggtagaggt ggaatgctta gaccagttga aggtggaaca tatgcagtaa 1620 atgatgcaat ggttgaagat ttaaaagttg gagttcaagg acctcatgct tctaaccttg 1680 gcggaataat tgccaagtca attggagatg aattaaatat tccatcattt atagtagatc 1740 cagttgttac agatgagtta gcagatgtag caagactatc tggagtacca gaactaccaa 1800 gaaaaagtaa attccatgct ttaaatcaaa aagcggtagc taaaagatat ggaaaagaaa 1860

gtggacaagg atatgaaaac ctaaatcttg tagttgtaca tatgggtgga ggcgtttcag 1920 ttggtgctca caatcatggg aaagttgtcg atgtaaataa tgcattagat ggagatggcc 1980 cattctcacc agaaagagct ggatcagttc caattggtga tttagttaaa atgtgtttta 2040 gtggaaaata tagtgaagca gaagtatatg gcaaggctgt aggaaaaggt ggatttgttg 2100 gttatctaaa cacaaatgat gtaaaaggtg ttattgataa gatggaagaa ggagataaag 2160 aatgtgaatc aatatacaaa gcatttgttt atcaaatttc aaaagcaatc ggagaaatgt 2220 cagttgtatt agaaggtaaa gttgatcaaa ttatttttac cggaggaatt gcatactcac 2280 caacacttgt tccagacctt aaagcaaaag ttgaatggat agccccagtt acagtttatc 2340 ctggagaaga tgaattactt gctctagctc aaggtgctat aagagtactt gatggagaag 2400 aacaagctaa ggtttactag gtaccctcga gtctggtaaa gaaaccgctg ctgcgaaatt 2460 tgaacgccag cacatggact cgtctactag cgcagcttaa ttaacctagg ctgctgccac 2520 cgctgagcaa taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt 2580 gctgaaagga ggaactatat ccggattggc gaatgggacg cgccctgtag cggcgcatta 2640 agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg 2700 cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa 2760 gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc 2820 aaaaaacttg attagggtga tggttcacgt agtgggccat cgccctgata gacggttttt 2880 cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca 2940 acactcaacc ctatctcggt ctattctttt gatttataag ggattttgcc gatttcggcc 3000 tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta 3060 acgtttacaa tttctggcgg cacgatggca tgagattatc aaaaaggatc ttcacctaga 3120 tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt 3180 ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt 3240 catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat 3300 ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag 3360 caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct 3420 ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt 3480 tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg 3540 cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca 3600 aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt 3660 tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat 3720 gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac 3780 cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa 3840 aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt 3900 tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt 3960 tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa 4020 gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatcatga ttgaagcatt 4080 tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 4140 ataggtcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 4200 agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 4260 aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 4320 ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 4380 gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 4440 aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 4500 aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 4560 gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 4620 aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 4680 aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 4740 cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 4800 cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 4860 tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4920 tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4980 ggaagcggaa gagcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca 5040 ccgcatatat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagtat 5100 acactccgct atcgctacgt gactgggtca tggctgcgcc ccgacacccg ccaacacccg 5160 ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg 5220 tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgaggcagc 5280 tgcggtaaag ctcatcagcg tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg 5340 cgtccagctc gttgagtttc tccagaagcg ttaatgtctg gcttctgata aagcgggcca 5400 tgttaagggc ggttttttcc tgtttggtca ctgatgcctc cgtgtaaggg ggatttctgt 5460 tcatgggggt aatgataccg atgaaacgag agaggatgct cacgatacgg gttactgatg 5520 atgaacatgc ccggttactg gaacgttgtg agggtaaaca actggcggta tggatgcggc 5580 gggaccagag aaaaatcact cagggtcaat gccagcgctt cgttaataca gatgtaggtg 5640 ttccacaggg tagccagcag catcctgcga tgcagatccg gaacataatg gtgcagggcg 5700 ctgacttccg cgtttccaga ctttacgaaa cacggaaacc gaagaccatt catgttgttg 5760 ctcaggtcgc agacgttttg cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt 5820 cattctgcta accagtaagg caaccccgcc agcctagccg ggtcctcaac gacaggagca 5880 cgatcatgct agtcatgccc cgcgcccacc ggaaggagct gactgggttg aaggctctca 5940 agggcatcgg tcgagatccc ggtgcctaat gagtgagcta acttacatta attgcgttgc 6000 gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc 6060 aacgcgcggg gagaggcggt ttgcgtattg ggcgccaggg tggtttttct tttcaccagt 6120 gagacgggca acagctgatt gcccttcacc gcctggccct gagagagttg cagcaagcgg 6180 tccacgctgg tttgccccag caggcgaaaa tcctgtttga tggtggttaa cggcgggata 6240 taacatgagc tgtcttcggt atcgtcgtat cccactaccg agatgtccgc accaacgcgc 6300 agcccggact cggtaatggc gcgcattgcg cccagcgcca tctgatcgtt ggcaaccagc 6360 atcgcagtgg gaacgatgcc ctcattcagc atttgcatgg tttgttgaaa accggacatg 6420 gcactccagt cgccttcccg ttccgctatc ggctgaattt gattgcgagt gagatattta 6480 tgccagccag ccagacgcag acgcgccgag acagaactta atgggcccgc taacagcgcg 6540 atttgctggt gacccaatgc gaccagatgc tccacgccca gtcgcgtacc gtcttcatgg 6600 gagaaaataa tactgttgat gggtgtctgg tcagagacat caagaaataa cgccggaaca 6660 ttagtgcagg cagcttccac agcaatggca tcctggtcat ccagcggata gttaatgatc 6720 agcccactga cgcgttgcgc gagaagattg tgcaccgccg ctttacaggc ttcgacgccg 6780 cttcgttcta ccatcgacac caccacgctg gcacccagtt gatcggcgcg agatttaatc 6840 gccgcgacaa tttgcgacgg cgcgtgcagg gccagactgg aggtggcaac gccaatcagc 6900 aacgactgtt tgcccgccag ttgttgtgcc acgcggttgg gaatgtaatt cagctccgcc 6960 atcgccgctt ccactttttc ccgcgttttc gcagaaacgt ggctggcctg gttcaccacg 7020 cgggaaacgg tctgataaga gacaccggca tactctgcga catcgtataa cgttactggt 7080 ttcacattca ccaccctgaa ttgactctct tccgggcgct atcatgccat accgcgaaag 7140 gttttgcgcc attcgatggt gtccgggatc tcgacgctct cccttatgcg actcctgcat 7200 taggaagcag cccagtagta ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc 7260 atgcaaggag atggcgccca acagtccccc ggccacgggg cctgccacca tacccacgcc 7320 gaaacaagcg ctcatgagcc cgaagtggcg agcccgatct tccccatcgg tgatgtcggc 7380 gatataggcg ccagcaaccg cacctgtggc gccggtgatg ccggccacga tgcgtccggc 7440 gtagaggatc gagatcgatc tcgatcccgc gaaattaata cgactcacta ta 7492 <210> SEQ ID NO 187 <211> LENGTH: 6233 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pDUET-tesB, plasmid <400> SEQUENCE: 187 ggggaattgt gagcggataa caattcccct ctagaaataa ttttgtttaa ctttaagaag 60 gagatatacc atgggcagca gccatcacca tcatcaccac agccaggatc cgaattcgag 120 ctcggcgcgc ctgcaggtcg acaagcttgc ggccgcataa tgcttaagtc gaacagaaag 180 taatcgtatt gtacacggcc gcataatcga aattaatacg actcactata ggggaattgt 240 gagcggataa caattcccca tcttagtata ttagttaagt ataagaagga gatatacata 300 tgagtcaggc acttaaaaat ttacttactt tacttaatct tgaaaaaata gaagaaggtt 360 tatttagagg acagtcagaa gatttaggat taagacaagt atttggaggt caagtagttg 420 gtcaggcact ttatgcagct aaagaaactg tacctgaaga aagacttgtt catagttttc 480 attcttattt tcttagacct ggagattcta aaaaaccaat tatatatgat gtagaaactc 540 ttagagatgg aaattcattt agtgcaagaa gagttgcagc tattcaaaat ggtaaaccta 600 tattttacat gacagcttct tttcaagcac cagaagctgg atttgaacat cagaaaacta 660 tgccttcagc acctgctcca gatggattac catcagaaac acaaatagca cagagtttag 720 ctcatttact tcctccagta cttaaagata aatttatttg tgatagacct ttagaagtta 780 gaccagttga atttcataat cctcttaaag gacatgtagc agaaccacat agacaagttt 840 ggataagagc taatggaagt gtaccagatg atcttagagt tcatcagtat cttcttggtt 900 atgcatctga tttaaatttt cttcctgtag ctttacaacc acatggaata ggttttcttg 960 aacctggaat acagatagca actatagatc attcaatgtg gtttcataga ccatttaatc 1020 ttaatgaatg gcttctttat agtgtagaat ctacatcagc aagttctgct agaggatttg 1080 ttaggggtga attttatact caagatggag tacttgttgc tagtacagta caggaaggtg 1140 ttatgagaaa tcataattaa ggtaccctcg agtctggtaa agaaaccgct gctgcgaaat 1200 ttgaacgcca gcacatggac tcgtctacta gcgcagctta attaacctag gctgctgcca 1260 ccgctgagca ataactagca taaccccttg gggcctctaa acgggtcttg aggggttttt 1320 tgctgaaagg aggaactata tccggattgg cgaatgggac gcgccctgta gcggcgcatt 1380 aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc 1440 gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca 1500

agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc 1560 caaaaaactt gattagggtg atggttcacg tagtgggcca tcgccctgat agacggtttt 1620 tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac 1680 aacactcaac cctatctcgg tctattcttt tgatttataa gggattttgc cgatttcggc 1740 ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt 1800 aacgtttaca atttctggcg gcacgatggc atgagattat caaaaaggat cttcacctag 1860 atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 1920 tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 1980 tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca 2040 tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca 2100 gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc 2160 tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt 2220 ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg 2280 gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc 2340 aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg 2400 ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga 2460 tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga 2520 ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta 2580 aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 2640 ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 2700 ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 2760 agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatcatg attgaagcat 2820 ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 2880 aataggtcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg 2940 tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc 3000 aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 3060 tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 3120 agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 3180 taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact 3240 caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac 3300 agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 3360 aaagcgccac gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg 3420 gaacaggaga gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg 3480 tcgggtttcg ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga 3540 gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 3600 ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 3660 ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 3720 aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 3780 accgcatata tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagta 3840 tacactccgc tatcgctacg tgactgggtc atggctgcgc cccgacaccc gccaacaccc 3900 gctgacgcgc cctgacgggc ttgtctgctc ccggcatccg cttacagaca agctgtgacc 3960 gtctccggga gctgcatgtg tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag 4020 ctgcggtaaa gctcatcagc gtggtcgtga agcgattcac agatgtctgc ctgttcatcc 4080 gcgtccagct cgttgagttt ctccagaagc gttaatgtct ggcttctgat aaagcgggcc 4140 atgttaaggg cggttttttc ctgtttggtc actgatgcct ccgtgtaagg gggatttctg 4200 ttcatggggg taatgatacc gatgaaacga gagaggatgc tcacgatacg ggttactgat 4260 gatgaacatg cccggttact ggaacgttgt gagggtaaac aactggcggt atggatgcgg 4320 cgggaccaga gaaaaatcac tcagggtcaa tgccagcgct tcgttaatac agatgtaggt 4380 gttccacagg gtagccagca gcatcctgcg atgcagatcc ggaacataat ggtgcagggc 4440 gctgacttcc gcgtttccag actttacgaa acacggaaac cgaagaccat tcatgttgtt 4500 gctcaggtcg cagacgtttt gcagcagcag tcgcttcacg ttcgctcgcg tatcggtgat 4560 tcattctgct aaccagtaag gcaaccccgc cagcctagcc gggtcctcaa cgacaggagc 4620 acgatcatgc tagtcatgcc ccgcgcccac cggaaggagc tgactgggtt gaaggctctc 4680 aagggcatcg gtcgagatcc cggtgcctaa tgagtgagct aacttacatt aattgcgttg 4740 cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 4800 caacgcgcgg ggagaggcgg tttgcgtatt gggcgccagg gtggtttttc ttttcaccag 4860 tgagacgggc aacagctgat tgcccttcac cgcctggccc tgagagagtt gcagcaagcg 4920 gtccacgctg gtttgcccca gcaggcgaaa atcctgtttg atggtggtta acggcgggat 4980 ataacatgag ctgtcttcgg tatcgtcgta tcccactacc gagatgtccg caccaacgcg 5040 cagcccggac tcggtaatgg cgcgcattgc gcccagcgcc atctgatcgt tggcaaccag 5100 catcgcagtg ggaacgatgc cctcattcag catttgcatg gtttgttgaa aaccggacat 5160 ggcactccag tcgccttccc gttccgctat cggctgaatt tgattgcgag tgagatattt 5220 atgccagcca gccagacgca gacgcgccga gacagaactt aatgggcccg ctaacagcgc 5280 gatttgctgg tgacccaatg cgaccagatg ctccacgccc agtcgcgtac cgtcttcatg 5340 ggagaaaata atactgttga tgggtgtctg gtcagagaca tcaagaaata acgccggaac 5400 attagtgcag gcagcttcca cagcaatggc atcctggtca tccagcggat agttaatgat 5460 cagcccactg acgcgttgcg cgagaagatt gtgcaccgcc gctttacagg cttcgacgcc 5520 gcttcgttct accatcgaca ccaccacgct ggcacccagt tgatcggcgc gagatttaat 5580 cgccgcgaca atttgcgacg gcgcgtgcag ggccagactg gaggtggcaa cgccaatcag 5640 caacgactgt ttgcccgcca gttgttgtgc cacgcggttg ggaatgtaat tcagctccgc 5700 catcgccgct tccacttttt cccgcgtttt cgcagaaacg tggctggcct ggttcaccac 5760 gcgggaaacg gtctgataag agacaccggc atactctgcg acatcgtata acgttactgg 5820 tttcacattc accaccctga attgactctc ttccgggcgc tatcatgcca taccgcgaaa 5880 ggttttgcgc cattcgatgg tgtccgggat ctcgacgctc tcccttatgc gactcctgca 5940 ttaggaagca gcccagtagt aggttgaggc cgttgagcac cgccgccgca aggaatggtg 6000 catgcaagga gatggcgccc aacagtcccc cggccacggg gcctgccacc atacccacgc 6060 cgaaacaagc gctcatgagc ccgaagtggc gagcccgatc ttccccatcg gtgatgtcgg 6120 cgatataggc gccagcaacc gcacctgtgg cgccggtgat gccggccacg atgcgtccgg 6180 cgtagaggat cgagatcgat ctcgatcccg cgaaattaat acgactcact ata 6233 <210> SEQ ID NO 188 <211> LENGTH: 3120 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: codon optimized gene cassette containing the Wood-Ljungdahl promoter in front of the genes meaB, hcmA and hcmB <400> SEQUENCE: 188 atgacttatg taccatcatc agcactttta gaacaactta gagcaggaaa tacttgggct 60 ttaggaagac ttatatcaag agcagaagct ggagttgcag aagctagacc tgcacttgct 120 gaagtatata gacatgcagg ttcagctcat gttataggtt taacaggagt accaggatct 180 ggtaaatcaa ctcttgtagc aaaacttaca gcagctctta gaaaaagagg agaaaaagtt 240 ggtatagtag ctattgatcc tagttctcca tatagtggag gagcaatact tggagataga 300 attagaatga ctgaattagc aaatgattca ggagtattta taagaagtat ggcaactaga 360 ggtgctactg gaggaatggc tagagcagct cttgatgcag ttgatttact tgatgtagct 420 ggatatcata ctattatttt agaaacagtt ggagtaggtc aagatgaagt tgaagtagca 480 catgcttctg atactacagt agttgtatca gcacctggac ttggtgatga aatacaggca 540 attaaagctg gagttttaga aattgctgat attcatgttg taagtaaatg tgatagagat 600 gatgcaaata gaactcttac agatcttaaa caaatgctta ctttaggaac aatggtagga 660 cctaaaagag catgggctat accagttgta ggagtttcaa gttatacagg agaaggtgta 720 gatgatttac ttggtagaat tgcagctcat agacaagcaa ctgctgatac agaacttgga 780 agagaaagaa gaagaagagt agctgaattt agacttcaaa aaactgcaga aacattactt 840 ttagaaagat ttactacagg agcacagcct ttttcaccag cattagctga tagtctttct 900 aatagagcta gtgatcctta tgcagctgca agagaattaa tagctagaac tataagaaaa 960 gaatattcta atgatcttgc atgtgctaaa cttactataa catggttaga accacaaatt 1020 aaaagtcaac ttcagtctga aagaaaagat tgggaagcaa atgaagttgg agcatttctt 1080 aaaaaagcac ctgaaagaaa agaacaattt catacaattg gagattttcc agtacagaga 1140 acttatacag ctgcagatat agcagatact cctcttgaag atattggttt acctggaaga 1200 tatccattta ctagaggacc ttatccaaca atgtatagaa gtagaacttg gacaatgaga 1260 caaatagctg gatttggtac tggagaagat acaaataaaa gatttaaata tcttatagca 1320 cagggtcaga ctggaatatc aacagatttt gatatgccta cattaatggg atatgattca 1380 gatcatccaa tgagtgatgg tgaagttgga agagaaggtg tagctataga tacacttgca 1440 gatatggaag cacttcttgc tgatattgat ttagaaaaaa tttcagttag ttttactata 1500 aatccaagtg catggattct tttagcaatg tatgtagctt taggtgaaaa aagaggttat 1560 gatcttaata aactttctgg aacagtacaa gctgatatac ttaaagaata tatggcacag 1620 aaagaatata tttatcctat agctccaagt gttagaattg taagagatat aattacttat 1680 tctgcaaaaa atcttaaaag atataatcct attaatattt ctggatatca tatatcagaa 1740 gctggttctt caccattaca agaagctgca tttactcttg caaatcttat tacttatgta 1800 aatgaagtaa ctaaaacagg aatgcatgta gatgaatttg cacctagatt agcatttttc 1860 tttgttagtc aaggagattt ctttgaagaa gtagcaaaat ttagagcttt aagaagatgt 1920 tatgctaaaa taatgaaaga aagatttgga gcaagaaatc ctgaatctat gagacttaga 1980 tttcattgtc aaactgctgc agctactctt acaaaaccac agtatatggt taatgttgta 2040 agaacaagtc ttcaagcatt atctgctgta ttgggaggag cacaaagtct tcatactaat 2100 ggatatgatg aagcatttgc tatacctact gaagatgcaa tgaaaatggc tcttagaaca 2160 caacagatta tagctgaaga atctggagtt gcagatgtaa tagatcctct tggaggaagt 2220 tattatgttg aagcattaac tacagaatat gaaaagaaaa tatttgaaat tcttgaagaa 2280 gtagaaaaaa gaggtggaac tattaaactt attgaacaag gatggtttca aaaacagata 2340

gcagattttg cttatgaaac tgcacttaga aaacaatcag gacagaaacc tgttataggt 2400 gtaaatagat ttgttgaaaa tgaagaagat gtaaaaattg aaatacatcc ttatgataat 2460 actacagctg aaagacaaat atcaagaact agaagagtta gagcagaaag agatgaagca 2520 aaagtacaag ctatgcttga tcagttagtt gcagtagcta aagatgaaag tcagaatctt 2580 atgcctctta ctattgaatt agtaaaagca ggagctacaa tgggtgatat tgtagaaaaa 2640 cttaaaggta tttggggaac ttatagagaa acaccagtat tttaagcact agttggagag 2700 cttcccacga tggatcagat tcctattaga gtattattag caaaagtagg tttagatgga 2760 catgatagag gtgtaaaagt tgtagcaaga gcattaagag atgctggaat ggatgtaata 2820 tatagtggtc ttcatagaac tcctgaagaa gtagttaata cagcaattca agaagatgta 2880 gatgttttag gagttagttt actttctggt gtacagctta ctgtttttcc taaaattttt 2940 aaattacttg atgaaagagg agctggtgat ttaatagtaa ttgctggagg agtaatgcca 3000 gatgaagatg cagctgcaat aagaaaactt ggagtaagag aagttttact tcaagataca 3060 ccaccacagg caataataga ttcaataaga agtttagtag cagcaagagg agcaagataa 3120 <210> SEQ ID NO 189 <211> LENGTH: 894 <212> TYPE: PRT <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polypeptide <220> FEATURE: <221> NAME/KEY: MISC_FEATURE <223> OTHER INFORMATION: hcmA and meaB fusion <400> SEQUENCE: 189 Met Thr Tyr Val Pro Ser Ser Ala Leu Leu Glu Gln Leu Arg Ala Gly 1 5 10 15 Asn Thr Trp Ala Leu Gly Arg Leu Ile Ser Arg Ala Glu Ala Gly Val 20 25 30 Ala Glu Ala Arg Pro Ala Leu Ala Glu Val Tyr Arg His Ala Gly Ser 35 40 45 Ala His Val Ile Gly Leu Thr Gly Val Pro Gly Ser Gly Lys Ser Thr 50 55 60 Leu Val Ala Lys Leu Thr Ala Ala Leu Arg Lys Arg Gly Glu Lys Val 65 70 75 80 Gly Ile Val Ala Ile Asp Pro Ser Ser Pro Tyr Ser Gly Gly Ala Ile 85 90 95 Leu Gly Asp Arg Ile Arg Met Thr Glu Leu Ala Asn Asp Ser Gly Val 100 105 110 Phe Ile Arg Ser Met Ala Thr Arg Gly Ala Thr Gly Gly Met Ala Arg 115 120 125 Ala Ala Leu Asp Ala Val Asp Leu Leu Asp Val Ala Gly Tyr His Thr 130 135 140 Ile Ile Leu Glu Thr Val Gly Val Gly Gln Asp Glu Val Glu Val Ala 145 150 155 160 His Ala Ser Asp Thr Thr Val Val Val Ser Ala Pro Gly Leu Gly Asp 165 170 175 Glu Ile Gln Ala Ile Lys Ala Gly Val Leu Glu Ile Ala Asp Ile His 180 185 190 Val Val Ser Lys Cys Asp Arg Asp Asp Ala Asn Arg Thr Leu Thr Asp 195 200 205 Leu Lys Gln Met Leu Thr Leu Gly Thr Met Val Gly Pro Lys Arg Ala 210 215 220 Trp Ala Ile Pro Val Val Gly Val Ser Ser Tyr Thr Gly Glu Gly Val 225 230 235 240 Asp Asp Leu Leu Gly Arg Ile Ala Ala His Arg Gln Ala Thr Ala Asp 245 250 255 Thr Glu Leu Gly Arg Glu Arg Arg Arg Arg Val Ala Glu Phe Arg Leu 260 265 270 Gln Lys Thr Ala Glu Thr Leu Leu Leu Glu Arg Phe Thr Thr Gly Ala 275 280 285 Gln Pro Phe Ser Pro Ala Leu Ala Asp Ser Leu Ser Asn Arg Ala Ser 290 295 300 Asp Pro Tyr Ala Ala Ala Arg Glu Leu Ile Ala Arg Thr Ile Arg Lys 305 310 315 320 Glu Tyr Ser Asn Asp Leu Ala Cys Ala Lys Leu Thr Ile Thr Trp Leu 325 330 335 Glu Pro Gln Ile Lys Ser Gln Leu Gln Ser Glu Arg Lys Asp Trp Glu 340 345 350 Ala Asn Glu Val Gly Ala Phe Leu Lys Lys Ala Pro Glu Arg Lys Glu 355 360 365 Gln Phe His Thr Ile Gly Asp Phe Pro Val Gln Arg Thr Tyr Thr Ala 370 375 380 Ala Asp Ile Ala Asp Thr Pro Leu Glu Asp Ile Gly Leu Pro Gly Arg 385 390 395 400 Tyr Pro Phe Thr Arg Gly Pro Tyr Pro Thr Met Tyr Arg Ser Arg Thr 405 410 415 Trp Thr Met Arg Gln Ile Ala Gly Phe Gly Thr Gly Glu Asp Thr Asn 420 425 430 Lys Arg Phe Lys Tyr Leu Ile Ala Gln Gly Gln Thr Gly Ile Ser Thr 435 440 445 Asp Phe Asp Met Pro Thr Leu Met Gly Tyr Asp Ser Asp His Pro Met 450 455 460 Ser Asp Gly Glu Val Gly Arg Glu Gly Val Ala Ile Asp Thr Leu Ala 465 470 475 480 Asp Met Glu Ala Leu Leu Ala Asp Ile Asp Leu Glu Lys Ile Ser Val 485 490 495 Ser Phe Thr Ile Asn Pro Ser Ala Trp Ile Leu Leu Ala Met Tyr Val 500 505 510 Ala Leu Gly Glu Lys Arg Gly Tyr Asp Leu Asn Lys Leu Ser Gly Thr 515 520 525 Val Gln Ala Asp Ile Leu Lys Glu Tyr Met Ala Gln Lys Glu Tyr Ile 530 535 540 Tyr Pro Ile Ala Pro Ser Val Arg Ile Val Arg Asp Ile Ile Thr Tyr 545 550 555 560 Ser Ala Lys Asn Leu Lys Arg Tyr Asn Pro Ile Asn Ile Ser Gly Tyr 565 570 575 His Ile Ser Glu Ala Gly Ser Ser Pro Leu Gln Glu Ala Ala Phe Thr 580 585 590 Leu Ala Asn Leu Ile Thr Tyr Val Asn Glu Val Thr Lys Thr Gly Met 595 600 605 His Val Asp Glu Phe Ala Pro Arg Leu Ala Phe Phe Phe Val Ser Gln 610 615 620 Gly Asp Phe Phe Glu Glu Val Ala Lys Phe Arg Ala Leu Arg Arg Cys 625 630 635 640 Tyr Ala Lys Ile Met Lys Glu Arg Phe Gly Ala Arg Asn Pro Glu Ser 645 650 655 Met Arg Leu Arg Phe His Cys Gln Thr Ala Ala Ala Thr Leu Thr Lys 660 665 670 Pro Gln Tyr Met Val Asn Val Val Arg Thr Ser Leu Gln Ala Leu Ser 675 680 685 Ala Val Leu Gly Gly Ala Gln Ser Leu His Thr Asn Gly Tyr Asp Glu 690 695 700 Ala Phe Ala Ile Pro Thr Glu Asp Ala Met Lys Met Ala Leu Arg Thr 705 710 715 720 Gln Gln Ile Ile Ala Glu Glu Ser Gly Val Ala Asp Val Ile Asp Pro 725 730 735 Leu Gly Gly Ser Tyr Tyr Val Glu Ala Leu Thr Thr Glu Tyr Glu Lys 740 745 750 Lys Ile Phe Glu Ile Leu Glu Glu Val Glu Lys Arg Gly Gly Thr Ile 755 760 765 Lys Leu Ile Glu Gln Gly Trp Phe Gln Lys Gln Ile Ala Asp Phe Ala 770 775 780 Tyr Glu Thr Ala Leu Arg Lys Gln Ser Gly Gln Lys Pro Val Ile Gly 785 790 795 800 Val Asn Arg Phe Val Glu Asn Glu Glu Asp Val Lys Ile Glu Ile His 805 810 815 Pro Tyr Asp Asn Thr Thr Ala Glu Arg Gln Ile Ser Arg Thr Arg Arg 820 825 830 Val Arg Ala Glu Arg Asp Glu Ala Lys Val Gln Ala Met Leu Asp Gln 835 840 845 Leu Val Ala Val Ala Lys Asp Glu Ser Gln Asn Leu Met Pro Leu Thr 850 855 860 Ile Glu Leu Val Lys Ala Gly Ala Thr Met Gly Asp Ile Val Glu Lys 865 870 875 880 Leu Lys Gly Ile Trp Gly Thr Tyr Arg Glu Thr Pro Val Phe 885 890 <210> SEQ ID NO 190 <211> LENGTH: 849 <212> TYPE: DNA <213> ORGANISM: Clostridium acetobutylicum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: hbd <400> SEQUENCE: 190 atgagtatta aaagtgtagc ggttttaggt agtggaacta tgtctcgtgg aattgtgcag 60 gcttttgcag aagcaggtat agatgtaatt atccgtggaa gaactgaagg tagtattgga 120 aaaggtctag cagcagtaaa gaaagcttat gataaaaaag tatcaaaggg gaaaatttcc 180 caggaagatg ctgataaaat agttggaaga gtaagtacaa caactgaact tgaaaaattg 240 gctgattgtg atcttataat agaagcagca tcagaggata tgaatataaa gaaagactat 300 tttggaaaat tagaagaaat atgcaagcct gaaacaattt ttgctactaa tacttcttca 360 ttatctataa ctgaagtagc aacagctaca aagagaccag ataaattcat aggaatgcat 420 ttctttaatc cagcaaatgt tatgaaatta gttgaaatca taagaggtat gaatacttca 480 caagaaactt ttgatattat aaaagaagct tccattaaaa taggaaaaac tcctgtagaa 540 gttgcagaag ctccaggatt tgttgtaaac aagatattag taccaatgat caatgaagca 600 gtaggaattt tggcagaagg aatagcttca gcagaagata tcgatacagc tatgaaatta 660 ggcgctaatc acccaatggg tcctttagca ttaggagatc ttattggact tgatgtagtt 720 cttgcagtta tggatgtact ttatagtgaa actggagatt caaaatatag agctcataca 780 ttacttagaa aatatgtaag agcaggatgg cttggaagaa aatcaggaaa aggattcttc 840 gcttattaa 849 <210> SEQ ID NO 191

<211> LENGTH: 10647 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL83155-thlA-hbd-Pwl-meaBhcmA-hcmB <400> SEQUENCE: 191 cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt 60 atcaggaaac agctatgacc gcggccgcaa tatgatattt atgtccattg tgaaagggat 120 tatattcaac tattattcca gttacgttca tagaaatttt cctttctaaa atattttatt 180 ccatgtcaag aactctgttt atttcattaa agaactataa gtacaaagta taaggcattt 240 gaaaaaatag gctagtatat tgattgatta tttattttaa aatgcctaag tgaaatatat 300 acatattata acaataaaat aagtattagt gtaggatttt taaatagagt atctattttc 360 agattaaatt tttgattatt tgatttacat tatataatat tgagtaaagt attgactagc 420 aaaatttttt gatactttaa tttgtgaaat ttcttatcaa aagttatatt tttgaataat 480 ttttattgaa aaatacaact aaaaaggatt atagtataag tgtgtgtaat tttgtgttaa 540 atttaaaggg aggaaatgaa catgaaacat atgaaagaag ttgtaatagc tagtgcagta 600 agaacagcga ttggatctta tggaaagtct cttaaggatg taccagcagt agatttagga 660 gctacagcta taaaggaagc agttaaaaaa gcaggaataa aaccagagga tgttaatgaa 720 gtcattttag gaaatgttct tcaagcaggt ttaggacaga atccagcaag acaggcatct 780 tttaaagcag gattaccagt tgaaattcca gctatgacta ttaataaggt ttgtggttca 840 ggacttagaa cagttagctt agcagcacaa attataaaag caggagatgc tgacgtaata 900 atagcaggtg gtatggaaaa tatgtctaga gctccttact tagcgaataa cgctagatgg 960 ggatatagaa tgggaaacgc taaatttgtt gatgaaatga tcactgacgg attgtgggat 1020 gcatttaatg attaccacat gggaataaca gcagaaaaca tagctgagag atggaacatt 1080 tcaagagaag aacaagatga gtttgctctt gcatcacaaa aaaaagctga agaagctata 1140 aaatcaggtc aatttaaaga tgaaatagtt cctgtagtaa ttaaaggcag aaagggagaa 1200 actgtagttg atacagatga gcaccctaga tttggatcaa ctatagaagg acttgcaaaa 1260 ttaaaacctg ccttcaaaaa agatggaaca gttacagctg gtaatgcatc aggattaaat 1320 gactgtgcag cagtacttgt aatcatgagt gcagaaaaag ctaaagagct tggagtaaaa 1380 ccacttgcta agatagtttc ttatggttca gcaggagttg acccagcaat aatgggatat 1440 ggacctttct atgcaacaaa agcagctatt gaaaaagcag gttggacagt tgatgaatta 1500 gatttaatag aatcaaatga agcttttgca gctcaaagtt tagcagtagc aaaagattta 1560 aaatttgata tgaataaagt aaatgtaaat ggaggagcta ttgcccttgg tcatccaatt 1620 ggagcatcag gtgcaagaat actcgttact cttgtacacg caatgcaaaa aagagatgca 1680 aaaaaaggct tagcaacttt atgtataggt ggcggacaag gaacagcaat attgctagaa 1740 aagtgctagg aattctcaaa aattcggtta aataaaataa ttaggaggtt caatcatgtc 1800 tattaaatca gttgcagttt taggttcagg tacaatgtca agaggtattg ttcaagcatt 1860 tgctgaagca ggtatagatg taataattag aggtagaaca gaaggatcaa taggaaaagg 1920 acttgctgct gttaagaaag catacgataa aaaggtaagt aaaggaaaga tatcacaaga 1980 agatgctgat aaaatagttg gtagagtatc tactactaca gaattagaaa aattagcaga 2040 ttgcgacctt ataattgagg ctgcatcaga agatatgaac ataaagaaag attattttgg 2100 aaaacttgaa gaaatatgta aaccagaaac tatttttgct actaatacat caagtttaag 2160 tattacagaa gtagcaacag caactaaaag accagataag ttcataggaa tgcacttctt 2220 taatcctgct aatgtaatga agcttgtaga gattataaga ggtatgaata cttctcagga 2280 aacatttgat ataattaagg aagcaagtat taaaatagga aaaactcctg tagaagtagc 2340 agaagcacca ggatttgttg ttaataagat acttgttcct atgataaatg aggctgtagg 2400 tatacttgct gaaggtattg ctagtgctga agacatagac actgctatga agttaggtgc 2460 aaaccatcct atgggaccat tagcattagg tgatcttatt ggattagatg ttgttttagc 2520 agtaatggat gtactttatt ctgagacagg tgattctaaa tatagagctc atacacttct 2580 tagaaagtat gtaagagctg gttggttagg tagaaagtct ggtaaaggat ttttcgcata 2640 ttaaggtacc gcagatagtc ataatagttc cagaatagtt caatttagaa attagactaa 2700 acttcaaaat gtttgttaaa tatataccaa actagtatag atatttttta aatactggac 2760 ttaaacagta gtaatttgcc taaaaaattt tttcaatttt ttttaaaaaa tccttttcaa 2820 gttgtacatt gttatggtaa tatgtaattg aagaagttat gtagtaatat tgtaaacgtt 2880 tcttgatttt tttacatcca tgtagtgctt aaaaaaccaa aatatgtcac atgcaattgt 2940 atatttcaaa taacaatatt tattttctcg ttaaattcac aaataattta ttaataatat 3000 caataaccaa gattatactt aaatggatgt ttatttttta acacttttat agtaaatata 3060 tttattttat gtagtaaaaa ggttataatt ataattgtat ttattacaat taattaaaat 3120 aaaaaatagg gttttaggta aaattaagtt attttaagaa gtaattacaa taaaaattga 3180 agttatttct ttaaggaggg aattattcat atgacttatg taccatcatc agcactttta 3240 gaacaactta gagcaggaaa tacttgggct ttaggaagac ttatatcaag agcagaagct 3300 ggagttgcag aagctagacc tgcacttgct gaagtatata gacatgcagg ttcagctcat 3360 gttataggtt taacaggagt accaggatct ggtaaatcaa ctcttgtagc aaaacttaca 3420 gcagctctta gaaaaagagg agaaaaagtt ggtatagtag ctattgatcc tagttctcca 3480 tatagtggag gagcaatact tggagataga attagaatga ctgaattagc aaatgattca 3540 ggagtattta taagaagtat ggcaactaga ggtgctactg gaggaatggc tagagcagct 3600 cttgatgcag ttgatttact tgatgtagct ggatatcata ctattatttt agaaacagtt 3660 ggagtaggtc aagatgaagt tgaagtagca catgcttctg atactacagt agttgtatca 3720 gcacctggac ttggtgatga aatacaggca attaaagctg gagttttaga aattgctgat 3780 attcatgttg taagtaaatg tgatagagat gatgcaaata gaactcttac agatcttaaa 3840 caaatgctta ctttaggaac aatggtagga cctaaaagag catgggctat accagttgta 3900 ggagtttcaa gttatacagg agaaggtgta gatgatttac ttggtagaat tgcagctcat 3960 agacaagcaa ctgctgatac agaacttgga agagaaagaa gaagaagagt agctgaattt 4020 agacttcaaa aaactgcaga aacattactt ttagaaagat ttactacagg agcacagcct 4080 ttttcaccag cattagctga tagtctttct aatagagcta gtgatcctta tgcagctgca 4140 agagaattaa tagctagaac tataagaaaa gaatattcta atgatcttgc atgtgctaaa 4200 cttactataa catggttaga accacaaatt aaaagtcaac ttcagtctga aagaaaagat 4260 tgggaagcaa atgaagttgg agcatttctt aaaaaagcac ctgaaagaaa agaacaattt 4320 catacaattg gagattttcc agtacagaga acttatacag ctgcagatat agcagatact 4380 cctcttgaag atattggttt acctggaaga tatccattta ctagaggacc ttatccaaca 4440 atgtatagaa gtagaacttg gacaatgaga caaatagctg gatttggtac tggagaagat 4500 acaaataaaa gatttaaata tcttatagca cagggtcaga ctggaatatc aacagatttt 4560 gatatgccta cattaatggg atatgattca gatcatccaa tgagtgatgg tgaagttgga 4620 agagaaggtg tagctataga tacacttgca gatatggaag cacttcttgc tgatattgat 4680 ttagaaaaaa tttcagttag ttttactata aatccaagtg catggattct tttagcaatg 4740 tatgtagctt taggtgaaaa aagaggttat gatcttaata aactttctgg aacagtacaa 4800 gctgatatac ttaaagaata tatggcacag aaagaatata tttatcctat agctccaagt 4860 gttagaattg taagagatat aattacttat tctgcaaaaa atcttaaaag atataatcct 4920 attaatattt ctggatatca tatatcagaa gctggttctt caccattaca agaagctgca 4980 tttactcttg caaatcttat tacttatgta aatgaagtaa ctaaaacagg aatgcatgta 5040 gatgaatttg cacctagatt agcatttttc tttgttagtc aaggagattt ctttgaagaa 5100 gtagcaaaat ttagagcttt aagaagatgt tatgctaaaa taatgaaaga aagatttgga 5160 gcaagaaatc ctgaatctat gagacttaga tttcattgtc aaactgctgc agctactctt 5220 acaaaaccac agtatatggt taatgttgta agaacaagtc ttcaagcatt atctgctgta 5280 ttgggaggag cacaaagtct tcatactaat ggatatgatg aagcatttgc tatacctact 5340 gaagatgcaa tgaaaatggc tcttagaaca caacagatta tagctgaaga atctggagtt 5400 gcagatgtaa tagatcctct tggaggaagt tattatgttg aagcattaac tacagaatat 5460 gaaaagaaaa tatttgaaat tcttgaagaa gtagaaaaaa gaggtggaac tattaaactt 5520 attgaacaag gatggtttca aaaacagata gcagattttg cttatgaaac tgcacttaga 5580 aaacaatcag gacagaaacc tgttataggt gtaaatagat ttgttgaaaa tgaagaagat 5640 gtaaaaattg aaatacatcc ttatgataat actacagctg aaagacaaat atcaagaact 5700 agaagagtta gagcagaaag agatgaagca aaagtacaag ctatgcttga tcagttagtt 5760 gcagtagcta aagatgaaag tcagaatctt atgcctctta ctattgaatt agtaaaagca 5820 ggagctacaa tgggtgatat tgtagaaaaa cttaaaggta tttggggaac ttatagagaa 5880 acaccagtat tttaagcact agttggagag cttcccacga tggatcagat tcctattaga 5940 gtattattag caaaagtagg tttagatgga catgatagag gtgtaaaagt tgtagcaaga 6000 gcattaagag atgctggaat ggatgtaata tatagtggtc ttcatagaac tcctgaagaa 6060 gtagttaata cagcaattca agaagatgta gatgttttag gagttagttt actttctggt 6120 gtacagctta ctgtttttcc taaaattttt aaattacttg atgaaagagg agctggtgat 6180 ttaatagtaa ttgctggagg agtaatgcca gatgaagatg cagctgcaat aagaaaactt 6240 ggagtaagag aagttttact tcaagataca ccaccacagg caataataga ttcaataaga 6300 agtttagtag cagcaagagg agcaagataa ccatggagat ctcgaggcct gcagacatgc 6360 aagcttggca ctggccgtcg ttttacaacg tcgtgactgg gaaaaccctg gcgttaccca 6420 acttaatcgc cttgcagcac atcccccttt cgccagctgg cgtaatagcg aagaggcccg 6480 caccgatcgc ccttcccaac agttgcgcag cctgaatggc gaatggcgct agcataaaaa 6540 taagaagcct gcatttgcag gcttcttatt tttatggcgc gccgccatta tttttttgaa 6600 caattgacaa ttcatttctt attttttatt aagtgatagt caaaaggcat aacagtgctg 6660 aatagaaaga aatttacaga aaagaaaatt atagaattta gtatgattaa ttatactcat 6720 ttatgaatgt ttaattgaat acaaaaaaaa atacttgtta tgtattcaat tacgggttaa 6780 aatatagaca agttgaaaaa tttaataaaa aaataagtcc tcagctctta tatattaagc 6840 taccaactta gtatataagc caaaacttaa atgtgctacc aacacatcaa gccgttagag 6900 aactctatct atagcaatat ttcaaatgta ccgacataca agagaaacat taactatata 6960 tattcaattt atgagattat cttaacagat ataaatgtaa attgcaataa gtaagattta 7020 gaagtttata gcctttgtgt attggaagca gtacgcaaag gcttttttat ttgataaaaa 7080 ttagaagtat atttattttt tcataattaa tttatgaaaa tgaaaggggg tgagcaaagt 7140 gacagaggaa agcagtatct tatcaaataa caaggtatta gcaatatcat tattgacttt 7200

agcagtaaac attatgactt ttatagtgct tgtagctaag tagtacgaaa gggggagctt 7260 taaaaagctc cttggaatac atagaattca taaattaatt tatgaaaaga agggcgtata 7320 tgaaaacttg taaaaattgc aaagagttta ttaaagatac tgaaatatgc aaaatacatt 7380 cgttgatgat tcatgataaa acagtagcaa cctattgcag taaatacaat gagtcaagat 7440 gtttacataa agggaaagtc caatgtatta attgttcaaa gatgaaccga tatggatggt 7500 gtgccataaa aatgagatgt tttacagagg aagaacagaa aaaagaacgt acatgcatta 7560 aatattatgc aaggagcttt aaaaaagctc atgtaaagaa gagtaaaaag aaaaaataat 7620 ttatttatta atttaatatt gagagtgccg acacagtatg cactaaaaaa tatatctgtg 7680 gtgtagtgag ccgatacaaa aggatagtca ctcgcatttt cataatacat cttatgttat 7740 gattatgtgt cggtgggact tcacgacgaa aacccacaat aaaaaaagag ttcggggtag 7800 ggttaagcat agttgaggca actaaacaat caagctagga tatgcagtag cagaccgtaa 7860 ggtcgttgtt taggtgtgtt gtaatacata cgctattaag atgtaaaaat acggatacca 7920 atgaagggaa aagtataatt tttggatgta gtttgtttgt tcatctatgg gcaaactacg 7980 tccaaagccg tttccaaatc tgctaaaaag tatatccttt ctaaaatcaa agtcaagtat 8040 gaaatcataa ataaagttta attttgaagt tattatgata ttatgttttt ctattaaaat 8100 aaattaagta tatagaatag tttaataata gtatatactt aatgtgataa gtgtctgaca 8160 gtgtcacaga aaggatgatt gttatggatt ataagcggcc ggccagtggg caagttgaaa 8220 aattcacaaa aatgtggtat aatatctttg ttcattagag cgataaactt gaatttgaga 8280 gggaacttag atggtatttg aaaaaattga taaaaatagt tggaacagaa aagagtattt 8340 tgaccactac tttgcaagtg taccttgtac ctacagcatg accgttaaag tggatatcac 8400 acaaataaag gaaaagggaa tgaaactata tcctgcaatg ctttattata ttgcaatgat 8460 tgtaaaccgc cattcagagt ttaggacggc aatcaatcaa gatggtgaat tggggatata 8520 tgatgagatg ataccaagct atacaatatt tcacaatgat actgaaacat tttccagcct 8580 ttggactgag tgtaagtctg actttaaatc atttttagca gattatgaaa gtgatacgca 8640 acggtatgga aacaatcata gaatggaagg aaagccaaat gctccggaaa acatttttaa 8700 tgtatctatg ataccgtggt caaccttcga tggctttaat ctgaatttgc agaaaggata 8760 tgattatttg attcctattt ttactatggg gaaatattat aaagaagata acaaaattat 8820 acttcctttg gcaattcaag ttcatcacgc agtatgtgac ggatttcaca tttgccgttt 8880 tgtaaacgaa ttgcaggaat tgataaatag ttaacttcag gtttgtctgt aactaaaaac 8940 aagtatttaa gcaaaaacat cgtagaaata cggtgttttt tgttacccta agtttaaact 9000 cctttttgat aatctcatga ccaaaatccc ttaacgtgag ttttcgttcc actgagcgtc 9060 agaccccgta gaaaagatca aaggatcttc ttgagatcct ttttttctgc gcgtaatctg 9120 ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct 9180 accaactctt tttccgaagg taactggctt cagcagagcg cagataccaa atactgttct 9240 tctagtgtag ccgtagttag gccaccactt caagaactct gtagcaccgc ctacatacct 9300 cgctctgcta atcctgttac cagtggctgc tgccagtggc gataagtcgt gtcttaccgg 9360 gttggactca agacgatagt taccggataa ggcgcagcgg tcgggctgaa cggggggttc 9420 gtgcacacag cccagcttgg agcgaacgac ctacaccgaa ctgagatacc tacagcgtga 9480 gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg 9540 cagggtcgga acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta 9600 tagtcctgtc gggtttcgcc acctctgact tgagcgtcga tttttgtgat gctcgtcagg 9660 ggggcggagc ctatggaaaa acgccagcaa cgcggccttt ttacggttcc tggccttttg 9720 ctggcctttt gctcacatgt tctttcctgc gttatcccct gattctgtgg ataaccgtat 9780 taccgccttt gagtgagctg ataccgctcg ccgcagccga acgaccgagc gcagcgagtc 9840 agtgagcgag gaagcggaag agcgcccaat acgcagggcc ccctgcttcg gggtcattat 9900 agcgattttt tcggtatatc catccttttt cgcacgatat acaggatttt gccaaagggt 9960 tcgtgtagac tttccttggt gtatccaacg gcgtcagccg ggcaggatag gtgaagtagg 10020 cccacccgcg agcgggtgtt ccttcttcac tgtcccttat tcgcacctgg cggtgctcaa 10080 cgggaatcct gctctgcgag gctggccggc taccgccggc gtaacagatg agggcaagcg 10140 gatggctgat gaaaccaagc caaccaggaa gggcagccca cctatcaagg tgtactgcct 10200 tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg gccggcatga gcctgtcggc 10260 ctacctgctg gccgtcggcc agggctacaa aatcacgggc gtcgtggact atgagcacgt 10320 ccgcgagctg gcccgcatca atggcgacct gggccgcctg ggcggcctgc tgaaactctg 10380 gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc acgatcctcg ccctgctggc 10440 gaagatcgaa gagaagcagg acgagcttgg caaggtcatg atgggcgtgg tccgcccgag 10500 ggcagagcca tgactttttt agccgctaaa acggccgggg ggtgcgcgtg attgccaagc 10560 acgtccccat gcgctccatc aagaagagcg acttcgcgga gctggtgaag tacatcaccg 10620 acgagcaagg caagaccgat cgggccc 10647 <210> SEQ ID NO 192 <211> LENGTH: 10539 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL83155-thlA-phaB-Pwl-meaBhcmA-hcmB <400> SEQUENCE: 192 cctgcaggat aaaaaaattg tagataaatt ttataaaata gttttatcta caattttttt 60 atcaggaaac agctatgacc gcggccgcaa tatgatattt atgtccattg tgaaagggat 120 tatattcaac tattattcca gttacgttca tagaaatttt cctttctaaa atattttatt 180 ccatgtcaag aactctgttt atttcattaa agaactataa gtacaaagta taaggcattt 240 gaaaaaatag gctagtatat tgattgatta tttattttaa aatgcctaag tgaaatatat 300 acatattata acaataaaat aagtattagt gtaggatttt taaatagagt atctattttc 360 agattaaatt tttgattatt tgatttacat tatataatat tgagtaaagt attgactagc 420 aaaatttttt gatactttaa tttgtgaaat ttcttatcaa aagttatatt tttgaataat 480 ttttattgaa aaatacaact aaaaaggatt atagtataag tgtgtgtaat tttgtgttaa 540 atttaaaggg aggaaatgaa catgaaacat atgaaagaag ttgtaatagc tagtgcagta 600 agaacagcga ttggatctta tggaaagtct cttaaggatg taccagcagt agatttagga 660 gctacagcta taaaggaagc agttaaaaaa gcaggaataa aaccagagga tgttaatgaa 720 gtcattttag gaaatgttct tcaagcaggt ttaggacaga atccagcaag acaggcatct 780 tttaaagcag gattaccagt tgaaattcca gctatgacta ttaataaggt ttgtggttca 840 ggacttagaa cagttagctt agcagcacaa attataaaag caggagatgc tgacgtaata 900 atagcaggtg gtatggaaaa tatgtctaga gctccttact tagcgaataa cgctagatgg 960 ggatatagaa tgggaaacgc taaatttgtt gatgaaatga tcactgacgg attgtgggat 1020 gcatttaatg attaccacat gggaataaca gcagaaaaca tagctgagag atggaacatt 1080 tcaagagaag aacaagatga gtttgctctt gcatcacaaa aaaaagctga agaagctata 1140 aaatcaggtc aatttaaaga tgaaatagtt cctgtagtaa ttaaaggcag aaagggagaa 1200 actgtagttg atacagatga gcaccctaga tttggatcaa ctatagaagg acttgcaaaa 1260 ttaaaacctg ccttcaaaaa agatggaaca gttacagctg gtaatgcatc aggattaaat 1320 gactgtgcag cagtacttgt aatcatgagt gcagaaaaag ctaaagagct tggagtaaaa 1380 ccacttgcta agatagtttc ttatggttca gcaggagttg acccagcaat aatgggatat 1440 ggacctttct atgcaacaaa agcagctatt gaaaaagcag gttggacagt tgatgaatta 1500 gatttaatag aatcaaatga agcttttgca gctcaaagtt tagcagtagc aaaagattta 1560 aaatttgata tgaataaagt aaatgtaaat ggaggagcta ttgcccttgg tcatccaatt 1620 ggagcatcag gtgcaagaat actcgttact cttgtacacg caatgcaaaa aagagatgca 1680 aaaaaaggct tagcaacttt atgtataggt ggcggacaag gaacagcaat attgctagaa 1740 aagtgctagg aattctcaaa aattcggtta aataaaataa ttaggaggtt caatcatgac 1800 tcagcgcatt gcgtatgtga ccggcggcat gggtggtatc ggaaccgcca tttgccagcg 1860 gctggccaag gatggctttc gtgtggtggc cggttgcggc cccaactcgc cgcgccgcga 1920 aaagtggctg gagcagcaga aggccctggg cttcgatttc attgcctcgg aaggcaatgt 1980 ggctgactgg gactcgacca agaccgcatt cgacaaggtc aagtccgagg tcggcgaggt 2040 tgatgtgctg atcaacaacg ccggtatcac ccgcgacgtg gtgttccgca agatgacccg 2100 cgccgactgg gatgcggtga tcgacaccaa cctgacctcg ctgttcaacg tcaccaagca 2160 ggtgatcgac ggcatggccg accgtggctg gggccgcatc gtcaacatct cgtcggtgaa 2220 cgggcagaag ggccagttcg gccagaccaa ctactccacc gccaaggccg gcctgcatgg 2280 cttcaccatg gcactggcgc aggaagtggc gaccaagggc gtgaccgtca acacggtctc 2340 tccgggctat atcgccaccg acatggtcaa ggcgatccgc caggacgtgc tcgacaagat 2400 cgtcgcgacg atcccggtca agcgcctggg cctgccggaa gagatcgcct cgatctgcgc 2460 ctggttgtcg tcggaggagt ccggtttctc gaccggcgcc gacttctcgc tcaacggcgg 2520 cctgcatatg ggctgaggta ccgcagatag tcataatagt tccagaatag ttcaatttag 2580 aaattagact aaacttcaaa atgtttgtta aatatatacc aaactagtat agatattttt 2640 taaatactgg acttaaacag tagtaatttg cctaaaaaat tttttcaatt ttttttaaaa 2700 aatccttttc aagttgtaca ttgttatggt aatatgtaat tgaagaagtt atgtagtaat 2760 attgtaaacg tttcttgatt tttttacatc catgtagtgc ttaaaaaacc aaaatatgtc 2820 acatgcaatt gtatatttca aataacaata tttattttct cgttaaattc acaaataatt 2880 tattaataat atcaataacc aagattatac ttaaatggat gtttattttt taacactttt 2940 atagtaaata tatttatttt atgtagtaaa aaggttataa ttataattgt atttattaca 3000 attaattaaa ataaaaaata gggttttagg taaaattaag ttattttaag aagtaattac 3060 aataaaaatt gaagttattt ctttaaggag ggaattattc atatgactta tgtaccatca 3120 tcagcacttt tagaacaact tagagcagga aatacttggg ctttaggaag acttatatca 3180 agagcagaag ctggagttgc agaagctaga cctgcacttg ctgaagtata tagacatgca 3240 ggttcagctc atgttatagg tttaacagga gtaccaggat ctggtaaatc aactcttgta 3300 gcaaaactta cagcagctct tagaaaaaga ggagaaaaag ttggtatagt agctattgat 3360 cctagttctc catatagtgg aggagcaata cttggagata gaattagaat gactgaatta 3420 gcaaatgatt caggagtatt tataagaagt atggcaacta gaggtgctac tggaggaatg 3480 gctagagcag ctcttgatgc agttgattta cttgatgtag ctggatatca tactattatt 3540 ttagaaacag ttggagtagg tcaagatgaa gttgaagtag cacatgcttc tgatactaca 3600 gtagttgtat cagcacctgg acttggtgat gaaatacagg caattaaagc tggagtttta 3660

gaaattgctg atattcatgt tgtaagtaaa tgtgatagag atgatgcaaa tagaactctt 3720 acagatctta aacaaatgct tactttagga acaatggtag gacctaaaag agcatgggct 3780 ataccagttg taggagtttc aagttataca ggagaaggtg tagatgattt acttggtaga 3840 attgcagctc atagacaagc aactgctgat acagaacttg gaagagaaag aagaagaaga 3900 gtagctgaat ttagacttca aaaaactgca gaaacattac ttttagaaag atttactaca 3960 ggagcacagc ctttttcacc agcattagct gatagtcttt ctaatagagc tagtgatcct 4020 tatgcagctg caagagaatt aatagctaga actataagaa aagaatattc taatgatctt 4080 gcatgtgcta aacttactat aacatggtta gaaccacaaa ttaaaagtca acttcagtct 4140 gaaagaaaag attgggaagc aaatgaagtt ggagcatttc ttaaaaaagc acctgaaaga 4200 aaagaacaat ttcatacaat tggagatttt ccagtacaga gaacttatac agctgcagat 4260 atagcagata ctcctcttga agatattggt ttacctggaa gatatccatt tactagagga 4320 ccttatccaa caatgtatag aagtagaact tggacaatga gacaaatagc tggatttggt 4380 actggagaag atacaaataa aagatttaaa tatcttatag cacagggtca gactggaata 4440 tcaacagatt ttgatatgcc tacattaatg ggatatgatt cagatcatcc aatgagtgat 4500 ggtgaagttg gaagagaagg tgtagctata gatacacttg cagatatgga agcacttctt 4560 gctgatattg atttagaaaa aatttcagtt agttttacta taaatccaag tgcatggatt 4620 cttttagcaa tgtatgtagc tttaggtgaa aaaagaggtt atgatcttaa taaactttct 4680 ggaacagtac aagctgatat acttaaagaa tatatggcac agaaagaata tatttatcct 4740 atagctccaa gtgttagaat tgtaagagat ataattactt attctgcaaa aaatcttaaa 4800 agatataatc ctattaatat ttctggatat catatatcag aagctggttc ttcaccatta 4860 caagaagctg catttactct tgcaaatctt attacttatg taaatgaagt aactaaaaca 4920 ggaatgcatg tagatgaatt tgcacctaga ttagcatttt tctttgttag tcaaggagat 4980 ttctttgaag aagtagcaaa atttagagct ttaagaagat gttatgctaa aataatgaaa 5040 gaaagatttg gagcaagaaa tcctgaatct atgagactta gatttcattg tcaaactgct 5100 gcagctactc ttacaaaacc acagtatatg gttaatgttg taagaacaag tcttcaagca 5160 ttatctgctg tattgggagg agcacaaagt cttcatacta atggatatga tgaagcattt 5220 gctataccta ctgaagatgc aatgaaaatg gctcttagaa cacaacagat tatagctgaa 5280 gaatctggag ttgcagatgt aatagatcct cttggaggaa gttattatgt tgaagcatta 5340 actacagaat atgaaaagaa aatatttgaa attcttgaag aagtagaaaa aagaggtgga 5400 actattaaac ttattgaaca aggatggttt caaaaacaga tagcagattt tgcttatgaa 5460 actgcactta gaaaacaatc aggacagaaa cctgttatag gtgtaaatag atttgttgaa 5520 aatgaagaag atgtaaaaat tgaaatacat ccttatgata atactacagc tgaaagacaa 5580 atatcaagaa ctagaagagt tagagcagaa agagatgaag caaaagtaca agctatgctt 5640 gatcagttag ttgcagtagc taaagatgaa agtcagaatc ttatgcctct tactattgaa 5700 ttagtaaaag caggagctac aatgggtgat attgtagaaa aacttaaagg tatttgggga 5760 acttatagag aaacaccagt attttaagca ctagttggag agcttcccac gatggatcag 5820 attcctatta gagtattatt agcaaaagta ggtttagatg gacatgatag aggtgtaaaa 5880 gttgtagcaa gagcattaag agatgctgga atggatgtaa tatatagtgg tcttcataga 5940 actcctgaag aagtagttaa tacagcaatt caagaagatg tagatgtttt aggagttagt 6000 ttactttctg gtgtacagct tactgttttt cctaaaattt ttaaattact tgatgaaaga 6060 ggagctggtg atttaatagt aattgctgga ggagtaatgc cagatgaaga tgcagctgca 6120 ataagaaaac ttggagtaag agaagtttta cttcaagata caccaccaca ggcaataata 6180 gattcaataa gaagtttagt agcagcaaga ggagcaagat aaccatggag atctcgaggc 6240 ctgcagacat gcaagcttgg cactggccgt cgttttacaa cgtcgtgact gggaaaaccc 6300 tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct ggcgtaatag 6360 cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg gcgaatggcg 6420 ctagcataaa aataagaagc ctgcatttgc aggcttctta tttttatggc gcgccgccat 6480 tatttttttg aacaattgac aattcatttc ttatttttta ttaagtgata gtcaaaaggc 6540 ataacagtgc tgaatagaaa gaaatttaca gaaaagaaaa ttatagaatt tagtatgatt 6600 aattatactc atttatgaat gtttaattga atacaaaaaa aaatacttgt tatgtattca 6660 attacgggtt aaaatataga caagttgaaa aatttaataa aaaaataagt cctcagctct 6720 tatatattaa gctaccaact tagtatataa gccaaaactt aaatgtgcta ccaacacatc 6780 aagccgttag agaactctat ctatagcaat atttcaaatg taccgacata caagagaaac 6840 attaactata tatattcaat ttatgagatt atcttaacag atataaatgt aaattgcaat 6900 aagtaagatt tagaagttta tagcctttgt gtattggaag cagtacgcaa aggctttttt 6960 atttgataaa aattagaagt atatttattt tttcataatt aatttatgaa aatgaaaggg 7020 ggtgagcaaa gtgacagagg aaagcagtat cttatcaaat aacaaggtat tagcaatatc 7080 attattgact ttagcagtaa acattatgac ttttatagtg cttgtagcta agtagtacga 7140 aagggggagc tttaaaaagc tccttggaat acatagaatt cataaattaa tttatgaaaa 7200 gaagggcgta tatgaaaact tgtaaaaatt gcaaagagtt tattaaagat actgaaatat 7260 gcaaaataca ttcgttgatg attcatgata aaacagtagc aacctattgc agtaaataca 7320 atgagtcaag atgtttacat aaagggaaag tccaatgtat taattgttca aagatgaacc 7380 gatatggatg gtgtgccata aaaatgagat gttttacaga ggaagaacag aaaaaagaac 7440 gtacatgcat taaatattat gcaaggagct ttaaaaaagc tcatgtaaag aagagtaaaa 7500 agaaaaaata atttatttat taatttaata ttgagagtgc cgacacagta tgcactaaaa 7560 aatatatctg tggtgtagtg agccgataca aaaggatagt cactcgcatt ttcataatac 7620 atcttatgtt atgattatgt gtcggtggga cttcacgacg aaaacccaca ataaaaaaag 7680 agttcggggt agggttaagc atagttgagg caactaaaca atcaagctag gatatgcagt 7740 agcagaccgt aaggtcgttg tttaggtgtg ttgtaataca tacgctatta agatgtaaaa 7800 atacggatac caatgaaggg aaaagtataa tttttggatg tagtttgttt gttcatctat 7860 gggcaaacta cgtccaaagc cgtttccaaa tctgctaaaa agtatatcct ttctaaaatc 7920 aaagtcaagt atgaaatcat aaataaagtt taattttgaa gttattatga tattatgttt 7980 ttctattaaa ataaattaag tatatagaat agtttaataa tagtatatac ttaatgtgat 8040 aagtgtctga cagtgtcaca gaaaggatga ttgttatgga ttataagcgg ccggccagtg 8100 ggcaagttga aaaattcaca aaaatgtggt ataatatctt tgttcattag agcgataaac 8160 ttgaatttga gagggaactt agatggtatt tgaaaaaatt gataaaaata gttggaacag 8220 aaaagagtat tttgaccact actttgcaag tgtaccttgt acctacagca tgaccgttaa 8280 agtggatatc acacaaataa aggaaaaggg aatgaaacta tatcctgcaa tgctttatta 8340 tattgcaatg attgtaaacc gccattcaga gtttaggacg gcaatcaatc aagatggtga 8400 attggggata tatgatgaga tgataccaag ctatacaata tttcacaatg atactgaaac 8460 attttccagc ctttggactg agtgtaagtc tgactttaaa tcatttttag cagattatga 8520 aagtgatacg caacggtatg gaaacaatca tagaatggaa ggaaagccaa atgctccgga 8580 aaacattttt aatgtatcta tgataccgtg gtcaaccttc gatggcttta atctgaattt 8640 gcagaaagga tatgattatt tgattcctat ttttactatg gggaaatatt ataaagaaga 8700 taacaaaatt atacttcctt tggcaattca agttcatcac gcagtatgtg acggatttca 8760 catttgccgt tttgtaaacg aattgcagga attgataaat agttaacttc aggtttgtct 8820 gtaactaaaa acaagtattt aagcaaaaac atcgtagaaa tacggtgttt tttgttaccc 8880 taagtttaaa ctcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt 8940 ccactgagcg tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct 9000 gcgcgtaatc tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc 9060 ggatcaagag ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc 9120 aaatactgtt cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc 9180 gcctacatac ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc 9240 gtgtcttacc gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg 9300 aacggggggt tcgtgcacac agcccagctt ggagcgaacg acctacaccg aactgagata 9360 cctacagcgt gagctatgag aaagcgccac gcttcccgaa gggagaaagg cggacaggta 9420 tccggtaagc ggcagggtcg gaacaggaga gcgcacgagg gagcttccag ggggaaacgc 9480 ctggtatctt tatagtcctg tcgggtttcg ccacctctga cttgagcgtc gatttttgtg 9540 atgctcgtca ggggggcgga gcctatggaa aaacgccagc aacgcggcct ttttacggtt 9600 cctggccttt tgctggcctt ttgctcacat gttctttcct gcgttatccc ctgattctgt 9660 ggataaccgt attaccgcct ttgagtgagc tgataccgct cgccgcagcc gaacgaccga 9720 gcgcagcgag tcagtgagcg aggaagcgga agagcgccca atacgcaggg ccccctgctt 9780 cggggtcatt atagcgattt tttcggtata tccatccttt ttcgcacgat atacaggatt 9840 ttgccaaagg gttcgtgtag actttccttg gtgtatccaa cggcgtcagc cgggcaggat 9900 aggtgaagta ggcccacccg cgagcgggtg ttccttcttc actgtccctt attcgcacct 9960 ggcggtgctc aacgggaatc ctgctctgcg aggctggccg gctaccgccg gcgtaacaga 10020 tgagggcaag cggatggctg atgaaaccaa gccaaccagg aagggcagcc cacctatcaa 10080 ggtgtactgc cttccagacg aacgaagagc gattgaggaa aaggcggcgg cggccggcat 10140 gagcctgtcg gcctacctgc tggccgtcgg ccagggctac aaaatcacgg gcgtcgtgga 10200 ctatgagcac gtccgcgagc tggcccgcat caatggcgac ctgggccgcc tgggcggcct 10260 gctgaaactc tggctcaccg acgacccgcg cacggcgcgg ttcggtgatg ccacgatcct 10320 cgccctgctg gcgaagatcg aagagaagca ggacgagctt ggcaaggtca tgatgggcgt 10380 ggtccgcccg agggcagagc catgactttt ttagccgcta aaacggccgg ggggtgcgcg 10440 tgattgccaa gcacgtcccc atgcgctcca tcaagaagag cgacttcgcg gagctggtga 10500 agtacatcac cgacgagcaa ggcaagaccg atcgggccc 10539 <210> SEQ ID NO 193 <211> LENGTH: 487 <212> TYPE: DNA <213> ORGANISM: Clostridium autoethanogenum <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: promoter region of phosphate acetyltransferase <400> SEQUENCE: 193 ggccgcaata tgatatttat gtccattgtg aaagggatta tattcaacta ttattccagt 60 tacgttcata gaaattttcc tttctaaaat attttattcc atgtcaagaa ctctgtttat 120 ttcattaaag aactataagt acaaagtata aggcatttga aaaaataggc tagtatattg 180 attgattatt tattttaaaa tgcctaagtg aaatatatac atattataac aataaaataa 240 gtattagtgt aggattttta aatagagtat ctattttcag attaaatttt tgattatttg 300

atttacatta tataatattg agtaaagtat tgactagcaa aattttttga tactttaatt 360 tgtgaaattt cttatcaaaa gttatatttt tgaataattt ttattgaaaa atacaactaa 420 aaaggattat agtataagtg tgtgtaattt tgtgttaaat ttaaagggag gaaatgaaca 480 tgaaaca 487 <210> SEQ ID NO 194 <211> LENGTH: 7884 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL82256-ptb-buk <400> SEQUENCE: 194 gagatctcga ggcctgcaga catgcaagct tggcactggc cgtcgtttta caacgtcgtg 60 actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 120 gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 180 atggcgaatg gcgctagcat aaaaataaga agcctgcatt tgcaggcttc ttatttttat 240 ggcgcgccgt tctgaatcct tagctaatgg ttcaacaggt aactatgacg aagatagcac 300 cctggataag tctgtaatgg attctaaggc atttaatgaa gacgtgtata taaaatgtgc 360 taatgaaaaa gaaaatgcgt taaaagagcc taaaatgagt tcaaatggtt ttgaaattga 420 ttggtagttt aatttaatat attttttcta ttggctatct cgatacctat agaatcttct 480 gttcactttt gtttttgaaa tataaaaagg ggctttttag cccctttttt ttaaaactcc 540 ggaggagttt cttcattctt gatactatac gtaactattt tcgatttgac ttcattgtca 600 attaagctag taaaatcaat ggttaaaaaa caaaaaactt gcatttttct acctagtaat 660 ttataatttt aagtgtcgag tttaaaagta taatttacca ggaaaggagc aagtttttta 720 ataaggaaaa atttttcctt ttaaaattct atttcgttat atgactaatt ataatcaaaa 780 aaatgaaaat aaacaagagg taaaaactgc tttagagaaa tgtactgata aaaaaagaaa 840 aaatcctaga tttacgtcat acatagcacc tttaactact aagaaaaata ttgaaaggac 900 ttccacttgt ggagattatt tgtttatgtt gagtgatgca gacttagaac attttaaatt 960 acataaaggt aatttttgcg gtaatagatt ttgtccaatg tgtagttggc gacttgcttg 1020 taaggatagt ttagaaatat ctattcttat ggagcattta agaaaagaag aaaataaaga 1080 gtttatattt ttaactctta caactccaaa tgtaaaaagt tatgatctta attattctat 1140 taaacaatat aataaatctt ttaaaaaatt aatggagcgt aaggaagtta aggatataac 1200 taaaggttat ataagaaaat tagaagtaac ttaccaaaag gaaaaataca taacaaagga 1260 tttatggaaa ataaaaaaag attattatca aaaaaaagga cttgaaattg gtgatttaga 1320 acctaatttt gatacttata atcctcattt tcatgtagtt attgcagtta ataaaagtta 1380 ttttacagat aaaaattatt atataaatcg agaaagatgg ttggaattat ggaagtttgc 1440 tactaaggat gattctataa ctcaagttga tgttagaaaa gcaaaaatta atgattataa 1500 agaggtttac gaacttgcga aatattcagc taaagacact gattatttaa tatcgaggcc 1560 agtatttgaa attttttata aagcattaaa aggcaagcag gtattagttt ttagtggatt 1620 ttttaaagat gcacacaaat tgtacaagca aggaaaactt gatgtttata aaaagaaaga 1680 tgaaattaaa tatgtctata tagtttatta taattggtgc aaaaaacaat atgaaaaaac 1740 tagaataagg gaacttacgg aagatgaaaa agaagaatta aatcaagatt taatagatga 1800 aatagaaata gattaaagtg taactatact ttatatatat atgattaaaa aaataaaaaa 1860 caacagccta ttaggttgtt gttttttatt ttctttatta atttttttaa tttttagttt 1920 ttagttcttt tttaaaataa gtttcagcct ctttttcaat attttttaaa gaaggagtat 1980 ttgcatgaat tgcctttttt ctaacagact taggaaatat tttaacagta tcttcttgcg 2040 ccggtgattt tggaacttca taacttacta atttataatt attattttct tttttaattg 2100 taacagttgc aaaagaagct gaacctgttc cttcaactag tttatcatct tcaatataat 2160 attcttgacc tatatagtat aaatatattt ttattatatt tttacttttt tctgaatcta 2220 ttattttata atcataaaaa gttttaccac caaaagaagg ttgtactcct tctggtccaa 2280 catatttttt tactatatta tctaaataat ttttgggaac tggtgttgta atttgattaa 2340 tcgaacaacc agttatactt aaaggaatta taactataaa aatatatagg attatctttt 2400 taaatttcat tattggcctc ctttttatta aatttatgtt accataaaaa ggacataacg 2460 ggaatatgta gaatattttt aatgtagaca aaattttaca taaatataaa gaaaggaagt 2520 gtttgtttaa attttatagc aaactatcaa aaattagggg gataaaaatt tatgaaaaaa 2580 aggttttcga tgttattttt atgtttaact ttaatagttt gtggtttatt tacaaattcg 2640 gccggccgaa gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtataat 2700 aggaattgaa gttaaattag atgctaaaaa tttgtaatta agaaggagtg attacatgaa 2760 caaaaatata aaatattctc aaaacttttt aacgagtgaa aaagtactca accaaataat 2820 aaaacaattg aatttaaaag aaaccgatac cgtttacgaa attggaacag gtaaagggca 2880 tttaacgacg aaactggcta aaataagtaa acaggtaacg tctattgaat tagacagtca 2940 tctattcaac ttatcgtcag aaaaattaaa actgaatact cgtgtcactt taattcacca 3000 agatattcta cagtttcaat tccctaacaa acagaggtat aaaattgttg ggagtattcc 3060 ttaccattta agcacacaaa ttattaaaaa agtggttttt gaaagccatg cgtctgacat 3120 ctatctgatt gttgaagaag gattctacaa gcgtaccttg gatattcacc gaacactagg 3180 gttgctcttg cacactcaag tctcgattca gcaattgctt aagctgccag cggaatgctt 3240 tcatcctaaa ccaaaagtaa acagtgtctt aataaaactt acccgccata ccacagatgt 3300 tccagataaa tattggaagc tatatacgta ctttgtttca aaatgggtca atcgagaata 3360 tcgtcaactg tttactaaaa atcagtttca tcaagcaatg aaacacgcca aagtaaacaa 3420 tttaagtacc gttacttatg agcaagtatt gtctattttt aatagttatc tattatttaa 3480 cgggaggaaa taattctatg agtcgctttt gtaaatttgg aaagttacac gttactaaag 3540 ggaatgtgtt taaactcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 3600 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 3660 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 3720 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 3780 taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 3840 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 3900 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 3960 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4020 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4080 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 4140 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 4200 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 4260 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 4320 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 4380 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc agggccccct 4440 gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag 4500 gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca 4560 ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc 4620 acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa 4680 cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta 4740 tcaaggtgta ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg 4800 gcatgagcct gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg 4860 tggactatga gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg 4920 gcctgctgaa actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga 4980 tcctcgccct gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg 5040 gcgtggtccg cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg 5100 cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg 5160 gtgaagtaca tcaccgacga gcaaggcaag accgatcggg ccccctgcag gataaaaaaa 5220 ttgtagataa attttataaa atagttttat ctacaatttt tttatcagga aacagctatg 5280 accgcggccg caaaatagtt gataataatg cagagttata aacaaaggtg aaaagcatta 5340 cttgtattct tttttatata ttattataaa ttaaaatgaa gctgtattag aaaaaataca 5400 cacctgtaat ataaaatttt aaattaattt ttaatttttt caaaatgtat tttacatgtt 5460 tagaattttg atgtatatta aaatagtaga atacataaga tacttaattt aattaaagat 5520 agttaagtac ttttcaatgt gcttttttag atgtttaata caaatcttta attgtaaaag 5580 aaatgctgta ctatttactg tactagtgac gggattaaac tgtattaatt ataaataaaa 5640 aataagtaca gttgtttaaa attatatttt gtattaaatc taatagtacg atgtaagtta 5700 ttttatacta ttgctagttt aataaaaaga tttaattata tgcttgaaaa ggagaggaat 5760 ccaatgagta aaaactttga tgagttatta tcaagattaa aggaagttcc aacaaaaaaa 5820 gtggctgtag ccgtagcaca agatgaacca gtattagagg ctataaaaga agctacagaa 5880 aataacatcg cacaagcaat attggttggt gataaacaac aaatccatga aatcgcaaag 5940 aaaataaact tggacttatc tgattatgaa ataatggata ttaaagatcc aaagaaagca 6000 acattagaag cagtaaaatt agtttctagt ggtcatgcag atatgttaat gaaaggtcta 6060 gttgatactg caacattcct aagaagcgta ttaaacaaag aggttggtct tagaacagga 6120 aaattaatgt cccatgtagc tgtgtttgat gtggaaggtt gggatagact gttattttta 6180 actgatgcag catttaatac atatccagaa tttaaggata aagttggaat gataaataat 6240 gcagttgtag ttgctcatgc atgtggaata gatgttccaa gagtagcacc tatatgccca 6300 gttgaagttg taaatacaag tatgcaatca acagttgatg cagcattgtt agctaaaatg 6360 agtgacaggg ggcaaattaa aggatgcgta attgatggac cttttgcctt agataatgca 6420 atatcagaag aagcagctca tcataaaggt gttacaggat cagtagcagg taaagctgat 6480 atattattat taccaaatat agaagcagca aatgtaatgt ataaaacatt aacatatttc 6540 tctaaatcaa gaaatggtgg acttttagta ggtacatcag caccagtaat tttaacttca 6600 agagcagatt cattcgaaac taaagttaat tcaattgctc ttgcagcatt agttgcagca 6660 agaaataagt aataaatcaa tccataataa ttaatgcata attaatggag agatttatat 6720 ggaatttgca atgcactatt agattctata ataatttctt ctgaaaatta tgcattatga 6780 ctgtatagaa tgcattaaat ttaaggggga ttcagaatgt catataagct attaataatc 6840 aatccaggtt caacatcaac aaagattggt gtttacgaag gagaaaagga actatttgaa 6900

gaaactttga gacacacaaa tgaagaaata aagagatatg atacaatata tgatcaattt 6960 gaatttagaa aagaagttat attaaatgtt cttaaagaaa agaattttga tataaagact 7020 ctaagtgcta ttgttggtag aggtggaatg cttagaccag ttgaaggtgg aacatatgca 7080 gtaaatgatg caatggttga agatttaaaa gttggagttc aaggacctca tgcttctaac 7140 cttggcggaa taattgccaa gtcaattgga gatgaattaa atattccatc atttatagta 7200 gatccagttg ttacagatga gttagcagat gtagcaagac tatctggagt accagaacta 7260 ccaagaaaaa gtaaattcca tgctttaaat caaaaagcgg tagctaaaag atatggaaaa 7320 gaaagtggac aaggatatga aaacctaaat cttgtagttg tacatatggg tggaggcgtt 7380 tcagttggtg ctcacaatca tgggaaagtt gtcgatgtaa ataatgcatt agatggagat 7440 ggcccattct caccagaaag agctggatca gttccaattg gtgatttagt taaaatgtgt 7500 tttagtggaa aatatagtga agcagaagta tatggcaagg ctgtaggaaa aggtggattt 7560 gttggttatc taaacacaaa tgatgtaaaa ggtgttattg ataagatgga agaaggagat 7620 aaagaatgtg aatcaatata caaagcattt gtttatcaaa tttcaaaagc aatcggagaa 7680 atgtcagttg tattagaagg taaagttgat caaattattt ttaccggagg aattgcatac 7740 tcaccaacac ttgttccaga ccttaaagca aaagttgaat ggatagcccc agttacagtt 7800 tatcctggag aagatgaatt acttgctcta gctcaaggtg ctataagagt acttgatgga 7860 gaagaacaag ctaaggttta ctag 7884 <210> SEQ ID NO 195 <211> LENGTH: 6624 <212> TYPE: DNA <213> ORGANISM: Artificial sequence <220> FEATURE: <223> OTHER INFORMATION: Synthetic polynucleotide <220> FEATURE: <221> NAME/KEY: misc_feature <223> OTHER INFORMATION: pMTL82256-tesB <400> SEQUENCE: 195 gagatctcga ggcctgcaga catgcaagct tggcactggc cgtcgtttta caacgtcgtg 60 actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 120 gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 180 atggcgaatg gcgctagcat aaaaataaga agcctgcatt tgcaggcttc ttatttttat 240 ggcgcgccgt tctgaatcct tagctaatgg ttcaacaggt aactatgacg aagatagcac 300 cctggataag tctgtaatgg attctaaggc atttaatgaa gacgtgtata taaaatgtgc 360 taatgaaaaa gaaaatgcgt taaaagagcc taaaatgagt tcaaatggtt ttgaaattga 420 ttggtagttt aatttaatat attttttcta ttggctatct cgatacctat agaatcttct 480 gttcactttt gtttttgaaa tataaaaagg ggctttttag cccctttttt ttaaaactcc 540 ggaggagttt cttcattctt gatactatac gtaactattt tcgatttgac ttcattgtca 600 attaagctag taaaatcaat ggttaaaaaa caaaaaactt gcatttttct acctagtaat 660 ttataatttt aagtgtcgag tttaaaagta taatttacca ggaaaggagc aagtttttta 720 ataaggaaaa atttttcctt ttaaaattct atttcgttat atgactaatt ataatcaaaa 780 aaatgaaaat aaacaagagg taaaaactgc tttagagaaa tgtactgata aaaaaagaaa 840 aaatcctaga tttacgtcat acatagcacc tttaactact aagaaaaata ttgaaaggac 900 ttccacttgt ggagattatt tgtttatgtt gagtgatgca gacttagaac attttaaatt 960 acataaaggt aatttttgcg gtaatagatt ttgtccaatg tgtagttggc gacttgcttg 1020 taaggatagt ttagaaatat ctattcttat ggagcattta agaaaagaag aaaataaaga 1080 gtttatattt ttaactctta caactccaaa tgtaaaaagt tatgatctta attattctat 1140 taaacaatat aataaatctt ttaaaaaatt aatggagcgt aaggaagtta aggatataac 1200 taaaggttat ataagaaaat tagaagtaac ttaccaaaag gaaaaataca taacaaagga 1260 tttatggaaa ataaaaaaag attattatca aaaaaaagga cttgaaattg gtgatttaga 1320 acctaatttt gatacttata atcctcattt tcatgtagtt attgcagtta ataaaagtta 1380 ttttacagat aaaaattatt atataaatcg agaaagatgg ttggaattat ggaagtttgc 1440 tactaaggat gattctataa ctcaagttga tgttagaaaa gcaaaaatta atgattataa 1500 agaggtttac gaacttgcga aatattcagc taaagacact gattatttaa tatcgaggcc 1560 agtatttgaa attttttata aagcattaaa aggcaagcag gtattagttt ttagtggatt 1620 ttttaaagat gcacacaaat tgtacaagca aggaaaactt gatgtttata aaaagaaaga 1680 tgaaattaaa tatgtctata tagtttatta taattggtgc aaaaaacaat atgaaaaaac 1740 tagaataagg gaacttacgg aagatgaaaa agaagaatta aatcaagatt taatagatga 1800 aatagaaata gattaaagtg taactatact ttatatatat atgattaaaa aaataaaaaa 1860 caacagccta ttaggttgtt gttttttatt ttctttatta atttttttaa tttttagttt 1920 ttagttcttt tttaaaataa gtttcagcct ctttttcaat attttttaaa gaaggagtat 1980 ttgcatgaat tgcctttttt ctaacagact taggaaatat tttaacagta tcttcttgcg 2040 ccggtgattt tggaacttca taacttacta atttataatt attattttct tttttaattg 2100 taacagttgc aaaagaagct gaacctgttc cttcaactag tttatcatct tcaatataat 2160 attcttgacc tatatagtat aaatatattt ttattatatt tttacttttt tctgaatcta 2220 ttattttata atcataaaaa gttttaccac caaaagaagg ttgtactcct tctggtccaa 2280 catatttttt tactatatta tctaaataat ttttgggaac tggtgttgta atttgattaa 2340 tcgaacaacc agttatactt aaaggaatta taactataaa aatatatagg attatctttt 2400 taaatttcat tattggcctc ctttttatta aatttatgtt accataaaaa ggacataacg 2460 ggaatatgta gaatattttt aatgtagaca aaattttaca taaatataaa gaaaggaagt 2520 gtttgtttaa attttatagc aaactatcaa aaattagggg gataaaaatt tatgaaaaaa 2580 aggttttcga tgttattttt atgtttaact ttaatagttt gtggtttatt tacaaattcg 2640 gccggccgaa gcaaacttaa gagtgtgttg atagtgcagt atcttaaaat tttgtataat 2700 aggaattgaa gttaaattag atgctaaaaa tttgtaatta agaaggagtg attacatgaa 2760 caaaaatata aaatattctc aaaacttttt aacgagtgaa aaagtactca accaaataat 2820 aaaacaattg aatttaaaag aaaccgatac cgtttacgaa attggaacag gtaaagggca 2880 tttaacgacg aaactggcta aaataagtaa acaggtaacg tctattgaat tagacagtca 2940 tctattcaac ttatcgtcag aaaaattaaa actgaatact cgtgtcactt taattcacca 3000 agatattcta cagtttcaat tccctaacaa acagaggtat aaaattgttg ggagtattcc 3060 ttaccattta agcacacaaa ttattaaaaa agtggttttt gaaagccatg cgtctgacat 3120 ctatctgatt gttgaagaag gattctacaa gcgtaccttg gatattcacc gaacactagg 3180 gttgctcttg cacactcaag tctcgattca gcaattgctt aagctgccag cggaatgctt 3240 tcatcctaaa ccaaaagtaa acagtgtctt aataaaactt acccgccata ccacagatgt 3300 tccagataaa tattggaagc tatatacgta ctttgtttca aaatgggtca atcgagaata 3360 tcgtcaactg tttactaaaa atcagtttca tcaagcaatg aaacacgcca aagtaaacaa 3420 tttaagtacc gttacttatg agcaagtatt gtctattttt aatagttatc tattatttaa 3480 cgggaggaaa taattctatg agtcgctttt gtaaatttgg aaagttacac gttactaaag 3540 ggaatgtgtt taaactcctt tttgataatc tcatgaccaa aatcccttaa cgtgagtttt 3600 cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga gatccttttt 3660 ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg gtggtttgtt 3720 tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc agagcgcaga 3780 taccaaatac tgttcttcta gtgtagccgt agttaggcca ccacttcaag aactctgtag 3840 caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc agtggcgata 3900 agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg cagcggtcgg 3960 gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac accgaactga 4020 gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga aaggcggaca 4080 ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt ccagggggaa 4140 acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag cgtcgatttt 4200 tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg gcctttttac 4260 ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta tcccctgatt 4320 ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc agccgaacga 4380 ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cccaatacgc agggccccct 4440 gcttcggggt cattatagcg attttttcgg tatatccatc ctttttcgca cgatatacag 4500 gattttgcca aagggttcgt gtagactttc cttggtgtat ccaacggcgt cagccgggca 4560 ggataggtga agtaggccca cccgcgagcg ggtgttcctt cttcactgtc ccttattcgc 4620 acctggcggt gctcaacggg aatcctgctc tgcgaggctg gccggctacc gccggcgtaa 4680 cagatgaggg caagcggatg gctgatgaaa ccaagccaac caggaagggc agcccaccta 4740 tcaaggtgta ctgccttcca gacgaacgaa gagcgattga ggaaaaggcg gcggcggccg 4800 gcatgagcct gtcggcctac ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg 4860 tggactatga gcacgtccgc gagctggccc gcatcaatgg cgacctgggc cgcctgggcg 4920 gcctgctgaa actctggctc accgacgacc cgcgcacggc gcggttcggt gatgccacga 4980 tcctcgccct gctggcgaag atcgaagaga agcaggacga gcttggcaag gtcatgatgg 5040 gcgtggtccg cccgagggca gagccatgac ttttttagcc gctaaaacgg ccggggggtg 5100 cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga agagcgactt cgcggagctg 5160 gtgaagtaca tcaccgacga gcaaggcaag accgatcggg ccccctgcag gataaaaaaa 5220 ttgtagataa attttataaa atagttttat ctacaatttt tttatcagga aacagctatg 5280 accgcggccg caaaatagtt gataataatg cagagttata aacaaaggtg aaaagcatta 5340 cttgtattct tttttatata ttattataaa ttaaaatgaa gctgtattag aaaaaataca 5400 cacctgtaat ataaaatttt aaattaattt ttaatttttt caaaatgtat tttacatgtt 5460 tagaattttg atgtatatta aaatagtaga atacataaga tacttaattt aattaaagat 5520 agttaagtac ttttcaatgt gcttttttag atgtttaata caaatcttta attgtaaaag 5580 aaatgctgta ctatttactg tactagtgac gggattaaac tgtattaatt ataaataaaa 5640 aataagtaca gttgtttaaa attatatttt gtattaaatc taatagtacg atgtaagtta 5700 ttttatacta ttgctagttt aataaaaaga tttaattata tgcttgaaaa ggagaggaat 5760 ccaatgagtc aggcacttaa aaatttactt actttactta atcttgaaaa aatagaagaa 5820 ggtttattta gaggacagtc agaagattta ggattaagac aagtatttgg aggtcaagta 5880 gttggtcagg cactttatgc agctaaagaa actgtacctg aagaaagact tgttcatagt 5940 tttcattctt attttcttag acctggagat tctaaaaaac caattatata tgatgtagaa 6000 actcttagag atggaaattc atttagtgca agaagagttg cagctattca aaatggtaaa 6060 cctatatttt acatgacagc ttcttttcaa gcaccagaag ctggatttga acatcagaaa 6120

actatgcctt cagcacctgc tccagatgga ttaccatcag aaacacaaat agcacagagt 6180 ttagctcatt tacttcctcc agtacttaaa gataaattta tttgtgatag acctttagaa 6240 gttagaccag ttgaatttca taatcctctt aaaggacatg tagcagaacc acatagacaa 6300 gtttggataa gagctaatgg aagtgtacca gatgatctta gagttcatca gtatcttctt 6360 ggttatgcat ctgatttaaa ttttcttcct gtagctttac aaccacatgg aataggtttt 6420 cttgaacctg gaatacagat agcaactata gatcattcaa tgtggtttca tagaccattt 6480 aatcttaatg aatggcttct ttatagtgta gaatctacat cagcaagttc tgctagagga 6540 tttgttaggg gtgaatttta tactcaagat ggagtacttg ttgctagtac agtacaggaa 6600 ggtgttatga gaaatcataa ttaa 6624



User Contributions:

Comment about this patent or add new information about this topic:

CAPTCHA
New patent applications in this class:
DateTitle
2022-09-22Electronic device
2022-09-22Front-facing proximity detection using capacitive sensor
2022-09-22Touch-control panel and touch-control display apparatus
2022-09-22Sensing circuit with signal compensation
2022-09-22Reduced-size interfaces for managing alerts
New patent applications from these inventors:
DateTitle
2021-12-02A cell surface tag exchange (cste) system for tracing and manipulation of cells during recombinase mediated cassette exchange integration of nucleic acid sequences to engineered receiver cells
2021-11-04Fermentative production of -ketoadipate from gaseous substrates
2016-12-29Recombinant microorganisms and uses therefor
2016-06-09Recombinant microorganisms exhibiting increased flux through a fermentation pathway
Website © 2025 Advameg, Inc.