Patent application title: RECOMBINANT MICROORGANISM FOR IMPROVED PRODUCTION OF ALANINE
Inventors:
Jennifer Jaitzig (New York, NY, US)
Mukesh Kumar (Tarrytown, NY, US)
Matthew David Blankschien (Tarrytown, NY, US)
Shakir Ratani (Elmsford, NY, US)
Qingzhao Wang (Ardsley, NY, US)
IPC8 Class: AC12P1306FI
USPC Class:
1 1
Class name:
Publication date: 2021-02-04
Patent application number: 20210032667
Abstract:
The present invention relates to a recombinant nucleic acid molecule, a
recombinant microorganism, to a method for producing alanine and to the
use of the recombinant nucleic acid molecule or the recombinant
microorganism for the fermentative production of alanine.Claims:
1. A recombinant microorganism comprising a reduced, repressed or deleted
activity and/or expression of an asd gene or a gdhA gene.
2. The recombinant microorganism of claim 1 further comprising at least one of the features selected from the group of (a) an introduced, increased or enhanced activity and/or expression of an alaD gene encoding an alanine dehydrogenase, (b) a reduced, repressed or deleted activity and/or expression of a pflB gene encoding a pyruvate formate lyase I, (c) a reduced, repressed or deleted activity and/or expression of an adhE gene encoding a bifunctional acetaldehyde-CoA dehydrogenase/iron-dependent alcohol dehydrogenase/pyruvate-formate lyase deactivase, (d) a reduced, repressed or deleted activity and/or expression of a ldhA gene encoding a NAO-dependent fermentative D-lactate dehydrogenase, (e) a reduced, repressed or deleted activity and/or expression of a pta gene encoding a phosphate acetyltransferase and/or a reduced, repressed or deleted activity and/or expression of an ackA gene encoding an acetate kinase A and propionate kinase 2, (f) a reduced, repressed or deleted activity and/or expression of a frdA gene encoding a fumarate reductase, and/or (g) a reduced, repressed or deleted activity and/or expression of a dadX gene encoding an alanine racemase, wherein the reduction, repression, deletion, introduction, increase or enhancement of the activity and/or expression of a gene is determined compared to a respective reference microorganism.
3-8. (canceled)
9. The recombinant microorganism of claim 1, further comprising at least one feature selected from the group of: (a) a reduced, repressed or deleted activity and/or expression of a brnQ gene encoding a brnQ protein having a branched chain amino acid transporter activity, (b) a reduced, repressed or deleted activity and/or expression of a gcvB gene encoding a non-protein encoding RNA, (c) an increased and/or enhanced activity and/or expression of a zipA gene encoding cell division protein involved in Z ring assembly, (d) an increased and/or enhanced activity and/or expression of a lpd gene encoding a lipoamide dehydrogenase, a changed activity of a lpxD gene encoding encoding an UDP-3-O-(3-hydroxymyristoyl)glucosamine N-acyltransferase protein, an increased and/or enhanced activity and/or expression of a gcvA gene encoding a DNA-binding protein, and/or an increased and/or enhanced activity and/or expression of a ygaW gene encoding an alanine transporter.
10-15. (canceled)
16. The recombinant microorganism of claim 1, wherein the asd gene or the gdhA gene is selected from the group of (i) a nucleic acid molecule comprising a sequence of SEQ ID NO: 25 or SEQ ID NO: 37, or (ii) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 25 or SEQ ID NO: 37, or (iii) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 25 or SEQ ID NO: 37 under stringent conditions, or (iv) a nucleic acid molecule encoding the polypeptide of SEQ ID NO: 26 or SEQ ID NO: 38, or (v) a nucleic acid molecule encoding a polypeptide having at least 60% homology to the polypeptide of SEQ ID NO: 26 or SEQ ID NO: 38.
17. The recombinant microorganism of claim 2, wherein the alaD gene is selected from the group consisting of (AA) a nucleic acid molecule comprising a sequence of SEQ ID NO: 1, or (BB) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 1, or (CC) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 1 under stringent conditions, or (DD) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 2, or (EE) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 2.
18. The recombinant microorganism of claim 2, wherein the pflB gene is selected from the group consisting of (A) a nucleic acid molecule comprising a sequence of SEQ ID NO: 5, or (B) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 5, or (C) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 5 under stringent conditions, or (D) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 6, or (E) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 6.
19. The recombinant microorganism of claim 2, wherein the adhE gene is selected from the group consisting of (F) a nucleic acid molecule comprising a sequence of SEQ ID NO: 7, or (G) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 7, or (H) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 7 under stringent conditions, or (I) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 8, or (J) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 8.
20. The recombinant microorganism of claim 2, wherein the ldhA gene is selected from the group consisting of (K) a nucleic acid molecule comprising a sequence of SEQ ID NO: 9, or (L) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 9, or (M) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 9 under stringent conditions, or (N) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 10, or (O) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 10.
21. The recombinant microorganism of claim 2, wherein the pta gene is selected from the group consisting of (P1) a nucleic acid molecule comprising a sequence of SEQ ID NO: 11, or (Q1) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 11, or (R1) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 11 under stringent conditions, or (S1) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 12, or (T1) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 12 and wherein the ackA gene is selected from the group consisting of (P2) a nucleic acid molecule comprising a sequence of SEQ ID NO: 32, or (Q2) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 32, or (R2) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 32 under stringent conditions, or (S2) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 33, or (T2) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 33.
22. The recombinant microorganism of claim 2, wherein the frdA gene is selected from the group consisting of (U) a nucleic acid molecule comprising a sequence of SEQ ID NO: 13, or (V) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 13, or (W) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 13 under stringent conditions, or (X) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 14, or (Y) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 14.
23. The recombinant microorganism of claim 2, wherein the dadX gene is selected from the group consisting of (Z) a nucleic acid molecule comprising a sequence of SEQ ID NO: 15, or (FF) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 15, or (GG) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 15 under stringent conditions, or (HH) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 16, or (II) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 16.
24. The recombinant microorganism of claim 9, wherein the ygaW gene is selected from the group consisting of (JJ) a nucleic acid molecule comprising a sequence of SEQ ID NO: 17, or (KK) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 17, or (LL) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 17 under stringent conditions, or (MM) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 18, or (NN) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 18.
25. The recombinant microorganism of claim 9, wherein the zipA gene is selected from the group consisting of (OO) a nucleic acid molecule comprising a sequence of SEQ ID NO: 19, or (PP) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 19, or (QQ) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 19 under stringent conditions, or (RR) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 20, or (SS) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 20.
26. The recombinant microorganism of claim 9, wherein the lpd gene is selected from the group consisting of (TT) a nucleic acid molecule comprising a sequence of SEQ ID NO: 21, or (UU) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 21, or (VV) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 21 under stringent conditions, or (WW) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 22, or (XX) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 22.
27. The recombinant microorganism of claim 9, wherein the brnQ gene is selected from the group consisting of (YY) a nucleic acid molecule comprising a sequence of SEQ ID NO: 23, or (ZZ) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 23, or (AAA) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 23 under stringent conditions, or (BBB) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 24, or (CCC) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 24.
28. The recombinant microorganism of claim 9, wherein the lpxD gene is selected from the group consisting of (DDD) a nucleic acid molecule comprising a sequence of SEQ ID NO: 27, or (EEE) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 28.
29. The recombinant microorganism of claim 9, wherein the gcvA gene is selected from the group consisting of (FFF) a nucleic acid molecule comprising a sequence of SEQ ID NO: 29, or (GGG) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 29, or (HHH) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 29 under stringent conditions, or (III) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 30, or (JJJ) a nucleic acid molecule encoding a polypeptide having at least 60% homology to a polypeptide of SEQ ID NO: 30.
30. The recombinant microorganism of claim 9, wherein the gcvB gene is selected from the group consisting of (KKK) a nucleic acid molecule comprising a sequence of SEQ ID NO: 31, or (LLL) a nucleic acid molecule having at least 80% identity to a nucleic acid molecule of SEQ ID NO: 31, or (MMM) a nucleic acid molecule hybridizing to the complement of a nucleic acid molecule having SEQ ID NO: 31 under stringent conditions.
31. The recombinant microorganism of claim 1, wherein the microorganism is selected from a genus of the group consisting of Corynebacterium, Bacillus, Erwinia, Escherichia, Pantoea, Streptomyces, Zymomonas, Rhodococcus, Saccharomyces, Candida or Pichia.
32-36. (canceled)
37. A method of producing alanine comprising culturing one or more recombinant microorganism according to claim 1 under conditions that allow for the production of alanine
38-43. (canceled)
44. The method of claim 37, comprising I) growing the one or more microorganism in a fermenter to obtain a fermentation broth, and II) recovering alanine from the fermentation broth.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation of U.S. application Ser. No. 15/735,875, filed Dec. 12, 2017, allowed, which is a national stage application (under 35 U.S.C. .sctn. 371) of PCT/EP2016/063172, filed Jun. 9, 2016, which claims benefit of U.S. Application Nos. 62/174,529, filed Jun. 12, 2015, and 62/180,081, filed Jun. 16, 2015, all of which are incorporated herein by reference in their entirety.
SEQUENCE LISTING
[0002] The Sequence Listing associated with this application is filed in electronic format via EFS-Web and hereby incorporated by reference into the specification in its entirety. The name of the text file containing the Sequence Listing is 074008_1829_01_SL. The size of the text file is 111,568 bytes and the text file was created on Sep. 14, 2020.
FIELD OF THE INVENTION
[0003] The present invention relates to a recombinant nucleic acid molecule, a recombinant microorganism, to a method for producing alanine and to the use of the recombinant nucleic acid molecule or the recombinant microorganism for the fermentative production of alanine.
DESCRIPTION OF THE INVENTION
[0004] Amino acids are organic compounds with a carboxy-group and an amino-group. The most important amino acids are the alpha-amino acids where the amino group is located next to the carboxy-group. Proteins are based on alpha-amino acids.
[0005] Alanine has drawn considerable interest because it has been used as an additive in the food, feed and pharmaceutical industries. Moreover alanine is a raw material for the industrial production of alanine, N,N-bis(carboxymethyl)-, trisodium salt (MGDA, trade name Trilon M) which is a strong chelating agent, showing an excellent performance at dissolving organic and inorganic scale (WO94/29421, WO2012/150155). Trilon M grades are readily biodegradable according to standard OECD tests. Due to the superb ecological and toxicological profile, Trilon M grades are particularly suitable for use in products for end-consumers and the demand for such biodegradable complex builders is constantly rising.
[0006] Alanine can be produced by fermentation with Coryneform bacteria (Hermann, 2003: Industrial production of amino acids by Coryneform bacteria, J. of Biotechnol, 104, 155-172.) or E. coli. (WO2007/120198, WO2008/119009).
[0007] Alanine production in E. coli is more efficient and widely used for industrial production of alanine as raw material for the chemical industry. As the demand of the chemical industry for alanine is increasing, there is a demand for improvement of productivity of fermentative production of alanine.
[0008] It is one object of the present invention to provide microorganisms which can be used in fermentative production of alanine with high yield and efficiency.
DETAILED DESCRIPTION OF THE INVENTION
[0009] A contribution to achieving the above mentioned aim is provided by a recombinant microorganism of the family of Escherichia coli (E. coli) having, compared to a respective reference microorganism, an introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase in said microorganism compared to a respective control microorganism not comprising said introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene.
[0010] Accordingly, one embodiment of the invention at hand is a recombinant microorganism comprising compared to a respective reference microorganism an introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase and having compared to a respective reference microorganism a higher yield and/or productivity of alanine in fermentative production.
[0011] The term "higher", "increase" or "enhanced" e.g. in reference to expression and/or activity of an enzyme or to yield or productivity, means a significantly higher, increased or enhanced expression and/or activity or yield or productivity as compared to a reference or control microorganism.
[0012] The term "reduced, repressed or deleted expression and/or activity of an enzyme", means a significantly reduced, repressed or deleted expression and/or activity and also encompasses an undetectable expression and/or activity of the respective enzymes in a reference or control microorganism.
[0013] The term "reference microorganism" as used herein means a control microorganism to which the recombinant microorganism is compared. This reference microorganism has substantially the same genotype as the recombinant microorganism with the exception of the difference to be analyzed. Preferably the reference microorganism is the strain from which the recombinant microorganism is originated. For example, a gene has been introduced into a wild type microorganism, thus creating a recombinant microorganism, in this case the wild type would be a suitable reference microorganism for this recombinant microorganism. It is also possible, that into a recombinant microorganism A a further mutation is introduced, thereby creating a recombinant microorganism B. The recombinant microorganism A would then be the suitable reference microorganism for recombinant microorganism B. In the event, the performance of a recombinant microorganism and the respective reference microorganism shall be compared both microorganisms are grown under substantially identical conditions.
[0014] It is obvious for the skilled person that a microorganism having an increased yield and/or productivity of alanine can also be used for the production of other metabolites that are closely related to alanine, for example metabolites that are intermediates in the alanine pathway, that share common intermediates with the alanine pathway or that are metabolites which use alanine as intermediate in their pathway. The microorganisms of the invention can also be easily adapted for having an increased yield and/or productivity of such related metabolites by increasing or introducing certain enzyme activities or by knocking out or decreasing certain enzyme activities. Such metabolites are for example pyruvate, succinate, aspartate, malate, lactate, valine and leucine.
[0015] For example, in order to use the recombinant microorganism of the invention to produce succinate, the genes ldh, pfl, pta and adhE have to be knocked out and a PEP carboxylase gene and/or a pyruvate carboxylase gene have to be introduced in the genome of the microorganism of the invention. The respective pathway and necessary mutations are described for example in Zhang et al. (2009), PNAS (106) pp 20180-20185.
[0016] Accordingly, another embodiment of the invention at hand is a recombinant microorganism comprising compared to a respective reference microorganism an introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase and having compared to a respective reference microorganism a higher yield and/or productivity of pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine in fermentative production.
[0017] Furthermore another embodiment of the invention at hand is a recombinant microorganism comprising compared to a respective reference microorganism an introduced, increased or enhanced activity and/or expression of an asd and a gdhA gene and having compared to a respective reference microorganism a higher yield and/or productivity of alanine in fermentative production.
[0018] In some embodiments, the microorganism is a prokaryotic cell. Suitable prokaryotic cells include Gram-positive, Gram negative and Gram-variable bacterial cells, preferably Gram-negative.
[0019] Thus, microorganisms that can be used in the present invention include, but are not limited to, Gluconobacter oxydans, Gluconobacter asaii, Achromobacter delmarvae, Achromobacter viscosus, Achromobacter lacticum, Agrobacterium tumefaciens, Agrobacterium radiobacter, Alcaligenes faecalis, Arthrobacter citreus, Arthrobacter tumescens, Arthrobacter paraffineus, Arthrobacter hydrocarboglutamicus, Arthrobacter oxydans, Aureobacterium saperdae, Azotobacter indicus, Brevibacterium ammoniagenes, Brevibacterium divaricatum, Brevibacterium lactofermentum, Brevibacterium flavum, Brevibacterium globosum, Brevibacterium fuscum, Brevibacterium ketoglutamicum, Brevibacterium helcolum, Brevibacterium pusillum, Brevibacterium testaceum, Brevibacterium roseurn, Brevibacterium immariophilium, Brevibacterium linens, Brevibacterium protopharmiae, Corynebacterium acetophilum, Corynebacterium glutamicum, Corynebacterium callunae, Corynebacterium acetoacidophilum, Corynebacterium acetoglutamicum, Enterobacter aerogenes, Erwinia amylovora, Erwinia carotovora, Erwinia herbicola, Erwinia chrysanthemi, Flavobacterium peregrinum, Flavobacterium fucatum, Flavobacterium aurantinum, Flavobacterium rhenanum, Flavobacterium sewanense, Flavobacterium breve, Flavobacterium meningosepticum, Micrococcus sp. CCM825, Morganella morganii, Nocardia opaca, Nocardia rugosa, Planococcus eucinatus, Proteus rettgeri, Propionibacterium shermanii, Pseudomonas synxantha, Pseudomonas azotoformans, Pseudomonas fluorescens, Pseudomonas ovalis, Pseudomonas stutzeri, Pseudomonas acidovolans, Pseudomonas mucidolens, Pseudomonas testosterone, Pseudomonas aeruginosa, Rhodococcus erythropolis, Rhodococcus rhodochrous, Rhodococcus sp. ATCC 15592, Rhodococcus sp. ATCC 19070, Sporosarcina ureae, Staphylococcus aureus, Vibrio metschnikovii, Vibrio tyrogenes, Actinomadura madurae, Actinomyces violaceochromogenes, Kitasatosporia parulosa, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces flavelus, Streptomyces griseolus, Streptomyces lividans, Streptomyces olivaceus, Streptomyces tanashiensis, Streptomyces virginiae, Streptomyces antibioticus, Streptomyces cacaoi, Streptomyces lavendulae, Streptomyces viridochromogenes, Aeromonas salmonicida, Bacillus pumilus, Bacillus circulans, Bacillus thiaminolyticus, Escherichia freundii, Microbacterium ammoniaphilum, Serratia marcescens, Salmonella typhimurium, Salmonella schottmulleri, Xanthomonas citri and so forth.
[0020] In some embodiments, the microorganism is a eukaryotic cell. Suitable eukaryotic cells include yeast cells, as for example Saccharomyces spec, such as Saccharomyces cerevisiae, Hansenula spec, such as Hansenula polymorpha, Schizosaccharomyces spec, such as Schizosaccharomyces pombe, Kluyveromyces spec, such as Kluyveromyces lactis and Kluyveromyces marxianus, Yarrowia spec, such as Yarrowia lipolytica, Pichia spec, such as Pichia methanolica, Pichia stipites and Pichia pastoris, Zygosaccharomyces spec, such as Zygosaccharomyces rouxii and Zygosaccharomyces bailiff, Candida spec, such as Candida boidinii, Candida utilis, Candida freyschussii, Candida glabrata and Candida sonorensis, Schwanniomyces spec, such as Schwanniomyces occidentalis, Arxula spec, such as Arxula adeninivorans, Ogataea spec such as Ogataea minuta, Klebsiella spec, such as Klebsiella pneumonia.
[0021] Numerous bacterial industrial strains are especially suitable for use in the methods disclosed herein. In some embodiments, the microorganism is a species of the genus Corynebacterium, e.g. C. acetophilum, C. glutamicum, C. callunae, C. acetoacidophilum, C. acetoglutamicum. In some embodiments, the microorganism is a species of the genus Bacillus, e.g., B. thuringiensis, B. anthracis, B. megaterium, B. subtilis, B. lentils, B. circulans, B. pumilus, B. lautus, B. coagulans, B. brevis, B. firmus, B. alkaophius, B. licheniformis, B. clausii, B. stearothermophilus, B. halodurans, B. subtilis, B. pumilus, and B. amyloliquefaciens. In some embodiments, the microorganism is a species of the genus Erwinia, e.g., E. uredovora, E. carotovora, E. ananas, E. herbicola, E. punctata and E. terreus. In some embodiments, the microorganism is a species of the genus Escherichia, e.g., E. coli. In other embodiments the microorganism is a species of the genus Pantoea, e.g., P. citrea or P. agglomerans. In still other embodiments, the microorganism is a species of the genus Streptomyces, e.g., S. ambofaciens, S. achromogenes, S. avermitilis, S. coelicolor, S. aureofaciens, S. aureus, S. fungicidicus, S. griseus or S. lividans. In further embodiments, the microorganism is a species of the genus Zymomonas, e.g., Z. mobilis or Z. lipolytica. In further embodiments, the microorganism is a species of the genus Rhodococcus, e.g. R. opacus.
[0022] Preferably the microorganism is selected from the family of Enterobacteriaceae, preferably of the genus Escherichia, for example Escherichia coli (E. coli), preferably the strain E. coli W, which corresponds to DSMZ 1116, which corresponds to ATCC9637.
[0023] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (a) a reduced, repressed or deleted activity and/or expression of a pflB gene encoding a pyruvate formate lyase I, wherein the reduction, repression or deletion of the activity and/or expression of the pflB gene is determined compared to a respective reference microorganism.
[0024] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (b) a reduced, repressed or deleted activity and/or expression of a adhE gene encoding a bifunctional acetaldehyde-CoA dehydrogenase/iron-dependent alcohol dehydrogenase/pyruvate-formate lyase deactivase), wherein the reduction, repression or deletion of the activity and/or expression of the adhE gene is determined compared to a respective reference microorganism.
[0025] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (c) a reduced, repressed or deleted activity and/or expression of a IdhA gene encoding a NAD-dependent fermentative D-lactate dehydrogenase, wherein the reduction, repression or deletion of the activity and/or expression of the IdhA gene is determined compared to a respective reference microorganism.
[0026] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (d) a reduced, repressed or deleted activity and/or expression of a pta gene encoding a phosphate acetyltransferase and/or a reduced, repressed or deleted activity and/or expression of an ackA gene encoding an acetate kinase A and propionate kinase 2, wherein the reduction, repression or deletion of the activity and/or expression of the pta gene and/or the ackA gene is determined compared to a respective reference microorganism.
[0027] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (e) a reduced, repressed or deleted activity and/or expression of a frdA gene encoding a fumarate reductase, wherein the reduction, repression or deletion of the activity and/or expression of the frdA gene is determined compared to a respective reference microorganism.
[0028] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (f) an introduced, increased or enhanced activity and/or expression of an alaD gene encoding an alanine dehydrogenase, wherein the increase or enhancement of the activity and/or expression of the alaD gene is determined compared to a respective reference microorganism.
[0029] In addition to the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase, the recombinant microorganism of the invention may further comprise (g) a reduced, repressed or deleted activity and/or expression of a dadX gene encoding a alanine racemase, wherein the reduction, repression or deletion of the activity and/or expression of the dadX gene is determined compared to a respective reference microorganism.
[0030] Preferably, the recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase is additionally having at least two, more preferably at least three, even more preferably at least four, even more preferably at least five, most preferably all of the features selected from the group of
[0031] (a) a reduced, repressed or deleted activity and/or expression of a pflB gene encoding a pyruvate formate lyase I and
[0032] (b) a reduced, repressed or deleted activity and/or expression of a adhE gene encoding a bifunctional acetaldehyde-CoA dehydrogenase/iron-dependent alcohol dehydrogenase/pyruvate-formate lyase deactivase) and
[0033] (c) a reduced, repressed or deleted activity and/or expression of a IdhA gene encoding a NAD-dependent fermentative D-lactate dehydrogenase and
[0034] (d) a reduced, repressed or deleted activity and/or expression of a pta gene encoding a phosphate acetyltransferase and/or a reduced, repressed or deleted activity and/or expression of an ackA gene encoding an acetate kinase A and propionate kinase 2 and
[0035] (e) a reduced, repressed or deleted activity and/or expression of a frdA gene encoding a fumarate reductase and
[0036] (f) an introduced, increased or enhanced activity and/or expression of an alaD gene encoding an alanine dehydrogenase,
[0037] (g) a reduced, repressed or deleted activity and/or expression of a dadX gene encoding an alanine racemase wherein the reduction, repression, deletion, introduction, increase or enhancement of the activity and/or expression of a gene is determined compared to a respective reference microorganism.
[0038] The alaD gene may be derived from any organism or may be a synthetic gene designed by man, for example having codon usage optimized for expression in the recombinant microorganism of the invention or being optimized for enzyme activity, e.g. having improved Vmax or Km. Preferably the alaD gene is derived from a microorganism of one of the the geni Bacillus, Geobacillus, Paenibacillus, Halobacillus, Brevibacillus. In a more preferred embodiment the alaD gene is derived from a microorganism of the genus Geobacillus. In a most preferred embodiment, the alaD gene is derived from Geobacillus stearothermophilus.
[0039] In a preferred embodiment the alaD gene has been codon optimized for the expression in the recombinant microorganism of the invention.
[0040] The microorganism of the invention may comprise further genetic modifications, such as mutations, knock-outs or enhanced or introduced enzyme activities that further improve yield and/or productivity of alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine, preferably succinate or alanine, more preferably alanine. For example, the microorganism of the invention may further comprise an enhanced or increased expression and/or activity of the ygaW gene from E. coli or homologs or functional equivalents thereof which has recently been described to improve alanine productivity of a microorganism when overexpressed (WO2012/172822).
[0041] In a further example, the microorganism of the invention may in addition comprise any one of, any combination of or all of the genes that are specified and described in detail in the applications PCT/IB2014/064426 and PCT/IB2014/066686 and that are beneficial for production of alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine, preferably succinate or alanine, more preferably alanine.
[0042] In one embodiment the asd gene encoding an aspartate-beta-semialdehyde dehydrogenase, is selected from the group of
[0043] (i) a nucleic acid molecule comprising a sequence of SEQ ID NO: 25, or
[0044] (ii) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 25, or
[0045] (iii) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 25 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0046] (iv) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 26, or
[0047] (v) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 26, wherein the polypeptide encoded by (ii), (iii) or (v) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 26.
[0048] In one embodiment the gdhA gene encoding a glutamate dehydrogenase, is selected from the group of
[0049] (i) a nucleic acid molecule comprising a sequence of SEQ ID NO: 37, or
[0050] (ii) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 37, or
[0051] (iii) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 37 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0052] (iv) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 38, or
[0053] (v) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 38, wherein the polypeptide encoded by (ii), (iii) or (v) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 38.
[0054] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise any one, two, three, four, five, six or all of the features as defined above under (a) to (g),
wherein the pflB gene is selected from the group consisting of
[0055] (A) a nucleic acid molecule comprising a sequence of SEQ ID NO: 5, or
[0056] (B) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 5, or
[0057] (C) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 5 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0058] (D) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 6, or
[0059] (E) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 6, wherein the polypeptide encoded by (B), (C) or (E) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 6 and wherein the adhE gene is selected from the group consisting of
[0060] (F) a nucleic acid molecule comprising a sequence of SEQ ID NO: 7, or
[0061] (G) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 7, or
[0062] (H) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 7 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0063] (I) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 8, or
[0064] (J) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 8, wherein the polypeptide encoded by (G), (H) or (J) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 8 and wherein the IdhA gene is selected from the group consisting of
[0065] (K) a nucleic acid molecule comprising a sequence of SEQ ID NO: 9, or
[0066] (L) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 9, or
[0067] (M) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 9 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0068] (N) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 10, or
[0069] (O) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 10, wherein the polypeptide encoded by (L), (M) or (0) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 10 and wherein the pta gene is selected from the group consisting of
[0070] (P1) a nucleic acid molecule comprising a sequence of SEQ ID NO: 11, or
[0071] (Q1) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 11, or
[0072] (R1) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 11 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0073] (S1) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 12, or
[0074] (T1) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 12, wherein the polypeptide encoded by (Q1), (R1) or (T1) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 12 and wherein the ackA gene is selected from the group consisting of
[0075] (P2) a nucleic acid molecule comprising a sequence of SEQ ID NO: 32, or
[0076] (Q2) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 32, or
[0077] (R2) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 32 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0078] (S2) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 33, or
[0079] (T2) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 33, wherein the polypeptide encoded by (Q2), (R2) or (T2) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 33 and wherein the frdA gene is selected from the group consisting of
[0080] (U) a nucleic acid molecule comprising a sequence of SEQ ID NO: 13, or
[0081] (V) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 13, or
[0082] (W) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 13 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0083] (X) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 14, or
[0084] (Y) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 14, wherein the polypeptide encoded by (V), (W) or (Y) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 14 and wherein the alaD gene is selected from the group consisting of
[0085] (Z) a nucleic acid molecule comprising a sequence of SEQ ID NO: 1, or
[0086] (AA) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 1, or
[0087] (BB) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 1 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0088] (CC) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 2, or
[0089] (DD) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 2, wherein the polypeptide encoded by (AA), (BB) or (DD) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 2 and wherein the dadX gene is selected from the group consisting of
[0090] (EE) a nucleic acid molecule comprising a sequence of SEQ ID NO: 15, or
[0091] (FF) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 15, or
[0092] (GG) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 15 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0093] (HH) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 16, or
[0094] (II) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 16, wherein the polypeptide encoded by (FF), (GG) or (II) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 16.
[0095] In a further example, the microorganism of the invention may in addition comprise any one of, any combination of or all of the genes that are specified and described in detail in the applications WO2015/044818 and PCT/IB2014/066686 and that are beneficial for production of alanine.
[0096] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (h) a reduced, repressed or deleted activity and/or expression of a brnQ gene encoding a brnQ protein having a branched chain amino acid transporter activity, wherein the reduction, repression or deletion of the activity and/or expression of the brnQ gene is determined compared to a respective reference microorganism.
[0097] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (i) a reduced, repressed or deleted activity and/or expression of a gcvB gene encoding a non-protein encoding RNA, wherein the reduction, repression or deletion of the activity and/or expression of the gcvB gene is determined compared to a respective reference microorganism.
[0098] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (j) an increased or enhanced activity and/or expression of a zipA gene encoding a cell division protein involved in Z ring assembly, wherein the an increased or enhanced activity and/or expression of the zipA gene is determined compared to a respective reference microorganism.
[0099] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (k) an increased or enhanced activity and/or expression of a lpd gene encoding a lipoamide dehydrogenase, wherein the an increased or enhanced activity and/or expression of the lpd gene is determined compared to a respective reference microorganism.
[0100] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (I) a mutated IpxD gene encoding an UDP-3-O-(3-hydroxymyristoyl)-glucosamine N-acyltransferase protein, wherein IpxD gene is mutated compared to a respective reference microorganism.
[0101] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (m) an increased or enhanced activity and/or expression of a gcvA gene encoding a DNA-binding protein, wherein the an increased or enhanced activity and/or expression of the gcvA gene is determined compared to a respective reference microorganism.
[0102] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase may further comprise (n) an increased or enhanced activity and/or expression of an ygaW gene encoding an alanine transporter, wherein the an increased or enhanced activity and/or expression of the ygaW gene is determined compared to a respective reference microorganism.
[0103] Preferably, the recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase having the features as defined above under a) to g) is additionally having at least one, preferably at least two, more preferably at least three, more preferably at least four, more preferably at least five, even more preferably at least six, most preferably all of the features selected from the group of
[0104] (h) a reduced, repressed or deleted activity and/or expression of a brnQ gene encoding a brnQ protein having a branched chain amino acid transporter activity and
[0105] (i) a reduced, repressed or deleted activity and/or expression of a gcvB gene encoding a non-protein encoding RNA and
[0106] (j) an increased and/or enhanced activity and/or expression of a zipA gene encoding cell division protein involved in Z ring assembly and
[0107] (k) an increased and/or enhanced activity and/or expression of a lpd gene encoding a lipoamide dehydrogenase and
[0108] (l) a changed activity of a IpxD gene encoding encoding an UDP-3-O-(3-hydroxymyristoyI)-glucosamine N-acyltransferase protein and
[0109] (m) an increased and/or enhanced activity and/or expression of a gcvA gene encoding a DNA-binding protein and
[0110] (n) an increased and/or enhanced activity and/or expression of a ygaW gene encoding an alanine transporter, wherein the reduction, repression, deletion, increase, enhancement or change of the activity and/or expression of a gene is determined compared to a respective reference microorganism.
[0111] The recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase and comprising any one, two, three, four, five, six or all of the features as defined above under (a) to (g) and/or (A) to (II), may further comprise any one, two, three, four, five, six or all of the features as defined above under (h) to (n)
wherein the brnQ gene is selected from the group consisting of
[0112] (1) a nucleic acid molecule comprising a sequence of SEQ ID NO: 23, or
[0113] (2) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 23, or
[0114] (3) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 23 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0115] (4) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 24, or
[0116] (5) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 24, wherein the polypeptide encoded by (2), (3) or (5) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 23 and wherein the gcvB gene is selected from the group consisting of
[0117] (6) a nucleic acid molecule comprising a sequence of SEQ ID NO: 31, or
[0118] (7) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 31, or wherein the nucleic acid molecule under (7) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the nucleic acid molecule having SEQ ID NO: 31 and wherein the zipA gene is selected from the group consisting of
[0119] (8) a nucleic acid molecule comprising a sequence of SEQ ID NO: 19, or
[0120] (9) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 19, or
[0121] (10) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 19 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0122] (11) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 20, or
[0123] (12) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 20, wherein the polypeptide encoded by (9), (10) or (12) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 20 and wherein the lpd gene is selected from the group consisting of
[0124] (13) a nucleic acid molecule comprising a sequence of SEQ ID NO: 21, or
[0125] (14) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 21, or
[0126] (15) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 21 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0127] (16) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 22, or
[0128] (17) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 22, wherein the polypeptide encoded by (14), (15) or (17) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 22 and wherein the IpxD gene is selected from the group consisting of
[0129] (23) a nucleic acid molecule comprising a sequence of SEQ ID NO: 27, or
[0130] (24) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 27, or
[0131] (25) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 27 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0132] (26) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 28, or
[0133] (27) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 28, wherein the polypeptide encoded by (24), (25) or (27) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 28 and wherein the polypeptide encoded by (24) to (27) comprises at the position equivalent to position 15 of SEQ ID NO: 28 a threonine, and wherein the gcvA gene is selected from the group consisting of
[0134] (28) a nucleic acid molecule comprising a sequence of SEQ ID NO: 29, or
[0135] (29) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 29, or
[0136] (30) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 29 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0137] (31) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 30, or
[0138] (32) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 30, wherein the polypeptide encoded by (29), (30) or (32) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 30 and wherein the ygaW gene is selected from the group consisting of
[0139] (33) a nucleic acid molecule comprising a sequence of SEQ ID NO: 17, or
[0140] (34) a nucleic acid molecule having at least 80%, preferably at least 85% for example at least 90%, more preferably at least 95% for example at least 96%, even more preferably at least 97% for example at least 98%, most preferably at least 99% identity to a nucleic acid molecule of SEQ ID NO: 17, or
[0141] (35) a nucleic acid molecule hybridizing to a nucleic acid molecule having SEQ ID NO: 17 under medium stringent conditions, more preferably under high stringent conditions, most preferably under very high stringent conditions, or
[0142] (36) a nucleic acid molecule encoding a polypeptide of SEQ ID NO: 18, or
[0143] (37) a nucleic acid molecule encoding a polypeptide having at least 60% preferably at least 70% for example at least 75%, more preferably at least 80% for example at least 85%, even more preferably at least 90% for example at least 95%, most preferably at least 96%, at least 97%, at least 98% or at least 99% homology to a polypeptide of SEQ ID NO: 18, wherein the polypeptide encoded by (34), (35) or (37) is having at least 10%, 20% preferably at least 30% or 50%, more preferably at least 60% or 70%, even more preferably at least 75%, 80%, 85% or 90%, most preferred at least 95% of the activity as the polypeptide having SEQ ID NO: 18 and wherein the polypeptide encoded by (34) to (37) preferably comprises at the position equivalent to position 5 of SEQ ID NO: 18 preferably a histidine, asparagine, arginine or tyrosine.
[0144] Preferably the recombinant microorganism of the invention comprising the introduced, increased or enhanced activity and/or expression of an asd or a gdhA gene encoding an aspartate-beta-semialdehyde dehydrogenase or a glutamate dehydrogenase is comprising all of the features as defined above under (a) to (g) or (A) to (II) and (j), (k) and (n) or (13) to (17) and (33) to (37).
[0145] A further embodiment of the invention is a composition comprising one or more recombinant microorganisms of the invention as defined above. The composition may further comprise a medium that allows grow of the recombinant microorganism of the invention. The medium may additionally comprise a carbon source such as hexoses, pentoses or polyols for example sucrose, glucose, fructose, galactose, mannose, raffinose, xylose, arabinose, xylulose, glycerol, mannitol, arabitol, xylitol, starch, cellulose, lignocellulose or combinations thereof. Preferably the carbon source is glucose of sucrose, more preferably the carbon source is glucose.
[0146] In a preferred embodiment the composition comprises the microorganism of the invention and NBS medium, AM1 medium or PPM01 medium. More preferably the composition further comprises a carbon source, preferably a sugar. The ingredients of these media are known to a skilled person.
[0147] Preferably NBS medium comprises per liter
1-5 g, preferably 3.5 g KH.sub.2PO.sub.4 and 1-10 g, preferably 5.0 g K.sub.2HPO.sub.4 and 1-5 g, preferably 3.5 g (NH.sub.4).sub.2HPO.sub.4 and 0.1-1 g, preferably 0.25 g MgSO.sub.4-7 H.sub.2O and 5-25 mg, preferably 15 mg CaCL.sub.2-2 H.sub.2O and 0.1-1 mg, preferably 0.5 mg Thiamine and 0.1-5 ml, preferably 1 ml trace metal stock, wherein the trace metal stock comprises 0.5-5 g, preferably 1.6 g FeCL.sub.3-6 H.sub.2O; 0.05-0.5 g, preferably 0.2 g CoCl.sub.2-6 H.sub.2O; 0.01-0.5 g, preferably 0.1 g CuCl.sub.2-2 H.sub.2O; 0.1-0.5 g, preferably 0.2 g ZnCl.sub.2; 0.05-0.5 g, preferably 0.2 g NaMoO.sub.4-2 H.sub.2O; 0.001-0.1 g, preferably 0.05 g H.sub.3BO.sub.3 per liter 0.01-1 M, preferably 0.1 M HCL.
[0148] The preferred carbon source in the NBS medium is glucose or sucrose, preferably 2%18% glucose or 2%-16% sucrose.
[0149] Preferably AM 1 medium comprises per liter 0.1-10 mM, preferably 1 mM betain solution
1-10 g, preferably 2.6 g (NH4).sub.2HPO.sub.4 and 0.1-5 g, preferably 0.87 g NH.sub.4H.sub.2PO.sub.4 and 0.05-2.5 g, preferably 0.15 g KCl and 0.05-5 g, preferably 0.37 g MgSO.sub.4-7H.sub.2O and 0.1-5 ml, preferably 1 ml trace metal stock, wherein the trace metal stock comprises per liter 0.01-1 M, preferably 0.12 M HCL, 1-5 g, preferably 2.4 g FeCL.sub.3-6H.sub.2O; 0.1-1 g, preferably 0.3 g CoCl.sub.2-6H.sub.2O; 0.1-1 g, preferably 0.21 g CuCl.sub.2-2 H.sub.2O; 0.1-1 g, preferably 0.3 g ZnCl.sub.2; 0.1-1 g, preferably 0.27 g NaMoO4-2 H.sub.2O; 0.01-0.5 g, preferably 0.068 g H.sub.3BO.sub.3 and 0.1-1 g, preferably 0.5 g MnCl.sub.2-4 H.sub.2O, and optionally 1-30 g, preferably 15 g (NH.sub.4).sub.2SO.sub.4.
[0150] The preferred carbon source in the NBS medium is glucose or sucrose, preferably 2%18% glucose or 2%-16% sucrose.
[0151] Preferably PPM01 medium comprises per liter
0.05-5 g, preferably 0.37 g MgSO.sub.4-7 H.sub.2O and 0.1-10 g, preferably 1 g (NH.sub.4).sub.2SO.sub.4 and 0.05-5 g, preferably 0.46 g betaine and 0.001-0.5 g, preferably 0.05 g Cyanocobalamin (B12) and 1-10 g, preferably 3.74 g KH.sub.2PO.sub.4 and 0.1-5 ml, preferably 1 ml trace metal stock, wherein the trace metal stock comprises per liter 10-100 mM, preferably 60 mM sulfuric acid, 1-10 g, preferably 3.48 g (NH.sub.4).sub.2Fe(II)(SO.sub.4).sub.2-7 H.sub.2O; 0.1-1 g, preferably 0.35 g CoSO.sub.4-7 H.sub.2O; 0.1-1 g, preferably 0.31 g CuSO.sub.4-5 H.sub.2O; 0.1-5 g, preferably 0.63 g ZnSO.sub.4-7 H.sub.2O; 0.1-1 g, preferably 0.27 g MnSO.sub.4--H.sub.2O; 0.01-1 g, preferably 0.07 g NaMoO.sub.4-2 H.sub.2O and 0.1-5 g, preferably 0.43 g H.sub.3BO.sub.3.
[0152] The preferred carbon source in the PPM01 medium is glucose monohydrate, preferably 10-500 g, more preferably 140 g glucose monohydrate per liter medium.
[0153] A further embodiment of the invention is a method for producing a recombinant microorganism with enhanced alanine yield or productivity, which comprises the following steps:
[0154] (I) introducing, increasing or enhancing of one or more activity and/or expression of the asd gene or the gdhA gene or as defined above under (i) to (v) in a microorganism; and
[0155] (II) generating, identifying and isolating a recombinant microorganism with enhanced alanine yield or productivity compared to a corresponding microorganism without introduced, increased or enhanced activity and/or expression of the asd gene or the gdhA gene or as defined above under (i) to (v).
[0156] In a preferred embodiment of the method for producing a recombinant microorganism of the invention the method further comprises the step of reducing, repressing or deleting the activity and/or expression of at least one, at least two, at least three, at least four, at least five, at least six or all of the pflB gene, adhE gene, IdhA gene, pta gene, ackA gene, frdA gene or dadX gene for example as defined above under (A) to (Y) and (EE) to (II) and/or the step of introducing, increasing or enhancing activity and/or expression of an alaD gene, lpd gene, zipA gene and ygaW gene for example as defined above under (Z) to (DD), (8) to (17) and (33) to 37).
[0157] A more preferred method for producing a recombinant microorganism of the invention comprises the step of reducing, repressing or deleting the activity and/or expression of all of the pflB gene, adhE gene, IdhA gene, ackA gene and frdA gene and the step of introducing, increasing or enhancing activity and/or expression of an alaD gene, lpd gene, zipA gene and ygaW gene.
[0158] In one embodiment of the method for producing a recombinant microorganism of the invention the microorganism is selected from the group consisting of species of the genus Corynebacterium, e.g. C. acetophllum, C. glutamicum, C. callunae, C. acetoacidophllum, C. acetoglutamicum, species of the genus Bacillus, e.g., B. thuringiensis, B. anthracis, B. megaterium, B. subtilis, B. lentils, B. circulars, B. pumilus, B. lautus, B. coagulans, B. brevis, B. firmus, B. alkaophius, B. licheniformis, B. clausii, B. stearothermophilus, B. halodurans, B. subtilis, B. pumilus, and B. amyloliquefaciens, species of the genus Erwinia, e.g., E. uredovora, E. carotovora, E. ananas, E. herbicola, E. punctate, E. terreus, species of the genus Escherichia, e.g., E. coli, species of the genus Pantoea, e.g., P. citrea, P. agglomerans, species of the genus Streptomyces, e.g., S. ambofaciens, S. achromogenes, S. avermitllis, S. coelicolor, S. aureofaciens, S. aureus, S. fungicidicus, S. griseus, S. lividans, species of the genus Zymomonas, e.g., Z. mobilis or Z. lipolytica species of the genus Rhodococcus, e.g. R. opacus, species of the genus Saccharomyces spec, such as Saccharomyces cerevisiae, species of the genus Hansenula spec, such as Hansenula polymorpha, species of the genus Schizosaccharomyces spec, such as Schizosaccharomyces pombe, species of the genus Kluyveromyces spec, such as Kluyveromyces lactis and Kluyveromyces marxianus, species of the genus Yarrowia spec, such as Yarrowia lipolytica, species of the genus Pichia spec, such as Pichia methanolica, Pichia stipites and Pichia pastoris, species of the genus Zygosaccharomyces spec, such as Zygosaccharomyces rouxii and Zygosaccharomyces bailiff, species of the genus Candida spec, such as Candida boidinii, Candida utilis, Candida freyschussii, Candida glabrata and Candida sonorensis, species of the genus Schwanniomyces spec, such as Schwanniomyces occidentalis, species of the genus Arxula spec, such as Arxula adeninivorans, species of the genus Ogataea spec such as Ogataea minuta, species of the genus Klebsiella spec, such as Klebsiella pneumonia.
[0159] Preferably the microorganism is selected from the family of Enterobacteriaceae, preferably of the genus Escherichia, for example Escherichia coli (E. coli), preferably the strain E. coli W, which corresponds to DSMZ 1116, which corresponds to ATCC9637.
[0160] A further embodiment of the invention is a method of producing alanine, preferably L-alanine, comprising culturing one or more recombinant microorganism as defined above under conditions that allow for the production of alanine, preferably L-alanine.
[0161] In some embodiments, the recombinant microorganisms encompassed by the invention are grown under batch or continuous fermentations conditions. Classical batch fermentation is a closed system, wherein the compositions of the medium is set at the beginning of the fermentation and is not subject to artificial alterations during the fermentation. A variation of the batch system is a fed-batch fermentation. In this variation, the substrate is added in increments as the fermentation progresses. Fed-batch systems are useful when catabolite repression is likely to inhibit the metabolism of the cells and where it is desirable to have limited amounts of substrate in the medium. Batch and fed-batch fermentations are common and well known in the art. Continuous fermentation which also finds use in the present invention is a system where a defined fermentation medium is added continuously to a bioreactor and an equal amount of conditioned medium (e.g., containing the desired end-products) is removed simultaneously for processing. Continuous fermentation generally maintains the cultures at a constant high density where cells are primarily in the growth phase where production of end products is enhanced. Continuous fermentation systems strive to maintain steady state growth conditions. Methods for modulating nutrients and growth factors for continuous fermentation processes as well as techniques for maximizing the rate of product formation are well known in the art of industrial microbiology.
[0162] In some embodiments, fermentations are carried out in a temperature within the range of from about 10.degree. C. to about 60.degree. C., from about 15.degree. C. to about 50.degree. C., from about 20.degree. C. to about 45.degree. C., from about 25.degree. C. to about 45.degree. C., from about 30.degree. C. to about 45.degree. C. and from about 25.degree. C. to about 40.degree. C. In a preferred embodiment the temperature is about 34.degree. C., 35.degree. C. or 36.degree. C. In a most preferred embodiment the temperature is about 37.degree. C. or 38.degree. C.
[0163] In some other embodiments, the fermentation is carried out for a period of time within the range of from about 8 hours to 240 hours, from about 8 hours to about 168 hours, from about 10 hours to about 144 hours, from about 15 hours to about 120 hours, or from about 20 hours to about 72 hours. Preferably the fermentation is carried out from about 20 hours to about 40 hours.
[0164] In some other embodiments, the fermentation is carried out at a pH in the range of about 4 to about 9, in the range of about 4.5 to about 8.5, in the range of about 5 to about 8, or in the range of about 5.5 to about 7.5. Preferably the fermentation will be carried out at a pH of 7.
[0165] In one embodiment of the method of producing alanine, more preferably L-alanine, the microorganism is cultured in a medium comprising between 1% and 30% (w/v) of a sugar, between 5% and 25% (w/v) of a sugar, between 10% and 20% (w/v) of a sugar, between 12% and 18% (w/v) of a sugar. Preferably the microorganism is cultured in a medium comprising between 13% and 15% (w/v) of a sugar.
[0166] In another embodiment of the method for producing alanine, more preferably L-alanine the yield of alanine is at least 80% for example at least 81%, at least 82%, at least 83%, at least 84% or at least 85%. Preferably the yield is at least 86%, at least 87%, at least 88%, at least 89% or at least 90%. More preferably the yield is at least 90.5%, at least 91%, at least 91.5%, at least 92%, at least 92.5%, at least 93%, at least 93.5%, at least 94% or at least 94.5%. In an even more preferred embodiment the yield is at least 95% or at least 95.5%. In a most preferred embodiment, the yield is at least 96%. The percent yield is calculated as gram product produced from gram glucose in the medium. Hence, when the medium contained 100 g glucose and the fermentation yielded 98 g alanine, the yield would be 98%.
[0167] In another embodiment of the method for producing alanine preferably L-alanine is produced, wherein the chiral purity of L-alanine is at least 90%, at least 91%, at least 92%, at least 93% or at least 94%. In a preferred embodiment the chiral purity of L-alanine is at least 95% or at least 95.5%. In a more preferred embodiment, the chiral purity of L-alanine is at least 96% or at least 96.5% or at least 97%. In an even more preferred embodiment the chiral purity of L-alanine is at least 97.5%, at least 98% or at least 98.5% for example at least 99%. Even more preferably the chiral purity of L-alanine is at least 99.5% or at least 99.6% for example at least 99.7%, at least 99.8%, or at least 99.9%. In a most preferred embodiment chiral pure L-alanine is produced.
[0168] Another embodiment of the invention is a method of culturing or growing any of the genetically modified microorganisms as defined above, the method comprising inoculating a culture medium with one or more genetically modified microorganism and culturing or growing said genetically modified microorganism in culture medium under conditions as defined above.
[0169] The use of a recombinant microorganism as defined above or a composition as defined above for the fermentative production of alanine, preferably L-alanine is an additional embodiment of the invention.
[0170] The recombinant microorganism according to the present invention is characterized in that, compared to a respective reference microorganism for example a wild type or an alanine high production strain, the expression and/or the activity of the enzyme that is encoded by the asd gene or the gdhA gene is increased or enhanced.
[0171] Furthermore the recombinant microorganism according to the present invention is characterized in that, compared to a respective reference microorganism for example a wild type or an alanine high production strain the expression and/or the activity of both enzymes encoded by the asd gene and the gdhA gene are increased or enhanced.
[0172] In one embodiment the decrease of the expression and/or activity of a genes is achieved by a deactivation, mutation or knock-out of the gene. This could be done by deletion of part or total of the coding region and/or the promoter of the gene, by mutation of the gene such as insertion or deletion of a number of nucleotides for example one or two nucleotides leading to a frameshift in the coding region of the gene, introduction of stop codons in the coding region, inactivation of the promoter of the gene by for example deleting or mutating promoter boxes such as ribosomal entry sides, the TATA box and the like. The decrease may also be achieved by degrading the transcript of the gene for example by means of introduction of ribozymes, dsRNA, antisense RNA or antisense oligonucleotides. The decrease of the activity of a gene may be achieved by expressing antibodies or aptamers in the cell specifically binding the target enzyme. Other methods for the decrease of the expression and/or activity of a gene are known to a skilled person.
[0173] The reduced expression and/or activity of the enzymes disclosed herein can be a reduction of the expression and/or enzymatic activity by at least 50%, compared to the expression and/or activity of said enzyme in a respective reference microorganism for example the wild type of the microorganism, or a reduction of the expression and/or enzymatic activity by at least 90%, or more preferably a reduction of expression and/or the enzymatic activity by at least 95%, or more preferably an expression and/or reduction of the enzymatic activity by at least 98%, or even more preferably a reduction of the expression and/or enzymatic activity by at least 99% or even more preferably a reduction of the expression and/or the enzymatic activity by at least 99.9%. In a most preferred embodiment the expression and/or activity of the enzymes is not detectable in the microorganism of the invention.
[0174] The enhanced or increased expression and/or activity of the enzymes disclosed herein can be an increase of the expression and/or enzymatic activity by at least 25%, compared to the expression and/or activity of said enzyme in a respective reference microorganism for example the wild type of the microorganism, or an increase of the expression and/or enzymatic activity by at least 50%, or more preferably an increase of expression and/or the enzymatic activity by at least 100%, or more preferably an increase of the expression and/or of the enzymatic activity by at least 3 fold, for example at least 5 fold, or even more preferably an increase of the expression and/or enzymatic activity by at least 10 fold or even more preferably an increase of the expression and/or the enzymatic activity by at least 20 fold.
[0175] The increase of the expression and/or activity of the asd gene or the gdhA gene leads to an improved yield and/or productivity of alanine in the recombinant microorganism of the invention compared to a respective reference microorganism. Therefore the increase of the expression and/or activity of the asd gene or the gdhA gene may be determined by measuring alanine yield or productivity of the recombinant microorganism of the invention compared to a respective reference microorganism. Methods for fermentative production of metabolites, for example alanine are known to a skilled person and also described herein. Improved yield of e.g. alanine in fermentation by the microorganism of the invention compared to yield of alanine in fermentation by a respective reference microorganism is a measure for the increase of the expression and or activity of the asd gene or the gdhA gene.
[0176] Methods for determining the lactate dehydrogenase (IdhA) expression or activity are, for example, disclosed by Bunch et al. in "The IdhA gene encoding the fermentative lactate de hydrogenase of Escherichia coli", Microbiology (1997), Vol. 143, pages 187-155; or Bergmeyer, H. U., Bergmeyer J. and Grassl, M. (1983-1986) in "Methods of Enzymatic Analysis", 3rd Edition, Volume III, pages 126-133, Verlag Chemie, Weinheim; or Enzymes in Industry: Production and Applications, Second Edition (2004), Wolfgang Aehle, page 23. Preferred is the last method.
[0177] Methods for determining the pyruvate formate lyase I (pflB) expression or activity are, for example, disclosed in Knappe J, Blaschkowski H P, Grobner P, Schmitt T (1974). "Pyruvate formate-lyase of Escherichia coli: the acetyl-enzyme intermediate." Eur J Biochem 1974; 50(1); 253-63. PMID: 4615902; in KNAPPE, Joachim, et al. "Pyruvate Formate--Lyase of Escherichia coli: the Acetyl-Enzyme Intermediate." European Journal of Biochemistry 50.1 (1974): 253-263; in Wong, Kenny K., et al. "Molecular properties of pyruvate formate-lyase activating enzyme." Biochemistry 32.51 (1993): 14102-14110 and in Nnyepi, Mbako R., Yi Peng, and Joan B. Broderick. "Inactivation of E. coli pyruvate formate-lyase: Role of AdhE and small molecules." Archives of biochemistry and biophysics 459.1 (2007): 1-9.
[0178] Methods for determining the bifunctional acetaldehyde-CoA dehydrogenase/iron-dependent alcohol dehydrogenase/pyruvate-formate lyase deactivase (adhE) expression or activity are, for example, disclosed in Membrillo-Hernandez, Jorge, et al. "Evolution of the adhE Gene Product of Escherichia coli from a Functional Reductase to a Dehydrogenase GENETIC AND BIOCHEMICAL STUDIES OF THE MUTANT PROTEINS." Journal of Biological Chemistry 275.43 (2000): 33869-33875 and in Mbako R. Nnyepi, Yi Peng, Joan B. Broderick, Inactivation of E. coli pyruvate formate-lyase: Role of AdhE and small molecules, Archives of Biochemistry and Biophysics, Volume 459, Issue 1, 1 Mar. 2007, Pages 1-9.
[0179] Methods for determining the phosphate acetyltransferase (pta) expression or activity are, for example, disclosed in Dittrich, Cheryl R., George N. Bennett, and Ka-Yiu San. "Characterization of the Acetate-Producing Pathways in Escherichia coli." Biotechnology progress 21.4 (2005): 1062-1067 and in Brown, T. D. K., M. C. Jones-Mortimer, and H. L. Kornberg. "The enzymic interconversion of acetate and acetyl-coenzyme A in Escherichia coli." Journal of general microbiology 102.2 (1977): 327-336.
[0180] Methods for determining the fumarate reductase (frdA) expression or activity are, for example, disclosed in Dickie, Peter, and Joel H. Weiner. "Purification and characterization of membrane-bound fumarate reductase from anaerobically grown Escherichia coli." Canadian journal of biochemistry 57.6 (1979): 813-821; in Cecchini, Gary, et al. "Reconstitution of quinone reduction and characterization of Escherichia coli fumarate reductase activity." Journal of Biological Chemistry 261.4 (1986): 1808-1814 or in Schroder, I., et al. "Identification of active site residues of Escherichia coli fumarate reductase by site-directed mutagenesis." Journal of Biological Chemistry 266.21 (1991): 13572-13579.
[0181] Methods for determining the alanine dehydrogenase (alaD) expression or activity are, for example, disclosed in Sakamoto, Y., Nagata, S., Esaki, N., Tanaka, H., Soda, K. "Gene cloning, purification and characterization of thermostable alanine dehydrogenase of Bacillus stearothermophilus" J Fermen. Bioeng. 69 (1990):154-158.
[0182] The term "reduced expression of an enzyme" includes, for example, the expression of the enzyme by said genetically manipulated (e.g., genetically engineered) microorganism at a lower level than that expressed by a respective reference microorganism for example the wild type of said microorganism. Genetic manipulations for reducing the expression of an enzyme can include, but are not limited to, deleting the gene or parts thereof encoding for the enzyme, altering or modifying regulatory sequences or sites associated with expression of the gene encoding the enzyme (e.g., by removing strong promoters or repressible promoters), modifying proteins (e.g., regulatory proteins, suppressors, enhancers, transcriptional activators and the like) involved in transcription of the gene encoding the enzyme and/or the translation of the gene product, or any other conventional means of decreasing expression of a particular gene routine in the art (including, but not limited to, the use of antisense nucleic acid molecules or other methods to knock-out or block expression of the target protein). Further on, one may introduce destabilizing elements into the mRNA or introduce genetic modifications leading to deterioration of ribosomal binding sites (RBS) of the RNA. It is also possible to change the codon usage of the gene in a way, that the translation efficiency and speed is decreased.
[0183] A reduced activity of an enzyme can also be obtained by introducing one or more deleterious gene mutations which lead to a reduced activity of the enzyme. Furthermore, a reduction of the activity of an enzyme may also include an inactivation (or the reduced expression) of activating enzymes which are necessary in order to activate the enzyme the activity of which is to be reduced. By the latter approach the enzyme the activity of which is to be reduced is preferably kept in an inactivated state.
[0184] A deleterious mutation according to this application is any mutation within a gene comprising promoter and coding region that lead to a decreased or deleted protein activity of the protein encoded by the coding region of the gene. Such deleterious mutations comprise for example frameshifts, introduction of stop-codons in the coding region, mutation of promoter elements such as the TATA box that prevent transcription and the like.
[0185] Microorganisms having an increased or enhanced expression and/or activity of the enzyme encoded by the asd gene or the gdhA gene may occur naturally, i.e. due to spontaneous mutations. A microorganism can be modified to have significantly increased activity of the enzyme that is encoded by one or more of said genes by various techniques, such as chemical treatment or radiation. To this end, microorganisms will be treated by, e.g., a mutagenizing chemical agent, X-rays, or UV light. In a subsequent step, those microorganisms which have an increased expression and/or activity of the enzyme that is encoded by one or more of said genes will be selected. Recombinant microorganisms are also obtainable by homologous recombination techniques which aim to substitute one or more of said genes with a corresponding gene that encodes for an enzyme which, compared to the enzyme encoded by the wild type gene, has an increased expression and/or activity.
[0186] According to one embodiment of the recombinant microorganism according to the present invention, an increase of the expression and/or activity of the enzyme encoded by the asd gene or the gdhA gene may be achieved by a modification of the asd gene or the gdhA gene, wherein this/these gene modification(s) is(are) preferably realized by multiplication of the copy-number of the asd gene or the gdhA gene in the genome of the microorganism, by introducing the gene on a self-replicating expression vector into the microorganism, by exchanging the promoter of the asd gene or the gdhA gene against a stronger promoter or by converting the endogenous promoter of the gene into a stronger promoter, e.g. by introducing point-mutations into the promoter sequence.
[0187] Further the activity of the asd gene or the gdhA gene may be enhanced by mutating the gene in order to achieve amino acid exchanges in the protein which improve activity of the gene. Such methods are known to a skilled person.
[0188] A mutation into the above-gene can be introduced, for example, by site-directed or random mutagenesis, followed by an introduction of the modified gene into the genome of the microorganism by recombination. Variants of the genes can be are generated by mutating the gene sequences by means of PCR. The "Quickchange Site-directed Mutagenesis Kit" (Stratagene) can be used to carry out a directed mutagenesis. A random mutagenesis over the entire coding sequence, or else only part thereof, can be performed with the aid of the "GeneMorph II Random Mutagenesis Kit" (Stratagene). The mutagenesis rate is set to the desired amount of mutations via the amount of the template DNA used. Multiple mutations are generated by the targeted combination of individual mutations or by the sequential performance of several mutagenesis cycles.
[0189] In the following, a suitable technique for recombination, in particular for introducing a mutation or for deleting sequences, is described.
[0190] This technique is also sometimes referred to as the "Campbell recombination" herein (Leenhouts et al., Appl Env Microbiol. (1989), Vol. 55, pages 394-400). "Campbell in", as used herein, refers to a transformant of an original host cell in which an entire circular double stranded DNA molecule (for example a plasmid) has integrated into a chromosome by a single homologous recombination event (a cross in event), and that effectively results in the insertion of a linearized version of said circular DNA molecule into a first DNA sequence of the chromosome that is homologous to a first DNA sequence of the said circular DNA molecule. "Campbelled in" refers to the linearized DNA sequence that has been integrated into the chromosome of a "Campbell in" transformant. A "Campbell in" contains a duplication of the first homologous DNA sequence, each copy of which includes and surrounds a copy of the homologous recombination crossover point.
[0191] "Campbell out", as used herein, refers to a cell descending from a "Campbell in" transformant, in which a second homologous recombination event (a cross out event) has occurred between a second DNA sequence that is contained on the linearized inserted DNA of the "Campbelled in" DNA, and a second DNA sequence of chromosomal origin, which is homologous to the second DNA sequence of said linearized insert, the second recombination event resulting in the deletion (jettisoning) of a portion of the integrated DNA sequence, but, importantly, also resulting in a portion (this can be as little as a single base) of the integrated Campbelled in DNA remaining in the chromosome, such that compared to the original host cell, the "Campbell out" cell contains one or more intentional changes in the chromosome (for example, a single base substitution, multiple base substitutions, insertion of a heterologous gene or DNA sequence, insertion of an additional copy or copies of a homologous gene or a modified homologous gene, or insertion of a DNA sequence comprising more than one of these aforementioned examples listed above). A "Campbell out" cell is, preferably, obtained by a counter-selection against a gene that is contained in a portion (the portion that is desired to be jettisoned) of the "Campbelled in" DNA sequence, for example the Bacillus subtilis sacB-gene, which is lethal when expressed in a cell that is grown in the presence of about 5% to 10% sucrose. Either with or without a counter-selection, a desired "Campbell out" cell can be obtained or identified by screening for the desired cell, using any screenable phenotype, such as, but not limited to, colony morphology, colony color, presence or absence of antibiotic resistance, presence or absence of a given DNA sequence by polymerase chain reaction, presence or absence of an auxotrophy, presence or absence of an enzyme, colony nucleic acid hybridization, antibody screening, etc. The term "Campbell in" and "Campbell out" can also be used as verbs in various tenses to refer to the method or process described above.
[0192] It is understood that the homologous recombination events that leads to a "Campbell in" or "Campbell out" can occur over a range of DNA bases within the homologous DNA sequence, and since the homologous sequences will be identical to each other for at least part of this range, it is not usually possible to specify exactly where the crossover event occurred. In other words, it is not possible to specify precisely which sequence was originally from the inserted DNA, and which was originally from the chromosomal DNA. Moreover, the first homologous DNA sequence and the second homologous DNA sequence are usually separated by a region of partial non-homology, and it is this region of non-homology that remains deposited in a chromosome of the "Campbell out" cell.
[0193] Preferably, first and second homologous DNA sequence are at least about 200 base pairs in length, and can be up to several thousand base pairs in length. However, the procedure can be made to work with shorter or longer sequences. For example, a length for the first and second homologous sequences can range from about 500 to 2000 bases, and the obtaining of a "Campbell out" from a "Campbell in" is facilitated by arranging the first and second homologous sequences to be approximately the same length, preferably with a difference of less than 200 base pairs and most preferably with the shorter of the two being at least 70% of the length of the longer in base pairs.
[0194] In one embodiment the induction of the expression and/or activity of asd is achieved by an activation of the asd gene which encodes the aspartate-beta-semialdehyde dehydrogenase.
[0195] In one embodiment the induction of the expression and/or activity of gdhA is achieved by an activation of the gdhA gene which encodes the glutamate dehydrogenase.
[0196] The terms "alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine", as used in the context of the present invention, has to be understood in their broadest sense and also encompasses salts thereof, as for example alkali metal salts, like Na.sup.+ and K.sup.+-salts, or earth alkali salts, like Mg.sup.2+ and Ca.sup.2+-salts, or ammonium salts or anhydrides of alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine.
[0197] Preferably, alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine, preferably succinate or alanine, more preferably alanine is produced under microaerobic conditions. Aerobic or anaerobic conditions may be also used.
[0198] Microaerobic means that the concentration of oxygen is less than that in air. According to one embodiment microaerobic means oxygen tension between 5 and 27 mm Hg, preferably between 10 and 20 Hg (Megan Falsetta et al. (2011), The composition and metabolic phenotype of Neisseria gonorrhoeae biofilms, Frontiers in Microbiology, Vol 2, page 1 to 11). Preferably the microaerobic conditions are established with 0.1 to 1 vvm air flow.
[0199] Anaerobic conditions may be established by means of conventional techniques, as for example by degassing the constituents of the reaction medium and maintaining anaerobic conditions by introducing carbon dioxide or nitrogen or mixtures thereof and optionally hydrogen at a flow rate of, for example, 0.1 to 1 or 0.2 to 0.5 vvm. Aerobic conditions may be established by means of conventional techniques, as for example by introducing air or oxygen at a flow rate of, for example, 0.1 to 1 or 0.2 to 0.5 vvm. If appropriate, a slight over pressure of 0.1 to 1.5 bar may be applied in the process.
[0200] According to one embodiment of the process according to the present invention the assimilable carbon source may be glucose, glycerin, glucose, maltose, maltodextrin, fructose, galactose, mannose, xylose, sucrose, arabinose, lactose, raffinose and combinations thereof.
[0201] In a preferred embodiment the assimilable carbon source is glucose, sucrose, xylose, arabinose, glycerol or combinations thereof. Preferred carbon sources are glucose, sucrose, glucose and sucrose, glucose and xylose and/or glucose, arabinose and xylose. According to one embodiment of the process according to the present invention the assimilable carbon source may be sucrose, glycerin and/or glucose.
[0202] The initial concentration of the assimilable carbon source, preferably the initial concentration is, preferably, adjusted to a value in a range of 5 to 250 g/l, preferably 50 to 200 g/l and more preferably 125 to 150 g/l, most preferably about 140 g/l and may be maintained in said range during cultivation. The pH of the reaction medium may be controlled by addition of suitable bases as for example, gaseous ammonia, NH.sub.4OH, NH.sub.4HCO.sub.3, (NH.sub.4).sub.2CO.sub.3, NaOH, Na.sub.2CO.sub.3, NaHCO.sub.3, KOH, K.sub.2CO.sub.3, KHCO.sub.3, Mg(OH).sub.2, MgCO.sub.3, Mg(HCO.sub.3).sub.2, Ca(OH).sub.2, CaCO.sub.3, Ca(HCO.sub.3).sub.2, CaO, CH.sub.6N.sub.2O.sub.2, C.sub.2H.sub.7N and/or mixtures thereof.
[0203] Another embodiment of the invention is a process for fermentative production of alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine, preferably succinate or alanine, more preferably alanine, most preferably L-alanine comprising the steps of
[0204] I) growing the microorganism according to the invention as defined above in a fermenter and
[0205] II) recovering alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine, preferably succinate or alanine, more preferably alanine, most preferably L-alanine from the fermentation broth obtained in I).
[0206] The fermentation step I) according to the present invention can, for example, be performed in stirred fermenters, bubble columns and loop reactors. A comprehensive overview of the possible method types including stirrer types and geometric designs can be found in Chmiel: "Bioprozesstechnik: Einfuhrung in die Bioverfahrenstechnik", Volume 1. In the process according to the present invention, typical variants available are the following variants known to those skilled in the art or explained, for example, in Chmiel, Hammes and Bailey: "Biochemical Engineering", such as batch, fed-batch, repeated fed-batch or else continuous fermentation with and without recycling of the biomass. Depending on the production strain, sparging with air, oxygen, carbon dioxide, hydrogen, nitrogen or appropriate gas mixtures may be effected in order to achieve good yield (YP/S).
[0207] Particularly preferred conditions for producing alanine, pyruvate, succinate, aspartate, malate, lactate, valine and/or leucine, preferably succinate or alanine, more preferably alanine, most preferably L-alanine in process step I) are:
[0208] Assimilable carbon source: glucose
[0209] Temperature: 30 to 45.degree. C.
[0210] pH: 6.0 to 7.0
[0211] Microaerobic conditions
[0212] In process step II) the product is recovered from the fermentation broth obtained in process step I).
[0213] Usually, the recovery process comprises the step of separating the recombinant microorganisms from the fermentation broth as the so called "biomass". Processes for removing the biomass are known to those skilled in the art, and comprise filtration, sedimentation, flotation or combinations thereof. Consequently, the biomass can be removed, for example, with centrifuges, separators, decanters, filters or in a flotation apparatus. For maximum recovery of the product of value, washing of the biomass is often advisable, for example in the form of a diafiltration. The selection of the method is dependent upon the biomass content in the fermentation broth and the properties of the biomass, and also the interaction of the biomass with the organic compound (e. the product of value). In one embodiment, the fermentation broth can be sterilized or pasteurized. In a further embodiment, the fermentation broth is concentrated. Depending on the requirement, this concentration can be done batch wise or continuously.
[0214] The pressure and temperature range should be selected such that firstly no product damage occurs, and secondly minimal use of apparatus and energy is necessary. The skillful selection of pressure and temperature levels for a multistage evaporation in particular enables saving of energy.
[0215] The recovery process may further comprise additional purification steps in which the fermentation product is further purified. If, however, the fermentation product is converted into a secondary organic product by chemical reactions, a further purification of the fermentation product might, depending on the kind of reaction and the reaction conditions, not necessarily be required. For the purification of the fermentation product obtained in process step II) methods known to the person skilled in the art can be used, as for example crystallization, filtration, electrodialysis and chromatography. The resulting solution may be further purified by means of ion exchange chromatography in order to remove undesired residual ions.
Definitions
[0216] It is to be understood that this invention is not limited to the particular methodology or protocols. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention which will be limited only by the appended claims. It must be noted that as used herein and in the appended claims, the singular forms "a," "and," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, reference to "a vector" is a reference to one or more vectors and includes equivalents thereof known to those skilled in the art, and so forth. The term "about" is used herein to mean approximately, roughly, around, or in the region of. When the term "about" is used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. In general, the term "about" is used herein to modify a numerical value above and below the stated value by a variance of 20 percent, preferably 10 percent up or down (higher or lower). As used herein, the word "or" means any one member of a particular list and also includes any combination of members of that list. The words "comprise," "comprising," "include," "including," and "includes" when used in this specification and in the following claims are intended to specify the presence of one or more stated features, integers, components, or steps, but they do not preclude the presence or addition of one or more other features, integers, components, steps, or groups thereof. For clarity, certain terms used in the specification are defined and used as follows:
[0217] Coding region: As used herein the term "coding region" when used in reference to a structural gene refers to the nucleotide sequences which encode the amino acids found in the nascent polypeptide as a result of translation of a mRNA molecule. The coding region is bounded, in eukaryotes, on the 5'-side by the nucleotide triplet "ATG" which encodes the initiator methionine, prokaryotes also use the triplets "GTG" and "TTG" as startcodon. On the 3'-side it is bounded by one of the three triplets which specify stop codons (i.e., TAA, TAG, TGA). In addition a gene may include sequences located on both the 5'- and 3'-end of the sequences which are present on the RNA transcript. These sequences are referred to as "flanking" sequences or regions (these flanking sequences are located 5' or 3' to the non-translated sequences present on the mRNA transcript). The 5'-flanking region may contain regulatory sequences such as promoters and enhancers which control or influence the transcription of the gene. The 3'-flanking region may contain sequences which direct the termination of transcription, post-transcriptional cleavage and polyadenylation.
[0218] Complementary: "Complementary" or "complementarity" refers to two nucleotide sequences which comprise antiparallel nucleotide sequences capable of pairing with one another (by the base-pairing rules) upon formation of hydrogen bonds between the complementary base residues in the antiparallel nucleotide sequences. For example, the sequence 5'-AGT-3' is complementary to the sequence 5'-ACT-3'. Complementarity can be "partial" or "total." "Partial" complementarity is where one or more nucleic acid bases are not matched according to the base pairing rules. "Total" or "complete" complementarity between nucleic acid molecules is where each and every nucleic acid base is matched with another base under the base pairing rules. The degree of complementarity between nucleic acid molecule strands has significant effects on the efficiency and strength of hybridization between nucleic acid molecule strands. A "complement" of a nucleic acid sequence as used herein refers to a nucleotide sequence whose nucleic acid molecules show total complementarity to the nucleic acid molecules of the nucleic acid sequence.
[0219] Endogenous: An "endogenous" nucleotide sequence refers to a nucleotide sequence, which is present in the genome of a wild type microorganism.
[0220] Enhanced expression: "enhance" or "increase" the expression of a nucleic acid molecute in a microorganism are used equivalently herein and mean that the level of expression of a nucleic acid molecule in a microorganism is higher compared to a reference microorganism, for example a wild type. The terms "enhanced" or "increased" as used herein mean herein higher, preferably significantly higher expression of the nucleic acid molecule to be expressed. As used herein, an "enhancement" or "increase" of the level of an agent such as a protein, mRNA or RNA means that the level is increased relative to a substantially identical microorganism grown under substantially identical conditions. As used herein, "enhancement" or "increase" of the level of an agent, such as for example a preRNA, mRNA, rRNA, tRNA, expressed by the target gene and/or of the protein product encoded by it, means that the level is increased 50% or more, for example 100% or more, preferably 200% or more, more preferably 5 fold or more, even more preferably 10 fold or more, most preferably 20 fold or more for example 50 fold relative to a suitable reference microorganism. The enhancement or increase can be determined by methods with which the skilled worker is familiar. Thus, the enhancement or increase of the nucleic acid or protein quantity can be determined for example by an immunological detection of the protein. Moreover, techniques such as protein assay, fluorescence, Northern hybridization, nuclease protection assay, reverse transcription (quantitative RT-PCR), ELISA (enzyme-linked immunosorbent assay), Western blotting, radioimmunoassay (RIA) or other immunoassays and fluorescence-activated cell analysis (FACS) can be employed to measure a specific protein or RNA in a microorganism. Depending on the type of the induced protein product, its activity or the effect on the phenotype of the microorganism may also be determined. Methods for determining the protein quantity are known to the skilled worker. Examples, which may be mentioned, are: the micro-Biuret method (Goa J (1953) Scand J Clin Lab Invest 5:218-222), the Folin-Ciocalteau method (Lowry O H et al. (1951) J Biol Chem 193:265-275) or measuring the absorption of CBB G-250 (Bradford MM (1976) Analyt Biochem 72:248-254).
[0221] Expression: "Expression" refers to the biosynthesis of a gene product, preferably to the transcription and/or translation of a nucleotide sequence, for example an endogenous gene or a heterologous gene, in a cell. For example, in the case of a structural gene, expression involves transcription of the structural gene into mRNA and--optionally--the subsequent translation of mRNA into one or more polypeptides. In other cases, expression may refer only to the transcription of the DNA harboring an RNA molecule.
[0222] Foreign: The term "foreign" refers to any nucleic acid molecule (e.g., gene sequence) which is introduced into a cell by experimental manipulations and may include sequences found in that cell as long as the introduced sequence contains some modification (e.g., a point mutation, the presence of a selectable marker gene, etc.) and is therefore different relative to the naturally-occurring sequence.
[0223] Functional linkage: The term "functional linkage" or "functionally linked" is equivalent to the term "operable linkage" or "operably linked" and is to be understood as meaning, for example, the sequential arrangement of a regulatory element (e.g. a promoter) with a nucleic acid sequence to be expressed and, if appropriate, further regulatory elements (such as e.g., a terminator) in such a way that each of the regulatory elements can fulfill its intended function to allow, modify, facilitate or otherwise influence expression of said nucleic acid sequence. As a synonym the wording "operable linkage" or "operably linked" may be used. The expression may result depending on the arrangement of the nucleic acid sequences in relation to sense or antisense RNA. To this end, direct linkage in the chemical sense is not necessarily required. Genetic control sequences such as, for example, enhancer sequences, can also exert their function on the target sequence from positions which are further away, or indeed from other DNA molecules. Preferred arrangements are those in which the nucleic acid sequence to be expressed recombinantly is positioned behind the sequence acting as promoter, so that the two sequences are linked covalently to each other. In a preferred embodiment, the nucleic acid sequence to be transcribed is located behind the promoter in such a way that the transcription start is identical with the desired beginning of the chimeric RNA of the invention. Functional linkage, and an expression construct, can be generated by means of customary recombination and cloning techniques as described (e.g., in Maniatis T, Fritsch E F and Sambrook J (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor (N.Y.); Silhavy et al. (1984) Experiments with Gene Fusions, Cold Spring Harbor Laboratory, Cold Spring Harbor (N.Y.); Ausubel et al. (1987) Current Protocols in Molecular Biology, Greene Publishing Assoc. and Wiley Interscience; Gelvin et al. (Eds) (1990) Plant Molecular Biology Manual; Kluwer Academic Publisher, Dordrecht, The Netherlands). However, further sequences, which, for example, act as a linker with specific cleavage sites for restriction enzymes, or as a signal peptide, may also be positioned between the two sequences. The insertion of sequences may also lead to the expression of fusion proteins. Preferably, the expression construct, consisting of a linkage of a regulatory reBion for example a promoter and nucleic acid sequence to be expressed, can exist in a vector-integrated form or can be inserted into the genome, for example by transformation.
[0224] Gene: The term "gene" refers to a region operably linked to appropriate regulatory sequences capable of regulating the expression of the gene product (e.g., a polypeptide or a functional RNA) in some manner. A gene includes untranslated regulatory regions of DNA (e.g., promoters, enhancers, repressors, etc.) preceding (up-stream) and following (downstream) the coding region (open reading frame, ORF). The term "structural gene" as used herein is intended to mean a DNA sequence that is transcribed into mRNA which is then translated into a sequence of amino acids characteristic of a specific polypeptide.
[0225] Genome and genomic DNA: The terms "genome" or "genomic DNA" is referring to the heritable genetic information of a host organism. Said genomic DNA comprises the DNA of the nucleoid but also the DNA of the self replicating plasmid.
[0226] Heterologous: The term "heterologous" with respect to a nucleic acid molecule or DNA refers to a nucleic acid molecule which is operably linked to, or is manipulated to become operably linked to, a second nucleic acid molecule to which it is not operably linked in nature, or to which it is operably linked at a different location in nature. A heterologous expression construct comprising a nucleic acid molecule and one or more regulatory nucleic acid molecule (such as a promoter or a transcription termination signal) linked thereto for example is a constructs originating by experimental manipulations in which either a) said nucleic acid molecule, or b) said regulatory nucleic acid molecule or c) both (i.e. (a) and (b)) is not located in its natural (native) genetic environment or has been modified by experimental manipulations, an example of a modification being a substitution, addition, deletion, inversion or insertion of one or more nucleotide residues. Natural genetic environment refers to the natural genomic locus in the organism of origin, or to the presence in a genomic library. In the case of a genomic library, the natural genetic environment of the sequence of the nucleic acid molecule is preferably retained, at least in part. The environment flanks the nucleic acid sequence at least at one side and has a sequence of at least 50 bp, preferably at least 500 bp, especially preferably at least 1,000 bp, very especially preferably at least 5,000 bp, in length. A naturally occurring expression construct--for example the naturally occurring combination of a promoter with the corresponding gene--becomes a transgenic expression construct when it is modified by non-natural, synthetic "artificial" methods such as, for example, mutagenization. Such methods have been described (U.S. Pat. No. 5,565,350; WO 00/15815). For example a protein encoding nucleic acid molecule operably linked to a promoter, which is not the native promoter of this molecule, is considered to be heterologous with respect to the promoter. Preferably, heterologous DNA is not endogenous to or not naturally associated with the cell into which it is introduced, but has been obtained from another cell or has been synthesized. Heterologous DNA also includes an endogenous DNA sequence, which contains some modification, non-naturally occurring, multiple copies of an endogenous DNA sequence, or a DNA sequence which is not naturally associated with another DNA sequence physically linked thereto. Generally, although not necessarily, heterologous DNA encodes RNA or proteins that are not normally produced by the cell into which it is expressed.
[0227] Hybridization: The term "hybridization" as used herein includes "any process by which a strand of nucleic acid molecule joins with a complementary strand through base pairing." (J. Coombs (1994) Dictionary of Biotechnology, Stockton Press, New York). Hybridization and the strength of hybridization (i.e., the strength of the association between the nucleic acid molecules) is impacted by such factors as the degree of complementarity between the nucleic acid molecules, stringency of the conditions involved, the Tm of the formed hybrid, and the G:C ratio within the nucleic acid molecules. As used herein, the term "Tm" is used in reference to the "melting temperature." The melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half dissociated into single strands. The equation for calculating the Tm of nucleic acid molecules is well known in the art. As indicated by standard references, a simple estimate of the Tm value may be calculated by the equation: Tm=81.5+0.41(% G+C), when a nucleic acid molecule is in aqueous solution at 1 M NaCl [see e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization (1985)]. Other references include more sophisticated computations, which take structural as well as sequence characteristics into account for the calculation of Tm. Stringent conditions, are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6.
[0228] Suitable hybridization conditions are for example hybridizing under conditions equivalent to hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50.degree. C. with washing in 2.times.SSC, 0.1% SDS at 50.degree. C. (low stringency) to a nucleic acid molecule comprising at least 50, preferably at least 100, more preferably at least 150, even more preferably at least 200, most preferably at least 250 consecutive nucleotides of the complement of a sequence. Other suitable hybridizing conditions are hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50.degree. C. with washing in 1.times.SSC, 0.1% SDS at 50.degree. C. (medium stringency) or 65.degree. C. (high stringency) to a nucleic acid molecule comprising at least 50, preferably at least 100, more preferably at least 150, even more preferably at least 200, most preferably at least 250 consecutive nucleotides of a complement of a sequence. Other suitable hybridization conditions are hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4, 1 mM EDTA at 50.degree. C. with washing in 0.1.times.SSC, 0.1% SDS at 65.degree. C. (very high stringency) to a nucleic acid molecule comprising at least 50, preferably at least 100, more preferably at least 150, even more preferably at least 200, most preferably at least 250 consecutive nucleotides of a complement of a sequence.
[0229] "Identity": "Identity" when used in respect to the comparison of two or more nucleic acid or amino acid molecules means that the sequences of said molecules share a certain degree of sequence similarity, the sequences being partially identical.
[0230] To determine the percentage identity (homology is herein used interchangeably if referring to nucleic acid sequences) of two amino acid sequences or of two nucleic acid molecules, the sequences are written one underneath the other for an optimal comparison (for example gaps may be inserted into the sequence of a protein or of a nucleic acid in order to generate an optimal alignment with the other protein or the other nucleic acid).
[0231] The amino acid residues or nucleic acid molecules at the corresponding amino acid positions or nucleotide positions are then compared. If a position in one sequence is occupied by the same amino acid residue or the same nucleic acid molecule as the corresponding position in the other sequence, the molecules are identical at this position. The percentage identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e. % identity=number of identical positions/total number of positions.times.100). The terms "homology" and "identity" are thus to be considered as synonyms when referring to nucleic acid sequences. When referring to amino acid sequences the term identity refers to identical amino acids at a specific position in a sequence, the term homology refers to homologous amino acids at a specific position in a sequence. Homologous amino acids are amino acids having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art.
[0232] A nucleic acid molecule encoding a protein homologous to a protein of the invention can be created by introducing one or more nucleotide substitutions, additions or deletions into a nucleotide sequence such that one or more amino acid substitutions, additions or deletions are introduced into the encoded protein. Mutations can be introduced into one of the sequences of the invention by standard techniques, such as site-directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are made at one or more predicted non-essential amino acid residues. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in a protein of the invention is preferably replaced with another amino acid residue from the same side chain family.
[0233] Alternatively, in another embodiment, mutations can be introduced randomly along all or part of a coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened for the respective activity described herein to identify mutants that retain their activity. Following mutagenesis of one of the sequences of the invention, the encoded protein can be expressed recombinantly and the activity of the protein can be determined using, for example, assays described herein.
[0234] For the determination of the percentage identity of two or more amino acids or of two or more nucleotide sequences several computer software programs have been developed. The identity of two or more sequences can be calculated with for example the software fasta, which presently has been used in the version fasta 3 (W. R. Pearson and D. J. Lipman, PNAS 85, 2444(1988); W. R. Pearson, Methods in Enzymology 183, 63 (1990); W. R. Pearson and D. J. Lipman, PNAS 85, 2444 (1988); W. R. Pearson, Enzymology 183, 63 (1990)). Another useful program for the calculation of identities of different sequences is the standard blast program, which is included in the Biomax pedant software (Biomax, Munich, Federal Republic of Germany). This leads unfortunately sometimes to suboptimal results since blast does not always include complete sequences of the subject and the query. Nevertheless as this program is very efficient it can be used for the comparison of a huge number of sequences. The following settings are typically used for such a comparisons of sequences:
[0235] --p Program Name [String]; --d Database [String]; default=nr; --i Query File [File In]; default=stdin; --e Expectation value (E) [Real]; default=10.0; --m alignment view options: 0=pairwise; 1=query-anchored showing identities; 2=query-anchored no identities; 3=flat query-anchored, show identities; 4=flat query-anchored, no identities; 5=query-anchored no identities and blunt ends; 6=flat query-anchored, no identities and blunt ends; 7=XML Blast output; 8=tabular; 9 tabular with comment lines [Integer]; default=0; --o BLAST report Output File [File Out] Optional; default=stdout; --F Filter query sequence (DUST with blastn, SEG with others) [String]; default=T; --G Cost to open a gap (zero invokes default behavior) [Integer]; default=0; --E Cost to extend a gap (zero invokes default behavior) [Integer]; default=0; --X X dropoff value for gapped alignment (in bits) (zero invokes default behavior); blastn 30, megablast 20, tblastx 0, all others 15 [Integer]; default=0; --I Show GI's in deflines [T/F]; default=F; --q Penalty for a nucleotide mismatch (blastn only) [Integer]; default=-3; --r Reward for a nucleotide match (blastn only) [Integer]; default=1; --v Number of database sequences to show one-line descriptions for (V) [Integer]; default=500; --b Number of database sequence to show alignments for (B) [Integer]; default=250; --f Threshold for extending hits, default if zero; blastp 11, blastn 0, blastx 12, tblastn 13; tblastx 13, megablast 0 [Integer]; default=0; --g Perfom gapped alignment (not available with tblastx) [T/F]; default=T; --Q Query Genetic code to use [Integer]; default=1; --D DB Genetic code (for tblast[nx] only) [Integer]; default=1; --a Number of processors to use [Integer]; default=1; --O SeqAlign file [File Out] Optional; --J Believe the query defline [T/F]; default=F; --M Matrix [String]; default=BLOSUM62; --W Word size, default if zero (blastn 11, megablast 28, all others 3) [Integer]; default=0; --z Effective length of the database (use zero for the real size) [Real]; default=0; --K Number of best hits from a region to keep (off by default, if used a value of 100 is recommended) [Integer]; default=0; --P 0 for multiple hit, 1 for single hit [Integer]; default=0; --Y Effective length of the search space (use zero for the real size) [Real]; default=0; --S Query strands to search against database (for blast[nx], and tblastx); 3 is both, 1 is top, 2 is bottom [Integer]; default=3; --T Produce HTML output [T/F]; default=F; --I Restrict search of database to list of GI's [String] Optional; --U Use lower case filtering of FASTA sequence [T/F] Optional; default=F; --y X dropoff value for ungapped extensions in bits (0.0 invokes default behavior); blastn 20, megablast 10, all others 7 [Real]; default=0.0; --Z X dropoff value for final gapped alignment in bits (0.0 invokes default behavior); blastn/megablast 50, tblastx 0, all others 25 [Integer]; default=0; --R PSI-TBLASTN checkpoint file [File In] Optional; --n MegaBlast search [T/F]; default=F; --L Location on query sequence [String] Optional; --A Multiple Hits window size, default if zero (blastn/megablast 0, all others 40 [Integer]; default=0; --w Frame shift penalty (OOF algorithm for blastx) [Integer]; default=0; --t Length of the largest intron allowed in tblastn for linking HSPs (0 disables linking) [Integer]; default=0.
[0236] Results of high quality are reached by using the algorithm of Needleman and Wunsch or Smith and Waterman. Therefore programs based on said algorithms are preferred. Advantageously the comparisons of sequences can be done with the program PileUp (J. Mol. Evolution., 25, 351 (1987), Higgins et al., CABIOS 5, 151 (1989)) or preferably with the programs "Gap" and "Needle", which are both based on the algorithms of Needleman and Wunsch (J. Mol. Biol. 48; 443 (1970)), and "BestFit", which is based on the algorithm of Smith and Waterman (Adv. Appl. Math. 2; 482 (1981)). "Gap" and "BestFit" are part of the GCG software-package (Genetics Computer Group, 575 Science Drive, Madison, Wis., USA 53711 (1991); Altschul et al., (Nucleic Acids Res. 25, 3389 (1997)), "Needle" is part of the The European Molecular Biology Open Software Suite (EMBOSS) (Trends in Genetics 16 (6), 276 (2000)). Therefore preferably the calculations to determine the percentages of sequence identity are done with the programs "Gap" or "Needle" over the whole range of the sequences. The following standard adjustments for the comparison of nucleic acid sequences were used for "Needle": matrix: EDNAFULL, Gap_penalty: 10.0, Extend_penalty: 0.5. The following standard adjustments for the comparison of nucleic acid sequences were used for "Gap": gap weight: 50, length weight: 3, average match: 10.000, average mismatch: 0.000.
[0237] For example a sequence, which is said to have 80% identity with sequence SEQ ID NO: 1 at the nucleic acid level is understood as meaning a sequence which, upon comparison with the sequence represented by SEQ ID NO: 1 by the above program "Needle" with the above parameter set, has a 80% identity. Preferably the identity is calculated on the complete length of the query sequence, for example SEQ ID NO: 1.
[0238] Isolated: The term "isolated" as used herein means that a material has been removed by the hand of man and exists apart from its original, native environment and is therefore not a product of nature. An isolated material or molecule (such as a DNA molecule or enzyme) may exist in a purified form or may exist in a non-native environment such as, for example, in a transgenic host cell. For example, a naturally occurring nucleic acid molecule or polypeptide present in a living cell is not isolated, but the same nucleic acid molecule or polypeptide, separated from some or all of the coexisting materials in the natural system, is isolated. Such nucleic acid molecules can be part of a vector and/or such nucleic acid molecules or polypeptides could be part of a composition, and would be isolated in that such a vector or composition is not part of its original environment. Preferably, the term "isolated" when used in relation to a nucleic acid molecule, as in "an isolated nucleic acid sequence" refers to a nucleic acid sequence that is identified and separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in its natural source. Isolated nucleic acid molecule is nucleic acid molecule present in a form or setting that is different from that in which it is found in nature. In contrast, non-isolated nucleic acid molecules are nucleic acid molecules such as DNA and RNA, which are found in the state they exist in nature. For example, a given DNA sequence (e.g., a gene) is found on the host cell chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA sequence encoding a specific protein, are found in the cell as a mixture with numerous other mRNAs, which encode a multitude of proteins. However, an isolated nucleic acid sequence comprising for example SEQ ID NO: 1 includes, by way of example, such nucleic acid sequences in cells which ordinarily contain SEQ ID NO: 1 where the nucleic acid sequence is in a genomic or plasmid location different from that of natural cells, or is otherwise flanked by a different nucleic acid sequence than that found in nature. The isolated nucleic acid sequence may be present in single-stranded or double-stranded form. When an isolated nucleic acid sequence is to be utilized to express a protein, the nucleic acid sequence will contain at a minimum at least a portion of the sense or coding strand (i.e., the nucleic acid sequence may be single-stranded). Alternatively, it may contain both the sense and anti-sense strands (i.e., the nucleic acid sequence may be double-stranded).
[0239] Non-coding: The term "non-coding" refers to sequences of nucleic acid molecules that do not encode part or all of an expressed protein. Non-coding sequences include but are not limited enhancers, promoter regions, 3' untranslated regions, and 5' untranslated regions.
[0240] Nucleic acids and nucleotides: The terms "nucleic acids" and "Nucleotides" refer to naturally occurring or synthetic or artificial nucleic acid or nucleotides. The terms "nucleic acids" and "nucleotides" comprise deoxyribonucleotides or ribonucleotides or any nucleotide analogue and polymers or hybrids thereof in either single- or double-stranded, sense or antisense form. Unless otherwise indicated, a particular nucleic acid sequence also implicitly encompasses conservatively modified variants thereof (e.g., degenerate codon substitutions) and complementary sequences, as well as the sequence explicitly indicated. The term "nucleic acid" is used inter-changeably herein with "gene", "cDNA, "mRNA", "oligonucleotide," and "nucleic acid molecule". Nucleotide analogues include nucleotides having modifications in the chemical structure of the base, sugar and/or phosphate, including, but not limited to, 5-position pyrimidine modifications, 8-position purine modifications, modifications at cytosine exocyclic amines, substitution of 5-bromo-uracil, and the like; and 2'-position sugar modifications, including but not limited to, sugar-modified ribonucleotides in which the 2'-OH is replaced by a group selected from H, OR, R, halo, SH, SR, NH2, NHR, NR2, or CN. Short hairpin RNAs (shRNAs) also can comprise non-natural elements such as non-natural bases, e.g., ionosin and xanthine, non-natural sugars, e.g., 2'-methoxy ribose, or non-natural phosphodiester linkages, e.g., methylphosphonates, phosphorothioates and peptides.
[0241] Nucleic acid sequence: The phrase "nucleic acid sequence" refers to a single or double-stranded polymer of deoxyribonucleotide or ribonucleotide bases read from the 5'- to the 3'-end. It includes chromosomal DNA, self-replicating plasmids, infectious polymers of DNA or RNA and DNA or RNA that performs a primarily structural role. "Nucleic acid sequence" also refers to a consecutive list of abbreviations, letters, characters or words, which represent nucleotides. In one embodiment, a nucleic acid can be a "probe" which is a relatively short nucleic acid, usually less than 100 nucleotides in length. Often a nucleic acid probe is from about 50 nucleotides in length to about 10 nucleotides in length. A "target region" of a nucleic acid is a portion of a nucleic acid that is identified to be of interest. A "coding region" of a nucleic acid is the portion of the nucleic acid, which is transcribed and translated in a sequence-specific manner to produce into a particular polypeptide or protein when placed under the control of appropriate regulatory sequences. The coding region is said to encode such a polypeptide or protein.
[0242] Oligonucleotide: The term "oligonucleotide" refers to an oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or mimetics thereof, as well as oligonucleotides having non-naturally-occurring portions which function similarly. Such modified or substituted oligonucleotides are often preferred over native forms because of desirable properties such as, for example, enhanced cellular uptake, enhanced affinity for nucleic acid target and increased stability in the presence of nucleases. An oligonucleotide preferably includes two or more nucleomonomers covalently coupled to each other by linkages (e.g., phosphodiesters) or substitute linkages.
[0243] Overhang: An "overhang" is a relatively short single-stranded nucleotide sequence on the 5'- or 3'-hydroxyl end of a double-stranded oligonucleotide molecule (also referred to as an "extension," "protruding end," or "sticky end").
[0244] Polypeptide: The terms "polypeptide", "peptide", "oligopeptide", "polypeptide", "gene product", "expression product" and "protein" are used interchangeably herein to refer to a polymer or oligomer of consecutive amino acid residues.
[0245] Promoter: The terms "promoter", or "promoter sequence" are equivalents and as used herein, refer to a DNA sequence which when operably linked to a nucleotide sequence of interest is capable of controlling the transcription of the nucleotide sequence of interest into RNA. A promoter is located 5' (i.e., upstream), proximal to the transcriptional start site of a nucleotide sequence of interest whose transcription into mRNA it controls, and provides a site for specific binding by RNA polymerase and other transcription factors for initiation of transcription. The promoter does not comprise coding regions or 5' untranslated regions. The promoter may for example be heterologous or homologous to the respective cell. A nucleic acid molecule sequence is "heterologous to" an organism or a second nucleic acid molecule sequence if it originates from a foreign species, or, if from the same species, is modified from its original form. For example, a promoter operably linked to a heterologous coding sequence refers to a coding sequence from a species different from that from which the promoter was derived, or, if from the same species, a coding sequence which is not naturally associated with the promoter (e.g. a genetically engineered coding sequence or an allele from a different ecotype or variety). Suitable promoters can be derived from genes of the host cells where expression should occur or from pathogens for this host.
[0246] Purified: As used herein, the term "purified" refers to molecules, either nucleic or amino acid sequences that are removed from their natural environment, isolated or separated. "Substantially purified" molecules are at least 60% free, preferably at least 75% free, and more preferably at least 90% free from other components with which they are naturally associated. A purified nucleic acid sequence may be an isolated nucleic acid sequence.
[0247] Recombinant: The term "recombinant" with respect to nucleic acid molecules refers to nucleic acid molecules produced by recombinant DNA techniques. The term also comprises nucleic acid molecules which as such does not exist in nature but are modified, changed, mutated or otherwise manipulated by man. Preferably, a "recombinant nucleic acid molecule" is a non-naturally occurring nucleic acid molecule that differs in sequence from a naturally occurring nucleic acid molecule by at least one nucleic acid. A "recombinant nucleic acid molecule" may also comprise a "recombinant construct" which comprises, preferably operably linked, a sequence of nucleic acid molecules not naturally occurring in that order. Preferred methods for producing said recombinant nucleic acid molecule may comprise cloning techniques, directed or non-directed mutagenesis, synthesis or recombination techniques.
[0248] Significant increase: An increase for example in enzymatic activity, gene expression, productivity or yield of a certain product, that is larger than the margin of error inherent in the measurement technique, preferably an increase by about 10% or 25% preferably by 50% or 75%, more preferably 2-fold or-5 fold or greater of the activity, expression, productivity or yield of the control enzyme or expression in the control cell, productivity or yield of the control cell, even more preferably an increase by about 10-fold or greater.
[0249] Significant decrease: A decrease for example in enzymatic activity, gene expression, productivity or yield of a certain product, that is larger than the margin of error inherent in the measurement technique, preferably a decrease by at least about 5% or 10%, preferably by at least about 20% or 25%, more preferably by at least about 50% or 75%, even more preferably by at least about 80% or 85%, most preferably by at least about 90%, 95%, 97%, 98% or 99%.
[0250] Substantially complementary: In its broadest sense, the term "substantially complementary", when used herein with respect to a nucleotide sequence in relation to a reference or target nucleotide sequence, means a nucleotide sequence having a percentage of identity between the substantially complementary nucleotide sequence and the exact complementary sequence of said reference or target nucleotide sequence of at least 60%, more desirably at least 70%, more desirably at least 80% or 85%, preferably at least 90%, more preferably at least 93%, still more preferably at least 95% or 96%, yet still more preferably at least 97% or 98%, yet still more preferably at least 99% or most preferably 100% (the latter being equivalent to the term "identical" in this context). Preferably identity is assessed over a length of at least 19 nucleotides, preferably at least 50 nucleotides, more preferably the entire length of the nucleic acid sequence to said reference sequence (if not specified otherwise below). Sequence comparisons are carried out using default GAP analysis with the University of Wisconsin GCG, SEQWEB application of GAP, based on the algorithm of Needleman and Wunsch (Needleman and Wunsch (1970) J Mol. Biol. 48: 443-453; as defined above). A nucleotide sequence "substantially complementary" to a reference nucleotide sequence hybridizes to the reference nucleotide sequence under low stringency conditions, preferably medium stringency conditions, most preferably high stringency conditions (as defined above).
[0251] Transgene: The term "transgene" as used herein refers to any nucleic acid sequence, which is introduced into the genome of a cell by experimental manipulations. A transgene may be an "endogenous DNA sequence," or a "heterologous DNA sequence" (i.e., "foreign DNA"). The term "endogenous DNA sequence" refers to a nucleotide sequence, which is naturally found in the cell into which it is introduced so long as it does not contain some modification (e.g., a point mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring sequence.
[0252] Transgenic: The term transgenic when referring to an organism means transformed, preferably stably transformed, with a recombinant DNA molecule that preferably comprises a suitable promoter operatively linked to a DNA sequence of interest.
[0253] Vector: As used herein, the term "vector" refers to a nucleic acid molecule capable of transporting another nucleic acid molecule to which it has been linked. One type of vector is a genomic integrated vector, or "integrated vector", which can become integrated into the genomic DNA of the host cell. Another type of vector is an episomal vector, i.e., a plasmid or a nucleic acid molecule capable of extra-chromosomal replication. Vectors capable of directing the expression of genes to which they are operatively linked are referred to herein as "expression vectors". In the present specification, "plasmid" and "vector" are used interchangeably unless otherwise clear from the context.
[0254] Wild type: The term "wild type", "natural" or "natural origin" means with respect to an organism that said organism is not changed, mutated, or otherwise manipulated by man. With respect to a polypeptide or nucleic acid sequence, that the polypeptide or nucleic acid sequence is naturally occurring or available in at least one naturally occurring organism which is not changed, mutated, or otherwise manipulated by man. A wild type of a microorganism refers to a microorganism whose genome is present in a state as before the introduction of a genetic modification of a certain gene. The genetic modification may be e.g. a deletion of a gene or a part thereof or a point mutation or the introduction of a gene.
[0255] The terms "production" or "productivity" are art-recognized and include the concentration of the fermentation product (for example, alanine) formed within a given time and a given fermentation volume (e.g., kg product per hour per liter). The term "efficiency of production" includes the time required for a particular level of production to be achieved (for example, how long it takes for the cell to attain a particular rate of output of a fine chemical).
[0256] The term "yield" or "product/carbon yield" is art-recognized and includes the efficiency of the conversion of the carbon source into the product (i.e., fine chemical). This is generally written as, for example, kg product per kg carbon source. By increasing the yield or production of the compound, the quantity of recovered molecules or of useful recovered molecules of that compound in a given amount of culture over a given amount of time is increased.
[0257] The term "recombinant microorganism" includes microorganisms which have been genetically modified such that they exhibit an altered or different genotype and/or phenotype (e. g., when the genetic modification affects coding nucleic acid sequences of the microorganism) as compared to the wild type microorganism from which it was derived. A recombinant microorganism comprises at least one recombinant DNA molecule.
[0258] The term "recombinant" with respect to DNA refers to DNA molecules produced by man using recombinant DNA techniques. The term comprises DNA molecules which as such do not exist in nature or do not exist in the organism from which the DNA is derived, but are modified, changed, mutated or otherwise manipulated by man. Preferably, a "recombinant DNA molecule" is a non-naturally occurring nucleic acid molecule that differs in sequence from a naturally occurring nucleic acid molecule by at least one nucleic acid. A "recombinant DNA molecule" may also comprise a "recombinant construct" which comprises, preferably operably linked, a sequence of nucleic acid molecules not naturally occurring in that order. Preferred methods for producing said recombinant DNA molecule may comprise cloning techniques, directed or non-directed mutagenesis, gene synthesis or recombination techniques.
[0259] The term "directed evolution" is used synonymously with the term "metabolic evolution" herein and involves applying a selection pressure that favors the growth of mutants with the traits of interest. The selection pressure can be based on different culture conditions, ATP and growth coupled selection and redox related selection. The selection pressure can be carried out with batch fermentation with serial transferring inoculation or continuous culture with the same pressure.
[0260] An example of such a recombinant DNA is a plasmid into which a heterologous DNAsequence has been inserted or a gene or promoter which has been mutated compared to gene or promoter from which the recombinant DNA derived. The mutation may be introduced by means of directed mutagenesis technologies known in the art or by random mutagenesis technologies such as chemical, UV light or x-ray mutagenesis or directed evolution technologies.
[0261] The term "expression" or "gene expression" means the transcription of a specific gene(s) or specific genetic vector construct. The term "expression" or "gene expression" in particular means the transcription of gene(s) or genetic vector construct into mRNA. The process includes transcription of DNA and may include processing of the resulting RNA-product. The term "expression" or "gene expression" may also include the translation of the mRNA and therewith the synthesis of the encoded protein, i.e. protein expression.
[0262] FIG. 1
[0263] Alanine formation in a batch fermentation of E. coli QZ20 with empty plasmid control and F coliQZ20/pACYC-asd in 500 mL AM 1 medium with 120 g/L glucose. The fermentation was controlled at 37 C, 400 rpm, at pH 6.8 with 5 N NH.sub.4OH without aeration.
[0264] FIG. 2
[0265] Alanine formation in a batch fermentation of F col i QZ20 with empty plasmid control and F coliQZ20/pACYC-gdhA in 500 mL AM 1 medium with 120 g/L glucose. The fermentation was controlled at 37 C, 400 rpm, at pH 6.8 with 5 N NH.sub.4OH without aeration.
EXAMPLES
[0266] Chemicals and Common Methods
[0267] Unless indicated otherwise, cloning procedures carried out for the purposes of the present invention including restriction digest, agarose gel electrophoresis, purification of nucleic acids, ligation of nucleic acids, transformation, selection and cultivation of bacterial cells are performed as described (Sambrook et al., 1989). Sequence analyses of recombinant DNA are performed with a laser fluorescence DNA sequencer (Applied Biosystems, Foster City, Calif., USA) using the Sanger technology (Sanger et al., 1977).
[0268] Unless described otherwise, chemicals and reagents are obtained from Sigma Aldrich (Sigma Aldrich, St. Louis, USA), from Promega (Madison, Wis., USA), Duchefa (Haarlem, The Netherlands) or Invitrogen (Carlsbad, Calif., USA). Restriction endonucleases are from New England Biolabs (Ipswich, Mass., USA) or Roche Diagnostics GmbH (Penzberg, Germany). Oligonucleotides are synthesized by Eurofins MWG Operon (Ebersberg, Germany).
Example 1
[0269] E. coli W (LU17032) was engineered for L-alanine production by inactivation of the ackA, adhE, frdABCD and pflB ORFs and replacement of the IdhA ORF by a codon-optimized variant of the alaD gene (alaD-gstear).
[0270] The ackA, adhE, frdABCD and pflB ORFs were inactivated by insertion of an FRT-flanked kanamycin resistance cassette, followed by removal of the antibiotic resistance cassette by FLP recombination.
[0271] The IdhA gene was replaced by alaD-gstear and a downstream FRT-flanked zeocin resistance cassette, which was finally removed by FLP recombination.
[0272] The procedure has been described previously for example in WO2015/044818 which is hereby incorporated by reference.
[0273] Disruption of ackA has been described previously for example in Causey et al (2004), PNAS, 101 (8) pages 2235-2240.
[0274] Further, in said strain comprising an inactivated ackA, adhE, frdABCD, pflB and IdhA and an introduced alaD activity, the activity of the genes lpd, zipA and ygaW have been increased as described previously in WO2015/044818 and the activity of the alaD has been further enhanced by mutating the promoter controlling said gene as described previously in WO2015/044818.
[0275] Bacterial Culture
[0276] E. coli W (LU17032) was cultured in Luria-Bertani (LB) liquid medium or on Luria-Bertani solid medium. Occasionally, clones were passaged over M9 minimal agar containing 10 mM Sucrose to confirm W strain identity. Antibiotics were added to the liquid and solid media as appropriate, to final concentrations of 15 .mu.g/ml (kanamycin, chloramphenicol), 25 .mu.g/ml (zeocin) or 3 .mu.g/ml (tetracyclin).
[0277] Red/ET Recombination
[0278] Red/ET recombination was performed using standard protocols of Gene Bridges GmbH (www.genebridges.com). Briefly, Red/ET-proficient E. coli W was aerobically grown at 30.degree. C. to an OD600 nm of -0.3. Expression of red genes was induced by adding 50 .mu.I of 10% (w/v) L-arabinose, followed by a temperature increase to 37.degree. C. Arabinose was omitted from uninduced control cultures. After 35 min of incubation at 37.degree. C. the cells were washed twice with ice cold 10% (v/v) glycerol and electroporated with 500 ng of PCR product at 1.35 kV, 10 .mu.F, 6000. The cells were then resuspended in 1 ml ice-cold LB medium and aerobically grown at 37.degree. C. for approximately 1.5 h. Cultures were then plated on LB agar containing 15 .mu.g/ml kanamycin (knockouts) or 25 .mu.g/ml zeocin (knockin).
[0279] FLP Recombination
[0280] Flanking FRT sites allowed removal of antibiotic resistance markers by FLP recombination following modification of the E. coli chromosome. FLP recombination leaves a single FRT site (34 bp) as well as short flanking sequences (approx. 20 bp each) which are used as primer binding sites in the amplification of the cassettes.
[0281] To perform FLP recombination, plasmid 708-FLPe (Tab. 1) encoding FLP recombinase was introduced into the Red/ET recombinants by electroporation. KanR CmR or ZeoR CmR transformants were used to inoculate 0.2 ml LB cultures, which were incubated at 30.degree. C. for 3 h. FLP activity was then induced by a temperature shift to 37.degree. C., followed by a three-hour incubation at 37.degree. C. Single colonies obtained from these cultures were subsequently screened for a CmS and KanS or ZeoS phenotype.
[0282] DNA Preparation and Analysis
[0283] E. coli genomic DNA (gDNA) was isolated from overnight cultures with the Gentra Puregene Yeast/Bact. Kit B (Qiagen, Hilden, Germany). PCR products harbouring knockout or knockin cassettes were amplified from template plasmids with PRECISOR high-fidelity DNA polymerase (BioCat, Heidelberg) and analytical PCR reactions were performed with the PCR Extender System (5PRIME GmbH, Hamburg, Germany), according to the manufacturer's recommendations. PCR amplicons were purified using the GeneJET PCR Purification Kit or the GeneJET Gel Extraction Kit (Fermentas, St. Leon-Rot, Germany) and sequencing was performed by GATC BioTech (Konstanz, Germany) or BioSpring (Frankfurt am Main, Germany).
TABLE-US-00001 TABLE 1 Plasmids and primers Relevant characteristics/oligo sequences plasmids (5' .fwdarw. 3') Source pRed/ET red expression plasmid, Gene pSC101-based, Tc.sup.R Bridges 708-FLPe FLP recombinase expression plasmid, Gene pSC101-based, Cm.sup.R Bridges primers (BioSpring) SEQ ID NO pACYC-asd-F 35 pACYC-asd-R 36 pACYC-gdhA-F 40 pACYC-gdhA-R 41
Example 2 HPLC Detection and Quantification of Alanine
[0284] The following HPLC method for the alanine detection in the cell culture media was used:
[0285] Column: Aminex HPX-87C column (Bio-Rad), 300.times.7.8 mm, i.d. particle size 9 .mu.m
[0286] Mobile phase: Ca(NO3)2 at 0.1 mol/L 90%, Acetonitrile 10%
[0287] Flow rate: 0.6 mL/min
[0288] Column temperature: 60.degree. C.
[0289] Detection: Refractive index detector
[0290] Under above method, major estimated components in the cell culture sample matrix can be well separated from alanine, without interfering alanine's quantitation.
[0291] The amount of the alanine in the sample was determined by external standard calibration method. Standard samples containing alanine from 0.5 to 10.0 g/L were injected and the peak areas were used for calibration. Linear regression coefficient of the calibration curve was 0.9995.
[0292] Samples are injected once at 20 .mu.L. Peak areas are used to calculate the amount presenting in the sample by Waters LC Millenium software.
Example 3 HPLC Detection and Quantification of of Glucose, Succinate, Lactate, Formate, Acetate and Ethanol
[0293] HPLC Method Used
[0294] Column: Aminex HPX-87H column (Bio-Rad), 300 x7.8 mm, i.d. particle size 9 .mu.m
[0295] Mobile phase: H2SO4 4 mM
[0296] Flow rate: 0.4 mL/min
[0297] Column temperature: 45.degree. C.
[0298] Detection: Refractive index detector
[0299] The amount of the analytes was determined by external standard calibration method. Standard samples containing glucose from 0.1 to 38.0 g/L, succinate, lactate, formate, acetate and ethanol from 0.05 to 10.0 g/L were injected and the peak areas were used for calibration. Linear regression coefficients for all six calibration curves were better than 0.999.
[0300] Samples are injected once at 20 .mu.L. Peak areas are used to calculate the amount presenting in the sample by Waters LC Millenium software.
Example 4 Effect of the Increased Expression of the Asd Gene on Alanine Productivity
[0301] An additional copy of the asd gene (SEQ ID NO: 25) was introduced into the pACYC184 plasmid under the control of an IPTG-inducible Ptrc promoter. The vector, designated as pACYC-asd (SEQ ID NO: 34), was constructed via commercial InFusion cloning technology (Clontech, Mountain View, Calif., USA). The pACYC184 vector (NEB) was linearized with HindIII and SalI restriction endonucleases (NEB). The generated vector backbone was purified by agarose gel extraction. The asd gene was PCR amplified from wild-type E. coli W genomic DNA with primers asd-pACYC_F (SEQ ID NO: 35) and asd-pACYC_R (SEQ ID NO: 36). The primers contained additional 15 bp homologous overhangs to the vector backbone and a double-stranded DNA fragment with the Ptrc promoter that was synthesized by IDT. The amplified asd gene, the upstream Ptrc promoter and the pACYC184 vector backbone were cloned together according to the InFusion cloning manual. The resulting pACYC-asd vector was transformed into E. coli via electroporation and selected for on LB chloramphenicol plates. Positive constructs were confirmed by DNA sequencing.
[0302] The effect of asd overexpression on alanine productivity was tested by comparative cultivation of E. coli with an empty control pACYC vector and E. coli harbouring the asd overexpression plasmid pACYC-asd (SEQ ID NO: 34). Precultures were grown in shake flasks with LB medium, 20% filling volume at 37 C and 200 rpm overnight. The fermentation was performed in the DASGIP 1.5 L parallel bioreactor system (Eppendorf) in 500 mL AM 1 medium (2.6 g/L (NH4)2HPO4, 0.87 g/L NH4H2PO4, 0.15 g/L Kill, 0.37 g/L MgSO4.7 H2O, 15 g/L (NH4)2SO4, 1 mM betaine, 1 ml/L trace metal stock solution). The trace metal stock comprised 1.6 g/L FeCL3.6 H2O; 0.2 g/L CoCl2.6 H2O; 0.1 g/L CuCl2.2 H2O; 0.2 g/L ZnCl2; 0.2 g/L NaMoO4.2 H2O; 0.05 g/L H3BO3, 0.1 M HCL. 120 g/L Glucose were used as carbon source in the fermentation medium. 25 ug/mL chloramphenicol were added to stably maintain the plasmid. Expression of the asd gene from the Ptrc promoter was induced with 250 uM isopropyl .beta.-D-1-thiogalactopyranoside (IPTG) during the early logarithmic growth phase. Each strain was run in duplicates at 37 C and 400 rpm stirrer speed. 5N NH.sub.4OH was used to control the pH to 6.8 and provide the culture with ammonium as an alanine precursor throughout the fermentation. No air was sparged during the fermentation and the vessel was not pressurized so that after the initial consumption of dissolved oxygen in the medium by the cells the fermentation was run under microaerobic conditions. Samples were taken throughout the fermentation and analyzed by HPLC for alanine and glucose concentrations.
[0303] After 72 h of fermentation time E coliQZ20 in which the asd gene (SEQ ID NO: 25) was overexpressed from the pACYC-asd plasmid (SEQ ID NO: 34) reached a significantly higher L-alanine titer of 66.58.+-.0.50 g/L compared to the strain harbouring the empty control plasmid with 46.00.+-.1.85 g/L (FIG. 1).
Example 5 Effect of the Increased Expression of the gdhA Gene on Alanine Productivity
[0304] An additional copy of the gdhA gene (SEQ ID NO: 37) was introduced into the pACYC184 plasmid under the control of an IPTG-inducible Ptrc promoter. The vector, designated as pACYC-gdhA (SEQ ID NO: 39), was constructed via commercial InFusion cloning technology (Clontech, Mountain View, Calif., USA). The pACYC184 vector (NEB) was linearized with HindIII and SalI restriction endonucleases (NEB). The generated vector backbone was purified by agarose gel extraction. The gdhA gene was PCR amplified from wild-type E. coli W genomic DNA with primers gdhA-pACYC_F (SEQ ID NO: 40) and gdhA-pACYC_R (SEQ ID NO: 41). The primers contained additional 15 bp homologous overhangs to the vector backbone and a double-stranded DNA fragment with the Ptrc promoter that was synthesized by IDT. The amplified gdhA gene, the upstream Ptrc promoter and the pACYC184 vector backbone were cloned together according to the InFusion cloning manual. The resulting pACYC-gdhA vector was transformed into E. coli via electroporation and selected for on LB chloramphenicol plates. Positive constructs were confirmed by DNA sequencing.
[0305] The effect of gdhA overexpression on alanine productivity was tested by comparative cultivation of E. coli with an empty control pACYC vector and E. coli harbouring the gdhA overexpression plasmid pACYC-gdhA (SEQ ID NO: 39). Precultures were grown in shake flasks with LB medium, 20% filling volume at 37 C and 200 rpm overnight. The fermentation was performed in the DASGIP 1.5 L parallel bioreactor system (Eppendorf) in 500 mL AM 1 medium (2.6 g/L (NH4)2HPO4, 0.87 g/L NH4H2PO4, 0.15 g/L Kill, 0.37 g/L MgSO4.7 H2O, 15 g/L (NH4)2504, 1 mM betaine, 1 ml/L trace metal stock solution). The trace metal stock comprised 1.6 g/L FeCL3.6 H2O; 0.2 g/L CoCl2.6 H2O; 0.1 g/L CuCl2.2 H2O; 0.2 g/L ZnCl2; 0.2 g/L NaMoO4.2 H2O; 0.05 g/L H3BO3, 0.1 M HCL. 120 g/L Glucose were used as carbon source in the fermentation medium. 25 ug/mL chloramphenicol were added to stably maintain the plasmid. Expression of the gdhA gene from the Ptrc promoter was induced with 250 uM isopropyl .beta.-D-1-thiogalactopyranoside (IPTG) during the early logarithmic growth phase. Each strain was run in duplicates at 37 C and 400 rpm stirrer speed. 5N NH.sub.4OH was used to control the pH to 6.8 and provide the culture with ammonium as an alanine precursor throughout the fermentation. No air was sparged during the fermentation and the vessel was not pressurized so that after the initial consumption of dissolved oxygen in the medium by the cells the fermentation was run under microaerobic conditions. Samples were taken throughout the fermentation and analyzed by HPLC for alanine and glucose concentrations.
[0306] After 72 h of fermentation time E. coli QZ20 in which the gdhA gene (SEQ ID NO: 37) was overexpressed from the pACYC-gdhA plasmid (SEQ ID NO: 39) reached a significantly higher L-alanine titer of 62.18.+-.0.37 g/L compared to the strain harbouring the empty control plasmid with 50.96.+-.3.41 g/L (FIG. 2).
Sequence CWU
1
1
4111119DNAG.stearothermophilusmisc_feature(1)..(1119)alaD 1atgaaaattg
gcatccctaa agagattaag aacaatgaaa accgtgtagc aatcaccccg 60gcaggtgtta
tgactctggt taaagcgggc cacgatgtgt acgtcgaaac cgaagcgggt 120gccggcagcg
gcttcagcga cagcgagtat gagaaggcgg gtgcggttat tgtgactaag 180gcggaggacg
cttgggcagc cgaaatggtt ctgaaggtga aagaaccgct ggcggaggag 240tttcgctatt
ttcgtccggg tctgattttg ttcacctacc tgcacctggc tgcggccgag 300gcgctgacca
aggcactggt ggagcagaag gttgttggca tcgcgtacga aacggttcaa 360ctggcgaatg
gttccctgcc gctgctgacc cctatgtctg aagttgcggg tcgcatgagc 420gttcaagtcg
gcgctcagtt tctggagaaa ccgcacggtg gcaagggcat tttgctgggt 480ggtgttccgg
gtgtccgccg tggtaaagtg acgatcattg gcggtggtac ggccggtacg 540aacgcggcca
agattgccgt aggtctgggt gcagatgtga ccattctgga catcaacgcg 600gaacgtttgc
gtgagctgga cgacctgttt ggcgaccaag tcaccaccct gatgagcaac 660agctaccaca
tcgcggagtg cgtccgtgaa agcgatttgg tcgttggtgc ggtgctgatc 720ccgggtgcaa
aagccccgaa actggtgacc gaggagatgg tccgtagcat gaccccgggt 780tcggttctgg
tcgacgtggc aattgaccag ggcggtatct tcgaaaccac cgaccgcgtc 840acgacccatg
atgacccgac ctatgtgaaa catggcgtgg ttcactatgc ggtcgcgaat 900atgccgggtg
cagtgccgcg cacgtccacg ttcgcgctga cgaacgtgac gattccatac 960gctctgcaga
tcgccaataa gggctatcgt gcggcgtgtc tggataatcc ggcattgctg 1020aaaggcatca
ataccctgga tggtcatatc gtttacgagg ctgtggctgc agcacacaac 1080atgccgtaca
ctgatgtcca tagcttgctg caaggctaa
11192372PRTG.stearothermophilusMISC_FEATURE(1)..(372)alaD 2Met Lys Ile
Gly Ile Pro Lys Glu Ile Lys Asn Asn Glu Asn Arg Val1 5
10 15Ala Ile Thr Pro Ala Gly Val Met Thr
Leu Val Lys Ala Gly His Asp 20 25
30Val Tyr Val Glu Thr Glu Ala Gly Ala Gly Ser Gly Phe Ser Asp Ser
35 40 45Glu Tyr Glu Lys Ala Gly Ala
Val Ile Val Thr Lys Ala Glu Asp Ala 50 55
60Trp Ala Ala Glu Met Val Leu Lys Val Lys Glu Pro Leu Ala Glu Glu65
70 75 80Phe Arg Tyr Phe
Arg Pro Gly Leu Ile Leu Phe Thr Tyr Leu His Leu 85
90 95Ala Ala Ala Glu Ala Leu Thr Lys Ala Leu
Val Glu Gln Lys Val Val 100 105
110Gly Ile Ala Tyr Glu Thr Val Gln Leu Ala Asn Gly Ser Leu Pro Leu
115 120 125Leu Thr Pro Met Ser Glu Val
Ala Gly Arg Met Ser Val Gln Val Gly 130 135
140Ala Gln Phe Leu Glu Lys Pro His Gly Gly Lys Gly Ile Leu Leu
Gly145 150 155 160Gly Val
Pro Gly Val Arg Arg Gly Lys Val Thr Ile Ile Gly Gly Gly
165 170 175Thr Ala Gly Thr Asn Ala Ala
Lys Ile Ala Val Gly Leu Gly Ala Asp 180 185
190Val Thr Ile Leu Asp Ile Asn Ala Glu Arg Leu Arg Glu Leu
Asp Asp 195 200 205Leu Phe Gly Asp
Gln Val Thr Thr Leu Met Ser Asn Ser Tyr His Ile 210
215 220Ala Glu Cys Val Arg Glu Ser Asp Leu Val Val Gly
Ala Val Leu Ile225 230 235
240Pro Gly Ala Lys Ala Pro Lys Leu Val Thr Glu Glu Met Val Arg Ser
245 250 255Met Thr Pro Gly Ser
Val Leu Val Asp Val Ala Ile Asp Gln Gly Gly 260
265 270Ile Phe Glu Thr Thr Asp Arg Val Thr Thr His Asp
Asp Pro Thr Tyr 275 280 285Val Lys
His Gly Val Val His Tyr Ala Val Ala Asn Met Pro Gly Ala 290
295 300Val Pro Arg Thr Ser Thr Phe Ala Leu Thr Asn
Val Thr Ile Pro Tyr305 310 315
320Ala Leu Gln Ile Ala Asn Lys Gly Tyr Arg Ala Ala Cys Leu Asp Asn
325 330 335Pro Ala Leu Leu
Lys Gly Ile Asn Thr Leu Asp Gly His Ile Val Tyr 340
345 350Glu Ala Val Ala Ala Ala His Asn Met Pro Tyr
Thr Asp Val His Ser 355 360 365Leu
Leu Gln Gly 3703269DNAE.colimisc_feature(1)..(269)ldhA mut Promoter
3catagtaaat tcccccacca gtttaaccgg cggctgattt tcaaacgcga cgacatccag
60ttcgctgact gtaagttgtt gccctttcag ctggccttga aatttaactt tttcgccctg
120ataacgcagt tgctggatat cagaggttaa tgcgagagag agttttccct gccattcctg
180ccagggagaa aaaatcagtt tatcgatatt gattttgtaa aatattttta gtagcttaaa
240tgtgattcaa catcactgga gaaagtctt
269450DNAE.colimisc_feature(1)..(50)ldhA mut Promoter fragment
4tattgatttt gtaaaatatt tttagtagct taaatgtgat tcaacatcac
5052283DNAE.colimisc_feature(1)..(2283)pflB 5atgtccgagc ttaatgaaaa
gttagccaca gcctgggaag gttttaccaa aggtgactgg 60cagaatgaag taaacgtccg
tgacttcatt cagaaaaact acactccgta cgagggtgac 120gagtccttcc tggctggcgc
tactgaagcg accaccaccc tgtgggacaa agtaatggaa 180ggcgttaaac tggaaaaccg
cactcacgcg ccagttgact ttgacaccgc tgttgcttcc 240accatcacct ctcacgacgc
tggctacatc aacaaagcgt tggaaaaagt tgttggtcta 300cagactgaag ctccgctgaa
acgtgctctt atcccgttcg gtggtatcaa aatgatcgag 360ggttcctgca aagcgtacaa
ccgcgaactg gacccgatga tcaaaaaaat cttcactgaa 420taccgtaaaa ctcacaacca
gggcgtgttc gacgtttaca ctccggacat cctgcgttgc 480cgtaaatccg gtgttctgac
cggtctgcca gatgcttatg gccgtggccg tatcatcggt 540gactaccgtc gcgttgcgct
gtacggtatc gactacctga tgaaagacaa atacgctcag 600ttcacctctc tgcaggctga
tctggaaaac ggcgtaaacc tggaacagac tatccgtctg 660cgcgaagaaa tcgctgaaca
gcaccgcgct ctgggtcaga tgaaagaaat ggctgcgaaa 720tacggctacg acatctctgg
tccggctacc aacgctcagg aagctatcca gtggacttac 780ttcggctacc tggctgctgt
taagtctcag aacggtgctg caatgtcctt cggtcgtacc 840tccaccttcc tggatgtgta
catcgaacgt gacctgaaag ctggcaagat caccgaacaa 900gaagcgcagg aaatggttga
ccacctggtc atgaaactgc gtatggttcg cttcctgcgt 960actccggaat acgatgaact
gttctctggc gacccgatct gggcaaccga atctatcggt 1020ggtatgggcc tcgacggtcg
taccctggtt accaaaaaca gcttccgttt cctgaacacc 1080ctgtacacca tgggtccgtc
tccggaaccg aacatgacca ttctgtggtc tgaaaaactg 1140ccgctgaact tcaagaaatt
cgccgctaaa gtgtccatcg acacctcttc tctgcaatat 1200gagaacgatg acctgatgcg
tccggacttc aacaacgatg actacgctat tgcttgctgc 1260gtaagcccga tgatcgttgg
taaacaaatg cagttcttcg gtgcgcgtgc aaacctggcg 1320aaaaccatgc tgtacgcaat
caacggcggc gttgacgaaa aactgaaaat gcaggttggt 1380ccgaagtctg aaccgatcaa
aggcgatgtc ctgaactatg atgaagtgat ggagcgcatg 1440gatcacttca tggactggct
ggctaaacag tacatcactg cactgaacat catccactac 1500atgcacgaca agtacagcta
cgaagcctct ctgatggcgc tgcacgaccg tgacgttatc 1560cgcaccatgg cgtgtggtat
cgctggtctg tccgttgctg ctgactccct gtctgcaatc 1620aaatatgcga aagttaaacc
gattcgtgac gaagacggtc tggctatcga cttcgaaatc 1680gaaggcgaat acccgcagtt
tggtaacaat gatccgcgtg tagatgacct ggctgttgac 1740ctggtagaac gtttcatgaa
gaaaattcag aaactgcaca cctaccgtga cgctatcccg 1800actcagtctg ttctgaccat
cacttctaac gttgtgtatg gtaagaaaac tggtaacacc 1860ccagacggtc gtcgtgctgg
cgcgccgttc ggaccgggtg ctaacccgat gcacggtcgt 1920gaccagaaag gtgctgtagc
gtctctgact tccgttgcta aactaccgtt tgcttacgct 1980aaagatggta tctcctacac
cttctctatc gttccgaacg cactgggtaa agacgacgaa 2040gttcgtaaga ccaacctggc
tggtctgatg gatggttact tccaccacga agcatccatc 2100gaaggtggtc agcacctgaa
cgttaacgtg atgaaccgtg aaatgctgct cgacgcgatg 2160gaaaacccgg aaaaatatcc
gcagctgacc atccgtgtat ctggctacgc agtacgtttc 2220aactcgctga ctaaagaaca
gcagcaggac gttattactc gtaccttcac tcaatctatg 2280taa
22836760PRTE.coliMISC_FEATURE(1)..(760)pflB 6Met Ser Glu Leu Asn Glu Lys
Leu Ala Thr Ala Trp Glu Gly Phe Thr1 5 10
15Lys Gly Asp Trp Gln Asn Glu Val Asn Val Arg Asp Phe
Ile Gln Lys 20 25 30Asn Tyr
Thr Pro Tyr Glu Gly Asp Glu Ser Phe Leu Ala Gly Ala Thr 35
40 45Glu Ala Thr Thr Thr Leu Trp Asp Lys Val
Met Glu Gly Val Lys Leu 50 55 60Glu
Asn Arg Thr His Ala Pro Val Asp Phe Asp Thr Ala Val Ala Ser65
70 75 80Thr Ile Thr Ser His Asp
Ala Gly Tyr Ile Asn Lys Ala Leu Glu Lys 85
90 95Val Val Gly Leu Gln Thr Glu Ala Pro Leu Lys Arg
Ala Leu Ile Pro 100 105 110Phe
Gly Gly Ile Lys Met Ile Glu Gly Ser Cys Lys Ala Tyr Asn Arg 115
120 125Glu Leu Asp Pro Met Ile Lys Lys Ile
Phe Thr Glu Tyr Arg Lys Thr 130 135
140His Asn Gln Gly Val Phe Asp Val Tyr Thr Pro Asp Ile Leu Arg Cys145
150 155 160Arg Lys Ser Gly
Val Leu Thr Gly Leu Pro Asp Ala Tyr Gly Arg Gly 165
170 175Arg Ile Ile Gly Asp Tyr Arg Arg Val Ala
Leu Tyr Gly Ile Asp Tyr 180 185
190Leu Met Lys Asp Lys Tyr Ala Gln Phe Thr Ser Leu Gln Ala Asp Leu
195 200 205Glu Asn Gly Val Asn Leu Glu
Gln Thr Ile Arg Leu Arg Glu Glu Ile 210 215
220Ala Glu Gln His Arg Ala Leu Gly Gln Met Lys Glu Met Ala Ala
Lys225 230 235 240Tyr Gly
Tyr Asp Ile Ser Gly Pro Ala Thr Asn Ala Gln Glu Ala Ile
245 250 255Gln Trp Thr Tyr Phe Gly Tyr
Leu Ala Ala Val Lys Ser Gln Asn Gly 260 265
270Ala Ala Met Ser Phe Gly Arg Thr Ser Thr Phe Leu Asp Val
Tyr Ile 275 280 285Glu Arg Asp Leu
Lys Ala Gly Lys Ile Thr Glu Gln Glu Ala Gln Glu 290
295 300Met Val Asp His Leu Val Met Lys Leu Arg Met Val
Arg Phe Leu Arg305 310 315
320Thr Pro Glu Tyr Asp Glu Leu Phe Ser Gly Asp Pro Ile Trp Ala Thr
325 330 335Glu Ser Ile Gly Gly
Met Gly Leu Asp Gly Arg Thr Leu Val Thr Lys 340
345 350Asn Ser Phe Arg Phe Leu Asn Thr Leu Tyr Thr Met
Gly Pro Ser Pro 355 360 365Glu Pro
Asn Met Thr Ile Leu Trp Ser Glu Lys Leu Pro Leu Asn Phe 370
375 380Lys Lys Phe Ala Ala Lys Val Ser Ile Asp Thr
Ser Ser Leu Gln Tyr385 390 395
400Glu Asn Asp Asp Leu Met Arg Pro Asp Phe Asn Asn Asp Asp Tyr Ala
405 410 415Ile Ala Cys Cys
Val Ser Pro Met Ile Val Gly Lys Gln Met Gln Phe 420
425 430Phe Gly Ala Arg Ala Asn Leu Ala Lys Thr Met
Leu Tyr Ala Ile Asn 435 440 445Gly
Gly Val Asp Glu Lys Leu Lys Met Gln Val Gly Pro Lys Ser Glu 450
455 460Pro Ile Lys Gly Asp Val Leu Asn Tyr Asp
Glu Val Met Glu Arg Met465 470 475
480Asp His Phe Met Asp Trp Leu Ala Lys Gln Tyr Ile Thr Ala Leu
Asn 485 490 495Ile Ile His
Tyr Met His Asp Lys Tyr Ser Tyr Glu Ala Ser Leu Met 500
505 510Ala Leu His Asp Arg Asp Val Ile Arg Thr
Met Ala Cys Gly Ile Ala 515 520
525Gly Leu Ser Val Ala Ala Asp Ser Leu Ser Ala Ile Lys Tyr Ala Lys 530
535 540Val Lys Pro Ile Arg Asp Glu Asp
Gly Leu Ala Ile Asp Phe Glu Ile545 550
555 560Glu Gly Glu Tyr Pro Gln Phe Gly Asn Asn Asp Pro
Arg Val Asp Asp 565 570
575Leu Ala Val Asp Leu Val Glu Arg Phe Met Lys Lys Ile Gln Lys Leu
580 585 590His Thr Tyr Arg Asp Ala
Ile Pro Thr Gln Ser Val Leu Thr Ile Thr 595 600
605Ser Asn Val Val Tyr Gly Lys Lys Thr Gly Asn Thr Pro Asp
Gly Arg 610 615 620Arg Ala Gly Ala Pro
Phe Gly Pro Gly Ala Asn Pro Met His Gly Arg625 630
635 640Asp Gln Lys Gly Ala Val Ala Ser Leu Thr
Ser Val Ala Lys Leu Pro 645 650
655Phe Ala Tyr Ala Lys Asp Gly Ile Ser Tyr Thr Phe Ser Ile Val Pro
660 665 670Asn Ala Leu Gly Lys
Asp Asp Glu Val Arg Lys Thr Asn Leu Ala Gly 675
680 685Leu Met Asp Gly Tyr Phe His His Glu Ala Ser Ile
Glu Gly Gly Gln 690 695 700His Leu Asn
Val Asn Val Met Asn Arg Glu Met Leu Leu Asp Ala Met705
710 715 720Glu Asn Pro Glu Lys Tyr Pro
Gln Leu Thr Ile Arg Val Ser Gly Tyr 725
730 735Ala Val Arg Phe Asn Ser Leu Thr Lys Glu Gln Gln
Gln Asp Val Ile 740 745 750Thr
Arg Thr Phe Thr Gln Ser Met 755
76072676DNAE.colimisc_feature(1)..(2676)adhE 7atggctgtta ctaatgtcgc
tgaacttaac gcactcgtag agcgtgtaaa aaaagcccag 60cgtgaatatg ccagtttcac
tcaagagcaa gtagacaaaa tcttccgcgc cgccgctctg 120gctgctgcag atgctcgaat
cccactcgcg aaaatggccg ttgccgaatc cggcatgggt 180atcgtcgaag ataaagtgat
caaaaaccac tttgcttctg aatatatcta caacgcctat 240aaagatgaaa aaacctgtgg
tgttctgtct gaagacgaca cttttggtac catcactatc 300gctgaaccaa tcggtattat
ttgcggtatc gttccgacca ctaacccgac ttcaactgct 360atcttcaaat cgctgatcag
tctgaagacc cgtaacgcca ttatcttctc cccgcacccg 420cgtgcaaaag atgccaccaa
caaagcggct gatatcgttc tgcaggctgc tatcgctgcc 480ggtgctccga aagatctgat
cggctggatc gatcaacctt ctgttgaact gtctaacgca 540ctgatgcacc acccagacat
caacctgatc ctcgcgactg gtggtccggg catggttaaa 600gccgcataca gctccggtaa
accagctatc ggtgtaggcg cgggcaacac tccagttgtt 660atcgatgaaa ctgctgatat
caaacgtgca gttgcatctg tactgatgtc caaaaccttc 720gacaacggcg taatctgtgc
ttctgaacag tctgttgttg ttgttgactc tgtttatgac 780gctgtacgtg aacgttttgc
aacccacggc ggctatctgt tgcagggtaa agagctgaaa 840gctgttcagg atgttatcct
gaaaaacggt gcgctgaacg cggctatcgt tggtcagcca 900gcctataaaa ttgctgaact
ggcaggcttc tctgtaccag aaaacaccaa gattctgatc 960ggtgaagtga ccgttgttga
tgaaagcgaa ccgttcgcac atgaaaaact gtccccgact 1020ctggcaatgt accgcgctaa
agatttcgaa gacgcggtag aaaaagcaga gaaactggtt 1080gctatgggcg gtatcggtca
tacctcttgc ctgtacactg accaggataa ccaaccggct 1140cgcgtttctt acttcggtca
gaaaatgaaa acggctcgta tcctgattaa caccccagcg 1200tctcagggtg gtatcggtga
cctgtataac ttcaaactcg caccttccct gactctgggt 1260tgtggttctt ggggtggtaa
ctccatctct gaaaacgttg gtccgaaaca cctgatcaac 1320aagaaaaccg ttgctaagcg
agctgaaaac atgttgtggc acaaacttcc gaaatctatc 1380tacttccgcc gtggctccct
gccaatcgcg ctggatgaag tgattactga tggccacaaa 1440cgtgcgctca tcgtgactga
ccgcttcctg ttcaacaatg gttatgctga tcagatcact 1500tccgtactga aagcagcagg
cgttgaaact gaagtcttct tcgaagtaga agcggacccg 1560accctgagca tcgttcgtaa
aggtgcagaa ctggcaaact ccttcaaacc agacgtgatt 1620atcgcgctgg gtggtggttc
cccgatggac gccgcgaaga tcatgtgggt tatgtacgaa 1680catccggaaa ctcacttcga
agagctggcg ctgcgcttta tggatatccg taaacgtatc 1740tacaagttcc cgaaaatggg
cgtgaaagcg aaaatgatcg ctgtcaccac cacttctggt 1800acaggttctg aagtcactcc
gtttgcggtt gtaactgacg acgctactgg tcagaaatat 1860ccgctggcag actatgcgct
gactccggat atggcgattg tcgacgccaa cctggttatg 1920gacatgccga agtccctgtg
tgctttcggt ggtctggacg cagtaactca cgccatggaa 1980gcttatgttt ctgtactggc
atctgagttc tctgatggtc aggctctgca ggcactgaaa 2040ctgctgaaag aatatctgcc
agcgtcctac cacgaagggt ctaaaaatcc ggtagcgcgt 2100gaacgtgttc acagtgcagc
gactatcgcg ggtatcgcgt ttgcgaacgc cttcctgggt 2160gtatgtcact caatggcgca
caaactgggt tcccagttcc atattccgca cggtctggca 2220aacgccctgc tgatttgtaa
cgttattcgc tacaatgcga acgacaaccc gaccaagcag 2280actgcattca gccagtatga
ccgtccgcag gctcgccgtc gttatgctga aattgccgac 2340cacttgggtc tgagcgcacc
gggcgaccgt actgctgcta agatcgagaa actgctggca 2400tggctggaaa cgctgaaagc
tgaactgggt attccgaaat ctatccgtga agctggcgtt 2460caggaagcag acttcctggc
gaacgtggat aaactgtctg aagatgcgtt cgatgaccag 2520tgcaccggcg ctaacccgcg
ttacccgctg atctccgagc tgaaacagat cctgctggat 2580acctactacg gtcgtgatta
tgtagaaggt gaaactgcag cgaaaaaaga agccgctccg 2640gctaaagctg agaaaaaagc
gaaaaaatcc gcttaa
26768891PRTE.coliMISC_FEATURE(1)..(891)adhE 8Met Ala Val Thr Asn Val Ala
Glu Leu Asn Ala Leu Val Glu Arg Val1 5 10
15Lys Lys Ala Gln Arg Glu Tyr Ala Ser Phe Thr Gln Glu
Gln Val Asp 20 25 30Lys Ile
Phe Arg Ala Ala Ala Leu Ala Ala Ala Asp Ala Arg Ile Pro 35
40 45Leu Ala Lys Met Ala Val Ala Glu Ser Gly
Met Gly Ile Val Glu Asp 50 55 60Lys
Val Ile Lys Asn His Phe Ala Ser Glu Tyr Ile Tyr Asn Ala Tyr65
70 75 80Lys Asp Glu Lys Thr Cys
Gly Val Leu Ser Glu Asp Asp Thr Phe Gly 85
90 95Thr Ile Thr Ile Ala Glu Pro Ile Gly Ile Ile Cys
Gly Ile Val Pro 100 105 110Thr
Thr Asn Pro Thr Ser Thr Ala Ile Phe Lys Ser Leu Ile Ser Leu 115
120 125Lys Thr Arg Asn Ala Ile Ile Phe Ser
Pro His Pro Arg Ala Lys Asp 130 135
140Ala Thr Asn Lys Ala Ala Asp Ile Val Leu Gln Ala Ala Ile Ala Ala145
150 155 160Gly Ala Pro Lys
Asp Leu Ile Gly Trp Ile Asp Gln Pro Ser Val Glu 165
170 175Leu Ser Asn Ala Leu Met His His Pro Asp
Ile Asn Leu Ile Leu Ala 180 185
190Thr Gly Gly Pro Gly Met Val Lys Ala Ala Tyr Ser Ser Gly Lys Pro
195 200 205Ala Ile Gly Val Gly Ala Gly
Asn Thr Pro Val Val Ile Asp Glu Thr 210 215
220Ala Asp Ile Lys Arg Ala Val Ala Ser Val Leu Met Ser Lys Thr
Phe225 230 235 240Asp Asn
Gly Val Ile Cys Ala Ser Glu Gln Ser Val Val Val Val Asp
245 250 255Ser Val Tyr Asp Ala Val Arg
Glu Arg Phe Ala Thr His Gly Gly Tyr 260 265
270Leu Leu Gln Gly Lys Glu Leu Lys Ala Val Gln Asp Val Ile
Leu Lys 275 280 285Asn Gly Ala Leu
Asn Ala Ala Ile Val Gly Gln Pro Ala Tyr Lys Ile 290
295 300Ala Glu Leu Ala Gly Phe Ser Val Pro Glu Asn Thr
Lys Ile Leu Ile305 310 315
320Gly Glu Val Thr Val Val Asp Glu Ser Glu Pro Phe Ala His Glu Lys
325 330 335Leu Ser Pro Thr Leu
Ala Met Tyr Arg Ala Lys Asp Phe Glu Asp Ala 340
345 350Val Glu Lys Ala Glu Lys Leu Val Ala Met Gly Gly
Ile Gly His Thr 355 360 365Ser Cys
Leu Tyr Thr Asp Gln Asp Asn Gln Pro Ala Arg Val Ser Tyr 370
375 380Phe Gly Gln Lys Met Lys Thr Ala Arg Ile Leu
Ile Asn Thr Pro Ala385 390 395
400Ser Gln Gly Gly Ile Gly Asp Leu Tyr Asn Phe Lys Leu Ala Pro Ser
405 410 415Leu Thr Leu Gly
Cys Gly Ser Trp Gly Gly Asn Ser Ile Ser Glu Asn 420
425 430Val Gly Pro Lys His Leu Ile Asn Lys Lys Thr
Val Ala Lys Arg Ala 435 440 445Glu
Asn Met Leu Trp His Lys Leu Pro Lys Ser Ile Tyr Phe Arg Arg 450
455 460Gly Ser Leu Pro Ile Ala Leu Asp Glu Val
Ile Thr Asp Gly His Lys465 470 475
480Arg Ala Leu Ile Val Thr Asp Arg Phe Leu Phe Asn Asn Gly Tyr
Ala 485 490 495Asp Gln Ile
Thr Ser Val Leu Lys Ala Ala Gly Val Glu Thr Glu Val 500
505 510Phe Phe Glu Val Glu Ala Asp Pro Thr Leu
Ser Ile Val Arg Lys Gly 515 520
525Ala Glu Leu Ala Asn Ser Phe Lys Pro Asp Val Ile Ile Ala Leu Gly 530
535 540Gly Gly Ser Pro Met Asp Ala Ala
Lys Ile Met Trp Val Met Tyr Glu545 550
555 560His Pro Glu Thr His Phe Glu Glu Leu Ala Leu Arg
Phe Met Asp Ile 565 570
575Arg Lys Arg Ile Tyr Lys Phe Pro Lys Met Gly Val Lys Ala Lys Met
580 585 590Ile Ala Val Thr Thr Thr
Ser Gly Thr Gly Ser Glu Val Thr Pro Phe 595 600
605Ala Val Val Thr Asp Asp Ala Thr Gly Gln Lys Tyr Pro Leu
Ala Asp 610 615 620Tyr Ala Leu Thr Pro
Asp Met Ala Ile Val Asp Ala Asn Leu Val Met625 630
635 640Asp Met Pro Lys Ser Leu Cys Ala Phe Gly
Gly Leu Asp Ala Val Thr 645 650
655His Ala Met Glu Ala Tyr Val Ser Val Leu Ala Ser Glu Phe Ser Asp
660 665 670Gly Gln Ala Leu Gln
Ala Leu Lys Leu Leu Lys Glu Tyr Leu Pro Ala 675
680 685Ser Tyr His Glu Gly Ser Lys Asn Pro Val Ala Arg
Glu Arg Val His 690 695 700Ser Ala Ala
Thr Ile Ala Gly Ile Ala Phe Ala Asn Ala Phe Leu Gly705
710 715 720Val Cys His Ser Met Ala His
Lys Leu Gly Ser Gln Phe His Ile Pro 725
730 735His Gly Leu Ala Asn Ala Leu Leu Ile Cys Asn Val
Ile Arg Tyr Asn 740 745 750Ala
Asn Asp Asn Pro Thr Lys Gln Thr Ala Phe Ser Gln Tyr Asp Arg 755
760 765Pro Gln Ala Arg Arg Arg Tyr Ala Glu
Ile Ala Asp His Leu Gly Leu 770 775
780Ser Ala Pro Gly Asp Arg Thr Ala Ala Lys Ile Glu Lys Leu Leu Ala785
790 795 800Trp Leu Glu Thr
Leu Lys Ala Glu Leu Gly Ile Pro Lys Ser Ile Arg 805
810 815Glu Ala Gly Val Gln Glu Ala Asp Phe Leu
Ala Asn Val Asp Lys Leu 820 825
830Ser Glu Asp Ala Phe Asp Asp Gln Cys Thr Gly Ala Asn Pro Arg Tyr
835 840 845Pro Leu Ile Ser Glu Leu Lys
Gln Ile Leu Leu Asp Thr Tyr Tyr Gly 850 855
860Arg Asp Tyr Val Glu Gly Glu Thr Ala Ala Lys Lys Glu Ala Ala
Pro865 870 875 880Ala Lys
Ala Glu Lys Lys Ala Lys Lys Ser Ala 885
8909990DNAE.colimisc_feature(1)..(990)ldhA 9atgaaactcg ccgtttatag
cacaaaacag tacgacaaga agtacctgca acaggtgaac 60gagtcctttg gctttgagct
ggaatttttt gactttctgc tgacggaaaa aaccgctaaa 120actgccaatg gctgcgaagc
ggtatgtatt ttcgtaaacg atgacggcag ccgcccggtg 180ctggaagagc tgaaaaagca
cggcgttaaa tatatcgccc tgcgctgtgc cggtttcaat 240aacgtcgacc ttgacgcggc
aaaagaactg gggctgaaag tagtccgtgt tccagcctat 300gatccagagg ccgttgctga
acacgccatc ggtatgatga tgacgctgaa ccgccgtatt 360caccgcgcgt atcagcgtac
ccgtgacgct aacttctctc tggaaggtct gaccggcttt 420actatgtatg gcaaaacggc
aggcgttatc ggtaccggta aaatcggtgt ggcgatgctg 480cgcattctga aaggttttgg
tatgcgtctg ctggcgttcg atccgtatcc aagtgcagcg 540gcgctggaac tcggtgtgga
gtatgtcgat ctgccaaccc tgttctctga atcagacgtt 600atctctctgc actgcccgct
gacaccggaa aactaccatc tgttgaacga agccgccttc 660gatcagatga aaaatggcgt
gatgatcgtc aataccagtc gcggtgcatt gattgattct 720caggcagcaa ttgaagcgct
gaaaaatcag aaaattggtt cgttgggtat ggacgtgtat 780gagaacgaac gcgatctatt
ctttgaagat aaatccaacg acgtaattca ggatgacgta 840ttccgtcgcc tgtctgcctg
ccacaacgtg ctatttaccg ggcaccaggc attcctgaca 900gcagaagctc tgaccagtat
ttctcagact acgctgcaaa acttaagcaa tctggaaaaa 960ggcgaaacct gcccgaacga
actggtttaa
99010329PRTE.coliMISC_FEATURE(1)..(329)ldhA 10Met Lys Leu Ala Val Tyr Ser
Thr Lys Gln Tyr Asp Lys Lys Tyr Leu1 5 10
15Gln Gln Val Asn Glu Ser Phe Gly Phe Glu Leu Glu Phe
Phe Asp Phe 20 25 30Leu Leu
Thr Glu Lys Thr Ala Lys Thr Ala Asn Gly Cys Glu Ala Val 35
40 45Cys Ile Phe Val Asn Asp Asp Gly Ser Arg
Pro Val Leu Glu Glu Leu 50 55 60Lys
Lys His Gly Val Lys Tyr Ile Ala Leu Arg Cys Ala Gly Phe Asn65
70 75 80Asn Val Asp Leu Asp Ala
Ala Lys Glu Leu Gly Leu Lys Val Val Arg 85
90 95Val Pro Ala Tyr Asp Pro Glu Ala Val Ala Glu His
Ala Ile Gly Met 100 105 110Met
Met Thr Leu Asn Arg Arg Ile His Arg Ala Tyr Gln Arg Thr Arg 115
120 125Asp Ala Asn Phe Ser Leu Glu Gly Leu
Thr Gly Phe Thr Met Tyr Gly 130 135
140Lys Thr Ala Gly Val Ile Gly Thr Gly Lys Ile Gly Val Ala Met Leu145
150 155 160Arg Ile Leu Lys
Gly Phe Gly Met Arg Leu Leu Ala Phe Asp Pro Tyr 165
170 175Pro Ser Ala Ala Ala Leu Glu Leu Gly Val
Glu Tyr Val Asp Leu Pro 180 185
190Thr Leu Phe Ser Glu Ser Asp Val Ile Ser Leu His Cys Pro Leu Thr
195 200 205Pro Glu Asn Tyr His Leu Leu
Asn Glu Ala Ala Phe Asp Gln Met Lys 210 215
220Asn Gly Val Met Ile Val Asn Thr Ser Arg Gly Ala Leu Ile Asp
Ser225 230 235 240Gln Ala
Ala Ile Glu Ala Leu Lys Asn Gln Lys Ile Gly Ser Leu Gly
245 250 255Met Asp Val Tyr Glu Asn Glu
Arg Asp Leu Phe Phe Glu Asp Lys Ser 260 265
270Asn Asp Val Ile Gln Asp Asp Val Phe Arg Arg Leu Ser Ala
Cys His 275 280 285Asn Val Leu Phe
Thr Gly His Gln Ala Phe Leu Thr Ala Glu Ala Leu 290
295 300Thr Ser Ile Ser Gln Thr Thr Leu Gln Asn Leu Ser
Asn Leu Glu Lys305 310 315
320Gly Glu Thr Cys Pro Asn Glu Leu Val
325112145DNAE.colimisc_feature(1)..(2145)pta 11gtgtcccgta ttattatgct
gatccctacc ggaaccagcg tcggtctgac cagcgtcagc 60cttggcgtga tccgtgcaat
ggaacgcaaa ggcgttcgtc tgagcgtttt caaacctatc 120gctcagccgc gtaccggtgg
cgatgcgccc gatcagacta cgactatcgt gcgtgcgaac 180tcttccacca cgacggccgc
tgaaccgctg aaaatgagct acgttgaagg tctgctttcc 240agcaatcaga aagatgtgct
gatggaagag atcgtcgcaa actaccacgc taacaccaaa 300gacgctgaag tcgttctggt
tgaaggtctg gtcccgacac gtaagcacca gtttgcccag 360tctctgaact acgaaatcgc
taaaacgctg aatgcggaaa tcgtcttcgt tatgtctcag 420ggcactgaca ccccggaaca
gctgaaagag cgtatcgaac tgacccgcaa cagcttcggc 480ggtgccaaaa acaccaacat
caccggcgtt atcgttaaca aactgaacgc accggttgat 540gaacagggtc gtactcgccc
ggatctgtcc gagattttcg acgactcttc caaagctaaa 600gtaaacaatg ttgatccggc
gaagctgcaa gaatccagcc cgctgccggt tctcggcgct 660gtgccgtgga gctttgacct
gatcgcgact cgtgcgatcg atatggctcg ccacctgaat 720gcgaccatca tcaacgaagg
cgacatcaat actcgccgcg ttaaatccgt cactttctgc 780gcacgcagca ttccgcacat
gctggagcac ttccgtgccg gttctctgct ggtgacttcc 840gcagaccgtc ctgacgtgct
ggtggccgct tgcctggcag ccatgaacgg cgtagaaatc 900ggtgccctgc tgctgactgg
cggctacgaa atggacgcgc gcatttctaa actgtgcgaa 960cgtgctttcg ctaccggcct
gccggtattt atggtgaaca ccaacacctg gcagacctct 1020ctgagcctgc agagcttcaa
cctggaagtt ccggttgacg atcacgagcg tatcgagaaa 1080gttcaggaat acgttgctaa
ctacatcaac gctgactgga tcgaatctct gactgccact 1140tctgagcgca gccgtcgtct
gtctccgcct gcgttccgtt atcagctgac tgaacttgcg 1200cgcaaagcgg gcaaacgtat
cgtactgccg gaaggtgacg aaccgcgtac cgttaaagca 1260gccgctatct gtgctgaacg
tggtatcgca acttgcgtac tgctgggtaa tccggcagag 1320atcaaccgtg ttgcagcgtc
tcagggtgta gaactgggtg cagggattga aatcgttgat 1380ccagaagtgg ttcgcgaaag
ctatgttggt cgtctggtcg aactgcgtaa gaacaaaggc 1440atgaccgaaa ccgttgcccg
cgaacagctg gaagacaacg tggtgctcgg tacgctgatg 1500ctggaacagg atgaagttga
tggtctggtt tccggtgctg ttcacactac cgcaaacacc 1560atccgtccgc cgctgcagct
gatcaaaact gcaccgggca gctccctggt atcttccgtg 1620ttcttcatgc tgctgccgga
acaggtttac gtttacggtg actgtgcgat caacccggat 1680ccgaccgctg aacagctggc
agaaatcgcg attcagtccg ctgattccgc tgcggccttc 1740ggtatcgaac cgcgcgttgc
tatgctctcc tactccaccg gtacttctgg tgcaggtagc 1800gacgtagaaa aagttcgcga
agcaactcgt ctggcgcagg aaaaacgtcc tgacctgatg 1860atcgacggtc cgctgcagta
cgacgctgcg gtaatggctg acgttgcgaa atccaaagcg 1920ccgaactctc cggttgcagg
tcgcgctacc gtgttcatct tcccggatct gaacaccggt 1980aacaccacct acaaagcggt
acagcgttct gccgacctga tctccatcgg gccgatgctg 2040cagggtatgc gcaagccggt
taacgacctg tcccgtggcg cactggttga cgatatcgtc 2100tacaccatcg cgctgactgc
gattcagtct gcacagcagc agtaa
214512714PRTE.coliMISC_FEATURE(1)..(714)pta 12Met Ser Arg Ile Ile Met Leu
Ile Pro Thr Gly Thr Ser Val Gly Leu1 5 10
15Thr Ser Val Ser Leu Gly Val Ile Arg Ala Met Glu Arg
Lys Gly Val 20 25 30Arg Leu
Ser Val Phe Lys Pro Ile Ala Gln Pro Arg Thr Gly Gly Asp 35
40 45Ala Pro Asp Gln Thr Thr Thr Ile Val Arg
Ala Asn Ser Ser Thr Thr 50 55 60Thr
Ala Ala Glu Pro Leu Lys Met Ser Tyr Val Glu Gly Leu Leu Ser65
70 75 80Ser Asn Gln Lys Asp Val
Leu Met Glu Glu Ile Val Ala Asn Tyr His 85
90 95Ala Asn Thr Lys Asp Ala Glu Val Val Leu Val Glu
Gly Leu Val Pro 100 105 110Thr
Arg Lys His Gln Phe Ala Gln Ser Leu Asn Tyr Glu Ile Ala Lys 115
120 125Thr Leu Asn Ala Glu Ile Val Phe Val
Met Ser Gln Gly Thr Asp Thr 130 135
140Pro Glu Gln Leu Lys Glu Arg Ile Glu Leu Thr Arg Asn Ser Phe Gly145
150 155 160Gly Ala Lys Asn
Thr Asn Ile Thr Gly Val Ile Val Asn Lys Leu Asn 165
170 175Ala Pro Val Asp Glu Gln Gly Arg Thr Arg
Pro Asp Leu Ser Glu Ile 180 185
190Phe Asp Asp Ser Ser Lys Ala Lys Val Asn Asn Val Asp Pro Ala Lys
195 200 205Leu Gln Glu Ser Ser Pro Leu
Pro Val Leu Gly Ala Val Pro Trp Ser 210 215
220Phe Asp Leu Ile Ala Thr Arg Ala Ile Asp Met Ala Arg His Leu
Asn225 230 235 240Ala Thr
Ile Ile Asn Glu Gly Asp Ile Asn Thr Arg Arg Val Lys Ser
245 250 255Val Thr Phe Cys Ala Arg Ser
Ile Pro His Met Leu Glu His Phe Arg 260 265
270Ala Gly Ser Leu Leu Val Thr Ser Ala Asp Arg Pro Asp Val
Leu Val 275 280 285Ala Ala Cys Leu
Ala Ala Met Asn Gly Val Glu Ile Gly Ala Leu Leu 290
295 300Leu Thr Gly Gly Tyr Glu Met Asp Ala Arg Ile Ser
Lys Leu Cys Glu305 310 315
320Arg Ala Phe Ala Thr Gly Leu Pro Val Phe Met Val Asn Thr Asn Thr
325 330 335Trp Gln Thr Ser Leu
Ser Leu Gln Ser Phe Asn Leu Glu Val Pro Val 340
345 350Asp Asp His Glu Arg Ile Glu Lys Val Gln Glu Tyr
Val Ala Asn Tyr 355 360 365Ile Asn
Ala Asp Trp Ile Glu Ser Leu Thr Ala Thr Ser Glu Arg Ser 370
375 380Arg Arg Leu Ser Pro Pro Ala Phe Arg Tyr Gln
Leu Thr Glu Leu Ala385 390 395
400Arg Lys Ala Gly Lys Arg Ile Val Leu Pro Glu Gly Asp Glu Pro Arg
405 410 415Thr Val Lys Ala
Ala Ala Ile Cys Ala Glu Arg Gly Ile Ala Thr Cys 420
425 430Val Leu Leu Gly Asn Pro Ala Glu Ile Asn Arg
Val Ala Ala Ser Gln 435 440 445Gly
Val Glu Leu Gly Ala Gly Ile Glu Ile Val Asp Pro Glu Val Val 450
455 460Arg Glu Ser Tyr Val Gly Arg Leu Val Glu
Leu Arg Lys Asn Lys Gly465 470 475
480Met Thr Glu Thr Val Ala Arg Glu Gln Leu Glu Asp Asn Val Val
Leu 485 490 495Gly Thr Leu
Met Leu Glu Gln Asp Glu Val Asp Gly Leu Val Ser Gly 500
505 510Ala Val His Thr Thr Ala Asn Thr Ile Arg
Pro Pro Leu Gln Leu Ile 515 520
525Lys Thr Ala Pro Gly Ser Ser Leu Val Ser Ser Val Phe Phe Met Leu 530
535 540Leu Pro Glu Gln Val Tyr Val Tyr
Gly Asp Cys Ala Ile Asn Pro Asp545 550
555 560Pro Thr Ala Glu Gln Leu Ala Glu Ile Ala Ile Gln
Ser Ala Asp Ser 565 570
575Ala Ala Ala Phe Gly Ile Glu Pro Arg Val Ala Met Leu Ser Tyr Ser
580 585 590Thr Gly Thr Ser Gly Ala
Gly Ser Asp Val Glu Lys Val Arg Glu Ala 595 600
605Thr Arg Leu Ala Gln Glu Lys Arg Pro Asp Leu Met Ile Asp
Gly Pro 610 615 620Leu Gln Tyr Asp Ala
Ala Val Met Ala Asp Val Ala Lys Ser Lys Ala625 630
635 640Pro Asn Ser Pro Val Ala Gly Arg Ala Thr
Val Phe Ile Phe Pro Asp 645 650
655Leu Asn Thr Gly Asn Thr Thr Tyr Lys Ala Val Gln Arg Ser Ala Asp
660 665 670Leu Ile Ser Ile Gly
Pro Met Leu Gln Gly Met Arg Lys Pro Val Asn 675
680 685Asp Leu Ser Arg Gly Ala Leu Val Asp Asp Ile Val
Tyr Thr Ile Ala 690 695 700Leu Thr Ala
Ile Gln Ser Ala Gln Gln Gln705
710131809DNAE.colimisc_feature(1)..(1809)frdA 13gtgcaaacct ttcaagccga
tcttgccatt gtaggcgccg gtggcgcggg attacgtgct 60gcaattgctg ccgcgcaggc
aaatccgaat gcaaaaatcg cactaatctc aaaagtatac 120ccgatgcgta gccataccgt
tgctgcagaa gggggctccg ccgctgtcgc gcaggatcat 180gacagcttcg aatatcactt
tcacgataca gtagcgggtg gcgactggtt gtgtgagcag 240gatgtcgtgg attatttcgt
ccaccactgc ccaaccgaaa tgacccaact ggaactgtgg 300ggatgcccat ggagccgtcg
cccggatggt agcgtcaacg tacgtcgctt cggcggcatg 360aaaatcgagc gcacctggtt
cgccgccgat aagaccggct tccatatgct gcacacgctg 420ttccagacct ctctgcaatt
cccgcagatc cagcgttttg acgaacattt cgtgctggat 480attctggttg atgatggtca
tgttcgcggc ctggtagcaa tgaacatgat ggaaggcacg 540ctggtgcaga tccgtgctaa
cgcggtcgtt atggctactg gcggtgcggg tcgcgtttat 600cgttacaaca ccaacggcgg
catcgttacc ggtgacggta tgggtatggc gctaagccac 660ggcgttccgc tgcgtgacat
ggaattcgtt cagtatcacc caaccggtct gccaggttcc 720ggtatcctga tgaccgaagg
ctgccgcggt gaaggcggta ttctggtcaa caaaaatggc 780taccgttatc tgcaagatta
cggcatgggc ccggaaactc cgctgggcga gccgaaaaac 840aaatatatgg aactgggtcc
acgcgacaaa gtttctcagg ccttctggca cgaatggcgt 900aaaggcaaca ccatctccac
gccgcgtggc gatgtggttt atctcgacct gcgtcacctc 960ggcgagaaaa aactgcatga
acgtctgccg ttcatctgcg aactggcgaa agcgtacgtt 1020ggcgtcgatc cggttaaaga
accgattccg gtacgtccga ccgcacacta caccatgggc 1080ggtatcgaaa ccgatcagaa
ctgtgaaacc cgcattaaag gtctgttcgc cgtgggtgaa 1140tgttcctctg ttggtctgca
cggtgcaaac cgtctgggtt ctaactccct ggcggaactg 1200gtggtcttcg gccgtctggc
cggtgaacaa gcgacagagc gtgcagcaac tgccggtaat 1260ggcaacgaag cggcaattga
agcgcaggca gctggcgttg aacaacgtct gaaagatctg 1320gttaaccagg atggcggcga
aaactgggcg aagatccgcg acgaaatggg cctggcaatg 1380gaagaaggtt gcggtatcta
ccgtacgccg gaactgatgc agaaaaccat cgacaagctg 1440gcagagctgc aggaacgctt
caagcgcgtg cgcatcaccg acacctccag cgtgttcaac 1500accgacctgc tctacaccat
tgaactgggc cacggtctga acgttgctga atgtatggcg 1560cactccgcaa tggcacgtaa
agagtcccgc ggcgcacacc agcgtctgga cgaaggttgc 1620accgagcgtg acgacgtcaa
cttcctcaaa cacaccctcg ccttccgcga tgctgatggc 1680acgactcgcc tggagtacag
cgacgtgaag attactacgc tgccgccagc taaacgcgtt 1740tacggtggcg aagcggatgc
agccgataag gcggaagcag ccaataagaa ggagaaggcg 1800aatggctga
180914602PRTE.coliMISC_FEATURE(1)..(602)frdA 14Met Gln Thr Phe Gln Ala
Asp Leu Ala Ile Val Gly Ala Gly Gly Ala1 5
10 15Gly Leu Arg Ala Ala Ile Ala Ala Ala Gln Ala Asn
Pro Asn Ala Lys 20 25 30Ile
Ala Leu Ile Ser Lys Val Tyr Pro Met Arg Ser His Thr Val Ala 35
40 45Ala Glu Gly Gly Ser Ala Ala Val Ala
Gln Asp His Asp Ser Phe Glu 50 55
60Tyr His Phe His Asp Thr Val Ala Gly Gly Asp Trp Leu Cys Glu Gln65
70 75 80Asp Val Val Asp Tyr
Phe Val His His Cys Pro Thr Glu Met Thr Gln 85
90 95Leu Glu Leu Trp Gly Cys Pro Trp Ser Arg Arg
Pro Asp Gly Ser Val 100 105
110Asn Val Arg Arg Phe Gly Gly Met Lys Ile Glu Arg Thr Trp Phe Ala
115 120 125Ala Asp Lys Thr Gly Phe His
Met Leu His Thr Leu Phe Gln Thr Ser 130 135
140Leu Gln Phe Pro Gln Ile Gln Arg Phe Asp Glu His Phe Val Leu
Asp145 150 155 160Ile Leu
Val Asp Asp Gly His Val Arg Gly Leu Val Ala Met Asn Met
165 170 175Met Glu Gly Thr Leu Val Gln
Ile Arg Ala Asn Ala Val Val Met Ala 180 185
190Thr Gly Gly Ala Gly Arg Val Tyr Arg Tyr Asn Thr Asn Gly
Gly Ile 195 200 205Val Thr Gly Asp
Gly Met Gly Met Ala Leu Ser His Gly Val Pro Leu 210
215 220Arg Asp Met Glu Phe Val Gln Tyr His Pro Thr Gly
Leu Pro Gly Ser225 230 235
240Gly Ile Leu Met Thr Glu Gly Cys Arg Gly Glu Gly Gly Ile Leu Val
245 250 255Asn Lys Asn Gly Tyr
Arg Tyr Leu Gln Asp Tyr Gly Met Gly Pro Glu 260
265 270Thr Pro Leu Gly Glu Pro Lys Asn Lys Tyr Met Glu
Leu Gly Pro Arg 275 280 285Asp Lys
Val Ser Gln Ala Phe Trp His Glu Trp Arg Lys Gly Asn Thr 290
295 300Ile Ser Thr Pro Arg Gly Asp Val Val Tyr Leu
Asp Leu Arg His Leu305 310 315
320Gly Glu Lys Lys Leu His Glu Arg Leu Pro Phe Ile Cys Glu Leu Ala
325 330 335Lys Ala Tyr Val
Gly Val Asp Pro Val Lys Glu Pro Ile Pro Val Arg 340
345 350Pro Thr Ala His Tyr Thr Met Gly Gly Ile Glu
Thr Asp Gln Asn Cys 355 360 365Glu
Thr Arg Ile Lys Gly Leu Phe Ala Val Gly Glu Cys Ser Ser Val 370
375 380Gly Leu His Gly Ala Asn Arg Leu Gly Ser
Asn Ser Leu Ala Glu Leu385 390 395
400Val Val Phe Gly Arg Leu Ala Gly Glu Gln Ala Thr Glu Arg Ala
Ala 405 410 415Thr Ala Gly
Asn Gly Asn Glu Ala Ala Ile Glu Ala Gln Ala Ala Gly 420
425 430Val Glu Gln Arg Leu Lys Asp Leu Val Asn
Gln Asp Gly Gly Glu Asn 435 440
445Trp Ala Lys Ile Arg Asp Glu Met Gly Leu Ala Met Glu Glu Gly Cys 450
455 460Gly Ile Tyr Arg Thr Pro Glu Leu
Met Gln Lys Thr Ile Asp Lys Leu465 470
475 480Ala Glu Leu Gln Glu Arg Phe Lys Arg Val Arg Ile
Thr Asp Thr Ser 485 490
495Ser Val Phe Asn Thr Asp Leu Leu Tyr Thr Ile Glu Leu Gly His Gly
500 505 510Leu Asn Val Ala Glu Cys
Met Ala His Ser Ala Met Ala Arg Lys Glu 515 520
525Ser Arg Gly Ala His Gln Arg Leu Asp Glu Gly Cys Thr Glu
Arg Asp 530 535 540Asp Val Asn Phe Leu
Lys His Thr Leu Ala Phe Arg Asp Ala Asp Gly545 550
555 560Thr Thr Arg Leu Glu Tyr Ser Asp Val Lys
Ile Thr Thr Leu Pro Pro 565 570
575Ala Lys Arg Val Tyr Gly Gly Glu Ala Asp Ala Ala Asp Lys Ala Glu
580 585 590Ala Ala Asn Lys Lys
Glu Lys Ala Asn Gly 595 600151071DNAE.
colimisc_feature(1)..(1071)dadX 15atgacccgtc cgatacaggc cagcctcgat
ctgcaggcat taaaacagaa tctgtccatt 60gtccgccagg ccgcgccgca cgcgcgcgtc
tggtcggtgg taaaagcgaa cgcttacggg 120catggtattg agcgtatctg gagcgcgctc
ggggccaccg atggctttgc attacttaac 180ctggaagagg caataacgtt acgtgagcgc
ggctggaagg ggccgatcct gatgctggaa 240ggatttttcc atgctcagga tctggagatt
tatgaccagc accgcctgac cacctgcgta 300cacagcaact ggcagctcaa agcactgcaa
aatgcgcggc taaaagcacc gttggatatt 360tatcttaaag tgaacagtgg gatgaatcgg
ttgggcttcc agcccgatcg cgtgcttacc 420gtctggcagc agttgcgggc aatggcgaat
gttggcgaga tgaccctgat gtcgcatttt 480gccgaagcgg aacatcctga tggaatttcc
agcgcgatgg cgcgtattga gcaggcggcg 540gaagggctgg agtgtcggcg ttcgttgtcc
aattcggcgg cgactctgtg gcacccggaa 600gcgcattttg actgggttcg gcctggcatt
attttgtatg gcgcttcgcc gtccggtcag 660tggcgtgata tcgccaatac cggattacgt
ccggtgatga cgctaagcag tgagattatt 720ggtgtccaga cgctaaaagc gggtgagcgt
gtgggctacg gcggtcgcta tactgcgcgc 780gatgaacagc gaatcggcat tgtcgccgca
gggtacgccg acggttatcc gcgccacgcg 840cctaccggta cccctgtttt agtggacggc
gtgcgcacca tgacggtggg gaccgtctcg 900atggatatgc tagcggtcga tttaacgcct
tgcccgcagg cggggattgg tacgccggtt 960gagctgtggg gcaaggagat caaaattgat
gatgtcgccg ccgctgccgg aacggtgggc 1020tatgagttga tgtgcgcgct ggcgttacgc
gtcccggttg tgacagtgta a 107116356PRTE.
coliMISC_FEATURE(1)..(356)dadX 16Met Thr Arg Pro Ile Gln Ala Ser Leu Asp
Leu Gln Ala Leu Lys Gln1 5 10
15Asn Leu Ser Ile Val Arg Gln Ala Ala Pro His Ala Arg Val Trp Ser
20 25 30Val Val Lys Ala Asn Ala
Tyr Gly His Gly Ile Glu Arg Ile Trp Ser 35 40
45Ala Leu Gly Ala Thr Asp Gly Phe Ala Leu Leu Asn Leu Glu
Glu Ala 50 55 60Ile Thr Leu Arg Glu
Arg Gly Trp Lys Gly Pro Ile Leu Met Leu Glu65 70
75 80Gly Phe Phe His Ala Gln Asp Leu Glu Ile
Tyr Asp Gln His Arg Leu 85 90
95Thr Thr Cys Val His Ser Asn Trp Gln Leu Lys Ala Leu Gln Asn Ala
100 105 110Arg Leu Lys Ala Pro
Leu Asp Ile Tyr Leu Lys Val Asn Ser Gly Met 115
120 125Asn Arg Leu Gly Phe Gln Pro Asp Arg Val Leu Thr
Val Trp Gln Gln 130 135 140Leu Arg Ala
Met Ala Asn Val Gly Glu Met Thr Leu Met Ser His Phe145
150 155 160Ala Glu Ala Glu His Pro Asp
Gly Ile Ser Ser Ala Met Ala Arg Ile 165
170 175Glu Gln Ala Ala Glu Gly Leu Glu Cys Arg Arg Ser
Leu Ser Asn Ser 180 185 190Ala
Ala Thr Leu Trp His Pro Glu Ala His Phe Asp Trp Val Arg Pro 195
200 205Gly Ile Ile Leu Tyr Gly Ala Ser Pro
Ser Gly Gln Trp Arg Asp Ile 210 215
220Ala Asn Thr Gly Leu Arg Pro Val Met Thr Leu Ser Ser Glu Ile Ile225
230 235 240Gly Val Gln Thr
Leu Lys Ala Gly Glu Arg Val Gly Tyr Gly Gly Arg 245
250 255Tyr Thr Ala Arg Asp Glu Gln Arg Ile Gly
Ile Val Ala Ala Gly Tyr 260 265
270Ala Asp Gly Tyr Pro Arg His Ala Pro Thr Gly Thr Pro Val Leu Val
275 280 285Asp Gly Val Arg Thr Met Thr
Val Gly Thr Val Ser Met Asp Met Leu 290 295
300Ala Val Asp Leu Thr Pro Cys Pro Gln Ala Gly Ile Gly Thr Pro
Val305 310 315 320Glu Leu
Trp Gly Lys Glu Ile Lys Ile Asp Asp Val Ala Ala Ala Ala
325 330 335Gly Thr Val Gly Tyr Glu Leu
Met Cys Ala Leu Ala Leu Arg Val Pro 340 345
350Val Val Thr Val
35517450DNAE.colimisc_feature(1)..(450)ygaW 17atgttctcac cgcagtcacg
cttgcgtcat gcagttgcag atacgttcgc gatggttgtt 60tactgttctg tcgtgaacat
gtgtattgaa gttttcctct ccggaatgag cttcgaacag 120tctttttatt ccagattggt
agcgattccg gtgaacatct taattgcatg gccatacggt 180atgtaccgtg atctgtttat
gcgcgcggca cgcaaagtta gcccgtcggg ctggataaaa 240aatctggctg atatcctggc
ttatgtgacg ttccagtcac cggtgtatgt ggcgatcttg 300ttagtggtgg gcgcagactg
gcatcagatt atggcggcgg tcagttcaaa catcgttgtt 360tcgatgttga tgggggcggt
ttatggctac ttcctcgatt attgccgccg actgtttaaa 420gtcagccgtt accagcaggt
aaaagcctga
45018149PRTE.coliMISC_FEATURE(1)..(149)ygaW 18Met Phe Ser Pro Gln Ser Arg
Leu Arg His Ala Val Ala Asp Thr Phe1 5 10
15Ala Met Val Val Tyr Cys Ser Val Val Asn Met Cys Ile
Glu Val Phe 20 25 30Leu Ser
Gly Met Ser Phe Glu Gln Ser Phe Tyr Ser Arg Leu Val Ala 35
40 45Ile Pro Val Asn Ile Leu Ile Ala Trp Pro
Tyr Gly Met Tyr Arg Asp 50 55 60Leu
Phe Met Arg Ala Ala Arg Lys Val Ser Pro Ser Gly Trp Ile Lys65
70 75 80Asn Leu Ala Asp Ile Leu
Ala Tyr Val Thr Phe Gln Ser Pro Val Tyr 85
90 95Val Ala Ile Leu Leu Val Val Gly Ala Asp Trp His
Gln Ile Met Ala 100 105 110Ala
Val Ser Ser Asn Ile Val Val Ser Met Leu Met Gly Ala Val Tyr 115
120 125Gly Tyr Phe Leu Asp Tyr Cys Arg Arg
Leu Phe Lys Val Ser Arg Tyr 130 135
140Gln Gln Val Lys Ala14519987DNAE.colimisc_feature(1)..(987)zipA
19atgatgcagg atttgcgtct gatattaatc attgttggcg cgatcgccat aatcgcttta
60ctggtacatg gtttctggac cagccgtaaa gaacgatctt ctatgttccg cgatcggcca
120ttaaaacgaa tgaagtcaaa acgtgacgac gattcttatg acgaggatgt cgaagatgat
180gagggcgttg gtgaggttcg tgttcaccgc gtgaatcatg ccccggctaa cgctcaggag
240catgaggctg ctcgtccgtc gccgcaacac cagtaccaac cgccttatgc gtctgcgcag
300ccgcgtcaac cggtccagca gccgcctgaa gcgcaggtac cgccgcaaca tgctccgcgt
360ccagcgcagc cggtgcagca gcctgcctat cagccgcagc ctgaacagcc gttgcagcag
420ccagtttcgc cacaggtcgc gccagcgccg cagcctgtgc attcagcacc gcaaccggca
480caacaggctt tccagcctgc agaacccgta gcggcaccac agcctgagcc tgtagcggaa
540ccggctccag ttatggataa accgaagcgc aaagaagcgg tgattatcat gaacgtcgcg
600gcgcatcacg gtagcgagct aaacggtgaa ctgcttctta acagcattca acaagcgggc
660ttcatttttg gcgatatgaa tatttaccat cgtcatctta gcccggatgg cagcggcccg
720gcgttattca gcctggcgaa tatggtgaaa ccgggaacct ttgatcctga aatgaaggat
780ttcactactc cgggtgtcac catctttatg caggtaccgt cttacggtga cgagctgcag
840aacttcaagc tgatgctgca atctgcgcag catattgccg atgaagtggg cggtgtcgtg
900cttgacgatc agcgccgtat gatgactccg cagaaattgc gcgagtacca ggacatcatc
960cgcgaagtca aagacgccaa cgcctga
98720328PRTE.coliMISC_FEATURE(1)..(328)zipA 20Met Met Gln Asp Leu Arg Leu
Ile Leu Ile Ile Val Gly Ala Ile Ala1 5 10
15Ile Ile Ala Leu Leu Val His Gly Phe Trp Thr Ser Arg
Lys Glu Arg 20 25 30Ser Ser
Met Phe Arg Asp Arg Pro Leu Lys Arg Met Lys Ser Lys Arg 35
40 45Asp Asp Asp Ser Tyr Asp Glu Asp Val Glu
Asp Asp Glu Gly Val Gly 50 55 60Glu
Val Arg Val His Arg Val Asn His Ala Pro Ala Asn Ala Gln Glu65
70 75 80His Glu Ala Ala Arg Pro
Ser Pro Gln His Gln Tyr Gln Pro Pro Tyr 85
90 95Ala Ser Ala Gln Pro Arg Gln Pro Val Gln Gln Pro
Pro Glu Ala Gln 100 105 110Val
Pro Pro Gln His Ala Pro Arg Pro Ala Gln Pro Val Gln Gln Pro 115
120 125Ala Tyr Gln Pro Gln Pro Glu Gln Pro
Leu Gln Gln Pro Val Ser Pro 130 135
140Gln Val Ala Pro Ala Pro Gln Pro Val His Ser Ala Pro Gln Pro Ala145
150 155 160Gln Gln Ala Phe
Gln Pro Ala Glu Pro Val Ala Ala Pro Gln Pro Glu 165
170 175Pro Val Ala Glu Pro Ala Pro Val Met Asp
Lys Pro Lys Arg Lys Glu 180 185
190Ala Val Ile Ile Met Asn Val Ala Ala His His Gly Ser Glu Leu Asn
195 200 205Gly Glu Leu Leu Leu Asn Ser
Ile Gln Gln Ala Gly Phe Ile Phe Gly 210 215
220Asp Met Asn Ile Tyr His Arg His Leu Ser Pro Asp Gly Ser Gly
Pro225 230 235 240Ala Leu
Phe Ser Leu Ala Asn Met Val Lys Pro Gly Thr Phe Asp Pro
245 250 255Glu Met Lys Asp Phe Thr Thr
Pro Gly Val Thr Ile Phe Met Gln Val 260 265
270Pro Ser Tyr Gly Asp Glu Leu Gln Asn Phe Lys Leu Met Leu
Gln Ser 275 280 285Ala Gln His Ile
Ala Asp Glu Val Gly Gly Val Val Leu Asp Asp Gln 290
295 300Arg Arg Met Met Thr Pro Gln Lys Leu Arg Glu Tyr
Gln Asp Ile Ile305 310 315
320Arg Glu Val Lys Asp Ala Asn Ala
325211425DNAE.colimisc_feature(1)..(1425)lpd 21atgagtactg aaatcaaaac
tcaggtcgtg gtacttgggg caggccccgc aggttactcc 60gctgccttcc gttgcgctga
tttaggtctg gaaaccgtaa tcgtagaacg ttacaacacc 120cttggcggtg tttgcctgaa
cgtcggctgt atcccttcta aagcactgct gcacgtagca 180aaagttatcg aagaagccaa
agcgctggct gaacacggta tcgtcttcgg cgaaccgaaa 240accgatattg acaagattcg
tacctggaaa gagaaagtga tcaatcagct gaccggtggt 300ctggctggta tggcgaaagg
ccgcaaagtc aaagtggtca acggtctggg taaatttacc 360ggggctaaca ccctggaagt
tgaaggtgag aacggtaaaa ccgtgatcaa cttcgacaac 420gcgatcattg cagcgggttc
tcgcccgatc caactgccgt ttattccgca tgaagatccg 480cgtatctggg actccactga
cgcgctggaa ctgaaagaag taccagaacg cctgctggta 540atgggtggcg gtatcatcgg
tctggaaatg ggcaccgtat accacgcgct gggttcacag 600attgacgtgg ttgaaatgtt
cgaccaggtt atcccggcag ctgacaaaga catcgttaaa 660gtcttcacca agcgtatcag
caagaaattc aacctgatgc tggaaaccaa agttaccgcc 720gttgaagcga aagaagacgg
tatttatgtg acgatggaag gcaaaaaagc acccgctgaa 780ccgcagcgtt acgacgccgt
gctggtagcg attggtcgtg tgccgaacgg taaaaacctc 840gacgcaggca aagctggcgt
ggaagtggac gaccgtggtt tcatccgcgt tgacaaacag 900ctgcgtacca acgtaccgca
catctttgct atcggcgata tcgtcggtca gccgatgctg 960gcacacaaag gtgttcacga
aggtcacgtt gccgctgaag ttatcgccgg taagaaacac 1020tacttcgatc cgaaagttat
cccgtccatc gcctataccg aaccagaagt tgcatgggta 1080ggtctgactg agaaagaagc
gaaagagaaa ggcatcagct atgaaaccgc caccttcccg 1140tgggctgctt ctggtcgtgc
tatcgcttcc gactgcgcag acggtatgac caagctgatt 1200ttcgacaaag aatctcaccg
tgtgatcggt ggtgcgattg tcggtaccaa cggcggcgag 1260ctgctgggtg aaatcggcct
ggcaatcgaa atgggttgtg atgctgaaga catcgcactg 1320accatccacg cgcacccgac
tctgcacgag tctgtgggcc tggcggcaga agtgttcgaa 1380ggtagcatta ccgacctgcc
gaacccgaaa gcgaagaaga agtaa
142522474PRTE.coliMISC_FEATURE(1)..(474)lpd 22Met Ser Thr Glu Ile Lys Thr
Gln Val Val Val Leu Gly Ala Gly Pro1 5 10
15Ala Gly Tyr Ser Ala Ala Phe Arg Cys Ala Asp Leu Gly
Leu Glu Thr 20 25 30Val Ile
Val Glu Arg Tyr Asn Thr Leu Gly Gly Val Cys Leu Asn Val 35
40 45Gly Cys Ile Pro Ser Lys Ala Leu Leu His
Val Ala Lys Val Ile Glu 50 55 60Glu
Ala Lys Ala Leu Ala Glu His Gly Ile Val Phe Gly Glu Pro Lys65
70 75 80Thr Asp Ile Asp Lys Ile
Arg Thr Trp Lys Glu Lys Val Ile Asn Gln 85
90 95Leu Thr Gly Gly Leu Ala Gly Met Ala Lys Gly Arg
Lys Val Lys Val 100 105 110Val
Asn Gly Leu Gly Lys Phe Thr Gly Ala Asn Thr Leu Glu Val Glu 115
120 125Gly Glu Asn Gly Lys Thr Val Ile Asn
Phe Asp Asn Ala Ile Ile Ala 130 135
140Ala Gly Ser Arg Pro Ile Gln Leu Pro Phe Ile Pro His Glu Asp Pro145
150 155 160Arg Ile Trp Asp
Ser Thr Asp Ala Leu Glu Leu Lys Glu Val Pro Glu 165
170 175Arg Leu Leu Val Met Gly Gly Gly Ile Ile
Gly Leu Glu Met Gly Thr 180 185
190Val Tyr His Ala Leu Gly Ser Gln Ile Asp Val Val Glu Met Phe Asp
195 200 205Gln Val Ile Pro Ala Ala Asp
Lys Asp Ile Val Lys Val Phe Thr Lys 210 215
220Arg Ile Ser Lys Lys Phe Asn Leu Met Leu Glu Thr Lys Val Thr
Ala225 230 235 240Val Glu
Ala Lys Glu Asp Gly Ile Tyr Val Thr Met Glu Gly Lys Lys
245 250 255Ala Pro Ala Glu Pro Gln Arg
Tyr Asp Ala Val Leu Val Ala Ile Gly 260 265
270Arg Val Pro Asn Gly Lys Asn Leu Asp Ala Gly Lys Ala Gly
Val Glu 275 280 285Val Asp Asp Arg
Gly Phe Ile Arg Val Asp Lys Gln Leu Arg Thr Asn 290
295 300Val Pro His Ile Phe Ala Ile Gly Asp Ile Val Gly
Gln Pro Met Leu305 310 315
320Ala His Lys Gly Val His Glu Gly His Val Ala Ala Glu Val Ile Ala
325 330 335Gly Lys Lys His Tyr
Phe Asp Pro Lys Val Ile Pro Ser Ile Ala Tyr 340
345 350Thr Glu Pro Glu Val Ala Trp Val Gly Leu Thr Glu
Lys Glu Ala Lys 355 360 365Glu Lys
Gly Ile Ser Tyr Glu Thr Ala Thr Phe Pro Trp Ala Ala Ser 370
375 380Gly Arg Ala Ile Ala Ser Asp Cys Ala Asp Gly
Met Thr Lys Leu Ile385 390 395
400Phe Asp Lys Glu Ser His Arg Val Ile Gly Gly Ala Ile Val Gly Thr
405 410 415Asn Gly Gly Glu
Leu Leu Gly Glu Ile Gly Leu Ala Ile Glu Met Gly 420
425 430Cys Asp Ala Glu Asp Ile Ala Leu Thr Ile His
Ala His Pro Thr Leu 435 440 445His
Glu Ser Val Gly Leu Ala Ala Glu Val Phe Glu Gly Ser Ile Thr 450
455 460Asp Leu Pro Asn Pro Lys Ala Lys Lys
Lys465 470231320DNAE. colimisc_feature(1)..(1320)brnQ
23atgacccatc aattaagatc gcgcgatatc atcgctctgg gctttatgac atttgcgttg
60ttcgtcggcg caggtaacat tattttccct ccaatggtcg gcttgcaggc aggcgaacac
120gtctggactg cggcattcgg cttcctcatt actgccgttg gcctgccggt attaacggta
180gtggcgctgg caaaagttgg cggcggtgtt gacagcctca gcacgccaat cggtaaagtc
240gctggcgtgc tgctggcaac ggtttgttac ctggcggtgg ggccgctttt cgctacgccg
300cgtacagcta ccgtttcctt tgaagtgggg attgcgccgc tgacgggtga ttccgcgctg
360ccgctgttta tctacagcct ggtctatttc gctatcgtta ttctggtttc gctctatccg
420ggcaagctgc tggataccgt gggcaacttc cttgcgccgc tgaaaattat cgcgctggtc
480atcctgtctg ttgccgcgat tgtctggccg gcgggttcta tcagcacggc gactgaggct
540tatcaaaacg ctgcgttttc taacggcttc gttaacggct atctgaccat ggatacgctg
600ggcgcaatgg tgtttggtat cgttattgtt aacgcggcgc gttctcgtgg cgttaccgaa
660gcgcgtctgc tgacccgtta taccgtctgg gctggcctga tggcgggtgt tggtctgact
720ctgctgtacc tggcgctgtt ccgtctgggt tcagacagcg cgtcgctggt cgatcagtct
780gcaaacggcg ctgctattct gcatgcttac gttcagcaca cctttggcgg cggcggtagc
840ttcctgctgg cggcgttaat cttcatcgcc tgcctggtaa cggcagttgg cctgacctgt
900gcttgtgcag aattctttgc ccagtacgta ccgctctctt atcgtacgct ggtgtttatc
960ctcggcggct tctcgatggt ggtttctaac ctcggcttaa gccagctgat ccagatctcc
1020gtaccggtgc tgaccgctat ttatccgccg tgtatcgcac tggttgtatt aagttttaca
1080cgctcatggt ggcataattc gtcccgcgtg attgctccgc cgatgtttat cagcctgctt
1140tttggtattc tcgacgggat caaagcatct gcattcagcg atatcttacc gtcctgggcg
1200cagcgtttac cgctggccga acaaggtctg gcgtggttaa tgccaacagt ggtgatggtg
1260gttctggcca ttatctggga tcgcgcggca ggtcgtcagg tgacctccag cgctcactaa
132024439PRTE. coliMISC_FEATURE(1)..(439)brnQ 24Met Thr His Gln Leu Arg
Ser Arg Asp Ile Ile Ala Leu Gly Phe Met1 5
10 15Thr Phe Ala Leu Phe Val Gly Ala Gly Asn Ile Ile
Phe Pro Pro Met 20 25 30Val
Gly Leu Gln Ala Gly Glu His Val Trp Thr Ala Ala Phe Gly Phe 35
40 45Leu Ile Thr Ala Val Gly Leu Pro Val
Leu Thr Val Val Ala Leu Ala 50 55
60Lys Val Gly Gly Gly Val Asp Ser Leu Ser Thr Pro Ile Gly Lys Val65
70 75 80Ala Gly Val Leu Leu
Ala Thr Val Cys Tyr Leu Ala Val Gly Pro Leu 85
90 95Phe Ala Thr Pro Arg Thr Ala Thr Val Ser Phe
Glu Val Gly Ile Ala 100 105
110Pro Leu Thr Gly Asp Ser Ala Leu Pro Leu Phe Ile Tyr Ser Leu Val
115 120 125Tyr Phe Ala Ile Val Ile Leu
Val Ser Leu Tyr Pro Gly Lys Leu Leu 130 135
140Asp Thr Val Gly Asn Phe Leu Ala Pro Leu Lys Ile Ile Ala Leu
Val145 150 155 160Ile Leu
Ser Val Ala Ala Ile Val Trp Pro Ala Gly Ser Ile Ser Thr
165 170 175Ala Thr Glu Ala Tyr Gln Asn
Ala Ala Phe Ser Asn Gly Phe Val Asn 180 185
190Gly Tyr Leu Thr Met Asp Thr Leu Gly Ala Met Val Phe Gly
Ile Val 195 200 205Ile Val Asn Ala
Ala Arg Ser Arg Gly Val Thr Glu Ala Arg Leu Leu 210
215 220Thr Arg Tyr Thr Val Trp Ala Gly Leu Met Ala Gly
Val Gly Leu Thr225 230 235
240Leu Leu Tyr Leu Ala Leu Phe Arg Leu Gly Ser Asp Ser Ala Ser Leu
245 250 255Val Asp Gln Ser Ala
Asn Gly Ala Ala Ile Leu His Ala Tyr Val Gln 260
265 270His Thr Phe Gly Gly Gly Gly Ser Phe Leu Leu Ala
Ala Leu Ile Phe 275 280 285Ile Ala
Cys Leu Val Thr Ala Val Gly Leu Thr Cys Ala Cys Ala Glu 290
295 300Phe Phe Ala Gln Tyr Val Pro Leu Ser Tyr Arg
Thr Leu Val Phe Ile305 310 315
320Leu Gly Gly Phe Ser Met Val Val Ser Asn Leu Gly Leu Ser Gln Leu
325 330 335Ile Gln Ile Ser
Val Pro Val Leu Thr Ala Ile Tyr Pro Pro Cys Ile 340
345 350Ala Leu Val Val Leu Ser Phe Thr Arg Ser Trp
Trp His Asn Ser Ser 355 360 365Arg
Val Ile Ala Pro Pro Met Phe Ile Ser Leu Leu Phe Gly Ile Leu 370
375 380Asp Gly Ile Lys Ala Ser Ala Phe Ser Asp
Ile Leu Pro Ser Trp Ala385 390 395
400Gln Arg Leu Pro Leu Ala Glu Gln Gly Leu Ala Trp Leu Met Pro
Thr 405 410 415Val Val Met
Val Val Leu Ala Ile Ile Trp Asp Arg Ala Ala Gly Arg 420
425 430Gln Val Thr Ser Ser Ala His
435251104DNAE. colimisc_feature(1)..(1104)asd 25atgaaaaatg ttggttttat
cggctggcgc ggtatggtcg gctccgttct catgcaacgc 60atggttgaag agcgcgactt
cgacgccatt cgccctgtct tcttttctac ttctcagctt 120ggccaggctg cgccgtcttt
tggcggaacc actggcacac ttcaggatgc ctttgatctg 180gaggcgctaa aggccctcga
tatcattgtg acctgtcagg gcggcgatta taccaacgaa 240atctatccaa agcttcgtga
aagcggatgg caaggttact ggattgacgc agcatcgtct 300ctgcgcatga aagatgacgc
catcatcatt cttgaccccg tcaatcagga cgtcattacc 360gacggattaa ataatggcat
caggactttt gttggcggta actgtaccgt aagcctgatg 420ttgatgtcgt tgggtggttt
attcgccaat gatcttgttg attgggtgtc cgttgcaacc 480taccaggccg cttccggcgg
tggtgcgcga catatgcgtg agttattaac ccagatgggc 540catctgtatg gccatgtggc
agatgaactc gcgaccccgt cctctgctat tctcgatatc 600gaacgcaaag tcacaacctt
aacccgtagc ggtgagctgc cggtggataa ctttggcgtg 660ccgctggcgg gtagcctgat
tccgtggatc gacaaacagc tcgataacgg tcagagccgc 720gaagagtgga aagggcaggc
ggaaaccaac aagatcctca acacatcttc cgtaattccg 780gtagatggtt tatgtgtgcg
tgtcggggca ttgcgctgcc acagccaggc attcactatt 840aaattgaaaa aagatgtgtc
tattccgacc gtggaagaac tgctggctgc gcacaatccg 900tgggcgaaag tcgttccgaa
cgatcgggaa atcactatgc gtgagctaac cccagctgcc 960gttaccggca cgctgaccac
gccggtaggc cgcctgcgta agctgaatat gggaccagag 1020ttcctgtcag cctttaccgt
gggcgaccag ctgctgtggg gggccgcgga gccgctgcgt 1080cggatgcttc gtcaactggc
gtaa 110426367PRTE.
coliMISC_FEATURE(1)..(367)asd 26Met Lys Asn Val Gly Phe Ile Gly Trp Arg
Gly Met Val Gly Ser Val1 5 10
15Leu Met Gln Arg Met Val Glu Glu Arg Asp Phe Asp Ala Ile Arg Pro
20 25 30Val Phe Phe Ser Thr Ser
Gln Leu Gly Gln Ala Ala Pro Ser Phe Gly 35 40
45Gly Thr Thr Gly Thr Leu Gln Asp Ala Phe Asp Leu Glu Ala
Leu Lys 50 55 60Ala Leu Asp Ile Ile
Val Thr Cys Gln Gly Gly Asp Tyr Thr Asn Glu65 70
75 80Ile Tyr Pro Lys Leu Arg Glu Ser Gly Trp
Gln Gly Tyr Trp Ile Asp 85 90
95Ala Ala Ser Ser Leu Arg Met Lys Asp Asp Ala Ile Ile Ile Leu Asp
100 105 110Pro Val Asn Gln Asp
Val Ile Thr Asp Gly Leu Asn Asn Gly Ile Arg 115
120 125Thr Phe Val Gly Gly Asn Cys Thr Val Ser Leu Met
Leu Met Ser Leu 130 135 140Gly Gly Leu
Phe Ala Asn Asp Leu Val Asp Trp Val Ser Val Ala Thr145
150 155 160Tyr Gln Ala Ala Ser Gly Gly
Gly Ala Arg His Met Arg Glu Leu Leu 165
170 175Thr Gln Met Gly His Leu Tyr Gly His Val Ala Asp
Glu Leu Ala Thr 180 185 190Pro
Ser Ser Ala Ile Leu Asp Ile Glu Arg Lys Val Thr Thr Leu Thr 195
200 205Arg Ser Gly Glu Leu Pro Val Asp Asn
Phe Gly Val Pro Leu Ala Gly 210 215
220Ser Leu Ile Pro Trp Ile Asp Lys Gln Leu Asp Asn Gly Gln Ser Arg225
230 235 240Glu Glu Trp Lys
Gly Gln Ala Glu Thr Asn Lys Ile Leu Asn Thr Ser 245
250 255Ser Val Ile Pro Val Asp Gly Leu Cys Val
Arg Val Gly Ala Leu Arg 260 265
270Cys His Ser Gln Ala Phe Thr Ile Lys Leu Lys Lys Asp Val Ser Ile
275 280 285Pro Thr Val Glu Glu Leu Leu
Ala Ala His Asn Pro Trp Ala Lys Val 290 295
300Val Pro Asn Asp Arg Glu Ile Thr Met Arg Glu Leu Thr Pro Ala
Ala305 310 315 320Val Thr
Gly Thr Leu Thr Thr Pro Val Gly Arg Leu Arg Lys Leu Asn
325 330 335Met Gly Pro Glu Phe Leu Ser
Ala Phe Thr Val Gly Asp Gln Leu Leu 340 345
350Trp Gly Ala Ala Glu Pro Leu Arg Arg Met Leu Arg Gln Leu
Ala 355 360 365271026DNAE.
colimisc_feature(1)..(1026)lpxD mutated 27atgccttcaa ttcgactggc
tgatttagcg cagcagttgg atacagaact acacggtgat 60ggcgatatcg tcatcaccgg
cgttgcgtcc atgcaatctg cacaaacagg tcacattacg 120ttcatggtta acccaaaata
ccgtgagcat ttaggcttgt gccaggcgtc cgcggttgtc 180atgacccagg acgatcttcc
tttcgcgaaa agtgccgcgc tggtagtgaa gaatccctac 240ctgacttacg cgcgcatggc
gcaaatttta gataccacgc cgcagcccgc gcagaacatt 300gcacccagtg cggtgatcga
cgcgacggcg aagctgggta acaacgtatc gattggcgct 360aacgcggtga ttgagtccgg
cgttgaactg ggcgataacg tgattatcgg tgccggttgc 420ttcgtaggta aaaacagcaa
aatcggtgca ggttcgcgtc tctgggcgaa cgtaaccatt 480taccatgaga tccagatcgg
tcagaattgc ctgatccagt ccggaacagt ggtaggcgca 540gacggctttg gttatgccaa
cgatcgtggt aactgggtga agatcccaca gattggtcgc 600gtaattattg gcgatcgcgt
ggagatcggt gcctgcacaa ccatcgatcg cggcgcgctg 660gatgacacta ttattggcaa
tggcgtgatc attgataacc agtgccagat tgcacataac 720gtcgtgattg gcgacaatac
ggcggttgcc ggtggcgtca ttatggcggg cagcctgaaa 780attggtcgtt actgcatgat
cggcggagcc agcgtaatca acgggcatat ggaaatatgc 840gacaaagtga cggttacggg
catgggtatg gtgatgcgtc ccatcactga accaggcgtc 900tattcctcag gcattccgct
gcaacccaac aaagtctggc gcaaaaccgc tgcactggtg 960atgaacattg atgacatgag
caagcgtctg aaatcgcttg agcgcaaggt taatcaacaa 1020gactaa
102628341PRTE.
coliMISC_FEATURE(1)..(341)lpxD mutated 28Met Pro Ser Ile Arg Leu Ala Asp
Leu Ala Gln Gln Leu Asp Thr Glu1 5 10
15Leu His Gly Asp Gly Asp Ile Val Ile Thr Gly Val Ala Ser
Met Gln 20 25 30Ser Ala Gln
Thr Gly His Ile Thr Phe Met Val Asn Pro Lys Tyr Arg 35
40 45Glu His Leu Gly Leu Cys Gln Ala Ser Ala Val
Val Met Thr Gln Asp 50 55 60Asp Leu
Pro Phe Ala Lys Ser Ala Ala Leu Val Val Lys Asn Pro Tyr65
70 75 80Leu Thr Tyr Ala Arg Met Ala
Gln Ile Leu Asp Thr Thr Pro Gln Pro 85 90
95Ala Gln Asn Ile Ala Pro Ser Ala Val Ile Asp Ala Thr
Ala Lys Leu 100 105 110Gly Asn
Asn Val Ser Ile Gly Ala Asn Ala Val Ile Glu Ser Gly Val 115
120 125Glu Leu Gly Asp Asn Val Ile Ile Gly Ala
Gly Cys Phe Val Gly Lys 130 135 140Asn
Ser Lys Ile Gly Ala Gly Ser Arg Leu Trp Ala Asn Val Thr Ile145
150 155 160Tyr His Glu Ile Gln Ile
Gly Gln Asn Cys Leu Ile Gln Ser Gly Thr 165
170 175Val Val Gly Ala Asp Gly Phe Gly Tyr Ala Asn Asp
Arg Gly Asn Trp 180 185 190Val
Lys Ile Pro Gln Ile Gly Arg Val Ile Ile Gly Asp Arg Val Glu 195
200 205Ile Gly Ala Cys Thr Thr Ile Asp Arg
Gly Ala Leu Asp Asp Thr Ile 210 215
220Ile Gly Asn Gly Val Ile Ile Asp Asn Gln Cys Gln Ile Ala His Asn225
230 235 240Val Val Ile Gly
Asp Asn Thr Ala Val Ala Gly Gly Val Ile Met Ala 245
250 255Gly Ser Leu Lys Ile Gly Arg Tyr Cys Met
Ile Gly Gly Ala Ser Val 260 265
270Ile Asn Gly His Met Glu Ile Cys Asp Lys Val Thr Val Thr Gly Met
275 280 285Gly Met Val Met Arg Pro Ile
Thr Glu Pro Gly Val Tyr Ser Ser Gly 290 295
300Ile Pro Leu Gln Pro Asn Lys Val Trp Arg Lys Thr Ala Ala Leu
Val305 310 315 320Met Asn
Ile Asp Asp Met Ser Lys Arg Leu Lys Ser Leu Glu Arg Lys
325 330 335Val Asn Gln Gln Asp
34029918DNAE. colimisc_feature(1)..(918)gcvA 29atgtctaaac gattaccacc
gctaaatgcc ttacgagttt ttgatgccgc agcacgccat 60ttaagtttca ctcgcgcagc
agaagagctt tttgtgaccc aggccgcagt aagtcatcaa 120atcaagtctc ttgaggattt
tctggggcta aaactgttcc gccgccgtaa tcgttcactc 180ctgctgaccg aggaaggtca
aagctatttc ctcgatatca aagagatatt ttcgcaatta 240accgaagcga cgcgtaaact
ccaggcccgt agcgccaagg gggcgttgac ggtcagttta 300ctccccagtt tcgccattca
ttggttggtt ccgcgacttt ccagctttaa ttcagcttat 360ccgggaattg acgttcgaat
ccaggcggtt gatcgtcagg aagataagct ggcggatgat 420gttgatgtgg cgatatttta
tggtcggggc aactggccgg ggctacgggt ggaaaaactg 480tacgccgaat atttattgcc
ggtgtgttcg ccgctactgc tgacaggcga aaaacccttg 540aagaccccgg aagatctggc
taaacatacg ttattacatg atgcgtcacg ccgtgactgg 600cagacatata cccgacagtt
ggggttaaat catatcaacg ttcagcaagg gccaattttt 660agtcatagcg ccatggtgct
gcaagcggct attcacgggc agggagtggc gctggcaaat 720aacgtgatgg cgcaatctga
aatcgaggcc ggacgtcttg tttgcccgtt taatgatgtt 780ctggtcagta aaaacgcttt
ttatctggtt tgtcatgaca gccaggcaga actgggtaaa 840atagccgcct ttcgccaatg
gatcctggcg aaagccgctg ctgaacaaga aaaattccgc 900tttcgttatg aacaataa
91830305PRTE.
coliMISC_FEATURE(1)..(305)gcvA 30Met Ser Lys Arg Leu Pro Pro Leu Asn Ala
Leu Arg Val Phe Asp Ala1 5 10
15Ala Ala Arg His Leu Ser Phe Thr Arg Ala Ala Glu Glu Leu Phe Val
20 25 30Thr Gln Ala Ala Val Ser
His Gln Ile Lys Ser Leu Glu Asp Phe Leu 35 40
45Gly Leu Lys Leu Phe Arg Arg Arg Asn Arg Ser Leu Leu Leu
Thr Glu 50 55 60Glu Gly Gln Ser Tyr
Phe Leu Asp Ile Lys Glu Ile Phe Ser Gln Leu65 70
75 80Thr Glu Ala Thr Arg Lys Leu Gln Ala Arg
Ser Ala Lys Gly Ala Leu 85 90
95Thr Val Ser Leu Leu Pro Ser Phe Ala Ile His Trp Leu Val Pro Arg
100 105 110Leu Ser Ser Phe Asn
Ser Ala Tyr Pro Gly Ile Asp Val Arg Ile Gln 115
120 125Ala Val Asp Arg Gln Glu Asp Lys Leu Ala Asp Asp
Val Asp Val Ala 130 135 140Ile Phe Tyr
Gly Arg Gly Asn Trp Pro Gly Leu Arg Val Glu Lys Leu145
150 155 160Tyr Ala Glu Tyr Leu Leu Pro
Val Cys Ser Pro Leu Leu Leu Thr Gly 165
170 175Glu Lys Pro Leu Lys Thr Pro Glu Asp Leu Ala Lys
His Thr Leu Leu 180 185 190His
Asp Ala Ser Arg Arg Asp Trp Gln Thr Tyr Thr Arg Gln Leu Gly 195
200 205Leu Asn His Ile Asn Val Gln Gln Gly
Pro Ile Phe Ser His Ser Ala 210 215
220Met Val Leu Gln Ala Ala Ile His Gly Gln Gly Val Ala Leu Ala Asn225
230 235 240Asn Val Met Ala
Gln Ser Glu Ile Glu Ala Gly Arg Leu Val Cys Pro 245
250 255Phe Asn Asp Val Leu Val Ser Lys Asn Ala
Phe Tyr Leu Val Cys His 260 265
270Asp Ser Gln Ala Glu Leu Gly Lys Ile Ala Ala Phe Arg Gln Trp Ile
275 280 285Leu Ala Lys Ala Ala Ala Glu
Gln Glu Lys Phe Arg Phe Arg Tyr Glu 290 295
300Gln30531206DNAE. colimisc_feature(1)..(206)gcvB 31acttcctgag
ccggaacgaa aagttttatc ggaatgcgtg ttctggtgaa cttttggctt 60acggttgtga
tgttgtgttg ttgtgtttgc aattggtctg cgattcagac catggtagca 120aagctacctt
ttttcacttc ctgtacattt accctgtctg tccatagtga ttaatgtagc 180accgcctaat
tgcggtgctt tttttt
206321203DNAEscherichia colimisc_feature(1)..(1203)ackA 32atgtcgagta
agttagtact ggttctgaac tgcggtagtt cttcactgaa atttgccatc 60atcgatgcag
taaatggtga agagtacctt tctggtttag ccgaatgttt ccacctgccc 120gaagcacgta
tcaaatggaa aatggacggc aataaacagg aagcggcttt aggtgcaggc 180gccgctcaca
gcgaagcgct caactttatc gttaatacta ttctggcaca aaaaccagaa 240ctgtctgcgc
agctgactgc tatcggtcac cgtatcgtac acggcggcga aaagtatacc 300agctccgtag
tgatcgatga gtctgttatt cagggtatca aagatgcagc ttcttttgca 360ccgctgcaca
acccggctca cctgatcggt atcgaagaag ctctgaaatc tttcccacag 420ctgaaagaca
aaaacgttgc tgtatttgac accgcgttcc accagactat gccggaagag 480tcttacctct
acgccctgcc ttacaacctg tacaaagagc acggcatccg tcgttacggc 540gcgcacggca
ccagccactt ctatgtaacc caggaagcgg caaaaatgct gaacaaaccg 600gtagaagaac
tgaacatcat cacctgccac ctgggcaacg gtggttccgt ttctgctatc 660cgcaacggta
aatgcgttga cacctctatg ggcctgaccc cgctggaagg tctggtcatg 720ggtacccgtt
ctggtgatat cgatccggcg atcatcttcc acctgcacga caccctgggc 780atgagcgttg
acgcaatcaa caaactgctg accaaagagt ctggcctgct gggtctgacc 840gaagtgacca
gcgactgccg ctatgttgaa gacaactacg cgacgaaaga agacgcgaag 900cgcgcaatgg
acgtttactg ccaccgcctg gcgaaataca tcggtgccta cactgcgctg 960atggatggtc
gtctggacgc tgttgtattc actggtggta tcggtgaaaa tgccgcaatg 1020gttcgtgaac
tgtctctggg caaactgggc gtgctgggct ttgaagttga tcatgaacgc 1080aacctggctg
cacgtttcgg caaatctggt ttcatcaaca aagaaggtac ccgtcctgcg 1140gtggttatcc
caaccaacga agaactggtt atcgcgcaag acgcgagccg cctgactgcc 1200tga
120333400PRTEscherichia coliMISC_FEATURE(1)..(400)ackA 33Met Ser Ser Lys
Leu Val Leu Val Leu Asn Cys Gly Ser Ser Ser Leu1 5
10 15Lys Phe Ala Ile Ile Asp Ala Val Asn Gly
Glu Glu Tyr Leu Ser Gly 20 25
30Leu Ala Glu Cys Phe His Leu Pro Glu Ala Arg Ile Lys Trp Lys Met
35 40 45Asp Gly Asn Lys Gln Glu Ala Ala
Leu Gly Ala Gly Ala Ala His Ser 50 55
60Glu Ala Leu Asn Phe Ile Val Asn Thr Ile Leu Ala Gln Lys Pro Glu65
70 75 80Leu Ser Ala Gln Leu
Thr Ala Ile Gly His Arg Ile Val His Gly Gly 85
90 95Glu Lys Tyr Thr Ser Ser Val Val Ile Asp Glu
Ser Val Ile Gln Gly 100 105
110Ile Lys Asp Ala Ala Ser Phe Ala Pro Leu His Asn Pro Ala His Leu
115 120 125Ile Gly Ile Glu Glu Ala Leu
Lys Ser Phe Pro Gln Leu Lys Asp Lys 130 135
140Asn Val Ala Val Phe Asp Thr Ala Phe His Gln Thr Met Pro Glu
Glu145 150 155 160Ser Tyr
Leu Tyr Ala Leu Pro Tyr Asn Leu Tyr Lys Glu His Gly Ile
165 170 175Arg Arg Tyr Gly Ala His Gly
Thr Ser His Phe Tyr Val Thr Gln Glu 180 185
190Ala Ala Lys Met Leu Asn Lys Pro Val Glu Glu Leu Asn Ile
Ile Thr 195 200 205Cys His Leu Gly
Asn Gly Gly Ser Val Ser Ala Ile Arg Asn Gly Lys 210
215 220Cys Val Asp Thr Ser Met Gly Leu Thr Pro Leu Glu
Gly Leu Val Met225 230 235
240Gly Thr Arg Ser Gly Asp Ile Asp Pro Ala Ile Ile Phe His Leu His
245 250 255Asp Thr Leu Gly Met
Ser Val Asp Ala Ile Asn Lys Leu Leu Thr Lys 260
265 270Glu Ser Gly Leu Leu Gly Leu Thr Glu Val Thr Ser
Asp Cys Arg Tyr 275 280 285Val Glu
Asp Asn Tyr Ala Thr Lys Glu Asp Ala Lys Arg Ala Met Asp 290
295 300Val Tyr Cys His Arg Leu Ala Lys Tyr Ile Gly
Ala Tyr Thr Ala Leu305 310 315
320Met Asp Gly Arg Leu Asp Ala Val Val Phe Thr Gly Gly Ile Gly Glu
325 330 335Asn Ala Ala Met
Val Arg Glu Leu Ser Leu Gly Lys Leu Gly Val Leu 340
345 350Gly Phe Glu Val Asp His Glu Arg Asn Leu Ala
Ala Arg Phe Gly Lys 355 360 365Ser
Gly Phe Ile Asn Lys Glu Gly Thr Arg Pro Ala Val Val Ile Pro 370
375 380Thr Asn Glu Glu Leu Val Ile Ala Gln Asp
Ala Ser Arg Leu Thr Ala385 390 395
400345003DNAArtificialPlasmid
pACYC-asdmisc_feature(1)..(5003)Plasmid pACYC-asd 34gaattccgga tgagcattca
tcaggcgggc aagaatgtga ataaaggccg gataaaactt 60gtgcttattt ttctttacgg
tctttaaaaa ggccgtaata tccagctgaa cggtctggtt 120ataggtacat tgagcaactg
actgaaatgc ctcaaaatgt tctttacgat gccattggga 180tatatcaacg gtggtatatc
cagtgatttt tttctccatt ttagcttcct tagctcctga 240aaatctcgat aactcaaaaa
atacgcccgg tagtgatctt atttcattat ggtgaaagtt 300ggaacctctt acgtgccgat
caacgtctca ttttcgccaa aagttggccc agggcttccc 360ggtatcaaca gggacaccag
gatttattta ttctgcgaag tgatcttccg tcacaggtat 420ttattcggcg caaagtgcgt
cgggtgatgc tgccaactta ctgatttagt gtatgatggt 480gtttttgagg tgctccagtg
gcttctgttt ctatcagctg tccctcctgt tcagctactg 540acggggtggt gcgtaacggc
aaaagcaccg ccggacatca gcgctagcgg agtgtatact 600ggcttactat gttggcactg
atgagggtgt cagtgaagtg cttcatgtgg caggagaaaa 660aaggctgcac cggtgcgtca
gcagaatatg tgatacagga tatattccgc ttcctcgctc 720actgactcgc tacgctcggt
cgttcgactg cggcgagcgg aaatggctta cgaacggggc 780ggagatttcc tggaagatgc
caggaagata cttaacaggg aagtgagagg gccgcggcaa 840agccgttttt ccataggctc
cgcccccctg acaagcatca cgaaatctga cgctcaaatc 900agtggtggcg aaacccgaca
ggactataaa gataccaggc gtttccccct ggcggctccc 960tcgtgcgctc tcctgttcct
gcctttcggt ttaccggtgt cattccgctg ttatggccgc 1020gtttgtctca ttccacgcct
gacactcagt tccgggtagg cagttcgctc caagctggac 1080tgtatgcacg aaccccccgt
tcagtccgac cgctgcgcct tatccggtaa ctatcgtctt 1140gagtccaacc cggaaagaca
tgcaaaagca ccactggcag cagccactgg taattgattt 1200agaggagtta gtcttgaagt
catgcgccgg ttaaggctaa actgaaagga caagttttgg 1260tgactgcgct cctccaagcc
agttacctcg gttcaaagag ttggtagctc agagaacctt 1320cgaaaaaccg ccctgcaagg
cggttttttc gttttcagag caagagatta cgcgcagacc 1380aaaacgatct caagaagatc
atcttattaa tcagataaaa tatttctaga tttcagtgca 1440atttatctct tcaaatgtag
cacctgaagt cagccccata cgatataagt tgtaattctc 1500atgtttgaca gcttatcatc
gataagcttc gactgcacgg tgcaccaatg cttctggcgt 1560caggcagcca tcggaagctg
tggtatggct gtgcaggtcg taaatcactg cataattcgt 1620gtcgctcaag gcgcactccc
gttctggata atgttttttg cgccgacatc ataacggttc 1680tggcaaatat tctgaaatga
gctgttgaca attaatcatc cggctcgtat aatgtgtgga 1740attgtgagcg gataacaatt
tcagaattca aaagatcttt taagaaggag atatacatat 1800gaaaaatgtt ggttttatcg
gctggcgcgg tatggtcggc tccgttctca tgcaacgcat 1860ggttgaagag cgcgacttcg
acgccattcg ccctgtcttc ttttctactt ctcagcttgg 1920ccaggctgcg ccgtcttttg
gcggaaccac tggcacactt caggatgcct ttgatctgga 1980ggcgctaaag gccctcgata
tcattgtgac ctgtcagggc ggcgattata ccaacgaaat 2040ctatccaaag cttcgtgaaa
gcggatggca aggttactgg attgacgcag catcgtctct 2100gcgcatgaaa gatgacgcca
tcatcattct tgaccccgtc aatcaggacg tcattaccga 2160cggattaaat aatggcatca
ggacttttgt tggcggtaac tgtaccgtaa gcctgatgtt 2220gatgtcgttg ggtggtttat
tcgccaatga tcttgttgat tgggtgtccg ttgcaaccta 2280ccaggccgct tccggcggtg
gtgcgcgaca tatgcgtgag ttattaaccc agatgggcca 2340tctgtatggc catgtggcag
atgaactcgc gaccccgtcc tctgctattc tcgatatcga 2400acgcaaagtc acaaccttaa
cccgtagcgg tgagctgccg gtggataact ttggcgtgcc 2460gctggcgggt agcctgattc
cgtggatcga caaacagctc gataacggtc agagccgcga 2520agagtggaaa gggcaggcgg
aaaccaacaa gatcctcaac acatcttccg taattccggt 2580agatggttta tgtgtgcgtg
tcggggcatt gcgctgccac agccaggcat tcactattaa 2640attgaaaaaa gatgtgtcta
ttccgaccgt ggaagaactg ctggctgcgc acaatccgtg 2700ggcgaaagtc gttccgaacg
atcgggaaat cactatgcgt gagctaaccc cagctgccgt 2760taccggcacg ctgaccacgc
cggtaggccg cctgcgtaag ctgaatatgg gaccagagtt 2820cctgtcagcc tttaccgtgg
gcgaccagct gctgtggggg gccgcggagc cgctgcgtcg 2880gatgcttcgt caactggcgt
aatgtcgacc gatgcccttg agagccttca acccagtcag 2940ctccttccgg tgggcgcggg
gcatgactat cgtcgccgca cttatgactg tcttctttat 3000catgcaactc gtaggacagg
tgccggcagc gctctgggtc attttcggcg aggaccgctt 3060tcgctggagc gcgacgatga
tcggcctgtc gcttgcggta ttcggaatct tgcacgccct 3120cgctcaagcc ttcgtcactg
gtcccgccac caaacgtttc ggcgagaagc aggccattat 3180cgccggcatg gcggccgacg
cgctgggcta cgtcttgctg gcgttcgcga cgcgaggctg 3240gatggccttc cccattatga
ttcttctcgc ttccggcggc atcgggatgc ccgcgttgca 3300ggccatgctg tccaggcagg
tagatgacga ccatcaggga cagcttcaag gatcgctcgc 3360ggctcttacc agcctaactt
cgatcactgg accgctgatc gtcacggcga tttatgccgc 3420ctcggcgagc acatggaacg
ggttggcatg gattgtaggc gccgccctat accttgtctg 3480cctccccgcg ttgcgtcgcg
gtgcatggag ccgggccacc tcgacctgaa tggaagccgg 3540cggcacctcg ctaacggatt
caccactcca agaattggag ccaatcaatt cttgcggaga 3600actgtgaatg cgcaaaccaa
cccttggcag aacatatcca tcgcgtccgc catctccagc 3660agccgcacgc ggcgcatctc
gggcagcgtt gggtcctggc cacgggtgcg catgatcgtg 3720ctcctgtcgt tgaggacccg
gctaggctgg cggggttgcc ttactggtta gcagaatgaa 3780tcaccgatac gcgagcgaac
gtgaagcgac tgctgctgca aaacgtctgc gacctgagca 3840acaacatgaa tggtcttcgg
tttccgtgtt tcgtaaagtc tggaaacgcg gaagtcccct 3900acgtgctgct gaagttgccc
gcaacagaga gtggaaccaa ccggtgatac cacgatacta 3960tgactgagag tcaacgccat
gagcggcctc atttcttatt ctgagttaca acagtccgca 4020ccgctgtccg gtagctcctt
ccggtgggcg cggggcatga ctatcgtcgc cgcacttatg 4080actgtcttct ttatcatgca
actcgtagga caggtgccgg cagcgcccaa cagtcccccg 4140gccacggggc ctgccaccat
acccacgccg aaacaagcgc cctgcaccat tatgttccgg 4200atctgcatcg caggatgctg
ctggctaccc tgtggaacac ctacatctgt attaacgaag 4260cgctaaccgt ttttatcagg
ctctgggagg cagaataaat gatcatatcg tcaattatta 4320cctccacggg gagagcctga
gcaaactggc ctcaggcatt tgagaagcac acggtcacac 4380tgcttccggt agtcaataaa
ccggtaaacc agcaatagac ataagcggct atttaacgac 4440cctgccctga accgacgacc
gggtcgaatt tgctttcgaa tttctgccat tcatccgctt 4500attatcactt attcaggcgt
agcaccaggc gtttaagggc accaataact gccttaaaaa 4560aattacgccc cgccctgcca
ctcatcgcag tactgttgta attcattaag cattctgccg 4620acatggaagc catcacagac
ggcatgatga acctgaatcg ccagcggcat cagcaccttg 4680tcgccttgcg tataatattt
gcccatggtg aaaacggggg cgaagaagtt gtccatattg 4740gccacgttta aatcaaaact
ggtgaaactc acccagggat tggctgagac gaaaaacata 4800ttctcaataa accctttagg
gaaataggcc aggttttcac cgtaacacgc cacatcttgc 4860gaatatatgt gtagaaactg
ccggaaatcg tcgtggtatt cactccagag cgatgaaaac 4920gtttcagttt gctcatggaa
aacggtgtaa caagggtgaa cactatccca tatcaccagc 4980tcaccgtctt tcattgccat
acg
50033538DNAArtificialpACYC-asd-Fmisc_feature(1)..(38)pACYA-asd-F
35gaaggagata tacatatgaa aaatgttggt tttatcgg
383636DNAArtificialpACYC-asd_Rmisc_feature(1)..(36)pACYA-asd-R
36aagggcatcg gtcgacatta cgccagttga cgaagc
36371344DNAE. colimisc_feature(1)..(1344)gdhA 37atggatcaga catattctct
ggagtcattc ctcaaccatg tccaaaagcg cgacccgaat 60caaaccgagt tcgcgcaagc
cgttcgtgaa gtaatgacca cactctggcc ttttcttgaa 120caaaatccaa aatatcgcca
gatgtcatta ctggagcgtc tggttgaacc ggagcgcgtg 180atccagtttc gcgtggtatg
ggttgatgat cgcaatcagg tacaggtcaa ccgtgcatgg 240cgtgtgcagt tcagctctgc
catcggcccg tacaaaggcg gtatgcgctt ccatccgtca 300gttaacctct ccattctcaa
attcctcggc ttcgaacaaa ccttcaaaaa tgccctgact 360accctgccga tgggcggtgg
taaaggcggt agcgattttg acccgaaagg taaaagcgaa 420ggcgaagtga tgcgtttctg
ccaggcgctg atgaccgagc tttatcgtca tctgggcgcg 480gataccgacg ttccggcagg
tgatatcggc gttggtggtc gtgaagtcgg ctttatggcg 540gggatgatga aaaagctctc
caacaatacc gcctgcgtct tcaccggtaa gggcctttca 600tttggcggta gtcttattcg
cccggaagct accggctacg gtctggttta tttcacagaa 660gcaatgttaa aacgccacgg
tatgggtttt gaagggatgc gcgtttccgt ttctggctcc 720ggcaacgtcg cccagtacgc
tatcgaaaaa gcgatggaat ttggtgctcg tgtgatcact 780gcgtcagact ccagcggcac
tgtagttgat gaaagcggat tcacgaaaga gaaactggca 840cgtcttatcg aaatcaaagc
cagccgcgat ggtcgagtgg cagattacgc caaagaattt 900ggtctggtct atctcgaagg
ccaacagccg tggtctgtac cggttgatat cgccctgcct 960tgcgccaccc agaacgaact
ggatgttgac gccgcgcatc agcttatcgc caatggcgtt 1020aaagccgtcg ctgaaggggc
aaatatgccg accaccatcg aagcgactga actgttccag 1080caggcaggcg tactgtttgc
accgggtaaa gcggctaatg ctggcggtgt agcaacgtcg 1140ggcctggaaa tggcacaaaa
cgctgcgcgc ctgggctgga aagccgagaa agttgacgca 1200cgtttgcatc acatcatgct
ggatatccac catgcctgtg ttgagcatgg tggtgaaggt 1260gagcaaacca actacgtgca
gggcgcgaac attgccggtt ttgtgaaggt tgccgatgcg 1320atgctggcgc agggtgtgat
ttaa 134438447PRTE.
coliMISC_FEATURE(1)..(447)gdhA 38Met Asp Gln Thr Tyr Ser Leu Glu Ser Phe
Leu Asn His Val Gln Lys1 5 10
15Arg Asp Pro Asn Gln Thr Glu Phe Ala Gln Ala Val Arg Glu Val Met
20 25 30Thr Thr Leu Trp Pro Phe
Leu Glu Gln Asn Pro Lys Tyr Arg Gln Met 35 40
45Ser Leu Leu Glu Arg Leu Val Glu Pro Glu Arg Val Ile Gln
Phe Arg 50 55 60Val Val Trp Val Asp
Asp Arg Asn Gln Val Gln Val Asn Arg Ala Trp65 70
75 80Arg Val Gln Phe Ser Ser Ala Ile Gly Pro
Tyr Lys Gly Gly Met Arg 85 90
95Phe His Pro Ser Val Asn Leu Ser Ile Leu Lys Phe Leu Gly Phe Glu
100 105 110Gln Thr Phe Lys Asn
Ala Leu Thr Thr Leu Pro Met Gly Gly Gly Lys 115
120 125Gly Gly Ser Asp Phe Asp Pro Lys Gly Lys Ser Glu
Gly Glu Val Met 130 135 140Arg Phe Cys
Gln Ala Leu Met Thr Glu Leu Tyr Arg His Leu Gly Ala145
150 155 160Asp Thr Asp Val Pro Ala Gly
Asp Ile Gly Val Gly Gly Arg Glu Val 165
170 175Gly Phe Met Ala Gly Met Met Lys Lys Leu Ser Asn
Asn Thr Ala Cys 180 185 190Val
Phe Thr Gly Lys Gly Leu Ser Phe Gly Gly Ser Leu Ile Arg Pro 195
200 205Glu Ala Thr Gly Tyr Gly Leu Val Tyr
Phe Thr Glu Ala Met Leu Lys 210 215
220Arg His Gly Met Gly Phe Glu Gly Met Arg Val Ser Val Ser Gly Ser225
230 235 240Gly Asn Val Ala
Gln Tyr Ala Ile Glu Lys Ala Met Glu Phe Gly Ala 245
250 255Arg Val Ile Thr Ala Ser Asp Ser Ser Gly
Thr Val Val Asp Glu Ser 260 265
270Gly Phe Thr Lys Glu Lys Leu Ala Arg Leu Ile Glu Ile Lys Ala Ser
275 280 285Arg Asp Gly Arg Val Ala Asp
Tyr Ala Lys Glu Phe Gly Leu Val Tyr 290 295
300Leu Glu Gly Gln Gln Pro Trp Ser Val Pro Val Asp Ile Ala Leu
Pro305 310 315 320Cys Ala
Thr Gln Asn Glu Leu Asp Val Asp Ala Ala His Gln Leu Ile
325 330 335Ala Asn Gly Val Lys Ala Val
Ala Glu Gly Ala Asn Met Pro Thr Thr 340 345
350Ile Glu Ala Thr Glu Leu Phe Gln Gln Ala Gly Val Leu Phe
Ala Pro 355 360 365Gly Lys Ala Ala
Asn Ala Gly Gly Val Ala Thr Ser Gly Leu Glu Met 370
375 380Ala Gln Asn Ala Ala Arg Leu Gly Trp Lys Ala Glu
Lys Val Asp Ala385 390 395
400Arg Leu His His Ile Met Leu Asp Ile His His Ala Cys Val Glu His
405 410 415Gly Gly Glu Gly Glu
Gln Thr Asn Tyr Val Gln Gly Ala Asn Ile Ala 420
425 430Gly Phe Val Lys Val Ala Asp Ala Met Leu Ala Gln
Gly Val Ile 435 440
445395242DNAArtificialSynthetic DNAmisc_feature(1)..(5242)pACYC-gdhA
plasmid 39gaattccgga tgagcattca tcaggcgggc aagaatgtga ataaaggccg
gataaaactt 60gtgcttattt ttctttacgg tctttaaaaa ggccgtaata tccagctgaa
cggtctggtt 120ataggtacat tgagcaactg actgaaatgc ctcaaaatgt tctttacgat
gccattggga 180tatatcaacg gtggtatatc cagtgatttt tttctccatt ttagcttcct
tagctcctga 240aaatctcgat aactcaaaaa atacgcccgg tagtgatctt atttcattat
ggtgaaagtt 300ggaacctctt acgtgccgat caacgtctca ttttcgccaa aagttggccc
agggcttccc 360ggtatcaaca gggacaccag gatttattta ttctgcgaag tgatcttccg
tcacaggtat 420ttattcggcg caaagtgcgt cgggtgatgc tgccaactta ctgatttagt
gtatgatggt 480gtttttgagg tgctccagtg gcttctgttt ctatcagctg tccctcctgt
tcagctactg 540acggggtggt gcgtaacggc aaaagcaccg ccggacatca gcgctagcgg
agtgtatact 600ggcttactat gttggcactg atgagggtgt cagtgaagtg cttcatgtgg
caggagaaaa 660aaggctgcac cggtgcgtca gcagaatatg tgatacagga tatattccgc
ttcctcgctc 720actgactcgc tacgctcggt cgttcgactg cggcgagcgg aaatggctta
cgaacggggc 780ggagatttcc tggaagatgc caggaagata cttaacaggg aagtgagagg
gccgcggcaa 840agccgttttt ccataggctc cgcccccctg acaagcatca cgaaatctga
cgctcaaatc 900agtggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct
ggcggctccc 960tcgtgcgctc tcctgttcct gcctttcggt ttaccggtgt cattccgctg
ttatggccgc 1020gtttgtctca ttccacgcct gacactcagt tccgggtagg cagttcgctc
caagctggac 1080tgtatgcacg aaccccccgt tcagtccgac cgctgcgcct tatccggtaa
ctatcgtctt 1140gagtccaacc cggaaagaca tgcaaaagca ccactggcag cagccactgg
taattgattt 1200agaggagtta gtcttgaagt catgcgccgg ttaaggctaa actgaaagga
caagttttgg 1260tgactgcgct cctccaagcc agttacctcg gttcaaagag ttggtagctc
agagaacctt 1320cgaaaaaccg ccctgcaagg cggttttttc gttttcagag caagagatta
cgcgcagacc 1380aaaacgatct caagaagatc atcttattaa tcagataaaa tatttctaga
tttcagtgca 1440atttatctct tcaaatgtag cacctgaagt cagccccata cgatataagt
tgtaattctc 1500atgtttgaca gcttatcatc gataagcttc gactgcacgg tgcaccaatg
cttctggcgt 1560caggcagcca tcggaagctg tggtatggct gtgcaggtcg taaatcactg
cataattcgt 1620gtcgctcaag gcgcactccc gttctggata atgttttttg cgccgacatc
ataacggttc 1680tggcaaatat tctgaaatga gctgttgaca attaatcatc cggctcgtat
aatgtgtgga 1740attgtgagcg gataacaatt tcagaattca aaagatcttt taagaaggag
atatacatat 1800ggatcagaca tattctctgg agtcattcct caaccatgtc caaaagcgcg
acccgaatca 1860aaccgagttc gcgcaagccg ttcgtgaagt aatgaccaca ctctggcctt
ttcttgaaca 1920aaatccaaaa tatcgccaga tgtcattact ggagcgtctg gttgaaccgg
agcgcgtgat 1980ccagtttcgc gtggtatggg ttgatgatcg caatcaggta caggtcaacc
gtgcatggcg 2040tgtgcagttc agctctgcca tcggcccgta caaaggcggt atgcgcttcc
atccgtcagt 2100taacctctcc attctcaaat tcctcggctt cgaacaaacc ttcaaaaatg
ccctgactac 2160cctgccgatg ggcggtggta aaggcggtag cgattttgac ccgaaaggta
aaagcgaagg 2220cgaagtgatg cgtttctgcc aggcgctgat gaccgagctt tatcgtcatc
tgggcgcgga 2280taccgacgtt ccggcaggtg atatcggcgt tggtggtcgt gaagtcggct
ttatggcggg 2340gatgatgaaa aagctctcca acaataccgc ctgcgtcttc accggtaagg
gcctttcatt 2400tggcggtagt cttattcgcc cggaagctac cggctacggt ctggtttatt
tcacagaagc 2460aatgttaaaa cgccacggta tgggttttga agggatgcgc gtttccgttt
ctggctccgg 2520caacgtcgcc cagtacgcta tcgaaaaagc gatggaattt ggtgctcgtg
tgatcactgc 2580gtcagactcc agcggcactg tagttgatga aagcggattc acgaaagaga
aactggcacg 2640tcttatcgaa atcaaagcca gccgcgatgg tcgagtggca gattacgcca
aagaatttgg 2700tctggtctat ctcgaaggcc aacagccgtg gtctgtaccg gttgatatcg
ccctgccttg 2760cgccacccag aacgaactgg atgttgacgc cgcgcatcag cttatcgcca
atggcgttaa 2820agccgtcgct gaaggggcaa atatgccgac caccatcgaa gcgactgaac
tgttccagca 2880ggcaggcgta ctgtttgcac cgggtaaagc ggctaatgct ggcggtgtag
caacgtcggg 2940cctggaaatg gcacaaaacg ctgcgcgcct gggctggaaa gccgagaaag
ttgacgcacg 3000tttgcatcac atcatgctgg atatccacca tgcctgtgtt gagcatggtg
gtgaaggtga 3060gcaaaccaac tacgtgcagg gcgcgaacat tgccggtttt gtgaaggttg
ccgatgcgat 3120gctggcgcag ggtgtgattt aagtcgaccg atgcccttga gagccttcaa
cccagtcagc 3180tccttccggt gggcgcgggg catgactatc gtcgccgcac ttatgactgt
cttctttatc 3240atgcaactcg taggacaggt gccggcagcg ctctgggtca ttttcggcga
ggaccgcttt 3300cgctggagcg cgacgatgat cggcctgtcg cttgcggtat tcggaatctt
gcacgccctc 3360gctcaagcct tcgtcactgg tcccgccacc aaacgtttcg gcgagaagca
ggccattatc 3420gccggcatgg cggccgacgc gctgggctac gtcttgctgg cgttcgcgac
gcgaggctgg 3480atggccttcc ccattatgat tcttctcgct tccggcggca tcgggatgcc
cgcgttgcag 3540gccatgctgt ccaggcaggt agatgacgac catcagggac agcttcaagg
atcgctcgcg 3600gctcttacca gcctaacttc gatcactgga ccgctgatcg tcacggcgat
ttatgccgcc 3660tcggcgagca catggaacgg gttggcatgg attgtaggcg ccgccctata
ccttgtctgc 3720ctccccgcgt tgcgtcgcgg tgcatggagc cgggccacct cgacctgaat
ggaagccggc 3780ggcacctcgc taacggattc accactccaa gaattggagc caatcaattc
ttgcggagaa 3840ctgtgaatgc gcaaaccaac ccttggcaga acatatccat cgcgtccgcc
atctccagca 3900gccgcacgcg gcgcatctcg ggcagcgttg ggtcctggcc acgggtgcgc
atgatcgtgc 3960tcctgtcgtt gaggacccgg ctaggctggc ggggttgcct tactggttag
cagaatgaat 4020caccgatacg cgagcgaacg tgaagcgact gctgctgcaa aacgtctgcg
acctgagcaa 4080caacatgaat ggtcttcggt ttccgtgttt cgtaaagtct ggaaacgcgg
aagtccccta 4140cgtgctgctg aagttgcccg caacagagag tggaaccaac cggtgatacc
acgatactat 4200gactgagagt caacgccatg agcggcctca tttcttattc tgagttacaa
cagtccgcac 4260cgctgtccgg tagctccttc cggtgggcgc ggggcatgac tatcgtcgcc
gcacttatga 4320ctgtcttctt tatcatgcaa ctcgtaggac aggtgccggc agcgcccaac
agtcccccgg 4380ccacggggcc tgccaccata cccacgccga aacaagcgcc ctgcaccatt
atgttccgga 4440tctgcatcgc aggatgctgc tggctaccct gtggaacacc tacatctgta
ttaacgaagc 4500gctaaccgtt tttatcaggc tctgggaggc agaataaatg atcatatcgt
caattattac 4560ctccacgggg agagcctgag caaactggcc tcaggcattt gagaagcaca
cggtcacact 4620gcttccggta gtcaataaac cggtaaacca gcaatagaca taagcggcta
tttaacgacc 4680ctgccctgaa ccgacgaccg ggtcgaattt gctttcgaat ttctgccatt
catccgctta 4740ttatcactta ttcaggcgta gcaccaggcg tttaagggca ccaataactg
ccttaaaaaa 4800attacgcccc gccctgccac tcatcgcagt actgttgtaa ttcattaagc
attctgccga 4860catggaagcc atcacagacg gcatgatgaa cctgaatcgc cagcggcatc
agcaccttgt 4920cgccttgcgt ataatatttg cccatggtga aaacgggggc gaagaagttg
tccatattgg 4980ccacgtttaa atcaaaactg gtgaaactca cccagggatt ggctgagacg
aaaaacatat 5040tctcaataaa ccctttaggg aaataggcca ggttttcacc gtaacacgcc
acatcttgcg 5100aatatatgtg tagaaactgc cggaaatcgt cgtggtattc actccagagc
gatgaaaacg 5160tttcagtttg ctcatggaaa acggtgtaac aagggtgaac actatcccat
atcaccagct 5220caccgtcttt cattgccata cg
52424040DNAArtificialSynthetic DNAmisc_feature(1)..(40)Primer
pACYC-gdhA-F 40gaaggagata tacatatgga tcagacatat tctctggagt
404135DNAArtificialSynthetic DNAmisc_feature(1)..(35)Primer
pACYC-gdhA-R 41aagggcatcg gtcgacttaa atcacaccct gcgcc
35
User Contributions:
Comment about this patent or add new information about this topic: