Patent application title: METHODS FOR DEGRADING OR CONVERTING CELLULOSIC MATERIAL
Inventors:
IPC8 Class: AC12P1900FI
USPC Class:
1 1
Class name:
Publication date: 2019-03-21
Patent application number: 20190085365
Abstract:
The present invention relates to methods for degrading or converting a
cellulosic material, comprising: treating the cellulosic material with an
enzyme composition in the presence of a polypeptide having catalase
activity; and enzyme compositions used for degrading or converting a
cellulosic material comprising one or more (e.g., several) enzymes having
cellulolytic and/or hemicellulolytic activity and a polypeptide having
catalase activity.Claims:
1-25. (canceled)
26. A method for degrading or converting a cellulosic material to fermentable sugars, comprising: treating the cellulosic material with an enzyme composition in the presence of a GH61 polypeptide having cellulolytic enhancing activity, one or more oxidoreductases selected from the group consisting of a laccase and a peroxidase, and a polypeptide having catalase activity, wherein the presence of the polypeptide having catalase activity increases the production of fermentable sugars, as compared to a degradation or conversion of a cellulosic material not in the presence of a polypeptide having catalase activity.
27. The method of claim 26, wherein the enzyme composition comprises one or more enzymes selected from the group consisting of an esterase, an expansin, a ligninolytic enzyme, a pectinase, a protease, and a swollenin.
28. The method of claim 26, wherein the cellulosic material is selected from the group consisting of agricultural residue, herbaceous material, municipal solid waste, pulp and paper mill residue, waste paper, wood, arundo, bagasse, bamboo, corn cob, corn fiber, corn stover, miscanthus, orange peel, rice straw, switchgrass, wheat straw, eucalyptus, fir, pine, poplar, spruce, willow, algal cellulose, bacterial cellulose, cotton linter, filter paper, microcrystalline cellulose, and phosphoric-acid treated cellulose.
29. The method of claim 26, wherein the cellulosic material is pretreated by chemical pretreatment, physical pretreatment, or biochemical pretreatment.
30. The method of claim 26, further comprising fermenting the fermentable sugars with one or more fermenting microorganisms to produce a fermentation product.
31. The method of claim 30, wherein the fermentation product comprises ethanol.
32. A method for producing a fermentation product comprising: (a) saccharifying a cellulosic material an enzyme composition in the presence of a GH61 polypeptide having cellulolytic enhancing activity, one or more oxidoreductases selected from the group consisting of a laccase and a peroxidase, and a polypeptide having catalase activity, wherein the presence of the polypeptide having catalase activity increases the production of fermentable sugars, as compared to a degradation or conversion of a cellulosic material not in the presence of a polypeptide having catalase; and (b) fermenting the saccharified cellulosic material with one or more fermenting microorganisms to produce the fermentation product.
33. The method of claim 32, wherein the enzyme composition comprises one or more enzymes selected from the group consisting of an esterase, an expansin, a ligninolytic enzyme, a pectinase, a protease, and a swollenin.
34. The method of claim 32, wherein the cellulosic material is selected from the group consisting of agricultural residue, herbaceous material, municipal solid waste, pulp and paper mill residue, waste paper, wood, arundo, bagasse, bamboo, corn cob, corn fiber, corn stover, miscanthus, orange peel, rice straw, switchgrass, wheat straw, eucalyptus, fir, pine, poplar, spruce, willow, algal cellulose, bacterial cellulose, cotton linter, filter paper, microcrystalline cellulose, and phosphoric-acid treated cellulose.
35. The method of claim 32, wherein the cellulosic material is pretreated by chemical pretreatment, physical pretreatment, or biochemical pretreatment.
36. The method of claim 32, wherein the fermentation product comprises ethanol.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a divisional of U.S. application Ser. No. 15/847,037 filed on Dec. 19, 2017 which is a divisional of U.S. application Ser. No. 13/982,857 filed on Jul. 31, 2013 which is a 35 U.S.C. 371 national application of PCT/CN2012/73040 filed Mar. 26, 2012 which claims priority or the benefit under 35 U.S.C. 119 of Chinese application nos. PCT/CN2011/072143 and PCT/CN2011/084230 filed Mar. 25, 2011 and Dec. 19, 2011 and U.S. provisional application No. 61/477,418 filed Apr. 20, 2011 the contents of which are fully incorporated herein by reference.
REFERENCE TO A SEQUENCE LISTING
[0002] This application contains a Sequence Listing in computer readable form. The computer readable form is incorporated herein by reference.
BACKGROUND OF THE INVENTION
Field of the Invention
[0003] The present invention relates to methods for degrading or converting cellulosic material and enzyme composition for degrading or converting cellulosic material.
Description of the Related Art
[0004] Catalases [hydrogen peroxide: hydrogen peroxide oxidoreductases (EC 1.11.1.6)] are enzymes which catalyze the conversion of hydrogen peroxide (H.sub.2O.sub.2) to oxygen (O.sub.2) and water (H.sub.2O). These ubiquitous enzymes have been purified from a variety of animal tissues, plants and microorganisms (Chance and Maehly, 1955, Methods Enzymol. 2: 764-791).
[0005] Catalase preparations are used commercially for diagnostic enzyme kits, for the enzymatic production of sodium gluconate from glucose, for the neutralization of H.sub.2O.sub.2 waste, for removal of H.sub.2O.sub.2 from textile fabrics, and for the removal of H.sub.2O.sub.2 and/or generation of O.sub.2 in foods and beverages.
[0006] Cellulose is a polymer of simple sugars covalently linked by beta-1,4-bonds. Many microorganisms produce enzymes that hydrolyze beta-linked glucans. These enzymes include endoglucanases, cellobiohydrolases, and beta-glucosidases. Endoglucanases digest the cellulose polymer at random locations, opening it to attack by cellobiohydrolases. Cellobiohydrolases sequentially release molecules of cellobiose from the ends of the cellulose polymer. Cellobiose is a water-soluble beta-1,4-linked dimer of glucose. Beta-glucosidases hydrolyze cellobiose to glucose.
[0007] The conversion of lignocellulosic material has the advantages of the ready availability of large amounts of feedstock and the desirability of avoiding burning or land filling the materials. Wood, agricultural residues, herbaceous crops, and municipal solid wastes have been considered as feedstocks. These materials primarily consist of cellulose, hemicellulose, and lignin. Once the lignocellulose is converted to simple sugars, e.g., glucose, the simple sugars can further be converted to many useful substances, e.g., fuel, potable ethanol, fermentation products and/or chemicals (e.g., acids, alcohols, ketones, gases, and the like).
[0008] It would be advantageous in the art to improve methods for degrading or converting a cellulosic material.
SUMMARY OF THE INVENTION
[0009] The present invention relates to methods for degrading or converting a cellulosic material, comprising: treating the cellulosic material with an enzyme composition in the presence of a polypeptide having catalase activity.
[0010] The present invention also relates to methods for producing a fermentation product, comprising:
[0011] (a) saccharifying a cellulosic material with an enzyme composition in the presence of a polypeptide having catalase activity;
[0012] (b) fermenting the saccharified cellulosic material with one or more (e.g., several) fermenting microorganisms to produce the fermentation product; and
[0013] (c) recovering the fermentation product from the fermentation.
[0014] The present invention further relates to methods of fermenting a cellulosic material, comprising: fermenting the cellulosic material with one or more (e.g., several) fermenting microorganisms, wherein the cellulosic material is hydrolyzed with an enzyme composition in the presence of a polypeptide having catalase activity.
[0015] The present invention even further relates to an enzyme composition for degrading or converting a cellulosic material comprising enzymes having cellulolytic activity and/or xylan degrading activity and a polypeptide having catalase activity; and the uses thereof.
BRIEF DESCRIPTION OF THE FIGURES
[0016] FIG. 1 shows the genomic DNA sequence (SEQ ID NO: 3) and the amino acid sequence (SEQ ID NO: 4) of a Talaromyces stipitatus catalase gene.
[0017] FIG. 2 shows the genomic DNA sequence (SEQ ID NO: 5) and the amino acid sequence (SEQ ID NO: 6) of a Humicola insolens catalase gene.
[0018] FIG. 3 shows the genomic DNA sequence (SEQ ID NO: 7) and the amino acid sequence (SEQ ID NO: 8) of a Penicillium emersonii catalase gene.
DEFINITIONS
[0019] Catalase activity: The term "catalase activity" is defined herein as a hydrogen-peroxide:hydrogen-peroxide oxidoreductase activity (EC 1.11.1.6) that catalyzes the conversion of 2 H.sub.2O.sub.2 to O.sub.2+2 H.sub.2O. For purposes of the present invention, catalase activity is determined according to U.S. Pat. No. 5,646,025. One unit of catalase activity equals the amount of enzyme that catalyzes the oxidation of 1 .mu.mole of hydrogen peroxide under the assay conditions.
[0020] In one aspect, the catalases used in the present invention have at least 20%, e.g., at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 100% of the catalase activity of the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, or the mature polypeptide of SEQ ID NO: 8.
[0021] Acetylxylan esterase: The term "acetylxylan esterase" means a carboxylesterase (EC 3.1.1.72) that catalyzes the hydrolysis of acetyl groups from polymeric xylan, acetylated xylose, acetylated glucose, alpha-napthyl acetate, and p-nitrophenyl acetate. For purposes of the present invention, acetylxylan esterase activity is determined using 0.5 mM p-nitrophenylacetate as substrate in 50 mM sodium acetate pH 5.0 containing 0.01% TWEEN.TM. 20 (polyoxyethylene sorbitan monolaurate). One unit of acetylxylan esterase is defined as the amount of enzyme capable of releasing 1 .mu.mole of p-nitrophenolate anion per minute at pH 5, 25.degree. C.
[0022] Alpha-L-arabinofuranosidase: The term "alpha-L-arabinofuranosidase" means an alpha-L-arabinofuranoside arabinofuranohydrolase (EC 3.2.1.55) that catalyzes the hydrolysis of terminal non-reducing alpha-L-arabinofuranoside residues in alpha-L-arabinosides. The enzyme acts on alpha-L-arabinofuranosides, alpha-L-arabinans containing (1,3)- and/or (1,5)-linkages, arabinoxylans, and arabinogalactans. Alpha-L-arabinofuranosidase is also known as arabinosidase, alpha-arabinosidase, alpha-L-arabinosidase, alpha-arabinofuranosidase, polysaccharide alpha-L-arabinofuranosidase, alpha-L-arabinofuranoside hydrolase, L-arabinosidase, or alpha-L-arabinanase. For purposes of the present invention, alpha-L-arabinofuranosidase activity is determined using 5 mg of medium viscosity wheat arabinoxylan (Megazyme International Ireland, Ltd., Bray, Co. Wicklow, Ireland) per ml of 100 mM sodium acetate pH 5 in a total volume of 200 .mu.l for 30 minutes at 40.degree. C. followed by arabinose analysis by AMINEX.RTM. HPX-87H column chromatography (Bio-Rad Laboratories, Inc., Hercules, Calif., USA).
[0023] Alpha-glucuronidase: The term "alpha-glucuronidase" means an alpha-D-glucosiduronate glucuronohydrolase (EC 3.2.1.139) that catalyzes the hydrolysis of an alpha-D-glucuronoside to D-glucuronate and an alcohol. For purposes of the present invention, alpha-glucuronidase activity is determined according to de Vries, 1998, J. Bacteriol. 180: 243-249. One unit of alpha-glucuronidase equals the amount of enzyme capable of releasing 1 .mu.mole of glucuronic or 4-O-methylglucuronic acid per minute at pH 5, 40.degree. C.
[0024] Beta-glucosidase: The term "beta-glucosidase" means a beta-D-glucoside glucohydrolase (E.C. 3.2.1.21) that catalyzes the hydrolysis of terminal non-reducing beta-D-glucose residues with the release of beta-D-glucose. For purposes of the present invention, beta-glucosidase activity is determined using p-nitrophenyl-beta-D-glucopyranoside as substrate according to the procedure of Venturi et al., 2002, Extracellular beta-D-glucosidase from Chaetomium thermophilum var. coprophilum: production, purification and some biochemical properties, J. Basic Microbiol. 42: 55-66. One unit of beta-glucosidase is defined as 1.0 .mu.mole of p-nitrophenolate anion produced per minute at 25.degree. C., pH 4.8 from 1 mM p-nitrophenyl-beta-D-glucopyranoside as substrate in 50 mM sodium citrate containing 0.01% TWEEN.RTM. 20.
[0025] Beta-xylosidase: The term "beta-xylosidase" means a beta-D-xyloside xylohydrolase (E.C. 3.2.1.37) that catalyzes the exo-hydrolysis of short beta (1.fwdarw.4)-xylooligosaccharides to remove successive D-xylose residues from non-reducing termini. For purposes of the present invention, one unit of beta-xylosidase is defined as 1.0 .mu.mole of p-nitrophenolate anion produced per minute at 40.degree. C., pH 5 from 1 mM p-nitrophenyl-beta-D-xyloside as substrate in 100 mM sodium citrate containing 0.01% TWEEN.RTM. 20.
[0026] cDNA: The term "cDNA" means a DNA molecule that can be prepared by reverse transcription from a mature, spliced, mRNA molecule obtained from a eukaryotic or prokaryotic cell. cDNA lacks intron sequences that may be present in the corresponding genomic DNA. The initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.
[0027] Cellobiohydrolase: The term "cellobiohydrolase" means a 1,4-beta-D-glucan cellobiohydrolase (E.C. 3.2.1.91 and E.C. 3.2.1.176) that catalyzes the hydrolysis of 1,4-beta-D-glucosidic linkages in cellulose, cellooligosaccharides, or any beta-1,4-linked glucose containing polymer, releasing cellobiose from the reducing or non-reducing ends of the chain (Teeri, 1997, Crystalline cellulose degradation: New insight into the function of cellobiohydrolases, Trends in Biotechnology 15: 160-167; Teeri et al., 1998, Trichoderma reesei cellobiohydrolases: why so efficient on crystalline cellulose?, Biochem. Soc. Trans. 26: 173-178). Cellobiohydrolase activity is determined according to the procedures described by Lever et al., 1972, Anal. Biochem. 47: 273-279; van Tilbeurgh et al., 1982, FEBS Letters, 149: 152-156; van Tilbeurgh and Claeyssens, 1985, FEBS Letters, 187: 283-288; and Tomme et al., 1988, Eur. J. Biochem. 170: 575-581. In the present invention, the Tomme et al. method can be used to determine cellobiohydrolase activity.
[0028] Cellulosic material: The term "cellulosic material" means any material containing cellulose. The predominant polysaccharide in the primary cell wall of biomass is cellulose, the second most abundant is hemicellulose, and the third is pectin. The secondary cell wall, produced after the cell has stopped growing, also contains polysaccharides and is strengthened by polymeric lignin covalently cross-linked to hemicellulose. Cellulose is a homopolymer of anhydrocellobiose and thus a linear beta-(1-4)-D-glucan, while hemicelluloses include a variety of compounds, such as xylans, xyloglucans, arabinoxylans, and mannans in complex branched structures with a spectrum of substituents. Although generally polymorphous, cellulose is found in plant tissue primarily as an insoluble crystalline matrix of parallel glucan chains. Hemicelluloses usually hydrogen bond to cellulose, as well as to other hemicelluloses, which help stabilize the cell wall matrix.
[0029] Cellulose is generally found, for example, in the stems, leaves, hulls, husks, and cobs of plants or leaves, branches, and wood of trees. The cellulosic material can be, but is not limited to, agricultural residue, herbaceous material (including energy crops), municipal solid waste, pulp and paper mill residue, waste paper, and wood (including forestry residue) (see, for example, Wiselogel et al., 1995, in Handbook on Bioethanol (Charles E. Wyman, editor), pp. 105-118, Taylor & Francis, Washington D.C.; Wyman, 1994, Bioresource Technology 50: 3-16; Lynd, 1990, Applied Biochemistry and Biotechnology 24/25: 695-719; Mosier et al., 1999, Recent Progress in Bioconversion of Lignocellulosics, in Advances in Biochemical Engineering/Biotechnology, T. Scheper, managing editor, Volume 65, pp. 23-40, Springer-Verlag, N.Y.). It is understood herein that the cellulose may be in the form of lignocellulose, a plant cell wall material containing lignin, cellulose, and hemicellulose in a mixed matrix. In a preferred aspect, the cellulosic material is any biomass material. In another preferred aspect, the cellulosic material is lignocellulose, which comprises cellulose, hemicelluloses, and lignin.
[0030] In one aspect, the cellulosic material is agricultural residue. In another aspect, the cellulosic material is herbaceous material (including energy crops). In another aspect, the cellulosic material is municipal solid waste. In another aspect, the cellulosic material is pulp and paper mill residue. In another aspect, the cellulosic material is waste paper. In another aspect, the cellulosic material is wood (including forestry residue).
[0031] In another aspect, the cellulosic material is arundo. In another aspect, the cellulosic material is bagasse. In another aspect, the cellulosic material is bamboo. In another aspect, the cellulosic material is corn cob. In another aspect, the cellulosic material is corn fiber. In another aspect, the cellulosic material is corn stover. In another aspect, the cellulosic material is miscanthus. In another aspect, the cellulosic material is orange peel. In another aspect, the cellulosic material is rice straw. In another aspect, the cellulosic material is switchgrass. In another aspect, the cellulosic material is wheat straw.
[0032] In another aspect, the cellulosic material is aspen. In another aspect, the cellulosic material is eucalyptus. In another aspect, the cellulosic material is fir. In another aspect, the cellulosic material is pine. In another aspect, the cellulosic material is poplar. In another aspect, the cellulosic material is spruce. In another aspect, the cellulosic material is willow.
[0033] In another aspect, the cellulosic material is algal cellulose. In another aspect, the cellulosic material is bacterial cellulose. In another aspect, the cellulosic material is cotton linter. In another aspect, the cellulosic material is filter paper. In another aspect, the cellulosic material is microcrystalline cellulose. In another aspect, the cellulosic material is phosphoric-acid treated cellulose.
[0034] In another aspect, the cellulosic material is an aquatic biomass. As used herein the term "aquatic biomass" means biomass produced in an aquatic environment by a photosynthesis process. The aquatic biomass can be algae, emergent plants, floating-leaf plants, or submerged plants.
[0035] The cellulosic material may be used as is or may be subjected to pretreatment, using conventional methods known in the art, as described herein. In a preferred aspect, the cellulosic material is pretreated.
[0036] Cellulolytic enzyme or cellulase: The term "cellulolytic enzyme" or "cellulase" means one or more (e.g., several) enzymes that hydrolyze a cellulosic material. Such enzymes include endoglucanase(s), cellobiohydrolase(s), beta-glucosidase(s), or combinations thereof. The two basic approaches for measuring cellulolytic activity include: (1) measuring the total cellulolytic activity, and (2) measuring the individual cellulolytic activities (endoglucanases, cellobiohydrolases, and beta-glucosidases) as reviewed in Zhang et al., Outlook for cellulase improvement: Screening and selection strategies, 2006, Biotechnology Advances 24: 452-481. Total cellulolytic activity is usually measured using insoluble substrates, including Whatman No1 filter paper, microcrystalline cellulose, bacterial cellulose, algal cellulose, cotton, pretreated lignocellulose, etc. The most common total cellulolytic activity assay is the filter paper assay using Whatman No1 filter paper as the substrate. The assay was established by the International Union of Pure and Applied Chemistry (IUPAC) (Ghose, 1987, Measurement of cellulase activities, Pure Appl. Chem. 59: 257-68).
[0037] For purposes of the present invention, cellulolytic enzyme activity is determined by measuring the increase in hydrolysis of a cellulosic material by cellulolytic enzyme(s) under the following conditions: 1-50 mg of cellulolytic enzyme protein/g of cellulose in PCS (or other pretreated cellulosic material) for 3-7 days at a suitable temperature, e.g., 50.degree. C., 55.degree. C., or 60.degree. C., compared to a control hydrolysis without addition of cellulolytic enzyme protein. Typical conditions are 1 ml reactions, washed or unwashed PCS, 5% insoluble solids, 50 mM sodium acetate pH 5, 1 mM MnSO.sub.4, 50.degree. C., 55.degree. C., or 60.degree. C., 72 hours, sugar analysis by AMINEX.RTM. HPX-87H column (Bio-Rad Laboratories, Inc., Hercules, Calif., USA).
[0038] Coding sequence: The term "coding sequence" means a polynucleotide, which directly specifies the amino acid sequence of a polypeptide. The boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon such as ATG, GTG, or TTG and ends with a stop codon such as TAA, TAG, or TGA. The coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
[0039] Control sequences: The term "control sequences" means nucleic acid sequences necessary for expression of a polynucleotide encoding a mature polypeptide of the present invention. Each control sequence may be native (i.e., from the same gene) or foreign (i.e., from a different gene) to the polynucleotide encoding the polypeptide or native or foreign to each other. Such control sequences include, but are not limited to, a leader, polyadenylation sequence, propeptide sequence, promoter, signal peptide sequence, and transcription terminator. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.
[0040] Endoglucanase: The term "endoglucanase" means an endo-1,4-(1,3;1,4)-beta-D-glucan 4-glucanohydrolase (E.C. 3.2.1.4) that catalyzes endohydrolysis of 1,4-beta-D-glycosidic linkages in cellulose, cellulose derivatives (such as carboxymethyl cellulose and hydroxyethyl cellulose), lichenin, beta-1,4 bonds in mixed beta-1,3 glucans such as cereal beta-D-glucans or xyloglucans, and other plant material containing cellulosic components. Endoglucanase activity can be determined by measuring reduction in substrate viscosity or increase in reducing ends determined by a reducing sugar assay (Zhang et al., 2006, Biotechnology Advances 24: 452-481). For purposes of the present invention, endoglucanase activity is determined using carboxymethyl cellulose (CMC) as substrate according to the procedure of Ghose, 1987, Pure and Appl. Chem. 59: 257-268, at pH 5, 40.degree. C.
[0041] Expression: The term "expression" includes any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
[0042] Expression vector: The term "expression vector" means a linear or circular DNA molecule that comprises a polynucleotide encoding a polypeptide and is operably linked to control sequences that provide for its expression.
[0043] Family 61 glycoside hydrolase: The term "Family 61 glycoside hydrolase" or "Family GH61" or "GH61" means a polypeptide falling into the glycoside hydrolase Family 61 according to Henrissat B., 1991, A classification of glycosyl hydrolases based on amino-acid sequence similarities, Biochem. J. 280: 309-316, and Henrissat B., and Bairoch A., 1996, Updating the sequence-based classification of glycosyl hydrolases, Biochem. J. 316: 695-696. The enzymes in this family were originally classified as a glycoside hydrolase family based on measurement of very weak endo-1,4-beta-D-glucanase activity in one family member. The structure and mode of action of these enzymes are non-canonical and they cannot be considered as bona fide glycosidases. However, they are kept in the CAZy classification on the basis of their capacity to enhance the breakdown of lignocellulose when used in conjunction with a cellulase or a mixture of cellulases.
[0044] Feruloyl esterase: The term "feruloyl esterase" means a 4-hydroxy-3-methoxycinnamoyl-sugar hydrolase (EC 3.1.1.73) that catalyzes the hydrolysis of 4-hydroxy-3-methoxycinnamoyl (feruloyl) groups from esterified sugar, which is usually arabinose in "natural" substrates, to produce ferulate (4-hydroxy-3-methoxycinnamate). Feruloyl esterase is also known as ferulic acid esterase, hydroxycinnamoyl esterase, FAE-III, cinnamoyl ester hydrolase, FAEA, cinnAE, FAE-I, or FAE-II. For purposes of the present invention, feruloyl esterase activity is determined using 0.5 mM p-nitrophenylferulate as substrate in 50 mM sodium acetate pH 5.0. One unit of feruloyl esterase equals the amount of enzyme capable of releasing 1 .mu.mole of p-nitrophenolate anion per minute at pH 5, 25.degree. C.
[0045] Fragment: The term "fragment" means a polypeptide having one or more (e.g., several) amino acids absent from the amino and/or carboxyl terminus of a mature polypeptide main; wherein the fragment has catalase activity. In one aspect, a fragment contains at least 632 amino acid residues, e.g., at least 670 amino acid residues or at least 708 amino acid residues of SEQ ID NO: 2. In another aspect, a fragment contains at least 622 amino acid residues, e.g., at least 659 amino acid residues or at least 696 amino acid residues of SEQ ID NO: 4. In another aspect, a fragment contains at least 652 amino acid residues, e.g., at least 689 amino acid residues or at least 727 amino acid residues of SEQ ID NO: 6. In another aspect, a fragment contains at least 614 amino acid residues, e.g., at least 650 amino acid residues or at least 686 amino acid residues of SEQ ID NO: 8.
[0046] Hemicellulolytic enzyme or hemicellulase: The term "hemicellulolytic enzyme" or "hemicellulase" means one or more (e.g., several) enzymes that hydrolyze a hemicellulosic material. See, for example, Shallom, D. and Shoham, Y. Microbial hemicellulases. Current Opinion In Microbiology, 2003, 6(3): 219-228). Hemicellulases are key components in the degradation of plant biomass. Examples of hemicellulases include, but are not limited to, an acetylmannan esterase, an acetylxylan esterase, an arabinanase, an arabinofuranosidase, a coumaric acid esterase, a feruloyl esterase, a galactosidase, a glucuronidase, a glucuronoyl esterase, a mannanase, a mannosidase, a xylanase, and a xylosidase. The substrates of these enzymes, the hemicelluloses, are a heterogeneous group of branched and linear polysaccharides that are bound via hydrogen bonds to the cellulose microfibrils in the plant cell wall, crosslinking them into a robust network. Hemicelluloses are also covalently attached to lignin, forming together with cellulose a highly complex structure. The variable structure and organization of hemicelluloses require the concerted action of many enzymes for its complete degradation. The catalytic modules of hemicellulases are either glycoside hydrolases (GHs) that hydrolyze glycosidic bonds, or carbohydrate esterases (CEs), which hydrolyze ester linkages of acetate or ferulic acid side groups. These catalytic modules, based on homology of their primary sequence, can be assigned into GH and CE families. Some families, with an overall similar fold, can be further grouped into clans, marked alphabetically (e.g., GH-A). A most informative and updated classification of these and other carbohydrate active enzymes is available in the Carbohydrate-Active Enzymes (CAZy) database. Hemicellulolytic enzyme activities can be measured according to Ghose and Bisaria, 1987, Pure & Appl. Chem. 59: 1739-1752, at a suitable temperature, e.g., 50.degree. C., 55.degree. C., or 60.degree. C., and pH, e.g., 5.0 or 5.5.
[0047] High stringency conditions: The term "high stringency conditions" means for probes of at least 100 nucleotides in length, prehybridization and hybridization at 42.degree. C. in 5.times.SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmon sperm DNA, and 50% formamide, following standard Southern blotting procedures for 12 to 24 hours. The carrier material is finally washed three times each for 15 minutes using 2.times.SSC, 0.2% SDS at 65.degree. C.
[0048] Host cell: The term "host cell" means any cell type that is susceptible to transformation, transfection, transduction, or the like with a nucleic acid construct or expression vector comprising a catalase of the present invention. The term "host cell" encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication.
[0049] Low stringency conditions: The term "low stringency conditions" means for probes of at least 100 nucleotides in length, prehybridization and hybridization at 42.degree. C. in 5.times.SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmon sperm DNA, and 25% formamide, following standard Southern blotting procedures for 12 to 24 hours. The carrier material is finally washed three times each for 15 minutes using 2.times.SSC, 0.2% SDS at 50.degree. C.
[0050] Mature polypeptide: The term "mature polypeptide" means a polypeptide in its final form following translation and any post-translational modifications, such as N-terminal processing, C-terminal truncation, glycosylation, phosphorylation, etc. In one embodiment, the mature polypeptide is amino acids 1 to 746 of SEQ ID NO: 2. In another embodiment, the mature polypeptide is amino acids 20 to 733 of SEQ ID NO: 4 based on the SignalP program that predicts amino acids 1 to 19 of SEQ ID NO: 4 are a signal peptide. In another embodiment, the mature polypeptide is amino acids 20 to 765 of SEQ ID NO: 6 based on the SignalP program that predicts amino acids 1 to 19 of SEQ ID NO: 6 are a signal peptide. In another embodiment, the mature polypeptide is amino acids 20 to 741 of SEQ ID NO: 8 based on the SignalP program that predicts amino acids 1 to 19 of SEQ ID NO: 8 are a signal peptide. It is known in the art that a host cell may produce a mixture of two of more different mature polypeptides (i.e., with a different C-terminal and/or N-terminal amino acid) expressed by the same polynucleotide.
[0051] Mature polypeptide coding sequence: The term "mature polypeptide coding sequence" means a polynucleotide that encodes a mature polypeptide having catalase activity. In one embodiment, the mature polypeptide coding sequence is nucleotides 1 to 2351 of SEQ ID NO: 1 or the cDNA sequence thereof. In another embodiment, the mature polypeptide coding sequence is nucleotides 58 to 2418 of SEQ ID NO: 3 or the cDNA sequence thereof based on the SignalP program that predicts nucleotides 1 to 57 of SEQ ID NO: 3 encode a signal peptide. In another embodiment, the mature polypeptide coding sequence is nucleotides 58 to 3040 of SEQ ID NO: 5 or the cDNA sequence thereof based on the SignalP program that predicts nucleotides 1 to 57 of SEQ ID NO: 5 encode a signal peptide. In another embodiment, the mature polypeptide coding sequence is nucleotides 58 to 2476 of SEQ ID NO: 7 or the cDNA sequence thereof based on the SignalP program that predicts nucleotides 1 to 57 of SEQ ID NO: 7 encode a signal peptide.
[0052] Medium stringency conditions: The term "medium stringency conditions" means for probes of at least 100 nucleotides in length, prehybridization and hybridization at 42.degree. C. in 5.times.SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmon sperm DNA, and 35% formamide, following standard Southern blotting procedures for 12 to 24 hours. The carrier material is finally washed three times each for 15 minutes using 2.times.SSC, 0.2% SDS at 55.degree. C.
[0053] Medium-high stringency conditions: The term "medium-high stringency conditions" means for probes of at least 100 nucleotides in length, prehybridization and hybridization at 42.degree. C. in 5.times.SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmon sperm DNA, and either 35% formamide, following standard Southern blotting procedures for 12 to 24 hours. The carrier material is finally washed three times each for 15 minutes using 2.times.SSC, 0.2% SDS at 60.degree. C.
[0054] Nucleic acid construct: The term "nucleic acid construct" means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature or which is synthetic, which comprises one or more control sequences.
[0055] Operably linked: The term "operably linked" means a configuration in which a control sequence is placed at an appropriate position relative to the coding sequence of a polynucleotide such that the control sequence directs expression of the coding sequence.
[0056] Polypeptide having cellulolytic enhancing activity: The term "polypeptide having cellulolytic enhancing activity" means a GH61 polypeptide that catalyzes the enhancement of the hydrolysis of a cellulosic material by enzyme having cellulolytic activity. For purposes of the present invention, cellulolytic enhancing activity is determined by measuring the increase in reducing sugars or the increase of the total of cellobiose and glucose from the hydrolysis of a cellulosic material by cellulolytic enzyme under the following conditions: 1-50 mg of total protein/g of cellulose in PCS, wherein total protein is comprised of 50-99.5% w/w cellulolytic enzyme protein and 0.5-50% w/w protein of a GH61 polypeptide having cellulolytic enhancing activity for 1-7 days at a suitable temperature, e.g., 50.degree. C., 55.degree. C., or 60.degree. C., and pH, e.g., 5.0 or 5.5, compared to a control hydrolysis with equal total protein loading without cellulolytic enhancing activity (1-50 mg of cellulolytic protein/g of cellulose in PCS). In a preferred aspect, a mixture of CELLUCLAST.RTM. 1.5L (Novozymes A/S, Bagsv.ae butted.rd, Denmark) in the presence of 2-3% of total protein weight Aspergillus oryzae beta-glucosidase (recombinantly produced in Aspergillus oryzae according to WO 02/095014) or 2-3% of total protein weight Aspergillus fumigatus beta-glucosidase (recombinantly produced in Aspergillus oryzae as described in WO 2002/095014) of cellulase protein loading is used as the source of the cellulolytic activity.
[0057] The GH61 polypeptides having cellulolytic enhancing activity enhance the hydrolysis of a cellulosic material catalyzed by enzyme having cellulolytic activity by reducing the amount of cellulolytic enzyme required to reach the same degree of hydrolysis preferably at least 1.01-fold, e.g., at least 1.05-fold, at least 1.10-fold, at least 1.25-fold, at least 1.5-fold, at least 2-fold, at least 3-fold, at least 4-fold, at least 5-fold, at least 10-fold, or at least 20-fold.
[0058] Pretreated corn stover: The term "PCS" or "Pretreated Corn Stover" means a cellulosic material derived from corn stover by treatment with heat and dilute sulfuric acid, alkaline pretreatment, or neutral pretreatment.
[0059] Sequence identity: The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter "sequence identity".
[0060] For purposes of the present invention, the sequence identity between two amino acid sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 5.0.0 or later. The parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. The output of Needle labeled "longest identity" (obtained using the -nobrief option) is used as the percent identity and is calculated as follows:
(Identical Residues.times.100)/(Length of Alignment-Total Number of Gaps in Alignment)
[0061] For purposes of the present invention, the sequence identity between two deoxyribonucleotide sequences is determined using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 5.0.0 or later. The parameters used are gap open penalty of 10, gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NUC4.4) substitution matrix. The output of Needle labeled "longest identity" (obtained using the -nobrief option) is used as the percent identity and is calculated as follows:
(Identical Deoxyribonucleotides.times.100)/(Length of Alignment-Total Number of Gaps in Alignment)
[0062] Variant: The term "variant" means a polypeptide having catalase activity comprising an alteration, i.e., a substitution, insertion, and/or deletion, at one or more (e.g., several) positions. A substitution means replacement of the amino acid occupying a position with a different amino acid; a deletion means removal of the amino acid occupying a position; and an insertion means adding an amino acid adjacent to and immediately following the amino acid occupying a position.
[0063] Very high stringency conditions: The term "very high stringency conditions" means for probes of at least 100 nucleotides in length, prehybridization and hybridization at 42.degree. C. in 5.times.SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmon sperm DNA, and 50% formamide, following standard Southern blotting procedures for 12 to 24 hours. The carrier material is finally washed three times each for 15 minutes using 2.times.SSC, 0.2% SDS at 70.degree. C.
[0064] Very low stringency conditions: The term "very low stringency conditions" means for probes of at least 100 nucleotides in length, prehybridization and hybridization at 42.degree. C. in 5.times.SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured salmon sperm DNA, and 25% formamide, following standard Southern blotting procedures for 12 to 24 hours. The carrier material is finally washed three times each for 15 minutes using 2.times.SSC, 0.2% SDS at 45.degree. C.
[0065] Xylan-containing material: The term "xylan-containing material" means any material comprising a plant cell wall polysaccharide containing a backbone of beta-(1-4)-linked xylose residues. Xylans of terrestrial plants are heteropolymers possessing a beta-(1-4)-D-xylopyranose backbone, which is branched by short carbohydrate chains. They comprise D-glucuronic acid or its 4-O-methyl ether, L-arabinose, and/or various oligosaccharides, composed of D-xylose, L-arabinose, D- or L-galactose, and D-glucose. Xylan-type polysaccharides can be divided into homoxylans and heteroxylans, which include glucuronoxylans, (arabino)glucuronoxylans, (glucurono)arabinoxylans, arabinoxylans, and complex heteroxylans. See, for example, Ebringerova et al., 2005, Adv. Polym. Sci. 186: 1-67.
[0066] In the processes of the present invention, any material containing xylan may be used. In a preferred aspect, the xylan-containing material is lignocellulose.
[0067] Xylan degrading activity or xylanolytic activity: The term "xylan degrading activity" or "xylanolytic activity" means a biological activity that hydrolyzes xylan-containing material. The two basic approaches for measuring xylanolytic activity include: (1) measuring the total xylanolytic activity, and (2) measuring the individual xylanolytic activities (e.g., endoxylanases, beta-xylosidases, arabinofuranosidases, alpha-glucuronidases, acetylxylan esterases, feruloyl esterases, and alpha-glucuronyl esterases). Recent progress in assays of xylanolytic enzymes was summarized in several publications including Biely and Puchard, Recent progress in the assays of xylanolytic enzymes, 2006, Journal of the Science of Food and Agriculture 86(11): 1636-1647; Spanikova and Biely, 2006, Glucuronoyl esterase--Novel carbohydrate esterase produced by Schizophyllum commune, FEBS Letters 580(19): 4597-4601; Herrmann, Vrsanska, Jurickova, Hirsch, Biely, and Kubicek, 1997, The beta-D-xylosidase of Trichoderma reesei is a multifunctional beta-D-xylan xylohydrolase, Biochemical Journal 321: 375-381.
[0068] Total xylan degrading activity can be measured by determining the reducing sugars formed from various types of xylan, including, for example, oat spelt, beechwood, and larchwood xylans, or by photometric determination of dyed xylan fragments released from various covalently dyed xylans. The most common total xylanolytic activity assay is based on production of reducing sugars from polymeric 4-O-methyl glucuronoxylan as described in Bailey, Biely, Poutanen, 1992, Interlaboratory testing of methods for assay of xylanase activity, Journal of Biotechnology 23(3): 257-270. Xylanase activity can also be determined with 0.2% AZCL-arabinoxylan as substrate in 0.01% TRITON.RTM. X-100 (4-(1,1,3,3-tetramethylbutyl)phenyl-polyethylene glycol) and 200 mM sodium phosphate buffer pH 6 at 37.degree. C. One unit of xylanase activity is defined as 1.0 .mu.mole of azurine produced per minute at 37.degree. C., pH 6 from 0.2% AZCL-arabinoxylan as substrate in 200 mM sodium phosphate pH 6 buffer.
[0069] For purposes of the present invention, xylan degrading activity is determined by measuring the increase in hydrolysis of birchwood xylan (Sigma Chemical Co., Inc., St. Louis, Mo., USA) by xylan-degrading enzyme(s) under the following typical conditions: 1 ml reactions, 5 mg/ml substrate (total solids), 5 mg of xylanolytic protein/g of substrate, 50 mM sodium acetate pH 5, 50.degree. C., 24 hours, sugar analysis using p-hydroxybenzoic acid hydrazide (PHBAH) assay as described by Lever, 1972, A new reaction for colorimetric determination of carbohydrates, Anal. Biochem 47: 273-279.
[0070] Xylanase: The term "xylanase" means a 1,4-beta-D-xylan-xylohydrolase (E.C. 3.2.1.8) that catalyzes the endohydrolysis of 1,4-beta-D-xylosidic linkages in xylans. For purposes of the present invention, xylanase activity is determined with 0.2% AZCL-arabinoxylan as substrate in 0.01% TRITON.RTM. X-100 and 200 mM sodium phosphate buffer pH 6 at 37.degree. C. One unit of xylanase activity is defined as 1.0 .mu.mole of azurine produced per minute at 37.degree. C., pH 6 from 0.2% AZCL-arabinoxylan as substrate in 200 mM sodium phosphate pH 6 buffer.
DETAILED DESCRIPTION OF THE INVENTION
Methods of Processing Cellulosic Material
[0071] The present invention relates to methods for degrading or converting a cellulosic material, comprising: treating the cellulosic material with an enzyme composition in the presence of a polypeptide having catalase activity. In one aspect, the method further comprises recovering the degraded or converted cellulosic material.
[0072] The present invention also relates to methods for producing a fermentation product, comprising:
[0073] (a) saccharifying a cellulosic material with an enzyme composition in the presence of a polypeptide having catalase activity;
[0074] (b) fermenting the saccharified cellulosic material with one or more (e.g., several) fermenting microorganisms to produce the fermentation product; and
[0075] (c) recovering the fermentation product from the fermentation.
[0076] The present invention further relates to methods of fermenting a cellulosic material, comprising: fermenting the cellulosic material with one or more (e.g., several) fermenting microorganisms, wherein the cellulosic material is hydrolyzed with an enzyme composition in the presence of a polypeptide having catalase activity. In one aspect, the fermenting of the cellulosic material produces a fermentation product. In another aspect, the method further comprises recovering the fermentation product from the fermentation.
[0077] In the methods described above, the presence of the polypeptide having catalase activity increases the hydrolysis of the cellulosic material compared to the absence of the polypeptide having catalase activity.
[0078] The methods of the present invention can be used to saccharify the cellulosic material to fermentable sugars and to convert the fermentable sugars to many useful fermentation products, e.g., fuel, potable ethanol, and/or platform chemicals (e.g., acids, alcohols, ketones, gases, and the like). The production of a desired fermentation product from the cellulosic material typically involves pretreatment, enzymatic hydrolysis (saccharification), and fermentation.
[0079] The processing of the cellulosic material according to the present invention can be accomplished using processes conventional in the art. Moreover, the methods of the present invention can be implemented using any conventional biomass processing apparatus configured to operate in accordance with the invention.
[0080] Hydrolysis (saccharification) and fermentation, separate or simultaneous, include, but are not limited to, separate hydrolysis and fermentation (SHF); simultaneous saccharification and fermentation (SSF); simultaneous saccharification and co-fermentation (SSCF); hybrid hydrolysis and fermentation (HHF); separate hydrolysis and co-fermentation (SHCF); hybrid hydrolysis and co-fermentation (HHCF); and direct microbial conversion (DMC), also sometimes called consolidated bioprocessing (CBP). SHF uses separate process steps to first enzymatically hydrolyze the cellulosic material to fermentable sugars, e.g., glucose, cellobiose, cellotriose, and pentose monomers, and then ferment the fermentable sugars to ethanol. In SSF, the enzymatic hydrolysis of the cellulosic material and the fermentation of sugars to ethanol are combined in one step (Philippidis, G. P., 1996, Cellulose bioconversion technology, in Handbook on Bioethanol: Production and Utilization, Wyman, C. E., ed., Taylor & Francis, Washington, D.C., 179-212). SSCF involves the co-fermentation of multiple sugars (Sheehan, J., and Himmel, M., 1999, Enzymes, energy and the environment: A strategic perspective on the U.S. Department of Energy's research and development activities for bioethanol, Biotechnol. Prog. 15: 817-827). HHF involves a separate hydrolysis step, and in addition a simultaneous saccharification and hydrolysis step, which can be carried out in the same reactor. The steps in an HHF process can be carried out at different temperatures, i.e., high temperature enzymatic saccharification followed by SSF at a lower temperature that the fermentation strain can tolerate. DMC combines all three processes (enzyme production, hydrolysis, and fermentation) in one or more (e.g., several) steps where the same organism is used to produce the enzymes for conversion of the cellulosic material to fermentable sugars and to convert the fermentable sugars into a final product (Lynd, L. R., Weimer, P. J., van Zyl, W. H., and Pretorius, I. S., 2002, Microbial cellulose utilization: Fundamentals and biotechnology, Microbiol. Mol. Biol. Reviews 66: 506-577). It is understood herein that any method known in the art comprising pretreatment, enzymatic hydrolysis (saccharification), fermentation, or a combination thereof can be used in the practicing the methods of the present invention.
[0081] A conventional apparatus can include a fed-batch stirred reactor, a batch stirred reactor, a continuous flow stirred reactor with ultrafiltration, and/or a continuous plug-flow column reactor (Fernanda de Castilhos Corazza, Flavio Faria de Moraes, Gisella Maria Zanin and Ivo Neitzel, 2003, Optimal control in fed-batch reactor for the cellobiose hydrolysis, Acta Scientiarum. Technology 25: 33-38; Gusakov, A. V., and Sinitsyn, A. P., 1985, Kinetics of the enzymatic hydrolysis of cellulose: 1. A mathematical model for a batch reactor process, Enz. Microb. Technol. 7: 346-352), an attrition reactor (Ryu, S. K., and Lee, J. M., 1983, Bioconversion of waste cellulose by using an attrition bioreactor, Biotechnol. Bioeng. 25: 53-65), or a reactor with intensive stirring induced by an electromagnetic field (Gusakov, A. V., Sinitsyn, A. P., Davydkin, I. Y., Davydkin, V. Y., Protas, O. V., 1996, Enhancement of enzymatic cellulose hydrolysis using a novel type of bioreactor with intensive stirring induced by electromagnetic field, Appl. Biochem. Biotechnol. 56: 141-153). Additional reactor types include: fluidized bed, upflow blanket, immobilized, and extruder type reactors for hydrolysis and/or fermentation.
[0082] Pretreatment. In practicing the methods of the present invention, any pretreatment process known in the art can be used to disrupt plant cell wall components of cellulosic material (Chandra et al., 2007, Substrate pretreatment: The key to effective enzymatic hydrolysis of lignocellulosics? Adv. Biochem. Engin./Biotechnol. 108: 67-93; Galbe and Zacchi, 2007, Pretreatment of lignocellulosic materials for efficient bioethanol production, Adv. Biochem. Engin./Biotechnol. 108: 41-65; Hendriks and Zeeman, 2009, Pretreatments to enhance the digestibility of lignocellulosic biomass, Bioresource Technol. 100: 10-18; Mosier et al., 2005, Features of promising technologies for pretreatment of lignocellulosic biomass, Bioresource Technol. 96: 673-686; Taherzadeh and Karimi, 2008, Pretreatment of lignocellulosic wastes to improve ethanol and biogas production: A review, Int. J. of Mol. Sci. 9: 1621-1651; Yang and Wyman, 2008, Pretreatment: the key to unlocking low-cost cellulosic ethanol, Biofuels Bioproducts and Biorefining-Biofpr. 2: 26-40).
[0083] The cellulosic material can also be subjected to particle size reduction, pre-soaking, wetting, washing, and/or conditioning prior to pretreatment using methods known in the art.
[0084] Conventional pretreatments include, but are not limited to, steam pretreatment (with or without explosion), dilute acid pretreatment, hot water pretreatment, alkaline pretreatment, lime pretreatment, wet oxidation, wet explosion, ammonia fiber explosion, organosolv pretreatment, and biological pretreatment. Additional pretreatments include ammonia percolation, ultrasound, electroporation, microwave, supercritical CO.sub.2, supercritical H.sub.2O, ozone, ionic liquid, and gamma irradiation pretreatments.
[0085] The cellulosic material can be pretreated before hydrolysis and/or fermentation. Pretreatment is preferably performed prior to the hydrolysis. Alternatively, the pretreatment can be carried out simultaneously with enzyme hydrolysis to release fermentable sugars, such as glucose, xylose, and/or cellobiose. In most cases the pretreatment step itself results in some conversion of biomass to fermentable sugars (even in absence of enzymes).
[0086] Steam Pretreatment. In steam pretreatment, cellulosic material is heated to disrupt the plant cell wall components, including lignin, hemicellulose, and cellulose to make the cellulose and other fractions, e.g., hemicellulose, accessible to enzymes. Cellulosic material is passed to or through a reaction vessel where steam is injected to increase the temperature to the required temperature and pressure and is retained therein for the desired reaction time. Steam pretreatment is preferably done at 140-230.degree. C., more preferably 160-200.degree. C., and most preferably 170-190.degree. C., where the optimal temperature range depends on any addition of a chemical catalyst. Residence time for the steam pretreatment is preferably 1-30 minutes, more preferably 1-15 minutes, even more preferably 3-12 minutes, and most preferably 4-10 minutes, where the optimal residence time depends on temperature range and any addition of a chemical catalyst. Steam pretreatment allows for relatively high solids loadings, so that cellulosic material is generally only moist during the pretreatment. The steam pretreatment is often combined with an explosive discharge of the material after the pretreatment, which is known as steam explosion, that is, rapid flashing to atmospheric pressure and turbulent flow of the material to increase the accessible surface area by fragmentation (Duff and Murray, 1996, Bioresource Technology 855: 1-33; Galbe and Zacchi, 2002, Appl. Microbiol. Biotechnol. 59: 618-628; U.S. Patent Application No. 20020164730). During steam pretreatment, hemicellulose acetyl groups are cleaved and the resulting acid autocatalyzes partial hydrolysis of the hemicellulose to monosaccharides and oligosaccharides. Lignin is removed to only a limited extent.
[0087] A catalyst such as H.sub.2SO.sub.4 or SO.sub.2 (typically 0.3 to 3% w/w) is often added prior to steam pretreatment, which decreases the time and temperature, increases the recovery, and improves enzymatic hydrolysis (Ballesteros et al., 2006, Appl. Biochem. Biotechnol. 129-132: 496-508; Varga et al., 2004, Appl. Biochem. Biotechnol. 113-116: 509-523; Sassner et al., 2006, Enzyme Microb. Technol. 39: 756-762).
[0088] Chemical Pretreatment: The term "chemical treatment" refers to any chemical pretreatment that promotes the separation and/or release of cellulose, hemicellulose, and/or lignin. Examples of suitable chemical pretreatment processes include, for example, dilute acid pretreatment, lime pretreatment, wet oxidation, ammonia fiber/freeze explosion (AFEX), ammonia percolation (APR), ionic liquid, and organosolv pretreatments.
[0089] In dilute acid pretreatment, cellulosic material is mixed with dilute acid, typically H.sub.2SO.sub.4, and water to form a slurry, heated by steam to the desired temperature, and after a residence time flashed to atmospheric pressure. The dilute acid pretreatment can be performed with a number of reactor designs, e.g., plug-flow reactors, counter-current reactors, or continuous counter-current shrinking bed reactors (Duff and Murray, 1996, supra; Schell et al., 2004, Bioresource Technol. 91: 179-188; Lee et al., 1999, Adv. Biochem. Eng. Biotechnol. 65: 93-115).
[0090] Several methods of pretreatment under alkaline conditions can also be used. These alkaline pretreatments include, but are not limited to, lime pretreatment, wet oxidation, ammonia percolation (APR), and ammonia fiber/freeze explosion (AFEX).
[0091] Lime pretreatment is performed with calcium carbonate, sodium hydroxide, or ammonia at low temperatures of 85-150.degree. C. and residence times from 1 hour to several days (Wyman et al., 2005, Bioresource Technol. 96: 1959-1966; Mosier et al., 2005, Bioresource Technol. 96: 673-686). WO 2006/110891, WO 2006/110899, WO 2006/110900, and WO 2006/110901 disclose pretreatment methods using ammonia.
[0092] Wet oxidation is a thermal pretreatment performed typically at 180-200.degree. C. for 5-15 minutes with addition of an oxidative agent such as hydrogen peroxide or over-pressure of oxygen (Schmidt and Thomsen, 1998, Bioresource Technol. 64: 139-151; Palonen et al., 2004, Appl. Biochem. Biotechnol. 117: 1-17; Varga et al., 2004, Biotechnol. Bioeng. 88: 567-574; Martin et al., 2006, J. Chem. Technol. Biotechnol. 81: 1669-1677). The pretreatment is performed at preferably 1-40% dry matter, more preferably 2-30% dry matter, and most preferably 5-20% dry matter, and often the initial pH is increased by the addition of alkali such as sodium carbonate.
[0093] A modification of the wet oxidation pretreatment method, known as wet explosion (combination of wet oxidation and steam explosion), can handle dry matter up to 30%. In wet explosion, the oxidizing agent is introduced during pretreatment after a certain residence time. The pretreatment is then ended by flashing to atmospheric pressure (WO 2006/032282).
[0094] Ammonia fiber explosion (AFEX) involves treating cellulosic material with liquid or gaseous ammonia at moderate temperatures such as 90-100.degree. C. and high pressure such as 17-20 bar for 5-10 minutes, where the dry matter content can be as high as 60% (Gollapalli et al., 2002, Appl. Biochem. Biotechnol. 98: 23-35; Chundawat et al., 2007, Biotechnol. Bioeng. 96: 219-231; Alizadeh et al., 2005, Appl. Biochem. Biotechnol. 121: 1133-1141; Teymouri et al., 2005, Bioresource Technol. 96: 2014-2018). AFEX pretreatment results in the depolymerization of cellulose and partial hydrolysis of hemicellulose. Lignin-carbohydrate complexes are cleaved.
[0095] Organosolv pretreatment delignifies cellulosic material by extraction using aqueous ethanol (40-60% ethanol) at 160-200.degree. C. for 30-60 minutes (Pan et al., 2005, Biotechnol. Bioeng. 90: 473-481; Pan et al., 2006, Biotechnol. Bioeng. 94: 851-861; Kurabi et al., 2005, Appl. Biochem. Biotechnol. 121: 219-230). Sulphuric acid is usually added as a catalyst. In organosolv pretreatment, the majority of hemicellulose is removed.
[0096] Other examples of suitable pretreatment methods are described by Schell et al., 2003, Appl. Biochem. and Biotechnol. Vol. 105-108, p. 69-85, and Mosier et al., 2005, Bioresource Technology 96: 673-686, and U.S. Published Application 2002/0164730.
[0097] In one aspect, the chemical pretreatment is preferably carried out as an acid treatment, and more preferably as a continuous dilute and/or mild acid treatment. The acid is typically sulfuric acid, but other acids can also be used, such as acetic acid, citric acid, nitric acid, phosphoric acid, tartaric acid, succinic acid, hydrogen chloride, or mixtures thereof. Mild acid treatment is conducted in the pH range of preferably 1-5, more preferably 1-4, and most preferably 1-3. In one aspect, the acid concentration is in the range from preferably 0.01 to 20 wt % acid, more preferably 0.05 to 10 wt % acid, even more preferably 0.1 to 5 wt % acid, and most preferably 0.2 to 2.0 wt % acid. The acid is contacted with cellulosic material and held at a temperature in the range of preferably 160-220.degree. C., and more preferably 165-195.degree. C., for periods ranging from seconds to minutes to, e.g., 1 second to 60 minutes.
[0098] In another aspect, pretreatment is carried out as an ammonia fiber explosion step (AFEX pretreatment step).
[0099] In another aspect, pretreatment takes place in an aqueous slurry. In preferred aspects, cellulosic material is present during pretreatment in amounts preferably between 10-80 wt %, more preferably between 20-70 wt %, and most preferably between 30-60 wt %, such as around 50 wt %. The pretreated cellulosic material can be unwashed or washed using any method known in the art, e.g., washed with water.
[0100] Mechanical Pretreatment: The term "mechanical pretreatment" refers to various types of grinding or milling (e.g., dry milling, wet milling, or vibratory ball milling).
[0101] Physical Pretreatment: The term "physical pretreatment" refers to any pretreatment that promotes the separation and/or release of cellulose, hemicellulose, and/or lignin from cellulosic material. For example, physical pretreatment can involve irradiation (e.g., microwave irradiation), steaming/steam explosion, hydrothermolysis, and combinations thereof.
[0102] Physical pretreatment can involve high pressure and/or high temperature (steam explosion). In one aspect, high pressure means pressure in the range of preferably about 300 to about 600 psi, more preferably about 350 to about 550 psi, and most preferably about 400 to about 500 psi, such as around 450 psi. In another aspect, high temperature means temperatures in the range of about 100 to about 300.degree. C., preferably about 140 to about 235.degree. C. In a preferred aspect, mechanical pretreatment is performed in a batch-process, steam gun hydrolyzer system that uses high pressure and high temperature as defined above, e.g., a Sunds Hydrolyzer available from Sunds Defibrator AB, Sweden.
[0103] Combined Physical and Chemical Pretreatment: Cellulosic material can be pretreated both physically and chemically. For instance, the pretreatment step can involve dilute or mild acid treatment and high temperature and/or pressure treatment. The physical and chemical pretreatments can be carried out sequentially or simultaneously, as desired. A mechanical pretreatment can also be included.
[0104] Accordingly, in a preferred aspect, cellulosic material is subjected to mechanical, chemical, or physical pretreatment, or any combination thereof, to promote the separation and/or release of cellulose, hemicellulose, and/or lignin.
[0105] Biological Pretreatment: The term "biological pretreatment" refers to any biological pretreatment that promotes the separation and/or release of cellulose, hemicellulose, and/or lignin from cellulosic material. Biological pretreatment techniques can involve applying lignin-solubilizing microorganisms (see, for example, Hsu, T.-A., 1996, Pretreatment of biomass, in Handbook on Bioethanol: Production and Utilization, Wyman, C. E., ed., Taylor & Francis, Washington, D.C., 179-212; Ghosh and Singh, 1993, Physicochemical and biological treatments for enzymatic/microbial conversion of cellulosic biomass, Adv. Appl. Microbiol. 39: 295-333; McMillan, J. D., 1994, Pretreating lignocellulosic biomass: a review, in Enzymatic Conversion of Biomass for Fuels Production, Himmel, M. E., Baker, J. O., and Overend, R. P., eds., ACS Symposium Series 566, American Chemical Society, Washington, D.C., chapter 15; Gong, C. S., Cao, N. J., Du, J., and Tsao, G. T., 1999, Ethanol production from renewable resources, in Advances in Biochemical Engineering/Biotechnology, Scheper, T., ed., Springer-Verlag Berlin Heidelberg, Germany, 65: 207-241; Olsson and Hahn-Hagerdal, 1996, Fermentation of lignocellulosic hydrolysates for ethanol production, Enz. Microb. Tech. 18: 312-331; and Vallander and Eriksson, 1990, Production of ethanol from lignocellulosic materials: State of the art, Adv. Biochem. Eng./Biotechnol. 42: 63-95).
[0106] Saccharification. In the hydrolysis step, also known as saccharification, the cellulosic material, e.g., pretreated, is hydrolyzed to break down cellulose and alternatively also hemicellulose to sugars, such as glucose, cellobiose, xylose, xylulose, arabinose, mannose, galactose, and/or soluble oligosaccharides. The sugars, and/or soluble oligosaccharides can further be further used to produce an alcohol (e.g., arabinitol, n-butanol, isobutanol, ethanol, glycerol, methanol, ethylene glycol, 1,3-propanediol [propylene glycol], butanediol, glycerin, sorbitol, and xylitol); an alkane (e.g., pentane, hexane, heptane, octane, nonane, decane, undecane, and dodecane), a cycloalkane (e.g., cyclopentane, cyclohexane, cycloheptane, and cyclooctane), an alkene (e.g. pentene, hexene, heptene, and octene); an amino acid (e.g., aspartic acid, glutamic acid, glycine, lysine, serine, and threonine); a gas (e.g., methane, hydrogen (H.sub.2), carbon dioxide (CO2), and carbon monoxide (CO)); isoprene; a ketone (e.g., acetone); an organic acid (e.g., acetic acid, acetonic acid, adipic acid, ascorbic acid, citric acid, 2,5-diketo-D-gluconic acid, formic acid, fumaric acid, glucaric acid, gluconic acid, glucuronic acid, glutaric acid, 3-hydroxypropionic acid, itaconic acid, lactic acid, malic acid, malonic acid, oxalic acid, oxaloacetic acid, propionic acid, succinic acid, and xylonic acid); and polyketide.
[0107] The hydrolysis is performed enzymatically by an enzyme composition in the presence of a polypeptide having catalase activity of the present invention. The enzymes of the compositions and the polypeptide having catalase activity can also be added simultaneously or sequentially.
[0108] Enzymatic hydrolysis is preferably carried out in a suitable aqueous environment under conditions that can be readily determined by one skilled in the art. In one aspect, hydrolysis is performed under conditions suitable for the activity of the enzyme(s), i.e., optimal for the enzyme(s). The hydrolysis can be carried out as a fed batch or continuous process where the cellulosic material is fed gradually to, for example, an enzyme containing hydrolysis solution.
[0109] The saccharification is generally performed in stirred-tank reactors or fermentors under controlled pH, temperature, and mixing conditions. Suitable process time, temperature and pH conditions can readily be determined by one skilled in the art. For example, the saccharification can last up to 200 hours, but is typically performed for preferably about 12 to about 120 hours, e.g., about 16 to about 72 hours or about 24 to about 48 hours. The temperature is in the range of preferably about 25.degree. C. to about 70.degree. C., e.g., about 30.degree. C. to about 65.degree. C., about 40.degree. C. to about 60.degree. C., or about 50.degree. C. to about 55.degree. C. The pH is in the range of preferably about 3 to about 8, e.g., about 3.5 to about 7, about 4 to about 6, or about 5.0 to about 5.5. The dry solids content is in the range of preferably about 5 to about 50 wt %, e.g., about 10 to about 40 wt % or about 20 to about 30 wt %.
Enzyme Composition
[0110] The enzyme compositions can comprise any protein that is useful in degrading or converting cellulosic material. The compositions may comprise one enzyme as the major enzymatic component, e.g., a mono-component composition, or multiple enzymes. The compositions may be prepared in accordance with methods known in the art and may be in the form of a liquid or a dry composition. The compositions may be stabilized in accordance with methods known in the art.
[0111] In one aspect, an enzyme composition for degrading or converting a cellulosic material comprises one or more (e.g., several) enzymes having cellulolytic and/or hemicellulolytic activity and a polypeptide having catalase activity.
[0112] In an embodiment, the enzyme composition comprises or further comprises one or more (e.g., several) proteins selected from the group consisting of a cellulase, a GH61 polypeptide having cellulolytic enhancing activity, a hemicellulase, an esterase, an expansin, a laccase, a ligninolytic enzyme, a pectinase, a peroxidase, a protease, and a swollenin. In another aspect, the cellulase is preferably one or more (e.g., several) enzymes selected from the group consisting of an endoglucanase, a cellobiohydrolase, and a beta-glucosidase. In another aspect, the hemicellulase is preferably one or more (e.g., several) enzymes selected from the group consisting of an acetylmannan esterase, an acetylxylan esterase, an arabinanase, an arabinofuranosidase, a coumaric acid esterase, a feruloyl esterase, a galactosidase, a glucuronidase, a glucuronoyl esterase, a mannanase, a mannosidase, a xylanase, and a xylosidase.
[0113] In another embodiment, the enzyme composition comprises one or more (e.g., several) cellulolytic enzymes. In another aspect, the enzyme composition comprises or further comprises one or more (e.g., several) hemicellulolytic enzymes. In another aspect, the enzyme composition comprises one or more (e.g., several) cellulolytic enzymes and one or more (e.g., several) hemicellulolytic enzymes. In another aspect, the enzyme composition comprises one or more (e.g., several) enzymes selected from the group of cellulolytic enzymes and hemicellulolytic enzymes. In another aspect, the enzyme composition comprises an endoglucanase. In another aspect, the enzyme composition comprises a cellobiohydrolase. In another aspect, the enzyme composition comprises a beta-glucosidase. In another aspect, the enzyme composition comprises a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises an endoglucanase and a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises a cellobiohydrolase and a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises a beta-glucosidase and a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises an endoglucanase and a cellobiohydrolase. In another aspect, the enzyme composition comprises an endoglucanase and a beta-glucosidase. In another aspect, the enzyme composition comprises a cellobiohydrolase and a beta-glucosidase. In another aspect, the enzyme composition comprises an endoglucanase, a cellobiohydrolase, and a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises an endoglucanase, a beta-glucosidase, and a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises a cellobiohydrolase, a beta-glucosidase, and a polypeptide having cellulolytic enhancing activity. In another aspect, the enzyme composition comprises an endoglucanase, a cellobiohydrolase, and a beta-glucosidase. In another aspect, the enzyme composition comprises an endoglucanase, a cellobiohydrolase, a beta-glucosidase, and a polypeptide having cellulolytic enhancing activity.
[0114] In another embodiment, the enzyme composition comprises an acetylmannan esterase. In another aspect, the enzyme composition comprises an acetylxylan esterase. In another aspect, the enzyme composition comprises an arabinanase (e.g., alpha-L-arabinanase). In another aspect, the enzyme composition comprises an arabinofuranosidase (e.g., alpha-L-arabinofuranosidase). In another aspect, the enzyme composition comprises a coumaric acid esterase. In another aspect, the enzyme composition comprises a feruloyl esterase. In another aspect, the enzyme composition comprises a galactosidase (e.g., alpha-galactosidase and/or beta-galactosidase). In another aspect, the enzyme composition comprises a glucuronidase (e.g., alpha-D-glucuronidase). In another aspect, the enzyme composition comprises a glucuronoyl esterase. In another aspect, the enzyme composition comprises a mannanase. In another aspect, the enzyme composition comprises a mannosidase (e.g., beta-mannosidase). In another aspect, the enzyme composition comprises a xylanase. In a preferred aspect, the xylanase is a Family 10 xylanase. In another aspect, the enzyme composition comprises a xylosidase (e.g., beta-xylosidase).
[0115] In another embodiment, the enzyme composition comprises an esterase. In another aspect, the enzyme composition comprises an expansin. In another aspect, the enzyme composition comprises a laccase. In another aspect, the enzyme composition comprises a ligninolytic enzyme. In a preferred aspect, the ligninolytic enzyme is a manganese peroxidase. In another preferred aspect, the ligninolytic enzyme is a lignin peroxidase. In another preferred aspect, the ligninolytic enzyme is a H.sub.2O.sub.2-producing enzyme. In another aspect, the enzyme composition comprises a pectinase. In another aspect, the enzyme composition comprises a peroxidase. In another aspect, the enzyme composition comprises a protease. In another aspect, the enzyme composition comprises a swollenin.
[0116] In the methods of the present invention, the enzyme(s) can be added prior to or during saccharification, saccharification and fermentation, or fermentation. The enzymes having cellulolytic and/or hemicellulolytic activity and a polypeptide having catalase activity can be added simultaneously or sequentially.
[0117] One or more (e.g., several) components of the enzyme composition may be wild-type proteins, recombinant proteins, or a combination of wild-type proteins and recombinant proteins. For example, one or more (e.g., several) components may be native proteins of a cell, which is used as a host cell to express recombinantly one or more (e.g., several) other components of the enzyme composition. One or more (e.g., several) components of the enzyme composition may be produced as monocomponents, which are then combined to form the enzyme composition. The enzyme composition may be a combination of multicomponent and monocomponent protein preparations.
[0118] The enzymes used in the methods of the present invention may be in any form suitable for use, such as, for example, a fermentation broth formulation or a cell composition, a cell lysate with or without cellular debris, a semi-purified or purified enzyme preparation, or a host cell as a source of the enzymes. The enzyme composition may be a dry powder or granulate, a non-dusting granulate, a liquid, a stabilized liquid, or a stabilized protected enzyme. Liquid enzyme preparations may, for instance, be stabilized by adding stabilizers such as a sugar, a sugar alcohol or another polyol, and/or lactic acid or another organic acid according to established processes.
[0119] The optimum amounts of the enzymes and polypeptide(s) having catalase activity depend on several factors including, but not limited to, the mixture of component cellulolytic enzymes, the cellulosic material, the concentration of cellulosic material, the pretreatment(s) of the cellulosic material, temperature, time, pH, and inclusion of fermenting organism (e.g., yeast for Simultaneous Saccharification and Fermentation).
[0120] In a preferred aspect, an effective amount of cellulolytic or hemicellulolytic enzyme to the cellulosic material is about 0.5 to about 50 mg, preferably about 0.5 to about 40 mg, more preferably about 0.5 to about 25 mg, more preferably about 0.75 to about 20 mg, more preferably about 0.75 to about 15 mg, even more preferably about 0.5 to about 10 mg, and most preferably about 1.0 to about 10 mg per g of the cellulosic material.
[0121] In another preferred aspect, an effective amount of polypeptide(s) having catalase activity to the cellulosic material is about 0.001 to about 100.0 mg, preferably about 0.01 to about 50 mg, more preferably about 0.01 to about 40 mg, more preferably about 0.01 to about 30 mg, more preferably about 0.01 to about 20 mg, more preferably about 0.01 to about 10 mg, more preferably about 0.025 to about 8 mg, more preferably bout 0.05 to about 6 mg, more preferably about 0.075 to about 5 mg, more preferably about 0.1 to about 4 mg, even more preferably about 0.15 to about 3 mg, and most preferably about 0.25 to about 1.0 mg per g of the cellulosic material.
[0122] In another preferred aspect, an effective amount of polypeptide(s) having catalase activity to cellulolytic or hemicellulolytic enzyme is about 0.005 to about 1.0 g, preferably about 0.01 to about 1.0 g, more preferably about 0.15 to about 0.75 g, more preferably about 0.15 to about 0.5 g, more preferably about 0.1 to about 0.5 g, even more preferably about 0.1 to about 0.5 g, and most preferably about 0.05 to about 0.2 g per g of cellulolytic or hemicellulolytic enzyme.
[0123] In another aspect, an effective amount of a GH61 polypeptide having cellulolytic enhancing activity to cellulosic material is about 0.01 to about 50.0 mg, preferably about 0.01 to about 40 mg, more preferably about 0.01 to about 30 mg, more preferably about 0.01 to about 20 mg, more preferably about 0.01 to about 10 mg, more preferably about 0.01 to about 5 mg, more preferably about 0.025 to about 1.5 mg, more preferably about 0.05 to about 1.25 mg, more preferably about 0.075 to about 1.25 mg, more preferably about 0.1 to about 1.25 mg, even more preferably about 0.15 to about 1.25 mg, and most preferably about 0.25 to about 1.0 mg per g of cellulosic material.
[0124] In another aspect, an effective amount of a GH61 polypeptide having cellulolytic enhancing activity to cellulolytic enzyme protein is about 0.005 to about 1.0 g, preferably about 0.01 to about 1.0 g, more preferably about 0.15 to about 0.75 g, more preferably about 0.15 to about 0.5 g, more preferably about 0.1 to about 0.5 g, even more preferably about 0.1 to about 0.5 g, and most preferably about 0.05 to about 0.2 g per g of cellulolytic enzyme protein.
[0125] The polypeptides having cellulolytic enzyme activity or hemicellulolytic enzyme activity as well as other proteins/polypeptides useful in the degradation of the cellulosic material, e.g., polypeptides having cellulolytic enhancing activity (collectively hereinafter "polypeptides having enzyme activity") can be derived or obtained from any suitable origin, including, bacterial, fungal, yeast, plant, or mammalian origin. The term "obtained" also means herein that the enzyme may have been produced recombinantly in a host organism employing methods described herein, wherein the recombinantly produced enzyme is either native or foreign to the host organism or has a modified amino acid sequence, e.g., having one or more (e.g., several) amino acids that are deleted, inserted and/or substituted, i.e., a recombinantly produced enzyme that is a mutant and/or a fragment of a native amino acid sequence or an enzyme produced by nucleic acid shuffling processes known in the art. Encompassed within the meaning of a native enzyme are natural variants and within the meaning of a foreign enzyme are variants obtained recombinantly, such as by site-directed mutagenesis or shuffling.
[0126] A polypeptide having enzyme activity may be a bacterial polypeptide. For example, the polypeptide may be a gram positive bacterial polypeptide such as a Bacillus, Streptococcus, Streptomyces, Staphylococcus, Enterococcus, Lactobacillus, Lactococcus, Clostridium, Geobacillus, Caldicellulosiruptor, Acidothermus, Thermobifidia, or Oceanobacillus polypeptide having enzyme activity, or a Gram negative bacterial polypeptide such as an E. coli, Pseudomonas, Salmonella, Campylobacter, Helicobacter, Flavobacterium, Fusobacterium, Ilyobacter, Neisseria, or Ureaplasma polypeptide having enzyme activity.
[0127] In one aspect, the polypeptide is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis polypeptide having enzyme activity.
[0128] In another aspect, the polypeptide is a Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, or Streptococcus equi subsp. Zooepidemicus polypeptide having enzyme activity.
[0129] In another aspect, the polypeptide is a Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, or Streptomyces lividans polypeptide having enzyme activity.
[0130] The polypeptide having enzyme activity may also be a fungal polypeptide, and more preferably a yeast polypeptide such as a Candida, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia polypeptide having enzyme activity; or more preferably a filamentous fungal polypeptide such as an Acremonium, Agaricus, Alternaria, Aspergillus, Aureobasidium, Botryospaeria, Ceriporiopsis, Chaetomidium, Chrysosporium, Claviceps, Cochliobolus, Coprinopsis, Coptotermes, Corynascus, Cryphonectria, Cryptococcus, Diplodia, Exidia, Filibasidium, Fusarium, Gibberella, Holomastigotoides, Humicola, Irpex, Lentinula, Leptospaeria, Magnaporthe, Melanocarpus, Meripilus, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Piromyces, Poitrasia, Pseudoplectania, Pseudotrichonympha, Rhizomucor, Schizophyllum, Scytalidium, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trichoderma, Trichophaea, Verticillium, Volvariella, or Xylaria polypeptide having enzyme activity.
[0131] In one aspect, the polypeptide is a Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, or Saccharomyces oviformis polypeptide having enzyme activity.
[0132] In one aspect, the polypeptide is an Acremonium cellulolyticus, Aspergillus aculeatus, Aspergillus awamori, Aspergillus fumigatus, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium tropicum, Chrysosporium merdarium, Chrysosporium inops, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium zonatum, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola grisea, Humicola insolens, Humicola lanuginosa, Irpex lacteus, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium funiculosum, Penicillium purpurogenum, Phanerochaete chrysosporium, Thielavia achromatica, Thielavia albomyces, Thielavia albopilosa, Thielavia australeinsis, Thielavia fimeti, Thielavia microspora, Thielavia ovispora, Thielavia peruviana, Thielavia spededonium, Thielavia setosa, Thielavia subthermophila, Thielavia terrestris, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, Trichoderma viride, or Trichophaea saccata polypeptide having enzyme activity.
[0133] Chemically modified or protein engineered mutants of polypeptides having enzyme activity may also be used.
[0134] One or more (e.g., several) components of the enzyme composition may be a recombinant component, i.e., produced by cloning of a DNA sequence encoding the single component and subsequent cell transformed with the DNA sequence and expressed in a host (see, for example, WO 91/17243 and WO 91/17244). The host is preferably a heterologous host (enzyme is foreign to host), but the host may under certain conditions also be a homologous host (enzyme is native to host). Monocomponent cellulolytic proteins may also be prepared by purifying such a protein from a fermentation broth.
[0135] In one aspect, the one or more (e.g., several) cellulolytic enzymes comprise a commercial cellulolytic enzyme preparation. Examples of commercial cellulolytic enzyme preparations suitable for use in the present invention include, for example, CELLIC.RTM. CTec Ctec3 (Novozymes A/S), CELLIC.RTM. CTec CTec2 (Novozymes A/S), CELLIC.RTM. CTec (Novozymes A/S), CELLUCLAST.TM. (Novozymes A/S), NOVOZYM.TM. 188 (Novozymes A/S), CELLUZYME.TM. (Novozymes A/S), CEREFLO.TM. (Novozymes A/S), and ULTRAFLO.TM. (Novozymes A/S), ACCELERASE.TM. (Genencor Int.), LAMINEX.TM. (Genencor Int.), SPEZYME.TM. CP (Genencor Int.), FILTRASE.RTM. NL (DSM); METHAPLUS.RTM. S/L 100 (DSM), ROHAMENT.TM. 7069 W (Roohm GmbH), FIBREZYME.RTM. LDI (Dyadic International, Inc.), FIBREZYME.RTM. LBR (Dyadic International, Inc.), or VISCOSTAR.RTM. 150L (Dyadic International, Inc.). The cellulase enzymes are added in amounts effective from about 0.001 to about 5.0 wt % of solids, e.g., about 0.025 to about 4.0 wt % of solids or about 0.005 to about 2.0 wt % of solids.
[0136] Examples of bacterial endoglucanases that can be used in the method of the present invention, include, but are not limited to, an Acidothermus cellulolyticus endoglucanase (WO 91/05039; WO 93/15186; U.S. Pat. No. 5,275,944; WO 96/02551; U.S. Pat. No. 5,536,655, WO 00/70031, WO 05/093050); Thermobifida fusca endoglucanase III (WO 05/093050); and Thermobifida fusca endoglucanase V (WO 05/093050).
[0137] Examples of fungal endoglucanases that can be used in the present invention, include, but are not limited to, a Trichoderma reesei endoglucanase I (Penttila et al., 1986, Gene 45: 253-263; Trichoderma reesei Cel7B endoglucanase I (GENBANK.TM. accession no. M15665); Trichoderma reesei endoglucanase II (Saloheimo, et al., 1988, Gene 63:11-22; Trichoderma reesei Cel5A endoglucanase II (GENBANK.TM. accession no. M19373); Trichoderma reesei endoglucanase III (Okada et al., 1988, Appl. Environ. Microbiol. 64: 555-563; GENBANK.TM. accession no. AB003694); Trichoderma reesei endoglucanase V (Saloheimo et al., 1994, Molecular Microbiology 13: 219-228; GENBANK.TM. accession no. Z33381); Aspergillus aculeatus endoglucanase (Ooi et al., 1990, Nucleic Acids Research 18: 5884); Aspergillus kawachii endoglucanase (Sakamoto et al., 1995, Current Genetics 27: 435-439); Erwinia carotovara endoglucanase (Saarilahti et al., 1990, Gene 90: 9-14); Fusarium oxysporum endoglucanase (GENBANK.TM. accession no. L29381); Humicola grisea var. thermoidea endoglucanase (GENBANK.TM. accession no. AB003107); Melanocarpus albomyces endoglucanase (GENBANK.TM. accession no. MAL515703); Neurospora crassa endoglucanase (GENBANK.TM. accession no. XM_324477); Humicola insolens endoglucanase V; Myceliophthora thermophila CBS 117.65 endoglucanase; basidiomycete CBS 495.95 endoglucanase; basidiomycete CBS 494.95 endoglucanase; Thielavia terrestris NRRL 8126 CEL6B endoglucanase; Thielavia terrestris NRRL 8126 CEL6C endoglucanase; Thielavia terrestris NRRL 8126 CEL7C endoglucanase; Thielavia terrestris NRRL 8126 CEL7E endoglucanase; Thielavia terrestris NRRL 8126 CEL7F endoglucanase; Cladorrhinum foecundissimum ATCC 62373 CEL7A endoglucanase; and Trichoderma reesei strain No. VTT-D-80133 endoglucanase (GENBANK.TM. accession no. M15665).
[0138] Examples of cellobiohydrolases useful in the present invention include, but are not limited to, Trichoderma reesei cellobiohydrolase I; Trichoderma reesei cellobiohydrolase II; Humicola insolens cellobiohydrolase I; Myceliophthora thermophila cellobiohydrolase II; Thielavia terrestris cellobiohydrolase II (CEL6A); Chaetomium thermophilum cellobiohydrolase I; and Chaetomium thermophilum cellobiohydrolase II.
[0139] Examples of beta-glucosidases useful in the present invention include, but are not limited to, Aspergillus oryzae beta-glucosidase; Aspergillus fumigatus beta-glucosidase; Penicillium brasilianum IBT 20888 beta-glucosidase; Aspergillus niger beta-glucosidase; and Aspergillus aculeatus beta-glucosidase.
[0140] The Aspergillus oryzae beta-glucosidase can be obtained according to WO 2002/095014. The Aspergillus fumigatus beta-glucosidase can be obtained according to WO 2005/047499. The Penicillium brasilianum beta-glucosidase can be obtained according to WO 2007/019442. The Aspergillus niger beta-glucosidase can be obtained according to Dan et al., 2000, J. Biol. Chem. 275: 4973-4980. The Aspergillus aculeatus beta-glucosidase can be obtained according to Kawaguchi et al., 1996, Gene 173: 287-288.
[0141] The beta-glucosidase may be a fusion protein. In one aspect, the beta-glucosidase is the Aspergillus oryzae beta-glucosidase variant BG fusion protein or the Aspergillus oryzae beta-glucosidase fusion protein obtained according to WO 2008/057637.
[0142] Other useful endoglucanases, cellobiohydrolases, and beta-glucosidases are disclosed in numerous Glycosyl Hydrolase families using the classification according to Henrissat B., 1991, A classification of glycosyl hydrolases based on amino-acid sequence similarities, Biochem. J. 280: 309-316, and Henrissat B., and Bairoch A., 1996, Updating the sequence-based classification of glycosyl hydrolases, Biochem. J. 316: 695-696.
[0143] Other cellulolytic enzymes that may be used in the present invention are described in EP 495,257, EP 531,315, EP 531,372, WO 89/09259, WO 94/07998, WO 95/24471, WO 96/11262, WO 96/29397, WO 96/034108, WO 97/14804, WO 98/08940, WO 98/012307, WO 98/13465, WO 98/015619, WO 98/015633, WO 98/028411, WO 99/06574, WO 99/10481, WO 99/025846, WO 99/025847, WO 99/031255, WO 2000/009707, WO 2002/050245, WO 2002/0076792, WO 2002/101078, WO 2003/027306, WO 2003/052054, WO 2003/052055, WO 2003/052056, WO 2003/052057, WO 2003/052118, WO 2004/016760, WO 2004/043980, WO 2004/048592, WO 2005/001065, WO 2005/028636, WO 2005/093050, WO 2005/093073, WO 2006/074005, WO 2006/117432, WO 2007/071818, WO 2007/071820, WO 2008/008070, WO 2008/008793, U.S. Pat. Nos. 4,435,307, 5,457,046, 5,648,263, 5,686,593, 5,691,178, 5,763,254, and 5,776,757.
[0144] In the methods of the present invention, any GH61 polypeptide having cellulolytic enhancing activity can be used.
[0145] In a first aspect, the GH61 polypeptide having cellulolytic enhancing activity comprises the following motifs:
[ILMV]-P-X(4,5)-G-X-Y-[ILMV]-X-R-X-[EQ]-X(4)-[HNQ] and [FW]-[TF]-K-[AIV],
[0146] wherein X is any amino acid, X(4,5) is any amino acid at 4 or 5 contiguous positions, and X(4) is any amino acid at 4 contiguous positions.
[0147] The polypeptide comprising the above-noted motifs may further comprise:
H-X(1,2)-G-P-X(3)-[YW]-[AILMV],
[EQ]-X-Y-X(2)-C-X-[EHQN]-[FILV]-X-[ILV], or
H-X(1,2)-G- P-X(3)-[YW]-[AILMV] and [EQ]-X-Y-X(2)-C-X-[EHQN]-[FILV]-X-[ILV],
[0148] wherein X is any amino acid, X(1,2) is any amino acid at 1 position or 2 contiguous positions, X(3) is any amino acid at 3 contiguous positions, and X(2) is any amino acid at 2 contiguous positions. In the above motifs, the accepted IUPAC single letter amino acid abbreviation is employed.
[0149] In a preferred aspect, the GH61 polypeptide having cellulolytic enhancing activity further comprises H-X(1,2)-G-P-X(3)-[YW]-[AILMV]. In another preferred aspect, the GH61 polypeptide having cellulolytic enhancing activity further comprises [EQ]-X-Y-X(2)-C-X-[EHQN]-[FILV]-X-[ILV]. In another preferred aspect, the GH61 polypeptide having cellulolytic enhancing activity further comprises H-X(1,2)-G-P-X(3)-[YW]-[AILMV] and [EQ]-X-Y-X(2)-C-X-[EHQN]-[FILV]-X-[ILV].
[0150] In a second aspect, the GH61 polypeptide having cellulolytic enhancing activity comprises the following motif:
[ILMV]-P-x(4,5)-G-x-Y-[ILMV]-x-R-x-[EQ]-x(3)-A-[HNQ],
[0151] wherein x is any amino acid, x(4,5) is any amino acid at 4 or 5 contiguous positions, and x(3) is any amino acid at 3 contiguous positions. In the above motif, the accepted IUPAC single letter amino acid abbreviation is employed.
[0152] Examples of GH61 polypeptides having cellulolytic enhancing activity useful in the methods of the present invention include, but are not limited to, polypeptides having cellulolytic enhancing activity from Thielavia terrestris (WO 2005/074647, WO/2008/148131 and WO 2011/035027); polypeptides having cellulolytic enhancing activity from Thermoascus aurantiacus (WO 2005/074656 and WO 2010/065830); polypeptides having cellulolytic enhancing activity from Trichoderma reesei (WO 2007/089290); and polypeptides having cellulolytic enhancing activity from Myceliophthora thermophila (WO 2009/085935; WO 2009/085859; WO 2009/085864; and WO 2009/085868); polypeptides having cellulolytic enhancing activity from Aspergillus fumigatus (WO 2010/138754); and polypeptides having cellulolytic enhancing activity from Penicillium pinophilum (WO 2011/005867), Thermoascus sp. (WO 2011/039319), Penicillium sp. (WO 2011/041397), and Thermoascus crustaceous (WO 2011/041504).
[0153] In one aspect, the GH61 polypeptide having cellulolytic enhancing activity is used in the presence of a soluble activating divalent metal cation according to WO 2008/151043, e.g., manganese sulfate.
[0154] In one aspect, the GH61 polypeptide having cellulolytic enhancing activity is used in the presence of a dioxy compound, a bicylic compound, a heterocyclic compound, a nitrogen-containing compound, a quinone compound, a sulfur-containing compound, or a liquor obtained from a pretreated cellulosic material such as pretreated corn stover (PCS).
[0155] The dioxy compound may include any suitable compound containing two or more oxygen atoms. In some aspects, the dioxy compounds contain a substituted aryl moiety as described herein. The dioxy compounds may comprise one or more (e.g., several) hydroxyl and/or hydroxyl derivatives, but also include substituted aryl moieties lacking hydroxyl and hydroxyl derivatives. Non-limiting examples of dioxy compounds include pyrocatechol or catechol; caffeic acid; 3,4-dihydroxybenzoic acid; 4-tert-butyl-5-methoxy-1,2-benzenediol; pyrogallol; gallic acid; methyl-3,4,5-trihydroxybenzoate; 2,3,4-trihydroxybenzophenone; 2,6-dimethoxyphenol; sinapinic acid; 3,5-dihydroxybenzoic acid; 4-chloro-1,2-benzenediol; 4-nitro-1,2-benzenediol; tannic acid; ethyl gallate; methyl glycolate; dihydroxyfumaric acid; 2-butyne-1,4-diol; (croconic acid; 1,3-propanediol; tartaric acid; 2,4-pentanediol; 3-ethyoxy-1,2-propanediol; 2,4,4'-trihydroxybenzophenone; cis-2-butene-1,4-diol; 3,4-dihydroxy-3-cyclobutene-1,2-dione; dihydroxyacetone; acrolein acetal; methyl-4-hydroxybenzoate; 4-hydroxybenzoic acid; and methyl-3,5-dimethoxy-4-hydroxybenzoate; or a salt or solvate thereof.
[0156] The bicyclic compound may include any suitable substituted fused ring system as described herein. The compounds may comprise one or more (e.g., several) additional rings, and are not limited to a specific number of rings unless otherwise stated. In one aspect, the bicyclic compound is a flavonoid. In another aspect, the bicyclic compound is an optionally subsituted isoflavonoid. In another aspect, the bicyclic compound is an optionally substituted flavylium ion, such as an optionally substituted anthocyanidin or optionally substituted anthocyanin, or derivative thereof. Non-limiting examples of bicyclic compounds include epicatechin; quercetin; myricetin; taxifolin; kaempferol; morin; acacetin; naringenin; isorhamnetin; apigenin; cyanidin; cyanin; kuromanin; keracyanin; or a salt or solvate thereof.
[0157] The heterocyclic compound may be any suitable compound, such as an optionally substituted aromatic or non-aromatic ring comprising a heteroatom, as described herein. In one aspect, the heterocyclic is a compound comprising an optionally substituted heterocycloalkyl moiety or an optionally substituted heteroaryl moiety. In another aspect, the optionally substituted heterocycloalkyl moiety or optionally substituted heteroaryl moiety is an optionally substituted 5-membered heterocycloalkyl or an optionally substituted 5-membered heteroaryl moiety. In another aspect, the optionally substituted heterocycloalkyl or optionally substituted heteroaryl moiety is an optionally substituted moiety selected from pyrazolyl, furanyl, imidazolyl, isoxazolyl, oxadiazolyl, oxazolyl, pyrrolyl, pyridyl, pyrimidyl, pyridazinyl, thiazolyl, triazolyl, thienyl, dihydrothieno-pyrazolyl, thianaphthenyl, carbazolyl, benzimidazolyl, benzothienyl, benzofuranyl, indolyl, quinolinyl, benzotriazolyl, benzothiazolyl, benzooxazolyl, benzimidazolyl, isoquinolinyl, isoindolyl, acridinyl, benzoisazolyl, dimethylhydantoin, pyrazinyl, tetrahydrofuranyl, pyrrolinyl, pyrrolidinyl, morpholinyl, indolyl, diazepinyl, azepinyl, thiepinyl, piperidinyl, and oxepinyl. In another aspect, the optionally substituted heterocycloalkyl moiety or optionally substituted heteroaryl moiety is an optionally substituted furanyl. Non-limiting examples of heterocyclic compounds include (1,2-dihydroxyethyl)-3,4-dihydroxyfuran-2(5H)-one; 4-hydroxy-5-methyl-3-furanone; 5-hydroxy-2(5H)-furanone; [1,2-dihydroxyethyl]furan-2,3,4(5H)-trione; .alpha.-hydroxy-y-butyrolactone; ribonic .gamma.-lactone; aldohexuronicaldohexuronic acid .gamma.-lactone; gluconic acid .delta.-lactone; 4-hydroxycoumarin; dihydrobenzofuran; 5-(hydroxymethyl)furfural; furoin; 2(5H)-furanone; 5,6-dihydro-2H-pyran-2-one; and 5,6-dihydro-4-hydroxy-6-methyl-2H-pyran-2-one; or a salt or solvate thereof.
[0158] The nitrogen-containing compound may be any suitable compound with one or more nitrogen atoms. In one aspect, the nitrogen-containing compound comprises an amine, imine, hydroxylamine, or nitroxide moiety. Non-limiting examples of nitrogen-containing compounds include acetone oxime; violuric acid; pyridine-2-aldoxime; 2-aminophenol; 1,2-benzenediamine; 2,2,6,6-tetramethyl-1-piperidinyloxy; 5,6,7,8-tetrahydrobiopterin; 6,7-dimethyl-5,6,7,8-tetrahydropterine; and maleamic acid; or a salt or solvate thereof.
[0159] The quinone compound may be any suitable compound comprising a quinone moiety as described herein. Non-limiting examples of quinone compounds include 1,4-benzoquinone; 1,4-naphthoquinone; 2-hydroxy-1,4-naphthoquinone; 2,3-dimethoxy-5-methyl-1,4-benzoquinone or coenzyme Q.sub.0; 2,3,5,6-tetramethyl-1,4-benzoquinone or duroquinone; 1,4-dihydroxyanthraquinone; 3-hydroxy-1-methyl-5,6-indolinedione or adrenochrome; 4-tert-butyl-5-methoxy-1,2-benzoquinone; pyrroloquinoline quinone; or a salt or solvate thereof.
[0160] The sulfur-containing compound may be any suitable compound comprising one or more sulfur atoms. In one aspect, the sulfur-containing comprises a moiety selected from thionyl, thioether, sulfinyl, sulfonyl, sulfamide, sulfonamide, sulfonic acid, and sulfonic ester. Non-limiting examples of sulfur-containing compounds include ethanethiol; 2-propanethiol; 2-propene-1-thiol; 2-mercaptoethanesulfonic acid; benzenethiol; benzene-1,2-dithiol; cysteine; methionine; glutathione; cystine; or a salt or solvate thereof.
[0161] In one aspect, an effective amount of such a compound described above to cellulosic material as a molar ratio to glucosyl units of cellulose is about 10.sup.-6 to about 10, e.g., about 10.sup.-6 to about 7.5, about 10.sup.-6 to about 5, about 10.sup.-6 to about 2.5, about 10.sup.-6 to about 1, about 10.sup.-5 to about 1, about 10.sup.-5 to about 10.sup.-1, about 10.sup.-4 to about 10.sup.-1, about 10.sup.-3 to about 10.sup.-1, and about 10.sup.-3 to about 10.sup.-2. In another aspect, an effective amount of such a compound described above is about 0.1 .mu.M to about 1 M, e.g., about 0.5 .mu.M to about 0.75 M, about 0.75 .mu.M to about 0.5 M, about 1 .mu.M to about 0.25 M, about 1 .mu.M to about 0.1 M, about 5 .mu.M to about 50 mM, about 10 .mu.M to about 25 mM, about 50 .mu.M to about 25 mM, about 10 .mu.M to about 10 mM, about 5 .mu.M to about 5 mM, and about 0.1 mM to about 1 mM.
[0162] The term "liquor" means the solution phase, either aqueous, organic, or a combination thereof, arising from treatment of a lignocellulose and/or hemicellulose material in a slurry, or monosaccharides thereof, e.g., xylose, arabinose, mannose, etc., under conditions as described herein, and the soluble contents thereof. A liquor for cellulolytic enhancement of a GH61 polypeptide can be produced by treating a lignocellulose or hemicellulose material (or feedstock) by applying heat and/or pressure, optionally in the presence of a catalyst, e.g., acid, optionally in the presence of an organic solvent, and optionally in combination with physical disruption of the material, and then separating the solution from the residual solids. Such conditions determine the degree of cellulolytic enhancement obtainable through the combination of liquor and a GH61 polypeptide during hydrolysis of a cellulosic substrate by a cellulase preparation. The liquor can be separated from the treated material using methods standard in the art, such as filtration, sedimentation, or centifugation.
[0163] In one aspect, an effective amount of the liquor to cellulose is about 10.sup.-6 to about 10 g per g of cellulose, e.g., about 10.sup.-6 to about 7.5 g, about 10.sup.-6 to about 5, about 10.sup.-6 to about 2.5 g, about 10.sup.-6 to about 1 g, about 10.sup.-5 to about 1 g, about 10.sup.-5 to about 10.sup.-1 g, about 10.sup.-4 to about 10.sup.-1 g, about 10.sup.-3 to about 10.sup.-1 g, and about 10.sup.-3 to about 10.sup.-2 g per g of cellulose.
[0164] In one embodiment, the one or more (e.g., several) hemicellulolytic enzymes comprise a commercial hemicellulolytic enzyme preparation. Examples of commercial hemicellulolytic enzyme preparations suitable for use in the present invention include, for example, SHEARZYME.TM. (Novozymes A/S), CELLIC.RTM. HTec (Novozymes A/S), CELLIC.RTM. HTec2 (Novozymes A/S), VISCOZYME.RTM. (Novozymes A/S), ULTRAFLO.RTM. (Novozymes A/S), PULPZYME.RTM. HC (Novozymes A/S), MULTIFECT.RTM. Xylanase (Genencor), ACCELLERASE.RTM. XY (Genencor), ACCELLERASE.RTM. XC (Genencor), ECOPULP.RTM. TX-200A (AB Enzymes), HSP 6000 Xylanase (DSM), DEPOL.TM. 333P (Biocatalysts Limit, Wales, UK), DEPOL.TM. 740L. (Biocatalysts Limit, Wales, UK), and DEPOL.TM. 762P (Biocatalysts Limit, Wales, UK).
[0165] Examples of xylanases useful in the methods of the present invention include, but are not limited to, Aspergillus aculeatus xylanase (GeneSeqP: AAR63790; WO 94/21785); Aspergillus fumigatus xylanases (WO 2006/078256); Penicillium pinophilum (WO 2011/041405); Penicillium sp. (WO 2010/126772); Thielavia terrestris NRRL 8126 (WO 2009/079210); and Trichophaea saccata GH10 (WO 2011/057083).
[0166] Examples of beta-xylosidases useful in the methods of the present invention include, but are not limited to, Trichoderma reesei beta-xylosidase (UniProtKB/TrEMBL accession number Q92458); Talaromyces emersonii (SwissProt accession number Q8X212); and Neurospora crassa (SwissProt accession number Q7SOW4).
[0167] Examples of acetylxylan esterases useful in the methods of the present invention include, but are not limited to, acetylxylan esterases from Aspergillus aculeatus (WO 2010/108918); Chaetomium globosum (Uniprot accession number Q2GWX4); Chaetomium gracile (GeneSeqP accession number AAB82124); Humicola insolens DSM 1800 (WO 2009/073709); Hypocrea jecorina (WO 2005/001036); Myceliophthora thermophila (WO 2010/014880); Neurospora crassa
[0168] (UniProt accession number q7s259); Phaeosphaeria nodorum (Uniprot accession number Q0UHJ1); and Thielavia terrestris NRRL 8126 (WO 2009/042846).
[0169] Examples of ferulic acid esterases useful in the methods of the present invention include, but are not limited to, Humicola insolens DSM 1800 feruloyl esterase (WO 2009/076122), Neurospora crassa feruloyl esterase (UniProt accession number Q9HGR3), and Neosartorya fischeri feruloyl esterase (UniProt Accession number A1D9T4).
[0170] Examples of arabinofuranosidases useful in the methods of the present invention include, but are not limited to, arabinofuranosidases from Aspergillus niger (GeneSeqP accession number AAR94170); Humicola insolens DSM 1800 (WO 2006/114094 and WO 2009/073383); and M. giganteus (WO 2006/114094).
[0171] Examples of alpha-glucuronidases useful in the methods of the present invention include, but are not limited to, alpha-glucuronidases from Aspergillus clavatus (UniProt accession number alcc12); Aspergillus fumigatus (SwissProt accession number Q4WW45); Aspergillus niger (Uniprot accession number Q96WX9); Aspergillus terreus (SwissProt accession number Q0CJP9); Humicola insolens (WO 2010/014706); Penicillium aurantiogriseum (WO 2009/068565); Talaromyces emersonii (UniProt accession number Q8X211); and Trichoderma reesei (Uniprot accession number Q99024).
[0172] The polypeptides having enzyme activity used in the methods of the present invention may be produced by fermentation of the above-noted microbial strains on a nutrient medium containing suitable carbon and nitrogen sources and inorganic salts, using procedures known in the art (see, e.g., Bennett, J. W. and LaSure, L. (eds.), More Gene Manipulations in Fungi, Academic Press, CA, 1991). Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). Temperature ranges and other conditions suitable for growth and enzyme production are known in the art (see, e.g., Bailey, J. E., and Ollis, D. F., Biochemical Engineering Fundamentals, McGraw-Hill Book Company, NY, 1986).
[0173] The fermentation can be any method of cultivation of a cell resulting in the expression or isolation of an enzyme or protein. Fermentation may, therefore, be understood as comprising shake flask cultivation, or small- or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors performed in a suitable medium and under conditions allowing the enzyme to be expressed or isolated. The resulting enzymes produced by the methods described above may be recovered from the fermentation medium and purified by conventional procedures.
[0174] The compositions may be a fermentation broth formulation or a cell composition, as described herein. In some embodiments, the composition is a cell-killed whole broth containing organic acid(s), killed cells and/or cell debris, and culture medium.
[0175] In one aspect, the present invention relates to a whole broth formulation or cell culture composition comprising one or more (e.g., several) enzymes having cellulolytic and/or hemicellulolytic activity and a polypeptide having catalase activity.
[0176] The term "fermentation broth" as used herein refers to a preparation produced by cellular fermentation that undergoes no or minimal recovery and/or purification. For example, fermentation broths are produced when microbial cultures are grown to saturation, incubated under carbon-limiting conditions to allow protein synthesis (e.g., expression of enzymes by host cells) and secretion into cell culture medium. The fermentation broth can contain unfractionated or fractionated contents of the fermentation materials derived at the end of the fermentation. Typically, the fermentation broth is unfractionated and comprises the spent culture medium and cell debris present after the microbial cells (e.g., filamentous fungal cells) are removed, e.g., by centrifugation. In some embodiments, the fermentation broth contains spent cell culture medium, extracellular enzymes, and viable and/or nonviable microbial cells.
[0177] In an embodiment, the fermentation broth formulation and cell compositions comprise a first organic acid component comprising at least one 1-5 carbon organic acid and/or a salt thereof and a second organic acid component comprising at least one 6 or more carbon organic acid and/or a salt thereof. In a specific embodiment, the first organic acid component is acetic acid, formic acid, propionic acid, a salt thereof, or a mixture of two or more of the foregoing and the second organic acid component is benzoic acid, cyclohexanecarboxylic acid, 4-methylvaleric acid, phenylacetic acid, a salt thereof, or a mixture of two or more of the foregoing.
[0178] In one aspect, the composition contains an organic acid(s), and optionally further contains killed cells and/or cell debris. In one embodiment, the killed cells and/or cell debris are removed from a cell-killed whole broth to provide a composition that is free of these components.
[0179] The fermentation broth formulations or cell compositions may further comprise a preservative and/or anti-microbial (e.g., bacteriostatic) agent, including, but not limited to, sorbitol, sodium chloride, potassium sorbate, and others known in the art.
[0180] The cell-killed whole broth or composition may contain the unfractionated contents of the fermentation materials derived at the end of the fermentation. Typically, the cell-killed whole broth or composition contains the spent culture medium and cell debris present after the microbial cells (e.g., filamentous fungal cells) are grown to saturation, incubated under carbon-limiting conditions to allow protein synthesis (e.g., expression of cellulase and/or glucosidase enzyme(s)). In some embodiments, the cell-killed whole broth or composition contains the spent cell culture medium, extracellular enzymes, and killed filamentous fungal cells. In some embodiments, the microbial cells present in the cell-killed whole broth or composition can be permeabilized and/or lysed using methods known in the art.
[0181] A whole broth or cell composition as described herein is typically a liquid, but may contain insoluble components, such as killed cells, cell debris, culture media components, and/or insoluble enzyme(s). In some embodiments, insoluble components may be removed to provide a clarified liquid composition.
[0182] The whole broth formulations and cell compositions may be produced by a method described in WO 90/15861 or WO 2010/096673.
[0183] In one aspect, the present invention relates to use of the enzyme composition of the present invention in degrading or converting a cellulosic material.
[0184] Fermentation. The fermentable sugars obtained from the hydrolyzed cellulosic material can be fermented by one or more (e.g., several) fermenting microorganisms capable of fermenting the sugars directly or indirectly into a desired fermentation product. "Fermentation" or "fermentation process" refers to any fermentation process or any process comprising a fermentation step. Fermentation processes also include fermentation processes used in the consumable alcohol industry (e.g., beer and wine), dairy industry (e.g., fermented dairy products), leather industry, and tobacco industry. The fermentation conditions depend on the desired fermentation product and fermenting organism and can easily be determined by one skilled in the art.
[0185] In the fermentation step, sugars, released from the cellulosic material as a result of the pretreatment and enzymatic hydrolysis steps, are fermented to a product, e.g., ethanol, by a fermenting organism, such as yeast. Hydrolysis (saccharification) and fermentation can be separate or simultaneous, as described herein.
[0186] Any suitable hydrolyzed cellulosic material can be used in the fermentation step in practicing the present invention. The material is generally selected based on the desired fermentation product, i.e., the substance to be obtained from the fermentation, and the process employed, as is well known in the art.
[0187] The term "fermentation medium" is understood herein to refer to a medium before the fermenting microorganism(s) is(are) added, such as, a medium resulting from a saccharification process, as well as a medium used in a simultaneous saccharification and fermentation process (SSF).
[0188] "Fermenting microorganism" refers to any microorganism, including bacterial and fungal organisms, suitable for use in a desired fermentation process to produce a fermentation product. The fermenting organism can be hexose and/or pentose fermenting organisms, or a combination thereof. Both hexose and pentose fermenting organisms are well known in the art. Suitable fermenting microorganisms are able to ferment, i.e., convert, sugars, such as glucose, xylose, xylulose, arabinose, maltose, mannose, galactose, and/or oligosaccharides, directly or indirectly into the desired fermentation product. Examples of bacterial and fungal fermenting organisms producing ethanol are described by Lin et al., 2006, Appl. Microbiol. Biotechnol. 69: 627-642.
[0189] Examples of fermenting microorganisms that can ferment hexose sugars include bacterial and fungal organisms, such as yeast. Preferred yeast includes strains of Candida, Kluyveromyces, and Saccharomyces, e.g., Candida sonorensis, Kluyveromyces marxianus, and Saccharomyces cerevisiae.
[0190] Examples of fermenting organisms that can ferment pentose sugars in their native state include bacterial and fungal organisms, such as some yeast. Preferred xylose fermenting yeast include strains of Candida, preferably C. sheatae or C. sonorensis; and strains of Pichia, preferably P. stipitis, such as P. stipitis CBS 5773. Preferred pentose fermenting yeast include strains of Pachysolen, preferably P. tannophilus. Organisms not capable of fermenting pentose sugars, such as xylose and arabinose, may be genetically modified to do so by methods known in the art.
[0191] Examples of bacteria that can efficiently ferment hexose and pentose to ethanol include, for example, Bacillus coagulans, Clostridium acetobutylicum, Clostridium thermocellum, Clostridium phytofermentans, Geobacillus sp., Thermoanaerobacter saccharolyticum, and Zymomonas mobilis (Philippidis, 1996, supra).
[0192] Other fermenting organisms include strains of Bacillus, such as Bacillus coagulans; Candida, such as C. sonorensis, C. methanosorbosa, C. diddensiae, C. parapsilosis, C. naedodendra, C. blankii, C. entomophilia, C. brassicae, C. pseudotropicalis, C. boidinii, C. utils, and C. scehatae; Clostridium, such as C. acetobutylicum, C. thermocellum, and C. phytofermentans; E. coli, especially E. coli strains that have been genetically modified to improve the yield of ethanol; Geobacillus sp.; Hansenula, such as Hansenula anomala; Klebsiella, such as K. oxytoca; Kluyveromyces, such as K. marxianus, K. lactis, K. thermotolerans, and K. fragilis; Schizosaccharomyces, such as S. pombe; Thermoanaerobacter, such as Thermoanaerobacter saccharolyticum; and Zymomonas, such as Zymomonas mobilis.
[0193] In a preferred aspect, the yeast is a Bretannomyces. In a more preferred aspect, the yeast is Bretannomyces clausenii. In another preferred aspect, the yeast is a Candida. In another more preferred aspect, the yeast is Candida sonorensis. In another more preferred aspect, the yeast is Candida boidinii. In another more preferred aspect, the yeast is Candida blankii. In another more preferred aspect, the yeast is Candida brassicae. In another more preferred aspect, the yeast is Candida diddensii. In another more preferred aspect, the yeast is Candida entomophiliia. In another more preferred aspect, the yeast is Candida pseudotropicalis. In another more preferred aspect, the yeast is Candida scehatae. In another more preferred aspect, the yeast is Candida utilis. In another preferred aspect, the yeast is a Clavispora. In another more preferred aspect, the yeast is Clavispora lusitaniae. In another more preferred aspect, the yeast is Clavispora opuntiae. In another preferred aspect, the yeast is a Kluyveromyces. In another more preferred aspect, the yeast is Kluyveromyces fragilis. In another more preferred aspect, the yeast is Kluyveromyces marxianus. In another more preferred aspect, the yeast is Kluyveromyces thermotolerans. In another preferred aspect, the yeast is a Pachysolen. In another more preferred aspect, the yeast is Pachysolen tannophilus. In another preferred aspect, the yeast is a Pichia. In another more preferred aspect, the yeast is a Pichia stipitis. In another preferred aspect, the yeast is a Saccharomyces spp. In another more preferred aspect, the yeast is Saccharomyces cerevisiae. In another more preferred aspect, the yeast is Saccharomyces distaticus. In another more preferred aspect, the yeast is Saccharomyces uvarum.
[0194] In a preferred aspect, the bacterium is a Bacillus. In a more preferred aspect, the bacterium is Bacillus coagulans. In another preferred aspect, the bacterium is a Clostridium. In another more preferred aspect, the bacterium is Clostridium acetobutylicum. In another more preferred aspect, the bacterium is Clostridium phytofermentans. In another more preferred aspect, the bacterium is Clostridium thermocellum. In another more preferred aspect, the bacterium is Geobacilus sp. In another more preferred aspect, the bacterium is a Thermoanaerobacter. In another more preferred aspect, the bacterium is Thermoanaerobacter saccharolyticum. In another preferred aspect, the bacterium is a Zymomonas. In another more preferred aspect, the bacterium is Zymomonas mobilis.
[0195] Commercially available yeast suitable for ethanol production include, e.g., BIOFERM.TM. AFT and XR (NABC--North American Bioproducts Corporation, GA, USA), ETHANOL RED.TM. yeast (Fermentis/Lesaffre, USA), FALI.TM. (Fleischmann's Yeast, USA), FERMIOL.TM. (DSM Specialties), GERT STRAND.TM. (Gert Strand AB, Sweden), and SUPERSTART.TM. and THERMOSACC.TM. fresh yeast (Ethanol Technology, WI, USA).
[0196] In a preferred aspect, the fermenting microorganism has been genetically modified to provide the ability to ferment pentose sugars, such as xylose utilizing, arabinose utilizing, and xylose and arabinose co-utilizing microorganisms.
[0197] The cloning of heterologous genes into various fermenting microorganisms has led to the construction of organisms capable of converting hexoses and pentoses to ethanol (co-fermentation) (Chen and Ho, 1993, Cloning and improving the expression of Pichia stipitis xylose reductase gene in Saccharomyces cerevisiae, Appl. Biochem. Biotechnol. 39-40: 135-147; Ho et al., 1998, Genetically engineered Saccharomyces yeast capable of effectively cofermenting glucose and xylose, Appl. Environ. Microbiol. 64: 1852-1859; Kotter and Ciriacy, 1993, Xylose fermentation by Saccharomyces cerevisiae, Appl. Microbiol. Biotechnol. 38: 776-783; Walfridsson et al., 1995, Xylose-metabolizing Saccharomyces cerevisiae strains overexpressing the TKL1 and TAL1 genes encoding the pentose phosphate pathway enzymes transketolase and transaldolase, Appl. Environ. Microbiol. 61: 4184-4190; Kuyper et al., 2004, Minimal metabolic engineering of Saccharomyces cerevisiae for efficient anaerobic xylose fermentation: a proof of principle, FEMS Yeast Research 4: 655-664; Beall et al., 1991, Parametric studies of ethanol production from xylose and other sugars by recombinant Escherichia coli, Biotech. Bioeng. 38: 296-303; Ingram et al., 1998, Metabolic engineering of bacteria for ethanol production, Biotechnol. Bioeng. 58: 204-214; Zhang et al., 1995, Metabolic engineering of a pentose metabolism pathway in ethanologenic Zymomonas mobilis, Science 267: 240-243; Deanda et al., 1996, Development of an arabinose-fermenting Zymomonas mobilis strain by metabolic pathway engineering, Appl. Environ. Microbiol. 62: 4465-4470; WO 2003/062430, xylose isomerase).
[0198] In a preferred aspect, the genetically modified fermenting microorganism is Candida sonorensis. In another preferred aspect, the genetically modified fermenting microorganism is Escherichia coli. In another preferred aspect, the genetically modified fermenting microorganism is Klebsiella oxytoca. In another preferred aspect, the genetically modified fermenting microorganism is Kluyveromyces marxianus. In another preferred aspect, the genetically modified fermenting microorganism is Saccharomyces cerevisiae. In another preferred aspect, the genetically modified fermenting microorganism is Zymomonas mobilis.
[0199] It is well known in the art that the organisms described above can also be used to produce other substances, as described herein.
[0200] The fermenting microorganism is typically added to the degraded cellulosic material or hydrolysate and the fermentation is performed for about 8 to about 96 hours, e.g., about 24 to about 60 hours. The temperature is typically between about 26.degree. C. to about 60.degree. C., e.g., about 32.degree. C. or 50.degree. C., and about pH 3 to about pH 8, e.g., pH 4-5, 6, or 7.
[0201] In one aspect, the yeast and/or another microorganism are applied to the degraded cellulosic material and the fermentation is performed for about 12 to about 96 hours, such as typically 24-60 hours. In another aspect, the temperature is preferably between about 20.degree. C. to about 60.degree. C., e.g., about 25.degree. C. to about 50.degree. C., about 32.degree. C. to about 50.degree. C., or about 32.degree. C. to about 50.degree. C., and the pH is generally from about pH 3 to about pH 7, e.g., about pH 4 to about pH 7. However, some fermenting organisms, e.g., bacteria, have higher fermentation temperature optima. Yeast or another microorganism is preferably applied in amounts of approximately 10.sup.5 to 10.sup.12, preferably from approximately 10.sup.7 to 10.sup.10, especially approximately 2.times.10.sup.8 viable cell count per ml of fermentation broth. Further guidance in respect of using yeast for fermentation can be found in, e.g., "The Alcohol Textbook" (Editors K. Jacques, T. P. Lyons and D. R. Kelsall, Nottingham University Press, United Kingdom 1999), which is hereby incorporated by reference.
[0202] For ethanol production, following the fermentation the fermented slurry is distilled to extract the ethanol. The ethanol obtained according to the methods of the invention can be used as, e.g., fuel ethanol, drinking ethanol, i.e., potable neutral spirits, or industrial ethanol.
[0203] A fermentation stimulator can be used in combination with any of the processes described herein to further improve the fermentation process, and in particular, the performance of the fermenting microorganism, such as, rate enhancement and ethanol yield. A "fermentation stimulator" refers to stimulators for growth of the fermenting microorganisms, in particular, yeast. Preferred fermentation stimulators for growth include vitamins and minerals. Examples of vitamins include multivitamins, biotin, pantothenate, nicotinic acid, meso-inositol, thiamine, pyridoxine, para-aminobenzoic acid, folic acid, riboflavin, and Vitamins A, B, C, D, and E. See, for example, Alfenore et al., Improving ethanol production and viability of Saccharomyces cerevisiae by a vitamin feeding strategy during fed-batch process, Springer-Verlag (2002), which is hereby incorporated by reference. Examples of minerals include minerals and mineral salts that can supply nutrients comprising P, K, Mg, S, Ca, Fe, Zn, Mn, and Cu.
[0204] Fermentation products: A fermentation product can be any substance derived from the fermentation. The fermentation product can be, without limitation, an alcohol (e.g., arabinitol, n-butanol, isobutanol, ethanol, glycerol, methanol, ethylene glycol, 1,3-propanediol [propylene glycol], butanediol, glycerin, sorbitol, and xylitol); an alkane (e.g., pentane, hexane, heptane, octane, nonane, decane, undecane, and dodecane), a cycloalkane (e.g., cyclopentane, cyclohexane, cycloheptane, and cyclooctane), an alkene (e.g. pentene, hexene, heptene, and octene); an amino acid (e.g., aspartic acid, glutamic acid, glycine, lysine, serine, and threonine); a gas (e.g., methane, hydrogen (H.sub.2), carbon dioxide (CO.sub.2), and carbon monoxide (CO)); isoprene; a ketone (e.g., acetone); an organic acid (e.g., acetic acid, acetonic acid, adipic acid, ascorbic acid, citric acid, 2,5-diketo-D-gluconic acid, formic acid, fumaric acid, glucaric acid, gluconic acid, glucuronic acid, glutaric acid, 3-hydroxypropionic acid, itaconic acid, lactic acid, malic acid, malonic acid, oxalic acid, oxaloacetic acid, propionic acid, succinic acid, and xylonic acid); and polyketide. The fermentation product can also be protein as a high value product.
[0205] In a preferred aspect, the fermentation product is an alcohol. It will be understood that the term "alcohol" encompasses a substance that contains one or more hydroxyl moieties. In a more preferred aspect, the alcohol is n-butanol. In another more preferred aspect, the alcohol is isobutanol. In another more preferred aspect, the alcohol is ethanol. In another more preferred aspect, the alcohol is methanol. In another more preferred aspect, the alcohol is arabinitol. In another more preferred aspect, the alcohol is butanediol. In another more preferred aspect, the alcohol is ethylene glycol. In another more preferred aspect, the alcohol is glycerin. In another more preferred aspect, the alcohol is glycerol. In another more preferred aspect, the alcohol is 1,3-propanediol. In another more preferred aspect, the alcohol is sorbitol. In another more preferred aspect, the alcohol is xylitol. See, for example, Gong, C. S., Cao, N. J., Du, J., and Tsao, G. T., 1999, Ethanol production from renewable resources, in Advances in Biochemical Engineering/Biotechnology, Scheper, T., ed., Springer-Verlag Berlin Heidelberg, Germany, 65: 207-241; Silveira, M. M., and Jonas, R., 2002, The biotechnological production of sorbitol, Appl. Microbiol. Biotechnol. 59: 400-408; Nigam, P., and Singh, D., 1995, Processes for fermentative production of xylitol--a sugar substitute, Process Biochemistry 30 (2): 117-124; Ezeji, T. C., Qureshi, N. and Blaschek, H. P., 2003, Production of acetone, butanol and ethanol by Clostridium beijerinckii BA101 and in situ recovery by gas stripping, World Journal of Microbiology and Biotechnology 19 (6): 595-603.
[0206] In another preferred aspect, the fermentation product is an alkane. The alkane can be an unbranched or a branched alkane. In another more preferred aspect, the alkane is pentane. In another more preferred aspect, the alkane is hexane. In another more preferred aspect, the alkane is heptane. In another more preferred aspect, the alkane is octane. In another more preferred aspect, the alkane is nonane. In another more preferred aspect, the alkane is decane. In another more preferred aspect, the alkane is undecane. In another more preferred aspect, the alkane is dodecane.
[0207] In another preferred aspect, the fermentation product is a cycloalkane. In another more preferred aspect, the cycloalkane is cyclopentane. In another more preferred aspect, the cycloalkane is cyclohexane. In another more preferred aspect, the cycloalkane is cycloheptane. In another more preferred aspect, the cycloalkane is cyclooctane.
[0208] In another preferred aspect, the fermentation product is an alkene. The alkene can be an unbranched or a branched alkene. In another more preferred aspect, the alkene is pentene. In another more preferred aspect, the alkene is hexene. In another more preferred aspect, the alkene is heptene. In another more preferred aspect, the alkene is octene.
[0209] In another preferred aspect, the fermentation product is an amino acid. In another more preferred aspect, the organic acid is aspartic acid. In another more preferred aspect, the amino acid is glutamic acid. In another more preferred aspect, the amino acid is glycine. In another more preferred aspect, the amino acid is lysine. In another more preferred aspect, the amino acid is serine. In another more preferred aspect, the amino acid is threonine. See, for example, Richard, A., and Margaritis, A., 2004, Empirical modeling of batch fermentation kinetics for poly(glutamic acid) production and other microbial biopolymers, Biotechnology and Bioengineering 87 (4): 501-515.
[0210] In another preferred aspect, the fermentation product is a gas. In another more preferred aspect, the gas is methane. In another more preferred aspect, the gas is H.sub.2. In another more preferred aspect, the gas is CO.sub.2. In another more preferred aspect, the gas is CO. See, for example, Kataoka, N., A. Miya, and K. Kiriyama, 1997, Studies on hydrogen production by continuous culture system of hydrogen-producing anaerobic bacteria, Water Science and Technology 36 (6-7): 41-47; and Gunaseelan V. N. in Biomass and Bioenergy, Vol. 13 (1-2), pp. 83-114, 1997, Anaerobic digestion of biomass for methane production: A review.
[0211] In another preferred aspect, the fermentation product is isoprene.
[0212] In another preferred aspect, the fermentation product is a ketone. It will be understood that the term "ketone" encompasses a substance that contains one or more ketone moieties. In another more preferred aspect, the ketone is acetone. See, for example, Qureshi and Blaschek, 2003, supra.
[0213] In another preferred aspect, the fermentation product is an organic acid. In another more preferred aspect, the organic acid is acetic acid. In another more preferred aspect, the organic acid is acetonic acid. In another more preferred aspect, the organic acid is adipic acid. In another more preferred aspect, the organic acid is ascorbic acid. In another more preferred aspect, the organic acid is citric acid. In another more preferred aspect, the organic acid is 2,5-diketo-D-gluconic acid. In another more preferred aspect, the organic acid is formic acid. In another more preferred aspect, the organic acid is fumaric acid. In another more preferred aspect, the organic acid is glucaric acid. In another more preferred aspect, the organic acid is gluconic acid. In another more preferred aspect, the organic acid is glucuronic acid. In another more preferred aspect, the organic acid is glutaric acid. In another preferred aspect, the organic acid is 3-hydroxypropionic acid. In another more preferred aspect, the organic acid is itaconic acid. In another more preferred aspect, the organic acid is lactic acid. In another more preferred aspect, the organic acid is malic acid. In another more preferred aspect, the organic acid is malonic acid. In another more preferred aspect, the organic acid is oxalic acid. In another more preferred aspect, the organic acid is propionic acid. In another more preferred aspect, the organic acid is succinic acid. In another more preferred aspect, the organic acid is xylonic acid. See, for example, Chen, R., and Lee, Y. Y., 1997, Membrane-mediated extractive fermentation for lactic acid production from cellulosic biomass, Appl. Biochem. Biotechnol. 63-65: 435-448.
[0214] In another preferred aspect, the fermentation product is polyketide.
[0215] Recovery. The fermentation product(s) can be optionally recovered from the fermentation medium using any method known in the art including, but not limited to, chromatography, electrophoretic procedures, differential solubility, distillation, or extraction. For example, alcohol is separated from the fermented cellulosic material and purified by conventional methods of distillation. Ethanol with a purity of up to about 96 vol. % can be obtained, which can be used as, for example, fuel ethanol, drinking ethanol, i.e., potable neutral spirits, or industrial ethanol.
Polypeptides Having Catalase Activity
[0216] In the methods of the present invention, the polypeptide having catalase activity can be any polypeptide having catalase activity. The polypeptide having catalase activity may be present as an enzyme in the enzyme composition and/or as one or more protein components added to the composition. In a preferred aspect, the polypeptide having catalase activity is foreign to one or more components of the cellulolytic enzyme composition.
[0217] The polypeptide having catalase activity may be obtained from microorganisms of any genus. In one aspect, the polypeptide obtained from a given source is secreted extracellularly.
[0218] The polypeptide having catalase activity may be a bacterial polypeptide. For example, the polypeptide may be a gram positive bacterial polypeptide such as a Bacillus, Streptococcus, Streptomyces, Staphylococcus, Enterococcus, Lactobacillus, Lactococcus, Clostridium, Geobacillus, or Oceanobacillus polypeptide having catalase activity, or a Gram negative bacterial polypeptide such as an E. coli, Pseudomonas, Salmonella, Campylobacter, Helicobacter, Flavobacterium, Fusobacterium, Ilyobacter, Neisseria, or Ureaplasma polypeptide having catalase activity.
[0219] In one aspect, the polypeptide having catalase activity is a Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, or Bacillus thuringiensis polypeptide having catalase activity.
[0220] In another aspect, the polypeptide having catalase activity is a Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, or Streptococcus equi subsp. Zooepidemicus polypeptide having catalase activity.
[0221] In another aspect, the polypeptide having catalase activity is a Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, or Streptomyces lividans polypeptide having catalase activity.
[0222] The polypeptide having catalase activity may also be a fungal polypeptide, and more preferably a yeast polypeptide such as a Candida, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia polypeptide having catalase activity; or more preferably a filamentous fungal polypeptide such as an Acremonium, Agaricus, Alternaria, Aspergillus, Aureobasidium, Botryospaeria, Ceriporiopsis, Chaetomidium, Chrysosporium, Claviceps, Cochliobolus, Coprinopsis, Coptotermes, Corynascus, Cryphonectria, Cryptococcus, Diplodia, Exidia, Filibasidium, Fusarium, Gibberella, Holomastigotoides, Humicola, Irpex, Lentinula, Leptospaeria, Magnaporthe, Melanocarpus, Meripilus, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Piromyces, Poitrasia, Pseudoplectania, Pseudotrichonympha, Rhizomucor, Schizophyllum, Scytalidium, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trichoderma, Trichophaea, Verticillium, Volvariella, or Xylaria polypeptide having catalase activity.
[0223] In another aspect, the polypeptide is a Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, or Saccharomyces oviformis polypeptide having catalase activity.
[0224] In another aspect, the polypeptide is an Acremonium cellulolyticus, Aspergillus aculeatus, Aspergillus awamori, Aspergillus fumigatus, Aspergillus foetidus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium tropicum, Chrysosporium merdarium, Chrysosporium inops, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium zonatum, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola grisea, Humicola insolens, Humicola lanuginosa, Irpex lacteus, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium emersonii, Penicillium funiculosum, Penicillium purpurogenum, Phanerochaete chrysosporium, Talaromyces stipitatus, Thermoascus aurantiacus, Thielavia achromatica, Thielavia albomyces, Thielavia albopilosa, Thielavia australeinsis, Thielavia fimeti, Thielavia microspora, Thielavia ovispora, Thielavia peruviana, Thielavia spededonium, Thielavia setosa, Thielavia subthermophila, Thielavia terrestris, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride polypeptide having catalase activity.
[0225] In a preferable embodiment, the polypeptide having catalase activity is a catalase from Thermoascus, Talaromyces, Humicola, or Penicillium. In a more preferable embodiment, the polypeptide having catalase activity is a catalase from Thermoascus aurantiacus, Talaromyces stipitatus, Humicola insolens, or Penicillium emersonii.
[0226] Non-limiting examples of suitable catalases and coding sequence thereof are listed below.
[0227] SEQ ID NOs: 1 and 2: a polynucleotide and a polypeptide of a catalase from Thermoascus aurantiacus, as described in JP 2004261137A.
[0228] SEQ ID NOs: 3 and 4: a polynucleotide and a polypeptide of a catalase from Talaromyces stipitatus, which can be prepared as shown in Examples 9-13.
[0229] SEQ ID NOs: 5 and 6: a polynucleotide and a polypeptide of a catalase from Humicola insolens, which can be prepared as shown in Examples 14-20.
[0230] SEQ ID NOs: 7 and 8: a polynucleotide and a polypeptide of a catalase from Penicillium emersonii, which can be prepared as shown in Examples 21-27.
[0231] SEQ ID NOs: 9 and 10: a polynucleotide and a polypeptide of a catalase from Thermus brockianus, as described in WO 2005/044994.
[0232] SEQ ID NOs: 11 and 12: a polynucleotide and a polypeptide of a catalase from Saccharomyces pastorianus, as described in WO 2007/105350.
[0233] SEQ ID NOs: 13 and 14: a polynucleotide and a polypeptide of a catalase from Saccharomyces pastorianus as described in WO 2007/105350.
[0234] SEQ ID NOs: 15 and 16: a polynucleotide and a polypeptide of a catalase from Penicillium pinophilum, as described in WO 2009/104622.
[0235] SEQ ID NOs: 17 and 18: a polynucleotide and a polypeptide of a catalase from Humicola grisea, as described in WO 2009/104622.
[0236] SEQ ID NOs: 19 and 20: a polynucleotide and a polypeptide of a catalase from Thielavia terrestris, as described in WO 2010/074972.
[0237] SEQ ID NOs: 21 and 22: a polynucleotide and a polypeptide of a catalase from Bacillus thermoglucosidasius, as described in JP 11243961A.
[0238] SEQ ID NOs: 23 and 24: a polynucleotide and a polypeptide of a catalase from Aspergillus oryzae, as described in JP 2002223772A.
[0239] SEQ ID NOs: 25 and 26: a polynucleotide and a polypeptide of a catalase from Thermoascus aurantiacus, as described in JP 2007143405A.
[0240] SEQ ID NOs: 27 and 28: a polynucleotide and a polypeptide of a catalase from Bacillus thermoglucosidasius, as described in U.S. Pat. No. 6,022,721.
[0241] SEQ ID NOs: 29 and 30: a polynucleotide and a polypeptide of a catalase from Bacillus thermoglucosidasius, as described in U.S. Pat. No. 6,022,721.
[0242] SEQ ID NOs: 31 and 32: a polynucleotide and a polypeptide of a catalase from Alcaligenes aquamarinus, as described in WO 98/00526.
[0243] SEQ ID NOs: 33 and 34: a polynucleotide and a polypeptide of a catalase from Microscilla furvescens, as described in WO 98/00526.
[0244] SEQ ID NOs: 35 and 36: a polynucleotide and a polypeptide of a catalase from Aspergillus niger, as described in U.S. Pat. No. 5,360,901.
[0245] SEQ ID NO 37: a polypeptide of Humicola grisea thermotolerant catalase (GENESEQP: AXQ55105, disclosed in WO2009104622).
[0246] In an embodiment, the catalases used in the present invention have a sequence identity to the mature polypeptide of the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, the mature polypeptide of SEQ ID NO: 8, the mature polypeptide of SEQ ID NO: 10, the mature polypeptide of SEQ ID NO: 12, the mature polypeptide of SEQ ID NO: 14, the mature polypeptide of SEQ ID NO: 16, the mature polypeptide of SEQ ID NO: 18, the mature polypeptide of SEQ ID NO: 20, the mature polypeptide of SEQ ID NO: 22, the mature polypeptide of SEQ ID NO: 24, the mature polypeptide of SEQ ID NO: 26, the mature polypeptide of SEQ ID NO: 28, the mature polypeptide of SEQ ID NO: 30, the mature polypeptide of SEQ ID NO: 32, the mature polypeptide of SEQ ID NO: 34, the mature polypeptide of SEQ ID NO: 36, the mature polypeptide of SEQ ID NO: 37, of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 78%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%, which have catalase activity.
[0247] In one embodiment, the mature polypeptide is amino acids 1 to 746 of SEQ ID NO: 2. In another aspect, the mature polypeptide is amino acids 20 to 733 of SEQ ID NO: 4 based on the SignalP program that predicts amino acids 1 to 19 of SEQ ID NO: 4 are a signal peptide. In another aspect, the mature polypeptide is amino acids 20 to 765 of SEQ ID NO: 6 based on the SignalP program that predicts amino acids 1 to 19 of SEQ ID NO: 6 are a signal peptide. In another aspect, the mature polypeptide is amino acids 20 to 741 of SEQ ID NO: 8 based on the SignalP program that predicts amino acids 1 to 19 of SEQ ID NO: 8 are a signal peptide. It is known in the art that a host cell may produce a mixture of two of more different mature polypeptides (i.e., with a different C-terminal and/or N-terminal amino acid) expressed by the same polynucleotide.
[0248] In another embodiment, the catalases used in the present invention are encoded by a polynucleotide that hybridizes under low stringency conditions, medium stringency conditions, medium-high stringency conditions, high stringency conditions, or very high stringency conditions with (i) the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7, (ii) the cDNA sequence thereof, or (iii) the full-length complement of (i) or (ii) (Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d edition, Cold Spring Harbor, N.Y.).
[0249] For purposes of the present invention, hybridization indicates that the polynucleotide hybridizes to a labeled nucleic acid probe corresponding to (i) SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7; (ii) the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7; (iii) the cDNA sequence thereof; (iv) the full-length complement thereof; or (v) a subsequence thereof; under very low to very high stringency conditions. Molecules to which the nucleic acid probe hybridizes under these conditions can be detected using, for example, X-ray film or any other detection means known in the art.
[0250] In an embodiment, the catalases used in the present invention are encoded by a polynucleotide having at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 78%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7; or the cDNA sequence thereof.
[0251] In one embodiment, the mature polypeptide coding sequence is nucleotides 1 to 2351 of SEQ ID NO: 1 or the cDNA sequence thereof. In another embodiment, the mature polypeptide coding sequence is nucleotides 58 to 2418 of SEQ ID NO: 3 or the cDNA sequence thereof based on the SignalP program that predicts nucleotides 1 to 57 of SEQ ID NO: 3 encode a signal peptide. In another embodiment, the mature polypeptide coding sequence is nucleotides 58 to 3040 of SEQ ID NO: 5 or the cDNA sequence thereof based on the SignalP program that predicts nucleotides 1 to 57 of SEQ ID NO: 5 encode a signal peptide. In another embodiment, the mature polypeptide coding sequence is nucleotides 58 to 2476 of SEQ ID NO: 7 or the cDNA sequence thereof based on the SignalP program that predicts nucleotides 1 to 57 of SEQ ID NO: 7 encode a signal peptide.
[0252] In another embodiment, the catalases used in the present invention relates to variants of the mature polypeptide of SEQ ID NO: 2, variants of the mature polypeptide of SEQ ID NO: 4, variants of the mature polypeptide of SEQ ID NO: 6, or variants of the mature polypeptide of SEQ ID NO: 8, comprising a substitution, deletion, and/or insertion at one or more (e.g., several) positions. In an embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, or the mature polypeptide of SEQ ID NO: 8 is up to 10, e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10. The amino acid changes may be of a minor nature, that is conservative amino acid substitutions or insertions that do not significantly affect the folding and/or activity of the protein; small deletions, typically of 1-30 amino acids; small amino- or carboxyl-terminal extensions, such as an amino-terminal methionine residue; a small linker peptide of up to 20-25 residues; or a small extension that facilitates purification by changing net charge or another function, such as a poly-histidine tract, an antigenic epitope or a binding domain.
[0253] Examples of conservative substitutions are within the groups of basic amino acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino acids (glycine, alanine, serine, threonine and methionine). Amino acid substitutions that do not generally alter specific activity are known in the art and are described, for example, by H. Neurath and R. L. Hill, 1979, In, The Proteins, Academic Press, New York. Common substitutions are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, Ala/Glu, and Asp/Gly.
[0254] Alternatively, the amino acid changes are of such a nature that the physico-chemical properties of the polypeptides are altered. For example, amino acid changes may improve the thermal stability of the polypeptide, alter the substrate specificity, change the pH optimum, and the like.
[0255] Essential amino acids in a polypeptide can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, 1989, Science 244: 1081-1085). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant mutant molecules are tested for catalase activity to identify amino acid residues that are critical to the activity of the molecule. See also, Hilton et al., 1996, J. Biol. Chem. 271: 4699-4708. The active site of the enzyme or other biological interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction, or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., 1992, Science 255: 306-312; Smith et al., 1992, J. Mol. Biol. 224: 899-904; Wlodaver et al., 1992, FEBS Lett. 309: 59-64. The identity of essential amino acids can also be inferred from an alignment with a related polypeptide.
[0256] Single or multiple amino acid substitutions, deletions, and/or insertions can be made and tested using known methods of mutagenesis, recombination, and/or shuffling, followed by a relevant screening procedure, such as those disclosed by Reidhaar-Olson and Sauer, 1988, Science 241: 53-57; Bowie and Sauer, 1989, Proc. Natl. Acad. Sci. USA 86: 2152-2156; WO 95/17413; or WO 95/22625. Other methods that can be used include error-prone PCR, phage display (e.g., Lowman et al., 1991, Biochemistry 30: 10832-10837; U.S. Pat. No. 5,223,409; WO 92/06204), and region-directed mutagenesis (Derbyshire et al., 1986, Gene 46: 145; Ner et al., 1988, DNA 7: 127).
[0257] Mutagenesis/shuffling methods can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides expressed by host cells (Ness et al., 1999, Nature Biotechnology 17: 893-896). Mutagenized DNA molecules that encode active polypeptides can be recovered from the host cells and rapidly sequenced using standard methods in the art. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide.
[0258] The polypeptide may be a hybrid polypeptide in which a region of one polypeptide is fused at the N-terminus or the C-terminus of a region of another polypeptide.
[0259] The polypeptide may be a fusion polypeptide or cleavable fusion polypeptide in which another polypeptide is fused at the N-terminus or the C-terminus of the polypeptide of the present invention. A fusion polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention. Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fusion polypeptide is under control of the same promoter(s) and terminator. Fusion polypeptides may also be constructed using intein technology in which fusion polypeptides are created post-translationally (Cooper et al., 1993, EMBO J. 12: 2575-2583; Dawson et al., 1994, Science 266: 776-779).
[0260] A fusion polypeptide can further comprise a cleavage site between the two polypeptides. Upon secretion of the fusion protein, the site is cleaved releasing the two polypeptides. Examples of cleavage sites include, but are not limited to, the sites disclosed in Martin et al., 2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000, J. Biotechnol. 76: 245-251; Rasmussen-Wilson et al., 1997, Appl. Environ. Microbiol. 63: 3488-3493; Ward et al., 1995, Biotechnology 13: 498-503; and Contreras et al., 1991, Biotechnology 9: 378-381; Eaton et al., 1986, Biochemistry 25: 505-512; Collins-Racie et al., 1995, Biotechnology 13: 982-987; Carter et al., 1989, Proteins: Structure, Function, and Genetics 6: 240-248; and Stevens, 2003, Drug Discovery World 4: 35-48.
[0261] Examples of commercial catalase preparations suitable for use in the present invention include, for example, Terminox Ultra 50L/200L (Novozymes A/S), Catazyme 25L (Novozymes A/S), GC118 (Danisco A/S), Oxygone T100/T400 (Danisco A/S), ASC Super 200L (Mitsubishi Chemicals, Japan), and Reyonet 200L (Nagase, Japan).
Nucleic Acid Constructs
[0262] An isolated polynucleotide encoding a polypeptide, e.g., a cellulolytic enzyme, a polypeptide having catalase activity, a polypeptide having cellulolytic enhancing activity, etc., may be manipulated in a variety of ways to provide for expression of the polypeptide by constructing a nucleic acid construct comprising an isolated polynucleotide encoding the polypeptide operably linked to one or more (e.g., several) control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences. A polynucleotide may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the polynucleotideprior to its insertion into a vector may be desirable or necessary depending on the expression vector. The techniques for modifying polynucleotide sequences utilizing recombinant DNA methods are well known in the art.
[0263] The control sequence may be a promoter, a polynucleotide that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide. The promoter contains transcriptional control sequences that mediate the expression of the polypeptide. The promoter may be any polynucleotide that shows transcriptional activity in the host cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
[0264] Examples of suitable promoters for directing transcription of the nucleic acid constructs of the present invention in a bacterial host cell are the promoters obtained from the Bacillus amyloliquefaciens alpha-amylase gene (amyQ), Bacillus licheniformis alpha-amylase gene (amyL), Bacillus licheniformis penicillinase gene (penP), Bacillus stearothermophilus maltogenic amylase gene (amyM), Bacillus subtilis levansucrase gene (sacB), Bacillus subtilis xylA and xylB genes, Bacillus thuringiensis cryIIIA gene (Agaisse and Lereclus, 1994, Molecular Microbiology 13: 97-107), E. coli lac operon, E. coli trc promoter (Egon et al., 1988, Gene 69: 301-315), Streptomyces coelicolor agarase gene (dagA), and prokaryotic beta-lactamase gene (Villa-Kamaroff et al., 1978, Proc. Natl. Acad. Sci. USA 75: 3727-3731), as well as the tac promoter (DeBoer et al., 1983, Proc. Natl. Acad. Sci. USA 80: 21-25). Further promoters are described in "Useful proteins from recombinant bacteria" in Gilbert et al., 1980, Scientific American 242: 74-94; and in Sambrook et al., 1989, supra. Examples of tandem promoters are disclosed in WO 99/43835.
[0265] Examples of suitable promoters for directing transcription of the nucleic acid constructs of the present invention in a filamentous fungal host cell are promoters obtained from the genes for Aspergillus nidulans acetamidase, Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus niger or Aspergillus awamori glucoamylase (glaA), Aspergillus oryzae TAKA amylase, Aspergillus oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Fusarium oxysporum trypsin-like protease (WO 96/00787), Fusarium venenatum amyloglucosidase (WO 00/56900), Fusarium venenatum Daria (WO 00/56900), Fusarium venenatum Quinn (WO 00/56900), Rhizomucor miehei lipase, Rhizomucor miehei aspartic proteinase, Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, Trichoderma reesei cellobiohydrolase II, Trichoderma reesei endoglucanase I, Trichoderma reesei endoglucanase II, Trichoderma reesei endoglucanase III, Trichoderma reesei endoglucanase IV, Trichoderma reesei endoglucanase V, Trichoderma reesei xylanase I, Trichoderma reesei xylanase II, Trichoderma reesei beta-xylosidase, as well as the NA2-tpi promoter (a modified promoter from an Aspergillus neutral alpha-amylase gene in which the untranslated leader has been replaced by an untranslated leader from an Aspergillus triose phosphate isomerase gene; non-limiting examples include modified promoters from an Aspergillus niger neutral alpha-amylase gene in which the untranslated leader has been replaced by an untranslated leader from an Aspergillus nidulans or Aspergillus oryzae triose phosphate isomerase gene); and mutant, truncated, and hybrid promoters thereof.
[0266] In a yeast host, useful promoters are obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae galactokinase (GAL1), Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH1, ADH2/GAP), Saccharomyces cerevisiae triose phosphate isomerase (TPI), Saccharomyces cerevisiae metallothionein (CUP1), and Saccharomyces cerevisiae 3-phosphoglycerate kinase. Other useful promoters for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488.
[0267] The control sequence may also be a transcription terminator, which is recognized by a host cell to terminate transcription. The terminator is operably linked to the 3'-terminus of the polynucleotide encoding the polypeptide. Any terminator that is functional in the host cell may be used in the present invention.
[0268] Preferred terminators for bacterial host cells are obtained from the genes for Bacillus clausii alkaline protease (aprH), Bacillus licheniformis alpha-amylase (amyL), and Escherichia coli ribosomal RNA (rrnB).
[0269] Preferred terminators for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alpha-glucosidase, Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease.
[0270] Preferred terminators for yeast host cells are obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for yeast host cells are described by Romanos et al., 1992, supra.
[0271] The control sequence may also be an mRNA stabilizer region downstream of a promoter and upstream of the coding sequence of a gene which increases expression of the gene.
[0272] Examples of suitable mRNA stabilizer regions are obtained from a Bacillus thuringiensis cryIIIA gene (WO 94/25612) and a Bacillus subtilis SP82 gene (Hue et al., 1995, Journal of Bacteriology 177: 3465-3471).
[0273] The control sequence may also be a leader, a nontranslated region of an mRNA that is important for translation by the host cell. The leader is operably linked to the 5'-terminus of the polynucleotide encoding the polypeptide. Any leader that is functional in the host cell may be used.
[0274] Preferred leaders for filamentous fungal host cells are obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
[0275] Suitable leaders for yeast host cells are obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).
[0276] The control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3'-terminus of the polynucleotide and, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence that is functional in the host cell may be used.
[0277] Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alpha-glucosidase Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease
[0278] Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 1995, Mol. Cellular Biol. 15: 5983-5990.
[0279] The control sequence may also be a signal peptide coding region that encodes a signal peptide linked to the N-terminus of a polypeptide and directs the polypeptide into the cell's secretory pathway. The 5'-end of the coding sequence of the polynucleotide may inherently contain a signal peptide coding sequence naturally linked in translation reading frame with the segment of the coding sequence that encodes the polypeptide. Alternatively, the 5'-end of the coding sequence may contain a signal peptide coding sequence that is foreign to the coding sequence. A foreign signal peptide coding sequence may be required where the coding sequence does not naturally contain a signal peptide coding sequence. Alternatively, a foreign signal peptide coding sequence may simply replace the natural signal peptide coding sequence in order to enhance secretion of the polypeptide. However, any signal peptide coding sequence that directs the expressed polypeptide into the secretory pathway of a host cell may be used.
[0280] Effective signal peptide coding sequences for bacterial host cells are the signal peptide coding sequences obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus alpha-amylase, Bacillus stearothermophilus neutral proteases (nprT, nprS, nprM), and Bacillus subtilis prsA. Further signal peptides are described by Simonen and Palva, 1993, Microbiological Reviews 57: 109-137.
[0281] Effective signal peptide coding sequences for filamentous fungal host cells are the signal peptide coding sequences obtained from the genes for Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Aspergillus oryzae TAKA amylase, Humicola insolens cellulase, Humicola insolens endoglucanase V, Humicola lanuginosa lipase, and Rhizomucor miehei aspartic proteinase.
[0282] Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding sequences are described by Romanos et al., 1992, supra.
[0283] The control sequence may also be a propeptide coding sequence that encodes a propeptide positioned at the N-terminus of a polypeptide. The resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally inactive and can be converted to an active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide coding sequence may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor miehei aspartic proteinase, and Saccharomyces cerevisiae alpha-factor.
[0284] Where both signal peptide and propeptide sequences are present, the propeptide sequence is positioned next to the N-terminus of a polypeptide and the signal peptide sequence is positioned next to the N-terminus of the propeptide sequence.
[0285] It may also be desirable to add regulatory sequences that regulate expression of the polypeptide relative to the growth of the host cell. Examples of regulatory systems are those that cause expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Regulatory systems in prokaryotic systems include the lac, tac, and trp operator systems. In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the Aspergillus niger glucoamylase promoter, Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzae glucoamylase promoter may be used. Other examples of regulatory sequences are those that allow for gene amplification. In eukaryotic systems, these regulatory sequences include the dihydrofolate reductase gene that is amplified in the presence of methotrexate, and the metallothionein genes that are amplified with heavy metals. In these cases, the polynucleotide encoding the polypeptide would be operably linked with the regulatory sequence.
Expression Vectors
[0286] The various nucleotide and control sequences described above may be joined together to produce a recombinant expression vector that may include one or more (e.g., several) convenient restriction sites to allow for insertion or substitution of a polynucleotide encoding a polypeptide, e.g., a cellulolytic enzyme, a polypeptide having catalase activity, a polypeptide having cellulolytic enhancing activity, etc., at such sites. Alternatively, the polynucleotide may be expressed by inserting the polynucleotide or a nucleic acid construct comprising the polynucleotide into an appropriate vector for expression. In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
[0287] The recombinant expression vector may be any vector (e.g., a plasmid or virus) that can be conveniently subjected to recombinant DNA procedures and can bring about expression of the polynucleotide. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vector may be a linear or closed circular plasmid.
[0288] The vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. The vector may contain any means for assuring self-replication. Alternatively, the vector may be one that, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon, may be used.
[0289] The vector preferably contains one or more (e.g., several) selectable markers that permit easy selection of transformed, transfected, transduced, or the like cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
[0290] Examples of bacterial selectable markers are Bacillus licheniformis or Bacillus subtilis dal genes, or markers that confer antibiotic resistance such as ampicillin, chloramphenicol, kanamycin, neomycin, spectinomycin, or tetracycline resistance. Suitable markers for yeast host cells include, but are not limited to, ADE2, HIS3, LEU2, LYS2, MET3, TRP1, and URA3. Selectable markers for use in a filamentous fungal host cell include, but are not limited to, amdS (acetamidase), argB (ornithine carbamoyltransferase), bar (phosphinothricin acetyltransferase), hph (hygromycin phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-5'-phosphate decarboxylase), sC (sulfate adenyltransferase), and trpC (anthranilate synthase), as well as equivalents thereof. Preferred for use in an Aspergillus cell are Aspergillus nidulans or Aspergillus oryzae amdS and pyrG genes and a Streptomyces hygroscopicus bar gene.
[0291] The vector preferably contains an element(s) that permits integration of the vector into the host cell's genome or autonomous replication of the vector in the cell independent of the genome.
[0292] For integration into the host cell genome, the vector may rely on the polynucleotide encoding the polypeptide or any other element of the vector for integration into the genome by homologous or non-homologous recombination. Alternatively, the vector may contain additional polynucleotides for directing integration by homologous recombination into the genome of the host cell at a precise location(s) in the chromosome(s). To increase the likelihood of integration at a precise location, the integrational elements should contain a sufficient number of nucleic acids, such as 100 to 10,000 base pairs, 400 to 10,000 base pairs, and 800 to 10,000 base pairs, which have a high degree of sequence identity to the corresponding target sequence to enhance the probability of homologous recombination. The integrational elements may be any sequence that is homologous with the target sequence in the genome of the host cell. Furthermore, the integrational elements may be non-encoding or encoding polynucleotides. On the other hand, the vector may be integrated into the genome of the host cell by non-homologous recombination.
[0293] For autonomous replication, the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question. The origin of replication may be any plasmid replicator mediating autonomous replication that functions in a cell. The term "origin of replication" or "plasmid replicator" means a polynucleotide that enables a plasmid or vector to replicate in vivo.
[0294] Examples of bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 permitting replication in E. coli, and pUB110, pE194, pTA1060, and pAM 1 permitting replication in Bacillus.
[0295] Examples of origins of replication for use in a yeast host cell are the 2 micron origin of replication, ARS1, ARS4, the combination of ARS1 and CEN3, and the combination of ARS4 and CEN6.
[0296] Examples of origins of replication useful in a filamentous fungal cell are AMA1 and ANS1 (Gems et al., 1991, Gene 98: 61-67; Cullen et al., 1987, Nucleic Acids Res. 15: 9163-9175; WO 00/24883). Isolation of the AMA1 gene and construction of plasmids or vectors comprising the gene can be accomplished according to the methods disclosed in WO 00/24883.
[0297] More than one copy of a polynucleotide may be inserted into a host cell to increase production of a polypeptide. An increase in the copy number of the polynucleotide can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the polynucleotide where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the polynucleotide, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
[0298] The procedures used to ligate the elements described above to construct the recombinant expression vectors are well known to one skilled in the art (see, e.g., Sambrook et al., 1989, supra).
Host Cells
[0299] Recombinant host cells comprising a polynucleotide encoding a polypeptide, e.g., a cellulolytic enzyme, a polypeptide having catalase activity, a polypeptide having cellulolytic enhancing activity, etc., can be advantageously used in the recombinant production of the polypeptide. A construct or vector comprising a polynucleotide is introduced into a host cell so that the construct or vector is maintained as a chromosomal integrant or as a self-replicating extra-chromosomal vector as described earlier. The term "host cell" encompasses any progeny of a parent cell that is not identical to the parent cell due to mutations that occur during replication. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source.
[0300] The host cell may be any cell useful in the recombinant production of a polypeptide, e.g., a prokaryote or a eukaryote.
[0301] The prokaryotic host cell may be any Gram-positive or Gram-negative bacterium. Gram-positive bacteria include, but not limited to, Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, and Streptomyces. Gram-negative bacteria include, but not limited to, Campylobacter, E. coli, Flavobacterium, Fusobacterium, Helicobacter, Ilyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma.
[0302] The bacterial host cell may be any Bacillus cell including, but not limited to, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis cells.
[0303] The bacterial host cell may also be any Streptococcus cell including, but not limited to, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus cells.
[0304] The bacterial host cell may also be any Streptomyces cell including, but not limited to, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
[0305] The introduction of DNA into a Bacillus cell may be effected by protoplast transformation (see, e.g., Chang and Cohen, 1979, Mol. Gen. Genet. 168: 111-115), using competent cells transformation (see, e.g., Young and Spizizen, 1961, J. Bacteriol. 81: 823-829, or Dubnau and Davidoff-Abelson, 1971, J. Mol. Biol. 56: 209-221), electroporation (see, e.g., Shigekawa and Dower, 1988, Biotechniques 6: 742-751), or conjugation (see, e.g., Koehler and Thorne, 1987, J. Bacteriol. 169: 5271-5278). The introduction of DNA into an E. coli cell may be effected by protoplast transformation (see, e.g., Hanahan, 1983, J. Mol. Biol. 166: 557-580) or electroporation (see, e.g., Dower et al., 1988, Nucleic Acids Res. 16: 6127-6145). The introduction of DNA into a Streptomyces cell may be effected by protoplast transformation and electroporation (see, e.g., Gong et al., 2004, Folia Microbiol. (Praha) 49: 399-405), conjugation (see, e.g., Mazodier et al., 1989, J. Bacteriol. 171: 3583-3585), or transduction (see, e.g., Burke et al., 2001, Proc. Natl. Acad. Sci. USA 98: 6289-6294). The introduction of DNA into a Pseudomonas cell may be effected by electroporation (see, e.g., Choi et al., 2006, J. Microbiol. Methods 64: 391-397) or conjugation (see, e.g., Pinedo and Smets, 2005, Appl. Environ. Microbiol. 71: 51-57). The introduction of DNA into a Streptococcus cell may be effected by natural competence (see, e.g., Perry and Kuramitsu, 1981, Infect. Immun. 32: 1295-1297), protoplast transformation (see, e.g., Catt and Jollick, 1991, Microbios 68: 189-207), electroporation (see, e.g., Buckley et al., 1999, Appl. Environ. Microbiol. 65: 3800-3804) or conjugation (see, e.g., Clewell, 1981, Microbiol. Rev. 45: 409-436). However, any method known in the art for introducing DNA into a host cell can be used.
[0306] The host cell may also be a eukaryote, such as a mammalian, insect, plant, or fungal cell. "Fungi" as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi (as defined by Hawksworth et al., In, Ainsworth and Bisby's Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK).
[0307] The fungal host cell may be a yeast cell. "Yeast" as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). Since the classification of yeast may change in the future, for the purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, F. A., Passmore, S. M., and Davenport, R. R., eds, Soc. App. Bacteriol. Symposium Series No. 9, 1980).
[0308] The yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
[0309] The fungal host cell may be a filamentous fungal cell. "Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
[0310] The filamentous fungal host cell may be an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Filibasidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell.
[0311] For example, the filamentous fungal host cell may be an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium suiphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phiebia radiata, Pleurotus eryngii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
[0312] Fungal cells may be transformed by a process involving protoplast formation, transformation of the protoplasts, and regeneration of the cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus and Trichoderma host cells are described in EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81: 1470-1474, and Christensen et al., 1988, Bio/Technology 6: 1419-1422. Suitable methods for transforming Fusarium species are described by Malardier et al., 1989, Gene 78: 147-156, and WO 96/00787. Yeast may be transformed using the procedures described by Becker and Guarente, In Abelson, J. N. and Simon, M. I., editors, Guide to Yeast Genetics and Molecular Biology, Methods in Enzymology, Volume 194, pp 182-187, Academic Press, Inc., New York; Ito et al., 1983, J. Bacteriol. 153: 163; and Hinnen et al., 1978, Proc. Natl. Acad. Sci. USA 75: 1920.
Methods of Production
[0313] Methods for producing a polypeptide, e.g., a cellulolytic enzyme, a polypeptide having catalase activity, a polypeptide having cellulolytic enhancing activity, etc., comprise (a) cultivating a cell, which in its wild-type form produces the polypeptide, under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide. In a preferred aspect, the cell is an Aspergillus, Thermoascus, Talaromyces, Trichoderma, Humicola, or Penicillium cell. In a more preferred aspect, the cell is an Aspergillus niger, Aspergillus oryaze, Aspergillus fumigatus, Thermoascus aurantiacus, Talaromyces stipitatus, Trichoderma reesei, Humicola insolens, or Penicillium emersonii cell.
[0314] Alternatively, methods for producing a polypeptide, e.g., a cellulolytic enzyme, a polypeptide having catalase activity, a polypeptide having cellulolytic enhancing activity, etc., comprise (a) cultivating a recombinant host cell under conditions conducive for production of the polypeptide; and (b) recovering the polypeptide.
[0315] The host cells are cultivated in a nutrient medium suitable for production of the polypeptide using methods known in the art. For example, the cell may be cultivated by shake flask cultivation, or small-scale or large-scale fermentation (including continuous, batch, fed-batch, or solid state fermentations) in laboratory or industrial fermentors in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. The cultivation takes place in a suitable nutrient medium comprising carbon and nitrogen sources and inorganic salts, using procedures known in the art. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the polypeptide is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
[0316] The polypeptide may be detected using methods known in the art that are specific for the polypeptides. These detection methods include, but are not limited to, use of specific antibodies, formation of an enzyme product, or disappearance of an enzyme substrate. For example, an enzyme assay may be used to determine the activity of the polypeptide. The polypeptides having cellulolytic enhancing activity are detected using the methods described herein.
[0317] The polypeptide may be recovered using methods known in the art. For example, the polypeptide may be recovered from the nutrient medium by conventional procedures including, but not limited to, collection, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. In one aspect, the whole fermentation broth is recovered.
[0318] The polypeptide may be purified by a variety of procedures known in the art including, but not limited to, chromatography (e.g., ion exchange, affinity, hydrophobic, chromatofocusing, and size exclusion), electrophoretic procedures (e.g., preparative isoelectric focusing), differential solubility (e.g., ammonium sulfate precipitation), SDS-PAGE, or extraction (see, e.g., Protein Purification, Janson and Ryden, editors, VCH Publishers, New York, 1989) to obtain substantially pure polypeptides.
[0319] In an alternative aspect, the polypeptide is not recovered, but rather a host cell expressing a polypeptide is used as a source of the polypeptide.
[0320] The present invention is further described by the following examples that should not be construed as limiting the scope of the invention.
EXAMPLES
Strain
[0321] The fungal strain NN70 was obtained from Centraalbureau voor Schimmelcultures named as CBS 375.48. The strain NN70 was identified as Talaromyces stipitatus, based on both morphological characteristics and ITS rDNA sequence.
[0322] The fungal strain NN38 was isolated from a soil sample collected from China by the dilution plate method with PDA medium at 45.degree. C. It was then purified by transferring a single conidium onto a YG agar plate. The strain NN38 was identified as Humicola insolens, based on both morphological characteristics and ITS rDNA sequence.
[0323] The fungal strain NN051602 was isolated from a compost sample collected from China by the dilution plate method with PDA medium at 45.degree. C. It was then purified by transferring a single conidium onto a YG agar plate. The strain NN051602 was identified as Penicillium emersonii, based on both morphological characteristics and ITS rDNA sequence.
Media
[0324] PDA medium was composed of 39 grams of potato dextrose agar and deionized water to 1 liter.
[0325] YG agar plate was composed of 5.0 g of yeast extract, 10.0 g of glucose, 20.0 g of agar, and deionized water to 1 liter.
[0326] YPM medium contained 1% yeast extract, 2% of peptone, and 2% of maltose in deionized water.
[0327] YPG medium contained 0.4% of yeast extract, 0.1% of KH.sub.2PO.sub.4, 0.05% of MgSO.sub.4.7H.sub.2O, 1.5% glucose in deionized water.
[0328] Minimal medium plates were composed of 342 g of sucrose, 20 ml of salt solution, 20 g of agar, and deionized water to 1 liter. The salt solution was composed of 2.6% KCl, 2.6% MgSO.sub.4.7H.sub.2O, 7.6% KH.sub.2PO.sub.4, 2 ppm Na.sub.2B.sub.4O.sub.7.10H.sub.2O, 20 ppm CuSO.sub.4.5H2O, 40 ppm FeSO.sub.4.7H.sub.2O, 40 ppm MnSO.sub.4.2H.sub.2O, 40 ppm Na.sub.2MoO.sub.4.2H.sub.2O, and 400 ppm ZnSO.sub.4.7H.sub.2O.
Example 1: Boosting Effect of Thermoascus aurantiacus Catalase Or Talaromyces stipitatus Catalase on Hydrolysis of Pretreated Corn Stover (PCS)
[0329] Catalase from Thermoascus aurantiacus (as shown in SEQ ID NO: 2) was expressed by Aspergillus niger and purified as described in J.P. Publication 2004261137A. Catalase from Talaromyces stipitatus (as shown in SEQ ID NO: 4) was cloned, expressed and purified as Examples 9-13. Corn stover was pretreated at the U.S. Department of Energy National Renewable Energy Laboratory (NREL) using dilute sulfuric acid at conditions of 190.degree. C., 1 minute residence time, 0.05 g acid/g dry biomass, and at a 30% total solid concentration in a pretreatment reactor.
[0330] PCS was hydrolyzed at an initial total solid (TS) of 10% and total weight of hydrolysis system of 20 g. Trichoderma reesei cellulase composition (CELLIC.RTM. CTec2 available from Novozymes A/S, Bagsvaerd, Denmark) was added into the PCS for enzymatic hydrolysis with a ratio of the Trichoderma reesei cellulase composition to cellulose of 0.5% (w/w), namely 5 mg/g cellulose based on protein amount. Thermoascus aurantiacus catalase or Talaromyces stipitatus catalase at dosages shown in Table 1 below was added into the hydrolysis system. The hydrolysis system without the addition of catalase was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm. Unless specified otherwise, the total hydrolysis time was 72 hours. After hydrolysis was completed, the sugar was analyzed by High Performance Liquid Chromatography (HPLC).
[0331] For HPLC measurement, the collected samples were filtered using 0.22 .mu.m syringe filters (Millipore, Bedford, Mass., USA) and the filtrates were analyzed for sugar content as described below. The sugar concentrations of samples diluted in 0.005 M H.sub.2SO.sub.4 were measured using a 7.8.times.300 mm AMINEX.RTM. HPX-87H column (Bio-Rad Laboratories, Inc., Hercules, Calif., USA) by elution with 0.005 M H.sub.2SO.sub.4 at 65.degree. C. at a flow rate of 0.7 ml per minute, and quantification by integration of the glucose signal from refractive index detection (CHEMSTATION.RTM., AGILENT.RTM. 1100 HPLC, Agilent Technologies, Santa Clara, Calif., USA) calibrated by pure sugar samples. The resultant glucose was used to calculate the percentage of glucose yield from glucans for each reaction. Measured sugar concentrations were adjusted for the appropriate dilution factor. The net concentrations of enzymatically-produced sugars were determined by adjusting the measured sugar concentrations for corresponding background sugar concentrations in unwashed biomass at zero time point. All HPLC data processing was performed using MICROSOFT EXCEL.TM. software (Microsoft, Richland, Wash., USA).
[0332] The degree of glucose conversion to glucose was calculated according to the publication by Zhu, Y., et al. 2010, Bioresource Technology. 102(3): 2897-2903.
[0333] The results as shown in Table 1 demonstrated that PCS conversion to glucose can be improved significantly by adding small amounts of catalase.
TABLE-US-00001 TABLE 1 Effect of catalase from T. aurantiacus or catalase from T. stipitatus on the glucose conversion of PCS T. aurantiacus catalase T. stipitatus catalase Dosage of catalase (.mu.g/g cellulose) 0 (control) 50 250 500 50 250 500 Glucose conversion (%) 49.7 .+-. 0.4 54.9 .+-. 0.1 61.7 .+-. 1.8 63 .+-. 0.7 56.7 .+-. 0.4 60.6 .+-. 1.9 60.4 .+-. 2.5
Example 2: Boosting Effect of Talaromyces stipitatus Catalase on Mono-Components of Cellulases
[0334] Avicel.RTM. PH-101 (Fluka 11365, Sigma-Aldrich (Shanghai), Shanghai, China), a kind of microcrystalline cellulose, was hydrolyzed at a final concentration of 5 g/l and total volume of the hydrolysis system of 0.5 ml. The pH was adjusted and kept at 5.0 by 50 mM sodium acetate. In addition, ascorbic acid was present with a final concentration of 5 mM or absent in the hydrolysis system. Manganese (II) sulfate was present with a final concentration of 1 mM in the hydrolysis system.
[0335] Cellobiohydrolase (CBH) I from Aspergillus fumigatus (WO 2011/057140), cellobiohydrolase (CBH) II from Aspergillus fumigatus (WO 2011/057140), endoglucanase (EG) I from Trichoderma reesei (WO 2011/057140) and beta-glucanase (BG) from Aspergillus oryaze (WO 02/095014) were cloned, expressed and purified. These mono-components were applied to the hydrolysis of Avicel.RTM. individually. 10 mg mono-component of cellulase/g Avicel.RTM. and 5 mg catalase/g Avicel.RTM. were used. The tubes were incubated at 50.degree. C. for 72 hours, with shaking at 600 rpm. All experiments were performed in triplicates.
[0336] HPLC analysis of the extent of hydrolysis was performed according to the procedure described in Example 1.
[0337] The extent of cellulose conversion was calculated based on the mass ratio of solubilized glucosyl units to the initial mass of insoluble cellulose. Only glucose and cellobiose were measured for soluble sugars, as cellodextrins longer than cellobiose were present in negligible concentrations (due to enzymatic hydrolysis). The extent of total cellulose conversion was calculated using the following equation:
% conversion = [ cellobiose ] ( mg / ml ) / 1.053 + ( [ glucose ] ( mg / ml ) / 1.111 ) [ Cellulose ] ( mg / ml ) .times. 100 ( Equation 1 ) ##EQU00001##
[0338] The 1.111 and 1.053 factors for glucose and cellobiose, respectively, take into account the increase in mass when the glucosyl units in cellulose (average molecular mass of 162 daltons) are converted to glucose (molecular mass of 180 daltons) or cellobiose glucosyl units (average molecular mass of 171 daltons).
TABLE-US-00002 TABLE 2 Effect of catalase from T. stipitatus on glucose conversion of Avicel. Mono-component without Ascorbic acid Mono-component with Ascorbic acid Control BG CBH I CBH II EG Control BG CBH I CBH II EG Control (%) 0.0 .+-. 0.0 0.0 .+-. 0.0 13.7 .+-. 0.2 9.0 .+-. 0.3 2.5 .+-. 0.0 0.0 .+-. 0.0 0.0 .+-. 0.0 11.4 .+-. 0.1 8.0 .+-. 0.3 1.9 .+-. 0.1 Catalase (%) 0.0% .+-. 0.0 0.0 .+-. 0.0 13.4 .+-. 0.2 9.1 .+-. 0.1 2.4 .+-. 0.1 0.0 .+-. 0.0 2.4 .+-. 0.3 15.8 .+-. 1.3 12.7 .+-. 0.3 4.8 .+-. 0.0
[0339] As shown in Table 2, in the presence of ascorbic acid, the hydrolysis of every mono-component of cellulase can be boosted by T. stipitatus catalase.
Example 3: Boosting Effect of Humicola insolens Catalase on Hydrolysis of PCS
[0340] Preparation of PCS and set-up of hydrolysis system were the same as example 1. Catalase from Humicola insolens was cloned, expressed and purified as shown in Examples 14-20.
[0341] PCS was hydrolyzed at an initial TS of 10% and total weight of hydrolysis system of 20 g. Trichoderma reesei cellulase composition (CELLIC.RTM. CTec2 available from Novozymes A/S, Bagsvaerd, Denmark) was utilized for enzymatic hydrolysis. Five percent by weight Ctec2 was replaced by H. insolens catalase based on protein amount and the total enzyme dose was 4 mg/g cellulose. The hydrolysis system with 4 mg Trichoderma reesei cellulase composition/g cellulose but without catalase was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm. The total hydrolysis time was 72 hours.
[0342] The calculation of glucose conversion was the same as example 1 and the boosting effect was shown in table 3.
TABLE-US-00003 TABLE 3 Effect of catalase from Humicola insolens on glucose conversion of PCS. H. insolens Control Catalase Glucose conversion (%) 50.4 .+-. 1.1 58.4 .+-. 0.9
Example 4: Boosting Effect of Humicola insolens Catalase on Hydrolysis of PCS
[0343] Preparation of PCS and set-up of hydrolysis system were the same as example 1. Catalase from Humicola insolens was cloned, expressed and purified and purified as shown in Examples 14-20.
[0344] PCS was hydrolyzed at an initial TS of 10% and total weight of hydrolysis system of 20 g. Trichoderma reesei cellulase composition (CELLIC.RTM. CTec3 available from Novozymes A/S, Bagsvaerd, Denmark) was utilized for enzymatic hydrolysis. Five percent by weight Ctec3 was replaced by H. insolens catalase based on protein amount and the total enzyme dose was 4 mg/g cellulose. The hydrolysis system with 4 mg Trichoderma reesei cellulase composition/g cellulose but without catalase was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm. The total hydrolysis time was 72 hours.
[0345] The calculation of glucose conversion was the same as example 1 and the boosting effect was shown in table 4.
TABLE-US-00004 TABLE 4 Effect of catalase from Humicola insolens on glucose conversion of PCS. H. insolens Control Catalase Glucose conversion (%) 70.9 .+-. 1.4 80.1 .+-. 1.2
Example 5: Synergetic Effect of Thermoascus aurantiacus Catalase and Thermoascus aurantiacus GH61A on Hydrolysis of PCS
[0346] PCS was prepared according to the procedure as described in Example 1, and hydrolyzed at initial TS of 10% and total weight of hydrolysis system of 20 g. The pH was adjusted to 5.0 using 10 M sodium hydroxide. Trichoderma reesei cellulase composition (CELLUCLAST.RTM. in the presence of 10% of total protein weight Aspergillus fumigatus beta-glucosidase (WO 2005/047499), available from Novozymes A/S, Bagsvaerd, Denmark) was added into PCS for enzymatic hydrolysis with a ratio of the Trichoderma reesei cellulase composition to cellulose of 0.8% (w/w). Catalase from Thermoascus aurantiacus, GH61A polypeptide from Thermoascus aurantiacus (WO 2005/074656), or the combination thereof were added into the hydrolysis system, respectively. The dosage of catalase, GH61A polypeptide, or the combination thereof was calculated based on the weight of cellulose. The hydrolysis system with Trichoderma reesei cellulase composition but without catalase and GH61 was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm. All experiments were performed in triplicates. HPLC analysis of the extent of hydrolysis was performed according to the procedure described in Example 1. The conversion of PCS to glucose after 72-hour hydrolysis was shown in Table 5 below.
TABLE-US-00005 TABLE 5 Synergetic effect of Thermoascus aurantiacus catalase and Thermoascus aurantiacus GH61A on hydrolysis of PCS Catalase alone GH61A alone Catalase + GH61A (.mu.g/g cellulose) (.mu.g/g cellulose) (.mu.g/g cellulose) Control 80 160 80 160 80 + 80 160 + 160 Glucose conversion (%) 66.9 .+-. 1.1 75.9 .+-. 0.0 82.8 .+-. 0.4 70.6 .+-. 0.2 71.8 .+-. 0.2 88.7 .+-. 3.0 93.2 .+-. 0.8
[0347] As shown in Table 5, catalase or GH61A polypeptide alone boosted the hydrolysis of PCS. It was surprisingly found that, when catalase and GH61A polypeptide were used simultaneously, the hydrolysis was improved significantly. The results indicated that catalase and GH61A polypeptide have a significant synergistic effect on the hydrolysis of PCS.
Example 6: Boosting Effect of Penicillium emersonii Catalase on Hydrolysis of PCS
[0348] Preparation of PCS and set-up of hydrolysis system were the same as example 1. Catalase from P. emersonii was cloned, expressed and purified as shown in Examples 21-27.
[0349] PCS was hydrolyzed at an initial TS of 10% and total weight of hydrolysis system of 20 g. Trichoderma reesei cellulase composition (CELLIC.RTM. CTec2 available from Novozymes A/S, Bagsvaerd, Denmark) was added into PCS for enzymatic hydrolysis. Five percent by weight of Ctec2 was replaced with P. emersonii catalase based on protein amount and the total enzyme dose was 4 mg/g cellulose. The hydrolysis system with Trichoderma reesei cellulase composition but without catalase was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm.
[0350] The calculation of glucose conversion was the same as example 1 and the boosting effect was shown in table 6.
TABLE-US-00006 TABLE 6 Effect of catalase from P. emersonii on glucose conversion of PCS. P. emersonii Control Catalase Glucose conversion (%) 48.6 .+-. 0.7 54.3 .+-. 0.8
Example 7: Boosting Effect of Thermoascus aurantiacus Catalase on Hydrolysis of PCS in Relatively High TS
[0351] Preparation of PCS and set-up of hydrolysis system were the same as example 1. PCS was hydrolyzed at an initial TS of 20% and total weight of hydrolysis system of 20 g. Trichoderma reesei cellulase composition (CELLIC.RTM. CTec2 available from Novozymes A/S, Bagsvaerd, Denmark) was added into PCS for enzymatic hydrolysis. Five percent by weight of Ctec2 was replaced by T. aurantiacus catalase based on protein amount and the total enzyme dose was 7 mg/g cellulose. The hydrolysis system with Trichoderma reesei cellulase composition but without catalase was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm. The calculation of glucose conversion was the same as example 1 and the boosting effect of catalases was shown in table 7.
TABLE-US-00007 TABLE 7 Effect of catalase from Thermoascus aurantiacus on glucose conversion of PCS. 5% T. aurantiacus Control catalase replacement Glucose conversion (%) 58.6 .+-. 1.4 64.8 .+-. 0.8
Example 8: Boosting Effect of Thermoascus aurantiacus Catalase on Hydrolysis of PCS in Relatively High TS
[0352] Preparation of PCS and set-up of hydrolysis system were the same as example 1. PCS was hydrolyzed at an initial TS of 20% and total weight of hydrolysis system of 20 g. Trichoderma reesei cellulase composition (CELLIC.RTM. CTec3 available from Novozymes A/S, Bagsvaerd, Denmark) was utilized for enzymatic hydrolysis. Five percent by weight of Ctec3 was replaced by T. aurantiacus catalase based on protein amount and the total enzyme dose was 6 mg/g cellulose. The hydrolysis system with Trichoderma reesei cellulase composition but without catalase was used as a control. The flasks were incubated at 50.degree. C. for 72 hours, with shaking at 130 rpm.
[0353] The calculation of glucose conversion was the same as example 1 and the boosting effect of catalases was shown in table 8.
TABLE-US-00008 TABLE 8 Effect of catalase from Thermoascus aurantiacus on glucose conversion of PCS. 5% T. aurantiacus Control catalase replacement Glucose conversion (%) 72.0.+-. 80.5 .+-. 0.8
Example 9: Talaromyces stipitatus Genomic DNA Extraction
[0354] Talaromyces stipitatus strain NN70 was grown in PDA agar plate at 45.degree. C. for 3 days. Mycelia were collected directly from the agar plate into a sterilized mortar and frozen under liquid nitrogen. Frozen mycelia were ground, by mortar and pestle, to a fine powder, and genomic DNA was isolated using a DNeasy.RTM. Plant Mini Kit (QIAGEN Inc., Valencia, Calif., USA).
Example 10: Cloning of the Talaromyces stipitatus Catalase Gene from Genomic DNA
[0355] Based on the DNA information of European Molecular Biology Laboratory (EMBL):EQ962660 (i.e., SEQ ID NO: 3) and protein sequence SWISSPROT:B8MT74 (i.e., SEQ ID NO: 4), oligonucleotide primers, shown in below, were designed to amplify the catalase gene from the genomic DNA of Talaromyces stipitatus NN70. Primers were fabricated by Invitrogen (Invitrogen, Beijing, China).
TABLE-US-00009 Forward primer: (SEQ ID NO: 38) 5' ACACAACTGGGGATCC ACC atgcgaggggcatactctctc 3' Reverse primer: (SEQ ID NO: 39) 5' GTCACCCTCTAGATCT aacaagttactcgtgttaatcgtggaa 3'
[0356] Lowercase characters represent the sequences of the gene, while capitalized parts were homologous to the insertion sites of pPFJO355 vector which has been described in US2010306879.
[0357] The expression vector pPFJO355 contains the TAKA-amylase promoter derived from Aspergillus oryzae and the Aspergillus niger glucoamylase terminator elements. Furthermore pPFJO355 has pUC18 derived sequences for selection and propagation in E. coli, and a pyrG gene, which encodes an orotidine decarboxylase derived from Aspergillus nidulans for selection of a transformant of a pyrG mutant Aspergillus strain.
[0358] Twenty picomoles of primer pair (forward and reverse) were used in a PCR reaction composed of 2 .mu.l of Talaromyces stipitatus NN70 genomic DNA, 10 .mu.l of 5.times.GC Buffer, 1.5 .mu.l of DMSO, 2.5 mM each of dATP, dTTP, dGTP, and dCTP, and 0.6 unit of PHUSION.TM. High-Fidelity DNA Polymerase (Finnzymes Oy, Espoo, Finland) in a final volume of 50 .mu.l. The amplification was performed using a Peltier Thermal Cycler (MJ Research Inc., South San Francisco, Calif., USA) programmed for denaturing at 98.degree. C. for 40 seconds; 8 cycles of denaturing at 98.degree. C. for 15 seconds, annealing at 70.degree. C. for 30 seconds, with 1.degree. C. decrease per cycle and elongation at 72.degree. C. for 80 seconds; and another 23 cycles each at 98.degree. C. for 15 seconds, 62.degree. C. for 30 seconds and 72.degree. C. for 80 seconds; final extension at 72.degree. C. for 5 minutes. The heat block then went to a 4.degree. C. soak cycle.
[0359] The PCR reaction products were isolated by 1.0% agarose gel electrophoresis using 90 mM Tris-borate and 1 mM EDTA (TBE) buffer where a single product band of the expected size, approximate 2.4 kb, was visualized under UV light, and then purified from solution using an ILLUSTRA.TM. GFX.TM. PCR DNA and Gel Band Purification Kit (GE Healthcare, Buckinghamshire, UK) according to the manufacturer's instructions.
[0360] Plasmid pPFJO355 was digested with Bam HI and Bgl II, isolated by 1.0% agarose gel electrophoresis using TBE buffer, and purified using an ILLUSTRA.TM. GFXT.TM. PCR DNA and Gel Band Purification Kit according to the manufacturer's instructions.
[0361] An IN-FUSION.TM. CF Dry-down Cloning Kit (Clontech Laboratories, Inc., Mountain View, Calif., USA) was used to clone the fragment directly into the expression vector pPFJO355, without the need for restriction digestion and ligation.
[0362] The PCR reaction product and the digested vector were ligated together using an IN-FUSION.TM. CF Dry-down PCR Cloning resulting in plasmid plasmid pTs in which the transcription of Talaromyces stipitatus catalase gene was under the control of a promoter from the gene for Aspergillus oryzae alpha-amylase. The cloning operation was conducted according to the manufacturer's instruction. In brief, 30 ng of pPFJO355 digested with Bam HI and Bgl II, and 60 ng of the purified Talaromyces stipitatus catalase PCR reaction products were added to the reaction vial and resuspended the powder in a final volume of 10 .mu.l with addition of deionized water. The reaction was incubated at 37.degree. C. for 15 minutes and then 50.degree. C. for 15 minutes. Three .mu.l of the reaction products were used to transform E. coli TOP10 competent cells (TIANGEN Biotech (Beijing) Co. Ltd., Beijing, China). E. coli transformants containing expression constructs were detected by colony PCR which is a method for quick screening of plasmid inserts directly from E. coli colonies. Briefly, in the premixed PCR solution aliquot in each PCR tube, including PCR buffer, MgCl.sub.2, dNTP and primer pairs for which the PCR fragment generated, a single colony was added by picking up with a sterile tip and twirling the tip in the reaction solution. Normaly 7-10 colonies were screened. After the PCR program, reactions were checked on agarose gel. The colony giving the amplification of expected size was possibly to contain the correct insert. The plasmid DNA was prepared using a QlAprep Spin Miniprep Kit (QIAGEN Inc., Valencia, Calif., USA). The Talaromyces stipitatus catalase gene inserted in plasmid pTs was confirmed by DNA sequencing using a 3730XL DNA Analyzers (Applied Biosystems Inc, Foster City, Calif., USA).
Example 11: Expression of Talaromyces stipitatus Catalase Gene in Aspergillus oryzae
[0363] Aspergillus oryzae HowB101 (described in patent WO 9535385 example 1) protoplasts were prepared according to the method of Christensen et al., 1988, Bio/Technology 6: 1419-1422. Three .mu.g of plasmid pTs were used to transform Aspergillus oryzae HowB101.
[0364] The transformation of Aspergillus oryzae HowB101 with plasmid pTs yielded about 50 transformants for each transformation. Eight transformants were isolated to individual Minimal medium plates.
[0365] Four transformants from each transformation were inoculated separately into 3 ml of YPM in 24-well plate and incubated at 30.degree. C., 150 rpm. After 3 days incubation, 20 .mu.l of supernatant from each culture were analyzed on NuPAGE Novex 4-12% Bis-Tris Gel with 2-(N-morpholino) ethanesulfonic acid (MES) (Invitrogen Corporation, Carlsbad, Calif., USA) according to the manufacturer's instructions. The resulting gel was stained with INSTANT BLUE.TM. (Expedeon Ltd., Babraham Cambridge, UK). SDS-PAGE profiles of the cultures showed expression with protein bands detected. The size of major band of the gene was around 92 KD. The expression strain was designated as EXP84.
Example 12: Fermentation of the Expression Strain EXP84
[0366] A slant of the expression strain, EXP84, was washed with 10 ml of YPM and inoculated into eight 2-liter flasks containing 400 ml of YPM medium to generate broth for characterization of the enzyme. The culture was harvested on day 3 and filtered using a 0.45 .mu.m DURAPORE Membrane (Millipore, Bedford, Mass., USA).
Example 13: Purification of Recombinant Talaromyces stipitatus Catalase from Aspergillus oryzae EXP84
[0367] 3200 ml supernatant of the recombinant strain EXP84 was precipitated with ammonium sulfate (80% saturation) and re-dissolved in 50 ml 20 mM Tris-HCl buffer, pH 7.5, then dialyzed against the same buffer and filtered through a 0.45 mm filter, the final volume was 100 ml. The solution was applied to a 40 ml Q SEPHAROSE.RTM. Fast Flow column (GE Healthcare, Buckinghamshire, UK) equilibrated in 20 mM Tris-HCl buffer, pH 7.5. Fractions eluted with 0.08-0.2M NaCl were collected and further purified on a 40 ml Q SEPHAROSE.RTM. Fast Flow column (GE
[0368] Healthcare, Buckinghamshire, UK) with a linear NaCl gradient (0.14-0.2M). Fractions were evaluated by SDS-PAGE (NP0336BOX, NUPAGE 4-12% BT GEL 1.5MM 15WW). Fractions containing a band of approximately 92 kDa were pooled. Then the pooled solution was concentrated by ultrafiltration.
Example 14: Humicola insolens Genomic DNA Extraction
[0369] Humicola insolens strain NN38 was inoculated onto a PDA plate and incubated for 3 days at 45.degree. C. in the darkness. Several mycelia-PDA plugs were inoculated into 500 ml shake flasks containing 100 ml of YPG medium. The flasks were incubated for 3 days at 45.degree. C. with shaking at 160 rpm. The mycelia were collected by filtration through MIRACLOTH.RTM. (Calbiochem, La Jolla, Calif., USA) and frozen in liquid nitrogen. Frozen mycelia were ground, by a mortar and a pestle, to a fine powder, and genomic DNA was isolated using DNeasy.RTM. Plant Maxi Kit (QIAGEN Inc., Valencia, Calif., USA) following the manufacturer's instruction.
Example 15: Genome Sequencing, Assembly and Annotation of Humicola insolens Strain NN38
[0370] The extracted genomic DNA samples were delivered to Beijing Genome Institute (BGI, Shenzhen, China) for genome sequencing using ILLUMINA.RTM. GA2 System (Illumina, Inc., San Diego, Calif., USA). The raw reads were assembled at BGI using program SOAPdenovo (Li et al., 2010, Genome Research 20(2): 265-72). The assembled sequences were analyzed using standard bioinformatics methods for gene finding and functional prediction. Briefly, geneID (Parra et al., 2000, Genome Research 10(4):511-515) was used for gene prediction. Blastall version 2.2.10 ((Altschul et al., 1990, J. Mol. Biol. 215 (3): 403-410; National Center for Biotechnology Information (NCBI), Bethesda, Md., USA) and HMMER version 2.1.1 (National Center for Biotechnology Information (NCBI), Bethesda, Md., USA) were used to predict function based on structural homology. The catalase gene, (SEQ ID NO: 5 for DNA sequence, SEQ ID NO: 6 for protein sequence), was identified directly by analysis of the Blast results. The Agene program (Munch and Krogh, 2006, BMC Bioinformatics 7:263) and SignalP program (Nielsen et al., 1997, Protein Engineering 10: 1-6) were used to identify starting codons. SignalP was further used to predict the signal peptide. Pepstats (European Bioinformatics Institute, Hinxton, Cambridge CB10 1SD, UK) was used to estimate isoelectric point of proteins, and molecular weight.
Example 16: Cloning of the Humicola insolens Catalase Gene from Genomic DNA
[0371] Based on the DNA information of the Humicola insolens catalase, oligonucleotide primers, shown in below, were designed to amplify the catalase gene from the genomic DNA of Humicola insolens NN38. Primers were fabricated by Invitrogen (Invitrogen, Beijing, China).
TABLE-US-00010 Forward primer: (SEQ ID NO: 40) 5' ACACAACTGGGGATCC ACC atgaacagagtcacgaat ctcctcg 3' Reverse primer: (SEQ ID NO: 41) 5' GTCACCCTCTAGATCT ggtacaactcccaccctattcc ttctc 3'
[0372] Lowercase characters represent the sequences of the gene in the forward primer and the flanking region of the 3' end of the gene in the reverse primer, while capitalized parts were homologous to the insertion sites of pPFJO355 vector which has been described in US2010306879.
[0373] The expression vector pPFJO355 contains the TAKA-amylase promoter derived from Aspergillus oryzae and the Aspergillus niger glucoamylase terminator elements. Furthermore pPFJO355 has pUC18 derived sequences for selection and propagation in E. coli, and a pyrG gene, which encodes an orotidine decarboxylase derived from Aspergillus nidulans for selection of a transformant of a pyrG mutant Aspergillus strain.
[0374] Twenty picomoles of primer pair (forward and reverse) were used in a PCR reaction composed of 2 82 l of Humicola insolens NN38 genomic DNA, 10 .mu.l of 5.times. GC Buffer, 1.5 .mu.l of DMSO, 2.5 mM each of dATP, dTTP, dGTP, and dCTP, and 0.6 unit of PHUSION.TM. High-Fidelity DNA Polymerase (Finnzymes Oy, Espoo, Finland) in a final volume of 50 .mu.l. The amplification was performed using a Peltier Thermal Cycler (MJ Research Inc., South San Francisco, Calif., USA) programmed for denaturing at 98.degree. C. for 1 minute; 6 cycles of denaturing at 98.degree. C. for 15 seconds, annealing at 63.degree. C. for 30 seconds, with 1.degree. C. decrease per cycle and elongation at 72.degree. C. for 3 minutes; and another 22 cycles each at 98.degree. C. for 15 seconds, 62.degree. C. for 30 seconds and 72.degree. C. for 3 minutes; final extension at 72.degree. C. for 7 minutes. The heat block then went to a 4.degree. C. soak cycle.
[0375] The PCR reaction products were isolated by 1.0% agarose gel electrophoresis using 90 mM Tris-borate and 1 mM EDTA (TBE) buffer where a single product band of the expected size, approximate 3.1 kb, was visualized under UV light and then purified from solution using an ILLUSTRA.TM. GFX.TM. PCR DNA and Gel Band Purification Kit (GE Healthcare, Buckinghamshire, UK) according to the manufacturer's instructions.
[0376] Plasmid pPFJO355 was digested with Bam HI and Bgl II, isolated by 1.0% agarose gel electrophoresis using TBE buffer, and purified using an ILLUSTRA.TM. GFX.TM. PCR DNA and Gel Band Purification Kit according to the manufacturer's instructions.
[0377] An IN-FUSION.TM. CF Dry-down Cloning Kit (Clontech Laboratories, Inc., Mountain View, Calif., USA) was used to clone the fragment directly into the expression vector pPFJO355, without the need for restriction digestion and ligation.
[0378] The PCR reaction product and the digested vector were ligated together using an IN-FUSION.TM. CF Dry-down PCR Cloning resulting in plasmid pHi in which the transcription of Humicola insolens catalase gene was under the control of a promoter from the gene for Aspergillus oryzae alpha-amylase. The cloning operation was conducted according to the manufacturer's instruction. In brief, 30 ng of pPFJO355 digested with Bam HI and Bgl II, and 60 ng of the purified Humicola insolens catalase PCR product were added to the reaction vial and resuspended the powder in a final volume of 10 .mu.l with addition of deionized water. The reaction was incubated at 37.degree. C. for 15 minutes and then 50.degree. C. for 15 minutes. Three .mu.l of the reaction products were used to transform E. coli TOP10 competent cells (TIANGEN Biotech (Beijing) Co. Ltd., Beijing, China). E. coli transformants containing expression constructs were detected by colony PCR which is a method for quick screening of plasmid inserts directly from E. coli colonies. Briefly, in the premixed PCR solution aliquot in each PCR tube, including PCR buffer, MgCl.sub.2, dNTP and primer pairs for which the PCR fragment generated, a single colony was added by picking up with a sterile tip and twirling the tip in the reaction solution. Normaly 7-10 colonies were screened. After the PCR program, reactions were checked on agarose gel. The colony giving the amplification of expected size was possibly to contain the correct insert. The plasmid DNA was prepared using a QIAprep Spin Miniprep Kit (QIAGEN Inc., Valencia, Calif., USA). The Humicola insolens catalase gene inserted in plasmid pHi was confirmed by DNA sequencing using a 3730XL DNA Analyzers (Applied Biosystems Inc, Foster City, Calif., USA).
Example 17: Expression of Humicola insolens Catalase Gene in Aspergillus oryzae
[0379] Aspergillus oryzae HowB101 (described in patent WO9535385 example 1) protoplasts were prepared according to the method of Christensen et al., 1988, Bio/Technology 6: 1419-1422. Three .mu.g of plasmid pHi were used to transform Aspergillus oryzae HowB101.
[0380] The transformation of Aspergillus oryzae HowB101 with plasmid pHi yielded about 50 transformants for each transformation. Eight transformants were isolated to individual Minimal medium plates.
[0381] Four transformants from each transformation were inoculated separately into 3 ml of YPM medium in 24-well plate and incubated at 30.degree. C., 150 rpm. After 3 days incubation, 20 .mu.l of supernatant from each culture were analyzed on NuPAGE Novex 4-12% Bis-Tris Gel w/MES (Invitrogen Corporation, Carlsbad, Calif., USA) according to the manufacturer's instructions. The resulting gel was stained with INSTANT BLUE.TM. (Expedeon Ltd., Babraham Cambridge, UK). SDS-PAGE profiles of the cultures showed expression with protein bands detected. The size of major band of the gene was around 80 KD. The expression strain was designated as O5.
Example 18: Fermentation of Expression Strain O5
[0382] A slant of the expression strain, O5, was washed with 10 ml of YPM and inoculated into twelve 2-liter flasks each containing 400 ml of YPM medium to generate broth for characterization of the enzyme. The culture was harvested on day 3 and filtered using a 0.45 pm DURAPORE Membrane (Millipore, Bedford, Mass., USA).
Example 19: Purification of Recombinant Humicola insolens Catalase from Aspergillus oryzae O5
[0383] 4000 ml supernatant of the recombinant strain O5 was precipitated with ammonium sulfate (80% saturation) and re-dissolved in 50 ml 20 mM Bis-Tris buffer, pH 6.0, then dialyzed against the same buffer and filtered through a 0.45 mm filter, the final volume was 140 ml. The solution was applied to a 40 ml Q SEPHAROSE.RTM. Fast Flow column (GE Healthcare, Buckinghamshire, UK) equilibrated in 20 mM Bis-Tris buffer, pH 6.0, and the proteins was eluted with a linear NaCl gradient (0-0.25M). Fractions eluted with 0.2-0.5M NaCl were collected and further purified on a 40 ml Q SEPHAROSE.RTM. Fast Flow column (GE Healthcare, Buckinghamshire, UK) equilibrated in 20 mM Bis-Tris buffer, pH 6.0, and the proteins was eluted with a linear NaCl gradient (0.2-0.5M). Fractions were evaluated by SDS-PAGE (NP0336BOX, NUPAGE 4-12% BT GEL 1.5MM15W). Fractions containing a band of approximately 80 kDa were pooled. Then the pooled solution was concentrated by ultrafiltration.
[0384] The mature polypeptide of the Humicola insolens catalase shares 99.25% identity with the mature polypeptide of Humicola grisea thermotolerant catalase protein (WO2009104622-A1).
Example 20: Catalase Activity Assay
[0385] The purified Humicola insolens catalase was checked for catalase activity by using the following protocol.
[0386] The substrate was prepared by 1000 times dilution of 30% H.sub.2O.sub.2 (from Xilong Chemical, Guangdong, China) with double distilled H.sub.2O (ddH.sub.2O), the final concentration was 10.3 mM. The reaction was started by adding 1 .mu.l of purified Humicola insolens catalase sample into 1000 .mu.l of substrate. The optical density (OD) at 240 nm was read by Ultrospec 3300 (GE Healthcare, Buckinghamshire, UK) at second of 0 and 16 respectively, and the decrease of the OD (from 0.400 to 0.102) showed the relative activity of the Humicola insolens catalase.
Example 21: Penicillium emersonii Genomic DNA Extraction
[0387] Penicillium emersonii strain NN051602 was inoculated onto a PDA plate and incubated for 3 days at 45.degree. C. in the darkness. Several mycelia-PDA plugs were inoculated into 500 ml shake flasks containing 100 ml of YPG medium. The flasks were incubated for 3 days at 45.degree. C. with shaking at 160 rpm. The mycelia were collected by filtration through MIRACLOTH.RTM. (Calbiochem, La Jolla, Calif., USA) and frozen in liquid nitrogen. Frozen mycelia were ground, by a mortar and a pestle, to a fine powder, and genomic DNA was isolated using a Large-Scale Column Fungal DNAout (Baoman Biotechnology, Shanghai, China) according to the manufacturer's instructions.
Example 22: Genome Sequencing, Assembly and Annotation
[0388] The extracted genomic DNA samples were delivered to Beijing Genome Institute (BGI, Shenzhen, China) for genome sequencing using ILLUMINA.RTM. GA2 System (Illumina, Inc., San Diego, Calif., USA). The raw reads were assembled at BGI using program SOAPdenovo (Li et al., 2010, Genome Res 20: 265-72). The assembled sequences were analyzed using standard bioinformatics methods for gene identification and functional prediction. Briefly, geneID (Parra et al., 2000, Genome Research 10(4):511-515) was used for gene prediction. Blastall version 2.2.10 (Altschul et al., 1990, J. Mol. Biol. 215 (3): 403-410, http://blast.ncbi.nlm.nih.gov/Blast.cgi) and HMMER version 2.1.1 (National Center for Biotechnology Information (NCBI), Bethesda, Md., USA, http://hmmer.janelia.org) were used to predict function based on structural homology. The catalase was identified by analysis of the Blast results. The Agene program (Munch and Krogh, 2006, BMC Bioinformatics 7:263) and SignalP program (Nielsen et al., 1997, Protein Engineering 10: 1-6) were used to identify start codons. SignalP was further used to predict the signal peptide. Pepstats (Rice et al., 2000, Trends Genet. 16(6): 276-277) was used to estimate isoelectric point of proteins, and molecular weight.
Example 23: Cloning of the Penicillium emersonii Catalase from Genomic DNA
[0389] One catalase gene, PE04230007241 (SEQ ID NO: 7), was selected for expression cloning.
[0390] Based on the gene information obtained by genome sequencing, oligonucleotide primers as shown below, were designed to amplify the catalase gene, PE04230007241, from the genomic DNA of Penicillium emersonii. Primers were fabricated by Invitrogen (Invitrogen, Beijing, China).
TABLE-US-00011 Forward 5' ACACAACTGGGGATCC ACC SEQ ID NO: 42 primer atgcgcgcagtgcagct 3' Reverse 5' GTCACCCTCTAGATCT SEQ ID NO: 43 primer gtcgactattccaaccttcct atatggacac 3'
[0391] Lowercase characters of the forward primer represent the coding regions of the gene and lowercase characters of the reverse primer represent the flanking region of the gene, while capitalized parts were homologous to the insertion sites of pPFJO355 vector which has been described in US2010306879.
[0392] An IN-FUSION.TM. CF Dry-down Cloning Kit (Clontech Laboratories, Inc., Mountain View, Calif., USA) was used to clone the fragment directly into the expression vector pPFJO355 which has been described in US2010306879, without the need for restriction digestion and ligation.
[0393] The expression vector pPFJO355 contains the TAKA-amylase promoter derived from Aspergillus oryzae and the Aspergillus niger glucoamylase terminator elements. Furthermore pPFJO355 has pUC18 derived sequences for selection and propagation in E. coli, and a pyrG gene, which encodes an orotidine decarboxylase derived from Aspergillus nidulans for selection of a transformant of a pyrG mutant Aspergillus strain.
[0394] Twenty picomoles of each of the primers above were used in a PCR reaction composed of 2 .mu.l of Penicillium emersonii genomic DNA, 10 .mu.l of 5.times.GC Buffer, 1.5 .mu.l of DMSO, 2.5 mM each of dATP, dTTP, dGTP, and dCTP, and 0.6 unit of PHUSION.TM. High-Fidelity DNA Polymerase (Finnzymes Oy, Espoo, Finland) in a final volume of 50 .mu.l. The amplification was performed using a Peltier Thermal Cycler (MJ Research Inc., South San Francisco, Calif., USA) programmed for denaturing at 98.degree. C. for 1 minutes; 8 cycles of denaturing at 98.degree. C. for 15 seconds, annealing at 65.degree. C. for 30 seconds, with 1.degree. C. decrease per cycle and elongation at 72.degree. C. for 3 minute 15 second; and another 22 cycles each at 98.degree. C. for 15 seconds, 58C for 30 seconds and 72.degree. C. for 3 minute 15 second; final extension at 72.degree. C. for 10 minutes. The heat block then went to a 4.degree. C. soak cycle.
[0395] The reaction product was isolated by 1.0% agarose gel electrophoresis using 90 mM Tris-borate and 1 mM EDTA (TBE) buffer where an approximate 2.5 kb product band was excised from the gel, and purified using an ILLUSTRA.TM. GFX.TM. PCR DNA and Gel Band Purification Kit (GE Healthcare, Buckinghamshire, UK) according to the manufacturer's instructions.
[0396] Plasmid pPFJO355 was digested with Bam HI and Bgl II, isolated by 1.0% agarose gel electrophoresis using TBE buffer, and purified using an ILLUSTRA.TM. GFX.TM. PCR DNA and Gel Band Purification Kit according to the manufacturer's instructions.
[0397] The PCR product and the digested vector were ligated together using an IN-FUSION.TM. CF Dry-down PCR Cloning resulting in pCat_PE04230007241 in which the transcription of the Penicillium emersonii catalase gene was under the control of a promoter from the gene for Aspergillus oryzae alpha-amylase. The cloning operation was conducted according to the manufacturer's instruction. In brief, 30 ng of pPFJO355 digested with Bam HI and Bgl II, and 60 ng of the purified Penicillium emersonii catalase gene PCR product were added to the reaction vial and resuspended the powder in a final volume of 10 .mu.l with addition of deionized water. The reaction was incubated at 37.degree. C. for 15 minutes and then 50.degree. C. for 15 minutes. Three .mu.l of the reaction products were used to transform E. coli TOP10 competent cells (TIANGEN Biotech (Beijing) Co. Ltd., Beijing, China). An E. coli transformant containing pCat_PE04230007241 was detected by colony PCR. Colony PCR is a method for quick screening of plasmid inserts directly from E. coli colonies. Briefly, in the premixed PCR solution aliquot in each PCR tube, including PCR buffer, MgCl.sub.2, dNTPs, and primer pairs from which the PCR fragment was generated, a single colony was added by picking with a sterile tip and twirling the tip in the reaction solution. Normally 7-10 colonies were screened. After the PCR, reactions were analyzed by 1.0% agarose gel electrophoresis using TBE buffer. The plasmid DNA was prepared using a QIAprep Spin Miniprep Kit (QIAGEN Inc., Valencia, Calif., USA). The Penicillium emersonii catalase gene inserted in pCat_PE04230007241 was confirmed by DNA sequencing using a 3730XL DNA Analyzer (Applied Biosystems Inc, Foster City, Calif., USA).
Example 24: Expression of Penicillium emersonii Catalase Gene in Aspergillus oryzae
[0398] Aspergillus oryzae HowB101 (described in patent WO9535385 example 1) protoplasts were prepared according to the method of Christensen et al., 1988, Bio/Technology 6: 1419-1422. Three .mu.g of pCat_PE04230007241 were used to transform Aspergillus oryzae HowB101.
[0399] The transformation of Aspergillus oryzae HowB101 with pCat_PE04230007241 yielded about 50 transformants. Four transformants were isolated to individual Minimal medium plates.
[0400] Four transformants were inoculated separately into 3 ml of YPM medium in 24-well plate and incubated at 30.degree. C., 150 rpm. After 3 days incubation, 20 .mu.l of supernatant from each culture were analyzed on NuPAGE Novex 4-12% Bis-Tris Gel with 2-(N-morpholino)ethanesulfonic acid (MES) (Invitrogen Corporation, Carlsbad, Calif., USA) according to the manufacturer's instructions. The resulting gel was stained with INSTANT BLUE.TM. (Expedeon Ltd., Babraham Cambridge, UK). SDS-PAGE profiles of the cultures showed that all transformants had a band of approximately 80 kDa. The expression strain was designated as O6YTS.
Example 25: Fermentation of Aspergillus oryzae Expression Strain O6YTS
[0401] A slant of the expression strain, O6YTS, was washed with 10 ml of YPM and inoculated into 7 2-liter flasks containing 400 ml of YPM medium to generate broth for characterization of the enzyme. The culture was harvested on day 3 and filtered using a 0.45 pm DURAPORE Membrane (Millipore, Bedford, Mass., USA).
Example 26: Purification of Recombinant Penicillium emersonii Catalase from Aspergillus oryzae O6YTS
[0402] 2800 ml supernatant of the recombinant strain O6YTS was precipitated with ammonium sulfate (80% saturation) and re-dissolved in 50 ml 20 mM Tris-HCl buffer, pH 8.0, then dialyzed against the same buffer and filtered through a 0.45 mm filter, the final volume was 80 ml. The solution was applied to a 40 ml Q SEPHAROSE.RTM. Fast Flow column (GE Healthcare, Buckinghamshire, UK) equilibrated in 20 mM Tris-HCl buffer, pH 8.0. Fractions eluted with 0.18-0.25M NaCl were evaluated by SDS-PAGE (NP0336BOX, NUPAGE 4-12% BT GEL 1.5MM15W). Fractions containing a band of approximately 80 kDa were pooled. Then the pooled solution was concentrated by ultrafiltration.
Example 27: Catalase Activity Assay
[0403] The purified Penicillium emersonii catalase was checked for catalase activity by using the following protocol.
[0404] The substrate was prepared by 1000 times dilution of 30% H.sub.2O.sub.2 (from Xilong Chemical, Guangdong, China) with double distilled H.sub.2O (ddH.sub.2O), the final concentration was 10.3 mM. The reaction was started by adding 1 .mu.l of purified Penicillium emersonii catalase sample into 1000 .mu.l of substrate. The optical density (OD) at 240 nm was read by Ultrospec 3300 (GE Healthcare, Buckinghamshire, UK) at second of 0 and 16 respectively, and the decrease of the OD (from 0.505 to 0.284) showed the relative activity of the Penicillium emersonii catalase.
[0405] The present invention is further described by the following numbered paragraphs:
[0406] [1] A method for degrading or converting a cellulosic material, comprising: treating the cellulosic material with an enzyme composition in the presence of a polypeptide having catalase activity.
[0407] [2] The method of paragraph 1, wherein the enzyme composition comprises one or more (e.g., several) enzymes selected from the group consisting of a cellulase, a GH61 polypeptide having cellulolytic enhancing activity, a hemicellulase, an esterase, an expansin, a laccase, a ligninolytic enzyme, a pectinase, a peroxidase, a protease, and a swollenin.
[0408] [3] The method of paragraph 2, wherein the cellulase is one or more (e.g., several) enzymes selected from the group consisting of an endoglucanase, a cellobiohydrolase, and a beta-glucosidase.
[0409] [4] The method of paragraph 2, wherein the hemicellulase is one or more enzymes selected from the group consisting of a xylanase, an acetylxylan esterase, a feruloyl esterase, an arabinofuranosidase, a xylosidase, and a glucuronidase.
[0410] [5] The method of any of paragraphs 1-4, wherein the cellulosic material is selected from the group consisting of agricultural residue, herbaceous material, municipal solid waste, pulp and paper mill residue, waste paper, and wood; preferably, arundo, bagasse, bamboo, corn cob, corn fiber, corn stover, miscanthus, orange peel, rice straw, switchgrass, wheat straw, eucalyptus, fir, pine, poplar, spruce, willow, algal cellulose, bacterial cellulose, cotton linter, filter paper, microcrystalline cellulose, or phosphoric-acid treated cellulose.
[0411] [6] The method of any of paragraphs 1-5, wherein the cellulosic material is pretreated, especially by chemical pretreatment, physical pretreatment, or biochemical pretreatment.
[0412] [7] The method of any of paragraphs 1-6, further comprising recovering the degraded cellulosic material.
[0413] [8] The method of paragraph 7, wherein the degraded cellulosic material is a sugar.
[0414] [9] The method of paragraph 8, wherein the sugar is selected from the group consisting of glucose, xylose, mannose, galactose, and arabinose.
[0415] [10] The method of any of paragraphs 1-9, wherein the presence of the polypeptide having catalase activity increases the hydrolysis of the cellulosic material compared to the absence of the polypeptide having catalase activity.
[0416] [11]. The method of any of paragraphs 1-10, wherein the polypeptide having catalase activity is selected from the group consisting of:
[0417] (a) a polypeptide having at least 60% sequence identity to the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, or the mature polypeptide of SEQ ID NO: 8;
[0418] (b) a polypeptide encoded by a polynucleotide that hybridizes under low, medium, medium-high, high, or very high stringency conditions with (i) the mature polypeptide coding sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7, (ii) the cDNA sequence thereof, or (iii) the full-length complement of (i) or (ii);
[0419] (c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7; or the cDNA sequence thereof;
[0420] (d) a variant of the mature polypeptide of SEQ ID NO: 2, a variant of the mature polypeptide of SEQ ID NO: 4, a variant of the mature polypeptide of SEQ ID NO: 6, or a variant of the mature polypeptide of SEQ ID NO: 8, comprising a substitution, deletion, and/or insertion at one or more (e.g., several) positions; and
[0421] (e) a fragment of the polypeptide of (a), (b), (c), or (d) that has catalase activity.
[0422] [12] The method of any of paragraphs 1-11, wherein the polypeptide having catalase activity is a catalase from Thermoascus, Talaromyces, Humicola, or Penicillium.
[0423] [13] The method of paragraph 12, wherein the polypeptide having catalase activity is a catalase from Thermoascus aurantiacus, Talaromyces stipitatus, Humicola insolens, or Penicillium emersonii.
[0424] [14] A method for producing a fermentation product, comprising:
[0425] (a) saccharifying a cellulosic material with an enzyme composition in the presence of a polypeptide having catalase activity;
[0426] (b) fermenting the saccharified cellulosic material with one or more (e.g., several) fermenting microorganisms to produce the fermentation product; and
[0427] (c) recovering the fermentation product from the fermentation.
[0428] [15] The method of paragraph 14, wherein the enzyme composition comprises one or more (e.g., several) enzymes selected from the group consisting of a cellulase, a GH61 polypeptide having cellulolytic enhancing activity, a hemicellulase, an esterase, an expansin, a laccase, a ligninolytic enzyme, a pectinase, a peroxidase, a protease, and a swollenin.
[0429] [16] The method of paragraph 15, wherein the cellulase is one or more (e.g., several) enzymes selected from the group consisting of an endoglucanase, a cellobiohydrolase, and a beta-glucosidase.
[0430] [17] The method of paragraph 15, wherein the hemicellulase is one or more enzymes selected from the group consisting of a xylanase, an acetylxylan esterase, a feruloyl esterase, an arabinofuranosidase, a xylosidase, and a glucuronidase.
[0431] [18] The method of any of paragraphs 14-17, wherein the cellulosic material is selected from the group consisting of agricultural residue, herbaceous material, municipal solid waste, pulp and paper mill residue, waste paper, and wood; preferably, arundo, bagasse, bamboo, corn cob, corn fiber, corn stover, miscanthus, orange peel, rice straw, switchgrass, wheat straw, eucalyptus, fir, pine, poplar, spruce, willow, algal cellulose, bacterial cellulose, cotton linter, filter paper, microcrystalline cellulose, or phosphoric-acid treated cellulose.
[0432] [19] The method of any of paragraphs 14-18, wherein the cellulosic material is pretreated, especially by chemical pretreatment, physical pretreatment, or biochemical pretreatment; or wherein steps (a) and (b) are performed simultaneously in a simultaneous saccharification and fermentation.
[0433] [20] The method of any of paragraphs 14-19, wherein the fermentation product is an alcohol, an alkane, a cycloalkane, an alkene, an amino acid, a gas, isoprene, a ketone, an organic acid, or polyketide.
[0434] [21] The method of any of paragraphs 14-20, wherein the presence of the polypeptide having catalase activity increases the hydrolysis of the cellulosic material compared to the absence of the polypeptide having catalase activity.
[0435] [22]. The method of any of paragraphs 14-21, wherein the polypeptide having catalase activity is selected from the group consisting of:
[0436] (a) a polypeptide having at least 60% sequence identity to the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, or the mature polypeptide of SEQ ID NO: 8;
[0437] (b) a polypeptide encoded by a polynucleotide that hybridizes under low, medium, medium-high, high, or very high stringency conditions with (i) the mature polypeptide coding sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7, (ii) the cDNA sequence thereof, or (iii) the full-length complement of (i) or (ii);
[0438] (c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7; or the cDNA sequence thereof;
[0439] (d) a variant of the mature polypeptide of SEQ ID NO: 2, a variant of the mature polypeptide of SEQ ID NO: 4, a variant of the mature polypeptide of SEQ ID NO: 6, or a variant of the mature polypeptide of SEQ ID NO: 8, comprising a substitution, deletion, and/or insertion at one or more (e.g., several) positions; and
[0440] (e) a fragment of the polypeptide of (a), (b), (c), or (d) that has catalase activity.
[0441] [23] The method of any of paragraphs 14-22, wherein the polypeptide having catalase activity is a catalase from Thermoascus, Talaromyces, Humicola, or Penicillium.
[0442] [24] The method of paragraph 23, wherein the polypeptide having catalase activity is a catalase from Thermoascus aurantiacus, Talaromyces stipitatus, Humicola insolens, or Penicillium emersonii.
[0443] [25] A method of fermenting a cellulosic material, comprising: fermenting the cellulosic material with one or more (e.g., several) fermenting microorganisms, wherein the cellulosic material is hydrolyzed with an enzyme composition in the presence of a polypeptide having catalase activity.
[0444] [26] The method of paragraph 25, wherein the enzyme composition comprises one or more (e.g., several) enzymes selected from the group consisting of a cellulase, a GH61 polypeptide having cellulolytic enhancing activity, a hemicellulase, an esterase, an expansin, a laccase, a ligninolytic enzyme, a pectinase, a peroxidase, a protease, and a swollenin.
[0445] [27] The method of paragraph 26, wherein the cellulase is one or more (e.g., several) enzymes selected from the group consisting of an endoglucanase, a cellobiohydrolase, and a beta-glucosidase.
[0446] [28] The method of paragraph 26, wherein the hemicellulase is one or more enzymes selected from the group consisting of a xylanase, an acetylxylan esterase, a feruloyl esterase, an arabinofuranosidase, a xylosidase, and a glucuronidase.
[0447] [29] The method of any of paragraphs 25-28, wherein the cellulosic material is selected from the group consisting of agricultural residue, herbaceous material, municipal solid waste, pulp and paper mill residue, waste paper, and wood; preferably, arundo, bagasse, bamboo, corn cob, corn fiber, corn stover, miscanthus, orange peel, rice straw, switchgrass, wheat straw, eucalyptus, fir, pine, poplar, spruce, willow, algal cellulose, bacterial cellulose, cotton linter, filter paper, microcrystalline cellulose, or phosphoric-acid treated cellulose.
[0448] [30] The method of any of paragraphs 25-29, wherein the cellulosic material is pretreated, especially by chemical pretreatment, physical pretreatment, or biochemical pretreatment.
[0449] [31] The method of any of paragraphs 25-30, wherein the fermentation produces a fermentation product.
[0450] [32] The method of paragraph 31, further comprising recovering the fermentation product.
[0451] [33] The method of paragraph 32, wherein the fermentation product is an alcohol, an alkane, a cycloalkane, an alkene, an amino acid, a gas, isoprene, a ketone, an organic acid, or polyketide.
[0452] [34] The method of any of paragraphs 25-33, wherein the presence of the polypeptide having catalase activity increases the hydrolysis of the cellulosic material compared to the absence of the polypeptide having catalase activity.
[0453] [35] The method of any of paragraphs 25-34, wherein the polypeptide having catalase activity is selected from the group consisting of:
[0454] (a) a polypeptide having at least 60% sequence identity to the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, or the mature polypeptide of SEQ ID NO: 8;
[0455] (b) a polypeptide encoded by a polynucleotide that hybridizes under low, medium, medium-high, high, or very high stringency conditions with (i) the mature polypeptide coding sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7, (ii) the cDNA sequence thereof, or (iii) the full-length complement of (i) or (ii);
[0456] (c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7; or the cDNA sequence thereof;
[0457] (d) a variant of the mature polypeptide of SEQ ID NO: 2, a variant of the mature polypeptide of SEQ ID NO: 4, a variant of the mature polypeptide of SEQ ID NO: 6, or a variant of the mature polypeptide of SEQ ID NO: 8, comprising a substitution, deletion, and/or insertion at one or more (e.g., several) positions; and
[0458] (e) a fragment of the polypeptide of (a), (b), (c), or (d) that has catalase activity.
[0459] [36] The method of any of paragraphs 25-35, wherein the polypeptide having catalase activity is a catalase from Thermoascus, Talaromyces, Humicola, or Penicillium.
[0460] [37] The method of paragraph 36, wherein the polypeptide having catalase activity is a catalase from Thermoascus aurantiacus, Talaromyces stipitatus, Humicola insolens, or Penicillium emersonii.
[0461] [38] An enzyme composition for degrading or converting a cellulosic material comprising one or more (e.g., several) enzymes having cellulolytic and/or hemicellulolytic activity and a polypeptide having catalase activity.
[0462] [39] The enzyme composition of paragraph 38, further comprising one or more (e.g., several) enzymes selected from the group consisting of a GH61 polypeptide having cellulolytic enhancing activity, an esterase, an expansin, a laccase, a ligninolytic enzyme, a pectinase, a peroxidase, a protease, and a swollenin.
[0463] [40] The enzyme composition of paragraph 38 or 39, wherein the enzymes having cellulolytic activity are selected from the group consisting of an endoglucanase, a cellobiohydrolase, and a beta-glucosidase.
[0464] [41] The enzyme composition of any of paragraphs 38-40, wherein the enzymes having hemicellulolytic activity are selected from the group consisting of a xylanase, an acetylxylan esterase, a feruloyl esterase, an arabinofuranosidase, a xylosidase, and a glucuronidase.
[0465] [42] The enzyme composition of any of paragraphs 38-41, wherein the polypeptide having catalase activity is selected from the group consisting of:
[0466] (a) a polypeptide having at least 60% sequence identity to the mature polypeptide of SEQ ID NO: 2, the mature polypeptide of SEQ ID NO: 4, the mature polypeptide of SEQ ID NO: 6, or the mature polypeptide of SEQ ID NO: 8;
[0467] (b) a polypeptide encoded by a polynucleotide that hybridizes under low, medium, medium-high, high, or very high stringency conditions with (i) the mature polypeptide coding sequence of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 5, or SEQ ID NO: 7, (ii) the cDNA sequence thereof, or (iii) the full-length complement of (i) or (ii);
[0468] (c) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the mature polypeptide coding sequence of SEQ ID NO: 1, the mature polypeptide coding sequence of SEQ ID NO: 3, the mature polypeptide coding sequence of SEQ ID NO: 5, or the mature polypeptide coding sequence of SEQ ID NO: 7; or the cDNA sequence thereof;
[0469] (d) a variant of the mature polypeptide of SEQ ID NO: 2, a variant of the mature polypeptide of SEQ ID NO: 4, a variant of the mature polypeptide of SEQ ID NO: 6, or a variant of the mature polypeptide of SEQ ID NO: 8, comprising a substitution, deletion, and/or insertion at one or more (e.g., several) positions; and
[0470] (e) a fragment of the polypeptide of (a), (b), (c), or (d) that has catalase activity.
[0471] [43] The enzyme composition of any of paragraphs 38-42, wherein the polypeptide having catalase activity is a catalase from Thermoascus, Talaromyces, Humicola, or Penicillium.
[0472] [44] The enzyme composition of paragraph 43, wherein the polypeptide having catalase activity is a catalase from Thermoascus aurantiacus, Talaromyces stipitatus, Humicola insolens, or Penicillium emersonii.
[0473] [45] Use of the enzyme composition of any of paragraphs 38-44 in degrading or converting a cellulosic material.
[0474] [46] The use of paragraph 45, wherein the cellulosic material is selected from the group consisting of agricultural residue, herbaceous material, municipal solid waste, pulp and paper mill residue, waste paper, and wood; preferably, arundo, bagasse, bamboo, corn cob, corn fiber, corn stover, miscanthus, orange peel, rice straw, switchgrass, wheat straw, eucalyptus, fir, pine, poplar, spruce, willow, algal cellulose, bacterial cellulose, cotton linter, filter paper, microcrystalline cellulose, or phosphoric-acid treated cellulose.
[0475] [47] The use of paragraph 45 or 46, wherein the cellulosic material is pretreated, especially by chemical pretreatment, physical pretreatment, or biochemical pretreatment.
[0476] [48] A whole broth formulation or cell culture composition comprising one or more (e.g., several) enzymes having cellulolytic and/or hemicellulolytic activity and a polypeptide having catalase activity.
[0477] The invention described and claimed herein is not to be limited in scope by the specific aspects herein disclosed, since these aspects are intended as illustrations of several aspects of the invention. Any equivalent aspects are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. In the case of conflict, the present disclosure including definitions will control.
Sequence CWU
1
1
4312354DNAThermoascus aurantiacus 1atggctgcta caatcgctgg tgggctacac
aaggcccagg acctcatcaa gaacaccaca 60tcgagggaca agaagctggt ggacttggag
cgcgacactg acaatatcca cacaaagtgc 120ccctttacgt cagaccatgg cgttgctatt
agcaataccg acaattggct gaaggctgtg 180gattaccagc atactggtcc atccctcctg
gaagatcagt ttgctagaga gaaggtatat 240ccaatttctc ttcccgaatg aaagctcggt
tgcagcattt gactcgcatc cagattcacc 300gcttcgatca cgagcgcatt ccagaaagag
tggttcatgc acgaggagca ggtgccttcg 360gaaacttcag actcaaggag agcgcggcgg
atgttaccca cgcaggagta ctgacggaca 420cgtcgcgaaa caccccagtc tttgtccgtt
tctccactgt ccaagggagt aggggtagtg 480ccgacacagt ccgcgacgtt cgaggattcg
cggttaagtt ctacaccgac gagggtaatt 540gggatatagt gggcaacaac atccctgtct
tcttcatcca ggaagctata aagtttcccg 600atgtcggtac gcgatctcgt attctacccg
ttgtcaggtc tagcaactaa ctgcactaag 660ttcacgctgt caagccggag ccacacaatg
aggtccctac agcgcagacg gcccataata 720acttctggga cttcgtatat atgcatcccg
aggcgacaca catgttcatg tggatcatgt 780ctgatagagc aattccacga tcgttccgta
tgatgcaagg attcagtgtg aacactttcg 840ttctcgttaa caaagagggc aaacgctcct
ttgtcaagtt ccattggatc cccaagcttg 900gcctgcactc actcgtttgg gatgaggccc
agaaaattgc tggccaggat cccgattttc 960accggaaaga cctctgggag gccattgcca
acaagatcta ccccaaatgg gattttggta 1020tccaggtgat cccagaggag aaacagcatg
acttcgaatt tgatctcctc gatgccacca 1080agctctggcc cgaggatctc atccccgtcc
gtatcatcgg agagctggag cttaaccgca 1140atccggacga gttcttcacc cagacagagc
aggttgcttt ctgcaccagc cacattgtcc 1200ccggcataga attctctgat gaccctctac
tccatggccg caacttctcc tacttcgata 1260cccagattac cagactggga attaactggg
aagagcttcc aatcaaccgt cccgtatgtc 1320ctgttttcaa ccacaacagg gacggccagt
cgcgacatag gatcacacag ggaactgtta 1380actactggcc caaccgcttt gaagcttgtc
caccgaccaa acccgaggaa ggtgggttcg 1440taacataccc gtcgacattc cacggcctca
aacagcgcac cttgagttat aagttccgtg 1500aacatcataa ccaagcgcag atgttctata
actccctgtc ggagcacgaa aagctccatc 1560ttaccaaggc attcagcttc gagcttgacc
actgcgacga cccaaccgtc tacagtcggt 1620tggccggcga acgcctggcc gaaatcgacc
tcgggctcgc ccagaaagtc gccgagatgg 1680tcggcgcccc gatcccgacg agagccctca
aggagaacaa aggccagcga gcggtccggc 1740tgtccttcac ggaattctat cctcctaacc
caaccatcga gtcccgccgt gtggctatca 1800taatcggcga cggctacgac ccgatagcgt
tcttgggcat caaagcagcc gtcgaggccg 1860ccaacgctct gcccttcatc atcggcacca
agcgccagcc catgttcgcc gaaggcgaag 1920acagaaccac ctccaagggc gtcatcccgg
accaccagta cgacggacag cgctcgaccc 1980tgttcgacgc gaccttcatc ccaggcggct
ctcacgtcaa gacgctggcc cagaacgggc 2040agatccggta ctggatcacc gagaccttcg
ggcacctgaa ggctctggcc gcgacgggcg 2100aggctgtcga ccttgtcaag caggtgctga
gcgtcgttcc cggtctgcag ttcgccactc 2160caaacgagcc caacgtcgtc gagtcatacg
gcgtcgtcac cgtgggcggg aagcagaagc 2220ctgagagctt cagcgagggg ttcaagatcc
tcaaggatgc gaaggacttt gtgggccagt 2280tcttctacca gatctcacag catcggaact
ttaagcgtga gctggaaggg ctgcatgcca 2340ccgttgcgtt ttaa
23542746PRTThermoascus aurantiacus 2Met
Ala Ala Thr Ile Ala Gly Gly Leu His Lys Ala Gln Asp Leu Ile 1
5 10 15 Lys Asn Thr Thr Ser Arg
Asp Lys Lys Leu Val Asp Leu Glu Arg Asp 20
25 30 Thr Asp Asn Ile His Thr Lys Cys Pro Phe
Thr Ser Asp His Gly Val 35 40
45 Ala Ile Ser Asn Thr Asp Asn Trp Leu Lys Ala Val Asp Tyr
Gln His 50 55 60
Thr Gly Pro Ser Leu Leu Glu Asp Gln Phe Ala Arg Glu Lys Ile His 65
70 75 80 Arg Phe Asp His Glu
Arg Ile Pro Glu Arg Val Val His Ala Arg Gly 85
90 95 Ala Gly Ala Phe Gly Asn Phe Arg Leu Lys
Glu Ser Ala Ala Asp Val 100 105
110 Thr His Ala Gly Val Leu Thr Asp Thr Ser Arg Asn Thr Pro Val
Phe 115 120 125 Val
Arg Phe Ser Thr Val Gln Gly Ser Arg Gly Ser Ala Asp Thr Val 130
135 140 Arg Asp Val Arg Gly Phe
Ala Val Lys Phe Tyr Thr Asp Glu Gly Asn 145 150
155 160 Trp Asp Ile Val Gly Asn Asn Ile Pro Val Phe
Phe Ile Gln Glu Ala 165 170
175 Ile Lys Phe Pro Asp Val Val His Ala Val Lys Pro Glu Pro His Asn
180 185 190 Glu Val
Pro Thr Ala Gln Thr Ala His Asn Asn Phe Trp Asp Phe Val 195
200 205 Tyr Met His Pro Glu Ala Thr
His Met Phe Met Trp Ile Met Ser Asp 210 215
220 Arg Ala Ile Pro Arg Ser Phe Arg Met Met Gln Gly
Phe Ser Val Asn 225 230 235
240 Thr Phe Val Leu Val Asn Lys Glu Gly Lys Arg Ser Phe Val Lys Phe
245 250 255 His Trp Ile
Pro Lys Leu Gly Leu His Ser Leu Val Trp Asp Glu Ala 260
265 270 Gln Lys Ile Ala Gly Gln Asp Pro
Asp Phe His Arg Lys Asp Leu Trp 275 280
285 Glu Ala Ile Ala Asn Lys Ile Tyr Pro Lys Trp Asp Phe
Gly Ile Gln 290 295 300
Val Ile Pro Glu Glu Lys Gln His Asp Phe Glu Phe Asp Leu Leu Asp 305
310 315 320 Ala Thr Lys Leu
Trp Pro Glu Asp Leu Ile Pro Val Arg Ile Ile Gly 325
330 335 Glu Leu Glu Leu Asn Arg Asn Pro Asp
Glu Phe Phe Thr Gln Thr Glu 340 345
350 Gln Val Ala Phe Cys Thr Ser His Ile Val Pro Gly Ile Glu
Phe Ser 355 360 365
Asp Asp Pro Leu Leu His Gly Arg Asn Phe Ser Tyr Phe Asp Thr Gln 370
375 380 Ile Thr Arg Leu Gly
Ile Asn Trp Glu Glu Leu Pro Ile Asn Arg Pro 385 390
395 400 Val Cys Pro Val Phe Asn His Asn Arg Asp
Gly Gln Ser Arg His Arg 405 410
415 Ile Thr Gln Gly Thr Val Asn Tyr Trp Pro Asn Arg Phe Glu Ala
Cys 420 425 430 Pro
Pro Thr Lys Pro Glu Glu Gly Gly Phe Val Thr Tyr Pro Ser Thr 435
440 445 Phe His Gly Leu Lys Gln
Arg Thr Leu Ser Tyr Lys Phe Arg Glu His 450 455
460 His Asn Gln Ala Gln Met Phe Tyr Asn Ser Leu
Ser Glu His Glu Lys 465 470 475
480 Leu His Leu Thr Lys Ala Phe Ser Phe Glu Leu Asp His Cys Asp Asp
485 490 495 Pro Thr
Val Tyr Ser Arg Leu Ala Gly Glu Arg Leu Ala Glu Ile Asp 500
505 510 Leu Gly Leu Ala Gln Lys Val
Ala Glu Met Val Gly Ala Pro Ile Pro 515 520
525 Thr Arg Ala Leu Lys Glu Asn Lys Gly Gln Arg Ala
Val Arg Leu Ser 530 535 540
Phe Thr Glu Phe Tyr Pro Pro Asn Pro Thr Ile Glu Ser Arg Arg Val 545
550 555 560 Ala Ile Ile
Ile Gly Asp Gly Tyr Asp Pro Ile Ala Phe Leu Gly Ile 565
570 575 Lys Ala Ala Val Glu Ala Ala Asn
Ala Leu Pro Phe Ile Ile Gly Thr 580 585
590 Lys Arg Gln Pro Met Phe Ala Glu Gly Glu Asp Arg Thr
Thr Ser Lys 595 600 605
Gly Val Ile Pro Asp His Gln Tyr Asp Gly Gln Arg Ser Thr Leu Phe 610
615 620 Asp Ala Thr Phe
Ile Pro Gly Gly Ser His Val Lys Thr Leu Ala Gln 625 630
635 640 Asn Gly Gln Ile Arg Tyr Trp Ile Thr
Glu Thr Phe Gly His Leu Lys 645 650
655 Ala Leu Ala Ala Thr Gly Glu Ala Val Asp Leu Val Lys Gln
Val Leu 660 665 670
Ser Val Val Pro Gly Leu Gln Phe Ala Thr Pro Asn Glu Pro Asn Val
675 680 685 Val Glu Ser Tyr
Gly Val Val Thr Val Gly Gly Lys Gln Lys Pro Glu 690
695 700 Ser Phe Ser Glu Gly Phe Lys Ile
Leu Lys Asp Ala Lys Asp Phe Val 705 710
715 720 Gly Gln Phe Phe Tyr Gln Ile Ser Gln His Arg Asn
Phe Lys Arg Glu 725 730
735 Leu Glu Gly Leu His Ala Thr Val Ala Phe 740
745 3 2452DNATalaromyces stipitatus 3atgcgagggg
catactctct cggcgccttt gccagtctca tcgcggtagc ttcggctgcc 60tgcccaatgc
tgactggcga aatcccagca ggcagcattg caaaccctca tcaccttgga 120agccgcgctg
actcgaatgc ttccgacgaa acagaagcct ttctgtccga attctacctt 180aatgacaaca
acagcttcct cactaccgat gtgggcggcc cgatagaaga ccaaaacagt 240ctcaaggccg
gcattcgcgg atcaacgctc ttggaggatt tcatctttcg ccagaagatt 300cagcgctttg
atcacgagcg tgtaagttct tgaaatcata tgactacttc gatgtgtact 360tacgacttct
aggtgcccga acgcgctgtg catgctcgag gtgctggtgc tcatggtgta 420ttcacatcgt
atgctgattg gtccaacatc accgctgctt cattcctagg agctgccgga 480aaggaaacgc
ccacctttgt acgcttctcg actgttgccg gcagtcgtgg tagtgccgat 540accgctcgtg
atgttcacgg ctttgctacc cgcttctata ctgacgaagg caactacggt 600aagatctatc
catggtcata gcagcctata catttgctaa ctcacagcag atatcgttgg 660aaacaacatt
cccgtcttct tcatccaaga cgctattcag ttccctgacc tcattcatgc 720agtcaagcca
cagccagcca gtgaaatccc acaggccgct actgcccacg acactgctta 780tgatttcttc
ggccagcagc ctagtacctt gcataccctc ttctgggcaa tggcaggtca 840tggtatcccg
cggtctttcc gccatgttga cggattcggc gttcacgctt accgatttgt 900gactgacgac
ggctcttcaa agctagtcaa attccactgg aagtccttac agggtcgtgc 960aagcttagtc
tgggaagaag ctcaagccac tgctggcaag aatgctgatt tcatgaggca 1020agatctgttt
gacaatattg cagctggccg gtacccagaa tgggaggtga gtataagttg 1080cctgtcctcc
gaagaatttc actaacatga atagctcggc gtgcaactta tcgaggaacc 1140agaccagctc
agctacggat ttgatctgct tgatcccacc aagatactcc cagttgaaca 1200agttccaatc
accccgctcg gaaaaatgca actcaaccgt aacccgctaa actactttgc 1260tgagaccgag
caagtaatgg tacgttaact tctcttctcc ccctcccccc aaacaactcc 1320gggtacagct
catgctgatc attttagttc caacctggtc acattgttcg tggcattgat 1380ttcacagagg
accctcttct ccaaggccgt ctattttctt acctcgacac tcagcttaat 1440cgcaacggtg
gccccaactt tgagcagatt ccgatcaacc gtcctcgtgt tcctatccac 1500aataacaacc
gagacggttt tggccagatg tttattccac tcaacgatgc ggcatactcg 1560ccaaacaccc
taagcgatgg caaccctaag caggcaaatg agactgttgg aaatggtttc 1620tttactactc
caggacgcac tgccaatgga aacctcgtcc gcgccaaaag cccaacgttt 1680gcggatgtgt
ggtcccaacc tggcctcttt tacaactcct tgacagccac cgaacaacag 1740tttgtcatca
atgctctgcg gttcgagcta gccaatgtag caagtgagac tgtgaagaat 1800aacttcatca
cccagatcaa tcgcgtaaac aacaccttgg caacacttgt agccactgca 1860attggtgtca
atgctcctga acccgacccg acttactacc accacaacaa gacgtctgat 1920gtgggaacat
tcggtactcc tctgaagaag attgatggtc tcaaggtcgg agtccttgct 1980tctgtcaacg
atgaaaacag tatttccgag ggacagtctc tagcacgaag cttggcggat 2040ttgaatgtgg
acgtcgttat tgtcgctgaa cgacttgctg gtaatgtctc agctacatac 2100tccgcatctg
acgctatcaa cttcgatgct gttattgtca cttcaggggc taagggtctc 2160tttggacctc
aaaccttcac cgccgtatcc aacaccactc tttatcccgt gggccgtccc 2220acgcagattt
tggtcgacgc tttccgctac ggcaagccgg ttggagcagt gggtagtgca 2280agcgaagcgc
tgactgtttc ggacattgat actgaccgca gtggtgtgat tactggtgat 2340ttgaacgacg
agtttgtgaa gcaactgtcg gaggaccttg caacattcaa gttcttggac 2400cgcttcaccg
tcgacgagta gacgtttcca cgattaacac gagtaacttg tt
24524733PRTTalaromyces stipitatus 4Met Arg Gly Ala Tyr Ser Leu Gly Ala
Phe Ala Ser Leu Ile Ala Val 1 5 10
15 Ala Ser Ala Ala Cys Pro Met Leu Thr Gly Glu Ile Pro Ala
Gly Ser 20 25 30
Ile Ala Asn Pro His His Leu Gly Ser Arg Ala Asp Ser Asn Ala Ser
35 40 45 Asp Glu Thr Glu
Ala Phe Leu Ser Glu Phe Tyr Leu Asn Asp Asn Asn 50
55 60 Ser Phe Leu Thr Thr Asp Val Gly
Gly Pro Ile Glu Asp Gln Asn Ser 65 70
75 80 Leu Lys Ala Gly Ile Arg Gly Ser Thr Leu Leu Glu
Asp Phe Ile Phe 85 90
95 Arg Gln Lys Ile Gln Arg Phe Asp His Glu Arg Val Pro Glu Arg Ala
100 105 110 Val His Ala
Arg Gly Ala Gly Ala His Gly Val Phe Thr Ser Tyr Ala 115
120 125 Asp Trp Ser Asn Ile Thr Ala Ala
Ser Phe Leu Gly Ala Ala Gly Lys 130 135
140 Glu Thr Pro Thr Phe Val Arg Phe Ser Thr Val Ala Gly
Ser Arg Gly 145 150 155
160 Ser Ala Asp Thr Ala Arg Asp Val His Gly Phe Ala Thr Arg Phe Tyr
165 170 175 Thr Asp Glu Gly
Asn Tyr Asp Ile Val Gly Asn Asn Ile Pro Val Phe 180
185 190 Phe Ile Gln Asp Ala Ile Gln Phe Pro
Asp Leu Ile His Ala Val Lys 195 200
205 Pro Gln Pro Ala Ser Glu Ile Pro Gln Ala Ala Thr Ala His
Asp Thr 210 215 220
Ala Tyr Asp Phe Phe Gly Gln Gln Pro Ser Thr Leu His Thr Leu Phe 225
230 235 240 Trp Ala Met Ala Gly
His Gly Ile Pro Arg Ser Phe Arg His Val Asp 245
250 255 Gly Phe Gly Val His Ala Tyr Arg Phe Val
Thr Asp Asp Gly Ser Ser 260 265
270 Lys Leu Val Lys Phe His Trp Lys Ser Leu Gln Gly Arg Ala Ser
Leu 275 280 285 Val
Trp Glu Glu Ala Gln Ala Thr Ala Gly Lys Asn Ala Asp Phe Met 290
295 300 Arg Gln Asp Leu Phe Asp
Asn Ile Ala Ala Gly Arg Tyr Pro Glu Trp 305 310
315 320 Glu Leu Gly Val Gln Leu Ile Glu Glu Pro Asp
Gln Leu Ser Tyr Gly 325 330
335 Phe Asp Leu Leu Asp Pro Thr Lys Ile Leu Pro Val Glu Gln Val Pro
340 345 350 Ile Thr
Pro Leu Gly Lys Met Gln Leu Asn Arg Asn Pro Leu Asn Tyr 355
360 365 Phe Ala Glu Thr Glu Gln Val
Met Phe Gln Pro Gly His Ile Val Arg 370 375
380 Gly Ile Asp Phe Thr Glu Asp Pro Leu Leu Gln Gly
Arg Leu Phe Ser 385 390 395
400 Tyr Leu Asp Thr Gln Leu Asn Arg Asn Gly Gly Pro Asn Phe Glu Gln
405 410 415 Ile Pro Ile
Asn Arg Pro Arg Val Pro Ile His Asn Asn Asn Arg Asp 420
425 430 Gly Phe Gly Gln Met Phe Ile Pro
Leu Asn Asp Ala Ala Tyr Ser Pro 435 440
445 Asn Thr Leu Ser Asp Gly Asn Pro Lys Gln Ala Asn Glu
Thr Val Gly 450 455 460
Asn Gly Phe Phe Thr Thr Pro Gly Arg Thr Ala Asn Gly Asn Leu Val 465
470 475 480 Arg Ala Lys Ser
Pro Thr Phe Ala Asp Val Trp Ser Gln Pro Gly Leu 485
490 495 Phe Tyr Asn Ser Leu Thr Ala Thr Glu
Gln Gln Phe Val Ile Asn Ala 500 505
510 Leu Arg Phe Glu Leu Ala Asn Val Ala Ser Glu Thr Val Lys
Asn Asn 515 520 525
Phe Ile Thr Gln Ile Asn Arg Val Asn Asn Thr Leu Ala Thr Leu Val 530
535 540 Ala Thr Ala Ile Gly
Val Asn Ala Pro Glu Pro Asp Pro Thr Tyr Tyr 545 550
555 560 His His Asn Lys Thr Ser Asp Val Gly Thr
Phe Gly Thr Pro Leu Lys 565 570
575 Lys Ile Asp Gly Leu Lys Val Gly Val Leu Ala Ser Val Asn Asp
Glu 580 585 590 Asn
Ser Ile Ser Glu Gly Gln Ser Leu Ala Arg Ser Leu Ala Asp Leu 595
600 605 Asn Val Asp Val Val Ile
Val Ala Glu Arg Leu Ala Gly Asn Val Ser 610 615
620 Ala Thr Tyr Ser Ala Ser Asp Ala Ile Asn Phe
Asp Ala Val Ile Val 625 630 635
640 Thr Ser Gly Ala Lys Gly Leu Phe Gly Pro Gln Thr Phe Thr Ala Val
645 650 655 Ser Asn
Thr Thr Leu Tyr Pro Val Gly Arg Pro Thr Gln Ile Leu Val 660
665 670 Asp Ala Phe Arg Tyr Gly Lys
Pro Val Gly Ala Val Gly Ser Ala Ser 675 680
685 Glu Ala Leu Thr Val Ser Asp Ile Asp Thr Asp Arg
Ser Gly Val Ile 690 695 700
Thr Gly Asp Leu Asn Asp Glu Phe Val Lys Gln Leu Ser Glu Asp Leu 705
710 715 720 Ala Thr Phe
Lys Phe Leu Asp Arg Phe Thr Val Asp Glu 725
730 53076DNAHumicola insolens 5atgaacagag tcacgaatct
cctcgcctgg gccggcgcga tagggctcgc ccaagcaaca 60tgcccctttg cggaccctgc
cgctctgtat aggcgtcaag atactaccag cggccagtcg 120ccacttgcag catacgaggt
ggatgacagc accggatacc tgacctccga tgttggcggg 180cccattcagg accagaccag
cctcaaggca ggcatccggg gtccgaccct tcttgaggac 240tttatgttcc gccagaagat
ccagcacttc gaccatgaac gggtaaggac ataatgctca 300cacgagcggc tgcgtaccta
tttatttccg agacattggg ctggctggct ggctgtgact 360gcctgagttt ggggacatac
ggagtacctt actgacgcgc tgatccactc caggttcccg 420aaagggcggt ccatgctcga
ggcgctggag cacacgggac cttcacgagt tacgccgact 480ggagtaacat cacagcggcg
tcctttctga acgccacagg aaagcagacg ccggtgtttg 540tccggttctc gaccgttgct
gggtctcgag ggagcgcaga cacggcgaga gacgttcatg 600gtttcgcgac gcggttgtaa
gttttgttgt gtttcattcg ttccggtctg tagaggaggg 660ttaggatatg agctaatgtg
tgtgtgtgtg tgtgtgtgtg tgtgaagtta cactgatgaa 720ggcaactttg gtacgtccca
tgcatggtcc tcaattctct tatctggcag cgatgtggtc 780attgtcgacg ttgctaactt
gcgtagatat cgtcggaaac aacatcccgg tattcttcat 840tcaagatgca atccagttcc
ctgaccttat ccactcggtc aagccgcgtc ccgacaacga 900gattccccaa gcggcgacgg
ctcatgattc agcttgggac ttcttcagcc agcagccaag 960caccatggta agcaatggac
caaggagccg cacctggggt gacatgccag ggagtacacg 1020gagcgttccg atgactctcg
tgtgaccaag gcagtacaac actccacgga ggactcgaag 1080agattcggaa atatggaaca
cagaactgac aggatggtag cacacgttgt tctgggccat 1140gtctggccac ggaatccctc
gcagctatcg ccatatggta cgtttgcctg gctgagatga 1200ccgtgaatcc atttctaacc
tcaagtccag gatggcttcg gcgtccacac gttccggttt 1260gtcaaagatg acggctcgtc
caagttgatc aagtggcatt tcaagtcacg ccagggaaag 1320gcgagtctag tctgggaaga
ggcgcaggtt ctgtctggca agaatgccga cttccaccgt 1380caggacctct gggatgctat
tgagtccggg aacggaccag aatgggatgt ctgcgtccag 1440attgtcgatg agtcccaggc
gcaagccttt ggcttcgact tgctggaccc gacaaagatc 1500atccccgagg agtacgcccc
cttgacgaag ctgggcctct tgaaactgga tcgcaatccg 1560accaactact tcgccgagac
ggagcaggtc atgttccaac ccggtcatat cgtccgcggc 1620atcgacttca cggaggatcc
cctgctacag ggacgtctct tctcgtacct tgacacgcag 1680ctgaaccgga atggcgggcc
caactttgag cagctgccca tcaacatgcc gcgggtgccg 1740attcacaaca ataatcgcga
cggcgccggc cagatgttca tccacaggaa caagtatcct 1800tgtaagtacc tcttttgcct
cgatcgttgt ggtgccggct tgctgacaga cgcagacact 1860cccaacaccc tgaacagtgg
ttatccgcgg caagccaacc aaaatgccgg acgcggattc 1920ttcacagcgc ctggccgtac
cgtcagcggt gccctcgtcc gtgaggtgtc gccaacattc 1980aacgaccact ggtcgcagcc
ccgtctcttc ttcaactccc tcactcccgt cgaacagcag 2040ttcctcgtca acgccatgcg
cttcgaaatc agccttgtga agtcggaaga agtcaggaag 2100aacgtgctca cccagctcaa
ccgcgtcagc catgatgtgg ccgggcgcgt ggccgccgct 2160atcggcctcg ccgcgcccga
cgcggacgac acatactacc acaacaacaa gacggctggc 2220gtctcgatcc ttggaagcgg
gcccttgcct accatcaaga ctctccgcgt cggcatcctg 2280gctaccacga gcgagtcgag
cgcgctggat caggcagccc agctccgcac ccgtctggaa 2340aaggacgggc ttgtggtcac
ggttgtggct gaaacgctgc gcgagggggt agaccagaca 2400tactcgacgg cggatgccac
gggtttcgac ggcgttgttg ttgtggacgg ggcggcggcg 2460ctgtttgcca gcaccgcgtc
gtcgccgttg ttcccgacgg gcaggccgtt gcagatcttt 2520gtggacgcgt atcggtgggg
aaagccggtc ggtgtgtgtg gtgggaagtc gagcgaggtg 2580ttggatgcgg cggatgttcc
ggaaaatggg gacggggtgt attcggagga gtcggtggac 2640aagtttgtgg aggagtttga
gaaggggttg gctactttca gggtgagtct tggtgccttt 2700gttttttgag atgttattgt
tttgtttcgt ctcggacttt gtgaaagaat gacggactga 2760cgtctttggt atctagttta
ccgatcggtt tgctctcgac tcttaggagg acgaatggac 2820agaaagtgag accgagagtg
actcagagac tgagttggag tcggaatagg aatcggaatt 2880ggaatcggag tcagggtcgg
agtcaggatc agagaatgaa ttttgtatcc aggaagtcga 2940tggggtgtat cagatttgta
atcaccagga ggatgaggat tctgaggata gcacttcgga 3000agaggaggga gatgaagttg
agcctgtcat ttcatgggga tgaggagggg agaaggaata 3060gggtgggagt tgtacc
30766765PRTHumicola insolens
6Met Asn Arg Val Thr Asn Leu Leu Ala Trp Ala Gly Ala Ile Gly Leu 1
5 10 15 Ala Gln Ala Thr
Cys Pro Phe Ala Asp Pro Ala Ala Leu Tyr Arg Arg 20
25 30 Gln Asp Thr Thr Ser Gly Gln Ser Pro
Leu Ala Ala Tyr Glu Val Asp 35 40
45 Asp Ser Thr Gly Tyr Leu Thr Ser Asp Val Gly Gly Pro Ile
Gln Asp 50 55 60
Gln Thr Ser Leu Lys Ala Gly Ile Arg Gly Pro Thr Leu Leu Glu Asp 65
70 75 80 Phe Met Phe Arg Gln
Lys Ile Gln His Phe Asp His Glu Arg Val Pro 85
90 95 Glu Arg Ala Val His Ala Arg Gly Ala Gly
Ala His Gly Thr Phe Thr 100 105
110 Ser Tyr Ala Asp Trp Ser Asn Ile Thr Ala Ala Ser Phe Leu Asn
Ala 115 120 125 Thr
Gly Lys Gln Thr Pro Val Phe Val Arg Phe Ser Thr Val Ala Gly 130
135 140 Ser Arg Gly Ser Ala Asp
Thr Ala Arg Asp Val His Gly Phe Ala Thr 145 150
155 160 Arg Phe Tyr Thr Asp Glu Gly Asn Phe Asp Ile
Val Gly Asn Asn Ile 165 170
175 Pro Val Phe Phe Ile Gln Asp Ala Ile Gln Phe Pro Asp Leu Ile His
180 185 190 Ser Val
Lys Pro Arg Pro Asp Asn Glu Ile Pro Gln Ala Ala Thr Ala 195
200 205 His Asp Ser Ala Trp Asp Phe
Phe Ser Gln Gln Pro Ser Thr Met His 210 215
220 Thr Leu Phe Trp Ala Met Ser Gly His Gly Ile Pro
Arg Ser Tyr Arg 225 230 235
240 His Met Asp Gly Phe Gly Val His Thr Phe Arg Phe Val Lys Asp Asp
245 250 255 Gly Ser Ser
Lys Leu Ile Lys Trp His Phe Lys Ser Arg Gln Gly Lys 260
265 270 Ala Ser Leu Val Trp Glu Glu Ala
Gln Val Leu Ser Gly Lys Asn Ala 275 280
285 Asp Phe His Arg Gln Asp Leu Trp Asp Ala Ile Glu Ser
Gly Asn Gly 290 295 300
Pro Glu Trp Asp Val Cys Val Gln Ile Val Asp Glu Ser Gln Ala Gln 305
310 315 320 Ala Phe Gly Phe
Asp Leu Leu Asp Pro Thr Lys Ile Ile Pro Glu Glu 325
330 335 Tyr Ala Pro Leu Thr Lys Leu Gly Leu
Leu Lys Leu Asp Arg Asn Pro 340 345
350 Thr Asn Tyr Phe Ala Glu Thr Glu Gln Val Met Phe Gln Pro
Gly His 355 360 365
Ile Val Arg Gly Ile Asp Phe Thr Glu Asp Pro Leu Leu Gln Gly Arg 370
375 380 Leu Phe Ser Tyr Leu
Asp Thr Gln Leu Asn Arg Asn Gly Gly Pro Asn 385 390
395 400 Phe Glu Gln Leu Pro Ile Asn Met Pro Arg
Val Pro Ile His Asn Asn 405 410
415 Asn Arg Asp Gly Ala Gly Gln Met Phe Ile His Arg Asn Lys Tyr
Pro 420 425 430 Tyr
Thr Pro Asn Thr Leu Asn Ser Gly Tyr Pro Arg Gln Ala Asn Gln 435
440 445 Asn Ala Gly Arg Gly Phe
Phe Thr Ala Pro Gly Arg Thr Val Ser Gly 450 455
460 Ala Leu Val Arg Glu Val Ser Pro Thr Phe Asn
Asp His Trp Ser Gln 465 470 475
480 Pro Arg Leu Phe Phe Asn Ser Leu Thr Pro Val Glu Gln Gln Phe Leu
485 490 495 Val Asn
Ala Met Arg Phe Glu Ile Ser Leu Val Lys Ser Glu Glu Val 500
505 510 Arg Lys Asn Val Leu Thr Gln
Leu Asn Arg Val Ser His Asp Val Ala 515 520
525 Gly Arg Val Ala Ala Ala Ile Gly Leu Ala Ala Pro
Asp Ala Asp Asp 530 535 540
Thr Tyr Tyr His Asn Asn Lys Thr Ala Gly Val Ser Ile Leu Gly Ser 545
550 555 560 Gly Pro Leu
Pro Thr Ile Lys Thr Leu Arg Val Gly Ile Leu Ala Thr 565
570 575 Thr Ser Glu Ser Ser Ala Leu Asp
Gln Ala Ala Gln Leu Arg Thr Arg 580 585
590 Leu Glu Lys Asp Gly Leu Val Val Thr Val Val Ala Glu
Thr Leu Arg 595 600 605
Glu Gly Val Asp Gln Thr Tyr Ser Thr Ala Asp Ala Thr Gly Phe Asp 610
615 620 Gly Val Val Val
Val Asp Gly Ala Ala Ala Leu Phe Ala Ser Thr Ala 625 630
635 640 Ser Ser Pro Leu Phe Pro Thr Gly Arg
Pro Leu Gln Ile Phe Val Asp 645 650
655 Ala Tyr Arg Trp Gly Lys Pro Val Gly Val Cys Gly Gly Lys
Ser Ser 660 665 670
Glu Val Leu Asp Ala Ala Asp Val Pro Glu Asn Gly Asp Gly Val Tyr
675 680 685 Ser Glu Glu Ser
Val Asp Lys Phe Val Glu Glu Phe Glu Lys Gly Leu 690
695 700 Ala Thr Phe Arg Glu Ser Glu Leu
Glu Ser Glu Ser Gly Ser Glu Ser 705 710
715 720 Gly Ser Glu Asn Glu Phe Cys Ile Gln Glu Val Asp
Gly Val Tyr Gln 725 730
735 Ile Cys Asn His Gln Glu Asp Glu Asp Ser Glu Asp Ser Thr Ser Glu
740 745 750 Glu Glu Gly
Asp Glu Val Glu Pro Val Ile Ser Trp Gly 755 760
765 7 2479DNAPenicillium emersonii 7atgcgcgcag
tgcagcttct gcccagcctc gccggcctga ttggcgctgc ctctgccgtt 60ggatgtccgt
atctgacggg ccagctcgat gccagagacg tgcacaatcc gcacgagttc 120cagcgtcgac
aggatcccgg agatgcggct gcgtccacag agcagttcct gtcccagttc 180tatctcaatg
acagcaacag ctacatgacc actgatgtcg gcggccccat ctcggatcag 240aacagtttga
aggccggaga gcgcggtcca accctgttgg aggacttcat cttccgtcag 300aagatccagc
actttgatca cgagcgggta ggttgtacca tccatgcgag agagatcgat 360cgatgttgac
gtggtggcag gtcccagaac gcgcagtcca tgctcgagga gccggcgccc 420acggaacgtt
cacttcctac ggaaactggt ccaacatcac tgcggcctcc ttcctgagcg 480ctgaagggaa
ggagaccccc gtgtttgtgc gcttctccac cgtggccgga agtcgaggca 540gtgcggacac
ggcgcgcgat gtgcatggct ttgccaccag gttctacact gacgagggca 600actttggtac
gtcgtctcac aatcctctcg actggcatcg tctgaccgct gagcagatat 660cgtcggcaac
aacattccag tcttcttcat ccaggacgcc attctcttcc ctgatctgat 720ccatgctgtc
aagcccagcc ccgacaacga gatcccccag gctgcgactg ctcatgacac 780ggcctgggac
ttcttcagcc agcagcccag tgcgttgcac acgctcttct gggctatgtc 840cggccatgga
atccctcgct cttttcgcca catggacggc tttggcgtcc acactttccg 900attcgtgact
gacgacggcg cctccaagct ggtcaaattc cactggacct cgctgcaggg 960ccgggccagc
ctggtctggg aggaggcgca agcggcagcg ggaaagaacc tggactatat 1020gcgccaggac
ctctatgaca acatcgaagc cggtcgatat cctgaatggg aggtaggtgg 1080ccgcattttc
tcggcatata tatgtccatg ctgacgttcc tagctgggca ttcaaatcgt 1140cgacgaggag
gatcagctca agtttggatt tgatctgctg gatccaacca agatcattcc 1200tgttgaatat
gtccccatca cgccgcttgg gaagctgcag ctcaaccgga atccgctcaa 1260ctatttcgcc
gagacggagc agataatggt atgtaaacag tttgttgttc gattctttgc 1320agtagactga
cgatacatag ttccaacccg gccatattgt gcgcggaatt gactttaccg 1380aagaccccct
tctccaggga cggctcttct cctatctcga cacgcagttg aatcggaatg 1440gaggccccaa
tttcgagcag cttcccatca atcgtcctag ggtgccatgg cataacaaca 1500accgtgatgg
attcagtaag tttacccccc tgcgctgact ctctgcatgc taactccacc 1560aggccaagcg
tttatccccc tgaacaaggc ggcctacagc ccgaacacgc tcaacaatgg 1620caaccccaag
caggcgaacc agactgtggg cgatggattc ttcaccactc ccggacgtac 1680gaccagtggc
cggctcatgc gcaccgtcag ttcgaccttc tccgacgtct ggtcgcagcc 1740tcggctgttc
tacaactcgc tggtgccggc cgagcagcag ttcctcgtca acgccatccg 1800tttcgagaac
tccaacgtca agagcgaagt ggtccggaac aatgtcatca tccagctcaa 1860ccgcgtcgat
aacgacctcg cccgccgggt tgctcgggtc attggcgttg cagaacccga 1920gcccgatcca
acctattatc acaacaacaa gacggccaac gtgggtacgt ttggcacgcc 1980gctcaagcgg
atcgacggtc tcaaagtcgg tgtgcttgcc acagttggcg acccagacag 2040tatcagtcag
ggccagagcc tcagtgacgc gctctcggac tccaaggtcg atgtcactgt 2100cgttgctgag
tctttcacgg acggggtcga tgcgctctac accaactcgg acgcgaccgg 2160cttcgacgcc
gttatcgtgg ctgatggcgc cgaagggctt tttaccccga gtagcttcac 2220agccaaaccg
acgaactcat tctcgacgac aacgctttat ccggccggtc gtccgctgca 2280gatcctggtc
gacgccttcc ggttcggcaa gcccgtcggc gctctgggca gcggagctaa 2340ggcgcttgat
gcggcaggta tctcgactag ccggcctggt gtgtacgtcg ccaactcgac 2400cagcgaggcg
ttcacggacg atatcgagga tggtttgcga acgttcaagt tcctcgaccg 2460gtttgcgctg
gatgagtga
24798741PRTPenicillium emersonii 8Met Arg Ala Val Gln Leu Leu Pro Ser Leu
Ala Gly Leu Ile Gly Ala 1 5 10
15 Ala Ser Ala Val Gly Cys Pro Tyr Leu Thr Gly Gln Leu Asp Ala
Arg 20 25 30 Asp
Val His Asn Pro His Glu Phe Gln Arg Arg Gln Asp Pro Gly Asp 35
40 45 Ala Ala Ala Ser Thr Glu
Gln Phe Leu Ser Gln Phe Tyr Leu Asn Asp 50 55
60 Ser Asn Ser Tyr Met Thr Thr Asp Val Gly
Gly Pro Ile Ser Asp Gln 65 70 75
80 Asn Ser Leu Lys Ala Gly Glu Arg Gly Pro Thr Leu Leu Glu Asp
Phe 85 90 95 Ile
Phe Arg Gln Lys Ile Gln His Phe Asp His Glu Arg Val Pro Glu
100 105 110 Arg Ala Val His Ala
Arg Gly Ala Gly Ala His Gly Thr Phe Thr Ser 115
120 125 Tyr Gly Asn Trp Ser Asn Ile Thr Ala
Ala Ser Phe Leu Ser Ala Glu 130 135
140 Gly Lys Glu Thr Pro Val Phe Val Arg Phe Ser Thr Val
Ala Gly Ser 145 150 155
160 Arg Gly Ser Ala Asp Thr Ala Arg Asp Val His Gly Phe Ala Thr Arg
165 170 175 Phe Tyr Thr Asp
Glu Gly Asn Phe Asp Ile Val Gly Asn Asn Ile Pro 180
185 190 Val Phe Phe Ile Gln Asp Ala Ile Leu
Phe Pro Asp Leu Ile His Ala 195 200
205 Val Lys Pro Ser Pro Asp Asn Glu Ile Pro Gln Ala Ala Thr
Ala His 210 215 220
Asp Thr Ala Trp Asp Phe Phe Ser Gln Gln Pro Ser Ala Leu His Thr 225
230 235 240 Leu Phe Trp Ala Met
Ser Gly His Gly Ile Pro Arg Ser Phe Arg His 245
250 255 Met Asp Gly Phe Gly Val His Thr Phe Arg
Phe Val Thr Asp Asp Gly 260 265
270 Ala Ser Lys Leu Val Lys Phe His Trp Thr Ser Leu Gln Gly Arg
Ala 275 280 285 Ser
Leu Val Trp Glu Glu Ala Gln Ala Ala Ala Gly Lys Asn Leu Asp 290
295 300 Tyr Met Arg Gln Asp Leu
Tyr Asp Asn Ile Glu Ala Gly Arg Tyr Pro 305 310
315 320 Glu Trp Glu Leu Gly Ile Gln Ile Val Asp Glu
Glu Asp Gln Leu Lys 325 330
335 Phe Gly Phe Asp Leu Leu Asp Pro Thr Lys Ile Ile Pro Val Glu Tyr
340 345 350 Val Pro
Ile Thr Pro Leu Gly Lys Leu Gln Leu Asn Arg Asn Pro Leu 355
360 365 Asn Tyr Phe Ala Glu Thr Glu
Gln Ile Met Phe Gln Pro Gly His Ile 370 375
380 Val Arg Gly Ile Asp Phe Thr Glu Asp Pro Leu Leu
Gln Gly Arg Leu 385 390 395
400 Phe Ser Tyr Leu Asp Thr Gln Leu Asn Arg Asn Gly Gly Pro Asn Phe
405 410 415 Glu Gln Leu
Pro Ile Asn Arg Pro Arg Val Pro Trp His Asn Asn Asn 420
425 430 Arg Asp Gly Phe Ser Gln Ala Phe
Ile Pro Leu Asn Lys Ala Ala Tyr 435 440
445 Ser Pro Asn Thr Leu Asn Asn Gly Asn Pro Lys Gln Ala
Asn Gln Thr 450 455 460
Val Gly Asp Gly Phe Phe Thr Thr Pro Gly Arg Thr Thr Ser Gly Arg 465
470 475 480 Leu Met Arg Thr
Val Ser Ser Thr Phe Ser Asp Val Trp Ser Gln Pro 485
490 495 Arg Leu Phe Tyr Asn Ser Leu Val Pro
Ala Glu Gln Gln Phe Leu Val 500 505
510 Asn Ala Ile Arg Phe Glu Asn Ser Asn Val Lys Ser Glu Val
Val Arg 515 520 525
Asn Asn Val Ile Ile Gln Leu Asn Arg Val Asp Asn Asp Leu Ala Arg 530
535 540 Arg Val Ala Arg Val
Ile Gly Val Ala Glu Pro Glu Pro Asp Pro Thr 545 550
555 560 Tyr Tyr His Asn Asn Lys Thr Ala Asn Val
Gly Thr Phe Gly Thr Pro 565 570
575 Leu Lys Arg Ile Asp Gly Leu Lys Val Gly Val Leu Ala Thr Val
Gly 580 585 590 Asp
Pro Asp Ser Ile Ser Gln Gly Gln Ser Leu Ser Asp Ala Leu Ser 595
600 605 Asp Ser Lys Val Asp Val
Thr Val Val Ala Glu Ser Phe Thr Asp Gly 610 615
620 Val Asp Ala Leu Tyr Thr Asn Ser Asp Ala Thr
Gly Phe Asp Ala Val 625 630 635
640 Ile Val Ala Asp Gly Ala Glu Gly Leu Phe Thr Pro Ser Ser Phe Thr
645 650 655 Ala Lys
Pro Thr Asn Ser Phe Ser Thr Thr Thr Leu Tyr Pro Ala Gly 660
665 670 Arg Pro Leu Gln Ile Leu Val
Asp Ala Phe Arg Phe Gly Lys Pro Val 675 680
685 Gly Ala Leu Gly Ser Gly Ala Lys Ala Leu Asp Ala
Ala Gly Ile Ser 690 695 700
Thr Ser Arg Pro Gly Val Tyr Val Ala Asn Ser Thr Ser Glu Ala Phe 705
710 715 720 Thr Asp Asp
Ile Glu Asp Gly Leu Arg Thr Phe Lys Phe Leu Asp Arg 725
730 735 Phe Ala Leu Asp Glu
740 9 860DNAThermus Brockianus 9tagaccgcct gcagattgag
ttgcccatgc ccaaggagca ggaccccaac gccgccgccg 60cggtgcaggc ccttctgggc
ggtcgcttcg gggagatgtc caccctgatg aactacatgt 120accagtcctt caacttccgg
gggaagaagg cgcttaagcc ctactacgac ctcatcgcca 180acatcgccac ggaggagctt
gggcacattg agctcgtttc cgccaccatc aacagcctcc 240tggccaaaaa ccccggaaag
gacctggagg aaggggtaga ccccgtgagc gcccccttgg 300gcttctccaa ggacgcccgc
aacgccgccc acttcatcgc cgggggggcc aacaccctgg 360tgatgggggc catgggagag
cactggcacg gggagtacgt cttcaccagc ggcaacctca 420tcctggacct tctccacaac
ttcttcctgg aggtggcggc ccgcacccac aagctccggg 480tctacgagat gacggataac
cccgtggccc gggagatgat cggctacctc ctggtgcggg 540gtggggtcca cgccgccgcc
tacggcaagg ccctggaaac ccttaccggg gtggagatga 600ccaagatgct ccccatcccc
cggattgaca acagcaagat ccccgaggcc aagaagtaca 660tggacctggg cttccaccgc
aacctctacc gctttagccc ctccgattac caggacctgg 720gccttatctg gaatggggct
tcccccgagg acgggagcga ggtggtggtg gtggacggcc 780cccccacggg cggccccgtg
tttgacgccg gccacgacgc cgccgagttc gcccccgagt 840tccaccccgc cgagctctac
86010286PRTThermus
Brockianus 10Asp Arg Leu Gln Ile Glu Leu Pro Met Pro Lys Glu Gln Asp Pro
Asn 1 5 10 15 Ala
Ala Ala Ala Val Gln Ala Leu Leu Gly Gly Arg Phe Gly Glu Met
20 25 30 Ser Thr Leu Met Asn
Tyr Met Tyr Gln Ser Phe Asn Phe Arg Gly Lys 35
40 45 Lys Ala Leu Lys Pro Tyr Tyr Asp Leu
Ile Ala Asn Ile Ala Thr Glu 50 55
60 Glu Leu Gly His Ile Glu Leu Val Ser Ala Thr Ile Asn
Ser Leu Leu 65 70 75
80 Ala Lys Asn Pro Gly Lys Asp Leu Glu Glu Gly Val Asp Pro Val Ser
85 90 95 Ala Pro Leu Gly
Phe Ser Lys Asp Ala Arg Asn Ala Ala His Phe Ile 100
105 110 Ala Gly Gly Ala Asn Thr Leu Val Met
Gly Ala Met Gly Glu His Trp 115 120
125 His Gly Glu Tyr Val Phe Thr Ser Gly Asn Leu Ile Leu Asp
Leu Leu 130 135 140
His Asn Phe Phe Leu Glu Val Ala Ala Arg Thr His Lys Leu Arg Val 145
150 155 160 Tyr Glu Met Thr Asp
Asn Pro Val Ala Arg Glu Met Ile Gly Tyr Leu 165
170 175 Leu Val Arg Gly Gly Val His Ala Ala Ala
Tyr Gly Lys Ala Leu Glu 180 185
190 Thr Leu Thr Gly Val Glu Met Thr Lys Met Leu Pro Ile Pro Arg
Ile 195 200 205 Asp
Asn Ser Lys Ile Pro Glu Ala Lys Lys Tyr Met Asp Leu Gly Phe 210
215 220 His Arg Asn Leu Tyr Arg
Phe Ser Pro Ser Asp Tyr Gln Asp Leu Gly 225 230
235 240 Leu Ile Trp Asn Gly Ala Ser Pro Glu Asp Gly
Ser Glu Val Val Val 245 250
255 Val Asp Gly Pro Pro Thr Gly Gly Pro Val Phe Asp Ala Gly His Asp
260 265 270 Ala Ala
Glu Phe Ala Pro Glu Phe His Pro Ala Glu Leu Tyr 275
280 285 11 1542DNASaccharomyces pastorianus
11atgtcaggac aagaggagaa taaagtaaat tcttctgacg taagaaagga tagagttgtg
60acgaactcta ctggtaatcc catcaatgag ccatttgtca cccagcgtgt tggggagcac
120gggcctttgc ttttacaaga ttataaccta ctcgattctt tggcgcattt taacagggag
180aatattcctc aaagaaatcc tcacgcccac ggttctgggg ccttcggtta ttttgaagtg
240acagacgata ttacagatgt ttgtgggtct gccatgttta gcaagatcgg taagagaacg
300aagtgtctga caagattctc cactgtgggt ggtgataaag gtagtgccga tactgttcgt
360gacccaagag ggtttgcaac taaattctac acagaagaag gtaatttgga ttgggtctac
420aacaatacac ctgtattttt tatcagggat ccttcgaaat tcccccattt tatccacacg
480cagaagagaa acccgcaaac taatctaaga gacgctgata tgttttggga tttccttacg
540actccagaga atcaagtggc catccatcaa gtcatgattc tcttttcaga ccgtggtact
600cctgcgagct atcgtaacat gcacggatat tctggtcata cttataaatg gtcaagtaaa
660aacggcgatt ggcgttatgt gcaagtccat attaaaacca atcaaggggt caagaatttg
720actatagacg aagccactaa aatcgcaggg tccaacccag attactgcca aaaagacttg
780tttgaatcta tccaaagcgg taactatcca tcgtggactg tttatattca aacaatgact
840gaacaggagg ccaagaattt accattttcg gtctttgact tgaccaaggt atggcctcaa
900aagcaattcc cattacgtcg tgtaggcaaa cttgttctga atgaaaatcc actgaatttc
960ttcgcacaag tggaacaagc agcgtttgcc cctagtacta ctgtcccata ccaagaagcc
1020agtgctgatc cggtgctaca agctcgatta ttttcttatg cagatgctca cagatacaga
1080ctgggcccca atttccatca aatacccgtc aactgtccct atgcctccaa gttttttaac
1140cctgccatca gagatggccc aatgaacgta aatggaaatt ttggttcaga acctacctat
1200ttagccaacg acaaatcata ctcgtatatt cagcaagaaa gacctattca acaacatcaa
1260gaagtatgga acggacccgc tatcccttac cactgggcaa catctccagg tgatgtcgat
1320tatgttcaag ctaggaattt gtaccgcgtc ttagggaagc aacctggaca acaaaagaac
1380ctagctcaca acatcggtat ccatgtagag ggcgcctgcc ctggaatcca gcaacgggtt
1440tacgatatgt ttgcccgcgt agataaggga ctatctgatg cgatcaagaa agaagcagag
1500gcaaaacacg ctgctgaact ttcaaataac tctaagtttt ga
154212513PRTSaccharomyces pastorianus 12Met Ser Gly Gln Glu Glu Asn Lys
Val Asn Ser Ser Asp Val Arg Lys 1 5 10
15 Asp Arg Val Val Thr Asn Ser Thr Gly Asn Pro Ile Asn
Glu Pro Phe 20 25 30
Val Thr Gln Arg Val Gly Glu His Gly Pro Leu Leu Leu Gln Asp Tyr
35 40 45 Asn Leu Leu Asp
Ser Leu Ala His Phe Asn Arg Glu Asn Ile Pro Gln 50
55 60 Arg Asn Pro His Ala His Gly Ser
Gly Ala Phe Gly Tyr Phe Glu Val 65 70
75 80 Thr Asp Asp Ile Thr Asp Val Cys Gly Ser Ala Met
Phe Ser Lys Ile 85 90
95 Gly Lys Arg Thr Lys Cys Leu Thr Arg Phe Ser Thr Val Gly Gly Asp
100 105 110 Lys Gly Ser
Ala Asp Thr Val Arg Asp Pro Arg Gly Phe Ala Thr Lys 115
120 125 Phe Tyr Thr Glu Glu Gly Asn Leu
Asp Trp Val Tyr Asn Asn Thr Pro 130 135
140 Val Phe Phe Ile Arg Asp Pro Ser Lys Phe Pro His Phe
Ile His Thr 145 150 155
160 Gln Lys Arg Asn Pro Gln Thr Asn Leu Arg Asp Ala Asp Met Phe Trp
165 170 175 Asp Phe Leu Thr
Thr Pro Glu Asn Gln Val Ala Ile His Gln Val Met 180
185 190 Ile Leu Phe Ser Asp Arg Gly Thr Pro
Ala Ser Tyr Arg Asn Met His 195 200
205 Gly Tyr Ser Gly His Thr Tyr Lys Trp Ser Ser Lys Asn Gly
Asp Trp 210 215 220
Arg Tyr Val Gln Val His Ile Lys Thr Asn Gln Gly Val Lys Asn Leu 225
230 235 240 Thr Ile Asp Glu Ala
Thr Lys Ile Ala Gly Ser Asn Pro Asp Tyr Cys 245
250 255 Gln Lys Asp Leu Phe Glu Ser Ile Gln Ser
Gly Asn Tyr Pro Ser Trp 260 265
270 Thr Val Tyr Ile Gln Thr Met Thr Glu Gln Glu Ala Lys Asn Leu
Pro 275 280 285 Phe
Ser Val Phe Asp Leu Thr Lys Val Trp Pro Gln Lys Gln Phe Pro 290
295 300 Leu Arg Arg Val Gly Lys
Leu Val Leu Asn Glu Asn Pro Leu Asn Phe 305 310
315 320 Phe Ala Gln Val Glu Gln Ala Ala Phe Ala Pro
Ser Thr Thr Val Pro 325 330
335 Tyr Gln Glu Ala Ser Ala Asp Pro Val Leu Gln Ala Arg Leu Phe Ser
340 345 350 Tyr Ala
Asp Ala His Arg Tyr Arg Leu Gly Pro Asn Phe His Gln Ile 355
360 365 Pro Val Asn Cys Pro Tyr Ala
Ser Lys Phe Phe Asn Pro Ala Ile Arg 370 375
380 Asp Gly Pro Met Asn Val Asn Gly Asn Phe Gly Ser
Glu Pro Thr Tyr 385 390 395
400 Leu Ala Asn Asp Lys Ser Tyr Ser Tyr Ile Gln Gln Glu Arg Pro Ile
405 410 415 Gln Gln His
Gln Glu Val Trp Asn Gly Pro Ala Ile Pro Tyr His Trp 420
425 430 Ala Thr Ser Pro Gly Asp Val Asp
Tyr Val Gln Ala Arg Asn Leu Tyr 435 440
445 Arg Val Leu Gly Lys Gln Pro Gly Gln Gln Lys Asn Leu
Ala His Asn 450 455 460
Ile Gly Ile His Val Glu Gly Ala Cys Pro Gly Ile Gln Gln Arg Val 465
470 475 480 Tyr Asp Met Phe
Ala Arg Val Asp Lys Gly Leu Ser Asp Ala Ile Lys 485
490 495 Lys Glu Ala Glu Ala Lys His Ala Ala
Glu Leu Ser Asn Asn Ser Lys 500 505
510 Phe 131548DNASaccharomyces pastorianus 13atgtcgaaat
tgggacaaga aaaaaatgaa gtaaattcct ctgatgtaag agaggataga 60gttgtgacaa
actccactgg taatccaatc aatgaaccat ttgtcaccca acgtattgga 120gaacatggcc
ctttgctttt gcaagattat aacttaattg attctttggc tcatttcaac 180agggaaaata
ttcctcaaag gaatccacat gctcatggtt ctggtgcctt cggctatttt 240gaagtaaccg
atgacattac tgatatctgc gggtctgcta tgtttagtaa aattgggaaa 300agaacgaaat
gtctaacaag attttcgact gtgggtggtg ataaaggtag tgccgacacg 360gttcgtgatc
caagggggtt tgccaccaaa ttctacactg aagaaggtaa tttagattgg 420gtctacaata
atacaccggt attctttatc agagaccctt ccaagttccc tcactttatc 480cacacacaga
agagaaaccc acaaaccaac ctaagggatg ctgacatgtt ttgggatttc 540ctcaccactc
ctgaaaatca ggtggccatt catcaagtaa tgatcctttt ttcagaccgt 600ggtacccctg
ccaactaccg tagtatgcat ggttattctg gtcataccta taaatggtcc 660aataaaaacg
gagattggca ttatgtgcaa gttcatatca aaaccgatca aggaataaag 720aatttgacca
tagaagaggc taccaaaatt gcgggatcca atccagatta ctgccagcag 780gatttatttg
aggctattca gaatggaaac tatccttcct ggacagttta tattcaaaca 840atgaccgaac
gcgatgccaa aaaattacca ttttcagtct ttgatttgac taaagtatgg 900cctcaggggc
aattcccttt acggcgtgtg ggtaagattg ttttgaacga gaatccactg 960aacttcttcg
cacaggtgga acaagctgcc ttcgccccca gtaccacggt tccttaccaa 1020gaagcaagcg
ctgatccagt attacaggcc cgtttgtttt catatgcgga tgctcataga 1080tacaggctag
gtcctaactt ccatcaaata cccgtaaact gtccatatgc atctaaattt 1140ttcaatcccg
ctatcagaga tggaccgatg aatgttaacg gcaacttcgg ctcagaacct 1200acatatttgg
ccaatgataa atcgtacacg tatatccaac aggacagacc cattcaacaa 1260caccaagagg
tatggaatgg gccagctatc ccttatcatt gggcaacatc cccaggtgat 1320gtagatttcg
tgcaagcaag aaatctctat cgcgttttgg gtaaacaacc tggacagcaa 1380aagaacttgg
catataacat cggcattcat gtagaaggcg cctgtcctca aatacagcag 1440cgcgtttatg
atatgtttgc tcgtgttgat aagggactat ctgaggcaat taaaaaagta 1500gctgaggcaa
aacatgcttc tgagctttcg agtaactcca aattttga
154814515PRTSaccharomyces pastorianus 14Met Ser Lys Leu Gly Gln Glu Lys
Asn Glu Val Asn Ser Ser Asp Val 1 5 10
15 Arg Glu Asp Arg Val Val Thr Asn Ser Thr Gly Asn Pro
Ile Asn Glu 20 25 30
Pro Phe Val Thr Gln Arg Ile Gly Glu His Gly Pro Leu Leu Leu Gln
35 40 45 Asp Tyr Asn Leu
Ile Asp Ser Leu Ala His Phe Asn Arg Glu Asn Ile 50
55 60 Pro Gln Arg Asn Pro His Ala His
Gly Ser Gly Ala Phe Gly Tyr Phe 65 70
75 80 Glu Val Thr Asp Asp Ile Thr Asp Ile Cys Gly Ser
Ala Met Phe Ser 85 90
95 Lys Ile Gly Lys Arg Thr Lys Cys Leu Thr Arg Phe Ser Thr Val Gly
100 105 110 Gly Asp Lys
Gly Ser Ala Asp Thr Val Arg Asp Pro Arg Gly Phe Ala 115
120 125 Thr Lys Phe Tyr Thr Glu Glu Gly
Asn Leu Asp Trp Val Tyr Asn Asn 130 135
140 Thr Pro Val Phe Phe Ile Arg Asp Pro Ser Lys Phe Pro
His Phe Ile 145 150 155
160 His Thr Gln Lys Arg Asn Pro Gln Thr Asn Leu Arg Asp Ala Asp Met
165 170 175 Phe Trp Asp Phe
Leu Thr Thr Pro Glu Asn Gln Val Ala Ile His Gln 180
185 190 Val Met Ile Leu Phe Ser Asp Arg Gly
Thr Pro Ala Asn Tyr Arg Ser 195 200
205 Met His Gly Tyr Ser Gly His Thr Tyr Lys Trp Ser Asn Lys
Asn Gly 210 215 220
Asp Trp His Tyr Val Gln Val His Ile Lys Thr Asp Gln Gly Ile Lys 225
230 235 240 Asn Leu Thr Ile Glu
Glu Ala Thr Lys Ile Ala Gly Ser Asn Pro Asp 245
250 255 Tyr Cys Gln Gln Asp Leu Phe Glu Ala Ile
Gln Asn Gly Asn Tyr Pro 260 265
270 Ser Trp Thr Val Tyr Ile Gln Thr Met Thr Glu Arg Asp Ala Lys
Lys 275 280 285 Leu
Pro Phe Ser Val Phe Asp Leu Thr Lys Val Trp Pro Gln Gly Gln 290
295 300 Phe Pro Leu Arg Arg Val
Gly Lys Ile Val Leu Asn Glu Asn Pro Leu 305 310
315 320 Asn Phe Phe Ala Gln Val Glu Gln Ala Ala Phe
Ala Pro Ser Thr Thr 325 330
335 Val Pro Tyr Gln Glu Ala Ser Ala Asp Pro Val Leu Gln Ala Arg Leu
340 345 350 Phe Ser
Tyr Ala Asp Ala His Arg Tyr Arg Leu Gly Pro Asn Phe His 355
360 365 Gln Ile Pro Val Asn Cys Pro
Tyr Ala Ser Lys Phe Phe Asn Pro Ala 370 375
380 Ile Arg Asp Gly Pro Met Asn Val Asn Gly Asn Phe
Gly Ser Glu Pro 385 390 395
400 Thr Tyr Leu Ala Asn Asp Lys Ser Tyr Thr Tyr Ile Gln Gln Asp Arg
405 410 415 Pro Ile Gln
Gln His Gln Glu Val Trp Asn Gly Pro Ala Ile Pro Tyr 420
425 430 His Trp Ala Thr Ser Pro Gly Asp
Val Asp Phe Val Gln Ala Arg Asn 435 440
445 Leu Tyr Arg Val Leu Gly Lys Gln Pro Gly Gln Gln Lys
Asn Leu Ala 450 455 460
Tyr Asn Ile Gly Ile His Val Glu Gly Ala Cys Pro Gln Ile Gln Gln 465
470 475 480 Arg Val Tyr Asp
Met Phe Ala Arg Val Asp Lys Gly Leu Ser Glu Ala 485
490 495 Ile Lys Lys Val Ala Glu Ala Lys His
Ala Ser Glu Leu Ser Ser Asn 500 505
510 Ser Lys Phe 515 15 2403DNAPenicillium
pinophilum 15atgcgaggat tatactccct cggcgccttc gccagtctca ttgcggcggc
ttcggctgca 60tgcccaatgc tgactggcga aatcccagct ggtagtgttg ccaatcctca
tcatcacgga 120aagcgtgacg attcaaatgc ttcctccgaa acagaagcct ttctgtccga
gttctacctc 180aacgacaacg atgcctatct caccaccgat gtaggcggtc cgatcgagga
tcaaaacagt 240ttgaaggccg gcattcgtgg atcgaccctc ttggaagact tcatcttccg
tcagaaaatc 300cagcattttg atcatgagcg tgtaggttat ccattctatc acgtacttca
ggggtagttc 360tgacatgccc aggtcccgga acgtgccgtg catgctcgag gtgcaggtgc
tcatggtgta 420tttacttcat atgccgactg gtccaacatc actgctgctt catttttggg
agcttccgga 480aaggaaacgc ccacatttgt ccgcttctcg actgttgcag gcagccgagg
aagtgccgac 540accgctcgtg acgttcacgg atttgctact cgcttctata ctgacgaggg
aaactatggt 600agcctttctc tttgactcgt ccatagatag ggatgtaact gacttcaaca
gacattgttg 660gaaacaacat tcctgtcttc ttcatccaag atgctatctt attcccagat
ctcatccata 720gcgttaagcc acagccagcc aatgaaatcc cacaggctgc tactgcacac
gacacggcct 780atgacttctt tggtcaacag ccaagcactc tgcataccct cttctgggca
atggcaggcc 840atggtatccc acggtctttc cgtcatgttg acggattcgg tgtccacacc
tatcggttcg 900tgacagatga tggctcgtcc aagttggtca aatttcactg gacatcgctg
caaggtcggg 960ccagtctggt ctgggaggaa gctcaggcca ctgctggcaa aaatgccgac
tttatgagac 1020aggatctgta tgatagcatt gaggctggcc gttatccaga gtgggaggta
tgtaccaccg 1080aattcatgga aagtactcga ctaacgtgaa cagctcggcg tgcaaataat
tgaggagtcg 1140gatgtcttaa gctacggatt tgacctgttg gatccaacca agattcttcc
ggttgaaaaa 1200gttccaatta ctgcgctcgg aaaaatgcaa ctcaaccgta atccattgaa
ttactttgcc 1260gagacagagc aagtcatggt aagtcgacct tccggcactc gagtcatttc
ctactaacgt 1320ggatagttcc aacctggcca cattgttcgt ggtatcgact tcacctatta
tcctcttctc 1380cagggtcgtt tattctccta cctcgatact cagctgaatc gcaatggtgg
tcccaacttt 1440gaacaaattc caatcaatcg tccgcgtgtt cctatccaca acaacaaccg
cgatggattc 1500gcccaaatgt ttattccttt gaaccaggca gcatattcac ccaacacctt
gaataatggc 1560tctcctcgac aagccaacga gactgtcgga aatggcttct ttaccgcccc
cgggcgctcc 1620gcagatggac accttgttcg cgctacgagc ccaacatttg ccgacgtgtg
gtctcagcct 1680ggcttgtttt acaactcctt gacggctacc gaacaacagt tcgtgatcaa
tgctttgcgt 1740ttcgaattgt ctaatgtaaa gagcgaggat gttaaaagca atttcatcac
acagataaat 1800cgcgtaaaca acacgttagc aacacttgtg gcttctgcaa ttggagtctc
cgcgcccgaa 1860cccgactcta catactacca cagcaataag acgtctaatg tcggaacatt
cggtactccg 1920ttgaaaaagc ttgacggtct caaggtcgga gtccttgctt cggtgaacgg
tgaaagtagt 1980attgccgagg gacaagcatt ggcacaaagc ctagcgggct cgaacgtgga
cgtcgttatc 2040gtcgccgagc atcttacttc gaacgtgtca gctacatact ctggatcaga
cgcaacgaac 2100tttgatgctg ttattgtcag ctcaggggct gaaggtctct ttggacctca
aacctttaca 2160gccgaatcca atacaacact ttatccggca ggccgtccta gccagatttt
ggtcgatgcc 2220ttccgctttg gcaagccggt tggagcagtt ggtggtgcca gtgcagctct
gtcagcggtg 2280gatatcagta ctgatcgtag tggtgtgatt actggtgatt ccgtcagtga
cgactttgtc 2340aagcagctaa cggaggacct tgccacattc aaattcttgg accgattcgc
tgtggatgag 2400tag
240316692PRTPenicillium pinophilum 16Met Arg Gly Leu Tyr Ser
Leu Gly Ala Phe Ala Ser Leu Ile Ala Ala 1 5
10 15 Ala Ser Ala Ala Cys Pro Met Leu Thr Gly Glu
Ile Pro Ala Gly Ser 20 25
30 Val Ala Asn Pro His His His Gly Lys Arg Asp Asp Ser Asn Ala
Ser 35 40 45 Ser
Glu Thr Glu Ala Phe Leu Ser Glu Phe Tyr Leu Asn Asp Asn Asp 50
55 60 Ala Tyr Leu Thr Thr Asp
Val Gly Gly Pro Ile Glu Asp Gln Asn Ser 65 70
75 80 Leu Lys Ala Gly Ile Arg Gly Ser Thr Leu Leu
Glu Asp Phe Ile Phe 85 90
95 Arg Gln Lys Ile Gln His Phe Asp His Glu Arg Val Pro Glu Arg Ala
100 105 110 Val His
Ala Arg Gly Ala Gly Ala His Gly Val Phe Thr Ser Tyr Ala 115
120 125 Asp Trp Ser Asn Ile Thr Ala
Ala Ser Phe Leu Gly Ala Ser Gly Lys 130 135
140 Glu Thr Pro Thr Phe Val Arg Phe Ser Thr Val Ala
Gly Ser Arg Gly 145 150 155
160 Ser Ala Asp Thr Ala Arg Asp Val His Gly Phe Ala Thr Arg Phe Tyr
165 170 175 Thr Asp Glu
Gly Asn Tyr Asp Ile Val Gly Asn Asn Ile Pro Val Phe 180
185 190 Phe Ile Gln Asp Ala Ile Leu Phe
Pro Asp Leu Ile His Ser Val Lys 195 200
205 Pro Gln Pro Ala Asn Glu Ile Pro Gln Ala Ala Thr Ala
His Asp Thr 210 215 220
Ala Tyr Asp Phe Phe Gly Gln Gln Pro Ser Thr Leu His Thr Leu Phe 225
230 235 240 Trp Ala Met Ala
Gly His Gly Ile Pro Arg Ser Phe Arg His Val Asp 245
250 255 Gly Phe Gly Val His Thr Tyr Arg Phe
Val Thr Asp Asp Gly Ser Ser 260 265
270 Lys Leu Val Lys Phe His Trp Thr Ser Leu Gln Gly Arg Ala
Ser Leu 275 280 285
Val Trp Glu Glu Ala Gln Ala Thr Ala Gly Lys Asn Ala Asp Phe Met 290
295 300 Arg Gln Asp Leu Tyr
Asp Ser Ile Glu Ala Gly Arg Tyr Pro Glu Trp 305 310
315 320 Glu Leu Gly Val Gln Ile Ile Glu Glu Ser
Asp Val Leu Ser Tyr Gly 325 330
335 Phe Asp Leu Leu Asp Pro Thr Lys Ile Leu Pro Val Glu Lys Val
Pro 340 345 350 Ile
Thr Ala Leu Gly Lys Met Gln Leu Asn Arg Asn Pro Leu Asn Tyr 355
360 365 Phe Ala Glu Thr Glu Gln
Val Met Phe Gln Pro Gly His Ile Val Arg 370 375
380 Gly Ile Asp Phe Thr Tyr Tyr Pro Leu Leu Gln
Gly Arg Leu Phe Ser 385 390 395
400 Tyr Leu Asp Thr Gln Leu Asn Arg Asn Gly Gly Pro Asn Phe Glu Gln
405 410 415 Ile Pro
Ile Asn Arg Pro Arg Val Pro Ile His Asn Asn Asn Arg Asp 420
425 430 Gly Phe Ala Gln Met Phe Ile
Pro Leu Asn Gln Ala Ala Tyr Ser Pro 435 440
445 Asn Thr Leu Asn Asn Gly Ser Pro Arg Gln Ala Asn
Glu Thr Val Gly 450 455 460
Asn Gly Phe Phe Thr Ala Pro Gly Arg Ser Ala Asp Gly His Leu Val 465
470 475 480 Arg Ala Thr
Ser Pro Thr Phe Ala Asp Val Trp Ser Gln Pro Gly Leu 485
490 495 Phe Tyr Asn Ser Leu Thr Ala Thr
Glu Gln Gln Phe Val Ile Asn Ala 500 505
510 Leu Arg Phe Glu Leu Ser Asn Val Lys Ser Glu Asp Val
Lys Ser Asn 515 520 525
Phe Ile Thr Gln Ile Asn Arg Val Asn Asn Thr Leu Ala Thr Leu Val 530
535 540 Ala Ser Ala Ile
Gly Val Ser Ala Pro Glu Pro Asp Ser Thr Tyr Tyr 545 550
555 560 His Ser Asn Lys Thr Ser Asn Val Gly
Thr Phe Gly Thr Pro Leu Lys 565 570
575 Lys Leu Asp Gly Leu Lys Val Gly Val Leu Ala Ser Val Asn
Gly Glu 580 585 590
Ser Ser Ile Ala Glu Gly Gln Ala Leu Ala Gln Ser Leu Ala Gly Ser
595 600 605 Asn Val Asp Val
Val Ile Val Ala Glu His Leu Thr Ser Asn Val Ser 610
615 620 Ala Thr Tyr Ser Gly Ser Asp Ala
Thr Asn Phe Asp Ala Val Ile Val 625 630
635 640 Ser Ser Gly Ala Glu Gly Leu Phe Gly Pro Gln Thr
Phe Thr Ala Glu 645 650
655 Ser Asn Thr Thr Leu Tyr Pro Ala Gly Arg Pro Ser Gln Ile Leu Val
660 665 670 Asp Ala Phe
Arg Phe Gly Lys Pro Val Gly Ala Val Gly Gly Ala Ser 675
680 685 Ala Ala Leu Ser 690
172749DNAHumicola grisea 17atgaacagag tcacgaatct cctcgcctgg gccggcgcga
tagggctcgc ccaagcaaca 60tgccccttcg cggaccctgc cgctctgtat aggcgtcagg
atactaccag cggccagtcg 120ccacttgcag catacgaggt ggatgacagc accggatacc
tgacctccga tgttggcggg 180cccattcagg accagaccag cctcaaggca ggcatccggg
gtccgaccct tcttgaggac 240tttatgttcc gccagaagat ccagcacttc gaccatgaac
gggtaaggac ataatgctca 300cacgagcggc tgcgtaccta tttattttga acgggtaagg
acataatgct cacacgagcg 360gctgcgtacc tatttatttc cgagagatgg gctggctggc
tggctgtgat gcctgagttt 420ggggacatac ggagtacctt actgacgcgc taatccactc
caggttcccg aaagggcggt 480ccatgctcga ggcgctggag cacacgggac cttcacgagt
tacgccgact ggagtaacat 540caccgcggcg tcctttctga acgccacagg aaagcagacg
ccggtgtttg tccggttctc 600gaccgttgct gggtctcgag ggagcgcaga cacggcgaga
gacgttcatg gtttcgcgac 660gcggttgtaa gttttgttgt gtttcattcg ttccggtctg
tagaggaggg ttaggatatg 720agctaacgtg tgtgtgtgtg tgtgaagtta cactgatgaa
ggcaactttg gtacgtccca 780cgcatggtcc tcaattctct tatctggcag cgatgtggtc
attgtcgacg ttgctaactt 840gcgtagatat cgtcggaaac aacatcccgg tattcttcat
tcaagatgca atccagttcc 900ctgaccttat ccactcggtc aagccgagtc cagacaacga
gattccccaa gcggcgacgg 960ctcatgattc agcttgggac ttcttcagcc agcagccaag
cgccatggta agcaatggac 1020caaggagccg cacctggggt gacataccag ggagtacacg
gggcgttccg atgaccctcg 1080tgtgaccaag gcagtacaac actccacgga ggactcgaag
agattcggaa atatggaaca 1140cagaactgac aggatggtag cacacgttgt tctgggccat
gtctggccac ggaatccctc 1200gcagctatcg ccatatggta cgtttgcctg gctgagatga
ccgtgaatcc atttctaacc 1260tcaagtccag gatggcttcg gcgtccacac gttccggttt
gtcaaagatg acggctcgtc 1320caagttgatc aagtggcatt tcaagtcacg ccagggaaag
gcgagtctag tctgggaaga 1380ggcgcaggtt ctttctggca agaatgccga cttccaccgt
caggacctct gggatgctat 1440tgagtccggg aacggaccag aatgggatgt ctgcgtccag
attgtcgatg agtcccaggc 1500gcaagccttt ggcttcgact tgctggaccc gacaaagatc
atccccgagg agtacgcccc 1560cttgacgaaa ctggggctct tgaagctgga tcgcaatccg
accaactact tcgccgagac 1620ggagcaggtc atgttccaac ccggtcatat agtccgcggc
gtcgacttca cggaggatcc 1680cctgctacag ggacgtctct tctcgtacct tgacacgcag
ctgaaccgga atggcgggcc 1740caactttgag cagctgccca tcaacatgcc gcgggtgccg
attcacaaca ataatcgcga 1800cggcgccggc cagatgttca tccacaggaa caagtatcct
tgtaagtacc tcttttgcct 1860cgatcgttgt ggtgccggct tgctgacaga cgcagacact
cccaacaccc tgaacagtgg 1920ttatccgcgg caagccaacc aaaatgccgg acgcggattc
ttcacagcgc ctggccgtac 1980cgtcagcggt gccctcgtcc gtgaggtgtc gccaacattc
aacgaccact ggtcgcagcc 2040ccgtctcttc ttcaactccc tcactcccgt cgaacagcag
ttcctcgtca acgccatgcg 2100cttcgaaatc agccttgtga agtcggaaga atgcaggaag
aacgtgctca cccagctcaa 2160ccgcgtcagc catgatgtgg ccgtgcgcgt ggccgccgct
atcggcctcg ccgcgcccga 2220cgcggacgac acatactacc acaacaacaa gacggctggc
gtctcgatcc ttggaagcgg 2280gcccttgcct accatcaaga ctctccgcgt cggcatcctg
gctaccacga gcgagtcgag 2340cgcgctggat caggcagccc agctccgcac ccgtctggaa
aaggacgggc ttgtggtcac 2400ggttgtggct gaaacgctgc gcgagggggt agaccagaca
tactcgacgg cggatgccac 2460gggtttcgac ggcgttgttg ttgtggacgg ggcggcggcg
ctgtttgcca gcaccgcgtc 2520gtcgccgttg ttcccgacgg gcaggccgtt gcagatcttt
gtggacgcgt atcggtgggg 2580aaagccggtc ggtgtgtgtg gtgggaagtc gagcgaggtg
ttggatgcgg cggatgttcc 2640ggaaaatggg gacggggtgt attcggagga gtcggtggac
aagtttgtgg aggagtttga 2700gaaggggttg gctactttca gggtgagtct tggtgccttt
gttttttga 274918684PRTHumicola grisea 18Met Asn Arg Val Thr
Asn Leu Leu Ala Trp Ala Gly Ala Ile Gly Leu 1 5
10 15 Ala Gln Ala Thr Cys Pro Phe Ala Asp Pro
Ala Ala Leu Tyr Arg Arg 20 25
30 Gln Asp Thr Thr Ser Gly Gln Ser Pro Leu Ala Ala Tyr Glu Val
Asp 35 40 45 Asp
Ser Thr Gly Tyr Leu Thr Ser Asp Val Gly Gly Pro Ile Gln Asp 50
55 60 Gln Thr Ser Leu Lys Ala
Gly Ile Arg Gly Pro Thr Leu Leu Glu Asp 65 70
75 80 Phe Met Phe Arg Gln Lys Ile Gln His Phe Asp
His Glu Arg Val Pro 85 90
95 Glu Arg Ala Val His Ala Arg Gly Ala Gly Ala His Gly Thr Phe Thr
100 105 110 Ser Tyr
Ala Asp Trp Ser Asn Ile Thr Ala Ala Ser Phe Leu Asn Ala 115
120 125 Thr Gly Lys Gln Thr Pro Val
Phe Val Arg Phe Ser Thr Val Ala Gly 130 135
140 Ser Arg Gly Ser Ala Asp Thr Ala Arg Asp Val His
Gly Phe Ala Thr 145 150 155
160 Arg Phe Tyr Thr Asp Glu Gly Asn Phe Asp Ile Val Gly Asn Asn Ile
165 170 175 Pro Val Phe
Phe Ile Gln Asp Ala Ile Gln Phe Pro Asp Leu Ile His 180
185 190 Ser Val Lys Pro Ser Pro Asp Asn
Glu Ile Pro Gln Ala Ala Thr Ala 195 200
205 His Asp Ser Ala Trp Asp Phe Phe Ser Gln Gln Pro Ser
Ala Met His 210 215 220
Thr Leu Phe Trp Ala Met Ser Gly His Gly Ile Pro Arg Ser Tyr Arg 225
230 235 240 His Met Asp Gly
Phe Gly Val His Thr Phe Arg Phe Val Lys Asp Asp 245
250 255 Gly Ser Ser Lys Leu Ile Lys Trp His
Phe Lys Ser Arg Gln Gly Lys 260 265
270 Ala Ser Leu Val Trp Glu Glu Ala Gln Val Leu Ser Gly Lys
Asn Ala 275 280 285
Asp Phe His Arg Gln Asp Leu Trp Asp Ala Ile Glu Ser Gly Asn Gly 290
295 300 Pro Glu Trp Asp Val
Cys Val Gln Ile Val Asp Glu Ser Gln Ala Gln 305 310
315 320 Ala Phe Gly Phe Asp Leu Leu Asp Pro Thr
Lys Ile Ile Pro Glu Glu 325 330
335 Tyr Ala Pro Leu Thr Lys Leu Gly Leu Leu Lys Leu Asp Arg Asn
Pro 340 345 350 Thr
Asn Tyr Phe Ala Glu Thr Glu Gln Val Met Phe Gln Pro Gly His 355
360 365 Ile Val Arg Gly Val Asp
Phe Thr Glu Asp Pro Leu Leu Gln Gly Arg 370 375
380 Leu Phe Ser Tyr Leu Asp Thr Gln Leu Asn Arg
Asn Gly Gly Pro Asn 385 390 395
400 Phe Glu Gln Leu Pro Ile Asn Met Pro Arg Val Pro Ile His Asn Asn
405 410 415 Asn Arg
Asp Gly Ala Gly Gln Met Phe Ile His Arg Asn Lys Tyr Pro 420
425 430 Tyr Thr Pro Asn Thr Leu Asn
Ser Gly Tyr Pro Arg Gln Ala Asn Gln 435 440
445 Asn Ala Gly Arg Gly Phe Phe Thr Ala Pro Gly Arg
Thr Val Ser Gly 450 455 460
Ala Leu Val Arg Glu Val Ser Pro Thr Phe Asn Asp His Trp Ser Gln 465
470 475 480 Pro Arg Leu
Phe Phe Asn Ser Leu Thr Pro Val Glu Gln Gln Phe Leu 485
490 495 Val Asn Ala Met Arg Phe Glu Ile
Ser Leu Val Lys Ser Glu Glu Cys 500 505
510 Arg Lys Asn Val Leu Thr Gln Leu Asn Arg Val Ser His
Asp Val Ala 515 520 525
Val Arg Val Ala Ala Ala Ile Gly Leu Ala Ala Pro Asp Ala Asp Asp 530
535 540 Thr Tyr Tyr His
Asn Asn Lys Thr Ala Gly Val Ser Ile Leu Gly Ser 545 550
555 560 Gly Pro Leu Pro Thr Ile Lys Thr Leu
Arg Val Gly Ile Leu Ala Thr 565 570
575 Thr Ser Glu Ser Ser Ala Leu Asp Gln Ala Ala Gln Leu Arg
Thr Arg 580 585 590
Leu Glu Lys Asp Gly Leu Val Val Thr Val Val Ala Glu Thr Leu Arg
595 600 605 Glu Gly Val Asp
Gln Thr Tyr Ser Thr Ala Asp Ala Thr Gly Phe Asp 610
615 620 Gly Val Val Val Val Asp Gly Ala
Ala Ala Leu Phe Ala Ser Thr Ala 625 630
635 640 Ser Ser Pro Leu Phe Pro Thr Gly Arg Pro Leu Gln
Ile Phe Val Asp 645 650
655 Ala Tyr Arg Trp Gly Lys Pro Val Gly Val Cys Gly Gly Lys Ser Ser
660 665 670 Glu Val Leu
Asp Ala Ala Asp Val Pro Glu Asn Gly 675 680
19 2362DNAThielavia terrestris 19cgcaagatcg aagcggtcca
gattcatgac cttgtcccac gcggccacga agtccttgac 60gaacttttcc tgaccccctg
cgctggcata cacctcggcg atggcgcgca gctcggcatg 120cgagccaaac acgagatcgg
cccgcgtggc cgtccacttc ttggcgcccg tcgcacggtc 180ggtgccctcg aacagctcgc
cgtcgaccga cttccacgcc gtgcgcatgt ccaggaggtt 240cacgaagaag tcgttggtca
gcttgccggg ccgggtggtg aagacgccgt gcgaggagcc 300gtcgtagttg gcgcccagga
cgcgcagccc gcccacgagc accgtcagct ccggcggcgt 360cagcgtcagc agctgcgccc
ggtcgacgag gaggtgctcc gtcggcacgc gcgccgtgcc 420gcggccgtag ttgcggaagc
cgtcggcata cggctcgagg tgcgcgaacg acgccacgtc 480ggtctgctcc tgcgacgcgt
cggtgcggcc gggccggaag ggcaccggca cgccggccgc 540ctgctcgagc gccgccacgc
cgcccagcac gatcaggtcc gccagcgaca ccttcttgcc 600gcccgccgcc gacgcgttga
acttggcctg cacgccctcg agcgcctgca gcacctcggc 660cagctgcggc gggttgttga
ccttccagtc cttctgcggc gccagccgga tgcgcgcgcc 720gttggcgccg ccgcgcttgt
cgctgccgcg gaacgtcgac gccgacgccc acgccaccga 780gatgagcttg gccggggcca
cgcccgtggc caggatgtcg cgcttcagcg cggcgatgtc 840gctgtcgtcg acgagcgggt
ggtcgaccgg cggcacgtag tcctcccaga ggagcacctc 900ggacgggatc tccgggccga
gccagcgcga acgcgggccc atgtcgcggt gcagcagctt 960gaaccaggcg cgcgcgaacg
cgtcggcgaa ctggtctggg tgctcgaggt agtggcgcgc 1020gatcttctcg tacaccgggt
cgaagcgcag cgccaggtcc gtcgtgagca tgcgcggccg 1080gtgcttcttg ctcgggtcgt
acgcgtccgg aatgaaagcg tcggcgttct tggccaccca 1140ctggttggcg ccggcggggc
tcttggtgag ctcccactcg aacttgaaga ggtactccaa 1200gaagttggtg ctccaccggg
tcggcgtctt ggtccagatg acctcgagcc cactggtgat 1260ggtatcaggg cccttgccgg
agccgtgctt gttggcccag ccgagaccct gctgctccag 1320gccggccccc tcgggctcct
tgccgacgtt gtccgagggg gccgcgccgt gcgtcttgcc 1380gaacgtgtgg ccgccggcga
tcagggccac cgtctcctcg tcgttcatgg ccatgcggct 1440gaaggtcgtg cggatgtccc
gcgcggccgc cacggggtcg gggatgccgt ccggaccctc 1500ggggttgacg tagatcaggc
ccatgtgggc ggccgccaag ggcgactcga gatcgcgcga 1560gtggatgtcc ttgttgacct
tcttggactc gtcgccgccc gtgacgccgt cgccggcgat 1620gcccgcctgg ccgtccgagt
agcggacatc gttgccgagc cacgtcgtct cgccgcccca 1680gtaggtcgac tcgtcagcct
cccacgtgtc cggacggccg ccggcaaagc cgaacgtctt 1740gaggcccatg gactcgaggg
cgacgttgcc agtcagcagc agcaggtcgg cccacgagat 1800cttgtcgccg tacttctgct
tgatgggcca cagcaggcgg cgggccttgt cgagactgac 1860gttgtcgggc cagctgttga
gtggcgcaaa ccgctgctgg ccctggccgc cgccgccgcg 1920gccgtcgaag acgcggtagg
tgccggcgct gtgccaggcc atgcggatga acagaccgcc 1980gtagtggccg aagtcggccg
gccaccagtc ctgcgagtcg gtcatgagcg cccgcaggtc 2040ctgcttcagc gcgtcatagt
caagcgactt gaaggcggcc ctatagtcga agtccttgta 2100cgggctcgac gccggctggt
gctggcggag gatgtggagc ggcagccggt tcggccacca 2160gtcggtgttt cgagtaccgc
cgccggcggc gttggcgaac ctgttcgggc actcacccat 2220cttctcgctt gttcaatctt
ccgtcgctgt ggctgctggg tgcttgtgcg tgtgtgtgtg 2280tgtgtgtgtg agtcggagtg
tgtatgtgtc tgtttgtttg tgtgggttgc cagaacgtaa 2340gctgcgaaac aaaccgccac
tg 236220740PRTThielavia
terrestris 20Met Arg Ala Lys His Glu Ile Gly Pro Arg Gly Arg Pro Leu Leu
Gly 1 5 10 15 Ala
Arg Arg Thr Val Gly Ala Leu Glu Gln Leu Ala Val Asp Arg Leu
20 25 30 Pro Arg Arg Ala His
Val Gln Glu Val His Glu Glu Val Val Gly Gln 35
40 45 Leu Ala Gly Pro Gly Gly Glu Asp Ala
Val Arg Gly Ala Val Val Val 50 55
60 Gly Ala Gln Asp Ala Gln Pro Ala His Glu His Arg Gln
Leu Arg Arg 65 70 75
80 Arg Gln Arg Gln Gln Leu Arg Pro Val Asp Glu Glu Val Leu Arg Arg
85 90 95 His Ala Arg Arg
Ala Ala Ala Val Val Ala Glu Ala Val Gly Ile Arg 100
105 110 Leu Glu Val Arg Glu Arg Arg His Val
Gly Leu Leu Leu Arg Arg Val 115 120
125 Gly Ala Ala Gly Pro Glu Gly His Arg His Ala Gly Arg Leu
Leu Glu 130 135 140
Arg Arg His Ala Ala Gln His Asp Gln Val Arg Gln Arg His Leu Leu 145
150 155 160 Ala Ala Arg Arg Arg
Arg Val Glu Leu Gly Leu His Ala Leu Glu Arg 165
170 175 Leu Gln His Leu Gly Gln Leu Arg Arg Val
Val Asp Leu Pro Val Leu 180 185
190 Leu Arg Arg Gln Pro Asp Ala Arg Ala Val Gly Ala Ala Ala Leu
Val 195 200 205 Ala
Ala Ala Glu Arg Arg Arg Arg Arg Pro Arg His Arg Asp Glu Leu 210
215 220 Gly Arg Gly His Ala Arg
Gly Gln Asp Val Ala Leu Gln Arg Gly Asp 225 230
235 240 Val Ala Val Val Asp Glu Arg Val Val Asp Arg
Arg His Val Val Leu 245 250
255 Pro Glu Glu His Leu Gly Arg Asp Leu Arg Ala Glu Pro Ala Arg Thr
260 265 270 Arg Ala
His Val Ala Val Gln Gln Leu Glu Pro Gly Ala Arg Glu Arg 275
280 285 Val Gly Glu Leu Val Trp Val
Leu Glu Val Val Ala Arg Asp Leu Leu 290 295
300 Val His Arg Val Glu Ala Gln Arg Gln Val Arg Arg
Glu His Ala Arg 305 310 315
320 Pro Val Leu Leu Ala Arg Val Val Arg Val Arg Asn Glu Ser Val Gly
325 330 335 Val Leu Gly
His Pro Leu Val Gly Ala Gly Gly Ala Leu Gly Glu Leu 340
345 350 Pro Leu Glu Leu Glu Glu Val Leu
Gln Glu Val Gly Ala Pro Pro Gly 355 360
365 Arg Arg Leu Gly Pro Asp Asp Leu Glu Pro Thr Gly Asp
Gly Ile Arg 370 375 380
Ala Leu Ala Gly Ala Val Leu Val Gly Pro Ala Glu Thr Leu Leu Leu 385
390 395 400 Gln Ala Gly Pro
Leu Gly Leu Leu Ala Asp Val Val Arg Gly Gly Arg 405
410 415 Ala Val Arg Leu Ala Glu Arg Val Ala
Ala Gly Asp Gln Gly His Arg 420 425
430 Leu Leu Val Val His Gly His Ala Ala Glu Gly Arg Ala Asp
Val Pro 435 440 445
Arg Gly Arg His Gly Val Gly Asp Ala Val Arg Thr Leu Gly Val Asp 450
455 460 Val Asp Gln Ala His
Val Gly Gly Arg Gln Gly Arg Leu Glu Ile Ala 465 470
475 480 Arg Val Asp Val Leu Val Asp Leu Leu Gly
Leu Val Ala Ala Arg Asp 485 490
495 Ala Val Ala Gly Asp Ala Arg Leu Ala Val Arg Val Ala Asp Ile
Val 500 505 510 Ala
Glu Pro Arg Arg Leu Ala Ala Pro Val Gly Arg Leu Val Ser Leu 515
520 525 Pro Arg Val Arg Thr Ala
Ala Gly Lys Ala Glu Arg Leu Glu Ala His 530 535
540 Gly Leu Glu Gly Asp Val Ala Ser Gln Gln Gln
Gln Val Gly Pro Arg 545 550 555
560 Asp Leu Val Ala Val Leu Leu Leu Asp Gly Pro Gln Gln Ala Ala Gly
565 570 575 Leu Val
Glu Thr Asp Val Val Gly Pro Ala Val Glu Trp Arg Lys Pro 580
585 590 Leu Leu Ala Leu Ala Ala Ala
Ala Ala Ala Val Glu Asp Ala Val Gly 595 600
605 Ala Gly Ala Val Pro Gly His Ala Asp Glu Gln Thr
Ala Val Val Ala 610 615 620
Glu Val Gly Arg Pro Pro Val Leu Arg Val Gly His Glu Arg Pro Gln 625
630 635 640 Val Leu Leu
Gln Arg Val Ile Val Lys Arg Leu Glu Gly Gly Pro Ile 645
650 655 Val Glu Val Leu Val Arg Ala Arg
Arg Arg Leu Val Leu Ala Glu Asp 660 665
670 Val Glu Arg Gln Pro Val Arg Pro Pro Val Gly Val Ser
Ser Thr Ala 675 680 685
Ala Gly Gly Val Gly Glu Pro Val Arg Ala Leu Thr His Leu Leu Ala 690
695 700 Cys Ser Ile Phe
Arg Arg Cys Gly Cys Trp Val Leu Val Arg Val Cys 705 710
715 720 Val Cys Val Cys Glu Ser Glu Cys Val
Cys Val Cys Leu Phe Val Trp 725 730
735 Val Ala Arg Thr 740 21
1449DNABacillus thermoglucosidasius 21atgagttcaa ataaactgac aactagctgg
ggagcacctg ttggcgataa ccaaaactcg 60ataacggccg gcaatcctgg cccgacatta
atccaagacg tacatcttat cgaaaaatta 120gcacacttca atagagaacg tgtcccagaa
cgtgttgtcc atgcgaaagg cgctggtgcg 180cacggctatt tcgaagtaac aaacgatatg
tcgaaataca caaaagcgaa agtgtttaac 240ggtgttggca aacgcacgcc tgtattcgtc
cgcttctcta ctgtcgccgg tgaattggga 300tctgcggata cagtccgcga cccgcgcggt
tttgccgtca aattttatac cgaagaagga 360aactatgaca tcgttggcaa caacacaccg
attttcttca ttcgtgatgc gattaaattc 420tcggatttta tccatacaca aaaacgcgac
ccgcgcaccc atttgattta tccgacagca 480atgtgggatt tcttgtcttt atctccggaa
tctttgcacc aagtcactta tttattcggg 540gatcgcggca tcccattgac ataccgccat
atgaacggat acggaagcca tacattcaaa 600tgggtgaatg aaaaaggcga agcggtatgg
gtaaaatacc actttaaaac aaaccaaggc 660gtgaaaaaca tggatccgga actagcggtt
aaaatcgccg gagaaaatcc ggattaccat 720acggaagatt tatataacgc catcgaaaaa
ggcgactatc catcttggac attatatgtg 780caaattatgc cgttagaaga cgcaaaaaca
taccgtttca atccatttga tgtcacaaaa 840gtttggtcac ataaagatta tccgttaatt
gaagtcggcc gtatggtatt aaaccgcaat 900ccagaaaatt attttgccga agtcgaacaa
gcgacattct ctcctggaaa ccttgttcct 960ggcgttgaac catcgccgga taaaatcttg
caagcccgtt tgttcgctta tgcggatgcg 1020caccgttacc gcgtcggcgt gaaccataac
ttgcttccga tcaaccgccc gcgcgtggaa 1080gtaaacaatt atcaacgtga cggcttcatg
cgctttgaca ataatggcgg cggttcggtc 1140aactacgaac caaacagctt cggcggaccg
acagaagtgc cagaacataa aacgacccca 1200ttcccggtat ccggcgtggc agaaagcgtg
ccatatgacg acgatgatca ttatacgcaa 1260gcaggcgact tataccgtct catgagcgaa
gaagaaaaag cgcgccttgt gaaaaacatt 1320gtcgaatcat tgaaacaagt aacaaaagaa
gaaattaaac ttcgccaaat ccgccacttc 1380tacaaagcag accctgacta cggccgccgc
gttgccgaag gtcttggatt gccgattaaa 1440aaagattct
144922483PRTBacillus thermoglucosidasius
22Met Ser Ser Asn Lys Leu Thr Thr Ser Trp Gly Ala Pro Val Gly Asp 1
5 10 15 Asn Gln Asn Ser
Ile Thr Ala Gly Asn Pro Gly Pro Thr Leu Ile Gln 20
25 30 Asp Val His Leu Ile Glu Lys Leu Ala
His Phe Asn Arg Glu Arg Val 35 40
45 Pro Glu Arg Val Val His Ala Lys Gly Ala Gly Ala His Gly
Tyr Phe 50 55 60
Glu Val Thr Asn Asp Met Ser Lys Tyr Thr Lys Ala Lys Val Phe Asn 65
70 75 80 Gly Val Gly Lys Arg
Thr Pro Val Phe Val Arg Phe Ser Thr Val Ala 85
90 95 Gly Glu Leu Gly Ser Ala Asp Thr Val Arg
Asp Pro Arg Gly Phe Ala 100 105
110 Val Lys Phe Tyr Thr Glu Glu Gly Lys Tyr Asp Ile Val Gly Asn
Asn 115 120 125 Thr
Pro Ile Phe Phe Ile Arg Asp Ala Ile Lys Phe Ser Asp Phe Ile 130
135 140 His Thr Gln Lys Arg Asp
Pro Arg Thr His Leu Ile Tyr Pro Thr Ala 145 150
155 160 Met Trp Asp Phe Leu Ser Leu Ser Pro Glu Ser
Leu His Gln Val Thr 165 170
175 Tyr Leu Phe Gly Asp Arg Gly Ile Pro Leu Thr Tyr Arg His Met Asn
180 185 190 Gly Tyr
Gly Ser His Thr Phe Lys Trp Val Asn Glu Lys Gly Glu Ala 195
200 205 Val Trp Val Lys Tyr His Phe
Lys Thr Asn Gln Gly Val Lys Asn Met 210 215
220 Asp Pro Glu Leu Ala Val Lys Ile Ala Gly Glu Asn
Pro Asp Tyr His 225 230 235
240 Thr Glu Asp Leu Tyr Asn Ala Ile Glu Lys Gly Asp Tyr Pro Ser Trp
245 250 255 Thr Leu Tyr
Val Gln Ile Met Pro Leu Glu Asp Ala Lys Thr Tyr Arg 260
265 270 Phe Asn Pro Glu Asp Val Thr Lys
Val Trp Ser His Lys Asp Tyr Pro 275 280
285 Leu Ile Glu Val Gly Arg Met Val Leu Asn Arg Asn Pro
Glu Asn Tyr 290 295 300
Phe Ala Glu Val Glu Gln Ala Thr Phe Ser Pro Gly Asn Leu Val Pro 305
310 315 320 Gly Val Glu Pro
Ser Pro Asp Lys Ile Leu Gln Ala Arg Leu Phe Ala 325
330 335 Tyr Ala Asp Ala His Arg Tyr Arg Val
Gly Val Asn His Asn Leu Leu 340 345
350 Pro Ile Asn Arg Pro Arg Val Glu Val Asn Asn Tyr Gln Arg
Asp Gly 355 360 365
Phe Met Arg Phe Asp Asn Asn Gly Gly Gly Ser Val Asn Tyr Glu Pro 370
375 380 Asn Ser Phe Gly Gly
Pro Thr Glu Val Pro Glu His Lys Thr Thr Pro 385 390
395 400 Phe Pro Val Ser Gly Val Ala Glu Ser Val
Pro Tyr Asp Asp Asp Asp 405 410
415 His Tyr Thr Gln Ala Gly Asp Leu Tyr Arg Leu Met Ser Glu Glu
Glu 420 425 430 Lys
Ala Arg Leu Val Lys Asn Ile Val Glu Ser Leu Lys Gln Val Thr 435
440 445 Lys Glu Glu Ile Lys Leu
Arg Gln Ile Arg His Phe Tyr Lys Ala Asp 450 455
460 Pro Asp Tyr Gly Arg Arg Val Ala Glu Gly Leu
Gly Leu Pro Ile Lys 465 470 475
480 Lys Asp Ser 233210DNAAspergillus oryzae 23ccccccagaa gggtccaaag
gcccagtccc ttccccttcg gtcttaacaa aagacggacc 60tttacgacgg aatttcgaag
taaggggcca gggcattcgt tccctggaaa gaagagcgag 120cttctggtgg agactctcga
taagcccgat aagaaaagca ctcgactctc cttcccgatg 180acgaagttca taaacaagga
acataatcag taagaagata ttctgaccaa taccaatgga 240ttcgaataat cattactcat
catcttgtat tcttcataag agaaaacagt atcagaaaaa 300gaaaaaaaac cagttcgatg
tcaacgtgac gttcatcaac cctgcgacgt cattttgacg 360tgcgggaccc atcacaacat
ccattcagaa gtttttcatt actggaaaag ctataagaag 420ctgaagaata atacatttct
tgttctcatg caagtaatga ccgtttcatg aacatagcct 480cgatcccacc ttaatctatc
tccgactatc ttatcgtccc cataatcatc atatccatca 540gaccccaatc atggccaata
ttgtggctgg gggcctccac aaggttcaag aagcagtgca 600gggcgctgct tccaaggata
agaagctagt tgacctagca cccgacaccc ataatgtaca 660gtccagcaag gagccactga
ccaccgacca tggtgtgcgt atcagcgata cggaccactg 720gctgaaggag gtgaatgaca
accacaccgg tcctatgatg cttgaggacc agattgcacg 780agagaaggta tgattccccg
aatcggtatg ggtcagaacc atagattgaa cgaatcgcaa 840cccagattca tcgtttcgat
catgagcgca ttcccgagag agtcgtccat gcgcgtggca 900ccgctgcatt cggaaacttc
aagctccatg agagcgctga agatgtatcc tacgctggta 960tcttgacgga tacctcaagg
aacactccgg ttttccttcg tttctccacg gtccagggca 1020gtaaaggaag tgccgacacc
gtccgtgacg ttcgtgggtt tgccgtgaaa ttctacaccg 1080acgaaggaaa ttgggatctg
gttggaaaca acatccccgt tttctttatc caagatgcga 1140ttaagttccc ggattttggt
acgtacctcc tcccaactat gagtccaaaa ctctagagct 1200aacgagtgta gtccatgctg
ttaagcccga gccgcacaac gaggtaccac aggcccaaac 1260tgctcacaac aacttctggg
actttgtcta tcttcacccg gaagccaccc atatgttcat 1320gtgggccatg tctgatcggg
ccattcctcg gtcataccgt atgatgcagg gtttcggtgt 1380caacacattc agtctcatca
acaaggaagg aaagcgccat tttgtcaagt tccatttcat 1440cccccacctg ggagtgcact
ctttggtgtg ggacgaggct ctgaaactgg ctggccagga 1500ccccgatttc catcgcaagg
atctcatgga ggccattgat aacggcgcat acccgaaatg 1560ggacttcgcc atccaggtca
tccctgagga gaaacaggat gacttcgaat ttgacatttt 1620cgacgcgacg aagatctggc
ccgaggagct cgtgcctctg cgcgtgatcg gcgaactgga 1680actgaaccgc aacgtcgacg
agttcttccc tcaaaccgag caagtcgcct tctgcaccag 1740ccacatcgtc cccggcattg
acttcagtga cgacccgctt ctccagggcc gtaacttctc 1800ctacttcgac actcagatca
gtcgactggg catcaactgg gaagaaatcc ccatcaaccg 1860ccccgtctgc cccgttctga
accacaaccg agacggcgcc aaacgccacc gcatcgccca 1920gggcactgtc actacttggt
cgaaccggtc cgaggccgga ccacccgcac cagtagaaca 1980tggtggcttc gcgtcctacc
ctgcgaaact gaacggtatc aagaagcgcg gcctgagccc 2040caagttccgc gagcaccaca
accaggctca actcttctac aactctctct ccgagcacga 2100gaaggtccac gtcaagaagg
ccttcggctt cgaactggac cactgcgacg accccatcgt 2160ctacgagcgc ctcgccggcc
accgtctcgc cgagatcgat ctcactctcg cccaggaagt 2220cgccgagctc gtcggcgccc
cgatcccaga caaggcactt cgcccgaacc atggaaagcg 2280cagcaagcat ctttcgcaga
ccgagttccc gggtaagcag ccgacgatcg ccagtcgccg 2340aatcgccatc attatcggcg
acggatacga ccccgtcgct ttcaatggcc tcaagggcgc 2400catcacggcg gttggagcct
taccgttcgt cattggcacc aagcggtcac ctatctacgc 2460cgacggtgag gacaaatcat
cttccaaggg cgtgatcgcc gaccaccagt atgacggaca 2520gcgttcgacg atgtttgacg
ctaccttcat ccctggcggt ccgcacgtcg aaagcctcaa 2580ggccaatggc cagatccggt
actggatcat tgagacattc ggtcatctca aggctctggg 2640cgccactggt gaagcggcgg
ctttcatcaa ggaagccctg ggctccgcgc ttgatgtgaa 2700ggtcgctacg tctgataacc
cccagccggt tgagtggtat ggtgttgtca cggctggaaa 2760gatccacaaa cctgagagct
tcaaggaggg tatccagatt gtcaaggatg cgaaggattt 2820cattagcacc ttcttctacc
agatcagtca gcatcggaac tacaagcgtg aactggatgg 2880cctcgcctcg acagttgcat
tctaaatgct ttcgtgattg gttgaggaca tggaggcttg 2940tgttaacgca aaagtggcat
tttagttaat gtcatccttg taatgaatta tgtctctaac 3000tgtggatggc cagaatgtac
gctaatatga atcatgaaaa tactctattc taattgtgaa 3060tgtgaaagtg aaacggcgtc
gaaaggtagt tatcaatgtt atcctgaggt atctaatata 3120caacatcttt gatattgtag
gaaagaaacg taagaaagga tcgtacatag tggggtatca 3180taatctggta cagcgtccaa
ggctcggctt 321024747PRTAspergillus
oryzae 24Met Ala Asn Ile Val Ala Gly Gly Leu His Lys Val Gln Glu Ala Val
1 5 10 15 Gln Gly
Ala Ala Ser Lys Asp Lys Lys Leu Val Asp Leu Ala Pro Asp 20
25 30 Thr His Asn Val Gln Ser Ser
Lys Glu Pro Leu Thr Thr Asp His Gly 35 40
45 Val Arg Ile Ser Asp Thr Asp His Trp Leu Lys Glu
Val Asn Asp Asn 50 55 60
His Thr Gly Pro Met Met Leu Glu Asp Gln Ile Ala Arg Glu Lys Ile 65
70 75 80 His Arg Phe
Asp His Glu Arg Ile Pro Glu Arg Val Val His Ala Arg 85
90 95 Gly Thr Ala Ala Phe Gly Asn Phe
Lys Leu His Glu Ser Ala Glu Asp 100 105
110 Val Ser Tyr Ala Gly Ile Leu Thr Asp Thr Ser Arg Asn
Thr Pro Val 115 120 125
Phe Leu Arg Phe Ser Thr Val Gln Gly Ser Lys Gly Ser Ala Asp Thr 130
135 140 Val Arg Asp Val
Arg Gly Phe Ala Val Lys Phe Tyr Thr Asp Glu Gly 145 150
155 160 Asn Trp Asp Leu Val Gly Asn Asn Ile
Pro Val Phe Phe Ile Gln Asp 165 170
175 Ala Ile Lys Phe Pro Asp Phe Val His Ala Val Lys Pro Glu
Pro His 180 185 190
Asn Glu Val Pro Gln Ala Gln Thr Ala His Asn Asn Phe Trp Asp Phe
195 200 205 Val Tyr Leu His
Pro Glu Ala Thr His Met Phe Met Trp Ala Met Ser 210
215 220 Asp Arg Ala Ile Pro Arg Ser Tyr
Arg Met Met Gln Gly Phe Gly Val 225 230
235 240 Asn Thr Phe Ser Leu Ile Asn Lys Glu Gly Lys Arg
His Phe Val Lys 245 250
255 Phe His Phe Ile Pro His Leu Gly Val His Ser Leu Val Trp Asp Glu
260 265 270 Ala Leu Lys
Leu Ala Gly Gln Asp Pro Asp Phe His Arg Lys Asp Leu 275
280 285 Met Glu Ala Ile Asp Asn Gly Ala
Tyr Pro Lys Trp Asp Phe Ala Ile 290 295
300 Gln Val Ile Pro Glu Glu Lys Gln Asp Asp Phe Glu Phe
Asp Ile Phe 305 310 315
320 Asp Ala Thr Lys Ile Trp Pro Glu Glu Leu Val Pro Leu Arg Val Ile
325 330 335 Gly Glu Leu Glu
Leu Asn Arg Asn Val Asp Glu Phe Phe Pro Gln Thr 340
345 350 Glu Gln Val Ala Phe Cys Thr Ser His
Ile Val Pro Gly Ile Asp Phe 355 360
365 Ser Asp Asp Pro Leu Leu Gln Gly Arg Asn Phe Ser Tyr Phe
Asp Thr 370 375 380
Gln Ile Ser Arg Leu Gly Ile Asn Trp Glu Glu Ile Pro Ile Asn Arg 385
390 395 400 Pro Val Cys Pro Val
Leu Asn His Asn Arg Asp Gly Ala Lys Arg His 405
410 415 Arg Ile Ala Gln Gly Thr Val Thr Thr Trp
Ser Asn Arg Ser Glu Ala 420 425
430 Gly Pro Pro Ala Pro Val Glu His Gly Gly Phe Ala Ser Tyr Pro
Ala 435 440 445 Lys
Leu Asn Gly Ile Lys Lys Arg Gly Leu Ser Pro Lys Phe Arg Glu 450
455 460 His His Asn Gln Ala Gln
Leu Phe Tyr Asn Ser Leu Ser Glu His Glu 465 470
475 480 Lys Val His Val Lys Lys Ala Phe Gly Phe Glu
Leu Asp His Cys Asp 485 490
495 Asp Pro Ile Val Tyr Glu Arg Leu Ala Gly His Arg Leu Ala Glu Ile
500 505 510 Asp Leu
Thr Leu Ala Gln Glu Val Ala Glu Leu Val Gly Ala Pro Ile 515
520 525 Pro Asp Lys Ala Leu Arg Pro
Asn His Gly Lys Arg Ser Lys His Leu 530 535
540 Ser Gln Thr Glu Phe Pro Gly Lys Gln Pro Thr Ile
Ala Ser Arg Arg 545 550 555
560 Ile Ala Ile Ile Ile Gly Asp Gly Tyr Asp Pro Val Ala Phe Asn Gly
565 570 575 Leu Lys Gly
Ala Ile Thr Ala Val Gly Ala Leu Pro Phe Val Ile Gly 580
585 590 Thr Lys Arg Ser Pro Ile Tyr Ala
Asp Gly Glu Asp Lys Ser Ser Ser 595 600
605 Lys Gly Val Ile Ala Asp His Gln Tyr Asp Gly Gln Arg
Ser Thr Met 610 615 620
Phe Asp Ala Thr Phe Ile Pro Gly Gly Pro His Val Glu Ser Leu Lys 625
630 635 640 Ala Asn Gly Gln
Ile Arg Tyr Trp Ile Ile Glu Thr Phe Gly His Leu 645
650 655 Lys Ala Leu Gly Ala Thr Gly Glu Ala
Ala Ala Phe Ile Lys Glu Ala 660 665
670 Leu Gly Ser Ala Leu Asp Val Lys Val Ala Thr Ser Asp Asn
Pro Gln 675 680 685
Pro Val Glu Trp Tyr Gly Val Val Thr Ala Gly Lys Ile His Lys Pro 690
695 700 Glu Ser Phe Lys Glu
Gly Ile Gln Ile Val Lys Asp Ala Lys Asp Phe 705 710
715 720 Ile Ser Thr Phe Phe Tyr Gln Ile Ser Gln
His Arg Asn Tyr Lys Arg 725 730
735 Glu Leu Asp Gly Leu Ala Ser Thr Val Ala Phe 740
745 25 2502DNAThermoascus aurantiacus
25atgcgcgcaa ttggacttct gccaggcatc atcggcattg ctggtgctgc ctgtccttac
60atgacaggcg agctgccgcg ctccttcgcc gagaaccctc atgctatcaa ccgtcgtgct
120gagggtggtg gtggtgccgc tgccgagacg gagaagttcc tgtctcagtt ctacctgaac
180gacaacgaca ccttcatgac caccgatgtt ggcggtccaa ttgaggatca gaacagtctc
240agcgctggtg acagaggtcc taccctgctg gaggacttca tcctccgtca aaagatccag
300cgctttgacc atgagcgggt aggttgatct ttactttcgg ccttcttcga gcggggtgat
360attaaaacag gtaataggtg cccgagcgtg ctgtccatgc ccgaggagcg ggagcgcatg
420gcgtgttcac atcctacgca gactggtcca acatcactgc cgcttccttc ctgtctgctg
480caggaaagga gacacctgtc tttgtccggt tctccactgt agcaggaagc agaggaagcg
540cagacacggc gcgtgacgtg cacggtttcg cgacgaggtt ctacacggat gaagggaact
600tcggtaggca actatcatgc tctctttaaa tgttctcgat ctgacagcca gcagacattg
660tcggcaacaa catccctgtc ttcttcattc aagatgcgat ccagttcccc gacctgatcc
720atgctgtcaa gcccagcccg aacaacgaga tccctcaggc cgcaaccgcc catgactctg
780cctgggactt tttcagccag cagccgagct ctttgcatac tctgttctgg gctatggccg
840gtcatggcat tcctcgttcc tacaggaaca tggatggctt cggcatccac accttccgct
900ttgtgacgga cgatggagct tccaagctcg tcaagttcca ctggacgtcg ctgcagggca
960aggcgagcct tgtgtgggaa gaggcacagg ccgtggctgg aaagaacgcg gactatcacc
1020gccaggactt gtgggacgca atcgaggctg gaaggtaccc tgagtgggag gtaggctctc
1080cctgctatgt atggatgtgc cagaagctta ataatggcct agctcggcgt gcaaatcatg
1140gatgaggaag accagctgcg ctttggcttc gatctgttgg acccgaccaa gatcgttccc
1200gaggaatacg tgcccatcac gaagctcgga aagatgcagc tcaaccgcaa cccgctgaac
1260tacttcgccg agactgaaca gatcatggtc agttcgccac cgtgttcggt tgctcgttgc
1320tgaagtgcta acttgcaaca gttccaaccg ggtcacgttg tccgtggcat tgatttcacc
1380gaggaccctc tgctccaggg acgtctcttc tcttacctcg acacccagct caaccgccac
1440ggaggtccga acttcgagca gatccccatc aaccggccac gcactccaat tcacaacaac
1500aaccgtgacg gagccggtat gctagcccat gtattccttt ctttatgcat ttttatatga
1560tgcgttctaa cggcaacagc gcaaatgtac atccccctga acaaggcggc gtacaccccc
1620aacactctga acaacggctc ccccaagcag gccaaccaga cggtcggaaa gggcttcttc
1680acgactccag gccggacggc aagcggcagg cttgtgcgcg ccgtcagctc aaccttcgcc
1740gacgtctggt cgcagcctcg tctgttctac aactccctcg tgccggcgga gcagcagttc
1800ctgatcaacg cgatccgctt tgagacggcc cacatcacga gcgacgtcgt gaagaacaac
1860gtcatcatcc agctgaaccg cgtgagcaac aacctcgcca agagagtcgc ccgggccatc
1920ggtgtcgcgg agcccgagcc agacccaacc ttgtaccaca acaacaagac cgccaacgtc
1980ggggtgttcg gcaagccgct cgccagactc gacggcctgc aggtcggggt cctcgccacc
2040gtcaacaagc ccgactcgat caagcaggcc gccagcctga aggccagctt cgcggcggac
2100aacgtcgacg tcaaggtcgt cgcggagcgc ctcgccgacg gcgtcgacga gacctactcg
2160gccgccgacg cggtcaactt cgacgccatc ctggtcgcca acggcgctga gggcctcttc
2220gcgcgcgaca gcttcaccgc caggccggcc aactcgacca ccgcgacgct ctaccccgcg
2280ggccgcccgc tccagatcct ggtcgacggg ttccgctacg gcaagccggt cggggcgctc
2340ggcagcggcg ccaaggcgct cgacgcagcg gagatttcga cgacccgggc cggcgtgtac
2400gtcgccaact cgacgaccga cagcttcatc aatggcgtca gggacggtct gcggacgttc
2460aagttcctgg accggttcgc gattgacgag gatgctgagt ga
250226740PRTThermoascus aurantiacus 26Met Arg Ala Ile Gly Leu Leu Pro Gly
Ile Ile Gly Ile Ala Gly Ala 1 5 10
15 Ala Cys Pro Tyr Met Thr Gly Glu Leu Pro Arg Ser Phe Ala
Glu Asn 20 25 30
Pro His Ala Ile Asn Arg Arg Ala Glu Gly Gly Gly Gly Ala Ala Ala
35 40 45 Glu Thr Glu Lys
Phe Leu Ser Gln Phe Tyr Leu Asn Asp Asn Asp Thr 50
55 60 Phe Met Thr Thr Asp Val Gly Gly
Pro Ile Glu Asp Gln Asn Ser Leu 65 70
75 80 Ser Ala Gly Asp Arg Gly Pro Thr Leu Leu Glu Asp
Phe Ile Leu Arg 85 90
95 Gln Lys Ile Gln Arg Phe Asp His Glu Arg Val Pro Glu Arg Ala Val
100 105 110 His Ala Arg
Gly Ala Gly Ala His Gly Val Phe Thr Ser Tyr Ala Asp 115
120 125 Trp Ser Asn Ile Thr Ala Ala Ser
Phe Leu Ser Ala Ala Gly Lys Glu 130 135
140 Thr Pro Val Phe Val Arg Phe Ser Thr Val Ala Gly Ser
Arg Gly Ser 145 150 155
160 Ala Asp Thr Ala Arg Asp Val His Gly Phe Ala Thr Arg Phe Tyr Thr
165 170 175 Asp Glu Gly Asn
Phe Asp Ile Val Gly Asn Asn Ile Pro Val Phe Phe 180
185 190 Ile Gln Asp Ala Ile Gln Phe Pro Asp
Leu Ile His Ala Val Lys Pro 195 200
205 Ser Pro Asn Asn Glu Ile Pro Gln Ala Ala Thr Ala His Asp
Ser Ala 210 215 220
Trp Asp Phe Phe Ser Gln Gln Pro Ser Ser Leu His Thr Leu Phe Trp 225
230 235 240 Ala Met Ala Gly His
Gly Ile Pro Arg Ser Tyr Arg Asn Met Asp Gly 245
250 255 Phe Gly Ile His Thr Phe Arg Phe Val Thr
Asp Asp Gly Ala Ser Lys 260 265
270 Leu Val Lys Phe His Trp Thr Ser Leu Gln Gly Lys Ala Ser Leu
Val 275 280 285 Trp
Glu Glu Ala Gln Ala Val Ala Gly Lys Asn Ala Asp Tyr His Arg 290
295 300 Gln Asp Leu Trp Asp Ala
Ile Glu Ala Gly Arg Tyr Pro Glu Trp Glu 305 310
315 320 Leu Gly Val Gln Ile Met Asp Glu Glu Asp Gln
Leu Arg Phe Gly Phe 325 330
335 Asp Leu Leu Asp Pro Thr Lys Ile Val Pro Glu Glu Tyr Val Pro Ile
340 345 350 Thr Lys
Leu Gly Lys Met Gln Leu Asn Arg Asn Pro Leu Asn Tyr Phe 355
360 365 Ala Glu Thr Glu Gln Ile Met
Phe Gln Pro Gly His Val Val Arg Gly 370 375
380 Ile Asp Phe Thr Glu Asp Pro Leu Leu Gln Gly Arg
Leu Phe Ser Tyr 385 390 395
400 Leu Asp Thr Gln Leu Asn Arg His Gly Gly Pro Asn Phe Glu Gln Ile
405 410 415 Pro Ile Asn
Arg Pro Arg Thr Pro Ile His Asn Asn Asn Arg Asp Gly 420
425 430 Ala Ala Gln Met Tyr Ile Pro Leu
Asn Lys Ala Ala Tyr Thr Pro Asn 435 440
445 Thr Leu Asn Asn Gly Ser Pro Lys Gln Ala Asn Gln Thr
Val Gly Lys 450 455 460
Gly Phe Phe Thr Thr Pro Gly Arg Thr Ala Ser Gly Arg Leu Val Arg 465
470 475 480 Ala Val Ser Ser
Thr Phe Ala Asp Val Trp Ser Gln Pro Arg Leu Phe 485
490 495 Tyr Asn Ser Leu Val Pro Ala Glu Gln
Gln Phe Leu Ile Asn Ala Ile 500 505
510 Arg Phe Glu Thr Ala His Ile Thr Ser Asp Val Val Lys Asn
Asn Val 515 520 525
Ile Ile Gln Leu Asn Arg Val Ser Asn Asn Leu Ala Lys Arg Val Ala 530
535 540 Arg Ala Ile Gly Val
Ala Glu Pro Glu Pro Asp Pro Thr Leu Tyr His 545 550
555 560 Asn Asn Lys Thr Ala Asn Val Gly Val Phe
Gly Lys Pro Leu Ala Arg 565 570
575 Leu Asp Gly Leu Gln Val Gly Val Leu Ala Thr Val Asn Lys Pro
Asp 580 585 590 Ser
Ile Lys Gln Ala Ala Ser Leu Lys Ala Ser Phe Ala Ala Asp Asn 595
600 605 Val Asp Val Lys Val Val
Ala Glu Arg Leu Ala Asp Gly Val Asp Glu 610 615
620 Thr Tyr Ser Ala Ala Asp Ala Val Asn Phe Asp
Ala Ile Leu Val Ala 625 630 635
640 Asn Gly Ala Glu Gly Leu Phe Ala Arg Asp Ser Phe Thr Ala Arg Pro
645 650 655 Ala Asn
Ser Thr Thr Ala Thr Leu Tyr Pro Ala Gly Arg Pro Leu Gln 660
665 670 Ile Leu Val Asp Gly Phe Arg
Tyr Gly Lys Pro Val Gly Ala Leu Gly 675 680
685 Ser Gly Ala Lys Ala Leu Asp Ala Ala Glu Ile Ser
Thr Thr Arg Ala 690 695 700
Gly Val Tyr Val Ala Asn Ser Thr Thr Asp Ser Phe Ile Asn Gly Val 705
710 715 720 Arg Asp Gly
Leu Arg Thr Phe Lys Phe Leu Asp Arg Phe Ala Ile Asp 725
730 735 Glu Asp Ala Glu 740
27 1452DNABacillus thermoglucosidasius 27atgagttcaa ataaactgac
aactagctgg ggcgctccgg ttggagataa tcaaaactca 60atgactgccg gttctcgcgg
accaacttta attcaagatg tacatttact cgaaaaattg 120gcccatttca accgagaacg
tgttcctgaa cgtgttgttc acgccaaagg agcaggcgca 180cacggatatt ttgaagtgac
aaacgacgta acaaaataca cgaaagccgc tttcctttct 240gaagtcggca aacgcacacc
gttgttcatc cgtttctcaa cagttgccgg tgaacttggc 300tctgctgaca cagttcgcga
cccgcgcgga tttgctgtta aattttatac tgaagaagga 360aactacgaca tcgtcggcaa
caatacgcct gtattcttta tccgcgatgc gattaagttc 420cctgatttca tccatacaca
aaaaagagat ccaaaaacac acctgaaaaa ccctacggct 480gtatgggatt tctggtcact
ttcaccagag tcattgcacc aagtgacaat cctgatgtct 540gaccgcggaa ttcctgcgac
acttcgccac atgcacggct tcggaagcca tacattcaaa 600tggacaaatg ccgaacccga
aggcgtatgg attaaatatc actttaaaac agaacaaggc 660gtgaaaaacc ttgatgtcaa
tacggcagca aaaattgccg gtgaaaaccc tgattaccat 720acagaagacc ttttcaacgc
aatcgaaaac ggtgattatc ctgcatggaa actatatgtg 780caaatcatgc ctttagaaga
tgcaaatacg taccgtttcg atccgtttga tgtcacaaaa 840gtttggtctc aaaaagacta
cccgttaatc gaggtcggac gcatggttct agacagaaat 900ccggaaaact actttgcaga
ggtagaacaa gcgacatttt cacctggaac cctcgtgcct 960ggtattgatg tttcaccgga
taaaatgctt caaggtcgac tttttgctta tcatgatgca 1020caccgctacc gtgtcggtgc
aaaccatcaa gcgctgccaa tcaaccgcgc acgcaacaaa 1080gtaaacaatt atcagcgtga
tgggcaaatg cgttttgatg ataacggcgg cggatctgtg 1140tattacgagc ctaacagctt
cggcggtcca aaagagtcac ctgaggataa gcaagcagca 1200tatccggtac aaggtatcgc
tgacagcgta agctacgatc actacgatca ctacactcaa 1260gccggcgatc tgtatcgttt
aatgagtgaa gatgaacgta cccgccttgt tgaaaatatc 1320gttaatgcca tgaagccggt
agaaaaagaa gaaatcaagc tgcgccaaat cgagcacttc 1380tacaaagcgg atcctgaata
cggaaaacgc gtggcagaag gccttggatt gccgattaaa 1440aaagattctt aa
145228483PRTBacillus
thermoglucosidasius 28Met Ser Ser Asn Lys Leu Thr Thr Ser Trp Gly Ala Pro
Val Gly Asp 1 5 10 15
Asn Gln Asn Ser Met Thr Ala Gly Ser Arg Gly Pro Thr Leu Ile Gln
20 25 30 Asp Val His Leu
Leu Glu Lys Leu Ala His Phe Asn Arg Glu Arg Val 35
40 45 Pro Glu Arg Val Val His Ala Lys Gly
Ala Gly Ala His Gly Tyr Phe 50 55
60 Glu Val Thr Asn Asp Val Thr Lys Tyr Thr Lys Ala Ala
Phe Leu Ser 65 70 75
80 Glu Val Gly Lys Arg Thr Pro Leu Phe Ile Arg Phe Ser Thr Val Ala
85 90 95 Gly Glu Leu Gly
Ser Ala Asp Thr Val Arg Asp Pro Arg Gly Phe Ala 100
105 110 Val Lys Phe Tyr Thr Glu Glu Gly Asn
Tyr Asp Ile Val Gly Asn Asn 115 120
125 Thr Pro Val Phe Phe Ile Arg Asp Ala Ile Lys Phe Pro Asp
Phe Ile 130 135 140
His Thr Gln Lys Arg Asp Pro Lys Thr His Leu Lys Asn Pro Thr Ala 145
150 155 160 Val Trp Asp Phe Trp
Ser Leu Ser Pro Glu Ser Leu His Gln Val Thr 165
170 175 Ile Leu Met Ser Asp Arg Gly Ile Pro Ala
Thr Leu Arg His Met His 180 185
190 Gly Phe Gly Ser His Thr Phe Lys Trp Thr Asn Ala Glu Pro Glu
Gly 195 200 205 Val
Trp Ile Lys Tyr His Phe Lys Thr Glu Gln Gly Val Lys Asn Leu 210
215 220 Asp Val Asn Thr Ala Ala
Lys Ile Ala Gly Glu Asn Pro Asp Tyr His 225 230
235 240 Thr Glu Asp Leu Phe Asn Ala Ile Glu Asn Gly
Asp Tyr Pro Ala Trp 245 250
255 Lys Leu Tyr Val Gln Ile Met Pro Leu Glu Asp Ala Asn Thr Tyr Arg
260 265 270 Phe Asp
Pro Phe Asp Val Thr Lys Val Trp Ser Gln Lys Asp Tyr Pro 275
280 285 Leu Ile Glu Val Gly Arg Met
Val Leu Asp Arg Asn Pro Glu Asn Tyr 290 295
300 Phe Ala Glu Val Glu Gln Ala Thr Phe Ser Pro Gly
Thr Leu Val Pro 305 310 315
320 Gly Ile Asp Val Ser Pro Asp Lys Met Leu Gln Gly Arg Leu Phe Ala
325 330 335 Tyr His Asp
Ala His Arg Tyr Arg Val Gly Ala Asn His Gln Ala Leu 340
345 350 Pro Ile Asn Arg Ala Arg Asn Lys
Val Asn Asn Tyr Gln Arg Asp Gly 355 360
365 Gln Met Arg Phe Asp Asp Asn Gly Gly Gly Ser Val Tyr
Tyr Glu Pro 370 375 380
Asn Ser Phe Gly Gly Pro Lys Glu Ser Pro Glu Asp Lys Gln Ala Ala 385
390 395 400 Tyr Pro Val Gln
Gly Ile Ala Asp Ser Val Ser Tyr Asp His Tyr Asp 405
410 415 His Tyr Thr Gln Ala Gly Asp Leu Tyr
Arg Leu Met Ser Glu Asp Glu 420 425
430 Arg Thr Arg Leu Val Glu Asn Ile Val Asn Ala Met Lys Pro
Val Glu 435 440 445
Lys Glu Glu Ile Lys Leu Arg Gln Ile Glu His Phe Tyr Lys Ala Asp 450
455 460 Pro Glu Tyr Gly Lys
Arg Val Ala Glu Gly Leu Gly Leu Pro Ile Lys 465 470
475 480 Lys Asp Ser 291449DNABacillus
thermoglucosidasius 29atgagttcaa ataaactgac aactagctgg ggagcacctg
ttggcgataa ccaaaactcg 60ataacggccg gcaatcctgg cccgacatta atccaagacg
tacatcttat cgaaaaatta 120gcacacttca atagagaacg tgtcccagaa cgtgttgtcc
atgcgaaagg cgctggtgcg 180cacggctatt tcgaagtaac aaacgatatg tcgaaataca
caaaagcgaa agtgtttaac 240ggtgttggca aacgcacgcc tgtattcgtc cgcttctcta
ctgtcgccgg tgaattggga 300tctgcggata cagtccgcga cccgcgcggt tttgccgtca
aattttatac cgaagaagga 360aactatgaca tcgttggcaa caacacaccg attttcttca
ttcgtgatgc gattaaattc 420tcggatttta tccatacaca aaaacgcgac ccgcgcaccc
atttgattta tccgacagca 480atgtgggatt tcttgtcttt atctccggaa tctttgcacc
aagtcactta tttattcggg 540gatcgcggca tcccattgac ataccgccat atgaacggat
acggaagcca tacattcaaa 600tgggtgaatg aaaaaggcga agcggtatgg gtaaaatacc
actttaaaac aaaccaaggc 660gtgaaaaaca tggatccgga actagcggtt aaaatcgccg
gagaaaatcc ggattaccat 720acggaagatt tatataacgc catcgaaaaa ggcgactatc
catcttggac attatatgtg 780caaattatgc cgttagaaga cgcaaaaaca taccgtttca
atccatttga tgtcacaaaa 840gtttggtcac ataaagatta tccgttaatt gaagtcggcc
gtatggtatt aaaccgcaat 900ccagaaaatt attttgccga agtcgaacaa gcgacattct
ctcctggaaa ccttgttcct 960ggcgttgaac catcgccgga taaaatcttg caagcccgtt
tgttcgctta tgcggatgcg 1020caccgttacc gcgtcggcgt gaaccataac ttgcttccga
tcaaccgccc gcgcgtggaa 1080gtaaacaatt atcaacgtga cggcttcatg cgctttgaca
ataatggcgg cggttcggtc 1140aactacgaac caaacagctt cggcggaccg acagaagtgc
cagaacataa aacgacccca 1200ttcccggtat ccggcgtggc agaaagcgtg ccatatgacg
acgatgatca ttatacgcaa 1260gcaggcgact tataccgtct catgagcgaa gaagaaaaag
cgcgccttgt gaaaaacatt 1320gtcgaatcat tgaaacaagt aacaaaagaa gaaattaaac
ttcgccaaat ccgccacttc 1380tacaaagcag accctgacta cggccgccgc gttgccgaag
gtcttggatt gccgattaaa 1440aaagattct
144930483PRTBacillus thermoglucosidasius 30Met Ser
Ser Asn Lys Leu Thr Thr Ser Trp Gly Ala Pro Val Gly Asp 1 5
10 15 Asn Gln Asn Ser Ile Thr Ala
Gly Asn Pro Gly Pro Thr Leu Ile Gln 20 25
30 Asp Val His Leu Ile Glu Lys Leu Ala His Phe Asn
Arg Glu Arg Val 35 40 45
Pro Glu Arg Val Val His Ala Lys Gly Ala Gly Ala His Gly Tyr Phe
50 55 60 Glu Val Thr
Asn Asp Met Ser Lys Tyr Thr Lys Ala Lys Val Phe Asn 65
70 75 80 Gly Val Gly Lys Arg Thr Pro
Val Phe Val Arg Phe Ser Thr Val Ala 85
90 95 Gly Glu Leu Gly Ser Ala Asp Thr Val Arg Asp
Pro Arg Gly Phe Ala 100 105
110 Val Lys Phe Tyr Thr Glu Glu Gly Asn Tyr Asp Ile Val Gly Asn
Asn 115 120 125 Thr
Pro Ile Phe Phe Ile Arg Asp Ala Ile Lys Phe Ser Asp Phe Ile 130
135 140 His Thr Gln Lys Arg Asp
Pro Arg Thr His Leu Ile Tyr Pro Thr Ala 145 150
155 160 Met Trp Asp Phe Leu Ser Leu Ser Pro Glu Ser
Leu His Gln Val Thr 165 170
175 Tyr Leu Phe Gly Asp Arg Gly Ile Pro Leu Thr Tyr Arg His Met Asn
180 185 190 Gly Tyr
Gly Ser His Thr Phe Lys Trp Val Asn Glu Lys Gly Glu Ala 195
200 205 Val Trp Val Lys Tyr His Phe
Lys Thr Asn Gln Gly Val Lys Asn Met 210 215
220 Asp Pro Glu Leu Ala Val Lys Ile Ala Gly Glu Asn
Pro Asp Tyr His 225 230 235
240 Thr Glu Asp Leu Tyr Asn Ala Ile Glu Lys Gly Asp Tyr Pro Ser Trp
245 250 255 Thr Leu Tyr
Val Gln Ile Met Pro Leu Glu Asp Ala Lys Thr Tyr Arg 260
265 270 Phe Asn Pro Phe Asp Val Thr Lys
Val Trp Ser His Lys Asp Tyr Pro 275 280
285 Leu Ile Glu Val Gly Arg Met Val Leu Asn Arg Asn Pro
Glu Asn Tyr 290 295 300
Phe Ala Glu Val Glu Gln Ala Thr Phe Ser Pro Gly Asn Leu Val Pro 305
310 315 320 Gly Val Glu Pro
Ser Pro Asp Lys Ile Leu Gln Ala Arg Leu Phe Ala 325
330 335 Tyr Ala Asp Ala His Arg Tyr Arg Val
Gly Val Asn His Asn Leu Leu 340 345
350 Pro Ile Asn Arg Pro Arg Val Glu Val Asn Asn Tyr Gln Arg
Asp Gly 355 360 365
Phe Met Arg Phe Asp Asn Asn Gly Gly Gly Ser Val Asn Tyr Glu Pro 370
375 380 Asn Ser Phe Gly Gly
Pro Thr Glu Val Pro Glu His Lys Thr Thr Pro 385 390
395 400 Phe Pro Val Ser Gly Val Ala Glu Ser Val
Pro Tyr Asp Asp Asp Asp 405 410
415 His Tyr Thr Gln Ala Gly Asp Leu Tyr Arg Leu Met Ser Glu Glu
Glu 420 425 430 Lys
Ala Arg Leu Val Lys Asn Ile Val Glu Ser Leu Lys Gln Val Thr 435
440 445 Lys Glu Glu Ile Lys Leu
Arg Gln Ile Arg His Phe Tyr Lys Ala Asp 450 455
460 Pro Asp Tyr Gly Arg Arg Val Ala Glu Gly Leu
Gly Leu Pro Ile Lys 465 470 475
480 Lys Asp Ser 312262DNAAlcaligenes aquamarinus 31atgaataacg
catccgctga cgatctacac agtagcttgc agcaaagatg cagagcattt 60gttcccttgg
tatcgccaag gcatagagca ataagggaga gagctatgag cggtaaatgt 120cctgtcatgc
acggtggtaa cacctcgacc ggtacttcca acaaagattg gtggccggaa 180gggttgaacc
tggatatttt gcatcagcaa gatcgcaaat cagacccgat ggatccggat 240ttcaactacc
gtgaagaagt acgcaagctc gatttcgacg cgctgaagaa agatgtccac 300gcgttgatga
ccgatagcca agagtggtgg cccgctgact gggggcacta cggcggtttg 360atgatccgta
tggcttggca ctccgctggc acctaccgta ttgctgatgg ccgtgggggc 420ggtggtaccg
gaagccagcg ctttgcaccg ctcaactcct ggccggacaa cgtcagcctg 480gataaagcgc
gccgtctgct gtggccgatc aagaagaagt acggcaacaa aatcagctgg 540gcagacctga
tgattctggc tggcaccgtg gcttatgagt ccatgggctt acctgcttac 600ggcttctctt
tcggccgcgt cgatatttgg gaacccgaaa aagatatcta ctggggtgac 660gaaaaagagt
ggctggcacc ttctgacgaa cgctacggcg acgtgaacaa gccagagacc 720atggaaaacc
cgctggcggc tgtccaaatg ggtctgatct atgtgaaccc ggaaggtgtt 780aacggccacc
ctgatccgct gagaaccgca cagcaggtac ttgaaacctt cgcccgtatg 840gcgatgaacg
acgaaaaaac cgcagccctc acagctggcg gccacaccgt cggtaattgt 900cacggtaatg
gcaatgcctc tgcgttagcc cctgacccaa aagcctctga cgttgaaaac 960cagggcttag
gttggggcaa ccccaacatg cagggcaagg caagcaacgc cgtgacctcg 1020ggtatcgaag
gtgcttggac caccaacccc acgaaattcg atatgggcta tttcgacctg 1080ctgttcggct
acaattggga actgaaaaag agtcctgccg gtgcccacca ttgggaaccg 1140attgacatca
aaaaggaaaa caagccggtt gacgccagcg acccctctat tcgccacaac 1200ccgatcatga
ccgatgcgga tatggcgata aaggtaaatc cgacctatcg cgctatctgc 1260gaaaaattca
tggccgatcc tgagtacttc aagaaaactt tcgcgaaggc gtggttcaag 1320ctgacgcacc
gtgacctggg cccgaaatca cgttacatcg gcccggaagt gccggcagaa 1380gacctgattt
ggcaagaccc gattccggca ggtaacaccg actactgcga agaagtggtc 1440aagcagaaaa
ttgcacaaag tggcctgagc attagtgaga tggtctccac cgcttgggac 1500agtgcccgta
cttatcgcgg ttccgatatg cgcggcggtg ctaacggtgc ccgcattcgc 1560ttggccccac
agaacgagtg gcagggcaac gagccggagc gcctggcgaa agtgctgagc 1620gtctacgagc
agatctctgc cgacaccggc gctagcatcg cggacgtgat cgttctggcc 1680ggtagcgtag
gcatcgagaa agccgcgaaa gcagcaggtt acgatgtgcg cgttcccttc 1740ctgaaaggcc
gtggcgatgc gaccgccgag atgaccgacg cagactcctt cgcaccgctg 1800gagccgctgg
ccgatggctt ccgcaactgg cagaagaaag agtatgtggt gaagccggaa 1860gagatgctgc
tggatcgtgc gcagctgatg ggcttaaccg gcccggaaat gaccgtgctg 1920ctgggcggta
tgcgcgtact gggcaccaac tatggtggca ccaaacacgg cgtattcacc 1980gattgtgaag
gccagttgac caacgacttt tttgtgaacc tgaccgatat ggggaacagc 2040tggaagccgg
taggtagcaa cgcctacgaa atccgcgacc gcaagaccgg tgccgtgaag 2100tggaccgcct
cgcgggtgga tctggtattt ggttccaact cgctactgcg ctcttacgca 2160gaagtgtacg
cccaggacga taacggcgag aagttcgtca gagacttcgt cgccgcctgg 2220accaaagtga
tgaacgccga ccgtttcgac gtcgcgtcgt aa
226232753PRTAlcaligenes aquamarinus 32Met Asn Asn Ala Ser Ala Asp Asp Leu
His Ser Ser Leu Gln Gln Arg 1 5 10
15 Cys Arg Ala Phe Val Pro Leu Val Ser Pro Arg His Arg Ala
Ile Arg 20 25 30
Glu Arg Ala Met Ser Gly Lys Cys Pro Val Met His Gly Gly Asn Thr
35 40 45 Ser Thr Gly Thr
Ser Asn Lys Asp Trp Trp Pro Glu Gly Leu Asn Leu 50
55 60 Asp Ile Leu His Gln Gln Asp Arg
Lys Ser Asp Pro Met Asp Pro Asp 65 70
75 80 Phe Asn Tyr Arg Glu Glu Val Arg Lys Leu Asp Phe
Asp Ala Leu Lys 85 90
95 Lys Asp Val His Ala Leu Met Thr Asp Ser Gln Glu Trp Trp Pro Ala
100 105 110 Asp Trp Gly
His Tyr Gly Gly Leu Met Ile Arg Met Ala Trp His Ser 115
120 125 Ala Gly Thr Tyr Arg Ile Ala Asp
Gly Arg Gly Gly Gly Gly Thr Gly 130 135
140 Ser Gln Arg Phe Ala Pro Leu Asn Ser Trp Pro Asp Asn
Val Ser Leu 145 150 155
160 Asp Lys Ala Arg Arg Leu Leu Trp Pro Ile Lys Lys Lys Tyr Gly Asn
165 170 175 Lys Ile Ser Trp
Ala Asp Leu Met Ile Leu Ala Gly Thr Val Ala Tyr 180
185 190 Glu Ser Met Gly Leu Pro Ala Tyr Gly
Phe Ser Phe Gly Arg Val Asp 195 200
205 Ile Trp Glu Pro Glu Lys Asp Ile Tyr Trp Gly Asp Glu Lys
Glu Trp 210 215 220
Leu Ala Pro Ser Asp Glu Arg Tyr Gly Asp Val Asn Lys Pro Glu Thr 225
230 235 240 Met Glu Asn Pro Leu
Ala Ala Val Gln Met Gly Leu Ile Tyr Val Asn 245
250 255 Pro Glu Gly Val Asn Gly His Pro Asp Pro
Leu Arg Thr Ala Gln Gln 260 265
270 Val Leu Glu Thr Phe Ala Arg Met Ala Met Asn Asp Glu Lys Thr
Ala 275 280 285 Ala
Leu Thr Ala Gly Gly His Thr Val Gly Asn Cys His Gly Asn Gly 290
295 300 Asn Ala Ser Ala Leu Ala
Pro Asp Pro Lys Ala Ser Asp Val Glu Asn 305 310
315 320 Gln Gly Leu Gly Trp Gly Asn Pro Asn Met Gln
Gly Lys Ala Ser Asn 325 330
335 Ala Val Thr Ser Gly Ile Glu Gly Ala Trp Thr Thr Asn Pro Thr Lys
340 345 350 Phe Asp
Met Gly Tyr Phe Asp Leu Leu Phe Gly Tyr Asn Trp Glu Leu 355
360 365 Lys Lys Ser Pro Ala Gly Ala
His His Trp Glu Pro Ile Asp Ile Lys 370 375
380 Lys Glu Asn Lys Pro Val Asp Ala Ser Asp Pro Ser
Ile Arg His Asn 385 390 395
400 Pro Ile Met Thr Asp Ala Asp Met Ala Ile Lys Val Asn Pro Thr Tyr
405 410 415 Arg Ala Ile
Cys Glu Lys Phe Met Ala Asp Pro Glu Tyr Phe Lys Lys 420
425 430 Thr Phe Ala Lys Ala Trp Phe Lys
Leu Thr His Arg Asp Leu Gly Pro 435 440
445 Lys Ser Arg Tyr Ile Gly Pro Glu Val Pro Ala Glu Asp
Leu Ile Trp 450 455 460
Gln Asp Pro Ile Pro Ala Gly Asn Thr Asp Tyr Cys Glu Glu Val Val 465
470 475 480 Lys Gln Lys Ile
Ala Gln Ser Gly Leu Ser Ile Ser Glu Met Val Ser 485
490 495 Thr Ala Trp Asp Ser Ala Arg Thr Tyr
Arg Gly Ser Asp Met Arg Gly 500 505
510 Gly Ala Asn Gly Ala Arg Ile Arg Leu Ala Pro Gln Asn Glu
Trp Gln 515 520 525
Gly Asn Glu Pro Glu Arg Leu Ala Lys Val Leu Ser Val Tyr Glu Gln 530
535 540 Ile Ser Ala Asp Thr
Gly Ala Ser Ile Ala Asp Val Ile Val Leu Ala 545 550
555 560 Gly Ser Val Gly Ile Glu Lys Ala Ala Lys
Ala Ala Gly Tyr Asp Val 565 570
575 Arg Val Pro Phe Leu Lys Gly Arg Gly Asp Ala Thr Ala Glu Met
Thr 580 585 590 Asp
Ala Asp Ser Phe Ala Pro Leu Glu Pro Leu Ala Asp Gly Phe Arg 595
600 605 Asn Trp Gln Lys Lys Glu
Tyr Val Val Lys Pro Glu Glu Met Leu Leu 610 615
620 Asp Arg Ala Gln Leu Met Gly Leu Thr Gly Pro
Glu Met Thr Val Leu 625 630 635
640 Leu Gly Gly Met Arg Val Leu Gly Thr Asn Tyr Gly Gly Thr Lys His
645 650 655 Gly Val
Phe Thr Asp Cys Glu Gly Gln Leu Thr Asn Asp Phe Phe Val 660
665 670 Asn Leu Thr Asp Met Gly Asn
Ser Trp Lys Pro Val Gly Ser Asn Ala 675 680
685 Tyr Glu Ile Arg Asp Arg Lys Thr Gly Ala Val Lys
Trp Thr Ala Ser 690 695 700
Arg Val Asp Leu Val Phe Gly Ser Asn Ser Leu Leu Arg Ser Tyr Ala 705
710 715 720 Glu Val Tyr
Ala Gln Asp Asp Asn Gly Glu Lys Phe Val Arg Asp Phe 725
730 735 Val Ala Ala Trp Thr Lys Val Met
Asn Ala Asp Arg Phe Asp Val Ala 740 745
750 Ser 332238DNAMicroscilla furvescens 33atggaaaatc
acaaacactc aggatcttct acgtataaca caaacactgg cggaaaatgc 60ccttttaccg
gaggttcgct taagcaaagt gcaggtggcg gcaccaaaaa cagggattgg 120tggcccaaca
tgctcaacct cggcatctta cgccaacatt catcgctatc ggacccaaac 180gacccggatt
ttgactatgc cgaagagttt aagaagctag atctggcagc ggttaaaaag 240gacctggcag
cgctaatgac agattcacag gactggtggc cagcagatta cggtcattat 300ggccccttct
ttatacgcat ggcgtggcac agcgccggca cctaccgtat cggtgatggc 360cgtggtggcg
gtggctccgg ctcacagcgc ttcgcgcctc tcaatagctg gccagacaat 420gccaatctgg
ataaagcacg cttgcttctt tggcccatca aacaaaaata cggtcgaaaa 480atctcctggg
cggatctaat gatactcaca ggaaacgtag ctctggaaac tatgggcttt 540aaaacttttg
gttttgcagg tggcagagca gatgtatggg agcctgaaga agatgtatac 600tggggagcag
aaaccgaatg gctgggagac aagcgctatg aaggtgaccg agagctcgaa 660aatcccctgg
gagccgtaca aatgggactc atctatgtaa accccgaagg acccaacggc 720aagccagacc
ctatcgctgc tgcgcgtgat attcgtgaga cttttggccg aatggcaatg 780aatgacgaag
aaaccgtggc tctcatagcg ggtggacaca ccttcggaaa aacccatggt 840gctgccgatg
cggagaaata tgtgggccga gagcctgccg ccgcaggtat tgaagaaatg 900agcctggggt
ggaaaaacac ctacggcacc ggacacggtg cggataccat caccagtgga 960ctagaaggcg
cctggaccaa gacccctact caatggagca ataacttttt tgaaaacctc 1020tttggttacg
agtgggagct taccaaaagt ccagctggag cttatcagtg gaaaccaaaa 1080gacggtgccg
gggctggcac cataccggat gcacatgatc ccagcaagtc gcacgctcca 1140tttatgctca
ctacggacct ggcgctgcgc atggaccctg attacgaaaa aatttctcga 1200cggtactatg
aaaaccctga tgagtttgca gatgctttcg cgaaagcatg gtacaaactg 1260acacacagag
atatgggacc aaaggtgcgc tacctgggac cagaagtgcc tcaggaagac 1320ctcatctggc
aagaccctat accagatgta agccatcctc ttgtagacga aaacgatatt 1380gaaggcctaa
aagccaaaat cctggaatcg ggactgacgg taagcgagct ggtaagcacg 1440gcatgggctt
ctgcatctac ttttagaaac tctgacaagc gcggcggtgc caacggtgca 1500cgtatacgac
tggccccaca aaaagactgg gaagtaaaca accctcagca acttgccagg 1560gtactcaaaa
cactagaagg tatccaggag gactttaacc aggcgcaatc agataacaaa 1620gcagtatcgt
tggccgacct gattgtgctg gccggctgtg cgggtgtaga aaaagctgca 1680aaagatgctg
gccatgaggt gcaggtgcct ttcaacccgg gacgagcgga tgccaccgct 1740gagcaaaccg
atgtggaagc tttcgaagca ctagagccag cggctgacgg ctttagaaac 1800tacattaaac
cggagcataa agtatccgct gaggaaatgc tcgtagaccg ggcgcagctt 1860ctgtcgcttt
cggcaccaga aatgactgct ttggtaggcg gtatgcgtgt actgggcacc 1920aactacgacg
gttcgcagca tggagtgttt acaaataagc cgggtcagct atccaatgac 1980ttctttgtaa
acctgctaga cctcaacact aaatggcgag ccagcgatga atcagacaaa 2040gtttttgaag
gcagagactt caaaactggc gaagtaaagt ggagtggcac ccgggtagac 2100ctgatcttcg
gatccaattc cgagctaaga gccctcgcag aagtgtacgg ctgtgcagat 2160tctgaagaaa
agtttgttaa agattttgtg aaggcctggg ccaaagtaat ggacctggac 2220cggtttgatc
tgaaataa
223834745PRTMicroscilla furvescens 34Met Glu Asn His Lys His Ser Gly Ser
Ser Thr Tyr Asn Thr Asn Thr 1 5 10
15 Gly Gly Lys Cys Pro Phe Thr Gly Gly Ser Leu Lys Gln Ser
Ala Gly 20 25 30
Gly Gly Thr Lys Asn Arg Asp Trp Trp Pro Asn Met Leu Asn Leu Gly
35 40 45 Ile Leu Arg Gln
His Ser Ser Leu Ser Asp Pro Asn Asp Pro Asp Phe 50
55 60 Asp Tyr Ala Glu Glu Phe Lys Lys
Leu Asp Leu Ala Ala Val Lys Lys 65 70
75 80 Asp Leu Ala Ala Leu Met Thr Asp Ser Gln Asp Trp
Trp Pro Ala Asp 85 90
95 Tyr Gly His Tyr Gly Pro Phe Phe Ile Arg Met Ala Trp His Ser Ala
100 105 110 Gly Thr Tyr
Arg Ile Gly Asp Gly Arg Gly Gly Gly Gly Ser Gly Ser 115
120 125 Gln Arg Phe Ala Pro Leu Asn Ser
Trp Pro Asp Asn Ala Asn Leu Asp 130 135
140 Lys Ala Arg Leu Leu Leu Trp Pro Ile Lys Gln Lys Tyr
Gly Arg Lys 145 150 155
160 Ile Ser Trp Ala Asp Leu Met Ile Leu Thr Gly Asn Val Ala Leu Glu
165 170 175 Thr Met Gly Phe
Lys Thr Phe Gly Phe Ala Gly Gly Arg Ala Asp Val 180
185 190 Trp Glu Pro Glu Glu Asp Val Tyr Trp
Gly Ala Glu Thr Glu Trp Leu 195 200
205 Gly Asp Lys Arg Tyr Glu Gly Asp Arg Glu Leu Glu Asn Pro
Leu Gly 210 215 220
Ala Val Gln Met Gly Leu Ile Tyr Val Asn Pro Glu Gly Pro Asn Gly 225
230 235 240 Lys Pro Asp Pro Ile
Ala Ala Ala Arg Asp Ile Arg Glu Thr Phe Gly 245
250 255 Arg Met Ala Met Asn Asp Glu Glu Thr Val
Ala Leu Ile Ala Gly Gly 260 265
270 His Thr Phe Gly Lys Thr His Gly Ala Ala Asp Ala Glu Lys Tyr
Val 275 280 285 Gly
Arg Glu Pro Ala Ala Ala Gly Ile Glu Glu Met Ser Leu Gly Trp 290
295 300 Lys Asn Thr Tyr Gly Thr
Gly His Gly Ala Asp Thr Ile Thr Ser Gly 305 310
315 320 Leu Glu Gly Ala Trp Thr Lys Thr Pro Thr Gln
Trp Ser Asn Asn Phe 325 330
335 Phe Glu Asn Leu Phe Gly Tyr Glu Trp Glu Leu Thr Lys Ser Pro Ala
340 345 350 Gly Ala
Tyr Gln Trp Lys Pro Lys Asp Gly Ala Gly Ala Gly Thr Ile 355
360 365 Pro Asp Ala His Asp Pro Ser
Lys Ser His Ala Pro Phe Met Leu Thr 370 375
380 Thr Asp Leu Ala Leu Arg Met Asp Pro Asp Tyr Glu
Lys Ile Ser Arg 385 390 395
400 Arg Tyr Tyr Glu Asn Pro Asp Glu Phe Ala Asp Ala Phe Ala Lys Ala
405 410 415 Trp Tyr Lys
Leu Thr His Arg Asp Met Gly Pro Lys Val Arg Tyr Leu 420
425 430 Gly Pro Glu Val Pro Gln Glu Asp
Leu Ile Trp Gln Asp Pro Ile Pro 435 440
445 Asp Val Ser His Pro Leu Val Asp Glu Asn Asp Ile Glu
Gly Leu Lys 450 455 460
Ala Lys Ile Leu Glu Ser Gly Leu Thr Val Ser Glu Leu Val Ser Thr 465
470 475 480 Ala Trp Ala Ser
Ala Ser Thr Phe Arg Asn Ser Asp Lys Arg Gly Gly 485
490 495 Ala Asn Gly Ala Arg Ile Arg Leu Ala
Pro Gln Lys Asp Trp Glu Val 500 505
510 Asn Asn Pro Gln Gln Leu Ala Arg Val Leu Lys Thr Leu Glu
Gly Ile 515 520 525
Gln Glu Asp Phe Asn Gln Ala Gln Ser Asp Asn Lys Ala Val Ser Leu 530
535 540 Ala Asp Leu Ile Val
Leu Ala Gly Cys Ala Gly Val Glu Lys Ala Ala 545 550
555 560 Lys Asp Ala Gly His Glu Val Gln Val Pro
Phe Asn Pro Gly Arg Ala 565 570
575 Asp Ala Thr Ala Glu Gln Thr Asp Val Glu Ala Phe Glu Ala Leu
Glu 580 585 590 Pro
Ala Ala Asp Gly Phe Arg Asn Tyr Ile Lys Pro Glu His Lys Val 595
600 605 Ser Ala Glu Glu Met Leu
Val Asp Arg Ala Gln Leu Leu Ser Leu Ser 610 615
620 Ala Pro Glu Met Thr Ala Leu Val Gly Gly Met
Arg Val Leu Gly Thr 625 630 635
640 Asn Tyr Asp Gly Ser Gln His Gly Val Phe Thr Asn Lys Pro Gly Gln
645 650 655 Leu Ser
Asn Asp Phe Phe Val Asn Leu Leu Asp Leu Asn Thr Lys Trp 660
665 670 Arg Ala Ser Asp Glu Ser Asp
Lys Val Phe Glu Gly Arg Asp Phe Lys 675 680
685 Thr Gly Glu Val Lys Trp Ser Gly Thr Arg Val Asp
Leu Ile Phe Gly 690 695 700
Ser Asn Ser Glu Leu Arg Ala Leu Ala Glu Val Tyr Gly Cys Ala Asp 705
710 715 720 Ser Glu Glu
Lys Phe Val Lys Asp Phe Val Lys Ala Trp Ala Lys Val 725
730 735 Met Asp Leu Asp Arg Phe Asp Leu
Lys 740 745 35 3108DNAAspergillus
niger 35cttgtcaccg agtgcccgtt tgtcacttgt tgtggtgatc ttgagcacat cgcgttcctc
60tcgtctcatc acatcgagtg atcaacattg catgacccta gtggagcccc ttcgtctccc
120aacaggaggg tccggattac caagtcccga caccgtttgg ctgtaattcg actcaaattc
180tggattcgta gcttaactaa gacgcgtggt ctgttaaccg gcctcgccat ggatgccgat
240ataaggaccc taggggactc ccccctggtg actctcgtcg gaagatcgca gcactctgaa
300ttctcctagt cttcgtttac tccgccatgc gtcatttctg ccttttgcca gctgttgctg
360gtatcgctgg ggctcaatgc ccctacctgt cgggtgaaat gagtttcacc caggagcagg
420acaatgctgg cgataccatt gaggtcacgg agcagcccat tgacaacacc ctgtatgtca
480atgacaccgg tagctacatg actaccgact ttggcactcc gatctccgac cagaccagtc
540tcaaggccgg gccccgtggt cctaccctgt tggaggactt tatcttccgt cagaagcttc
600agcggttcga ccatgagcgt gtaagtacag taactgctgc ggtgtgtagt aacaataaat
660tgacccagtg gttttcaatt aggtccccga gcgcgtcgtc cacgcccgtg gtgccggtgc
720atatggtact ttcaaatcct acgccgactg gtcgaacgtc acggctgccg atttcttgag
780tgccaacgat aaggagagcc cctatgttct gtcgcttctc tactgtggtc ggtttccgtg
840gtagtgttga cactgcgcgt gatgttcacg gtcacgcttg tcggttctac actgacgagg
900gtaactatgg tatcttgata tggtcaccca acaataattc aatacatgct aacagatatg
960tctctactag acatcgtcgg tatcaatttc gcccccttct tcatccagga cgccatccag
1020ttccccgatc ttgtccacgc catcaagccc atgcccaaca atgagatccc ccaggccgct
1080actgcacaca cttccgcttg ggacttcttc agccagcaga gcactgccct ccacagtgcc
1140ttgtggctga tgtctggtaa cggtattcct cgttctttcc gccacatgaa cggctacgga
1200gtccacagct tccgcttcgt cgctgccaat ggcacttcca aggtggtgcg aacaccttgg
1260aagtcccaac agggtgttgc cagtctggtg tgggatgaag ctcaggccgc tgctggtaag
1320aacagtgact accaccgcca ggatctgtac aatgcgatgc ccaatggcca ctacccgaaa
1380tacgaggtca gccaatccct tgatgtctat cgatagagcc ttttgctgac aatcccctag
1440gtccaagccc agatcatgga tgaggctgac atgcttcgtt tcggcttcga ccttctggat
1500cccaccaagt tggtccccga ggaggttgtc ccttacactc ctctcggaat gatggagctc
1560aatgccaacc ccaccaacta ctttgctgaa gttgaacagg ctggtgtatg tattccccat
1620tcatcaaatg ccagacataa tctaacttct gcagttccaa cccggtcacg tcgttcctgg
1680cattgacttc accgacgacc ccctgctgca aggccgtctc ttctcctacc tcgacactca
1740gttgacccgt cacggcggtc ccaacttcga gcaaatcccc gtcaaccgtc ctcgcaagcc
1800cgttcacaac aacaaccgtg acggcttcgg ccagcagcag atccccacca acaactgggc
1860ctacaccccc aacagcatga gcaacggtta ccccatgcaa gccaaccaga cccagggtca
1920tggtttcttc accgcgccct accgctacgc ttccggccat ctcgtccgcc agaccagccc
1980gaccttcaat gaccactggt cccagcccgc catgttctgg aactctctga tccccgctga
2040gcagcagatg gttgtcaacg ccattgtctt tgagaactcc aaggttaaca gcccccacgt
2100tcggaagaac gttgtcaacc agctgaacat ggtcaacaac aacctcgccg tccgtgtcgc
2160tcgtggtctt ggtctcgatg agccctcccc caacccgact tactacacct ccaacaagac
2220ctccaacgtc ggtaccttcg gcaagcccct cctcagcatc gagggtctgc aggtcggctt
2280cctggcctcg aactcccacc ccgaatccat caagcagggc caggccatgg ccgcgcagtt
2340ctctgccgct ggcgtcgacc tgaacattgt caccgaggcc tacgccgatg gtgtcaacac
2400cacctacgcc ctgtctgatg ccatcgactt tgacgccctc atcatcgccg atggtgtgca
2460gagcctcttc gcctcccccg ctctcgctaa ccagatgaac tctaccgcca cctctactct
2520ctaccctcct gccagacctt tccagatcct ggtcgattct ttcaggtacg gtaagcccgt
2580ggctgctgtc ggcagtggca gtgttgcgct caagaacgct ggtattgatt cctcccgctc
2640tggtgtgtac actggctcga gcgagacgac ggagaagatc gccaaggagg tcttggaggg
2700actctacact ttccgttttg tggaccggtt tgcgctggat gagtaagggt atcacgtttg
2760tacttgtact cacgttcatc gtttgtgatg atacattgat tgatcgatag atattttgtg
2820agatagatag agtatactag agwgkacata tctctactga tgaggtgttg tgctgctgca
2880acacatattt atgaatatat attctcttct ttgtgaaagc tagccttcta tataatcagc
2940aatggttaac tcttccaatt ctatagatac caatcaccta acccactcgg aatgacgaca
3000gaaaacatcg acatgttcgc ccaagtaaag ctacttgaac ttctacattt atgctatgct
3060ggagtcctct cataagtcca gaataaacaa agagatccga tcctgctc
310836729PRTAspergillus niger 36Met Arg His Phe Trp Leu Leu Pro Ala Val
Ala Gly Ile Ala Gly Ala 1 5 10
15 Gln Cys Pro Tyr Leu Ser Gly Glu Met Ser Phe Thr Gln Glu Gln
Asp 20 25 30 Asn
Ala Gly Asp Thr Ile Glu Val Thr Glu Gln Pro Ile Asp Asn Thr 35
40 45 Leu Tyr Val Asn Asp Thr
Gly Ser Tyr Met Thr Thr Asp Phe Gly Thr 50 55
60 Pro Ile Ser Asp Gln Thr Ser Leu Lys Ala Gly
Pro Arg Gly Pro Thr 65 70 75
80 Leu Leu Glu Asp Phe Ile Phe Arg Gln Lys Leu Gln Arg Phe Asp His
85 90 95 Glu Arg
Val Pro Glu Arg Val Val His Ala Arg Gly Gly Ala Tyr Gly 100
105 110 Thr Phe Lys Ser Tyr Ala Asp
Trp Ser Asn Val Thr Ala Ala Asp Phe 115 120
125 Leu Ser Ala Asn Asp Lys Glu Thr Pro Met Phe Cys
Arg Phe Ser Thr 130 135 140
Val Val Gly Phe Arg Gly Ser Val Asp Thr Ala Arg Asp Val His Gly 145
150 155 160 His Ala Cys
Arg Phe Tyr Thr Asp Glu Gly Asn Tyr Asp Ile Val Gly 165
170 175 Ile Asn Phe Ala Pro Phe Phe Ile
Gln Asp Ala Ile Gln Phe Pro Asp 180 185
190 Leu Val His Ala Ile Lys Pro Met Pro Asn Asn Glu Ile
Pro Gln Ala 195 200 205
Ala Thr Ala His Thr Ser Ala Trp Asp Phe Phe Ser Gln Gln Ser Thr 210
215 220 Ala Leu His Ser
Ala Leu Trp Leu Met Ser Gly Asn Gly Ile Pro Arg 225 230
235 240 Ser Phe Arg His Met Asn Gly Tyr Gly
Val His Ser Phe Arg Phe Val 245 250
255 Ala Ala Asn Gly Thr Ser Lys Val Val Arg Thr Pro Trp Lys
Ser Gln 260 265 270
Gln Gly Val Ala Ser Leu Val Trp Asp Glu Ala Gln Ala Ala Ala Gly
275 280 285 Lys Asn Ser Asp
Tyr His Arg Gln Asp Leu Tyr Asn Ala Met Pro Asn 290
295 300 Gly His Tyr Pro Lys Tyr Glu Leu
Gln Ala Gln Ile Met Asp Glu Ala 305 310
315 320 Asp Met Leu Arg Phe Gly Phe Asp Leu Leu Asp Pro
Thr Lys Leu Val 325 330
335 Pro Glu Glu Val Val Pro Tyr Thr Pro Leu Gly Met Met Glu Leu Asn
340 345 350 Ala Asn Pro
Thr Asn Tyr Phe Ala Glu Val Glu Gln Ala Gly Phe Gln 355
360 365 Pro Gly His Val Val Pro Gly Ile
Asp Phe Thr Asp Asp Pro Leu Leu 370 375
380 Gln Gly Arg Leu Phe Ser Tyr Leu Asp Thr Gln Leu Thr
Arg His Gly 385 390 395
400 Gly Pro Asn Phe Glu Gln Ile Pro Val Asn Arg Pro Arg Lys Pro Val
405 410 415 His Asn Asn Asn
Arg Asp Gly Phe Gly Gln Gln Gln Ile Pro Thr Asn 420
425 430 Asn Trp Ala Tyr Thr Pro Asn Ser Met
Ser Asn Gly Tyr Pro Met Gln 435 440
445 Ala Asn Gln Thr Gln Gly His Gly Phe Phe Thr Ala Pro Tyr
Arg Tyr 450 455 460
Ala Ser Gly His Leu Val Arg Gln Thr Ser Pro Thr Phe Asn Asp His 465
470 475 480 Trp Ser Gln Pro Ala
Met Phe Trp Asn Ser Leu Ile Pro Ala Glu Gln 485
490 495 Gln Met Val Val Asn Ala Ile Val Phe Glu
Asn Ser Lys Val Asn Ser 500 505
510 Pro His Val Arg Lys Asn Val Val Asn Gln Leu Asn Met Val Asn
Asn 515 520 525 Asn
Leu Ala Val Arg Val Ala Arg Gly Leu Gly Leu Asp Glu Pro Ser 530
535 540 Pro Asn Pro Thr Tyr Tyr
Thr Ser Asn Lys Thr Ser Asn Val Gly Thr 545 550
555 560 Phe Gly Lys Pro Leu Leu Ser Ile Glu Gly Leu
Gln Val Gly Phe Leu 565 570
575 Ala Ser Asn Ser His Pro Glu Ser Ile Lys Gln Gly Gln Ala Met Ala
580 585 590 Ala Gln
Phe Ser Ala Ala Gly Val Asp Leu Asn Ile Val Thr Glu Ala 595
600 605 Tyr Ala Asp Gly Val Asn Thr
Thr Tyr Ala Leu Ser Asp Ala Ile Asp 610 615
620 Phe Asp Ala Leu Ile Ile Ala Asp Gly Val Gln Ser
Leu Phe Ala Ser 625 630 635
640 Pro Ala Leu Ala Asn Gln Met Asn Ser Thr Ala Thr Ser Thr Leu Tyr
645 650 655 Pro Pro Ala
Arg Pro Phe Gln Ile Leu Val Asp Ser Phe Arg Tyr Gly 660
665 670 Lys Pro Val Ala Ala Val Gly Ser
Gly Ser Val Ala Leu Lys Asn Ala 675 680
685 Gly Ile Asp Ser Ser Arg Ser Gly Val Tyr Thr Gly Ser
Ser Glu Thr 690 695 700
Thr Glu Lys Ile Ala Lys Glu Val Leu Glu Gly Leu Tyr Thr Phe Arg 705
710 715 720 Phe Val Asp Arg
Phe Ala Leu Asp Glu 725 37684PRTHumicola
grisea 37Met Asn Arg Val Thr Asn Leu Leu Ala Trp Ala Gly Ala Ile Gly Leu
1 5 10 15 Ala Gln
Ala Thr Cys Pro Phe Ala Asp Pro Ala Ala Leu Tyr Arg Arg 20
25 30 Gln Asp Thr Thr Ser Gly Gln
Ser Pro Leu Ala Ala Tyr Glu Val Asp 35 40
45 Asp Ser Thr Gly Tyr Leu Thr Ser Asp Val Gly Gly
Pro Ile Gln Asp 50 55 60
Gln Thr Ser Leu Lys Ala Gly Ile Arg Gly Pro Thr Leu Leu Glu Asp 65
70 75 80 Phe Met Phe
Arg Gln Lys Ile Gln His Phe Asp His Glu Arg Val Pro 85
90 95 Glu Arg Ala Val His Ala Arg Gly
Ala Gly Ala His Gly Thr Phe Thr 100 105
110 Ser Tyr Ala Asp Trp Ser Asn Ile Thr Ala Ala Ser Phe
Leu Asn Ala 115 120 125
Thr Gly Lys Gln Thr Pro Val Phe Val Arg Phe Ser Thr Val Ala Gly 130
135 140 Ser Arg Gly Ser
Ala Asp Thr Ala Arg Asp Val His Gly Phe Ala Thr 145 150
155 160 Arg Phe Tyr Thr Asp Glu Gly Asn Phe
Asp Ile Val Gly Asn Asn Ile 165 170
175 Pro Val Phe Phe Ile Gln Asp Ala Ile Gln Phe Pro Asp Leu
Ile His 180 185 190
Ser Val Lys Pro Ser Pro Asp Asn Glu Ile Pro Gln Ala Ala Thr Ala
195 200 205 His Asp Ser Ala
Trp Asp Phe Phe Ser Gln Gln Pro Ser Ala Met His 210
215 220 Thr Leu Phe Trp Ala Met Ser Gly
His Gly Ile Pro Arg Ser Tyr Arg 225 230
235 240 His Met Asp Gly Phe Gly Val His Thr Phe Arg Phe
Val Lys Asp Asp 245 250
255 Gly Ser Ser Lys Leu Ile Lys Trp His Phe Lys Ser Arg Gln Gly Lys
260 265 270 Ala Ser Leu
Val Trp Glu Glu Ala Gln Val Leu Ser Gly Lys Asn Ala 275
280 285 Asp Phe His Arg Gln Asp Leu Trp
Asp Ala Ile Glu Ser Gly Asn Gly 290 295
300 Pro Glu Trp Asp Val Cys Val Gln Ile Val Asp Glu Ser
Gln Ala Gln 305 310 315
320 Ala Phe Gly Phe Asp Leu Leu Asp Pro Thr Lys Ile Ile Pro Glu Glu
325 330 335 Tyr Ala Pro Leu
Thr Lys Leu Gly Leu Leu Lys Leu Asp Arg Asn Pro 340
345 350 Thr Asn Tyr Phe Ala Glu Thr Glu Gln
Val Met Phe Gln Pro Gly His 355 360
365 Ile Val Arg Gly Val Asp Phe Thr Glu Asp Pro Leu Leu Gln
Gly Arg 370 375 380
Leu Phe Ser Tyr Leu Asp Thr Gln Leu Asn Arg Asn Gly Gly Pro Asn 385
390 395 400 Phe Glu Gln Leu Pro
Ile Asn Met Pro Arg Val Pro Ile His Asn Asn 405
410 415 Asn Arg Asp Gly Ala Gly Gln Met Phe Ile
His Arg Asn Lys Tyr Pro 420 425
430 Tyr Thr Pro Asn Thr Leu Asn Ser Gly Tyr Pro Arg Gln Ala Asn
Gln 435 440 445 Asn
Ala Gly Arg Gly Phe Phe Thr Ala Pro Gly Arg Thr Val Ser Gly 450
455 460 Ala Leu Val Arg Glu Val
Ser Pro Thr Phe Asn Asp His Trp Ser Gln 465 470
475 480 Pro Arg Leu Phe Phe Asn Ser Leu Thr Pro Val
Glu Gln Gln Phe Leu 485 490
495 Val Asn Ala Met Arg Phe Glu Ile Ser Leu Val Lys Ser Glu Glu Cys
500 505 510 Arg Lys
Asn Val Leu Thr Gln Leu Asn Arg Val Ser His Asp Val Ala 515
520 525 Val Arg Val Ala Ala Ala Ile
Gly Leu Ala Ala Pro Asp Ala Asp Asp 530 535
540 Thr Tyr Tyr His Asn Asn Lys Thr Ala Gly Val Ser
Ile Leu Gly Ser 545 550 555
560 Gly Pro Leu Pro Thr Ile Lys Thr Leu Arg Val Gly Ile Leu Ala Thr
565 570 575 Thr Ser Glu
Ser Ser Ala Leu Asp Gln Ala Ala Gln Leu Arg Thr Arg 580
585 590 Leu Glu Lys Asp Gly Leu Val Val
Thr Val Val Ala Glu Thr Leu Arg 595 600
605 Glu Gly Val Asp Gln Thr Tyr Ser Thr Ala Asp Ala Thr
Gly Phe Asp 610 615 620
Gly Val Val Val Val Asp Gly Ala Ala Ala Leu Phe Ala Ser Thr Ala 625
630 635 640 Ser Ser Pro Leu
Phe Pro Thr Gly Arg Pro Leu Gln Ile Phe Val Asp 645
650 655 Ala Tyr Arg Trp Gly Lys Pro Val Gly
Val Cys Gly Gly Lys Ser Ser 660 665
670 Glu Val Leu Asp Ala Ala Asp Val Pro Glu Asn Gly
675 680 3840DNAArtificial
SequenceSynthetic Construct 38acacaactgg ggatccacca tgcgaggggc atactctctc
403943DNAArtificial SequenceSynthetic Construct
39gtcaccctct agatctaaca agttactcgt gttaatcgtg gaa
434044DNAArtificial SequenceSynthetic Construct 40acacaactgg ggatccacca
tgaacagagt cacgaatctc ctcg 444143DNAArtificial
SequenceSynthetic Construct 41gtcaccctct agatctggta caactcccac cctattcctt
ctc 434236DNAArtificial SequenceSynthetic Construct
42acacaactgg ggatccacca tgcgcgcagt gcagct
364347DNAArtificial SequenceSynthetic Construct 43gtcaccctct agatctgtcg
actattccaa ccttcctata tggacac 47
User Contributions:
Comment about this patent or add new information about this topic: