Patents - stay tuned to the technology

Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: NOVEL PLASMIDS AND UTILIZATION THEREOF

Inventors:  Eitora Yamamura (Takaoka-Shi, JP)  Noboru Fujimoto (Takaoka-Shi, JP)
Assignees:  DAIICHI FINE CHEMICAL CO., LTD.
IPC8 Class: AC12N121FI
USPC Class: 43525233
Class name: Bacteria or actinomycetales; media therefor transformants (e.g., recombinant dna or vector or foreign or exogenous gene containing, fused bacteria, etc.) escherichia (e.g., e. coli, etc.)
Publication date: 2009-02-12
Patent application number: 20090042275



ructed by preparing a DNA region replicable in bacteria belonging to the genus Rhodococcus from a Rhodococcus-derived plasmid having the nucleotide sequence set forth as SEQ ID NO: 73 and a plasmid or its DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 74, and a DNA region replicable in E. coli from an E. coli-derived plasmid or its DNA fragment. An aminoketone asymmetric reductase gene is inserted into the shuttle vector, transformants containing the vector are created, and the aminoketone asymmetric reductase and optically active aminoalcohols are produced.

Claims:

1-24. (canceled)

25. An isolated DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 77.

26. An isolated DNA fragment comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 77.

27-28. (canceled)

29. An isolated vector comprising the isolated DNA fragment according to claim 25.

30. The isolated vector according to claim 29, having inserted therein an aminoketone asymmetric reductase gene.

31. The isolated vector according to claim 30, wherein the aminoketone asymmetric reductase gene is a nucleic acid coding for a protein comprising the amino acid sequence set forth as SEQ ID NO: 78, or a nucleic acid that codes for a protein having the amino acid sequence set forth as SEQ ID NO: 78 with a deletion, insertion, substitution or addition of one or a plurality of amino acids, and having aminoketone asymmetric reduction activity.

32. The isolated vector according to claim 30, wherein the aminoketone asymmetric reductase gene is a nucleic acid comprising the nucleotide sequence set forth as SEQ ID NO: 79, or a nucleic acid that hybridizes with nucleic acid having a nucleotide sequence complementary to the nucleotide sequence set forth as SEQ ID NO: 79 under stringent conditions, and that codes for a protein having aminoketone asymmetric reduction activity.

33. A transformant containing the isolated vector according to claim 29.

34. A transformant containing the isolated vector according to claim 30.

35-38. (canceled)

Description:

TECHNICAL FIELD

[0001]The present invention relates to novel plasmids derived from any of microorganisms belonging to the genus Rhodococcus (hereinafter referred to as "the genus Rhodococcus") and to utilization thereof. More specifically, the invention relates to plasmids or their partial DNA fragments (hereinafter also referred to simply as "DNA fragments"), and to shuttle vectors, vectors, transformants, aminoketone asymmetric reductase production methods and optically active aminoalcohol production methods which utilize them.

BACKGROUND ART

[0002]The genus Rhodococcus is known to produce enzymes involved in nitrile metabolism and to produce enzymes which asymmetrically reduce aminoketones. In particular, Rhodococcus erythropolis is known to have very high aminoketone asymmetric reduction activity. Such microorganisms and enzymes act on α-aminoketones to high selectively produce optically active β-aminoalcohols at high yields (for example, Patent documents 1 and 5). Thus, it has long been desired to develop a host-vector system intended for mass production of useful enzymes and the like in the genus Rhodococcus. However, the development of vectors suitable for the genus Rhodococcus as hosts has lagged behind. Only a few strains of the genus Rhodococcus have been found with plasmids, namely Rhodococcus sp. H13-A (Non-patent document 1), Rhodococcus rhodochrous ATCC4276 (Patent document 2), Rhodococcus rhodochrous ATCC4001 (Patent document 3) and Rhodococcus erythropolis IFO12320 (Patent document 4).

[0003][Patent document 1] WO01/73100

[0004][Patent document 2] Japanese Unexamined Patent Publication HEI No. 4-148685

[0005][Patent document 3] Japanese Unexamined Patent Publication HEI No. 4-330287

[0006][Patent document 4] Japanese Unexamined Patent Publication HEI No. 9-28379

[0007][Patent document 5] WO02/070714

[0008][Non-patent document 1] J. Bacteriol., 170, 638, 1988

DISCLOSURE OF THE INVENTION

Problems to be Solved by the Invention

[0009]As mentioned above, it has been desired to develop new vectors for breeding and improve to industrially useful strains (mutant strains) from the genus Rhodococcus. In particular, self-cloning systems are preferred from the standpoint of safety of the recombinant DNA microbes and their products which may be used as foods and additives. It is an object of the present invention to provide novel plasmids that can be used as vectors for such a host-vector system.

[0010]It is desirable to create recombinant microbes suitable for industrial application from among Rhodococcus erythropolis which has aminoketone asymmetric reduction activity. In particular, it is a first object of the invention to provide novel plasmids or their partial DNA fragments which can be used to create such recombinant microbes.

[0011]If a plasmid such as described above can be obtained, it would become easy to construct a shuttle vector that is replicable even in other microbes. It is therefore a second object of the invention to provide nucleotide sequence data relating to DNA replication (replication region, etc.) necessary for construction of such a shuttle vector.

[0012]It is a third object of the invention to provide shuttle vectors that are replicable in both the genus Rhodococcus and E. coli.

[0013]It is a fourth object of the invention to apply the shuttle vectors to an amino ketone asymmetric reductase.

Means for Solving the Problems

[0014]The present inventors carefully screened plasmids for vector construction from among Rhodococcus strains, and as a result discovered several novel plasmids usable as vectors for host-vector systems.

[0015]Furthermore, the present inventors found that it is possible to construct shuttle vectors by transferring into the aforementioned plasmids a drug resistance gene and a gene region that is replicable in E. coli. As a result there were obtained nucleotide sequence data, plasmids and shuttle vectors that achieve the objects stated above, and the present invention has thereupon been completed.

[0016]Specifically, the present invention provides a DNA fragment, a DNA, a plasmid, a shuttle vector, a vector, a transformant, a method for production of an aminoketone asymmetric reductase, and a method for production of an optically active aminoalcohol, according to following (1) to (39).

(1) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37.(2) A plasmid or a partial DNA fragment thereof, characterized by comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37.(3) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22.(4) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22.(5) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22 and comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37.(6) A DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 76.(7) A plasmid or a partial DNA fragment thereof, characterized by comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.(8) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22, comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37, and comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.(9) A circular plasmid characterized by comprising a plasmid or a partial DNA fragment according to any one of (1) to (8), wherein the numbers of restriction endonuclease cleavage sites are BamH I: 2, EcoR I: 2, Kpn I: 1, Pvu II: 1, Sac I: 1 and Sma I: 1, and the size is approximately 5.4 kbp.(10) A plasmid having the nucleotide sequence set forth as SEQ ID NO: 73.(11) A plasmid or a DNA fragment according to any one of (1) to (10), characterized by being derived from a bacterium belonging to the genus Rhodococcus. (12) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72.(13) A plasmid or a partial DNA fragment thereof, characterized by comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72.(14) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69.(15) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69.(16) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69 and comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72.(17) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69, comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72, and comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.(18) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 67 and SEQ ID NO: 47.(19) A plasmid or a partial DNA fragment thereof, characterized by comprising a mobilization protein region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 67 and SEQ ID NO: 47.(20) A DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 75.(21) A plasmid or a partial DNA fragment thereof characterized by comprising a mobilization-related region having the nucleotide sequence set forth as SEQ ID NO: 75.(22) A circular plasmid characterized by comprising a plasmid or DNA fragment according to any one of (12) to (21), wherein the numbers of restriction endonuclease cleavage sites are BamH I: 2, Pvu II: 4, Sac I: 3 and Sma I: 4, and the size is approximately 5.8 kbp.(23) A plasmid having the nucleotide sequence set forth as SEQ ID NO: 74.(24) A plasmid or a DNA fragment according to any one of (12) to (23), characterized by being derived from a bacterium belonging to the genus Rhodococcus. (25) A DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 77.(26) A DNA fragment characterized by comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 77.(27) A shuttle vector replicable in bacteria belonging to the genus Rhodococcus and E. coli, and comprising a plasmid or partial DNA fragment thereof according to any one of (1) to (26) and a DNA region replicable in E. coli. (28) A vector characterized by being constructed using a shuttle vector according to (27).(29) A vector characterized by comprising a plasmid or DNA fragment according to any one of (6), (7), (25) or (26).(30) A vector according to (28) or (29), characterized by having inserted therein an aminoketone asymmetric reductase gene.(31) A vector according to (30), characterized in that the aminoketone asymmetric reductase gene is a nucleic acid coding for a protein consisting the amino acid sequence set forth as SEQ ID NO: 78, or a nucleic acid that codes for a protein having the amino acid sequence set forth as SEQ ID NO: 78 with a deletion, insertion, substitution or addition of one or a plurality of amino acids, and having aminoketone asymmetric reduction activity.(32) A vector according to (30), characterized in that the aminoketone asymmetric reductase gene is a nucleic acid consisting the nucleotide sequence set forth as SEQ ID NO: 79, or a nucleic acid that hybridizes with nucleic acid having a nucleotide sequence complementary to the nucleotide set forth as SEQ. ID NO: 79 under stringent conditions, and that codes for a protein having aminoketone asymmetric reduction activity.(33) A transformant containing a vector according to (28) or (29).(34) A transformant containing a vector according to any one of (30) to (32).(35) A method for production of an aminoketone asymmetric reductase, which comprises a culturing step in which a transformant according to (34) is cultured in medium that allows growth of said transformant, and

[0017]a purification step in which the aminoketone asymmetric reductase is purified from said transformant obtained in said culturing step.

(36) A method for production of an optically active aminoalcohol, wherein an aminoketone asymmetric reductase obtained by the production method of (35) is reacted with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1):

##STR00001##

wherein X may be the same or different and represents at least one species selected from the group consisting of halogen, lower alkyl, hydroxyl optionally protected with a protecting group, nitro and sulfonyl;n represents an integer of 0 to 3;R1 represents lower alkyl;R2 and R3 may be the same or different and represent at least one species selected from the group consisting of hydrogen and lower alkyl; and* represents asymmetric carbon,or a salt thereof to produce an optically active aminoalcohol compound represented by the following general formula (2):

##STR00002##

wherein X, n, R1, R2, R3 and * have the same definitions as above, and having the desired optical activity.(37) A method for production of an optically active aminoalcohol, wherein a transformant according to (34) is reacted with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1):

##STR00003##

wherein X may be the same or different and represents at least one species selected from the group consisting of halogen, lower alkyl, hydroxyl optionally protected with a protecting group, nitro and sulfonyl;n represents an integer of 0 to 3;R1 represents lower alkyl;R2 and R3 may be the same or different and represent at least one species selected from the group consisting of hydrogen and lower alkyl; and* represents asymmetric carbon,or a salt thereof, to produce an optically active aminoalcohol compound represented by the following general formula (2):

##STR00004##

wherein X, n, R1, R2, R3 and * have the same definitions as above, and having the desired optical activity.(38) A production method for an optically active aminoalcohol according to (37), wherein the production method for the optically active aminoalcohol is carried out with further addition of a compound represented by the following general formula (3):

##STR00005##

wherein A represents the following formula (Y) or (Z):

##STR00006##

wherein R4 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R8,

##STR00007##

wherein R5 represents hydrogen, C1-3 alkyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R6 or R9;R6 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R9;R7 represents hydrogen or optionally substituted C1-6 alkyl;R8 represents hydrogen, carboxyl, optionally substituted C1-6 alkyl, a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R4 or a C5-10 hydrocarbon ring which is bonded to R6;R9 represents hydrogen, optionally substituted C1-6 alkyl, optionally substituted C1-6 alkyloxycarbonyl, optionally substituted acyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R6; andR10 represents hydrogen or optionally substituted C1-6 alkyl,or a pharmaceutically acceptable salt or solvate thereof, for production of an optically active aminoalcohol.(39) A shuttle vector according to (27), having a nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 89 to SEQ ID NO: 100.

EFFECT OF THE INVENTION

[0018]The plasmids of the invention are novel plasmids unknown to the prior art, and are valuable as vectors for host-vector systems belonging to the industrially useful the genus Rhodococcus. They are of particular utility in the creation of recombinant microbes capable of industrial asymmetric reduction of aminoketones. An example of asymmetric reduction of an aminoketone to which such microbes may contribute is a reaction for production of d-(1S,2S)-pseudoephedrine from 1-2-methylamino-1-phenyl-1-propanone.

[0019]The plasmids of the invention can coexist in single Rhodococcus cell and therefore can be used not only alone for their replicating function, but also as compatible plasmids. That is, by inserting different protein (for example, enzyme) genes into the different plasmids, it is possible to express the proteins simultaneously in the same cell.

[0020]The shuttle vectors of the invention are useful for creation of industrially useful recombinant microbes of the genus Rhodococcus and Escherichia coli.

[0021]The nucleotide sequence data relating to DNA replication obtained from the plasmids of the invention may serve as the basis for construction of the aforementioned shuttle vectors, and specifically they provide DNA fragments as constituent elements of the vectors.

BRIEF DESCRIPTION OF THE DRAWINGS

[0022]FIG. 1 is a restriction enzyme cleavage map of plasmid pRET1100.

[0023]FIG. 2 is a restriction enzyme cleavage map of plasmid pRET1000.

[0024]FIG. 3 is a summary illustration for construction of shuttle vector pRET1101.

[0025]FIG. 4 is a summary illustration for construction of shuttle vector pRET1102.

[0026]FIG. 5 is a summary illustration for construction of shuttle vector pRET1103.

BEST MODE FOR CARRYING OUT THE INVENTION

[0027]Preferred embodiments of the invention will now be explained.

[0028]The first plasmid of the invention is a plasmid isolated from the genus Rhodococcus, or a derivative thereof. Specifically, it may be isolated from, for example, Rhodococcus erythropolis IAM1400, IAM1503, JCM2893 and JCM2894 strains, has a size of approximately 5.4 kbp and is a circular plasmid cleavable by the restriction enzymes shown in Table 1. The plasmids isolated from each of these strains are designated as pRET1100, pRET1300, pRET1500 and pRET1700, respectively. Plasmids of the invention may be prepared from these sample strains by publicly known methods (for example, boiling, alkali dissolution, cesium chloride density gradient ultracentrifugation: Lab Manual Idenshi Kogaku, 3rd Edition, Chapter 10, pp. 55-59, Maruzen).

TABLE-US-00001 TABLE 1 Restriction Number of Fragment sizes enzyme cleavage sites (kbp) BamH I 2 0.4, 5.0 EcoR I 2 0.3, 5.1 Kpn I 1 5.4 Pvu II 1 5.4 Sac I 1 5.4 Sma I 1 5.4

[0029]FIG. 1 shows a restriction enzyme cleavage map for pRET1100. This plasmid was sequenced by a publicly known method (using a fluorescent automatic sequencer, for example) and its full nucleotide sequence was revealed to be 5444 bp set forth as SEQ ID NO: 73 of the Sequence Listing.

[0030]The second plasmid of the invention is also a plasmid isolated from the genus Rhodococcus, or its derivative. Specifically, it may be isolated from, for example, Rhodococcus rhodnii JCM3203, has a size of approximately 5.8 kbp and is a circular plasmid cleavable by the restriction enzymes shown in Table 2. This plasmid is designated as pRET1000.

TABLE-US-00002 TABLE 2 Restriction Number of Fragment sizes enzyme cleavage sites (kbp) BamH I 2 2.0, 3.8 Pvu II 4 0.1, 1.4, 2.0, 2.3 Sac I 3 0.9, 1.0, 3.9 Sma I 4 0.1, 1.2, 1.6, 2.9

[0031]FIG. 2 shows a restriction enzyme cleavage map for pRET1000. This plasmid was also sequenced by a publicly known method and its full nucleotide sequence was revealed to be 5813 bp set forth as SEQ ID NO: 74 of the Sequence Listing.

[0032]The plasmids of the invention (natural- or wild-types) are circular plasmids that can also be defined by the restriction enzyme cleavage patterns shown in Tables 1 and 2. Thus, the present invention encompasses the following two types of circular plasmids.

[0033](1) A circular plasmid derived from a Rhodococcus strain, characterized by having a size of approximately 5.4 kbp and possessing the following restriction enzyme cleavage sites: BamH I:2, EcoR I:2, Kpn I:1, Pvu II:1, Sac I:1 and Sma I:1.

[0034](2) A circular plasmid derived from a Rhodococcus strain, characterized by having a size of approximately 5.8 kbp and possessing the following restriction enzyme cleavage sites: BamH I:2, Pvu II:4, Sac I:3 and Sma I:4.

[0035]As a result of analysis of the nucleotide sequences of plasmids pRET1100 and pRET1000 (i.e., SEQ ID NO: 73 and SEQ ID NO: 74), there is predicted the existence of a group of nucleotide sequences (open reading frames, hereinafter "orf") coding for proteins for DNA replication or other functions.

[0036]In the relevant technical field, "DNA replication" refers to using DNA itself as template to form two double-stranded DNA molecules exactly identical to existing double-stranded DNA (parent DNA). The replication mechanism consists of three stages: initiation from the starting point of replication (replication origin), DNA chain elongation and termination. During replication, a portion of the DNA double strand is unraveled and new DNA strands are synthesized complementary to each single strand. The double strand is unraveled by DNA helicase and helix destabilizing proteins (also known as single-strand DNA-binding protein), and the unraveled portion is referred to as the replication fork. The template DNA in the direction from 3' to 5' toward the replication fork is the "leading strand", and the one in the direction from 5' to 3' is the "lagging strand". DNA polymerase extends the DNA strand in the direction from 5' to 3'. Therefore when the leading strand is the template, DNA is synthesized in the direction of the replication fork. However when the opposite lagging strand is the template, the DNA strand must be extended in the opposite direction from the replication fork. Consequently, replication of the lagging strand is accomplished in fragments of about 200 bases, known as Okazaki fragments. Every approximately 200 bases, RNA primer is used with DNA as template to synthesize 10 bases of RNA in the direction from 5' to 3'. From this RNA as primer, DNA polymerase synthesizes a DNA strand in the direction from 5' to 3' on the lagging strand as template. The replicated DNA fragment of approximately 200 bases then binds to the single-stranded DNA from which RNA is removed. In this replication mechanism, several proteins including DNA helicase and helix-destabilizing protein work together to form the replicating machinery. Other proteins involved include DNA topoisomerase (which prevents twisting during the DNA replication), replication initiation proteins and replication termination proteins. The DNA replication mechanism is described in detail in, for example, "Saibou no Bunshiseibutsugaku [Molecular Biology of the Cell]", 3rd Edition, translated by Keiko Nakamura et al., pp. 251-262, Kyoikusha, 1996.

[0037]Upon analysis of the nucleotide sequences of the plasmids pRET1100 and pRET1000, they were found to include sequences of AT-rich homologous or analogous repeats and a sequence thought to have a DNA secondary structure, i.e. a nucleotide sequence predicted to be a DNA replication region (a nucleotide sequence region recognized by proteins involved in DNA replication or a region including the DNA replication origin), in the vicinity of the aforementioned orf relating to DNA replication.

[0038]DNA replication requires a DNA replication region and a region coding for a protein involved in DNA replication (hereinafter referred to as "DNA replication-related protein"). According to the present invention it is possible to obtain data relating to the nucleotide sequences of these regions for both plasmids pRET1100 and pRET1000.

[0039]First, the nucleotide sequences set forth as SEQ ID NO: 35-37 were identified as DNA replication regions for plasmid pRET1100. As regions coding for proteins related to DNA replication there were identified the nucleotide sequences set forth as SEQ ID NO: 1-3 (orf1), the nucleotide sequence set forth as SEQ ID NO: 4 (orf2), the nucleotide sequences set forth as SEQ ID NO: 5-16 (orf3), the nucleotide sequences set forth as SEQ ID NO: 17-21 (orf4), the nucleotide sequences set forth as SEQ ID NO: 22-26 (orf5), the nucleotide sequence set forth as SEQ ID NO: 27 or 28 (orf6), the nucleotide sequence set forth as SEQ ID NO: 29 or 30 (orf7), the nucleotide sequence set forth as SEQ ID NO: 31 or 32 (orf8), and the nucleotide sequence set forth as SEQ ID NO: 33 or 34 (orf9).

[0040]Construction of a plasmid capable of DNA replication from pRET100 requires that the recombinant plasmid have at least one DNA replication region and at least one coding region (orf) for a DNA replication-related protein. Thus, the (recombinant) plasmids of the invention are characterized by comprising at least one DNA is replication region and at least one coding region for a DNA replication-related protein. The coding region for a DNA replication-related protein preferably has a nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, 4, 14, 17 and 22.

[0041]The region of the nucleotide sequence set forth as SEQ ID NO: 76 has been suggested as a promoter involved in expression of replication-related proteins, and the plasmids of the invention preferably comprise a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.

[0042]For plasmid construction, the DNA fragments are appropriately selected based on the aforementioned nucleotide sequence data. The present invention also encompasses derivatives or functional (DNA-replicating) fragments of the plasmids.

[0043]Next, the nucleotide sequences set forth as SEQ ID NO: 70-72 were identified as DNA replication regions for plasmid pRET1000. As regions coding for proteins related to DNA replication there were identified the nucleotide sequences set forth as SEQ ID NO: 38-41 (orf10), the nucleotide sequence set forth as SEQ ID NO: 42 or 43 (orf11), the nucleotide sequence set forth as SEQ ID NO: 44 (orf12), the nucleotide sequence set forth as SEQ ID NO: 45 or 46 (orf13), the nucleotide sequences set forth as SEQ ID NO: 48-56 (orf14), the nucleotide sequence set forth as SEQ ID NO: 51 or 52 (orf15), the nucleotide sequence set forth as SEQ ID NO: 53 or 54 (orf16), the nucleotide sequence set forth as SEQ ID NO: 55 (orf17), the nucleotide sequences set forth as SEQ ID NO: 56-60 (orf18), the nucleotide sequence set forth as SEQ ID NO: 61 (orf19), the nucleo tide sequence set forth as SEQ ID NO: 62 (orf20), and the nucleotide sequences set forth as SEQ ID NO: 63-69 (orf11).

[0044]Construction of a plasmid capable of DNA replication from pRET1000 requires that the recombinant plasmid have at least one DNA replication region and at least one coding region (orf) for a DNA replication-related protein. Thus, the (recombinant) plasmids of the invention are characterized by comprising at least one DNA replication region and at least one coding region for a DNA replication-related protein. The coding region for a DNA replication-related protein preferably has a nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, 42, 44, 45, 53, 55, 56, 61, 62 and 69.

[0045]The regions with the nucleotide sequences set forth as SEQ ID NO: 67 and 47 are homologous with mobilization proteins, and have been implicated in mobilization. The region with the nucleotide sequence set forth as SEQ ID NO: 75 has been implicated in gene expression of mobilization protein and suggested as a recognition site for an expressed protein. Thus, the plasmids of the invention preferably include mobilization protein regions having the nucleotide sequences set forth as SEQ ID NO: 67 and 47, or include a region involved in mobilization having the nucleotide sequence set forth as SEQ ID NO: 75.

[0046]For plasmid construction, the DNA fragments are appropriately selected based on the aforementioned nucleotide sequence data. The present invention also encompasses derivatives or functional (DNA-replicating) fragments of the plasmids.

[0047]The plasmids or DNA fragments of the invention may also contain nucleotide sequences with a substitution, deletion or insertion of one or a plurality of nucleotides in a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, so long as the function of each region is not impaired.

[0048]The shuttle vectors of the invention may be any which comprise a plasmid or DNA fragment having a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, and a DNA region that is replicable in E. coli, and which are replicable in the genus Rhodococcus and E. coli, such as those having the nucleotide sequences set forth as SEQ ID NO: 89 to 100. The shuttle vectors of the invention may also have nucleotide sequences with one or a plurality of nucleotide substitutions, deletions or insertions in the aforementioned nucleotide sequences, so long as they are replicable in the genus Rhodococcus and E. coli.

[0049]The "plurality" referred to above will differ depending on the type of region, and specifically may be 2-1100, preferably 2-800, more preferably 2-300, even more preferably 2-100, yet more preferably 2-20 and most preferably 2-10.

[0050]As a plasmid or DNA fragment having substantially the same nucleotide sequence as the aforementioned DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, there may be mentioned specifically, a nucleotide sequence which hybridizes with a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, under stringent conditions. Here "stringent conditions" are conditions under which specific hybrids are formed and non-specific hybrids are not formed. While it is difficult to precisely quantify the conditions, one example is a set of conditions that permit hybridization of DNA with high homology, such as 80% or greater, preferably 90% or greater or more preferably 95% or greater homology, while not permitting hybridization of DNA with lower homology. More specifically, there may be mentioned hybridization conditions with ordinary Southern hybridization washing at 60° C., 1×SSC, 0.1% SDS or preferably Southern hybridization washing at 0.1×SSC, 0.1% SDS corresponding salt concentration. When a DNA fragment with a length of approximately 300 bp is used as a portion of the DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, the hybridization washing conditions may be 50° C., 2×SSC, 0.1% SDS.

[0051]The aforementioned plasmid or DNA fragment having substantially the same nucleotide sequence as the aforementioned DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, may be obtained by, for example, modification of a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, by site-directed mutagenesis so as to have a substitution, deletion or insertion of nucleotides at a specific site. Such modified DNA may also be obtained by mutation treatment known in the prior art. As mutation treatments there may be mentioned methods of in vitro treatment of DNA including a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, with hydroxylamine or the like, and methods of treating a microbe possessing the DNA above, such as the genus Escherichia, with ultraviolet rays or with a mutagenic agent ordinarily used for mutagenesis such as N-methyl-N'-nitro-N-nitrosoguanidine (NTG) or EMS.

[0052]Nucleotide substitutions, deletions or insertions as mentioned above include those found in naturally occurring mutants or variants due to differences in Rhodococcus strains.

[0053]A shuttle vector of the invention includes a DNA fragment (A) as the aforementioned plasmid or portion thereof, and a DNA region (B) which is replicable in E. coli. In some cases it is preferred for the shuttle vector to comprise a DNA region including a drug resistance gene. In the relevant technical field, a "shuttle vector" is a vector which comprises the DNA replication mechanism for two different cell types, and preferably also a drug resistance gene or the like as a selective marker, allowing its auto-replication in the two different cell types. The DNA fragment (A) as the aforementioned plasmid or portion thereof is a DNA region that is replicable in the genus Rhodococcus. The DNA region (B) which is replicable in E. coli may be a full plasmid or a portion thereof, so long as it can be replicated and amplified in E. coli. As such DNA regions that are replicable in E. coli there may be used, for example, the plasmids pUC18, pHSG299 and pHSG398.

[0054]When the shuttle vector of the invention includes a drug resistance gene, the preferred ones are ampicillin resistance gene, kanamycin resistance gene and chloramphenol resistance gene, but there are no particular restrictions on the manner of drug so long as the gene is expressed in the genus Rhodococcus and E. coli as hosts and confers drug resistance to the host cells, in order to allow verification of the presence of plasmids in the two genera based on resistance to the drug. Also, a plurality of such drug resistance genes may be used in combination.

[0055]The shuttle vector preferably contains multiple cloning sites (multicloning sites), and the cloning sites and drug resistance gene may be induced from, for example, an E. coli plasmid. That is, a publicly known E. coli plasmid such as one listed above may be cleaved with an appropriate restriction endonuclease and a DNA region containing the cloning sites and drug resistance gene constructed and ligated with another DNA fragment (a DNA region which is replicable in the genus Rhodococcus).

[0056]As an illustration, outline of shuttle vector constructions is shown in FIGS. 3 to 5. The shuttle vectors may be constructed by treating the aforementioned plasmids and E. coli plasmids with suitable restriction endonucleases and then ligating them. In this manner, the present inventors constructed 18 shuttle vectors (Table 5) using the Rhodococcus plasmids pRET1000, pRET1100 or pRET1200, and the E. coli plasmids pUC18, pHSG299 or pHSG398.

[0057]The shuttle vectors of the invention are replicable in the genus Rhodococcus and E. coli as hosts, and are industrially useful. The Rhodococcus and E. coli strains transformed by the shuttle vectors of the invention, as well as other microbial transformants, are useful in this way and such transformants are also encompassed by the scope of the invention.

[0058]A vector of the invention is characterized by being constructed using a shuttle vector of the invention. Specifically, it is a vector having target DNA inserted therein which is to be introduced into the shuttle vector of the invention. The DNA to be introduced and the shuttle vector of the invention are treated with appropriate restriction endonucleases and then ligated them to construct the vector. The vector may then be used to obtain transformants having the desired DNA transferred therein.

[0059]As examples of DNA to be inserted there may be mentioned aminoketone asymmetric reductase genes and coenzyme-regenerating system enzyme genes. Aminoketone asymmetric reductase genes are genes coding for aminoketone asymmetric reductases as described in WO02/070714, and more specifically, DNA coding for a protein comprising the amino acid sequence set forth as SEQ ID NO: 78 (aminoketone asymmetric reductase derived from R. erythropolis MAK-34), and particularly DNA comprising the nucleotide sequence set forth as SEQ ID NO: 79. The entirety of the content described in WO02/070714 is incorporated herein by reference.

[0060]An aminoketone asymmetric reductase is any having the properties described in WO02/070714, and includes a protein having the amino acid sequence set forth as SEQ ID NO: 78 of the Sequence Listing, as well as proteins having amino acid sequences obtained by deletion, insertion, substitution or addition of one or more amino acids in the aforementioned amino acid sequence, and exhibiting aminoketone asymmetric reduction activity. Aminoketone asymmetric reduction activity is activity of producing an optically active aminoalcohol represented by general formula (2) above using an α-aminoketone represented by general formula (1) above as the substrate.

[0061]There are no particular restrictions on the methods of deletion, insertion, substitution and addition, and any publicly known methods may be employed. For example, there may be mentioned the methods described in "Zoku Seikagaku Jikken Kouza 1, Idenshi Kenkyuhou II", edited by the Japanese Biochemical Society, p105 (Hirose, S.), Tokyo Kagaku Dojin (1986); "Shin Seikagaku Jikken Kouza 2, Kakusan III (Recombinant DNA Technology)", edited by the Japanese Biochemical Society, p. 233 (Hirose, S.), Tokyo Kagaku Dojin (1992); R. Wu, L. Grossman ed., "Methods in Enzymology", Vol. 154, p. 350 & p. 367, Academic Press, New York (1987); R. Wu, L. Grossman, ed., "Methods in Enzymology", Vol. 100, p. 457 & p. 468, Academic Press, New York (1983); J. A. Wells et al., "Gene", Vol. 34, p. 315 (1985); T. Grundstroem et al., "Nucleic Acids Res", Vol. 13, p. 3305 (1985); J. Taylor et al., "Nucleic Acids Res.", Vol. 13, p. 8765 (1985); R. Wu, ed., "Methods in Enzymology", Vol. 155, p. 568, Academic Press, New York (1987); and A. R. Oliphant et al., "Gene", Vol. 44, p. 177 (1986). As specific examples, there may be mentioned the site-directed mutagenesis method (site-specific mutagenesis method) utilizing synthetic oligonucleotides, the Kunkel method, the dNTP[αS] method (Eckstein method), and the region-directed mutagenesis method using sulfurous acid or nitrous acid.

[0062]Sugar chains are attached to the majority of proteins, and substitution of one or a plurality of amino acids can modify the attachment of sugar chains. Thus, the aminoketone asymmetric reductases of the invention also include proteins having the amino acid sequence set forth as SEQ ID NO: 78 of the Sequence Listing and having modifications of sugar chains, so long as they exhibit the aforementioned aminoketone asymmetric reduction activity.

[0063]The aminoketone asymmetric reductases of the invention may also have modifications of their amino acid residues by chemical methods, or their derivatives may be enhanced by modification or partial degradation using peptidase enzymes such as pepsin, chymotrypsin, papain, bromelain, endopeptidase and exopeptidase.

[0064]When the aminoketone asymmetric reductases of the invention are produced by a gene recombinant method, a fusion protein may be expressed and then converted or processed into a protein having biological activity which is substantially equivalent to a natural aminoketone asymmetric reductase either in vivo or ex vivo. In this case, a fusion production method ordinarily employed for genetic engineering may be used, and the fusion protein may be purified by affinity chromatography or the like, utilizing the fused portion thereof. Modification and enhancement of protein structures may be carried out with reference to "Shin Seikagaku Jikken Kouza 1, Tanpakushitsu VII, Tanpakushitsu Kogaku", edited by the Japanese Biochemical Society, Tokyo Kagaku Dojin (1993), by the methods described therein, the methods described in literature cited therein, or methods which are essentially equivalent thereto.

[0065]The aminoketone asymmetric reductase of the invention may also differ from naturally occurring forms in the identities of one or more of the amino acid residues or in the positions of one or more of the amino acid residues. The present invention also encompasses deletion analogues with deletion of one or more (for example, 1-80, preferably 1-60, more preferably 1-40, even more preferably 1-20 and especially 1-10) amino acid residues, substitution analogues with substitution of one or more (for example, 1-80, preferably 1-60, more preferably 1-40, even more preferably 1-20 and especially 1-10) amino acid residues or addition analogues with addition of one or more (for example, 1-80, preferably 1-60, more preferably 1-40, even more preferably 1-20 and especially 1-10) amino acid residues peculiar to natural aminoketone asymmetric reductases. Also encompassed are enzymes having the domain structure characteristic of natural aminoketone asymmetric reductases. There may also be mentioned isomers of the aminoketone asymmetric reductases.

[0066]So long as the domain structure characteristic of natural aminoketone asymmetric reductases is maintained, all mutants above are also encompassed among the aminoketone asymmetric reductases of the invention. In addition, it is assumed that enzymes having a primary structural conformation substantially equivalent to natural aminoketone asymmetric reductases of the invention, or a portion thereof, as well as enzymes having biological activity substantially equivalent to natural aminoketone asymmetric reductases, may also be included. Naturally occurring mutants may also be mentioned. The aminoketone asymmetric reductases of the invention may be separated and purified in the manner explained below. The present invention encompasses DNA fragments coding for the aforementioned polypeptides, polypeptides of aminoketone asymmetric reductases having all or some of the natural features, and DNA fragments coding for analogues or derivatives thereof. The nucleotides of the aminoketone asymmetric reductases may be modified (for example, with addition, deletions or substitutions), and such modified forms are also encompassed by the invention.

[0067]An aminoketone asymmetric reductase gene according to the invention is a nucleic acid coding for any of the aforementioned aminoketone asymmetric reductases. As representative examples there may be mentioned nucleic acid coding for a protein having the amino acid sequence set forth as SEQ ID NO: 78 of the Sequence Listing, and especially nucleic acid having the nucleotide sequence set forth as SEQ ID NO: 79, but since several nucleotide sequences (codons) can code for each amino acid, there exist numerous nucleic acids coding for a protein having the amino acid sequence set forth as SEQ ID NO: 78. Thus, all such nucleic acids are also encompassed among the aminoketone asymmetric reductase genes of the invention. Here, "coding for a protein" means that, when the DNA consists of two strands, one of the two complementary strands has a nucleotide sequence coding for the protein, and therefore the nucleic acids of the invention include nucleic acids comprising nucleotide sequences directly coding for the amino acid sequence set forth as SEQ ID NO: 78 and nucleic acids comprising nucleotide sequences which are complementary thereto. In addition, the aminoketone asymmetric reductase genes of the invention may be nucleic acids which hybridize with nucleic acid comprising a nucleotide sequence complementary to SEQ ID NO: 79 under stringent conditions, and which code for proteins with aminoketone asymmetric reduction activity. Here, "stringent conditions" has the same definition as explained above.

[0068]The coenzyme-regenerating system enzyme gene may be one for various dehydrogenases, specifically, glucose dehydrogenase, glucose-6-phosphate dehydrogenase, aldehyde dehydrogenases, alcohol dehydrogenases, organic acid dehydrogenases and amino acid dehydrogenases. More specifically, there may be suitably used acetaldehyde dehydrogenase, ethanol dehydrogenase, propanol dehydrogenase, glycerol dehydrogenase, formate dehydrogenase, acetate dehydrogenase, butyrate dehydrogenase, lactate dehydrogenase, maleate dehydrogenase and glutamate dehydrogenase.

[0069]A transformant according to the invention is characterized by comprising the aforementioned vector. The transformant is obtained by introducing the vector into host cells. The vector introduction method may be a publicly known method, such as the calcium phosphate method, lipofection, electroporation, microinjection or the like.

[0070]For example, a transformant of the invention comprising a vector having an aminoketone asymmetric reductase gene inserted therein has aminoketone asymmetric reduction activity, and may be applied for an aminoketone asymmetric reductase production method or optically active aminoalcohol production method as described below.

[0071]The method for production of an aminoketone asymmetric reductase of the invention is characterized by comprising a culturing step in which transformants containing a vector having an aminoketone asymmetric reductase gene inserted therein are cultured in medium which allows growth of the transformants, and a purification step in which the aminoketone asymmetric reductase is purified from the transformants obtained in the culturing step.

[0072]The method for culturing may be a publicly known method with no particular restrictions so long as it permits growth of the cells used, and ordinarily a liquid medium containing a carbon source, nitrogen source and other nutrients is used. As carbon sources for the medium there may be used any of those that can be utilized by the cells. Specifically, there may be mentioned sugars such as glucose, fructose, sucrose, dextrin, starch and sorbitol, alcohols such as methanol, ethanol and glycerol, organic acids such as fumaric acid, citric acid, acetic acid and propionic acid, and their salts, hydrocarbons such as paraffin, and mixtures thereof. As nitrogen sources there may be used any of those that can be utilized by the cells. Specifically, there may be mentioned ammonium salts of inorganic acids such as ammonium chloride, ammonium sulfate and ammonium phosphate; ammonium salts of organic acids such as ammonium fumarate and ammonium citrate; nitric acid salts such as sodium nitrate and potassium nitrate; and inorganic or organic nitrogenous compounds such as meat extract, yeast extract, malt extract and peptone, as well as mixtures thereof. The medium may also contain appropriately added nutrient sources ordinarily used for culturing, such as inorganic salts, trace metal salts and vitamins. When necessary, there may also be added to the medium substances that promote cell growth and buffering substances effective for maintaining the pH of the medium.

[0073]The culturing of the cells may be carried out under conditions suitable for growth. Specifically, the medium pH may be 3-10, preferably 4-9, and the temperature may be 0-50° C., preferably 20-40° C. The cell culturing may be conducted either under aerobic or anaerobic conditions. The culturing time is preferably 10-150 hours, but should be appropriately determined for the type of cells used.

[0074]The culture solution of the cells cultured in the manner described above is filtered or centrifuged and the cells are rinsed with water or buffer solution. The rinsed cells are suspended in a suitable amount of buffer solution for disruption of the cells. The method of disruption is not particularly restricted but as examples there may be mentioned mechanical disruption with a mortar, Dynomill, French press, ultrasonic cell disrupter or the like. The aminoketone asymmetric reductase in the cell-free extract obtained by filtration or centrifugation of the solid matter from the cell disruptate is recovered by an ordinary enzyme isolating method.

[0075]There are no particular restrictions on the method for isolation of the enzyme and any publicly known method may be employed, but as examples there may be mentioned purification by salting out such as ammonium sulfate precipitation; gel filtration methods using Sephadex and the like; ion-exchange chromatography methods using carriers with diethylaminoethyl groups or carboxymethyl groups; hydrophobic chromatography using carriers with hydrophobic groups such as butyl, octyl and phenyl; dye gel chromatography methods; electrophoresis methods; dialysis; ultrafiltration methods; affinity chromatography methods; high performance liquid chromatography methods and the like.

[0076]The enzyme may also be used as an immobilized enzyme. There are no particular restrictions on the method and any publicly known method may be employed, among which there may be mentioned immobilization of the enzyme or the enzyme-producing cells, and the immobilization may be accomplished by a carrier bonding method such as a covalent bonding method or adsorption method, a crosslinking method, entrapment method or the like. A condensing agent such as glutaraldehyde, hexamethylene diisocyanate or hexamethylene diisothiocyanate may also be used if necessary. Other immobilizing methods include: a monomer method in which a monomer is gelled by polymerizing reaction; a prepolymer method in which molecules larger than monomers are polymerized; a polymer method in which a polymer is gelled; immobilization using polyacrylamide; immobilization using natural polymers such as alginic acid, collagen, gelatin, agar and K-carrageenan; and immobilization using synthetic polymers such as photosetting resins and urethane polymers.

[0077]The enzyme purified in this manner is judged as having been adequately purified if a single band is confirmed in electrophoresis (SDS-PAGE, etc.).

[0078]A method for production of an optically active aminoalcohol according to the invention is characterized to produce an optically active aminoalcohol compound represented by the following general formula (2), which compound exhibits the desired optical activity, by reacting an aminoketone asymmetric reductase obtained by the production method of the invention with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1) or a salt thereof.

##STR00008##

wherein X may be the same or different and represents at least one species selected from the group consisting of halogen, lower alkyl, hydroxyl optionally protected with a protecting group, nitro and sulfonyl;n represents an integer of 0 to 3;R1 represents lower alkyl;R2 and R3 may be the same or different and represent at least one species selected from the group consisting of hydrogen and lower alkyl; and* represents asymmetric carbon.

##STR00009##

wherein X, n, R1, R2, R3 and * have the same definitions as above.

[0079]First, the α-aminoketone compound represented by general formula (1) according to the invention will be explained.

[0080]The substituent X is as follows. As the aforementioned halogen there may be mentioned fluorine, chlorine, bromine and iodine.

[0081]As lower alkyl there are preferred C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures. As substituents they may contain halogens such as fluorine or chlorine, or hydroxyl, alkyl, amino, alkoxy and the like.

[0082]As protecting groups for hydroxyl optionally protected with a protecting group there may be mentioned groups that can be removed by treatment with water, groups that can be removed by acid or weak base treatment, groups that can be removed by hydrogenation or groups that can be removed with Lewis acid catalysts and thiourea, and such protecting groups include optionally substituted acyl, optionally substituted silyl, alkoxyalkyl, optionally substituted lower alkyl, benzyl, p-methoxybenzyl, 2,2,2-trichloroethoxycarbonyl, allyloxycarbonyl, trityl and the like.

[0083]The aforementioned acyl groups include acetyl, chloroacetyl, dichloroacetyl, pivaloyl, benzoyl, p-nitrobenzoyl and the like. They may contain hydroxyl, alkyl, alkoxy, nitro, halogen and the like as substituents. The aforementioned silyl groups include trimethylsilyl, t-butyldimethylsilyl, triarylsilyl and the like. They may contain alkyl, aryl, hydroxyl, alkoxy, nitro, halogen and the like as substituents. The aforementioned alkoxyalkyl groups include methoxymethyl, 2-methoxyethoxymethyl and the like. The aforementioned lower alkyl include C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures. As substituents they may contain halogen such as fluorine or chlorine, or hydroxyl, alkyl, amino, alkoxy and the like.

[0084]X may be nitro or sulfonyl, and specifically there may be mentioned methylsulfonyl and the like.

[0085]The number "n" for X is an integer of 0-3, and is preferably 0.

[0086]R1 in general formula (1) above represents lower alkyl. As lower alkyl there are preferred C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures.

[0087]Each of R2 and R3 represent hydrogen or lower alkyl. The lower alkyl include C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures.

[0088]As salts of the aforementioned α-aminoketone compounds there may be mentioned salts of inorganic acids such as hydrochloride, sulfate, nitrate, phosphate and carbonate, and salts of organic acids such as acetic acid and citric acid.

[0089]The α-aminoketone can be easily synthesized by halogenation (for example, bromination) of the α-carbon of a corresponding 1-phenylketone derivative, followed by replacement of the halogen such as bromine with an amine (Ger. (East), 11, 332, Mar. 12, 1956).

[0090]The optically active aminoalcohol represented by general formula (2) above according to the invention will now be explained. In general formula (2), X, n, R1, R2, R3 and * have the same definitions as in general formula (1) above. As β-aminoalcohols having the desired optical activity there may be mentioned (1S,2S)aminoalcohols. As specific examples of (1S,2S)aminoalcohols there may be mentioned d-threo-2-methylamino-1-phenylpropanol (d-pseudoephedrine), d-threo-2-dimethylamino-1-phenylpropanol (d-methylpseudoephedrine), (1S,2S)-α-(1-aminoethyl)-benzyl alcohol (d-norpseudoephedrine), (1S,2S)-1-(p-hydroxyphenyl)-2-methylamino-1-propanol, (1S,2S)-α-(1-aminoethyl)-2,5-dimethoxy-benzyl alcohol, (1S,2S)-1-(m-hydroxyphenyl)-2-amino-1-propanol, (1S,2S)-1-(p-hydroxyphenyl)-2-amino-1-propanol, (1S,2S)-1-phenyl-2-ethylamino-1-propanol, (1S,2S)-1-phenyl-2-amino-1-butanol, (1S,2S)-1-phenyl-2-methylamino-1-butanol and the like.

[0091]The conditions for reaction of the aminoketone asymmetric reductase are not particularly restricted so long as an optically active aminoalcohol represented by general formula (2) having the desired optical activity is produced, but since the enzyme optimum pH is 8.1 and the optimum temperature is 55° C., the reaction is preferably carried out under conditions of pH 7-9 and 30-65° C. temperature.

[0092]A method for production of an optically active aminoalcohol according to the invention is also characterized to produce an optically active aminoalcohol compound represented by the following general formula (2), which compound exhibits the desired optical activity, by reacting a transformant of the invention with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1) or a salt thereof.

##STR00010##

[0093]As the reaction conditions for the reaction described above, for example, the transformants shake cultured in liquid medium may be collected, an aqueous aminoketone solution (0.1-10% concentration) added to the obtained cells, and reaction conducted at a temperature of 20-40° C. for a period of several hours to one day while regulating the pH to between 6-8. Upon completion of the reaction, the cells may be separated and the product in the reaction solution isolated to obtain an optically active aminoalcohol. The reaction may be conducted in the same manner for treated transformant cells (dry cells or immobilized cells) or the enzyme or immobilized enzyme obtained from the transformants.

[0094]In the production method for an optically active aminoalcohol of the invention, the reaction may be carried out with further addition of a compound represented by the following general formula (3) or a pharmaceutically acceptable salt or solvate thereof, for more efficient production of the optically active aminoalcohol.

##STR00011##

(wherein A represents the following formula (Y) or (Z))

##STR00012##

(wherein R4 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R8)

##STR00013##

(wherein R5 represents hydrogen, C1-3 alkyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R6 or R9, R6 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R9, and R7 represents hydrogen or optionally substituted C1-6 alkyl); R8 represents hydrogen, carboxyl, optionally substituted C1-6 alkyl, a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R4 or a C5-10 hydrocarbon ring which is bonded to R6; R9 represents hydrogen, optionally substituted C1-6 alkyl, optionally substituted C1-6 alkyloxycarbonyl, optionally substituted acyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R6; and R10 represents hydrogen or optionally substituted C1-6 alkyl)

[0095]In general formula (3) above, C1-3 alkyl may be straight-chain or branched, and specifically there may be mentioned methyl, ethyl, n-propyl, isopropyl and the like. C1-6 alkyl may be straight-chain or branched, and specifically there may be mentioned methyl, ethyl, n-propyl, isopropyl, n-butyl, i-butyl, s-butyl, t-butyl, pentyl, hexyl and the like. As C5-10 hydrocarbon rings there may be mentioned cyclopentyl, cyclohexyl, cycloheptyl, cyclooctyl, cyclononyl, cyclodecanyl and the like.

[0096]As heteroatoms for the 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms there may be mentioned nitrogen, oxygen, sulfur and the like, among which nitrogen and oxygen are particularly preferred, and as 5- to 8-membered heterocyclic skeletons there may be mentioned pyrrolidine, piperidine, imidazolidine, piperazine, tetrahydrofuran, tetrahydropyran, tetrahydrothiophene, morpholine and the like.

[0097]As C1-6 alkyloxycarbonyl there may be mentioned methyloxycarbonyl, ethyloxycarbonyl, isopropyloxycarbonyl, isobutyloxycarbonyl, t-butyloxycarbonyl and the like. As acyl there may be mentioned formyl, acetyl, propionyl, butyryl, isobutyryl, pivaloyl, benzoyl, valeryl and the like. When the aforementioned C1-3 or C1-6 alkyl, C1-6 alkyloxycarbonyl or acyl have substituents there are no particular restrictions on the types, positions and numbers of substituents, and as examples of substituents there may be mentioned halogen such as fluorine and chlorine, hydroxyl, alkyl, carboxyl, amino, alkoxy, nitro, aryl and the like. As pharmaceutically acceptable salts there may be mentioned salts of inorganic acids such as hydrochloric acid, sulfuric acid, nitric acid and phosphoric acid, salts of organic acids such as acetic acid and citric acid, salts of inorganic bases such as Na, K, Mg, Ca and ammonia, and salts of organic bases such as triethylamine and cyclohexylamine.

[0098]As examples of compounds represented by general formula (3) above there may be mentioned 1-acetylamino-2-hydroxypropane, 1-methylamino-2-hydroxypropane, 1-amino-2-oxopropane, 1-amino-2-hydroxycyclopentane, 1-amino-2,3-dihydroxypropane, L-threonine, 4-amino-3-hydroxybutanoic acid, 1-amino-2-oxocyclohexane, morpholine, 3-hydroxypyrrolidine, 3-hydroxypiperidine, 2-aminomethyl-tetrahydrofuran, 1-(2-hydroxypropyl)amino-2-hydroxypropane, 1-t-butyloxycarbonylamino-2-hydroxypropane, 2-amino-3-hydroxybutane, DL-serine, 1-amino-2-hydroxypropane, 1-amino-2-hydroxybutane and 1-amino-2-hydroxycyclohexane. Compounds among these having asymmetric carbons may be optically active forms or racemic forms, unless otherwise specified.

[0099]Addition of such activity inducers to the medium can induce cellular activity and thus more efficiently promote production of the optically active β-aminoalcohol than when no such activity inducers are added. The activity inducers may be used alone, or several such activity inducers may be used in admixture. The amount of such activity inducers is preferably 0.01-10 wt % with respect to the medium.

[0100]The reaction method for production of the β-aminoalcohol of the invention is not particularly restricted so long as it is a method in which the cells or the cell-produced enzyme is reacted with an enantiomeric mixture of an α-aminoketone compound represented by general formula (1) above or its salt, to produce the corresponding optically active β-aminoalcohol compound represented by general formula (2), and the reaction is initiated by mixing the cells rinsed with buffer solution or water with the α-aminoketone aqueous solution used as the starting material.

[0101]The reaction conditions may be selected within a range that does not impede production of the optically active β-aminoalcohol compound represented by general formula (2). The cell volume is preferably 1/100 to 1000-fold and more preferably 1/10 to 100-fold in terms of dry weight with respect to the racemic aminoketone. The concentration of the racemic aminoketone substrate is preferably 0.01-20% and more preferably 0.1-10%. The pH of the reaction solution is preferably 5-9 and more preferably 6-8, and the reaction temperature is preferably 10-50° C. and more preferably 20-40° C. The reaction time is preferably 5-150 hours, but this may be appropriately determined depending on the cell type.

[0102]In order to more efficiently promote the reaction, there may be added sugars such as glucose, organic acids such as acetic acid and energy sources such as glycerol. These may be used alone or as mixtures. The amount of addition is preferably 1/100 to 10-fold with respect to the substrate. Coenzymes and the like may also be added. As coenzymes there may be used nicotinamide adenine dinucleotide (NAD), reduced nicotinamide adenine dinucleotide (NADH), nicotinamide adenine dinucleotide phosphate (NADP), reduced nicotinamide adenine dinucleotide phosphate (NADPH) and the like, either alone or in mixtures, added in amounts of preferably 1/1000 to 1/5 with respect to the racemic aminoketone. In addition to such coenzymes, there may be added coenzyme-regenerating enzymes such as glucose dehydrogenase, in amounts of 1/1000 to 1/5 with respect to the racemic aminoketone. Also, substrates for coenzyme-regenerating enzymes, such as glucose, may be added, in amounts of 1/100 to 10-fold with respect to the racemic aminoketone. There may also be used combinations of sugars such as glucose, organic acids such as acetic acid, energy sources such as glycerol, coenzymes, coenzyme-regenerating enzymes and coenzyme-regenerating enzyme substrates. These usually accumulate in the cells but if necessary they may be added to increase the reaction speed or yield, and therefore may be added as appropriate.

[0103]If the reaction solution is reacted with addition of the specific salts described above under the aforementioned conditions, racemization of the unreacted α-aminoketone isomers will be aided, thus more efficiently promoting conversion to the enantiomer which will serve as the substrate of the cells or cell-produced enzyme. This will tend to yield the target aminoalcohol from the starting material at a high yield of 50% or greater.

[0104]As salts that promote racemization of unreacted α-aminoketones there may be used weak acid salts such as acetate, tartarate, benzoate, citrate, malonate, phosphate, carbonate, paranitrophenol salt, sulfite and borate, but there are preferably used phosphate (for example, sodium dihydrogen phosphate, potassium dihydrogen phosphate, ammonium dihydrogen phosphate), carbonate (for example, sodium carbonate, sodium hydrogen carbonate, potassium carbonate, ammonium carbonate) and citrate (for example, sodium citrate, potassium citrate, ammonium citrate). Mixtures thereof may also be used, with a buffer solution with a pH of 6.0-8.0 added to a final concentration of preferably 0.01-1 M. In the case of a phosphate, for example, sodium dihydrogen phosphate and sodium monohydrogen phosphate may be mixed in a proportion of between 9:1 and 5:95.

[0105]The optically active α-aminoalcohol produced by the reaction may be purified by ordinary separation and purification means. For example, the optically active β-aminoalcohol may be obtained directly from the reaction solution or after separation of the cells, by being subjected to a common purification process such as membrane separation, extraction with an organic solvent (for example, toluene, chloroform, etc.), column chromatography, vacuum concentration, distillation, crystallization, recrystallization or the like. The optical purity of the produced optically active β-aminoalcohol can be measured by high performance liquid chromatography (HPLC).

EXAMPLES

[0106]The present invention will now be explained in greater detail through examples, with the understanding that these examples in no way limit the technical scope of the invention.

Example 1

Isolation and Purification of Plasmids

[0107](1) Method

[0108]Rhodococcus strains were inoculated to 5 mL of GPY medium (1% glucose, 0.5% bactopeptone, 0.3% yeast extract) and cultured with shaking at 25° C. After adding 250 μL of a 100 mg/mL ampicillin solution in the logarithmic growth phase, culturing was continued at 25° C. for 2 hours with shaking. The cells were harvested by centrifugation (12 krpm, 5 min), and after removing off the supernatant, they were suspended in 1 mL of 50 mM Tris (pH 7.5), the cells were again harvested by centrifugation (12 krpm, 5 min) and the supernatant was removed off. They were then suspended in 250 μL of a 10 mg/mL lysozyme solution dissolved in TE solution (10 mM Tris (pH 7.5), 1 mM EDTA), and the suspension was allowed to stand at 37° C. for 30 minutes. Next, 100 μL of 3 M sodium chloride and 25 μL of 10% SDS were added and the mixture was allowed to stand at -20° C. overnight. To the supernatant from centrifugation (12 krpm, 5 min) there were added 0.5 μL each of 50 μg/mL Proteinase K and 50 μg/mL RNase A, and the mixture was allowed to stand at 37° C. for 15 minutes. An equivalent of phenol/chloroform/isoamyl alcohol solution was added and centrifugation was performed (12 krpm, 5 min). A 2.5-fold amount of ethanol was added to the supernatant, the mixture was centrifuged (12 krpm, 5 min), and the precipitate was dissolved in 50 μL of sterilized water. Confirmation of plasmids was accomplished by electrophoresis with 0.8% agarose gel and staining with ethidium bromide, followed by UV irradiation.

[0109](2) Test Bacteria Strains and Results

[0110]Throughout the examples, the presence or absence of plasmids was screened from available strains belonging to the genus Rhodococcus and its related genus Mycobacterium followed the method described in (1) above.

[0111]Table 3 shows the screened strains confirmed to contain plasmids. Specifically, Rhodococcus erythropolis (IAM1400, IAM1503, JCM2893, JCM2894 and JCM2895) and Rhodococcus rhodnii (JCM3203) were confirmed to contain plasmids of approximately 5.4 kbp and 5.8 kbp, respectively. These plasmids were designated according to the names listed in Table 3: pRET100, pRET1200, pRET1300, pRET1400, pRET1500, pRET1600, pRET1700, pRET1800, pRET0500, pRET1000 (see Table 3).

[0112]R. erythropolis IAM1400 and IAM1503 are described in "IAM Catalogue of Strains, Third Edition, 2004" published by the Institute of Molecular and Cellular Biosciences, The University of Tokyo, and are available from the institute. Also, R. erythropolis JCM2893, JCM2894 and JCM2895 and R. rhodnii JCM3203 are described in "JCM Catalogue of Strains, Eighth Edition 2002" published by RIKEN, Japan, and are available from the institute.

TABLE-US-00003 TABLE 3 Strain No. Size (kbp) Name Rhodococcus erythropolis IAM 1400 5.4 pRET1100 5.4 pRET1200 '' IAM 1503 5.4 pRET1300 5.4 pRET1400 '' JCM 2893 5.4 pRET1500 5.4 pRET1600 '' JCM 2894 5.4 pRET1700 5.4 pRET1800 '' JCM 2895 5.4 pRET0500 Rhodococcus rhodnii JCM 3203 5.8 pRET1000

Example 2

Identification of Restriction Endonuclease Sites

[0113]Various restriction endonucleases were used to determine restriction endonuclease sites, for classification of the plasmids shown in Table 3. Each plasmid was isolated by the method described in Example 1, and then digested with EcoR I, Hind III, Pvu II, Sca I, Sph I, Sma I, Sac I, BamH I and Kpn I, and electrophoresed on 0.8% agarose gel for confirmation of the DNA fragments. The size marker used was Loading Quick DNA size Marker X/EcoR I+Hind III double digest (Toyobo). The numbers of sites cleaved by the restriction endonucleases and the sizes of the fragments were determined based on the size marker. The results are shown in Table 4.

TABLE-US-00004 TABLE 4 R. erythropolis R. rhodnii IAM 1400 IAM 1503 JCM 2893 JCM 2894 JCM 2895 JCM 3203 pRET1100 pRET1200 pRET1300 pRET1400 pRET1500 pRET1600 pRET1700 pRET1800 pRET0500 pRET1000 BamH I 2(0.4, 5.0) 1(5.4) same same same same same same same 2(2.0, 3.8) EcoR I 2(0.3, 5.1) 1(5.4) as as as as as as as 0 Hind III 0 0 pRET1100 pRET1200 pRET1100 pRET1200 pRET1100 pRET1200 pRET1200 0 Kpn I 1(5.4) 0 0 Pvu II 1(5.4) 2(0.9, 4.5) 4(0.1, 1.4, 2.0, 2.3) Sac I 1(5.4) 1(5.4) 3(0.9, 1.0, 3.9) Sca I 0 0 0 Sph I 0 0 0 Sma I 1(5.4) 2(0.4, 0.5) 4(0.1, 1.2, 1.6, 2.9) Values in parentheses indicate sizes (kbp)

[0114]Based on the analysis results shown above, the plasmids in Table 3 were classified into three types: plasmids possessing the same restriction endonuclease sites as pRET1100, plasmids possessing the same restriction endonuclease sites as pRET1200, and pRET1000.

Example 3

Plasmid Sequencing and Homology Search

[0115]As the plasmids were classified into three types, i.e. pRET1000, pRET1100 and pRET1200 based on the results of Example 2, it was attempted to sequence each of the plasmids.

[0116]First, the DNA fragments of the plasmids were cloned for determination of the nucleotide sequences. For Rhodococcus erythropolis (IAM1400), the plasmids (pRET1100, pRET1200) were isolated and digested with Sma I and Sac I. Upon electrophoresis on 0.8% agarose gel, DNA fragments with sizes of approximately 0.5 kbp, approximately 1.7 kbp, approximately 3.7 kbp and approximately 4.9 kbp were confirmed. The respective DNA fragments were recovered from the agarose gel using a GFX® PCR DNA and Gel Band Purification Kit (Amersham Bioscience) and used as insert DNA. Separately, pBluescript II KS(-) was used after digesting with Sma I alone or with Sma I and Sac I, as vector DNA. The insert DNA and vector DNA were ligated with Ligation High (Toyobo) and used to transform E. coli JM109. The obtained transformants were screened using a GFX Micro Plasmid Prep Kit (Amersham Bioscience) to obtain different clones.

[0117]For Rhodococcus rhodnii (JCM3203), the plasmid (pRET1000) was isolated and then digested with BamH I. Upon electrophoresis on 0.8% agarose gel, DNA fragments with sizes of approximately 2.0 kbp and approximately 3.8 kbp were confirmed. The respective DNA fragments were recovered from the gel using the aforementioned Kit and used as insert DNA. The vector DNA used was pBluescript II KS(-) digested with BamH I.

[0118]Determination of the nucleotide sequences of the plasmid inserts was accomplished by the primer walking method. The apparatus used was an ABI PRISM® 310NT Genetic Analyzer, and the enzyme used was a BigDye Terminator v3.1 Cycle Sequencing Kit (ABI).

[0119]First, P7 (M13 forward, Toyobo) and P8 (M13 reverse, Toyobo) primers were used for partial decoding of the insert nucleotide sequences. Next, primers were designed within the decoded sequence (using the sequence analyzing software DNASIS Pro; Hitachi Software Corp.), and the designed primers (synthetic oligo DNA) were used for further decoding of the nucleotide sequence. This procedure was repeated until decoding of the entirety of each insert nucleotide sequence. Upon completion of the insert nucleotide sequence decoding, primers were designed for reaction from the ends of each insert to the vector direction in order to analyze how the inserts were linked, and PCR was conducted (using KOD-plus-), using the plasmid isolated from Rhodococcus erythropolis (IAM1400) as template. The PCR product was purified using a GFX® PCR DNA and Gel Band Purification Kit, and sequencing was carried out using the same primers used for PCR, to analyze the arrangement of the inserts.

[0120]The results of sequencing showed that pRET1100 consisted of 5444 bp, with a G+C content of 59%. The full determined nucleotide sequence is set forth as SEQ ID NO: 73 of the Sequence Listing. Plasmid pRET1200 consisted of 5421 bp and had a G+C content of 62%. Plasmid pRET1000 consisted of 5813 bp and had a G+C content of 67%. The full determined nucleotide sequence is set forth as SEQ ID NO: 74 of the Sequence Listing.

[0121]A homology search for the determined nucleotide sequences using DNASIS Pro revealed that pRET1000 and pRET1100 were novel plasmids. On the other hand, pRET1200 had approximately 99.6% homology with pN30 (GenBank accession no. AF312210) (calculated based on pRET1200).

[0122]For pRET1000 and pRET1100, comparison was made with publicly known plasmids based on the determined nucleotide sequences, using DNASIS Pro. As a result, neither of the plasmids were found to have completely matching restriction endonuclease sites with other plasmids.

Example 4

Nucleotide Sequence Analysis

[0123]The results of analysis of the nucleotide sequences of pRET1100 and pRET1000 are shown below.

[0124]The following orfs were found in pRET1100:

[0125]orf1 (SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3) consisting of the nucleotide sequence from bases 202, 238 or 337 to 480 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0126]orf2 (SEQ ID NO: 4) consisting of the nucleotide sequence from bases 477 to 758 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0127]orf3 (SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15 or SEQ ID NO: 16) consisting of the nucleotide sequence from bases 862, 1294, 1450, 1462, 1486, 1489, 1513, 1630, 1645, 1687, 2224 or 2227 to 2409 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0128]orf4 (SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20 or SEQ ID NO: 21) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 1875, 1734, 1701, 1674 or 1581 to 1444 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0129]orf5 (SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25 or SEQ ID NO: 26) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 2828, 2792, 2747, 2594 or 2540 to 2406 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0130]orf6 (SEQ ID NO: 27 or SEQ ID NO: 28) consisting of the nucleotide sequence from bases 2971 or 3049 to 3306 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0131]orf7 (SEQ ID NO: 29 or SEQ ID NO: 30) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 3577 or 3571 to 3053 of the nucleotide sequence set forth as SEQ ID NO: 73;

[0132]orf8 (SEQ ID NO: 31 or SEQ ID NO: 32) consisting of the nucleotide sequence from bases 3339 or 3648 to 3902 of the nucleotide sequence set forth as SEQ ID NO: 73; and

[0133]orf9 (SEQ ID NO: 33 or SEQ ID NO: 34) consisting of the nucleotide sequence from bases 4366 or 4477 to 5034 of the nucleotide sequence set forth as SEQ ID NO: 73.

[0134]The following orfs were found in pRET1000:

[0135]orf10 (SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40 or SEQ ID NO: 41) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 3350, 3251, 2945 or 2849 to 2412 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0136]orf11 (SEQ ID NO: 42 or SEQ ID NO: 43) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 2365 or 2332 to 2159 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0137]orf12 (SEQ ID NO: 44) consisting of the nucleotide sequence from bases 3197 to 3526 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0138]orf13 (SEQ ID NO: 45 or SEQ ID NO: 46) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 4035 or 3996 to 3679 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0139]orf14 (SEQ ID NO: 48, SEQ ID NO: 49 or SEQ ID NO: 50) consisting of the nucleotide sequence from bases 4621, 4654 or 4666 to 4830 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0140]orf15 (SEQ ID NO: 51 or SEQ ID NO: 52) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 5161 or 5062 to 4709 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0141]orf16 (SEQ ID NO: 53 or SEQ ID NO: 54) consisting of the nucleotide sequence from bases 2331 or 2334 to 2618 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0142]orf17 (SEQ ID NO: 55) consisting of the nucleotide sequence from bases 2907 to 3242 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0143]orf18 (SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59 or SEQ ID NO: 60) consisting of the nucleotide sequence from bases 1650, 1689, 1713, 1827 or 1875 to 2162 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0144]orf19 (SEQ ID NO: 61) consisting of the nucleotide sequence from bases 1906 to 2169 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0145]orf20 (SEQ ID NO: 62) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 810 to 553 of the nucleotide sequence set forth as SEQ ID NO: 74;

[0146]orf21 (SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID NO: 68 or SEQ ID NO: 69) consisting of the nucleotide sequence from bases 117, 147, 306, 456, 5144, 5276 or 5534 to 656 of the nucleotide sequence set forth as SEQ ID NO:74.

[0147]The DNA replication region of pRET1100 is the region represented by the nucleotide sequence set forth as SEQ ID NO: 35 (from bases 2410 to 3200), the nucleotide sequence set forth as SEQ ID NO: 36 (from bases 1000 to 1500) or the nucleotide sequence set forth as SEQ ID NO: 37 (from bases 5000 to 500). The DNA replication region of pRET1000 is the region represented by the nucleotide sequence set forth as SEQ ID NO: 70 (from bases 3355 to 3507), the nucleotide sequence set forth as SEQ ID NO: 71 (from bases 4290 to 4350) or the nucleotide sequence set forth as SEQ ID NO: 72 (from bases 3570 to 3894).

[0148]The region of the nucleotide sequence from bases 5144 to 656 (SEQ ID NO: 67) and the region of the nucleotide sequence from bases 4381 to 4830 (SEQ ID NO: 47) of the nucleotide sequence of pRET1000 (SEQ ID NO: 74) are homologous with mobilization proteins, suggesting that they are involved in mobilization.

[0149]A DNA secondary structure is predicted for the region of the nucleotide sequence from bases 4260 to 4339 (SEQ ID NO: 75) of the nucleotide sequence of pRET1000 (SEQ ID NO: 74), and it is presumably involved in expression of the mobilization protein gene or is the recognition site of the expressed protein.

[0150]On the other hand, it was suggested that the region of the nucleotide sequence from bases 761 to 868 (SEQ ID NO: 76) of the nucleotide sequence of pRET1100 (SEQ ID NO: 73) is a promoter involved in expression of a protein related to replication.

Example 5

Construction of Shuttle Vectors

[0151]For construction of a shuttle vector between Rhodococcus strains and E. coli, the Rhodococcus plasmids pRET1000, pRET1100 and pRET1200 and the E. coli plasmids pUC18, pHSG299 and pHSG398 were used for the following experiment.

[0152]First, DNA fragments were prepared from R. erythropolis plasmids. Specifically, plasmids pRET1100 and pRET1200 were obtained from R. erythropolis (IAM1400), and then Alw44 I was used for digestion of pRET1100 at 37° C. for 2 hours and Blunting High (Toyobo) was used for blunting of the ends, while BspLU11 I was used for digestion of pRET1200 at 48° C. for 2 hours and Blunting High (Toyobo) was used for blunting of the ends, to obtain DNA fragments of R. erythropolis plasmid. Each of the DNA fragments was dissolved in TE solution.

[0153]For pRET1000, plasmid pRET1000 was obtained from R. rhodnii (JCM3203), and then Drd I was used for digestion of pRET1000 at 37° C. for 2 hours and Blunting High was used for blunting of the ends, to obtain pRET1000 DNA fragments, which were dissolved in TE solution.

[0154]Next, DNA fragments were prepared from the E. coli plasmids. Specifically, pUC18 (containing the ampicillin-resistance gene (Ampr)) was digested with Sma I at 30° C. for 2 hours, and pHSG299 (containing the kanamycin-resistance gene (Kmr)) and pHSG398 (containing the chloramphenicol-resistance gene (Cmr)) were digested with Hinc II at 37° C. for 2 hours to obtain DNA fragments of E. coli plasmid, which were dissolved in TE.

[0155]After ligating the DNA fragments from the Rhodococcus and E. coli plasmids prepared in the manner described above, they were used for transformation in E. coli DH5α, which were plated on LB (1% tryptophan, 0.5% yeast extract, 1% sodium chloride; pH 7.2) agar medium containing 100 μg/mL kanamycin, 100 μg/mL ampicillin or 30 μg/1 mL chloramphenicol, coated with 30 μL of 0.1 M IPTG (isopropyl-β-galactoside) and 4% X-gal (5-bromo-4-chloro-3-indole-β-D-galactopyranoside) and allowed to stand at 30° C. for 60 hours. White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL kanamycin, 100 μg/mL ampicillin or 30 μg/mL chloramphenicol, at 30° C. for 60 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit (Amersham Bioscience, with purification under the manufacturer's specified conditions). The obtained DNA was confirmed by electrophoresis on 0.8% agarose gel. The obtained shuttle vectors are shown in Table 5, and the methods for constructing each of the shuttle vectors using pRET1100 are shown in FIGS. 3 to 5.

TABLE-US-00005 TABLE 5 Constructed shuttle Origin vectors Rhodococcus E. coli pRET1001, pRET1001Rv pRET1000 pUC18 pRET1002, pRET1002Rv pRET1000 pHSG299 pRET1003, pRET1003Rv pRET1000 pHSG398 pRET1101, pRET1101Rv pRET1100 pUC18 pRET1102, pRET1102Rv pRET1100 pHSG299 pRET1103, pRET1103Rv pRET1100 pHSG398 pRET1201, pRET1201Rv pRET1200 pUC18 pRET1202, pRET1202Rv pRET1200 pHSG299 pRET1203, pRET1203Rv pRET1200 pHSG398

[0156]The shuttle vectors constructed with pRET1100 and pUC18, pHSG299 or pHSG398 were designated respectively as pRET1101 (SEQ ID NO: 89), pRET1102 (SEQ ID. NO: 90) or pRET1103 (SEQ ID NO: 91), respectively. Of the shuttle vectors, pRET1100 exhibits ampicillin resistance, pRET1102 exhibits kanamycin resistance and pRET1103 exhibits chloramphenicol resistance. Also, the shuttle vectors pRET1101 to 1103 wherein the E. coli gene and pRET1100 were linked in reverse (Rv) were designated respectively as pRET1101Rv (SEQ ID NO: 92), pRET1102Rv (SEQ ID NO: 93) and pRET1103Rv (SEQ ID NO: 94).

[0157]Similarly, the shuttle vectors constructed using pRET1000 and pRET1200 were designated as pRET1001-pRET1003 (SEQ ID NO: 95-SEQ ID NO: 97) and pRET1001Rv-pRET1003Rv (SEQ ID NO: 98-SEQ ID NO: 100), and as pRET1201-pRET1203 and pRET1201Rv-pRET1203Rv (Table 5).

Example 6

Examining Method of Transformation to R. erythropolis

[0158]The Rhodococcus-E. coli shuttle vectors obtained in Example 5 were used for transformation of R. erythropolis MAK-34 strain (MAK-34; deposited at the National Institute of Bioscience and Human-Technology, National Institute of Advanced Industrial Science and Technology, Ministry of Economy, Trade and Industry, (currently: International Patent Organism Depositary, National Institute of Advanced Industrial Science and Technology) on Feb. 15, 2001 as FERM BP-7451). Electroporation was investigated as the method of gene transfer.

[0159]First, R. erythropolis MAK-34 strain was inoculated to 5 mL of GPY medium and cultured with shaking at 30° C. for 36 hours. After seeding 1 mL of culture solution in 100 mL of LB medium, culturing was continued at 200 rpm at 30° C. for 10 hours. The cultured cells were harvested by centrifugation (12 krpm, 5 min, 4° C.) and the harvested cells were rinsed twice with ultrapurified water. The rinsed cells were harvested by centrifugation (12 krpm, 5 min, 4° C.) and suspended in 2.4 mL of a 10% glycerol solution. The suspension was dispensed into 300 μl portions and frozen at -80° C. as competent cells.

[0160]A 90 μL portion of the prepared competent cells and a 5 μL portion of the shuttle vector (pRET1001, pRET1002, pRET1003, pRET1101, pRET1102, pRET1103, pRET1201, pRET1202 or pRET1203) were mixed on ice. The mixed solution was gently poured into a 0.1 cm cuvette which had been cooled on ice, and was set in a Gene Pulser II Electroporation System (BIO-RAD). After pulsing at 20 kV/cm, 400 Ω. 25° F., the mixed solution was added with 300 μL of LB medium immediately and was allowed to stand at 25° C. for 3 hours.

[0161]A portion of the cell suspension was plated on an antibiotic-containing LB plate (100 μg/mL kanamycin, 100 μg/mL ampicillin or 30 μg/mL chloramphenicol). As a result, colonies were obtained when using pRET1002, pRET1102 and pRET1202 containing the kanamycin resistance gene. In order to confirm that the obtained colonies contained the plasmids, the plasmids were isolated and all were verified to contain the shuttle vector.

[0162]This suggested that R. erythropolis can be transformed by electroporation and that pRET1002, pRET1102 and pRET1202 function as shuttle vectors.

Example 7

Obtaining Aminoketone Asymmetric Reductase Gene (Mak Gene)

[0163]The mak gene was isolated from R. erythropolis MAK-34 strain for insertion of the mak gene into the shuttle vector shown in FIG. 5.

[0164]First, genomic DNA was obtained from R. erythropolis MAK-34 strain. After inoculating R. erythropolis MAK-34 strain to 5 mL of GPY medium, culturing with shaking was performed at 30° C. for 48 hours, and then the culture solution was seeded in 100 mL of GPY medium and subcultured at 200 rpm at 30° C. for 10 hours. The genomic DNA was obtained using a Genomic DNA Buffer set and Genomic-tip 500/G (QIAGEN).

[0165]The obtained genomic DNA was used as template for PCR using KOD-plus-. The primers used were MAKF 1 (5'-GAATCTTCTCGTTGATGCAGATCAGGTC-3'; SEQ ID NO: 80) and MAKR2 (5'-CTGACTCCGTAGTGTTCTGCCAGTTC-3'; SEQ ID NO: 81), for PCR at an annealing temperature of 68° C. and extension reaction for 1 minute and 50 seconds. The obtained PCR product was subjected to phenol/chloroform treatment and ethanol precipitation, and then mixed with pUC18 that had been digested with Sma I for 2 hours at 30° C., and ligated therewith using Ligation High. Competent High (Toyobo) was used for transformation of E. coli DH5α, which was then plated on LB agar medium (containing 100 μg/mL ampicillin) that had been coated with 30 μL of 0.1 M IPTG and 4% X-gal, and was allowed to stand at 30° C. for 60 hours. White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL ampicillin at 30° C. for 60 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit. The obtained DNA was confirmed by electrophoresis on 0.8% agarose gel. The obtained clone was designated as pMAK-1.

Example 8

Construction of Expression Vector-1

[0166]A promoter and aminoketone asymmetric reductase gene (mak gene) were inserted into the shuttle vector shown in Table 5.

[0167]First, an expression vector (without exogenous promoter) containing approximately 400 bp upstream from the mak gene was constructed.

[0168]pMAK-1 was digested with Sma I at 30° C. for 2 hours, and then with Pst I at 37° C. for 2 hours. The solution was supplied for 0.8% agarose gel electrophoresis. The DNA size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest. After electrophoresis, an approximately 1.4 kbp DNA fragment was purified using a GFX® PCR DNA and Gel Band Purification Kit, and used as the insert DNA. On the other hand, the vector used was pRET1102 digested with Hinc II and Pst I at 37° C. for 2 hours. The DNA fragments were ligated with Ligation High and Competent High was used for transformation of E. coli DH5α. The cells were plated on LB agar medium containing 100 μg/mL kanamycin and allowed to stand at 30° C. for 60 hours.

[0169]The appearing colonies were cultured with shaking on LB liquid medium containing 100 μg/mL kanamycin at 30° C. for 60 hours. The DNA was purified from the obtained cultured medium using a GFX® Micro Plasmid Prep Kit. The obtained DNA was confirmed by 0.8% agarose gel electrophoresis.

[0170]For screening, the obtained DNA without restriction endonuclease treatment and the DNA after digestion with Pst I at 37° C. for 2 hours were subjected to 0.8% agarose gel electrophoresis, and the target plasmid was obtained based on the size of the DNA. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest, pRET1102 and pRET1102 that had been digested with Pst I at 37° C. for 2 hours. The plasmid obtained in this manner was designated as pRET1104.

Example 9

Construction of Expression Vector-2

[0171]The shuttle vectors were reduced, since reduction of shuttle vectors is effective for expression vector enhancement, gene modification, transformation efficiency improvement and replication in cells.

[0172]First, shuttle vector pRET1102 was reduced. After digesting pRET1102 with BamH I and Hinc II for 2 hours, it was electrophoresed on 0.8% agarose gel and an approximately 2.7 kbp DNA fragment was recovered using a GFX® PCR DNA and Gel Band Purification Kit to prepare a pRET1102 DNA fragment. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest.

[0173]Separately, a DNA fragment replicable in E. coli was prepared by digesting pHSG299 with BamH I and Hinc II for 2 hours, subjecting it to 0.8% agarose gel electrophoresis, and recovering an approximately 2.7 kbp DNA fragment using a GFX® PCR DNA and Gel Band Purification Kit.

[0174]The DNA fragments were ligated with Ligation High and Competent High was used for transformation of E. coli JM109 cells, which were then plated on LB agar medium, containing 100 μg/mL kanamycin, that had been coated with 30 μL of 0.1 M IPTG and 4% X-gal, and was allowed to stand at 30° C. for 48 hours.

[0175]White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL kanamycin at 30° C. for 48 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit. The reduced shuttle vector of pRET1102 obtained in this manner was designated as pRET1123 (approximately 5.3 kbp).

[0176]Next, shuttle vector pRET1202 was reduced. The Rhodococcus-derived DNA fragment was prepared by digesting pRET1202 with EcoR I for 2 hours and then with Dra III for 2 hours, using Blunting High for blunting of the ends, performing 0.8% agarose gel electrophoresis, and then recovering an approximately 3.7 kbp DNA fragment using a GFX® PCR DNA and Gel Band Purification Kit. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest. The DNA fragment was inserted at the Hinc II site of pHSG299. After ligation, Competent High was used for transformation of E. coli DH5α, which was then plated on LB agar medium, containing 100 μg/mL kanamycin, that had been coated with 30 μL of 0.1 M IPTG and 4% X-gal, and was allowed to stand at 30° C. for 72 hours. White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL kanamycin at 30° C. for 72 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit. When the plasmid obtained by screening was digested with Sac I, BamH I, Pst I or EcoR I for 2 hours, all of the clones had approximately 500 bp clipped at the side of EcoR I site of the Rhodococcus-derived region. The plasmid was designated as pRET1204 (approximately 5.9 kbp). It was not possible to obtain a clone with no clipping of the genus Rhodococcus replication region.

[0177]The shuttle vector pRET1002 was reduced in a similar manner to obtain pRET1006 (approximately 4.9 kbp).

[0178]R. erythropolis was transformed with these three reduced plasmids, pRET1006, pRET1123 and pRET1204, and upon confirming the presence or absence of shuttle vector by the method described in Example 6, all the shuttle vectors were detected in the transformed cells. This suggested that the three reduced plasmids pRET1006, pRET1123 and pRET1204 are replicated in R. erythropolis.

Example 10

Construction of Expression Vector-3

[0179]An expression vector was constructed by having the mak gene inserted into the shuttle vector constructed in Example 9.

[0180]The Pst I site of pRET1123 constructed in Example 9 was deleted for cloning of the promoter in the single step. After digesting pRET1123 with Pst I for 2 hours, Blunting High was used for blunting of the ends and Ligation High was used for ligation. The solution was used to transform E. coli JM109 using Competent High, and culturing was performed on an LB plate containing 100 μg/mL kanamycin at 30° C. for 36 hours. The formed colonies were inoculated on LB liquid medium containing 100 μg/mL kanamycin and cultured at 30° C. for 24 hours, and then the DNA was purified using a GFX® Micro Plasmid Prep Kit to obtain pRET1132.

[0181]The obtained pRET1132 was digested with Pst I for 1 hour and then electrophoresed on 0.8% agarose, which resulted in confirming lack of cleavage of pRET1132 by Pst 1. As controls there were used pRET1123 and pRET1132 not digested with Pst I, and pRET1123 digested with Pst I.

Example 11

Construction of Expression Vector-4

[0182]A clone was constructed having a promoter and the mak gene inserted in the aforementioned shuttle vector.

[0183]A clone was constructed having a Pst I site upstream from the mak gene, for insertion of a promoter. The procedure was carried out in the following manner to obtain a clone having His-Tag added to the C-terminus of the aminoketone asymmetric reductase. PCR was conducted with KOD-plus- using the pMAK-1 obtained in Example 7 as template, MAKPstF (5'-GACCACTGCAGATCAATCAACTCTGATGAGGTCC-3'; SEQ ID NO: 82) and MAKHisBglIIR (5'-CGCTTAGATCTCAGTTCGCCGAGCGCCATCGCCG-3'; SEQ ID NO: 83) as primers, with an annealing temperature of 68° C. and extension reaction for 1 minute and 50 seconds. A PCR fragment (insert) produced by digesting the obtained PCR product with Bgl II at 37° C. for 2 hours was ligated with pQE70 (digested with Sph I at 37° C. for 2 hours, blunted with Blunting High and digested with Bgl II at 37° C. for 2 hours) using Ligation High, and then Competent High was used for transformation of E. coli DH5α cells, which were plated on LB agar medium containing 100 μg/mL ampicillin and allowed to stand at 30° C. for 60 hours. The appearing colonies were cultured with shaking on LB liquid medium containing 100 μg/mL ampicillin at 30° C. for 60 hours. The DNA was purified using a GFX® Micro Plasmid Prep Kit. The obtained DNA was confirmed by 0.8% agarose gel electrophoresis.

[0184]For screening, the DNA without restriction endonuclease treatment and the DNA after digestion with Pst I and Bgl II at 37° C. for 2 hours were subjected to 0.8% agarose gel electrophoresis, and the target plasmid was obtained based on the size of the DNA. The plasmid obtained in this manner was designated as pMAK-2. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest, pQE70, and pQE70 that had been digested with Bgl II at 37° C. for 2 hours.

[0185]A clone was constructed by inserting the pRET1200 repA promoter (obtained by PCR amplification using as template a clone of pRET1204 wherein the orientation of repA encoded by the Rhodococcus-derived DNA fragment was in the same orientation as the kanamycin resistance gene encoded by pHSG299, and using as primers P1200rep-Pst5195 (5'-AGCCGCTGCAGAAGCAACACCGCATCCGCCCATTG-3'; SEQ ID NO: 84) and P7 (5'-CGCCAGGGTTTTCCCAGTCACGAC-3'; SEQ ID NO: 85), with an annealing temperature of 60° C. and extension reaction for 1 minute, followed by digestion with EcoR I and Pst I at 37° C. for 2 hours) at the EcoR I-Pst I site of pMAK-2 (designated as pMAK-19).

[0186]Next, PCR was conducted with KOD-plus- using as template pMAK-19 and as primers pQE70F1 (5'-GGCGTATCACGAGGCCCTTTCGTCTTCACC-3'; SEQ ID NO: 86) and pQE70R1135Bm (5'-GGTTGGATCCGTCATCACCGAAACGCGCGAGGCAG-3'; SEQ ID NO: 87), with an annealing temperature of 60° C. and extension reaction for 3 minutes. The PCR product was purified from the reaction solution by using a GFX® PCR DNA and Gel Band Purification Kit and after digestion of the purified PCR product with EcoR I and BamH I for 2 hours, it was electrophoresed on 0.8% agarose gel and the DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The DNA fragment was used as an insert DNA.

[0187]Separately, a vector to be used as the expression shuttle vector was obtained by digesting pRET1132 with EcoR I and BamH I for 2 hours, subjecting the DNA fragment to 0.8% agarose gel electrophoresis and purifying the DNA fragment by using a GFX® PCR DNA and Gel Band Purification Kit. After mixing the insert DNA and vector, Ligation High was used for ligating them and Competent High was used for transformation of E. coli JM109 cells, which were plated on an LB plate containing 100 μg/mL kanamycin. The obtained colonies were cultured on LB liquid medium containing 100 μg/mL kanamycin, and then the plasmid DNA was recovered by using a GFX® Micro Plasmid Prep Kit and subjected to 0.8% agarose gel electrophoresis for screening. The size markers used were Loading Quick DNA size Marker λ/EcoR I+Hind III double digest and pRET1132. The obtained expression vector was designated as pRET1133.

[0188]Also, pMAK-19 was digested with EcoR I and Hind III at 37° C. for 2 hours, blunted with Blunting High and subjected to 0.8% agarose gel electrophoresis, and the approximately 1.6 kbp DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The clone having this fragment inserted at the Hinc II site of pRET1102 was designated as pRET1114.

[0189]The pRET1133 promoter was also modified. The mak gene-expressing promoter encoded in pRET1133 is the repA gene promoter of pRET1200 and has a length of approximately 800 bp, and a plasmid was constructed by having approximately 200 bp clipped off from this promoter. The promoter used for the cloning was prepared by PCR. Plasmid pRET1200 was used as template, P1204rep-Ec2958 (5'-CGCGGAATTCGACCACCACGCACGCACACCGCAC-3'; SEQ ID NO: 88) and P1200rep-Pst5195 (5'-AGCCGCTGCAGAAGCAACACCGCATCCGCCCATTG-3'; SEQ ID NO: 84) were used as primers, and KOD-plus- was used as the PCR enzyme for PCR at an annealing temperature of 60° C. and extension reaction for 50 seconds. The PCR product was purified by using a GFX® PCR DNA and Gel Band Purification Kit, digested with the restriction endonucleases EcoR I and Pst I for 2 hours, and subjected to 1.6% agarose gel electrophoresis, and the DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The DNA fragment was used as the insert DNA. The nucleotide sequence of the promoter region in the DNA fragment is set forth as SEQ ID NO: 77.

[0190]Separately, for the vector, pRET1133 was digested with restriction endonucleases EcoR I and Pst I for 2 hours and subjected to 0.8% agarose gel electrophoresis, and an approximately 7.2 kbp DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest.

[0191]The insert DNA and vector obtained in this manner were ligated by using Ligation High, and Competent High was used for transformation of E. coli JM109 cells, which were plated on an LB plate containing 100 μg/mL kanamycin. The obtained colonies were cultured on LB liquid medium containing 100 μg/mL kanamycin, and then the plasmid DNA was recovered by using a GFX® Micro Plasmid Prep Kit and subjected to 0.8% agarose gel electrophoresis for screening. The size markers used were Loading Quick DNA size Marker λ/EcoR I+Hind III double digest and pRET1133.

[0192]Also, after digesting the obtained DNA with restriction endonucleases EcoR I and Pst I for 2 hours, it was subjected to 1.6% agarose gel electrophoresis and a DNA fragment corresponding to the approximately 600 bp insert DNA was confirmed. The size marker used was a 100 bp DNA Ladder. The expression vector obtained in this manner was designated as pRET1138.

Example 12

Preparation of Recombinant R. erythropolis and Measurement of Enzyme Activity

[0193]The aforementioned expression vectors pRET1102, pRET1104, pRET1114 and pRET1138 were used for transformation of R. erythropolis MAK-34 strain and R. erythropolis JCM2895 (provided by RIKEN Japan), and the enzyme activity was measured. The aminoketone asymmetric reductase purified from MAK-34 strain has the abilities to react with 1-2-methylamino-1-phenyl-1-propanone as described in International Patent Publication WO02/070714, and to produce d-(1S,2S)-pseudoephedrine. It was also reacted with 1-2-dimethylaminopropiophenone, 1-amino-2-butanone, etc. and production of each corresponding β-aminoalcohol was confirmed.

[0194]The activity assay was conducted by preparing a reaction solution with a cell density O.D.=5, 2% glucose and 0.2 M sodium phosphate buffer (pH 6.0), and 3% (1S,2S)-2-(N-ethylamino)-1-phenyl-1-propanol (EAM) was contained in the reaction as substrate. A synthesis method for EAM is described in J. Am. Chem. Soc., Vol. 50, pp. 2287-2292, 1928. The reaction solution was incubated with shaking at 30° C. for 16 hours. Confirmation of (1S,2S)-2-(N-ethylamino)-1-phenyl-1-propanol (EPE), which was β-amino alcohol as the reaction product, was accomplished by HPLC. The column used was an Inertsil Ph-3 3.0×75 mm, the eluent was aqueous 7% acetonitrile and 0.05 M sodium phosphate buffer (pH 6.0), and the detection was carried out with UV (220 nm).

[0195]The results of the activity assay carried out in this manner are shown in Table 6. The pRET1104-introduced recombinant cells lacking the exogenous promoter region exhibited about the same activity as the pRET1102-introduced recombinant cells lacking the mak gene used as the control, and no recombinant enzyme expression was found.

[0196]With transformation of pRET1114 into MAK-34 strain, high specific activity was found compared to pRET1104. This indicated that the pRET1200 repA promoter region inserted into the vector functions as a promoter.

[0197]With transformation of pRET1138, the specific activity of the recombinant R. erythropolis MAK-34 strain was 37.7 μg/hmL/O.D. while the specific activity of the recombinant R. erythropolis JCM2895 was 34.9 μg/hmL/O.D., and therefore expression of the enzyme in R. erythropolis strain was confirmed.

TABLE-US-00006 TABLE 6 Vector MAK-34 JCM2895 pRET1102 1.0 1.0 pRET1104 0.7 2.0 pRET1114 17.2 not tested pRET1138 37.7 34.9 Specific activity (units: μg/h mL/O.D.)

Example 13

Purification of Enzyme

[0198]The recombinant cells obtained in Example 12 were cultured at 30° C. for 4 days in 100 mL of LB medium containing 100 μg/mL kanamycin, the cells were harvested by centrifugation at 12,000 rpm for 5 minutes and the protein having His-tag was purified with The QIAexpressionist Kit (Qiagen). Specifically, the cells were disrupted by ultrasonic treatment, the supernatant was obtained by centrifugation, and the protein was purified with a nickel chelate column. Upon applying the obtained protein to SDS-PAGE, a band of protein, which molecular weight is approximately 28,000, was observed. This molecular weight is roughly equivalent to the molecular weight of the aminoketone asymmetric reductase described in International Patent Publication WO02/070714, thus indicating that the aminoketone asymmetric reductase was produced in the recombinant Rhodococcus strains.

Example 14

Enzymatic Production of β-aminoalcohol

[0199]A 0.5 mL portion of reaction solution containing the purified enzyme (0.5 μg/mL) obtained in Example 13, 5 mM NADPH, 120 mM Tris-HCl (pH 7.5) and 5 mM EAM was reacted at 37° C. for 16 hours. The substrate and product (EPE) were analyzed by HPLC. The column used was an Inertsil Ph-3 3.0×75 mm, the eluent was aqueous 7% acetonitrile and 0.05 M sodium phosphate buffer (pH 6.0), and the detection was carried out with UV (220 nm). The results confirmed production of EPE.

[0200]Similarly, the purified enzyme or the crude enzyme extract obtained from the recombinant cells cultured as described in Example 13 was reacted with 1-2-dimethylaminopropiophenone and 1-amino-2-butanone, etc. and production of the corresponding β-aminoalcohols was confirmed.

INDUSTRIAL APPLICABILITY

[0201]As explained above, the plasmids and shuttle vectors of the invention are derived from Rhodococcus strains (especially Rhodococcus erythropolis and Rhodococcus rhodnii), and when utilized them for modification of the same bacteria by recombination, they allow creation of bacterial strains that more efficiently produce aminoketone asymmetric reductases. They also permit mass production of useful enzymes including aminoketone asymmetric reductases in transformants.

Sequence CWU 1

1001279DNARhodococcus erythropolismisc_feature202bp to 480bp pRET1100 1atgactctga gggtggacga accggagtcg gtgagaatgc ttcatccgag cgcttccccg 60gaagactgtg ccctggtcga gaccttcaag cctggtacct gccttttcga gaagccagga 120gaaggccggc agattatgcg atgcgacttt gtcggcgagt acgggagata tgcgcgagcc 180atcgagtctt cggatctgcg ttttctcgcc accctccagc aagaccaggc ccaacgcgaa 240ttcttcgctg aggagttcgg tgtggtggat ccgtcatga 2792243DNARhodococcus erythropolismisc_feature238bp to 480bp pRET1100 2atgcttcatc cgagcgcttc cccggaagac tgtgccctgg tcgagacctt caagcctggt 60acctgccttt tcgagaagcc aggagaaggc cggcagatta tgcgatgcga ctttgtcggc 120gagtacggga gatatgcgcg agccatcgag tcttcggatc tgcgttttct cgccaccctc 180cagcaagacc aggcccaacg cgaattcttc gctgaggagt tcggtgtggt ggatccgtca 240tga 2433144DNARhodococcus erythropolismisc_feature337bp to 480bp pRET1100 3atgcgatgcg actttgtcgg cgagtacggg agatatgcgc gagccatcga gtcttcggat 60ctgcgttttc tcgccaccct ccagcaagac caggcccaac gcgaattctt cgctgaggag 120ttcggtgtgg tggatccgtc atga 1444282DNARhodococcus erythropolismisc_feature477bp to 758bp pRET1100 4atgactggac cacaggagag aaagcgcaag gcggcgaagc cgtcgcggga gcctcagttg 60aactgctgtg aagcggacgt gccgaaacga gcaaaacagc ccccggttcc ctctacgttc 120gacctgctca cggtgaagga gactgcgggg ctgctgagag tcagtcaggc aactctttac 180cggctgcttc ggagtgggga aggacccaca tacacacgga tcggtggaca gatacgcgtt 240caccgcgagt cgctgcgtcg gttcatcgaa ccgcgtggat aa 28251548DNARhodococcus erythropolismisc_feature862bp to 2409bp pRET1100 5atgcacttcc acgataacgc agaggtcgga caagagggaa gaactgccgt tctctcgccg 60ttgcgcggcg tagccgccaa gcgggacgtg tctgacgatg cagcgaagcg gagtcggcag 120gcgcggcacg cgcctgggct tgttacatct gccacaactg tccgtgaatc tctgccagct 180cctgaaaccg ctggtcaggg ccttgcggaa tccgtgaccg ctgatgattt ttggtctcat 240tcgttccccc gcgctgacga tgtacgcggc gcagctgctt ccttccagtc ggtggctaac 300tgggatgggc gtgagggtcc gaggccgcgt ttcgttgtcg cgcctggcgt tgtccgcttg 360gaggtttgtg atctcgcacg ccgcgaacga acggctgaac gtgcgtatct ggctgctcgg 420gctcgggtgg atatggcggc tgccaggcat aactcgccgt acgacttcga cgtggacgat 480gaagagttgg cggaactggc ttctctgcaa ggcctcgagg acgacgacat tgggggctgg 540tctgcggaga gggaaatagt gggctggtct gctcgttctc ggtcacggat gatcttgcga 600atggcagaac tcgactgggc tcccatgatg gatttgccgg gcattcctgc gatggtgacc 660ctcacctatc cgggggactg gcttacggtt gcccccaccg gcgctgaggt caaaaaacat 720ctccagacgt tcttcaaacg gttccaacgg gcctggggca ttgcctggat gggtgcgtgg 780aaaatggagt tccaaagccg aggcgctccg cattttcacc tgtacatggt ccctcctcat 840gggaaggcag gagactcgcg gaagctgcgg catgatgctg agctcttgaa atgggagata 900gcacgtgcag agggtgaaga cccaggtcgc aggccgtatt tccgggaagc tccaagcgat 960ggattgaagt ttcgtccgtg gctttctgcg gtgtgggccg acgtcgtaga tcatccggac 1020cccaaggaaa aagaaaagca cgtcagtgcc ggcactggag tggactacgc ggagggcacg 1080cgagggtcag atccgaaaag gcttgcggtg tacttctcca agcatggaac ctttgccgac 1140aaggaatatc agcacgtagt tcctgctcaa tggcagaaaa cgggtgcggg acctggcagg 1200ttctggggct accgcggttt gtcgccggcc acggctgcca ccgagatttc ctgggatgag 1260tacctgcttt tatctcgcac gttgcgacga ttgtcagcgc gaacgaagat ctgggacccg 1320gctttacgag gcggtagcgg cggccacaga tggactaagg cgatgatgcg acgcacggtt 1380acccggcacc gcttggacct cgtgaccggt gagattctgg gcacgaagac gcggaaggtt 1440cgggcgccag tgaagaggtt tgtccggact tcgggatacc tgtgtgtcaa tgacgggccc 1500gcactggctc gaaccctcag ccgtcttcgt acaagctgcc tgagctag 154861116DNARhodococcus erythropolismisc_feature1294bp to 2409bp pRET1100 6atggcggctg ccaggcataa ctcgccgtac gacttcgacg tggacgatga agagttggcg 60gaactggctt ctctgcaagg cctcgaggac gacgacattg ggggctggtc tgcggagagg 120gaaatagtgg gctggtctgc tcgttctcgg tcacggatga tcttgcgaat ggcagaactc 180gactgggctc ccatgatgga tttgccgggc attcctgcga tggtgaccct cacctatccg 240ggggactggc ttacggttgc ccccaccggc gctgaggtca aaaaacatct ccagacgttc 300ttcaaacggt tccaacgggc ctggggcatt gcctggatgg gtgcgtggaa aatggagttc 360caaagccgag gcgctccgca ttttcacctg tacatggtcc ctcctcatgg gaaggcagga 420gactcgcgga agctgcggca tgatgctgag ctcttgaaat gggagatagc acgtgcagag 480ggtgaagacc caggtcgcag gccgtatttc cgggaagctc caagcgatgg attgaagttt 540cgtccgtggc tttctgcggt gtgggccgac gtcgtagatc atccggaccc caaggaaaaa 600gaaaagcacg tcagtgccgg cactggagtg gactacgcgg agggcacgcg agggtcagat 660ccgaaaaggc ttgcggtgta cttctccaag catggaacct ttgccgacaa ggaatatcag 720cacgtagttc ctgctcaatg gcagaaaacg ggtgcgggac ctggcaggtt ctggggctac 780cgcggtttgt cgccggccac ggctgccacc gagatttcct gggatgagta cctgctttta 840tctcgcacgt tgcgacgatt gtcagcgcga acgaagatct gggacccggc tttacgaggc 900ggtagcggcg gccacagatg gactaaggcg atgatgcgac gcacggttac ccggcaccgc 960ttggacctcg tgaccggtga gattctgggc acgaagacgc ggaaggttcg ggcgccagtg 1020aagaggtttg tccggacttc gggatacctg tgtgtcaatg acgggcccgc actggctcga 1080accctcagcc gtcttcgtac aagctgcctg agctag 11167960DNARhodocuccus erythropolismisc_feature1450bp to 2409bp pRET1100 7atgatcttgc gaatggcaga actcgactgg gctcccatga tggatttgcc gggcattcct 60gcgatggtga ccctcaccta tccgggggac tggcttacgg ttgcccccac cggcgctgag 120gtcaaaaaac atctccagac gttcttcaaa cggttccaac gggcctgggg cattgcctgg 180atgggtgcgt ggaaaatgga gttccaaagc cgaggcgctc cgcattttca cctgtacatg 240gtccctcctc atgggaaggc aggagactcg cggaagctgc ggcatgatgc tgagctcttg 300aaatgggaga tagcacgtgc agagggtgaa gacccaggtc gcaggccgta tttccgggaa 360gctccaagcg atggattgaa gtttcgtccg tggctttctg cggtgtgggc cgacgtcgta 420gatcatccgg accccaagga aaaagaaaag cacgtcagtg ccggcactgg agtggactac 480gcggagggca cgcgagggtc agatccgaaa aggcttgcgg tgtacttctc caagcatgga 540acctttgccg acaaggaata tcagcacgta gttcctgctc aatggcagaa aacgggtgcg 600ggacctggca ggttctgggg ctaccgcggt ttgtcgccgg ccacggctgc caccgagatt 660tcctgggatg agtacctgct tttatctcgc acgttgcgac gattgtcagc gcgaacgaag 720atctgggacc cggctttacg aggcggtagc ggcggccaca gatggactaa ggcgatgatg 780cgacgcacgg ttacccggca ccgcttggac ctcgtgaccg gtgagattct gggcacgaag 840acgcggaagg ttcgggcgcc agtgaagagg tttgtccgga cttcgggata cctgtgtgtc 900aatgacgggc ccgcactggc tcgaaccctc agccgtcttc gtacaagctg cctgagctag 9608948DNARhodococcus erythropolismisc_feature1462bp to 2409bp pRET1100 8atggcagaac tcgactgggc tcccatgatg gatttgccgg gcattcctgc gatggtgacc 60ctcacctatc cgggggactg gcttacggtt gcccccaccg gcgctgaggt caaaaaacat 120ctccagacgt tcttcaaacg gttccaacgg gcctggggca ttgcctggat gggtgcgtgg 180aaaatggagt tccaaagccg aggcgctccg cattttcacc tgtacatggt ccctcctcat 240gggaaggcag gagactcgcg gaagctgcgg catgatgctg agctcttgaa atgggagata 300gcacgtgcag agggtgaaga cccaggtcgc aggccgtatt tccgggaagc tccaagcgat 360ggattgaagt ttcgtccgtg gctttctgcg gtgtgggccg acgtcgtaga tcatccggac 420cccaaggaaa aagaaaagca cgtcagtgcc ggcactggag tggactacgc ggagggcacg 480cgagggtcag atccgaaaag gcttgcggtg tacttctcca agcatggaac ctttgccgac 540aaggaatatc agcacgtagt tcctgctcaa tggcagaaaa cgggtgcggg acctggcagg 600ttctggggct accgcggttt gtcgccggcc acggctgcca ccgagatttc ctgggatgag 660tacctgcttt tatctcgcac gttgcgacga ttgtcagcgc gaacgaagat ctgggacccg 720gctttacgag gcggtagcgg cggccacaga tggactaagg cgatgatgcg acgcacggtt 780acccggcacc gcttggacct cgtgaccggt gagattctgg gcacgaagac gcggaaggtt 840cgggcgccag tgaagaggtt tgtccggact tcgggatacc tgtgtgtcaa tgacgggccc 900gcactggctc gaaccctcag ccgtcttcgt acaagctgcc tgagctag 9489924DNARhodoccus erythropolismisc_feature1486bp to 2409bp pRET1100 9atgatggatt tgccgggcat tcctgcgatg gtgaccctca cctatccggg ggactggctt 60acggttgccc ccaccggcgc tgaggtcaaa aaacatctcc agacgttctt caaacggttc 120caacgggcct ggggcattgc ctggatgggt gcgtggaaaa tggagttcca aagccgaggc 180gctccgcatt ttcacctgta catggtccct cctcatggga aggcaggaga ctcgcggaag 240ctgcggcatg atgctgagct cttgaaatgg gagatagcac gtgcagaggg tgaagaccca 300ggtcgcaggc cgtatttccg ggaagctcca agcgatggat tgaagtttcg tccgtggctt 360tctgcggtgt gggccgacgt cgtagatcat ccggacccca aggaaaaaga aaagcacgtc 420agtgccggca ctggagtgga ctacgcggag ggcacgcgag ggtcagatcc gaaaaggctt 480gcggtgtact tctccaagca tggaaccttt gccgacaagg aatatcagca cgtagttcct 540gctcaatggc agaaaacggg tgcgggacct ggcaggttct ggggctaccg cggtttgtcg 600ccggccacgg ctgccaccga gatttcctgg gatgagtacc tgcttttatc tcgcacgttg 660cgacgattgt cagcgcgaac gaagatctgg gacccggctt tacgaggcgg tagcggcggc 720cacagatgga ctaaggcgat gatgcgacgc acggttaccc ggcaccgctt ggacctcgtg 780accggtgaga ttctgggcac gaagacgcgg aaggttcggg cgccagtgaa gaggtttgtc 840cggacttcgg gatacctgtg tgtcaatgac gggcccgcac tggctcgaac cctcagccgt 900cttcgtacaa gctgcctgag ctag 92410921DNARhodococcus erythropolismisc_feature1489bp to 2409bp pRET1100 10atggatttgc cgggcattcc tgcgatggtg accctcacct atccggggga ctggcttacg 60gttgccccca ccggcgctga ggtcaaaaaa catctccaga cgttcttcaa acggttccaa 120cgggcctggg gcattgcctg gatgggtgcg tggaaaatgg agttccaaag ccgaggcgct 180ccgcattttc acctgtacat ggtccctcct catgggaagg caggagactc gcggaagctg 240cggcatgatg ctgagctctt gaaatgggag atagcacgtg cagagggtga agacccaggt 300cgcaggccgt atttccggga agctccaagc gatggattga agtttcgtcc gtggctttct 360gcggtgtggg ccgacgtcgt agatcatccg gaccccaagg aaaaagaaaa gcacgtcagt 420gccggcactg gagtggacta cgcggagggc acgcgagggt cagatccgaa aaggcttgcg 480gtgtacttct ccaagcatgg aacctttgcc gacaaggaat atcagcacgt agttcctgct 540caatggcaga aaacgggtgc gggacctggc aggttctggg gctaccgcgg tttgtcgccg 600gccacggctg ccaccgagat ttcctgggat gagtacctgc ttttatctcg cacgttgcga 660cgattgtcag cgcgaacgaa gatctgggac ccggctttac gaggcggtag cggcggccac 720agatggacta aggcgatgat gcgacgcacg gttacccggc accgcttgga cctcgtgacc 780ggtgagattc tgggcacgaa gacgcggaag gttcgggcgc cagtgaagag gtttgtccgg 840acttcgggat acctgtgtgt caatgacggg cccgcactgg ctcgaaccct cagccgtctt 900cgtacaagct gcctgagcta g 92111897DNARhodococcus erythropolismisc_feature1513bp to 2409bp pRET1100 11atggtgaccc tcacctatcc gggggactgg cttacggttg cccccaccgg cgctgaggtc 60aaaaaacatc tccagacgtt cttcaaacgg ttccaacggg cctggggcat tgcctggatg 120ggtgcgtgga aaatggagtt ccaaagccga ggcgctccgc attttcacct gtacatggtc 180cctcctcatg ggaaggcagg agactcgcgg aagctgcggc atgatgctga gctcttgaaa 240tgggagatag cacgtgcaga gggtgaagac ccaggtcgca ggccgtattt ccgggaagct 300ccaagcgatg gattgaagtt tcgtccgtgg ctttctgcgg tgtgggccga cgtcgtagat 360catccggacc ccaaggaaaa agaaaagcac gtcagtgccg gcactggagt ggactacgcg 420gagggcacgc gagggtcaga tccgaaaagg cttgcggtgt acttctccaa gcatggaacc 480tttgccgaca aggaatatca gcacgtagtt cctgctcaat ggcagaaaac gggtgcggga 540cctggcaggt tctggggcta ccgcggtttg tcgccggcca cggctgccac cgagatttcc 600tgggatgagt acctgctttt atctcgcacg ttgcgacgat tgtcagcgcg aacgaagatc 660tgggacccgg ctttacgagg cggtagcggc ggccacagat ggactaaggc gatgatgcga 720cgcacggtta cccggcaccg cttggacctc gtgaccggtg agattctggg cacgaagacg 780cggaaggttc gggcgccagt gaagaggttt gtccggactt cgggatacct gtgtgtcaat 840gacgggcccg cactggctcg aaccctcagc cgtcttcgta caagctgcct gagctag 89712780DNARhodococcus erythropolismisc_feature1630bp to 2409bp pRET1100 12atgggtgcgt ggaaaatgga gttccaaagc cgaggcgctc cgcattttca cctgtacatg 60gtccctcctc atgggaaggc aggagactcg cggaagctgc ggcatgatgc tgagctcttg 120aaatgggaga tagcacgtgc agagggtgaa gacccaggtc gcaggccgta tttccgggaa 180gctccaagcg atggattgaa gtttcgtccg tggctttctg cggtgtgggc cgacgtcgta 240gatcatccgg accccaagga aaaagaaaag cacgtcagtg ccggcactgg agtggactac 300gcggagggca cgcgagggtc agatccgaaa aggcttgcgg tgtacttctc caagcatgga 360acctttgccg acaaggaata tcagcacgta gttcctgctc aatggcagaa aacgggtgcg 420ggacctggca ggttctgggg ctaccgcggt ttgtcgccgg ccacggctgc caccgagatt 480tcctgggatg agtacctgct tttatctcgc acgttgcgac gattgtcagc gcgaacgaag 540atctgggacc cggctttacg aggcggtagc ggcggccaca gatggactaa ggcgatgatg 600cgacgcacgg ttacccggca ccgcttggac ctcgtgaccg gtgagattct gggcacgaag 660acgcggaagg ttcgggcgcc agtgaagagg tttgtccgga cttcgggata cctgtgtgtc 720aatgacgggc ccgcactggc tcgaaccctc agccgtcttc gtacaagctg cctgagctag 78013765DNARhodococcus erythropolismisc_feature1645bp to 2409bp pRET1100 13atggagttcc aaagccgagg cgctccgcat tttcacctgt acatggtccc tcctcatggg 60aaggcaggag actcgcggaa gctgcggcat gatgctgagc tcttgaaatg ggagatagca 120cgtgcagagg gtgaagaccc aggtcgcagg ccgtatttcc gggaagctcc aagcgatgga 180ttgaagtttc gtccgtggct ttctgcggtg tgggccgacg tcgtagatca tccggacccc 240aaggaaaaag aaaagcacgt cagtgccggc actggagtgg actacgcgga gggcacgcga 300gggtcagatc cgaaaaggct tgcggtgtac ttctccaagc atggaacctt tgccgacaag 360gaatatcagc acgtagttcc tgctcaatgg cagaaaacgg gtgcgggacc tggcaggttc 420tggggctacc gcggtttgtc gccggccacg gctgccaccg agatttcctg ggatgagtac 480ctgcttttat ctcgcacgtt gcgacgattg tcagcgcgaa cgaagatctg ggacccggct 540ttacgaggcg gtagcggcgg ccacagatgg actaaggcga tgatgcgacg cacggttacc 600cggcaccgct tggacctcgt gaccggtgag attctgggca cgaagacgcg gaaggttcgg 660gcgccagtga agaggtttgt ccggacttcg ggatacctgt gtgtcaatga cgggcccgca 720ctggctcgaa ccctcagccg tcttcgtaca agctgcctga gctag 76514723DNARhodococcus erythropolismisc_feature1687bp to 2409bp pRET1100 14atggtccctc ctcatgggaa ggcaggagac tcgcggaagc tgcggcatga tgctgagctc 60ttgaaatggg agatagcacg tgcagagggt gaagacccag gtcgcaggcc gtatttccgg 120gaagctccaa gcgatggatt gaagtttcgt ccgtggcttt ctgcggtgtg ggccgacgtc 180gtagatcatc cggaccccaa ggaaaaagaa aagcacgtca gtgccggcac tggagtggac 240tacgcggagg gcacgcgagg gtcagatccg aaaaggcttg cggtgtactt ctccaagcat 300ggaacctttg ccgacaagga atatcagcac gtagttcctg ctcaatggca gaaaacgggt 360gcgggacctg gcaggttctg gggctaccgc ggtttgtcgc cggccacggc tgccaccgag 420atttcctggg atgagtacct gcttttatct cgcacgttgc gacgattgtc agcgcgaacg 480aagatctggg acccggcttt acgaggcggt agcggcggcc acagatggac taaggcgatg 540atgcgacgca cggttacccg gcaccgcttg gacctcgtga ccggtgagat tctgggcacg 600aagacgcgga aggttcgggc gccagtgaag aggtttgtcc ggacttcggg atacctgtgt 660gtcaatgacg ggcccgcact ggctcgaacc ctcagccgtc ttcgtacaag ctgcctgagc 720tag 72315186DNARhodococcus erythropolismisc_feature2224bp to 2409bp pRET1100 15atgatgcgac gcacggttac ccggcaccgc ttggacctcg tgaccggtga gattctgggc 60acgaagacgc ggaaggttcg ggcgccagtg aagaggtttg tccggacttc gggatacctg 120tgtgtcaatg acgggcccgc actggctcga accctcagcc gtcttcgtac aagctgcctg 180agctag 18616183DNARhodococcus erythropolismisc_feature2227bp to 2409bp pRET1100 16atgcgacgca cggttacccg gcaccgcttg gacctcgtga ccggtgagat tctgggcacg 60aagacgcgga aggttcgggc gccagtgaag aggtttgtcc ggacttcggg atacctgtgt 120gtcaatgacg ggcccgcact ggctcgaacc ctcagccgtc ttcgtacaag ctgcctgagc 180tag 18317432DNARhodococcus erythropolismisc_feature1875bp to 1444bp pRET1100 17atgatctacg acgtcggccc acaccgcaga aagccacgga cgaaacttca atccatcgct 60tggagcttcc cggaaatacg gcctgcgacc tgggtcttca ccctctgcac gtgctatctc 120ccatttcaag agctcagcat catgccgcag cttccgcgag tctcctgcct tcccatgagg 180agggaccatg tacaggtgaa aatgcggagc gcctcggctt tggaactcca ttttccacgc 240acccatccag gcaatgcccc aggcccgttg gaaccgtttg aagaacgtct ggagatgttt 300tttgacctca gcgccggtgg gggcaaccgt aagccagtcc cccggatagg tgagggtcac 360catcgcagga atgcccggca aatccatcat gggagcccag tcgagttctg ccattcgcaa 420gatcatccgt ga 43218291DNARhodococcus erythropolismisc_feature1734bp to 1444bp pRET1100 18atgccgcagc ttccgcgagt ctcctgcctt cccatgagga gggaccatgt acaggtgaaa 60atgcggagcg cctcggcttt ggaactccat tttccacgca cccatccagg caatgcccca 120ggcccgttgg aaccgtttga agaacgtctg gagatgtttt ttgacctcag cgccggtggg 180ggcaaccgta agccagtccc ccggataggt gagggtcacc atcgcaggaa tgcccggcaa 240atccatcatg ggagcccagt cgagttctgc cattcgcaag atcatccgtg a 29119258DNARhodococcus erythropolismisc_feature1701bp to 1444bp pRET1100 19atgaggaggg accatgtaca ggtgaaaatg cggagcgcct cggctttgga actccatttt 60ccacgcaccc atccaggcaa tgccccaggc ccgttggaac cgtttgaaga acgtctggag 120atgttttttg acctcagcgc cggtgggggc aaccgtaagc cagtcccccg gataggtgag 180ggtcaccatc gcaggaatgc ccggcaaatc catcatggga gcccagtcga gttctgccat 240tcgcaagatc atccgtga 25820231DNARhodococcus erythropolismisc_feature1674bp to 1444bp pRET1100 20atgcggagcg cctcggcttt ggaactccat tttccacgca cccatccagg caatgcccca 60ggcccgttgg aaccgtttga agaacgtctg gagatgtttt ttgacctcag cgccggtggg 120ggcaaccgta agccagtccc ccggataggt gagggtcacc atcgcaggaa tgcccggcaa 180atccatcatg ggagcccagt cgagttctgc cattcgcaag atcatccgtg a 23121138DNARhodococcus erythropolismisc_feature1581bp to 1444bp pRET1100 21atgttttttg acctcagcgc cggtgggggc aaccgtaagc cagtcccccg gataggtgag 60ggtcaccatc gcaggaatgc ccggcaaatc catcatggga gcccagtcga gttctgccat 120tcgcaagatc atccgtga 13822423DNARhodococcus erythropolismisc_feature2828bp to 2406bp pRET1100 22atggtgggag ggcaacactc ccaatacgct tcagttatga atgaagacag agacaacatc 60atcgccaggt tccgcgtcga aatgctccgc tcaatcgagg atgcaattca tttagccgca 120ctctccgcga acgacgaaaa ccgttatgcc gcaacagaag acaatcgacc cgtgcggaca 180caactatcgc aacaacagca ggttgtcctg accgagctga cattggccga ccacatggaa 240aagctcgcgc gggagcacct cgtttaccta gccgacagag cgcgggagat gaattgcacc 300tgggtagaga taggtcagtc gttgggtctc tctccccacg gagcgcagca gcgcatcacc 360agaagccgcc caaaacccgc catccagcaa aagacaaagc cgaaaggcgt tccgcgcgtc 420tag 42323387DNARhodococcus erythropolismisc_feature2792bp to 2406bp pRET1100 23atgaatgaag acagagacaa catcatcgcc aggttccgcg tcgaaatgct ccgctcaatc 60gaggatgcaa ttcatttagc cgcactctcc gcgaacgacg aaaaccgtta tgccgcaaca 120gaagacaatc gacccgtgcg gacacaacta tcgcaacaac agcaggttgt cctgaccgag 180ctgacattgg ccgaccacat ggaaaagctc gcgcgggagc acctcgttta cctagccgac 240agagcgcggg agatgaattg cacctgggta gagataggtc agtcgttggg tctctctccc 300cacggagcgc agcagcgcat caccagaagc cgcccaaaac ccgccatcca gcaaaagaca

360aagccgaaag gcgttccgcg cgtctag 38724342DNARhodococcus erythropolismisc_feature2747bp to 2406bp pRET1100 24atgctccgct caatcgagga tgcaattcat ttagccgcac tctccgcgaa cgacgaaaac 60cgttatgccg caacagaaga caatcgaccc gtgcggacac aactatcgca acaacagcag 120gttgtcctga ccgagctgac attggccgac cacatggaaa agctcgcgcg ggagcacctc 180gtttacctag ccgacagagc gcgggagatg aattgcacct gggtagagat aggtcagtcg 240ttgggtctct ctccccacgg agcgcagcag cgcatcacca gaagccgccc aaaacccgcc 300atccagcaaa agacaaagcc gaaaggcgtt ccgcgcgtct ag 34225189DNARhodococcus erythropolismisc_feature2594bp to 2406bp pRET1100 25atggaaaagc tcgcgcggga gcacctcgtt tacctagccg acagagcgcg ggagatgaat 60tgcacctggg tagagatagg tcagtcgttg ggtctctctc cccacggagc gcagcagcgc 120atcaccagaa gccgcccaaa acccgccatc cagcaaaaga caaagccgaa aggcgttccg 180cgcgtctag 18926135DNARhodococcus erythropolismisc_feature2540bp to 2406bp pRET1100 26atgaattgca cctgggtaga gataggtcag tcgttgggtc tctctcccca cggagcgcag 60cagcgcatca ccagaagccg cccaaaaccc gccatccagc aaaagacaaa gccgaaaggc 120gttccgcgcg tctag 13527336DNARhodococcus erythropolismisc_feature2971bp to 3306bp pRET1100 27atggctttga aagctgctgg caacgtgatt cctgattcct ccgcgtacga gtaccgggcg 60gttcaggtcg agccgaagat ggtcagaaaa gacccggaag acccgaactc tgagcagttc 120cagaagcaga aggacggcac gccggtgtgg tcgatcgact gcattcgggt cgaccgggca 180tcaggcaaca aggcaatcgt gaccgtgacg gttccggacg tgatggaacc ggatgttgcg 240gggccggtgg agttctccga gatgattgcc ggtttctggg tttcgcgcag tggttcgggc 300atgtggtttt cggcaagcgc cgtcgcttct ctctga 33628258DNARhodococcus erythropolismisc_feature3049bp to 3306bp pRET1100 28atggtcagaa aagacccgga agacccgaac tctgagcagt tccagaagca gaaggacggc 60acgccggtgt ggtcgatcga ctgcattcgg gtcgaccggg catcaggcaa caaggcaatc 120gtgaccgtga cggttccgga cgtgatggaa ccggatgttg cggggccggt ggagttctcc 180gagatgattg ccggtttctg ggtttcgcgc agtggttcgg gcatgtggtt ttcggcaagc 240gccgtcgctt ctctctga 25829525DNARhodococcus erythropolismisc_feature3577bp to 3053bp pRET1100 29atgtcgatgt actgccctcc gctgaacggc cccagctctt ccggagagag aacgaggcac 60ccggcaacgt ccgagaacac cccgttttcc cacttcggat cggccggcac tctcagcggc 120acagcttcgg actgtgaacg atcactgaac acgttcgccg cttgccaacc tgccgcaacc 180agcacaaaca cgagcacgag ggcacccaca cccagcgcaa cgccttttcc tttggacatt 240tccgaacctt tcgaggggcg acgatcagcg atcagagaga agcgacggcg cttgccgaaa 300accacatgcc cgaaccactg cgcgaaaccc agaaaccggc aatcatctcg gagaactcca 360ccggccccgc aacatccggt tccatcacgt ccggaaccgt cacggtcacg attgccttgt 420tgcctgatgc ccggtcgacc cgaatgcagt cgatcgacca caccggcgtg ccgtccttct 480gcttctggaa ctgctcagag ttcgggtctt ccgggtcttt tctga 52530519DNARhodococcus erythropolismisc_feature3571bp to 3053bp pRET1100 30atgtactgcc ctccgctgaa cggccccagc tcttccggag agagaacgag gcacccggca 60acgtccgaga acaccccgtt ttcccacttc ggatcggccg gcactctcag cggcacagct 120tcggactgtg aacgatcact gaacacgttc gccgcttgcc aacctgccgc aaccagcaca 180aacacgagca cgagggcacc cacacccagc gcaacgcctt ttcctttgga catttccgaa 240cctttcgagg ggcgacgatc agcgatcaga gagaagcgac ggcgcttgcc gaaaaccaca 300tgcccgaacc actgcgcgaa acccagaaac cggcaatcat ctcggagaac tccaccggcc 360ccgcaacatc cggttccatc acgtccggaa ccgtcacggt cacgattgcc ttgttgcctg 420atgcccggtc gacccgaatg cagtcgatcg accacaccgg cgtgccgtcc ttctgcttct 480ggaactgctc agagttcggg tcttccgggt cttttctga 51931564DNARhodococcus erythropolismisc_feature3339bp to 3902bp pRET1100 31atgtccaaag gaaaaggcgt tgcgctgggt gtgggtgccc tcgtgctcgt gtttgtgctg 60gttgcggcag gttggcaagc ggcgaacgtg ttcagtgatc gttcacagtc cgaagctgtg 120ccgctgagag tgccggccga tccgaagtgg gaaaacgggg tgttctcgga cgttgccggg 180tgcctcgttc tctctccgga agagctgggg ccgttcagcg gagggcagta catcgacata 240gtgaggccag ttgagccgga gaggttggag cgcgactggg tgaggtcggc tgagtgcgtt 300tcggcgtcga tgaatgtctc tgacctgttg gtttctgctc ttccagagtc cacccgtccc 360cccggcgatt tcgttcgttc gtggaaagtg gcgagtgatg attactgcta tgagggtgat 420aacccgcaag gctgcacttc tcgtatgccg gtttgggtct ctgcaaaaaa ctggtggtgc 480acagaacccg tactcgatcc gctcgttcgt cgctgtgagg tctttcctgc aaggcaaatc 540gttgtgccgg aaggggtttc gtga 56432255DNARhodococcus erythropolismisc_feature3648bp to 3902bp pRET1100 32atgaatgtct ctgacctgtt ggtttctgct cttccagagt ccacccgtcc ccccggcgat 60ttcgttcgtt cgtggaaagt ggcgagtgat gattactgct atgagggtga taacccgcaa 120ggctgcactt ctcgtatgcc ggtttgggtc tctgcaaaaa actggtggtg cacagaaccc 180gtactcgatc cgctcgttcg tcgctgtgag gtctttcctg caaggcaaat cgttgtgccg 240gaaggggttt cgtga 25533669DNARhodococcus erythropolismisc_feature4366bp to 5034bp pRET1100 33atgggcaccc cacgcccaag taaccgctgg tgcgctggat atttcggcgg tggtctcgtg 60agcggggaga agcggcacag cgaggccggc ccggtagaaa tcatcttttt gatgctggca 120gtcagggcgg gggactacat cgtcgccgtg actgcggttc tcgcggtcgg gttcttcgcg 180gtcgcggttg agggtttctg gttcctggtc gtcgcagtca tcgctgcacc ggcgtggtgg 240tttctgcgcg actgggaatc gaagcggagg gccgtacggg tctttgaacg ggcatggaag 300gggacacctg aatcccccgg tattgctctc tcccttggcc tgtcgaacgt ggcggggtct 360ctgccgaggt tgaggaagtt tgaaactggt tcggggatac gcacactcgt gttttctttg 420ccgcccggag tcactgccga gagctttgag aaagttcgcc ctgcgctggc agacgcgatg 480gggggtcacc gctgccaagt agagaaggtg gcccccggac aggtccgcgt cagagtgatt 540gatgaggatt cgatgaagac gccgcgtgat gcgggatggg cgaaagatgt tgtgctggaa 600gaggatacgt tcgacggtct tccgggcgag acgcgatcct ggttcgagca agaggggccg 660gcatcatga 66934558DNARhodococcus erythropolismisc_feature4477bp to 5034bp pRET1100 34atgctggcag tcagggcggg ggactacatc gtcgccgtga ctgcggttct cgcggtcggg 60ttcttcgcgg tcgcggttga gggtttctgg ttcctggtcg tcgcagtcat cgctgcaccg 120gcgtggtggt ttctgcgcga ctgggaatcg aagcggaggg ccgtacgggt ctttgaacgg 180gcatggaagg ggacacctga atcccccggt attgctctct cccttggcct gtcgaacgtg 240gcggggtctc tgccgaggtt gaggaagttt gaaactggtt cggggatacg cacactcgtg 300ttttctttgc cgcccggagt cactgccgag agctttgaga aagttcgccc tgcgctggca 360gacgcgatgg ggggtcaccg ctgccaagta gagaaggtgg cccccggaca ggtccgcgtc 420agagtgattg atgaggattc gatgaagacg ccgcgtgatg cgggatgggc gaaagatgtt 480gtgctggaag aggatacgtt cgacggtctt ccgggcgaga cgcgatcctg gttcgagcaa 540gaggggccgg catcatga 55835791DNARhodococcus erythropolismisc_feature2410bp to 3200bp pRET1100 35acgcgcggaa cgcctttcgg ctttgtcttt tgctggatgg cgggttttgg gcggcttctg 60gtgatgcgct gctgcgctcc gtggggagag agacccaacg actgacctat ctctacccag 120gtgcaattca tctcccgcgc tctgtcggct aggtaaacga ggtgctcccg cgcgagcttt 180tccatgtggt cggccaatgt cagctcggtc aggacaacct gctgttgttg cgatagttgt 240gtccgcacgg gtcgattgtc ttctgttgcg gcataacggt tttcgtcgtt cgcggagagt 300gcggctaaat gaattgcatc ctcgattgag cggagcattt cgacgcggaa cctggcgatg 360atgttgtctc tgtcttcatt cataactgaa gcgtattggg agtgttgccc tcccaccatg 420tgtgccaatg caggtgtgaa ctgagtcaca gtttctcaat agactccaag tttgtgatcc 480ttttactccc aaaatggggc atgatgtgtg cgtgcctcgg ttcaggggcg aaagttcgac 540acctcgaaag aaggcctcga catggctttg aaagctgctg gcaacgtgat tcctgattcc 600tccgcgtacg agtaccgggc ggttcaggtc gagccgaaga tggtcagaaa agacccggaa 660gacccgaact ctgagcagtt ccagaagcag aaggacggca cgccggtgtg gtcgatcgac 720tgcattcggg tcgaccgggc atcaggcaac aaggcaatcg tgaccgtgac ggttccggac 780gtgatggaac c 79136501DNARhodococcus erythropolismisc_feature1000bp to 1500bp pRET1100 36cttgttacat ctgccacaac tgtccgtgaa tctctgccag ctcctgaaac cgctggtcag 60ggccttgcgg aatccgtgac cgctgatgat ttttggtctc attcgttccc ccgcgctgac 120gatgtacgcg gcgcagctgc ttccttccag tcggtggcta actgggatgg gcgtgagggt 180ccgaggccgc gtttcgttgt cgcgcctggc gttgtccgct tggaggtttg tgatctcgca 240cgccgcgaac gaacggctga acgtgcgtat ctggctgctc gggctcgggt ggatatggcg 300gctgccaggc ataactcgcc gtacgacttc gacgtggacg atgaagagtt ggcggaactg 360gcttctctgc aaggcctcga ggacgacgac attgggggct ggtctgcgga gagggaaata 420gtgggctggt ctgctcgttc tcggtcacgg atgatcttgc gaatggcaga actcgactgg 480gctcccatga tggatttgcc g 50137945DNARhodococcus erythropolismisc_feature5000bp to 500bp pRET1100 37gatcctggtt cgagcaagag gggccggcat catgagaaaa tcggcgggag tatctcggat 60tcctatccgt ctcgggcgct ctcagtacgg ggaagacgtt ggattcgatc tcgctgcgga 120cgccgctcac atcgccatgc agggcaaaac ccgatccggc aaaagtcagg cgacgtacaa 180cgtgttagct caggcagcag cgaacgcggc ggttcgagtc gtagggtccg acccgacaca 240cgtactcctg gagcccttca aacatcgagg ggtgtccgag ccttacgtgg tttcgggact 300gaatgcgcag gccacggtgg acatgctggg ctgggtcaag cgtgagtctg atcgtcgcat 360cgaccagatg tggcccctgc gtaccgacaa gttttccgag ttcggggctt cgttcccgct 420gatactcgtc gtgctcgaag agtttcccgg gatcctcgag ggggcagcgg acgaagacgc 480cgcgttaggc cgaaaacctg ccgagcgtct cgcaccccgc atttcggcct acgtgcgtca 540gatagcagcg cagtcggcaa aggctggaat tcgccttctc ctgctctcgc aacgagcgga 600ggcctcgatc attggcggca atgcgcgttc gaatttcggg gtcaagatga ctctgagggt 660ggacgaaccg gagtcggtga gaatgcttca tccgagcgct tccccggaag actgtgccct 720ggtcgagacc ttcaagcctg gtacctgcct tttcgagaag ccaggagaag gccggcagat 780tatgcgatgc gactttgtcg gcgagtacgg gagatatgcg cgagccatcg agtcttcgga 840tctgcgtttt ctcgccaccc tccagcaaga ccaggcccaa cgcgaattct tcgctgagga 900gttcggtgtg gtggatccgt catgactgga ccacaggaga gaaag 94538939DNARhodococcus rhodniimisc_feature3350bp to 2412bp pRET1000 38atggttgcgg tggaagagca cacaggcggc gcctgggaac agctgtggct accgctgtgg 60ccactggcaa ccgacgattt cctcgacggc gtctaccgga tgcggcgatc agacgcactg 120gatcgccgct acatcgagtc gaacccgcag gcattgagca acctgctcgt cgtggacgtt 180gaccacccgg acgccgcgct gcgggcgctg tcggcggccg ggaatcatcc tctgccgaac 240gcgatcgtgg agaacccccg taacgggcac gcacacgctg tgtgggcgct ggcagagccg 300ttcacccgca ccgagtacgc ccgtcgtaag ccgctcgcct atgcggccgc cgtcaccgaa 360ggcctccggc gcgccgtcca gggggacaag ggctattcgg gcctgatgac caagaacccg 420actcacggtg actgggacac ccattggctg cacaccgagc ggcgatccct cgccgagctc 480gaggcggaac tcggcatcca catgccgcca acgcgctggc ggcaaacccg atcgcgccgt 540gagaacccga tcggcctcgg ccgaaactgc gccctgttcg aaaccgcacg cacctgggcc 600taccgcgaaa tccgcttcca ctggggcgac ccgaccggcc tcggggccgc gatctatgcg 660gaagccgcac agatcaacgc cacgttcagg aacccggtca caggcaggcc cgatccactg 720ccagcaagcg agctacgcgc cgtcgcggcc tccattaccc gctggatcac tacaaagtcc 780cggatgtggg ccgacggccc tgctgtctac gaggccacat tcatcgccat acaagccgca 840cgcggtcgca agatgagtga gaagaagcgc gaggcaaacc ggaaacgagc gacgaaggtc 900gaccggaacg cattgtggga ggcagaccgt gggcgctga 93939840DNARhodococcus rhodniimisc_feature3251bp to 2412bp pRET1000 39atgcggcgat cagacgcact ggatcgccgc tacatcgagt cgaacccgca ggcattgagc 60aacctgctcg tcgtggacgt tgaccacccg gacgccgcgc tgcgggcgct gtcggcggcc 120gggaatcatc ctctgccgaa cgcgatcgtg gagaaccccc gtaacgggca cgcacacgct 180gtgtgggcgc tggcagagcc gttcacccgc accgagtacg cccgtcgtaa gccgctcgcc 240tatgcggccg ccgtcaccga aggcctccgg cgcgccgtcc agggggacaa gggctattcg 300ggcctgatga ccaagaaccc gactcacggt gactgggaca cccattggct gcacaccgag 360cggcgatccc tcgccgagct cgaggcggaa ctcggcatcc acatgccgcc aacgcgctgg 420cggcaaaccc gatcgcgccg tgagaacccg atcggcctcg gccgaaactg cgccctgttc 480gaaaccgcac gcacctgggc ctaccgcgaa atccgcttcc actggggcga cccgaccggc 540ctcggggccg cgatctatgc ggaagccgca cagatcaacg ccacgttcag gaacccggtc 600acaggcaggc ccgatccact gccagcaagc gagctacgcg ccgtcgcggc ctccattacc 660cgctggatca ctacaaagtc ccggatgtgg gccgacggcc ctgctgtcta cgaggccaca 720ttcatcgcca tacaagccgc acgcggtcgc aagatgagtg agaagaagcg cgaggcaaac 780cggaaacgag cgacgaaggt cgaccggaac gcattgtggg aggcagaccg tgggcgctga 84040534DNARhodococcus rhodniimisc_feature2945bp to 2412bp pRET1000 40atgaccaaga acccgactca cggtgactgg gacacccatt ggctgcacac cgagcggcga 60tccctcgccg agctcgaggc ggaactcggc atccacatgc cgccaacgcg ctggcggcaa 120acccgatcgc gccgtgagaa cccgatcggc ctcggccgaa actgcgccct gttcgaaacc 180gcacgcacct gggcctaccg cgaaatccgc ttccactggg gcgacccgac cggcctcggg 240gccgcgatct atgcggaagc cgcacagatc aacgccacgt tcaggaaccc ggtcacaggc 300aggcccgatc cactgccagc aagcgagcta cgcgccgtcg cggcctccat tacccgctgg 360atcactacaa agtcccggat gtgggccgac ggccctgctg tctacgaggc cacattcatc 420gccatacaag ccgcacgcgg tcgcaagatg agtgagaaga agcgcgaggc aaaccggaaa 480cgagcgacga aggtcgaccg gaacgcattg tgggaggcag accgtgggcg ctga 53441438DNARhodococcus rhodniimisc_feature2849bp to 2412bp pRET1000 41atgccgccaa cgcgctggcg gcaaacccga tcgcgccgtg agaacccgat cggcctcggc 60cgaaactgcg ccctgttcga aaccgcacgc acctgggcct accgcgaaat ccgcttccac 120tggggcgacc cgaccggcct cggggccgcg atctatgcgg aagccgcaca gatcaacgcc 180acgttcagga acccggtcac aggcaggccc gatccactgc cagcaagcga gctacgcgcc 240gtcgcggcct ccattacccg ctggatcact acaaagtccc ggatgtgggc cgacggccct 300gctgtctacg aggccacatt catcgccata caagccgcac gcggtcgcaa gatgagtgag 360aagaagcgcg aggcaaaccg gaaacgagcg acgaaggtcg accggaacgc attgtgggag 420gcagaccgtg ggcgctga 43842207DNARhodococcus rhodniimisc_feature2365bp to 2159bp pRET1000 42atgggggcct ccacgcgcac gatccagcgc atcatggccg agccgcggga ccagttcctc 60gcacgggcag ccgagaaccg tcgccgggcc gtcgagctgc gcgagcaggg cctgaagtac 120cgcgagatcg ccgaggagat gggaatctcc accggaacgg tgggaaagct cctgcacgac 180gcacgcaagt acgcggtcag ctcctag 20743174DNARhodococcus rhodniimisc_feature2332bp to 2159bp pRET1000 43atggccgagc cgcgggacca gttcctcgca cgggcagccg agaaccgtcg ccgggccgtc 60gagctgcgcg agcagggcct gaagtaccgc gagatcgccg aggagatggg aatctccacc 120ggaacggtgg gaaagctcct gcacgacgca cgcaagtacg cggtcagctc ctag 17444330DNARhodococcus rhodniimisc_feature3197bp to 3526bp pRET1000 44atgcctgcgg gttcgactcg atgtagcggc gatccagtgc gtctgatcgc cgcatccggt 60agacgccgtc gaggaaatcg tcggttgcca gtggccacag cggtagccac agctgttccc 120aggcgccgcc tgtgtgctct tccaccgcaa ccatggggaa cacactcaca cacaagatcg 180atttattccg gtacgacacg ccagccaagt cagatgtttc ggtttctgga gcggtcctcc 240agacctttga gatccgctcc agaaacgtcc acaaattatt ggggtacgtc gaaccaagcc 300ttatcaggta tcccggggtt ccgggggtga 33045357DNARhodococcus rhodniimisc_feature4035bp to 3679bp pRET1000 45atggggtggt tattgcttgt tgcgtcgggg gccgtggcga tggtggccgg tgtggtctta 60ccgcgccggg atcgtctcgg gccggcacca ggatttccct ggttctgggt ggtgttccca 120tccacgtgca ttgccatcgc tgccgcggtg ggtgtcttcg cttggcccca agcggttacc 180ggcacgggga gctactggtg ggatccgccc agcgcgagct caccgaccct gcagttcctg 240tcaaacgagc agtaccggcg cctcgtgaca ctgcgccggt tgcagggggc gctaccggtg 300gtgtccctcg tgggaagcgg attgtgcgtg tgggcctggc gtcgacgccg cttctga 35746318DNARhodococcus rhodniimisc_feature3996bp to 3679bp pRET1000 46atggtggccg gtgtggtctt accgcgccgg gatcgtctcg ggccggcacc aggatttccc 60tggttctggg tggtgttccc atccacgtgc attgccatcg ctgccgcggt gggtgtcttc 120gcttggcccc aagcggttac cggcacgggg agctactggt gggatccgcc cagcgcgagc 180tcaccgaccc tgcagttcct gtcaaacgag cagtaccggc gcctcgtgac actgcgccgg 240ttgcaggggg cgctaccggt ggtgtccctc gtgggaagcg gattgtgcgt gtgggcctgg 300cgtcgacgcc gcttctga 31847450DNARhodococcus rhodniimisc_feature4381bp to 4830bp pRET1000 47atggccgctg acgctgcatc tgacgaccgg cggaccgagg tccgcgccgc tgcttcgcgg 60gccgctgacg cggccccggc gaagcgcacc cgcaccgtgg cggtgcggct gaccgatggg 120gaggaggccg cgtggatcga cgccgcgctg gccgatggcc accggcagct cggggcgtgg 180gtgcgtgagc gggcggtggc cggctatctc gggaaggtcc gcccgaagac cggcagtgga 240atgtcggcgg aggcggccgc ggaggtcgcc gcgatgcggc agcagatgac gaaggtgggg 300aacaacctga accagatcgc gagggcgatc aacgccgggc aggtgccgtc gcagatggcc 360gagtccctgc agaaggggtg gctggagagg tgggggcagg agttggggcg gatggcggat 420cggctcgacg cgctcgacga ccagggctga 45048210DNARhodococcus rhodniimisc_feature4621bp to 4830bp pRET1000 48atgtcggcgg aggcggccgc ggaggtcgcc gcgatgcggc agcagatgac gaaggtgggg 60aacaacctga accagatcgc gagggcgatc aacgccgggc aggtgccgtc gcagatggcc 120gagtccctgc agaaggggtg gctggagagg tgggggcagg agttggggcg gatggcggat 180cggctcgacg cgctcgacga ccagggctga 21049177DNARhodococcus rhodniimisc_feature4654bp to 4830bp pRET1000 49atgcggcagc agatgacgaa ggtggggaac aacctgaacc agatcgcgag ggcgatcaac 60gccgggcagg tgccgtcgca gatggccgag tccctgcaga aggggtggct ggagaggtgg 120gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc tcgacgacca gggctga 17750165DNARhodococcus rhodniimisc_feature4666bp to 4830bp pRET1000 50atgacgaagg tggggaacaa cctgaaccag atcgcgaggg cgatcaacgc cgggcaggtg 60ccgtcgcaga tggccgagtc cctgcagaag gggtggctgg agaggtgggg gcaggagttg 120gggcggatgg cggatcggct cgacgcgctc gacgaccagg gctga 16551453DNARhodococcus rhodniimisc_feature5161bp to 4709bp pRET1000 51atgactctcg aagcccatcc gctcggcgac cgtctgcgcg atgtccgcga actcggtatc 60ggtcagccgc cgatccccgg gcgcgcaccg cagcgagcaa tgccacaccg gcttacccac 120ccgcgcgttc gtcgcggcgg cccgctcgaa gtcccgcccc caccgggtcg ggtttttggc 180ggtgacctgc accgatcccg cgatcaccgt cccgccggca atcagccggc ccgcctcggt 240gcggtagctg tgcggggtgg ccttccccgg cccgtgcaga tacgccgcca accccttcgg 300gtcgctgccc gtgctgatct tcgcgatcac gtcagccctg gtcgtcgagc gcgtcgagcc 360gatccgccat ccgccccaac tcctgccccc acctctccag ccaccccttc tgcagggact 420cggccatctg cgacggcacc tgcccggcgt tga 45352354DNARhodococcus rhodniimisc_feature5062bp to 4709bp pRET1000 52atgccacacc ggcttaccca cccgcgcgtt cgtcgcggcg gcccgctcga agtcccgccc 60ccaccgggtc gggtttttgg cggtgacctg caccgatccc gcgatcaccg tcccgccggc 120aatcagccgg cccgcctcgg tgcggtagct gtgcggggtg gccttccccg gcccgtgcag 180atacgccgcc aaccccttcg ggtcgctgcc cgtgctgatc ttcgcgatca cgtcagccct 240ggtcgtcgag cgcgtcgagc cgatccgcca tccgccccaa ctcctgcccc cacctctcca

300gccacccctt ctgcagggac tcggccatct gcgacggcac ctgcccggcg ttga 35453288DNARhodococcus rhodniimisc_feature2331bp to 2618bp pRET1000 53atgatgcgct ggatcgtgcg cgtggaggcc cccatcttct cggccagctc gcgagctgtc 60tgcttgcggc ggatcggtcg ttcagcgccc acggtctgcc tcccacaatg cgttccggtc 120gaccttcgtc gctcgtttcc ggtttgcctc gcgcttcttc tcactcatct tgcgaccgcg 180tgcggcttgt atggcgatga atgtggcctc gtagacagca gggccgtcgg cccacatccg 240ggactttgta gtgatccagc gggtaatgga ggccgcgacg gcgcgtag 28854285DNARhodococcus rhodniimisc_feature2334bp to 2618bp pRET1000 54atgcgctgga tcgtgcgcgt ggaggccccc atcttctcgg ccagctcgcg agctgtctgc 60ttgcggcgga tcggtcgttc agcgcccacg gtctgcctcc cacaatgcgt tccggtcgac 120cttcgtcgct cgtttccggt ttgcctcgcg cttcttctca ctcatcttgc gaccgcgtgc 180ggcttgtatg gcgatgaatg tggcctcgta gacagcaggg ccgtcggccc acatccggga 240ctttgtagtg atccagcggg taatggaggc cgcgacggcg cgtag 28555336DNARhodococcus rhodniimisc_feature2907bp to 3242bp pRET1000 55atgggtgtcc cagtcaccgt gagtcgggtt cttggtcatc aggcccgaat agcccttgtc 60cccctggacg gcgcgccgga ggccttcggt gacggcggcc gcataggcga gcggcttacg 120acgggcgtac tcggtgcggg tgaacggctc tgccagcgcc cacacagcgt gtgcgtgccc 180gttacggggg ttctccacga tcgcgttcgg cagaggatga ttcccggccg ccgacagcgc 240ccgcagcgcg gcgtccgggt ggtcaacgtc cacgacgagc aggttgctca atgcctgcgg 300gttcgactcg atgtagcggc gatccagtgc gtctga 33656513DNARhodococcus rhodniimisc_feature1650bp to 2162bp pRET1000 56atgcggattg aactagttca tttggggaac gatgacctga tgaccgggga tcgtgaccta 60cccatgctga ccatcgccga ggcggtggac gcgacgcaga ccagtgagag cacgatcaag 120cgccgcctgc ggtcgggcgc gttcccgaac gcggtccgca ctgccgacgg gaagtggatg 180attcccctcg gtgacctatc agcggcaggg ctgagaccag ggaaaatggc gaaacctgac 240ccggtgaccc cttcaaatga ccgggtccgt gacctggcag ctgagaacgc cgagctccgt 300cagcgcctgg ccgtggccga agccctggcc agcgaacgca atcggatcat cgacgtgcag 360caacagatgc tccggatgct cgaagcccgg ccggtgtcgg ccctggagcc cgcggcggtt 420ccagtggcgg gtccgccgcc gcccgtcccg gccgccgatg gtcgggcagc tacgggcgcc 480ctggcccgga tacgtcgacg gcttctcggc tag 51357474DNARhodococcus rhodniimisc_feature1689bp to 2162bp pRET1000 57atgaccgggg atcgtgacct acccatgctg accatcgccg aggcggtgga cgcgacgcag 60accagtgaga gcacgatcaa gcgccgcctg cggtcgggcg cgttcccgaa cgcggtccgc 120actgccgacg ggaagtggat gattcccctc ggtgacctat cagcggcagg gctgagacca 180gggaaaatgg cgaaacctga cccggtgacc ccttcaaatg accgggtccg tgacctggca 240gctgagaacg ccgagctccg tcagcgcctg gccgtggccg aagccctggc cagcgaacgc 300aatcggatca tcgacgtgca gcaacagatg ctccggatgc tcgaagcccg gccggtgtcg 360gccctggagc ccgcggcggt tccagtggcg ggtccgccgc cgcccgtccc ggccgccgat 420ggtcgggcag ctacgggcgc cctggcccgg atacgtcgac ggcttctcgg ctag 47458450DNARhodococcus rhodniimisc_feature1713bp to 2162bp pRET1000 58atgctgacca tcgccgaggc ggtggacgcg acgcagacca gtgagagcac gatcaagcgc 60cgcctgcggt cgggcgcgtt cccgaacgcg gtccgcactg ccgacgggaa gtggatgatt 120cccctcggtg acctatcagc ggcagggctg agaccaggga aaatggcgaa acctgacccg 180gtgacccctt caaatgaccg ggtccgtgac ctggcagctg agaacgccga gctccgtcag 240cgcctggccg tggccgaagc cctggccagc gaacgcaatc ggatcatcga cgtgcagcaa 300cagatgctcc ggatgctcga agcccggccg gtgtcggccc tggagcccgc ggcggttcca 360gtggcgggtc cgccgccgcc cgtcccggcc gccgatggtc gggcagctac gggcgccctg 420gcccggatac gtcgacggct tctcggctag 45059336DNARhodococcus rhodniimisc_feature1827bp to 2162bp pRET1000 59atgattcccc tcggtgacct atcagcggca gggctgagac cagggaaaat ggcgaaacct 60gacccggtga ccccttcaaa tgaccgggtc cgtgacctgg cagctgagaa cgccgagctc 120cgtcagcgcc tggccgtggc cgaagccctg gccagcgaac gcaatcggat catcgacgtg 180cagcaacaga tgctccggat gctcgaagcc cggccggtgt cggccctgga gcccgcggcg 240gttccagtgg cgggtccgcc gccgcccgtc ccggccgccg atggtcgggc agctacgggc 300gccctggccc ggatacgtcg acggcttctc ggctag 33660288DNARhodococcus rhodniimisc_feature1875bp to 2162bp pRET1000 60atggcgaaac ctgacccggt gaccccttca aatgaccggg tccgtgacct ggcagctgag 60aacgccgagc tccgtcagcg cctggccgtg gccgaagccc tggccagcga acgcaatcgg 120atcatcgacg tgcagcaaca gatgctccgg atgctcgaag cccggccggt gtcggccctg 180gagcccgcgg cggttccagt ggcgggtccg ccgccgcccg tcccggccgc cgatggtcgg 240gcagctacgg gcgccctggc ccggatacgt cgacggcttc tcggctag 28861264DNARhodococcus rhodniimisc_feature1906bp to 2169bp pRET1000 61atgaccgggt ccgtgacctg gcagctgaga acgccgagct ccgtcagcgc ctggccgtgg 60ccgaagccct ggccagcgaa cgcaatcgga tcatcgacgt gcagcaacag atgctccgga 120tgctcgaagc ccggccggtg tcggccctgg agcccgcggc ggttccagtg gcgggtccgc 180cgccgcccgt cccggccgcc gatggtcggg cagctacggg cgccctggcc cggatacgtc 240gacggcttct cggctaggag ctga 26462258DNARhodococcus rhodniimisc_feature810bp to 553bp pRET1000 62atgctatggg aggtatgcac ctttcgcgcg ttatgtacgc atcctgggca ccctgggcac 60gaccgacctt ctagcgatcg atggtgttct tggacatgct tcgccaggcc tgcgtctgtt 120ccctacgctc cacgaaagcc ttctcgctct ctgctcacag tcccattccg gattctcgac 180ctcggtcgcg gccgggtggc tgataccccg gggccgactg cggcatggtt ggtccctggc 240ggcgggccgg gggtttga 25863540DNARhodococcus rhodniimisc_feature117bp to 656bp pRET1000 63atgggaggcc acccgacacc gctacgggac atgctcgccg cccaggagca gcgccggaag 60ccgtggactc cggagcagaa acgccagtac gcgaccgcaa aagcccaagc agaacgcgcc 120gcgaaggcca aggacgccgc gaaatggacc gaggtcgccg gcggcggcta ccagcgggac 180gtgcgcggga tgaacctgcg actgtgggtg gctgaggacg gcgcctggtc gatcacctcg 240aagaaggacc ccgaccgcca gtacgccgca ggtcaggccg acaccgtcgc gcaggcccaa 300gccgcggcca cggccacagc gaaaacgcag gcccaggcga tgtggaagca ggtcccggcc 360gacaagcgca ccgagtcagc caccagagcg gtccggcgcg tgatcgcgga tctcaccccc 420accaaacccg ccgaggtcaa acccccggcc cgccgccagg gaccaaccat gccgcagtcg 480gccccggggt atcagccacc cggccgcgac cgaggtcgag aatccggaat gggactgtga 54064510DNARhodococcus rhodniimisc_feature147bp to 656bp pRET1000 64atgctcgccg cccaggagca gcgccggaag ccgtggactc cggagcagaa acgccagtac 60gcgaccgcaa aagcccaagc agaacgcgcc gcgaaggcca aggacgccgc gaaatggacc 120gaggtcgccg gcggcggcta ccagcgggac gtgcgcggga tgaacctgcg actgtgggtg 180gctgaggacg gcgcctggtc gatcacctcg aagaaggacc ccgaccgcca gtacgccgca 240ggtcaggccg acaccgtcgc gcaggcccaa gccgcggcca cggccacagc gaaaacgcag 300gcccaggcga tgtggaagca ggtcccggcc gacaagcgca ccgagtcagc caccagagcg 360gtccggcgcg tgatcgcgga tctcaccccc accaaacccg ccgaggtcaa acccccggcc 420cgccgccagg gaccaaccat gccgcagtcg gccccggggt atcagccacc cggccgcgac 480cgaggtcgag aatccggaat gggactgtga 51065351DNARhodococcus rhodniimisc_feature306bp to 656bp pRET1000 65atgaacctgc gactgtgggt ggctgaggac ggcgcctggt cgatcacctc gaagaaggac 60cccgaccgcc agtacgccgc aggtcaggcc gacaccgtcg cgcaggccca agccgcggcc 120acggccacag cgaaaacgca ggcccaggcg atgtggaagc aggtcccggc cgacaagcgc 180accgagtcag ccaccagagc ggtccggcgc gtgatcgcgg atctcacccc caccaaaccc 240gccgaggtca aacccccggc ccgccgccag ggaccaacca tgccgcagtc ggccccgggg 300tatcagccac ccggccgcga ccgaggtcga gaatccggaa tgggactgtg a 35166201DNARhodococcus rhodniimisc_feature456bp to 656bp pRET1000 66atgtggaagc aggtcccggc cgacaagcgc accgagtcag ccaccagagc ggtccggcgc 60gtgatcgcgg atctcacccc caccaaaccc gccgaggtca aacccccggc ccgccgccag 120ggaccaacca tgccgcagtc ggccccgggg tatcagccac ccggccgcga ccgaggtcga 180gaatccggaa tgggactgtg a 201671326DNARhodococcus rhodniimisc_feature5144bp to 656bp pRET1000 67atgggcttcg agagtcatcc gtgggtggcg gtgcggcacg acgacgacca catccacctg 60gctgtctccc gggtcgattt tcagggcgtg acctggaaga acagcaacga ccggtggaag 120gtcgtcgagg tgatgcgcga ggtcgaacgc gcgcacggcc tgatcgaggt ggcgagcccg 180gagcgggccc gtggccggca agccagcagc ggcgagcaac gccgcgcggt gcggaccggc 240aaggtggcgc agcgggacgg tctgagggaa attgtgaccg ccgcccgcga catcgccgca 300ggccagggtg tgggggcgtt cgaagtggcg ctcgtacaga acccgattac ccgagtgcag 360gtgcggcgca acgtcgcgaa gacgggccgg atgaatggct acagcttcaa cctgcccggc 420tacgtcgacg ccgccgggga gccgatctgg ttgccggcct ccaaactcga ccggggtttg 480tcctggtcac agctggaaaa gacgctgacc agaccccgcc cggaccgcct cgccggcgag 540gagacggtgc cgcggaagcg gctcgagcgc gccgccgcgt gggagcagcg ccgccgcgag 600gtcggcggcg agcagttcgc agctgcccgc tgggagcagg cccgcgcgaa tgttggtgag 660acggccgggc ggatccgcgc cgaacagtcc gcggacacga agtggaagca ggtgaacgag 720gcgttgacca gccaagaccg ggccgaggag caggctgccg aggcagcgcg ggtcgcctcc 780gctgtcatgg gaggccaccc gacaccgcta cgggacatgc tcgccgccca ggagcagcgc 840cggaagccgt ggactccgga gcagaaacgc cagtacgcga ccgcaaaagc ccaagcagaa 900cgcgccgcga aggccaagga cgccgcgaaa tggaccgagg tcgccggcgg cggctaccag 960cgggacgtgc gcgggatgaa cctgcgactg tgggtggctg aggacggcgc ctggtcgatc 1020acctcgaaga aggaccccga ccgccagtac gccgcaggtc aggccgacac cgtcgcgcag 1080gcccaagccg cggccacggc cacagcgaaa acgcaggccc aggcgatgtg gaagcaggtc 1140ccggccgaca agcgcaccga gtcagccacc agagcggtcc ggcgcgtgat cgcggatctc 1200acccccacca aacccgccga ggtcaaaccc ccggcccgcc gccagggacc aaccatgccg 1260cagtcggccc cggggtatca gccacccggc cgcgaccgag gtcgagaatc cggaatggga 1320ctgtga 1326681194DNARhodococcus rhodniimisc_feature5276bp to 656bp pRET1000 68atgcgcgagg tcgaacgcgc gcacggcctg atcgaggtgg cgagcccgga gcgggcccgt 60ggccggcaag ccagcagcgg cgagcaacgc cgcgcggtgc ggaccggcaa ggtggcgcag 120cgggacggtc tgagggaaat tgtgaccgcc gcccgcgaca tcgccgcagg ccagggtgtg 180ggggcgttcg aagtggcgct cgtacagaac ccgattaccc gagtgcaggt gcggcgcaac 240gtcgcgaaga cgggccggat gaatggctac agcttcaacc tgcccggcta cgtcgacgcc 300gccggggagc cgatctggtt gccggcctcc aaactcgacc ggggtttgtc ctggtcacag 360ctggaaaaga cgctgaccag accccgcccg gaccgcctcg ccggcgagga gacggtgccg 420cggaagcggc tcgagcgcgc cgccgcgtgg gagcagcgcc gccgcgaggt cggcggcgag 480cagttcgcag ctgcccgctg ggagcaggcc cgcgcgaatg ttggtgagac ggccgggcgg 540atccgcgccg aacagtccgc ggacacgaag tggaagcagg tgaacgaggc gttgaccagc 600caagaccggg ccgaggagca ggctgccgag gcagcgcggg tcgcctccgc tgtcatggga 660ggccacccga caccgctacg ggacatgctc gccgcccagg agcagcgccg gaagccgtgg 720actccggagc agaaacgcca gtacgcgacc gcaaaagccc aagcagaacg cgccgcgaag 780gccaaggacg ccgcgaaatg gaccgaggtc gccggcggcg gctaccagcg ggacgtgcgc 840gggatgaacc tgcgactgtg ggtggctgag gacggcgcct ggtcgatcac ctcgaagaag 900gaccccgacc gccagtacgc cgcaggtcag gccgacaccg tcgcgcaggc ccaagccgcg 960gccacggcca cagcgaaaac gcaggcccag gcgatgtgga agcaggtccc ggccgacaag 1020cgcaccgagt cagccaccag agcggtccgg cgcgtgatcg cggatctcac ccccaccaaa 1080cccgccgagg tcaaaccccc ggcccgccgc cagggaccaa ccatgccgca gtcggccccg 1140gggtatcagc cacccggccg cgaccgaggt cgagaatccg gaatgggact gtga 119469936DNARhodococcus rhodniimisc_feature5534bp to 656bp pRET1000 69atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 60ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 120agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 180gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 240tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 300gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 360caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 420cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 480cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 540tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 600tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 660gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 720acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 780agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 840ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 900cgcgaccgag gtcgagaatc cggaatggga ctgtga 93670153DNARhodococcus rhodniimisc_feature3355bp to 3507bp pRET1000 70aacacactca cacacaagat cgatttattc cggtacgaca cgccagccaa gtcagatgtt 60tcggtttctg gagcggtcct ccagaccttt gagatccgct ccagaaacgt ccacaaatta 120ttggggtacg tcgaaccaag ccttatcagg tat 1537161DNARhodococcus rhodniimisc_feature4290bp to 4350bp pRET1000 71gagctatgcc cagggttgcg cagtgacttc gtcactgcgt aaccctgggc gctcgcctcc 60c 6172325DNARhodococcus rhodniimisc_feature3570bp to 3894bp pRET1000 72ccgctcgaag tccttgagtc agtgacagga ccactgctgg gctcccagcg cagaaggcaa 60gtgaaggcag acgactgcgg gaggtaagtc gggtacggca tgaggtcctt cagaagcggc 120gtcgacgcca ggcccacacg cacaatccgc ttcccacgag ggacaccacc ggtagcgccc 180cctgcaaccg gcgcagtgtc acgaggcgcc ggtactgctc gtttgacagg aactgcaggg 240tcggtgagct cgcgctgggc ggatcccacc agtagctccc cgtgccggta accgcttggg 300gccaagcgaa gacacccacc gcggc 325735444DNARhodococcus erythropolismisc_featurepRET1100 Full Length 73cccgggatcc tcgagggggc agcggacgaa gacgccgcgt taggccgaaa acctgccgag 60cgtctcgcac cccgcatttc ggcctacgtg cgtcagatag cagcgcagtc ggcaaaggct 120ggaattcgcc ttctcctgct ctcgcaacga gcggaggcct cgatcattgg cggcaatgcg 180cgttcgaatt tcggggtcaa gatgactctg agggtggacg aaccggagtc ggtgagaatg 240cttcatccga gcgcttcccc ggaagactgt gccctggtcg agaccttcaa gcctggtacc 300tgccttttcg agaagccagg agaaggccgg cagattatgc gatgcgactt tgtcggcgag 360tacgggagat atgcgcgagc catcgagtct tcggatctgc gttttctcgc caccctccag 420caagaccagg cccaacgcga attcttcgct gaggagttcg gtgtggtgga tccgtcatga 480ctggaccaca ggagagaaag cgcaaggcgg cgaagccgtc gcgggagcct cagttgaact 540gctgtgaagc ggacgtgccg aaacgagcaa aacagccccc ggttccctct acgttcgacc 600tgctcacggt gaaggagact gcggggctgc tgagagtcag tcaggcaact ctttaccggc 660tgcttcggag tggggaagga cccacataca cacggatcgg tggacagata cgcgttcacc 720gcgagtcgct gcgtcggttc atcgaaccgc gtggataacg tcacagagac agcgaaaacg 780cctcccctgg gtcaatccgg ttaccgccgg actgggggag gcgcttcgac acctacatcc 840gtcgcccctc gaaaggctca gatgcacttc cacgataacg cagaggtcgg acaagaggga 900agaactgccg ttctctcgcc gttgcgcggc gtagccgcca agcgggacgt gtctgacgat 960gcagcgaagc ggagtcggca ggcgcggcac gcgcctgggc ttgttacatc tgccacaact 1020gtccgtgaat ctctgccagc tcctgaaacc gctggtcagg gccttgcgga atccgtgacc 1080gctgatgatt tttggtctca ttcgttcccc cgcgctgacg atgtacgcgg cgcagctgct 1140tccttccagt cggtggctaa ctgggatggg cgtgagggtc cgaggccgcg tttcgttgtc 1200gcgcctggcg ttgtccgctt ggaggtttgt gatctcgcac gccgcgaacg aacggctgaa 1260cgtgcgtatc tggctgctcg ggctcgggtg gatatggcgg ctgccaggca taactcgccg 1320tacgacttcg acgtggacga tgaagagttg gcggaactgg cttctctgca aggcctcgag 1380gacgacgaca ttgggggctg gtctgcggag agggaaatag tgggctggtc tgctcgttct 1440cggtcacgga tgatcttgcg aatggcagaa ctcgactggg ctcccatgat ggatttgccg 1500ggcattcctg cgatggtgac cctcacctat ccgggggact ggcttacggt tgcccccacc 1560ggcgctgagg tcaaaaaaca tctccagacg ttcttcaaac ggttccaacg ggcctggggc 1620attgcctgga tgggtgcgtg gaaaatggag ttccaaagcc gaggcgctcc gcattttcac 1680ctgtacatgg tccctcctca tgggaaggca ggagactcgc ggaagctgcg gcatgatgct 1740gagctcttga aatgggagat agcacgtgca gagggtgaag acccaggtcg caggccgtat 1800ttccgggaag ctccaagcga tggattgaag tttcgtccgt ggctttctgc ggtgtgggcc 1860gacgtcgtag atcatccgga ccccaaggaa aaagaaaagc acgtcagtgc cggcactgga 1920gtggactacg cggagggcac gcgagggtca gatccgaaaa ggcttgcggt gtacttctcc 1980aagcatggaa cctttgccga caaggaatat cagcacgtag ttcctgctca atggcagaaa 2040acgggtgcgg gacctggcag gttctggggc taccgcggtt tgtcgccggc cacggctgcc 2100accgagattt cctgggatga gtacctgctt ttatctcgca cgttgcgacg attgtcagcg 2160cgaacgaaga tctgggaccc ggctttacga ggcggtagcg gcggccacag atggactaag 2220gcgatgatgc gacgcacggt tacccggcac cgcttggacc tcgtgaccgg tgagattctg 2280ggcacgaaga cgcggaaggt tcgggcgcca gtgaagaggt ttgtccggac ttcgggatac 2340ctgtgtgtca atgacgggcc cgcactggct cgaaccctca gccgtcttcg tacaagctgc 2400ctgagctaga cgcgcggaac gcctttcggc tttgtctttt gctggatggc gggttttggg 2460cggcttctgg tgatgcgctg ctgcgctccg tggggagaga gacccaacga ctgacctatc 2520tctacccagg tgcaattcat ctcccgcgct ctgtcggcta ggtaaacgag gtgctcccgc 2580gcgagctttt ccatgtggtc ggccaatgtc agctcggtca ggacaacctg ctgttgttgc 2640gatagttgtg tccgcacggg tcgattgtct tctgttgcgg cataacggtt ttcgtcgttc 2700gcggagagtg cggctaaatg aattgcatcc tcgattgagc ggagcatttc gacgcggaac 2760ctggcgatga tgttgtctct gtcttcattc ataactgaag cgtattggga gtgttgccct 2820cccaccatgt gtgccaatgc aggtgtgaac tgagtcacag tttctcaata gactccaagt 2880ttgtgatcct tttactccca aaatggggca tgatgtgtgc gtgcctcggt tcaggggcga 2940aagttcgaca cctcgaaaga aggcctcgac atggctttga aagctgctgg caacgtgatt 3000cctgattcct ccgcgtacga gtaccgggcg gttcaggtcg agccgaagat ggtcagaaaa 3060gacccggaag acccgaactc tgagcagttc cagaagcaga aggacggcac gccggtgtgg 3120tcgatcgact gcattcgggt cgaccgggca tcaggcaaca aggcaatcgt gaccgtgacg 3180gttccggacg tgatggaacc ggatgttgcg gggccggtgg agttctccga gatgattgcc 3240ggtttctggg tttcgcgcag tggttcgggc atgtggtttt cggcaagcgc cgtcgcttct 3300ctctgatcgc tgatcgtcgc ccctcgaaag gttcggaaat gtccaaagga aaaggcgttg 3360cgctgggtgt gggtgccctc gtgctcgtgt ttgtgctggt tgcggcaggt tggcaagcgg 3420cgaacgtgtt cagtgatcgt tcacagtccg aagctgtgcc gctgagagtg ccggccgatc 3480cgaagtggga aaacggggtg ttctcggacg ttgccgggtg cctcgttctc tctccggaag 3540agctggggcc gttcagcgga gggcagtaca tcgacatagt gaggccagtt gagccggaga 3600ggttggagcg cgactgggtg aggtcggctg agtgcgtttc ggcgtcgatg aatgtctctg 3660acctgttggt ttctgctctt ccagagtcca cccgtccccc cggcgatttc gttcgttcgt 3720ggaaagtggc gagtgatgat tactgctatg agggtgataa cccgcaaggc tgcacttctc 3780gtatgccggt ttgggtctct gcaaaaaact ggtggtgcac agaacccgta ctcgatccgc 3840tcgttcgtcg ctgtgaggtc tttcctgcaa ggcaaatcgt tgtgccggaa ggggtttcgt 3900gatgtttctc cgagcgtttt ttcgttccaa gttggtcatg gtggctcttg tcctggtcgc 3960tggcctgttt ctctacaacg cctgctcttc ttctgacgca aaggaagaga tcggcagcag 4020tctgaatctc tctcctgtca ctgctcgttc gaatccgtat gagggcgtcc agcccacgat 4080gagcgaaaaa agccctgttc ccgtccctgt cgtttccggc gacaggattt cgggggtggc

4140atcgtgcggg acggattacg ccgggaagcc tgcggtgacg ctggaagctg tgtggatttc 4200gtccgactcg gtgaactaca cactcgataa gaggcattgc ctggtgacga ccggcccgct 4260gtggaaacaa gcgatccgta aagcgtcagg gtcagagatt cggcctgagg gcgggagctg 4320gatacgggtg gtgcttgcca tgcctgacgg caatttcagg gcaggatggg caccccacgc 4380ccaagtaacc gctggtgcgc tggatatttc ggcggtggtc tcgtgagcgg ggagaagcgg 4440cacagcgagg ccggcccggt agaaatcatc tttttgatgc tggcagtcag ggcgggggac 4500tacatcgtcg ccgtgactgc ggttctcgcg gtcgggttct tcgcggtcgc ggttgagggt 4560ttctggttcc tggtcgtcgc agtcatcgct gcaccggcgt ggtggtttct gcgcgactgg 4620gaatcgaagc ggagggccgt acgggtcttt gaacgggcat ggaaggggac acctgaatcc 4680cccggtattg ctctctccct tggcctgtcg aacgtggcgg ggtctctgcc gaggttgagg 4740aagtttgaaa ctggttcggg gatacgcaca ctcgtgtttt ctttgccgcc cggagtcact 4800gccgagagct ttgagaaagt tcgccctgcg ctggcagacg cgatgggggg tcaccgctgc 4860caagtagaga aggtggcccc cggacaggtc cgcgtcagag tgattgatga ggattcgatg 4920aagacgccgc gtgatgcggg atgggcgaaa gatgttgtgc tggaagagga tacgttcgac 4980ggtcttccgg gcgagacgcg atcctggttc gagcaagagg ggccggcatc atgagaaaat 5040cggcgggagt atctcggatt cctatccgtc tcgggcgctc tcagtacggg gaagacgttg 5100gattcgatct cgctgcggac gccgctcaca tcgccatgca gggcaaaacc cgatccggca 5160aaagtcaggc gacgtacaac gtgttagctc aggcagcagc gaacgcggcg gttcgagtcg 5220tagggtccga cccgacacac gtactcctgg agcccttcaa acatcgaggg gtgtccgagc 5280cttacgtggt ttcgggactg aatgcgcagg ccacggtgga catgctgggc tgggtcaagc 5340gtgagtctga tcgtcgcatc gaccagatgt ggcccctgcg taccgacaag ttttccgagt 5400tcggggcttc gttcccgctg atactcgtcg tgctcgaaga gttt 5444745813DNARhodococcus rhodniimisc_featurepRET1000 Full Length 74ggatccgcgc cgaacagtcc gcggacacga agtggaagca ggtgaacgag gcgttgacca 60gccaagaccg ggccgaggag caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg 120gaggccaccc gacaccgcta cgggacatgc tcgccgccca ggagcagcgc cggaagccgt 180ggactccgga gcagaaacgc cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga 240aggccaagga cgccgcgaaa tggaccgagg tcgccggcgg cggctaccag cgggacgtgc 300gcgggatgaa cctgcgactg tgggtggctg aggacggcgc ctggtcgatc acctcgaaga 360aggaccccga ccgccagtac gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg 420cggccacggc cacagcgaaa acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca 480agcgcaccga gtcagccacc agagcggtcc ggcgcgtgat cgcggatctc acccccacca 540aacccgccga ggtcaaaccc ccggcccgcc gccagggacc aaccatgccg cagtcggccc 600cggggtatca gccacccggc cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag 660agagcgagaa ggctttcgtg gagcgtaggg aacagacgca ggcctggcga agcatgtcca 720agaacaccat cgatcgctag aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa 780cgcgcgaaag gtgcatacct cccatagcat cggcgcgtat ggtagggaaa atgatcttca 840aacgtattgc tgtggtcgtg ctcgctggtg gggctttggt agtgggaggc agccaggttg 900ctggtgctac cacggtttca gctccacagc cgagtccttc agcagcggtg gtgccgacgg 960ttcttccacc agtcactttc accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg 1020attcccggcg atgccgtctg attccacttc cacagggccg agcgatctgc tgggcggcag 1080ccgctgcccg ttacgcagcg tgccgcgccg gaaactaggt agaacgtgag catggacgag 1140cttcccacct tcatcgccga cgacatcgtg atggccagaa cgttcgacag ccctaacggc 1200caggtggtgc tcgaggtgaa cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac 1260tactgctgca ccttccggat cagcgggaac atggatgccc cttacgacgg attcggtggc 1320ggcgtcgacg cagtgcaggc gctgctactc gcattggcca tggcacacga ggaacttcgt 1380caaacttcgc cagagttgac gtttctaggc gagacgaacc tcggtctacc ggtcttgaac 1440atcaagcccg acaacgcgat cgaagccgtg gtctcattcc ccgctccctg atgtgacgca 1500ctttcacccc tggcactcat gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg 1560cttcgcgttg acttgccact gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg 1620gttcatgaca ccgctaacac gctgcggaaa tgcggattga actagttcat ttggggaacg 1680atgacctgat gaccggggat cgtgacctac ccatgctgac catcgccgag gcggtggacg 1740cgacgcagac cagtgagagc acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg 1800cggtccgcac tgccgacggg aagtggatga ttcccctcgg tgacctatca gcggcagggc 1860tgagaccagg gaaaatggcg aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg 1920acctggcagc tgagaacgcc gagctccgtc agcgcctggc cgtggccgaa gccctggcca 1980gcgaacgcaa tcggatcatc gacgtgcagc aacagatgct ccggatgctc gaagcccggc 2040cggtgtcggc cctggagccc gcggcggttc cagtggcggg tccgccgccg cccgtcccgg 2100ccgccgatgg tcgggcagct acgggcgccc tggcccggat acgtcgacgg cttctcggct 2160aggagctgac cgcgtacttg cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga 2220ttcccatctc ctcggcgatc tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc 2280ggcgacggtt ctcggctgcc cgtgcgagga actggtcccg cggctcggcc atgatgcgct 2340ggatcgtgcg cgtggaggcc cccatcttct cggccagctc gcgagctgtc tgcttgcggc 2400ggatcggtcg ttcagcgccc acggtctgcc tcccacaatg cgttccggtc gaccttcgtc 2460gctcgtttcc ggtttgcctc gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt 2520atggcgatga atgtggcctc gtagacagca gggccgtcgg cccacatccg ggactttgta 2580gtgatccagc gggtaatgga ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg 2640ggcctgcctg tgaccgggtt cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc 2700gcggccccga ggccggtcgg gtcgccccag tggaagcgga tttcgcggta ggcccaggtg 2760cgtgcggttt cgaacagggc gcagtttcgg ccgaggccga tcgggttctc acggcgcgat 2820cgggtttgcc gccagcgcgt tggcggcatg tggatgccga gttccgcctc gagctcggcg 2880agggatcgcc gctcggtgtg cagccaatgg gtgtcccagt caccgtgagt cgggttcttg 2940gtcatcaggc ccgaatagcc cttgtccccc tggacggcgc gccggaggcc ttcggtgacg 3000gcggccgcat aggcgagcgg cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc 3060agcgcccaca cagcgtgtgc gtgcccgtta cgggggttct ccacgatcgc gttcggcaga 3120ggatgattcc cggccgccga cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg 3180acgagcaggt tgctcaatgc ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct 3240gatcgccgca tccggtagac gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt 3300agccacagct gttcccaggc gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca 3360ctcacacaca agatcgattt attccggtac gacacgccag ccaagtcaga tgtttcggtt 3420tctggagcgg tcctccagac ctttgagatc cgctccagaa acgtccacaa attattgggg 3480tacgtcgaac caagccttat caggtatccc ggggttccgg gggtgaacac caccctccga 3540ccggtccaga atccgtcgat ctcacctatc cgctcgaagt ccttgagtca gtgacaggac 3600cactgctggg ctcccagcgc agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg 3660ggtacggcat gaggtccttc agaagcggcg tcgacgccag gcccacacgc acaatccgct 3720tcccacgagg gacaccaccg gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg 3780gtactgctcg tttgacagga actgcagggt cggtgagctc gcgctgggcg gatcccacca 3840gtagctcccc gtgccggtaa ccgcttgggg ccaagcgaag acacccaccg cggcagcgat 3900ggcaatgcac gtggatggga acaccaccca gaaccaggga aatcctggtg ccggcccgag 3960acgatcccgg cgcggtaaga ccacaccggc caccatcgcc acggcccccg acgcaacaag 4020caataaccac cccatgagcg gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc 4080gccagcccgt gaccggaccg gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc 4140cgtgcccgtt ctgaccggtg gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca 4200gcccgtgacc gtgccgtcca ccacccggtg cctggtctgc gtctccctcg gctcgttcct 4260cgcctatcct ggtgaccaga caccggagcg agctatgccc agggttgcgc agtgacttcg 4320tcactgcgta accctgggcg ctcgcctccc attcgcttcg ctcacaggag ggggccgtcg 4380atggccgctg acgctgcatc tgacgaccgg cggaccgagg tccgcgccgc tgcttcgcgg 4440gccgctgacg cggccccggc gaagcgcacc cgcaccgtgg cggtgcggct gaccgatggg 4500gaggaggccg cgtggatcga cgccgcgctg gccgatggcc accggcagct cggggcgtgg 4560gtgcgtgagc gggcggtggc cggctatctc gggaaggtcc gcccgaagac cggcagtgga 4620atgtcggcgg aggcggccgc ggaggtcgcc gcgatgcggc agcagatgac gaaggtgggg 4680aacaacctga accagatcgc gagggcgatc aacgccgggc aggtgccgtc gcagatggcc 4740gagtccctgc agaaggggtg gctggagagg tgggggcagg agttggggcg gatggcggat 4800cggctcgacg cgctcgacga ccagggctga cgtgatcgcg aagatcagca cgggcagcga 4860cccgaagggg ttggcggcgt atctgcacgg gccggggaag gccaccccgc acagctaccg 4920caccgaggcg ggccggctga ttgccggcgg gacggtgatc gcgggatcgg tgcaggtcac 4980cgccaaaaac ccgacccggt gggggcggga cttcgagcgg gccgccgcga cgaacgcgcg 5040ggtgggtaag ccggtgtggc attgctcgct gcggtgcgcg cccggggatc ggcggctgac 5100cgataccgag ttcgcggaca tcgcgcagac ggtcgccgag cggatgggct tcgagagtca 5160tccgtgggtg gcggtgcggc acgacgacga ccacatccac ctggctgtct cccgggtcga 5220ttttcagggc gtgacctgga agaacagcaa cgaccggtgg aaggtcgtcg aggtgatgcg 5280cgaggtcgaa cgcgcgcacg gcctgatcga ggtggcgagc ccggagcggg cccgtggccg 5340gcaagccagc agcggcgagc aacgccgcgc ggtgcggacc ggcaaggtgg cgcagcggga 5400cggtctgagg gaaattgtga ccgccgcccg cgacatcgcc gcaggccagg gtgtgggggc 5460gttcgaagtg gcgctcgtac agaacccgat tacccgagtg caggtgcggc gcaacgtcgc 5520gaagacgggc cggatgaatg gctacagctt caacctgccc ggctacgtcg acgccgccgg 5580ggagccgatc tggttgccgg cctccaaact cgaccggggt ttgtcctggt cacagctgga 5640aaagacgctg accagacccc gcccggaccg cctcgccggc gaggagacgg tgccgcggaa 5700gcggctcgag cgcgccgccg cgtgggagca gcgccgccgc gaggtcggcg gcgagcagtt 5760cgcagctgcc cgctgggagc aggcccgcgc gaatgttggt gagacggccg ggc 58137580DNARhodococcus rhodniimisc_feature4260bp to 4339bp pRET1000 75tcgcctatcc tggtgaccag acaccggagc gagctatgcc cagggttgcg cagtgacttc 60gtcactgcgt aaccctgggc 8076108DNARhodococcus erythropolismisc_feature761bp to 868bp pRET1100 76tcacagagac agcgaaaacg cctcccctgg gtcaatccgg ttaccgccgg actgggggag 60gcgcttcgac acctacatcc gtcgcccctc gaaaggctca gatgcact 10877556DNARhodococcus erythropolis 77gaagcaacac cgcatccgcc cattgccgat cgctcagcac gccccccgtt gcggatttca 60tggggcaact gtgcccgccc acatcaacta ttcgagtccg acgcgccgag gctatatgga 120aaattattcg actacgcaaa acaaagccat atcaggtatc ccggcgacac cccccaaaac 180ctcctcccca ccaacccctg ctttttgaac cttgccgcgc tggatcgttc gatttcttct 240ggaaccctgc gagcggaaag ccacggtcgg caccttggtg caagaggtgt gctcgggttg 300ggctttgcgt cggtggatgg tgagcacagg cgggtgagta cggcggtact cccgggagct 360gcttcgagct gcgggaggta ggtcgggtac ggcgcgcaga gcggaagcgt ggtcggtggt 420tgttcactct tctgctcggc cgaatcgagc gccggccgaa tcgagcgccg gccgaatcga 480gcgccggccg aatcgagcgc cggccgaatc gagcgccggc cgaatcgtta gtgcggtgtg 540cgtgcgtggt ggtcga 55678259PRTRhodococcus erythropolis 78Met Phe Asn Ser Ile Glu Gly Arg Ser Val Val Val Thr Gly Gly Ser 1 5 10 15Lys Gly Ile Gly Leu Gly Met Val Arg Val Phe Ala Arg Ala Gly Ala 20 25 30Asn Val Leu Met Thr Ala Arg Asp Ala Leu Thr Leu Glu Arg Ala Ala 35 40 45Glu Gly Leu Asn Gly Leu Pro Gly Ala Val Ser Thr Leu Gln Val Asp 50 55 60Val Thr Asn Pro Asp Ser Leu Ala Gly Met Ala Glu Val Ala Ala Glu 65 70 75 80Arg His Gly Gly Ile Asp Val Leu Cys Ala Asn Ala Gly Ile Phe Pro 85 90 95Ser Lys Arg Leu Gly Glu Met Thr Ser Glu Asp Met Asp Ser Val Phe 100 105 110Gly Val Asn Val Lys Gly Thr Ile His Ala Val Gln Ala Cys Met Pro 115 120 125Trp Leu Glu Thr Ser Gly Arg Gly Arg Val Val Val Thr Ser Ser Ile 130 135 140Thr Gly Pro Val Thr Gly Tyr Pro Gly Trp Ser His Tyr Gly Ala Ser145 150 155 160Lys Ala Ala Gln Met Gly Phe Ile Arg Thr Ala Ala Ile Glu Leu Ala 165 170 175Pro Lys Arg Ile Thr Ile Asn Ala Val Leu Pro Gly Asn Val Ile Thr 180 185 190Glu Gly Leu Asp Gly Leu Gly Gln Glu Tyr Leu Asp Gln Met Ala Ser 195 200 205Ser Val Pro Ala Gly Ser Leu Gly Ser Val Glu Asp Ile Ala Asn Ala 210 215 220Ala Leu Phe Phe Ala Leu Asp Glu Ala Ala Tyr Ile Thr Gly Gln Ser225 230 235 240Leu Ile Val Asp Gly Gly Gln Val Leu Pro Glu Ser Ala Met Ala Leu 245 250 255Gly Glu Leu79780DNARhodococcus erythropolis 79atgttcaact ccattgaagg tcgttcggtc gtcgtcaccg gcggtagcaa gggcatcggc 60ttgggaatgg tccgggtatt cgcgcgcgca ggggccaatg tgctcatgac cgcgcgagac 120gctctgactc tcgaacgtgc cgcggagggt ttgaatggtc ttcctggcgc ggtctccaca 180cttcaagtcg acgtcacgaa tcctgactcc ttggccggta tggcagaagt tgcggccgag 240cgacacggag gaatcgacgt gttgtgcgcg aacgctggga tcttcccgtc gaagcggttg 300ggagagatga cctcggagga catggacagc gtattcggcg tcaacgtcaa ggggaccatc 360cacgccgtgc aagcgtgcat gccgtggctc gaaacttctg ggcgtggaag ggttgtcgtg 420acatcgtcga tcaccggacc cgtaaccggt tatccgggtt ggtcgcacta cggggcaagc 480aaggctgcgc agatgggctt catccgaact gctgccattg agttggcacc gaagaggatc 540acgatcaacg ccgtcttgcc cggcaacgtg atcaccgagg ggctcgacgg tttgggacag 600gaatatctcg accaaatggc gtccagcgtc ccggccggca gtctgggcag cgtcgaggat 660atcgccaatg ccgcactgtt ctttgcactg gacgaagccg cgtacatcac cggtcagtcg 720ttgatcgtag atggtggaca ggttcttccg gagtcggcga tggcgctcgg cgaactgtaa 7808028DNAArtificialprimer (MAK F1) 80gaatcttctc gttgatgcag atcaggtc 288126DNAArtificialprimer (MAK R2) 81ctgactccgt agtgttctgc cagttc 268234DNAArtificialprimer (MAK Pst F) 82gaccactgca gatcaatcaa ctctgatgag gtcc 348334DNAArtificialprimer (MAK His Bgl II R) 83cgcttagatc tcagttcgcc gagcgccatc gccg 348435DNAArtificialprimer (P1200rep-Pst5195) 84agccgctgca gaagcaacac cgcatccgcc cattg 358524DNAArtificialprimer (P7) 85cgccagggtt ttcccagtca cgac 248630DNAArtificialprimer (pQE70 F1) 86ggcgtatcac gaggcccttt cgtcttcacc 308735DNAArtificialprimer (pQE70 R1135Bm) 87ggttggatcc gtcatcaccg aaacgcgcga ggcag 358834DNAArtificialprimer ( P1204rep-Ec2958) 88cgcggaattc gaccaccacg cacgcacacc gcac 34898134DNAArtificialpRET1101 89gggtaccgag ctcgaattcg taatcatggt catagctgtt tcctgtgtga aattgttatc 60cgctcacaat tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct 120aatgagtgag ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa 180acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta 240ttgggcgctc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc 300gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg 360caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt 420tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa 480gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct 540ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc 600cttcgggaag cgtggcgctt tctcatagct cacgctgtag gtatctcagt tcggtgtagg 660tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 720tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag 780cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga 840agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga 900agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg 960gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 1020aagatccttt gatcttttct acggggtctg acgctcagtg gaacgaaaac tcacgttaag 1080ggattttggt catgagatta tcaaaaagga tcttcaccta gatcctttta aattaaaaat 1140gaagttttaa atcaatctaa agtatatatg agtaaacttg gtctgacagt taccaatgct 1200taatcagtga ggcacctatc tcagcgatct gtctatttcg ttcatccata gttgcctgac 1260tccccgtcgt gtagataact acgatacggg agggcttacc atctggcccc agtgctgcaa 1320tgataccgcg agacccacgc tcaccggctc cagatttatc agcaataaac cagccagccg 1380gaagggccga gcgcagaagt ggtcctgcaa ctttatccgc ctccatccag tctattaatt 1440gttgccggga agctagagta agtagttcgc cagttaatag tttgcgcaac gttgttgcca 1500ttgctacagg catcgtggtg tcacgctcgt cgtttggtat ggcttcattc agctccggtt 1560cccaacgatc aaggcgagtt acatgatccc ccatgttgtg caaaaaagcg gttagctcct 1620tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt gttatcactc atggttatgg 1680cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg 1740agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg 1800cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa 1860aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt 1920aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt 1980gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt 2040gaatactcat actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca 2100tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat 2160ttccccgaaa agtgccacct gacgtctaag aaaccattat tatcatgaca ttaacctata 2220aaaataggcg tatcacgagg ccctttcgtc tcgcgcgttt cggtgatgac ggtgaaaacc 2280tctgacacat gcagctcccg gagacggtca cagcttgtct gtaagcggat gccgggagca 2340gacaagcccg tcagggcgcg tcagcgggtg ttggcgggtg tcggggctgg cttaactatg 2400cggcatcaga gcagattgta ctgagagtgc accatatgcg gtgtgaaata ccgcacagat 2460gcgtaaggag aaaataccgc atcaggcgcc attcgccatt caggctgcgc aactgttggg 2520aagggcgatc ggtgcgggcc tcttcgctat tacgccagct ggcgaaaggg ggatgtgctg 2580caaggcgatt aagttgggta acgccagggt tttcccagtc acgacgttgt aaaacgacgg 2640ccagtgccaa gcttgcatgc ctgcaggtcg actctagagg atcccctgca cagaacccgt 2700actcgatccg ctcgttcgtc gctgtgaggt ctttcctgca aggcaaatcg ttgtgccgga 2760aggggtttcg tgatgtttct ccgagcgttt tttcgttcca agttggtcat ggtggctctt 2820gtcctggtcg ctggcctgtt tctctacaac gcctgctctt cttctgacgc aaaggaagag 2880atcggcagca gtctgaatct ctctcctgtc actgctcgtt cgaatccgta tgagggcgtc 2940cagcccacga tgagcgaaaa aagccctgtt cccgtccctg tcgtttccgg cgacaggatt 3000tcgggggtgg catcgtgcgg gacggattac gccgggaagc ctgcggtgac gctggaagct 3060gtgtggattt cgtccgactc ggtgaactac acactcgata agaggcattg cctggtgacg 3120accggcccgc tgtggaaaca agcgatccgt aaagcgtcag ggtcagagat tcggcctgag 3180ggcgggagct ggatacgggt ggtgcttgcc atgcctgacg gcaatttcag ggcaggatgg 3240gcaccccacg cccaagtaac cgctggtgcg ctggatattt cggcggtggt ctcgtgagcg 3300gggagaagcg gcacagcgag gccggcccgg tagaaatcat ctttttgatg ctggcagtca 3360gggcggggga ctacatcgtc gccgtgactg cggttctcgc ggtcgggttc ttcgcggtcg 3420cggttgaggg tttctggttc ctggtcgtcg cagtcatcgc tgcaccggcg tggtggtttc 3480tgcgcgactg ggaatcgaag cggagggccg tacgggtctt tgaacgggca tggaagggga 3540cacctgaatc ccccggtatt gctctctccc

ttggcctgtc gaacgtggcg gggtctctgc 3600cgaggttgag gaagtttgaa actggttcgg ggatacgcac actcgtgttt tctttgccgc 3660ccggagtcac tgccgagagc tttgagaaag ttcgccctgc gctggcagac gcgatggggg 3720gtcaccgctg ccaagtagag aaggtggccc ccggacaggt ccgcgtcaga gtgattgatg 3780aggattcgat gaagacgccg cgtgatgcgg gatgggcgaa agatgttgtg ctggaagagg 3840atacgttcga cggtcttccg ggcgagacgc gatcctggtt cgagcaagag gggccggcat 3900catgagaaaa tcggcgggag tatctcggat tcctatccgt ctcgggcgct ctcagtacgg 3960ggaagacgtt ggattcgatc tcgctgcgga cgccgctcac atcgccatgc agggcaaaac 4020ccgatccggc aaaagtcagg cgacgtacaa cgtgttagct caggcagcag cgaacgcggc 4080ggttcgagtc gtagggtccg acccgacaca cgtactcctg gagcccttca aacatcgagg 4140ggtgtccgag ccttacgtgg tttcgggact gaatgcgcag gccacggtgg acatgctggg 4200ctgggtcaag cgtgagtctg atcgtcgcat cgaccagatg tggcccctgc gtaccgacaa 4260gttttccgag ttcggggctt cgttcccgct gatactcgtc gtgctcgaag agtttcccgg 4320gatcctcgag ggggcagcgg acgaagacgc cgcgttaggc cgaaaacctg ccgagcgtct 4380cgcaccccgc atttcggcct acgtgcgtca gatagcagcg cagtcggcaa aggctggaat 4440tcgccttctc ctgctctcgc aacgagcgga ggcctcgatc attggcggca atgcgcgttc 4500gaatttcggg gtcaagatga ctctgagggt ggacgaaccg gagtcggtga gaatgcttca 4560tccgagcgct tccccggaag actgtgccct ggtcgagacc ttcaagcctg gtacctgcct 4620tttcgagaag ccaggagaag gccggcagat tatgcgatgc gactttgtcg gcgagtacgg 4680gagatatgcg cgagccatcg agtcttcgga tctgcgtttt ctcgccaccc tccagcaaga 4740ccaggcccaa cgcgaattct tcgctgagga gttcggtgtg gtggatccgt catgactgga 4800ccacaggaga gaaagcgcaa ggcggcgaag ccgtcgcggg agcctcagtt gaactgctgt 4860gaagcggacg tgccgaaacg agcaaaacag cccccggttc cctctacgtt cgacctgctc 4920acggtgaagg agactgcggg gctgctgaga gtcagtcagg caactcttta ccggctgctt 4980cggagtgggg aaggacccac atacacacgg atcggtggac agatacgcgt tcaccgcgag 5040tcgctgcgtc ggttcatcga accgcgtgga taacgtcaca gagacagcga aaacgcctcc 5100cctgggtcaa tccggttacc gccggactgg gggaggcgct tcgacaccta catccgtcgc 5160ccctcgaaag gctcagatgc acttccacga taacgcagag gtcggacaag agggaagaac 5220tgccgttctc tcgccgttgc gcggcgtagc cgccaagcgg gacgtgtctg acgatgcagc 5280gaagcggagt cggcaggcgc ggcacgcgcc tgggcttgtt acatctgcca caactgtccg 5340tgaatctctg ccagctcctg aaaccgctgg tcagggcctt gcggaatccg tgaccgctga 5400tgatttttgg tctcattcgt tcccccgcgc tgacgatgta cgcggcgcag ctgcttcctt 5460ccagtcggtg gctaactggg atgggcgtga gggtccgagg ccgcgtttcg ttgtcgcgcc 5520tggcgttgtc cgcttggagg tttgtgatct cgcacgccgc gaacgaacgg ctgaacgtgc 5580gtatctggct gctcgggctc gggtggatat ggcggctgcc aggcataact cgccgtacga 5640cttcgacgtg gacgatgaag agttggcgga actggcttct ctgcaaggcc tcgaggacga 5700cgacattggg ggctggtctg cggagaggga aatagtgggc tggtctgctc gttctcggtc 5760acggatgatc ttgcgaatgg cagaactcga ctgggctccc atgatggatt tgccgggcat 5820tcctgcgatg gtgaccctca cctatccggg ggactggctt acggttgccc ccaccggcgc 5880tgaggtcaaa aaacatctcc agacgttctt caaacggttc caacgggcct ggggcattgc 5940ctggatgggt gcgtggaaaa tggagttcca aagccgaggc gctccgcatt ttcacctgta 6000catggtccct cctcatggga aggcaggaga ctcgcggaag ctgcggcatg atgctgagct 6060cttgaaatgg gagatagcac gtgcagaggg tgaagaccca ggtcgcaggc cgtatttccg 6120ggaagctcca agcgatggat tgaagtttcg tccgtggctt tctgcggtgt gggccgacgt 6180cgtagatcat ccggacccca aggaaaaaga aaagcacgtc agtgccggca ctggagtgga 6240ctacgcggag ggcacgcgag ggtcagatcc gaaaaggctt gcggtgtact tctccaagca 6300tggaaccttt gccgacaagg aatatcagca cgtagttcct gctcaatggc agaaaacggg 6360tgcgggacct ggcaggttct ggggctaccg cggtttgtcg ccggccacgg ctgccaccga 6420gatttcctgg gatgagtacc tgcttttatc tcgcacgttg cgacgattgt cagcgcgaac 6480gaagatctgg gacccggctt tacgaggcgg tagcggcggc cacagatgga ctaaggcgat 6540gatgcgacgc acggttaccc ggcaccgctt ggacctcgtg accggtgaga ttctgggcac 6600gaagacgcgg aaggttcggg cgccagtgaa gaggtttgtc cggacttcgg gatacctgtg 6660tgtcaatgac gggcccgcac tggctcgaac cctcagccgt cttcgtacaa gctgcctgag 6720ctagacgcgc ggaacgcctt tcggctttgt cttttgctgg atggcgggtt ttgggcggct 6780tctggtgatg cgctgctgcg ctccgtgggg agagagaccc aacgactgac ctatctctac 6840ccaggtgcaa ttcatctccc gcgctctgtc ggctaggtaa acgaggtgct cccgcgcgag 6900cttttccatg tggtcggcca atgtcagctc ggtcaggaca acctgctgtt gttgcgatag 6960ttgtgtccgc acgggtcgat tgtcttctgt tgcggcataa cggttttcgt cgttcgcgga 7020gagtgcggct aaatgaattg catcctcgat tgagcggagc atttcgacgc ggaacctggc 7080gatgatgttg tctctgtctt cattcataac tgaagcgtat tgggagtgtt gccctcccac 7140catgtgtgcc aatgcaggtg tgaactgagt cacagtttct caatagactc caagtttgtg 7200atccttttac tcccaaaatg gggcatgatg tgtgcgtgcc tcggttcagg ggcgaaagtt 7260cgacacctcg aaagaaggcc tcgacatggc tttgaaagct gctggcaacg tgattcctga 7320ttcctccgcg tacgagtacc gggcggttca ggtcgagccg aagatggtca gaaaagaccc 7380ggaagacccg aactctgagc agttccagaa gcagaaggac ggcacgccgg tgtggtcgat 7440cgactgcatt cgggtcgacc gggcatcagg caacaaggca atcgtgaccg tgacggttcc 7500ggacgtgatg gaaccggatg ttgcggggcc ggtggagttc tccgagatga ttgccggttt 7560ctgggtttcg cgcagtggtt cgggcatgtg gttttcggca agcgccgtcg cttctctctg 7620atcgctgatc gtcgcccctc gaaaggttcg gaaatgtcca aaggaaaagg cgttgcgctg 7680ggtgtgggtg ccctcgtgct cgtgtttgtg ctggttgcgg caggttggca agcggcgaac 7740gtgttcagtg atcgttcaca gtccgaagct gtgccgctga gagtgccggc cgatccgaag 7800tgggaaaacg gggtgttctc ggacgttgcc gggtgcctcg ttctctctcc ggaagagctg 7860gggccgttca gcggagggca gtacatcgac atagtgaggc cagttgagcc ggagaggttg 7920gagcgcgact gggtgaggtc ggctgagtgc gtttcggcgt cgatgaatgt ctctgacctg 7980ttggtttctg ctcttccaga gtccacccgt ccccccggcg atttcgttcg ttcgtggaaa 8040gtggcgagtg atgattactg ctatgagggt gataacccgc aaggctgcac ttctcgtatg 8100ccggtttggg tctctgcaaa aaactggtgg tgca 8134908124DNAArtificialpRET1102 90gacctgcagg catgcaagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg 60ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg 120tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc 180gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt 240gcgtattggc gaacttttgc tgagttgaag gatcagatca cgcatcttcc cgacaacgca 300gaccgttccg tggcaaagca aaagttcaaa atcagtaacc gtcagtgccg ataagttcaa 360agttaaacct ggtgttgata ccaacattga aacgctgatc gaaaacgcgc tgaaaaacgc 420tgctgaatgt gcgagcttct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 480ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 540gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 600aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc 660gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 720ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 780cctttctccc ttcgggaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt 840cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc 900gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc 960cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag 1020agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg 1080ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 1140ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 1200gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgatccgt 1260cgagaggtct gcctcgtgaa gaaggtgttg ctgactcata ccaggcctga atcgccccat 1320catccagcca gaaagtgagg gagccacggt tgatgagagc tttgttgtag gtggaccagt 1380tggtgatttt gaacttttgc tttgccacgg aacggtctgc gttgtcggga agatgcgtga 1440tctgatcctt caactcagca aaagttcgat ttattcaaca aagccacgtt gtgtctcaaa 1500atctctgatg ttacattgca caagataaaa atatatcatc atgaacaata aaactgtctg 1560cttacataaa cagtaataca aggggtgtta tgagccatat tcaacgggaa acgtcttgct 1620cgaagccgcg attaaattcc aacatggatg ctgatttata tgggtataaa tgggctcgcg 1680ataatgtcgg gcaatcaggt gcgacaatct atcgattgta tgggaagccc gatgcgccag 1740agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca 1800gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc 1860ctgatgatgc atggttactc accactgcga tccccgggaa aacagcattc caggtattag 1920aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt 1980tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc 2040aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta 2100atggctggcc tgttgaacaa gtctggaaag aaatgcataa gcttttgcca ttctcaccgg 2160attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat 2220taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca 2280tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat 2340atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt 2400tctaatcaga attggttaat tggttgtaac actggcagag cattacgctg acttgacggg 2460acggcggctt tgttgaataa atcgcattcg ccattcaggc tgcgcaactg ttgggaaggg 2520cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg 2580cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt 2640gaattcgagc tcggtacccg gggatcctct agagtctgca cagaacccgt actcgatccg 2700ctcgttcgtc gctgtgaggt ctttcctgca aggcaaatcg ttgtgccgga aggggtttcg 2760tgatgtttct ccgagcgttt tttcgttcca agttggtcat ggtggctctt gtcctggtcg 2820ctggcctgtt tctctacaac gcctgctctt cttctgacgc aaaggaagag atcggcagca 2880gtctgaatct ctctcctgtc actgctcgtt cgaatccgta tgagggcgtc cagcccacga 2940tgagcgaaaa aagccctgtt cccgtccctg tcgtttccgg cgacaggatt tcgggggtgg 3000catcgtgcgg gacggattac gccgggaagc ctgcggtgac gctggaagct gtgtggattt 3060cgtccgactc ggtgaactac acactcgata agaggcattg cctggtgacg accggcccgc 3120tgtggaaaca agcgatccgt aaagcgtcag ggtcagagat tcggcctgag ggcgggagct 3180ggatacgggt ggtgcttgcc atgcctgacg gcaatttcag ggcaggatgg gcaccccacg 3240cccaagtaac cgctggtgcg ctggatattt cggcggtggt ctcgtgagcg gggagaagcg 3300gcacagcgag gccggcccgg tagaaatcat ctttttgatg ctggcagtca gggcggggga 3360ctacatcgtc gccgtgactg cggttctcgc ggtcgggttc ttcgcggtcg cggttgaggg 3420tttctggttc ctggtcgtcg cagtcatcgc tgcaccggcg tggtggtttc tgcgcgactg 3480ggaatcgaag cggagggccg tacgggtctt tgaacgggca tggaagggga cacctgaatc 3540ccccggtatt gctctctccc ttggcctgtc gaacgtggcg gggtctctgc cgaggttgag 3600gaagtttgaa actggttcgg ggatacgcac actcgtgttt tctttgccgc ccggagtcac 3660tgccgagagc tttgagaaag ttcgccctgc gctggcagac gcgatggggg gtcaccgctg 3720ccaagtagag aaggtggccc ccggacaggt ccgcgtcaga gtgattgatg aggattcgat 3780gaagacgccg cgtgatgcgg gatgggcgaa agatgttgtg ctggaagagg atacgttcga 3840cggtcttccg ggcgagacgc gatcctggtt cgagcaagag gggccggcat catgagaaaa 3900tcggcgggag tatctcggat tcctatccgt ctcgggcgct ctcagtacgg ggaagacgtt 3960ggattcgatc tcgctgcgga cgccgctcac atcgccatgc agggcaaaac ccgatccggc 4020aaaagtcagg cgacgtacaa cgtgttagct caggcagcag cgaacgcggc ggttcgagtc 4080gtagggtccg acccgacaca cgtactcctg gagcccttca aacatcgagg ggtgtccgag 4140ccttacgtgg tttcgggact gaatgcgcag gccacggtgg acatgctggg ctgggtcaag 4200cgtgagtctg atcgtcgcat cgaccagatg tggcccctgc gtaccgacaa gttttccgag 4260ttcggggctt cgttcccgct gatactcgtc gtgctcgaag agtttcccgg gatcctcgag 4320ggggcagcgg acgaagacgc cgcgttaggc cgaaaacctg ccgagcgtct cgcaccccgc 4380atttcggcct acgtgcgtca gatagcagcg cagtcggcaa aggctggaat tcgccttctc 4440ctgctctcgc aacgagcgga ggcctcgatc attggcggca atgcgcgttc gaatttcggg 4500gtcaagatga ctctgagggt ggacgaaccg gagtcggtga gaatgcttca tccgagcgct 4560tccccggaag actgtgccct ggtcgagacc ttcaagcctg gtacctgcct tttcgagaag 4620ccaggagaag gccggcagat tatgcgatgc gactttgtcg gcgagtacgg gagatatgcg 4680cgagccatcg agtcttcgga tctgcgtttt ctcgccaccc tccagcaaga ccaggcccaa 4740cgcgaattct tcgctgagga gttcggtgtg gtggatccgt catgactgga ccacaggaga 4800gaaagcgcaa ggcggcgaag ccgtcgcggg agcctcagtt gaactgctgt gaagcggacg 4860tgccgaaacg agcaaaacag cccccggttc cctctacgtt cgacctgctc acggtgaagg 4920agactgcggg gctgctgaga gtcagtcagg caactcttta ccggctgctt cggagtgggg 4980aaggacccac atacacacgg atcggtggac agatacgcgt tcaccgcgag tcgctgcgtc 5040ggttcatcga accgcgtgga taacgtcaca gagacagcga aaacgcctcc cctgggtcaa 5100tccggttacc gccggactgg gggaggcgct tcgacaccta catccgtcgc ccctcgaaag 5160gctcagatgc acttccacga taacgcagag gtcggacaag agggaagaac tgccgttctc 5220tcgccgttgc gcggcgtagc cgccaagcgg gacgtgtctg acgatgcagc gaagcggagt 5280cggcaggcgc ggcacgcgcc tgggcttgtt acatctgcca caactgtccg tgaatctctg 5340ccagctcctg aaaccgctgg tcagggcctt gcggaatccg tgaccgctga tgatttttgg 5400tctcattcgt tcccccgcgc tgacgatgta cgcggcgcag ctgcttcctt ccagtcggtg 5460gctaactggg atgggcgtga gggtccgagg ccgcgtttcg ttgtcgcgcc tggcgttgtc 5520cgcttggagg tttgtgatct cgcacgccgc gaacgaacgg ctgaacgtgc gtatctggct 5580gctcgggctc gggtggatat ggcggctgcc aggcataact cgccgtacga cttcgacgtg 5640gacgatgaag agttggcgga actggcttct ctgcaaggcc tcgaggacga cgacattggg 5700ggctggtctg cggagaggga aatagtgggc tggtctgctc gttctcggtc acggatgatc 5760ttgcgaatgg cagaactcga ctgggctccc atgatggatt tgccgggcat tcctgcgatg 5820gtgaccctca cctatccggg ggactggctt acggttgccc ccaccggcgc tgaggtcaaa 5880aaacatctcc agacgttctt caaacggttc caacgggcct ggggcattgc ctggatgggt 5940gcgtggaaaa tggagttcca aagccgaggc gctccgcatt ttcacctgta catggtccct 6000cctcatggga aggcaggaga ctcgcggaag ctgcggcatg atgctgagct cttgaaatgg 6060gagatagcac gtgcagaggg tgaagaccca ggtcgcaggc cgtatttccg ggaagctcca 6120agcgatggat tgaagtttcg tccgtggctt tctgcggtgt gggccgacgt cgtagatcat 6180ccggacccca aggaaaaaga aaagcacgtc agtgccggca ctggagtgga ctacgcggag 6240ggcacgcgag ggtcagatcc gaaaaggctt gcggtgtact tctccaagca tggaaccttt 6300gccgacaagg aatatcagca cgtagttcct gctcaatggc agaaaacggg tgcgggacct 6360ggcaggttct ggggctaccg cggtttgtcg ccggccacgg ctgccaccga gatttcctgg 6420gatgagtacc tgcttttatc tcgcacgttg cgacgattgt cagcgcgaac gaagatctgg 6480gacccggctt tacgaggcgg tagcggcggc cacagatgga ctaaggcgat gatgcgacgc 6540acggttaccc ggcaccgctt ggacctcgtg accggtgaga ttctgggcac gaagacgcgg 6600aaggttcggg cgccagtgaa gaggtttgtc cggacttcgg gatacctgtg tgtcaatgac 6660gggcccgcac tggctcgaac cctcagccgt cttcgtacaa gctgcctgag ctagacgcgc 6720ggaacgcctt tcggctttgt cttttgctgg atggcgggtt ttgggcggct tctggtgatg 6780cgctgctgcg ctccgtgggg agagagaccc aacgactgac ctatctctac ccaggtgcaa 6840ttcatctccc gcgctctgtc ggctaggtaa acgaggtgct cccgcgcgag cttttccatg 6900tggtcggcca atgtcagctc ggtcaggaca acctgctgtt gttgcgatag ttgtgtccgc 6960acgggtcgat tgtcttctgt tgcggcataa cggttttcgt cgttcgcgga gagtgcggct 7020aaatgaattg catcctcgat tgagcggagc atttcgacgc ggaacctggc gatgatgttg 7080tctctgtctt cattcataac tgaagcgtat tgggagtgtt gccctcccac catgtgtgcc 7140aatgcaggtg tgaactgagt cacagtttct caatagactc caagtttgtg atccttttac 7200tcccaaaatg gggcatgatg tgtgcgtgcc tcggttcagg ggcgaaagtt cgacacctcg 7260aaagaaggcc tcgacatggc tttgaaagct gctggcaacg tgattcctga ttcctccgcg 7320tacgagtacc gggcggttca ggtcgagccg aagatggtca gaaaagaccc ggaagacccg 7380aactctgagc agttccagaa gcagaaggac ggcacgccgg tgtggtcgat cgactgcatt 7440cgggtcgacc gggcatcagg caacaaggca atcgtgaccg tgacggttcc ggacgtgatg 7500gaaccggatg ttgcggggcc ggtggagttc tccgagatga ttgccggttt ctgggtttcg 7560cgcagtggtt cgggcatgtg gttttcggca agcgccgtcg cttctctctg atcgctgatc 7620gtcgcccctc gaaaggttcg gaaatgtcca aaggaaaagg cgttgcgctg ggtgtgggtg 7680ccctcgtgct cgtgtttgtg ctggttgcgg caggttggca agcggcgaac gtgttcagtg 7740atcgttcaca gtccgaagct gtgccgctga gagtgccggc cgatccgaag tgggaaaacg 7800gggtgttctc ggacgttgcc gggtgcctcg ttctctctcc ggaagagctg gggccgttca 7860gcggagggca gtacatcgac atagtgaggc cagttgagcc ggagaggttg gagcgcgact 7920gggtgaggtc ggctgagtgc gtttcggcgt cgatgaatgt ctctgacctg ttggtttctg 7980ctcttccaga gtccacccgt ccccccggcg atttcgttcg ttcgtggaaa gtggcgagtg 8040atgattactg ctatgagggt gataacccgc aaggctgcac ttctcgtatg ccggtttggg 8100tctctgcaaa aaactggtgg tgca 8124917675DNAArtificialpRET1103 91gacctgcagg catgcaagct tggcactggc cgtcgtttta caacgtcgtg actgggaaaa 60ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca gctggcgtaa 120tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga atggcgaatg 180agcttcttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc 240ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg 300aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct 360ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca 420gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct 480cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc 540gggaagcgtg gcgctttctc aatgctcacg ctgtaggtat ctcagttcgg tgtaggtcgt 600tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc 660cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc 720cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg 780gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc 840agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag 900cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga 960tcctttgatc ttttctacgg ggtctgacgc tcagtggaac tccgtcgaac ggaagatcac 1020ttcgcagaat aaataaatcc tggtgtccct gttgataccg ggaagccctg ggccaacttt 1080tggcgaaaat gagacgttga tcggcacgta agaggttcca actttcacca taatgaaata 1140agatcactac cgggcgtatt ttttgagtta tcgagatttt caggagctaa ggaagctaaa 1200atggagaaaa aaatcactgg atataccacc gttgatatat cccaatggca tcgtaaagaa 1260cattttgagg catttcagtc agttgctcaa tgtacctata accagaccgt tcagctggat 1320attacggcct ttttaaagac cgtaaagaaa aataagcaca agttttatcc ggcctttatt 1380cacattcttg cccgcctgat gaatgctcat ccggaatttc gtatggcaat gaaagacggt 1440gagctggtga tatgggatag tgttcaccct tgttacaccg ttttccatga gcaaactgaa 1500acgttttcat cgctctggag tgaataccac gacgatttcc ggcagtttct acacatatat 1560tcgcaagatg tggcgtgtta cggtgaaaac ctggcctatt tccctaaagg gtttattgag 1620aatatgtttt tcgtctcagc caatccctgg gtgagtttca ccagttttga tttaaacgtg 1680gccaatatgg acaacttctt cgcccccgtt ttcaccatgg gcaaatatta tacgcaaggc 1740gacaaggtgc tgatgccgct ggcgattcag gttcatcatg ccgtctgtga tggcttccat 1800gtcggcagaa tgcttaatga attacaacag tactgcgatg agtggcaggg cggggcgtaa 1860tttttttaag gcagttattg gtgcccttaa acgcctggtg ctacgcctga ataagtgata 1920ataagcggat gaatggcaga aattcagctt ggcccagtgc caagctccaa tacgcaaacc 1980gcctctcccc gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg 2040gaaagcgggc agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca 2100ggctttacac tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt 2160tcacacagga aacagctatg accatgatta cgaattcgag ctcggtaccc ggggatcctc 2220tagagtctgc acagaacccg tactcgatcc

gctcgttcgt cgctgtgagg tctttcctgc 2280aaggcaaatc gttgtgccgg aaggggtttc gtgatgtttc tccgagcgtt ttttcgttcc 2340aagttggtca tggtggctct tgtcctggtc gctggcctgt ttctctacaa cgcctgctct 2400tcttctgacg caaaggaaga gatcggcagc agtctgaatc tctctcctgt cactgctcgt 2460tcgaatccgt atgagggcgt ccagcccacg atgagcgaaa aaagccctgt tcccgtccct 2520gtcgtttccg gcgacaggat ttcgggggtg gcatcgtgcg ggacggatta cgccgggaag 2580cctgcggtga cgctggaagc tgtgtggatt tcgtccgact cggtgaacta cacactcgat 2640aagaggcatt gcctggtgac gaccggcccg ctgtggaaac aagcgatccg taaagcgtca 2700gggtcagaga ttcggcctga gggcgggagc tggatacggg tggtgcttgc catgcctgac 2760ggcaatttca gggcaggatg ggcaccccac gcccaagtaa ccgctggtgc gctggatatt 2820tcggcggtgg tctcgtgagc ggggagaagc ggcacagcga ggccggcccg gtagaaatca 2880tctttttgat gctggcagtc agggcggggg actacatcgt cgccgtgact gcggttctcg 2940cggtcgggtt cttcgcggtc gcggttgagg gtttctggtt cctggtcgtc gcagtcatcg 3000ctgcaccggc gtggtggttt ctgcgcgact gggaatcgaa gcggagggcc gtacgggtct 3060ttgaacgggc atggaagggg acacctgaat cccccggtat tgctctctcc cttggcctgt 3120cgaacgtggc ggggtctctg ccgaggttga ggaagtttga aactggttcg gggatacgca 3180cactcgtgtt ttctttgccg cccggagtca ctgccgagag ctttgagaaa gttcgccctg 3240cgctggcaga cgcgatgggg ggtcaccgct gccaagtaga gaaggtggcc cccggacagg 3300tccgcgtcag agtgattgat gaggattcga tgaagacgcc gcgtgatgcg ggatgggcga 3360aagatgttgt gctggaagag gatacgttcg acggtcttcc gggcgagacg cgatcctggt 3420tcgagcaaga ggggccggca tcatgagaaa atcggcggga gtatctcgga ttcctatccg 3480tctcgggcgc tctcagtacg gggaagacgt tggattcgat ctcgctgcgg acgccgctca 3540catcgccatg cagggcaaaa cccgatccgg caaaagtcag gcgacgtaca acgtgttagc 3600tcaggcagca gcgaacgcgg cggttcgagt cgtagggtcc gacccgacac acgtactcct 3660ggagcccttc aaacatcgag gggtgtccga gccttacgtg gtttcgggac tgaatgcgca 3720ggccacggtg gacatgctgg gctgggtcaa gcgtgagtct gatcgtcgca tcgaccagat 3780gtggcccctg cgtaccgaca agttttccga gttcggggct tcgttcccgc tgatactcgt 3840cgtgctcgaa gagtttcccg ggatcctcga gggggcagcg gacgaagacg ccgcgttagg 3900ccgaaaacct gccgagcgtc tcgcaccccg catttcggcc tacgtgcgtc agatagcagc 3960gcagtcggca aaggctggaa ttcgccttct cctgctctcg caacgagcgg aggcctcgat 4020cattggcggc aatgcgcgtt cgaatttcgg ggtcaagatg actctgaggg tggacgaacc 4080ggagtcggtg agaatgcttc atccgagcgc ttccccggaa gactgtgccc tggtcgagac 4140cttcaagcct ggtacctgcc ttttcgagaa gccaggagaa ggccggcaga ttatgcgatg 4200cgactttgtc ggcgagtacg ggagatatgc gcgagccatc gagtcttcgg atctgcgttt 4260tctcgccacc ctccagcaag accaggccca acgcgaattc ttcgctgagg agttcggtgt 4320ggtggatccg tcatgactgg accacaggag agaaagcgca aggcggcgaa gccgtcgcgg 4380gagcctcagt tgaactgctg tgaagcggac gtgccgaaac gagcaaaaca gcccccggtt 4440ccctctacgt tcgacctgct cacggtgaag gagactgcgg ggctgctgag agtcagtcag 4500gcaactcttt accggctgct tcggagtggg gaaggaccca catacacacg gatcggtgga 4560cagatacgcg ttcaccgcga gtcgctgcgt cggttcatcg aaccgcgtgg ataacgtcac 4620agagacagcg aaaacgcctc ccctgggtca atccggttac cgccggactg ggggaggcgc 4680ttcgacacct acatccgtcg cccctcgaaa ggctcagatg cacttccacg ataacgcaga 4740ggtcggacaa gagggaagaa ctgccgttct ctcgccgttg cgcggcgtag ccgccaagcg 4800ggacgtgtct gacgatgcag cgaagcggag tcggcaggcg cggcacgcgc ctgggcttgt 4860tacatctgcc acaactgtcc gtgaatctct gccagctcct gaaaccgctg gtcagggcct 4920tgcggaatcc gtgaccgctg atgatttttg gtctcattcg ttcccccgcg ctgacgatgt 4980acgcggcgca gctgcttcct tccagtcggt ggctaactgg gatgggcgtg agggtccgag 5040gccgcgtttc gttgtcgcgc ctggcgttgt ccgcttggag gtttgtgatc tcgcacgccg 5100cgaacgaacg gctgaacgtg cgtatctggc tgctcgggct cgggtggata tggcggctgc 5160caggcataac tcgccgtacg acttcgacgt ggacgatgaa gagttggcgg aactggcttc 5220tctgcaaggc ctcgaggacg acgacattgg gggctggtct gcggagaggg aaatagtggg 5280ctggtctgct cgttctcggt cacggatgat cttgcgaatg gcagaactcg actgggctcc 5340catgatggat ttgccgggca ttcctgcgat ggtgaccctc acctatccgg gggactggct 5400tacggttgcc cccaccggcg ctgaggtcaa aaaacatctc cagacgttct tcaaacggtt 5460ccaacgggcc tggggcattg cctggatggg tgcgtggaaa atggagttcc aaagccgagg 5520cgctccgcat tttcacctgt acatggtccc tcctcatggg aaggcaggag actcgcggaa 5580gctgcggcat gatgctgagc tcttgaaatg ggagatagca cgtgcagagg gtgaagaccc 5640aggtcgcagg ccgtatttcc gggaagctcc aagcgatgga ttgaagtttc gtccgtggct 5700ttctgcggtg tgggccgacg tcgtagatca tccggacccc aaggaaaaag aaaagcacgt 5760cagtgccggc actggagtgg actacgcgga gggcacgcga gggtcagatc cgaaaaggct 5820tgcggtgtac ttctccaagc atggaacctt tgccgacaag gaatatcagc acgtagttcc 5880tgctcaatgg cagaaaacgg gtgcgggacc tggcaggttc tggggctacc gcggtttgtc 5940gccggccacg gctgccaccg agatttcctg ggatgagtac ctgcttttat ctcgcacgtt 6000gcgacgattg tcagcgcgaa cgaagatctg ggacccggct ttacgaggcg gtagcggcgg 6060ccacagatgg actaaggcga tgatgcgacg cacggttacc cggcaccgct tggacctcgt 6120gaccggtgag attctgggca cgaagacgcg gaaggttcgg gcgccagtga agaggtttgt 6180ccggacttcg ggatacctgt gtgtcaatga cgggcccgca ctggctcgaa ccctcagccg 6240tcttcgtaca agctgcctga gctagacgcg cggaacgcct ttcggctttg tcttttgctg 6300gatggcgggt tttgggcggc ttctggtgat gcgctgctgc gctccgtggg gagagagacc 6360caacgactga cctatctcta cccaggtgca attcatctcc cgcgctctgt cggctaggta 6420aacgaggtgc tcccgcgcga gcttttccat gtggtcggcc aatgtcagct cggtcaggac 6480aacctgctgt tgttgcgata gttgtgtccg cacgggtcga ttgtcttctg ttgcggcata 6540acggttttcg tcgttcgcgg agagtgcggc taaatgaatt gcatcctcga ttgagcggag 6600catttcgacg cggaacctgg cgatgatgtt gtctctgtct tcattcataa ctgaagcgta 6660ttgggagtgt tgccctccca ccatgtgtgc caatgcaggt gtgaactgag tcacagtttc 6720tcaatagact ccaagtttgt gatcctttta ctcccaaaat ggggcatgat gtgtgcgtgc 6780ctcggttcag gggcgaaagt tcgacacctc gaaagaaggc ctcgacatgg ctttgaaagc 6840tgctggcaac gtgattcctg attcctccgc gtacgagtac cgggcggttc aggtcgagcc 6900gaagatggtc agaaaagacc cggaagaccc gaactctgag cagttccaga agcagaagga 6960cggcacgccg gtgtggtcga tcgactgcat tcgggtcgac cgggcatcag gcaacaaggc 7020aatcgtgacc gtgacggttc cggacgtgat ggaaccggat gttgcggggc cggtggagtt 7080ctccgagatg attgccggtt tctgggtttc gcgcagtggt tcgggcatgt ggttttcggc 7140aagcgccgtc gcttctctct gatcgctgat cgtcgcccct cgaaaggttc ggaaatgtcc 7200aaaggaaaag gcgttgcgct gggtgtgggt gccctcgtgc tcgtgtttgt gctggttgcg 7260gcaggttggc aagcggcgaa cgtgttcagt gatcgttcac agtccgaagc tgtgccgctg 7320agagtgccgg ccgatccgaa gtgggaaaac ggggtgttct cggacgttgc cgggtgcctc 7380gttctctctc cggaagagct ggggccgttc agcggagggc agtacatcga catagtgagg 7440ccagttgagc cggagaggtt ggagcgcgac tgggtgaggt cggctgagtg cgtttcggcg 7500tcgatgaatg tctctgacct gttggtttct gctcttccag agtccacccg tccccccggc 7560gatttcgttc gttcgtggaa agtggcgagt gatgattact gctatgaggg tgataacccg 7620caaggctgca cttctcgtat gccggtttgg gtctctgcaa aaaactggtg gtgca 7675928134DNAArtificialpRET1101Rv 92ggggatcctc tagagtcgac ctgcaggcat gcaagcttgg cactggccgt cgttttacaa 60cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct 120ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc 180agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt 240tcacaccgca tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag 300ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc 360gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca 420tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc 480atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc 540cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc 600tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc 660gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg 720gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat 780ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc 840acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa 900ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa 960aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt 1020gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct 1080tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat 1140gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc aacaacgttg 1200cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg 1260atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt 1320attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg 1380ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg 1440gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg 1500tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa 1560aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt 1620tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt 1680tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt 1740ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag 1800ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta 1860gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat 1920aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg 1980ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg 2040agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac 2100aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga 2160aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt 2220ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta 2280cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat 2340tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg 2400accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg caaaccgcct 2460ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc cgactggaaa 2520gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc accccaggct 2580ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata acaatttcac 2640acaggaaaca gctatgacca tgattacgaa ttcgagctcg gtaccctgca cagaacccgt 2700actcgatccg ctcgttcgtc gctgtgaggt ctttcctgca aggcaaatcg ttgtgccgga 2760aggggtttcg tgatgtttct ccgagcgttt tttcgttcca agttggtcat ggtggctctt 2820gtcctggtcg ctggcctgtt tctctacaac gcctgctctt cttctgacgc aaaggaagag 2880atcggcagca gtctgaatct ctctcctgtc actgctcgtt cgaatccgta tgagggcgtc 2940cagcccacga tgagcgaaaa aagccctgtt cccgtccctg tcgtttccgg cgacaggatt 3000tcgggggtgg catcgtgcgg gacggattac gccgggaagc ctgcggtgac gctggaagct 3060gtgtggattt cgtccgactc ggtgaactac acactcgata agaggcattg cctggtgacg 3120accggcccgc tgtggaaaca agcgatccgt aaagcgtcag ggtcagagat tcggcctgag 3180ggcgggagct ggatacgggt ggtgcttgcc atgcctgacg gcaatttcag ggcaggatgg 3240gcaccccacg cccaagtaac cgctggtgcg ctggatattt cggcggtggt ctcgtgagcg 3300gggagaagcg gcacagcgag gccggcccgg tagaaatcat ctttttgatg ctggcagtca 3360gggcggggga ctacatcgtc gccgtgactg cggttctcgc ggtcgggttc ttcgcggtcg 3420cggttgaggg tttctggttc ctggtcgtcg cagtcatcgc tgcaccggcg tggtggtttc 3480tgcgcgactg ggaatcgaag cggagggccg tacgggtctt tgaacgggca tggaagggga 3540cacctgaatc ccccggtatt gctctctccc ttggcctgtc gaacgtggcg gggtctctgc 3600cgaggttgag gaagtttgaa actggttcgg ggatacgcac actcgtgttt tctttgccgc 3660ccggagtcac tgccgagagc tttgagaaag ttcgccctgc gctggcagac gcgatggggg 3720gtcaccgctg ccaagtagag aaggtggccc ccggacaggt ccgcgtcaga gtgattgatg 3780aggattcgat gaagacgccg cgtgatgcgg gatgggcgaa agatgttgtg ctggaagagg 3840atacgttcga cggtcttccg ggcgagacgc gatcctggtt cgagcaagag gggccggcat 3900catgagaaaa tcggcgggag tatctcggat tcctatccgt ctcgggcgct ctcagtacgg 3960ggaagacgtt ggattcgatc tcgctgcgga cgccgctcac atcgccatgc agggcaaaac 4020ccgatccggc aaaagtcagg cgacgtacaa cgtgttagct caggcagcag cgaacgcggc 4080ggttcgagtc gtagggtccg acccgacaca cgtactcctg gagcccttca aacatcgagg 4140ggtgtccgag ccttacgtgg tttcgggact gaatgcgcag gccacggtgg acatgctggg 4200ctgggtcaag cgtgagtctg atcgtcgcat cgaccagatg tggcccctgc gtaccgacaa 4260gttttccgag ttcggggctt cgttcccgct gatactcgtc gtgctcgaag agtttcccgg 4320gatcctcgag ggggcagcgg acgaagacgc cgcgttaggc cgaaaacctg ccgagcgtct 4380cgcaccccgc atttcggcct acgtgcgtca gatagcagcg cagtcggcaa aggctggaat 4440tcgccttctc ctgctctcgc aacgagcgga ggcctcgatc attggcggca atgcgcgttc 4500gaatttcggg gtcaagatga ctctgagggt ggacgaaccg gagtcggtga gaatgcttca 4560tccgagcgct tccccggaag actgtgccct ggtcgagacc ttcaagcctg gtacctgcct 4620tttcgagaag ccaggagaag gccggcagat tatgcgatgc gactttgtcg gcgagtacgg 4680gagatatgcg cgagccatcg agtcttcgga tctgcgtttt ctcgccaccc tccagcaaga 4740ccaggcccaa cgcgaattct tcgctgagga gttcggtgtg gtggatccgt catgactgga 4800ccacaggaga gaaagcgcaa ggcggcgaag ccgtcgcggg agcctcagtt gaactgctgt 4860gaagcggacg tgccgaaacg agcaaaacag cccccggttc cctctacgtt cgacctgctc 4920acggtgaagg agactgcggg gctgctgaga gtcagtcagg caactcttta ccggctgctt 4980cggagtgggg aaggacccac atacacacgg atcggtggac agatacgcgt tcaccgcgag 5040tcgctgcgtc ggttcatcga accgcgtgga taacgtcaca gagacagcga aaacgcctcc 5100cctgggtcaa tccggttacc gccggactgg gggaggcgct tcgacaccta catccgtcgc 5160ccctcgaaag gctcagatgc acttccacga taacgcagag gtcggacaag agggaagaac 5220tgccgttctc tcgccgttgc gcggcgtagc cgccaagcgg gacgtgtctg acgatgcagc 5280gaagcggagt cggcaggcgc ggcacgcgcc tgggcttgtt acatctgcca caactgtccg 5340tgaatctctg ccagctcctg aaaccgctgg tcagggcctt gcggaatccg tgaccgctga 5400tgatttttgg tctcattcgt tcccccgcgc tgacgatgta cgcggcgcag ctgcttcctt 5460ccagtcggtg gctaactggg atgggcgtga gggtccgagg ccgcgtttcg ttgtcgcgcc 5520tggcgttgtc cgcttggagg tttgtgatct cgcacgccgc gaacgaacgg ctgaacgtgc 5580gtatctggct gctcgggctc gggtggatat ggcggctgcc aggcataact cgccgtacga 5640cttcgacgtg gacgatgaag agttggcgga actggcttct ctgcaaggcc tcgaggacga 5700cgacattggg ggctggtctg cggagaggga aatagtgggc tggtctgctc gttctcggtc 5760acggatgatc ttgcgaatgg cagaactcga ctgggctccc atgatggatt tgccgggcat 5820tcctgcgatg gtgaccctca cctatccggg ggactggctt acggttgccc ccaccggcgc 5880tgaggtcaaa aaacatctcc agacgttctt caaacggttc caacgggcct ggggcattgc 5940ctggatgggt gcgtggaaaa tggagttcca aagccgaggc gctccgcatt ttcacctgta 6000catggtccct cctcatggga aggcaggaga ctcgcggaag ctgcggcatg atgctgagct 6060cttgaaatgg gagatagcac gtgcagaggg tgaagaccca ggtcgcaggc cgtatttccg 6120ggaagctcca agcgatggat tgaagtttcg tccgtggctt tctgcggtgt gggccgacgt 6180cgtagatcat ccggacccca aggaaaaaga aaagcacgtc agtgccggca ctggagtgga 6240ctacgcggag ggcacgcgag ggtcagatcc gaaaaggctt gcggtgtact tctccaagca 6300tggaaccttt gccgacaagg aatatcagca cgtagttcct gctcaatggc agaaaacggg 6360tgcgggacct ggcaggttct ggggctaccg cggtttgtcg ccggccacgg ctgccaccga 6420gatttcctgg gatgagtacc tgcttttatc tcgcacgttg cgacgattgt cagcgcgaac 6480gaagatctgg gacccggctt tacgaggcgg tagcggcggc cacagatgga ctaaggcgat 6540gatgcgacgc acggttaccc ggcaccgctt ggacctcgtg accggtgaga ttctgggcac 6600gaagacgcgg aaggttcggg cgccagtgaa gaggtttgtc cggacttcgg gatacctgtg 6660tgtcaatgac gggcccgcac tggctcgaac cctcagccgt cttcgtacaa gctgcctgag 6720ctagacgcgc ggaacgcctt tcggctttgt cttttgctgg atggcgggtt ttgggcggct 6780tctggtgatg cgctgctgcg ctccgtgggg agagagaccc aacgactgac ctatctctac 6840ccaggtgcaa ttcatctccc gcgctctgtc ggctaggtaa acgaggtgct cccgcgcgag 6900cttttccatg tggtcggcca atgtcagctc ggtcaggaca acctgctgtt gttgcgatag 6960ttgtgtccgc acgggtcgat tgtcttctgt tgcggcataa cggttttcgt cgttcgcgga 7020gagtgcggct aaatgaattg catcctcgat tgagcggagc atttcgacgc ggaacctggc 7080gatgatgttg tctctgtctt cattcataac tgaagcgtat tgggagtgtt gccctcccac 7140catgtgtgcc aatgcaggtg tgaactgagt cacagtttct caatagactc caagtttgtg 7200atccttttac tcccaaaatg gggcatgatg tgtgcgtgcc tcggttcagg ggcgaaagtt 7260cgacacctcg aaagaaggcc tcgacatggc tttgaaagct gctggcaacg tgattcctga 7320ttcctccgcg tacgagtacc gggcggttca ggtcgagccg aagatggtca gaaaagaccc 7380ggaagacccg aactctgagc agttccagaa gcagaaggac ggcacgccgg tgtggtcgat 7440cgactgcatt cgggtcgacc gggcatcagg caacaaggca atcgtgaccg tgacggttcc 7500ggacgtgatg gaaccggatg ttgcggggcc ggtggagttc tccgagatga ttgccggttt 7560ctgggtttcg cgcagtggtt cgggcatgtg gttttcggca agcgccgtcg cttctctctg 7620atcgctgatc gtcgcccctc gaaaggttcg gaaatgtcca aaggaaaagg cgttgcgctg 7680ggtgtgggtg ccctcgtgct cgtgtttgtg ctggttgcgg caggttggca agcggcgaac 7740gtgttcagtg atcgttcaca gtccgaagct gtgccgctga gagtgccggc cgatccgaag 7800tgggaaaacg gggtgttctc ggacgttgcc gggtgcctcg ttctctctcc ggaagagctg 7860gggccgttca gcggagggca gtacatcgac atagtgaggc cagttgagcc ggagaggttg 7920gagcgcgact gggtgaggtc ggctgagtgc gtttcggcgt cgatgaatgt ctctgacctg 7980ttggtttctg ctcttccaga gtccacccgt ccccccggcg atttcgttcg ttcgtggaaa 8040gtggcgagtg atgattactg ctatgagggt gataacccgc aaggctgcac ttctcgtatg 8100ccggtttggg tctctgcaaa aaactggtgg tgca 8134938124DNAArtificialpRET1102Rv 93gactctagag gatccccggg taccgagctc gaattcactg gccgtcgttt tacaacgtcg 60tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc 120cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct 180gaatggcgaa tgcgatttat tcaacaaagc cgccgtcccg tcaagtcagc gtaatgctct 240gccagtgtta caaccaatta accaattctg attagaaaaa ctcatcgagc atcaaatgaa 300actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc cgtttctgta 360atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg tatcggtctg 420cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca aaaataaggt 480tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc aaaagcttat 540gcatttcttt ccagacttgt tcaacaggcc agccattacg ctcgtcatca aaatcactcg 600catcaaccaa accgttattc attcgtgatt gcgcctgagc gagacgaaat acgcgatcgc 660tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac actgccagcg 720catcaacaat attttcacct gaatcaggat attcttctaa tacctggaat gctgttttcc 780cggggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa tgcttgatgg 840tcggaagagg cataaattcc gtcagccagt ttagtctgac catctcatct gtaacatcat 900tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc ttcccataca 960atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta tacccatata 1020aatcagcatc catgttggaa tttaatcgcg gcttcgagca agacgtttcc cgttgaatat 1080ggctcataac accccttgta ttactgttta tgtaagcaga cagttttatt gttcatgatg 1140atatattttt atcttgtgca atgtaacatc agagattttg agacacaacg tggctttgtt 1200gaataaatcg aacttttgct gagttgaagg atcagatcac gcatcttccc gacaacgcag 1260accgttccgt ggcaaagcaa aagttcaaaa tcaccaactg gtccacctac aacaaagctc 1320tcatcaaccg tggctccctc actttctggc tggatgatgg ggcgattcag gcctggtatg 1380agtcagcaac accttcttca cgaggcagac

ctctcgacgg atcgttccac tgagcgtcag 1440accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct 1500gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac 1560caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc 1620tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg 1680ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt 1740tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt 1800gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc 1860attgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca 1920gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata 1980gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg 2040ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct 2100ggccttttgc tcacatgttc tttcctgcgt tatcccctga ttctgtggat aaccgtatta 2160ccgcctttga gtgagctgat accgctcgcc gcagccgaac gaccgagcgc agcgagtcag 2220tgagcgagga agcggaagaa gctcgcacat tcagcagcgt ttttcagcgc gttttcgatc 2280agcgtttcaa tgttggtatc aacaccaggt ttaactttga acttatcggc actgacggtt 2340actgattttg aacttttgct ttgccacgga acggtctgcg ttgtcgggaa gatgcgtgat 2400ctgatccttc aactcagcaa aagttcgcca atacgcaaac cgcctctccc cgcgcgttgg 2460ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc 2520aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt 2580ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat 2640gaccatgatt acgccaagct tgcatgcctg caggtctgca cagaacccgt actcgatccg 2700ctcgttcgtc gctgtgaggt ctttcctgca aggcaaatcg ttgtgccgga aggggtttcg 2760tgatgtttct ccgagcgttt tttcgttcca agttggtcat ggtggctctt gtcctggtcg 2820ctggcctgtt tctctacaac gcctgctctt cttctgacgc aaaggaagag atcggcagca 2880gtctgaatct ctctcctgtc actgctcgtt cgaatccgta tgagggcgtc cagcccacga 2940tgagcgaaaa aagccctgtt cccgtccctg tcgtttccgg cgacaggatt tcgggggtgg 3000catcgtgcgg gacggattac gccgggaagc ctgcggtgac gctggaagct gtgtggattt 3060cgtccgactc ggtgaactac acactcgata agaggcattg cctggtgacg accggcccgc 3120tgtggaaaca agcgatccgt aaagcgtcag ggtcagagat tcggcctgag ggcgggagct 3180ggatacgggt ggtgcttgcc atgcctgacg gcaatttcag ggcaggatgg gcaccccacg 3240cccaagtaac cgctggtgcg ctggatattt cggcggtggt ctcgtgagcg gggagaagcg 3300gcacagcgag gccggcccgg tagaaatcat ctttttgatg ctggcagtca gggcggggga 3360ctacatcgtc gccgtgactg cggttctcgc ggtcgggttc ttcgcggtcg cggttgaggg 3420tttctggttc ctggtcgtcg cagtcatcgc tgcaccggcg tggtggtttc tgcgcgactg 3480ggaatcgaag cggagggccg tacgggtctt tgaacgggca tggaagggga cacctgaatc 3540ccccggtatt gctctctccc ttggcctgtc gaacgtggcg gggtctctgc cgaggttgag 3600gaagtttgaa actggttcgg ggatacgcac actcgtgttt tctttgccgc ccggagtcac 3660tgccgagagc tttgagaaag ttcgccctgc gctggcagac gcgatggggg gtcaccgctg 3720ccaagtagag aaggtggccc ccggacaggt ccgcgtcaga gtgattgatg aggattcgat 3780gaagacgccg cgtgatgcgg gatgggcgaa agatgttgtg ctggaagagg atacgttcga 3840cggtcttccg ggcgagacgc gatcctggtt cgagcaagag gggccggcat catgagaaaa 3900tcggcgggag tatctcggat tcctatccgt ctcgggcgct ctcagtacgg ggaagacgtt 3960ggattcgatc tcgctgcgga cgccgctcac atcgccatgc agggcaaaac ccgatccggc 4020aaaagtcagg cgacgtacaa cgtgttagct caggcagcag cgaacgcggc ggttcgagtc 4080gtagggtccg acccgacaca cgtactcctg gagcccttca aacatcgagg ggtgtccgag 4140ccttacgtgg tttcgggact gaatgcgcag gccacggtgg acatgctggg ctgggtcaag 4200cgtgagtctg atcgtcgcat cgaccagatg tggcccctgc gtaccgacaa gttttccgag 4260ttcggggctt cgttcccgct gatactcgtc gtgctcgaag agtttcccgg gatcctcgag 4320ggggcagcgg acgaagacgc cgcgttaggc cgaaaacctg ccgagcgtct cgcaccccgc 4380atttcggcct acgtgcgtca gatagcagcg cagtcggcaa aggctggaat tcgccttctc 4440ctgctctcgc aacgagcgga ggcctcgatc attggcggca atgcgcgttc gaatttcggg 4500gtcaagatga ctctgagggt ggacgaaccg gagtcggtga gaatgcttca tccgagcgct 4560tccccggaag actgtgccct ggtcgagacc ttcaagcctg gtacctgcct tttcgagaag 4620ccaggagaag gccggcagat tatgcgatgc gactttgtcg gcgagtacgg gagatatgcg 4680cgagccatcg agtcttcgga tctgcgtttt ctcgccaccc tccagcaaga ccaggcccaa 4740cgcgaattct tcgctgagga gttcggtgtg gtggatccgt catgactgga ccacaggaga 4800gaaagcgcaa ggcggcgaag ccgtcgcggg agcctcagtt gaactgctgt gaagcggacg 4860tgccgaaacg agcaaaacag cccccggttc cctctacgtt cgacctgctc acggtgaagg 4920agactgcggg gctgctgaga gtcagtcagg caactcttta ccggctgctt cggagtgggg 4980aaggacccac atacacacgg atcggtggac agatacgcgt tcaccgcgag tcgctgcgtc 5040ggttcatcga accgcgtgga taacgtcaca gagacagcga aaacgcctcc cctgggtcaa 5100tccggttacc gccggactgg gggaggcgct tcgacaccta catccgtcgc ccctcgaaag 5160gctcagatgc acttccacga taacgcagag gtcggacaag agggaagaac tgccgttctc 5220tcgccgttgc gcggcgtagc cgccaagcgg gacgtgtctg acgatgcagc gaagcggagt 5280cggcaggcgc ggcacgcgcc tgggcttgtt acatctgcca caactgtccg tgaatctctg 5340ccagctcctg aaaccgctgg tcagggcctt gcggaatccg tgaccgctga tgatttttgg 5400tctcattcgt tcccccgcgc tgacgatgta cgcggcgcag ctgcttcctt ccagtcggtg 5460gctaactggg atgggcgtga gggtccgagg ccgcgtttcg ttgtcgcgcc tggcgttgtc 5520cgcttggagg tttgtgatct cgcacgccgc gaacgaacgg ctgaacgtgc gtatctggct 5580gctcgggctc gggtggatat ggcggctgcc aggcataact cgccgtacga cttcgacgtg 5640gacgatgaag agttggcgga actggcttct ctgcaaggcc tcgaggacga cgacattggg 5700ggctggtctg cggagaggga aatagtgggc tggtctgctc gttctcggtc acggatgatc 5760ttgcgaatgg cagaactcga ctgggctccc atgatggatt tgccgggcat tcctgcgatg 5820gtgaccctca cctatccggg ggactggctt acggttgccc ccaccggcgc tgaggtcaaa 5880aaacatctcc agacgttctt caaacggttc caacgggcct ggggcattgc ctggatgggt 5940gcgtggaaaa tggagttcca aagccgaggc gctccgcatt ttcacctgta catggtccct 6000cctcatggga aggcaggaga ctcgcggaag ctgcggcatg atgctgagct cttgaaatgg 6060gagatagcac gtgcagaggg tgaagaccca ggtcgcaggc cgtatttccg ggaagctcca 6120agcgatggat tgaagtttcg tccgtggctt tctgcggtgt gggccgacgt cgtagatcat 6180ccggacccca aggaaaaaga aaagcacgtc agtgccggca ctggagtgga ctacgcggag 6240ggcacgcgag ggtcagatcc gaaaaggctt gcggtgtact tctccaagca tggaaccttt 6300gccgacaagg aatatcagca cgtagttcct gctcaatggc agaaaacggg tgcgggacct 6360ggcaggttct ggggctaccg cggtttgtcg ccggccacgg ctgccaccga gatttcctgg 6420gatgagtacc tgcttttatc tcgcacgttg cgacgattgt cagcgcgaac gaagatctgg 6480gacccggctt tacgaggcgg tagcggcggc cacagatgga ctaaggcgat gatgcgacgc 6540acggttaccc ggcaccgctt ggacctcgtg accggtgaga ttctgggcac gaagacgcgg 6600aaggttcggg cgccagtgaa gaggtttgtc cggacttcgg gatacctgtg tgtcaatgac 6660gggcccgcac tggctcgaac cctcagccgt cttcgtacaa gctgcctgag ctagacgcgc 6720ggaacgcctt tcggctttgt cttttgctgg atggcgggtt ttgggcggct tctggtgatg 6780cgctgctgcg ctccgtgggg agagagaccc aacgactgac ctatctctac ccaggtgcaa 6840ttcatctccc gcgctctgtc ggctaggtaa acgaggtgct cccgcgcgag cttttccatg 6900tggtcggcca atgtcagctc ggtcaggaca acctgctgtt gttgcgatag ttgtgtccgc 6960acgggtcgat tgtcttctgt tgcggcataa cggttttcgt cgttcgcgga gagtgcggct 7020aaatgaattg catcctcgat tgagcggagc atttcgacgc ggaacctggc gatgatgttg 7080tctctgtctt cattcataac tgaagcgtat tgggagtgtt gccctcccac catgtgtgcc 7140aatgcaggtg tgaactgagt cacagtttct caatagactc caagtttgtg atccttttac 7200tcccaaaatg gggcatgatg tgtgcgtgcc tcggttcagg ggcgaaagtt cgacacctcg 7260aaagaaggcc tcgacatggc tttgaaagct gctggcaacg tgattcctga ttcctccgcg 7320tacgagtacc gggcggttca ggtcgagccg aagatggtca gaaaagaccc ggaagacccg 7380aactctgagc agttccagaa gcagaaggac ggcacgccgg tgtggtcgat cgactgcatt 7440cgggtcgacc gggcatcagg caacaaggca atcgtgaccg tgacggttcc ggacgtgatg 7500gaaccggatg ttgcggggcc ggtggagttc tccgagatga ttgccggttt ctgggtttcg 7560cgcagtggtt cgggcatgtg gttttcggca agcgccgtcg cttctctctg atcgctgatc 7620gtcgcccctc gaaaggttcg gaaatgtcca aaggaaaagg cgttgcgctg ggtgtgggtg 7680ccctcgtgct cgtgtttgtg ctggttgcgg caggttggca agcggcgaac gtgttcagtg 7740atcgttcaca gtccgaagct gtgccgctga gagtgccggc cgatccgaag tgggaaaacg 7800gggtgttctc ggacgttgcc gggtgcctcg ttctctctcc ggaagagctg gggccgttca 7860gcggagggca gtacatcgac atagtgaggc cagttgagcc ggagaggttg gagcgcgact 7920gggtgaggtc ggctgagtgc gtttcggcgt cgatgaatgt ctctgacctg ttggtttctg 7980ctcttccaga gtccacccgt ccccccggcg atttcgttcg ttcgtggaaa gtggcgagtg 8040atgattactg ctatgagggt gataacccgc aaggctgcac ttctcgtatg ccggtttggg 8100tctctgcaaa aaactggtgg tgca 8124947675DNAArtificialpRET1103Rv 94gactctagag gatccccggg taccgagctc gaattcgtaa tcatggtcat agctgtttcc 60tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 120taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 180cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 240gagaggcggt ttgcgtattg gagcttggca ctgggccaag ctgaatttct gccattcatc 300cgcttattat cacttattca ggcgtagcac caggcgttta agggcaccaa taactgcctt 360aaaaaaatta cgccccgccc tgccactcat cgcagtactg ttgtaattca ttaagcattc 420tgccgacatg gaagccatca cagacggcat gatgaacctg aatcgccagc ggcatcagca 480ccttgtcgcc ttgcgtataa tatttgccca tggtgaaaac gggggcgaag aagttgtcca 540tattggccac gtttaaatca aaactggtga aactcaccca gggattggct gagacgaaaa 600acatattctc aataaaccct ttagggaaat aggccaggtt ttcaccgtaa cacgccacat 660cttgcgaata tatgtgtaga aactgccgga aatcgtcgtg gtattcactc cagagcgatg 720aaaacgtttc agtttgctca tggaaaacgg tgtaacaagg gtgaacacta tcccatatca 780ccagctcacc gtctttcatt gccatacgaa attccggatg agcattcatc aggcgggcaa 840gaatgtgaat aaaggccgga taaaacttgt gcttattttt ctttacggtc tttaaaaagg 900ccgtaatatc cagctgaacg gtctggttat aggtacattg agcaactgac tgaaatgcct 960caaaatgttc tttacgatgc cattgggata tatcaacggt ggtatatcca gtgatttttt 1020tctccatttt agcttcctta gctcctgaaa atctcgataa ctcaaaaaat acgcccggta 1080gtgatcttat ttcattatgg tgaaagttgg aacctcttac gtgccgatca acgtctcatt 1140ttcgccaaaa gttggcccag ggcttcccgg tatcaacagg gacaccagga tttatttatt 1200ctgcgaagtg atcttccgtt cgacggagtt ccactgagcg tcagaccccg tagaaaagat 1260caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa 1320accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa 1380ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt 1440aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt 1500accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata 1560gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt 1620ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag aaagcgccac 1680gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga 1740gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg 1800ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa 1860aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat 1920gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc 1980tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga 2040agaagctcat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc 2100ttcgctatta cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac 2160gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgccaagc ttgcatgcct 2220gcaggtctgc acagaacccg tactcgatcc gctcgttcgt cgctgtgagg tctttcctgc 2280aaggcaaatc gttgtgccgg aaggggtttc gtgatgtttc tccgagcgtt ttttcgttcc 2340aagttggtca tggtggctct tgtcctggtc gctggcctgt ttctctacaa cgcctgctct 2400tcttctgacg caaaggaaga gatcggcagc agtctgaatc tctctcctgt cactgctcgt 2460tcgaatccgt atgagggcgt ccagcccacg atgagcgaaa aaagccctgt tcccgtccct 2520gtcgtttccg gcgacaggat ttcgggggtg gcatcgtgcg ggacggatta cgccgggaag 2580cctgcggtga cgctggaagc tgtgtggatt tcgtccgact cggtgaacta cacactcgat 2640aagaggcatt gcctggtgac gaccggcccg ctgtggaaac aagcgatccg taaagcgtca 2700gggtcagaga ttcggcctga gggcgggagc tggatacggg tggtgcttgc catgcctgac 2760ggcaatttca gggcaggatg ggcaccccac gcccaagtaa ccgctggtgc gctggatatt 2820tcggcggtgg tctcgtgagc ggggagaagc ggcacagcga ggccggcccg gtagaaatca 2880tctttttgat gctggcagtc agggcggggg actacatcgt cgccgtgact gcggttctcg 2940cggtcgggtt cttcgcggtc gcggttgagg gtttctggtt cctggtcgtc gcagtcatcg 3000ctgcaccggc gtggtggttt ctgcgcgact gggaatcgaa gcggagggcc gtacgggtct 3060ttgaacgggc atggaagggg acacctgaat cccccggtat tgctctctcc cttggcctgt 3120cgaacgtggc ggggtctctg ccgaggttga ggaagtttga aactggttcg gggatacgca 3180cactcgtgtt ttctttgccg cccggagtca ctgccgagag ctttgagaaa gttcgccctg 3240cgctggcaga cgcgatgggg ggtcaccgct gccaagtaga gaaggtggcc cccggacagg 3300tccgcgtcag agtgattgat gaggattcga tgaagacgcc gcgtgatgcg ggatgggcga 3360aagatgttgt gctggaagag gatacgttcg acggtcttcc gggcgagacg cgatcctggt 3420tcgagcaaga ggggccggca tcatgagaaa atcggcggga gtatctcgga ttcctatccg 3480tctcgggcgc tctcagtacg gggaagacgt tggattcgat ctcgctgcgg acgccgctca 3540catcgccatg cagggcaaaa cccgatccgg caaaagtcag gcgacgtaca acgtgttagc 3600tcaggcagca gcgaacgcgg cggttcgagt cgtagggtcc gacccgacac acgtactcct 3660ggagcccttc aaacatcgag gggtgtccga gccttacgtg gtttcgggac tgaatgcgca 3720ggccacggtg gacatgctgg gctgggtcaa gcgtgagtct gatcgtcgca tcgaccagat 3780gtggcccctg cgtaccgaca agttttccga gttcggggct tcgttcccgc tgatactcgt 3840cgtgctcgaa gagtttcccg ggatcctcga gggggcagcg gacgaagacg ccgcgttagg 3900ccgaaaacct gccgagcgtc tcgcaccccg catttcggcc tacgtgcgtc agatagcagc 3960gcagtcggca aaggctggaa ttcgccttct cctgctctcg caacgagcgg aggcctcgat 4020cattggcggc aatgcgcgtt cgaatttcgg ggtcaagatg actctgaggg tggacgaacc 4080ggagtcggtg agaatgcttc atccgagcgc ttccccggaa gactgtgccc tggtcgagac 4140cttcaagcct ggtacctgcc ttttcgagaa gccaggagaa ggccggcaga ttatgcgatg 4200cgactttgtc ggcgagtacg ggagatatgc gcgagccatc gagtcttcgg atctgcgttt 4260tctcgccacc ctccagcaag accaggccca acgcgaattc ttcgctgagg agttcggtgt 4320ggtggatccg tcatgactgg accacaggag agaaagcgca aggcggcgaa gccgtcgcgg 4380gagcctcagt tgaactgctg tgaagcggac gtgccgaaac gagcaaaaca gcccccggtt 4440ccctctacgt tcgacctgct cacggtgaag gagactgcgg ggctgctgag agtcagtcag 4500gcaactcttt accggctgct tcggagtggg gaaggaccca catacacacg gatcggtgga 4560cagatacgcg ttcaccgcga gtcgctgcgt cggttcatcg aaccgcgtgg ataacgtcac 4620agagacagcg aaaacgcctc ccctgggtca atccggttac cgccggactg ggggaggcgc 4680ttcgacacct acatccgtcg cccctcgaaa ggctcagatg cacttccacg ataacgcaga 4740ggtcggacaa gagggaagaa ctgccgttct ctcgccgttg cgcggcgtag ccgccaagcg 4800ggacgtgtct gacgatgcag cgaagcggag tcggcaggcg cggcacgcgc ctgggcttgt 4860tacatctgcc acaactgtcc gtgaatctct gccagctcct gaaaccgctg gtcagggcct 4920tgcggaatcc gtgaccgctg atgatttttg gtctcattcg ttcccccgcg ctgacgatgt 4980acgcggcgca gctgcttcct tccagtcggt ggctaactgg gatgggcgtg agggtccgag 5040gccgcgtttc gttgtcgcgc ctggcgttgt ccgcttggag gtttgtgatc tcgcacgccg 5100cgaacgaacg gctgaacgtg cgtatctggc tgctcgggct cgggtggata tggcggctgc 5160caggcataac tcgccgtacg acttcgacgt ggacgatgaa gagttggcgg aactggcttc 5220tctgcaaggc ctcgaggacg acgacattgg gggctggtct gcggagaggg aaatagtggg 5280ctggtctgct cgttctcggt cacggatgat cttgcgaatg gcagaactcg actgggctcc 5340catgatggat ttgccgggca ttcctgcgat ggtgaccctc acctatccgg gggactggct 5400tacggttgcc cccaccggcg ctgaggtcaa aaaacatctc cagacgttct tcaaacggtt 5460ccaacgggcc tggggcattg cctggatggg tgcgtggaaa atggagttcc aaagccgagg 5520cgctccgcat tttcacctgt acatggtccc tcctcatggg aaggcaggag actcgcggaa 5580gctgcggcat gatgctgagc tcttgaaatg ggagatagca cgtgcagagg gtgaagaccc 5640aggtcgcagg ccgtatttcc gggaagctcc aagcgatgga ttgaagtttc gtccgtggct 5700ttctgcggtg tgggccgacg tcgtagatca tccggacccc aaggaaaaag aaaagcacgt 5760cagtgccggc actggagtgg actacgcgga gggcacgcga gggtcagatc cgaaaaggct 5820tgcggtgtac ttctccaagc atggaacctt tgccgacaag gaatatcagc acgtagttcc 5880tgctcaatgg cagaaaacgg gtgcgggacc tggcaggttc tggggctacc gcggtttgtc 5940gccggccacg gctgccaccg agatttcctg ggatgagtac ctgcttttat ctcgcacgtt 6000gcgacgattg tcagcgcgaa cgaagatctg ggacccggct ttacgaggcg gtagcggcgg 6060ccacagatgg actaaggcga tgatgcgacg cacggttacc cggcaccgct tggacctcgt 6120gaccggtgag attctgggca cgaagacgcg gaaggttcgg gcgccagtga agaggtttgt 6180ccggacttcg ggatacctgt gtgtcaatga cgggcccgca ctggctcgaa ccctcagccg 6240tcttcgtaca agctgcctga gctagacgcg cggaacgcct ttcggctttg tcttttgctg 6300gatggcgggt tttgggcggc ttctggtgat gcgctgctgc gctccgtggg gagagagacc 6360caacgactga cctatctcta cccaggtgca attcatctcc cgcgctctgt cggctaggta 6420aacgaggtgc tcccgcgcga gcttttccat gtggtcggcc aatgtcagct cggtcaggac 6480aacctgctgt tgttgcgata gttgtgtccg cacgggtcga ttgtcttctg ttgcggcata 6540acggttttcg tcgttcgcgg agagtgcggc taaatgaatt gcatcctcga ttgagcggag 6600catttcgacg cggaacctgg cgatgatgtt gtctctgtct tcattcataa ctgaagcgta 6660ttgggagtgt tgccctccca ccatgtgtgc caatgcaggt gtgaactgag tcacagtttc 6720tcaatagact ccaagtttgt gatcctttta ctcccaaaat ggggcatgat gtgtgcgtgc 6780ctcggttcag gggcgaaagt tcgacacctc gaaagaaggc ctcgacatgg ctttgaaagc 6840tgctggcaac gtgattcctg attcctccgc gtacgagtac cgggcggttc aggtcgagcc 6900gaagatggtc agaaaagacc cggaagaccc gaactctgag cagttccaga agcagaagga 6960cggcacgccg gtgtggtcga tcgactgcat tcgggtcgac cgggcatcag gcaacaaggc 7020aatcgtgacc gtgacggttc cggacgtgat ggaaccggat gttgcggggc cggtggagtt 7080ctccgagatg attgccggtt tctgggtttc gcgcagtggt tcgggcatgt ggttttcggc 7140aagcgccgtc gcttctctct gatcgctgat cgtcgcccct cgaaaggttc ggaaatgtcc 7200aaaggaaaag gcgttgcgct gggtgtgggt gccctcgtgc tcgtgtttgt gctggttgcg 7260gcaggttggc aagcggcgaa cgtgttcagt gatcgttcac agtccgaagc tgtgccgctg 7320agagtgccgg ccgatccgaa gtgggaaaac ggggtgttct cggacgttgc cgggtgcctc 7380gttctctctc cggaagagct ggggccgttc agcggagggc agtacatcga catagtgagg 7440ccagttgagc cggagaggtt ggagcgcgac tgggtgaggt cggctgagtg cgtttcggcg 7500tcgatgaatg tctctgacct gttggtttct gctcttccag agtccacccg tccccccggc 7560gatttcgttc gttcgtggaa agtggcgagt gatgattact gctatgaggg tgataacccg 7620caaggctgca cttctcgtat gccggtttgg gtctctgcaa aaaactggtg gtgca 7675958497DNAArtificialpRET1001 95ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt

tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggacctgcag 5820gcatgcaagc ttggcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 5880tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 5940ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcgcctgat 6000gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 6060tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga 6120cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 6180cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg 6240cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc 6300aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 6360ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 6420aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 6480ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 6540gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 6600ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 6660ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 6720gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 6780aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 6840gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 6900aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 6960caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 7020tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 7080acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 7140gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 7200agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 7260gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 7320ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 7380taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 7440agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 7500aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 7560ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 7620gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 7680aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 7740aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 7800gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 7860aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 7920aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 7980cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 8040cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 8100tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 8160tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 8220ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 8280atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 8340tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 8400gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 8460cgaattcgag ctcggtaccc ggggatcctc tagagtc 8497968487DNAArtificialpRET1002 96ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggacctgcag 5820gcatgcaagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct 5880cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg 5940agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct 6000gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg 6060cgaacttttg ctgagttgaa ggatcagatc acgcatcttc ccgacaacgc agaccgttcc 6120gtggcaaagc aaaagttcaa aatcagtaac cgtcagtgcc gataagttca aagttaaacc 6180tggtgttgat accaacattg aaacgctgat cgaaaacgcg ctgaaaaacg ctgctgaatg 6240tgcgagcttc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc 6300gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg 6360caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt 6420tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa 6480gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct 6540ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc 6600cttcgggaag cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg 6660tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct 6720tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag 6780cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga 6840agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga 6900agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg 6960gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag 7020aagatccttt gatcttttct acggggtctg acgctcagtg gaacgatccg tcgagaggtc

7080tgcctcgtga agaaggtgtt gctgactcat accaggcctg aatcgcccca tcatccagcc 7140agaaagtgag ggagccacgg ttgatgagag ctttgttgta ggtggaccag ttggtgattt 7200tgaacttttg ctttgccacg gaacggtctg cgttgtcggg aagatgcgtg atctgatcct 7260tcaactcagc aaaagttcga tttattcaac aaagccacgt tgtgtctcaa aatctctgat 7320gttacattgc acaagataaa aatatatcat catgaacaat aaaactgtct gcttacataa 7380acagtaatac aaggggtgtt atgagccata ttcaacggga aacgtcttgc tcgaagccgc 7440gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc gataatgtcg 7500ggcaatcagg tgcgacaatc tatcgattgt atgggaagcc cgatgcgcca gagttgtttc 7560tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc agactaaact 7620ggctgacgga atttatgcct cttccgacca tcaagcattt tatccgtact cctgatgatg 7680catggttact caccactgcg atccccggga aaacagcatt ccaggtatta gaagaatatc 7740ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga 7800ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat 7860cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt aatggctggc 7920ctgttgaaca agtctggaaa gaaatgcata agcttttgcc attctcaccg gattcagtcg 7980tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa ttaataggtt 8040gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc atcctatgga 8100actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa tatggtattg 8160ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt ttctaatcag 8220aattggttaa ttggttgtaa cactggcaga gcattacgct gacttgacgg gacggcggct 8280ttgttgaata aatcgcattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 8340cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 8400tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 8460ctcggtaccc ggggatcctc tagagtc 8487978038DNAArtificialpRET1003 97ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggacctgcag 5820gcatgcaagc ttggcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 5880tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 5940ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg aatggcgaat gagcttcttc 6000cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 6060tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 6120gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 6180ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 6240aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 6300tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 6360ggcgctttct caatgctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa 6420gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta 6480tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa 6540caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa 6600ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt 6660cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt 6720ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat 6780cttttctacg gggtctgacg ctcagtggaa ctccgtcgaa cggaagatca cttcgcagaa 6840taaataaatc ctggtgtccc tgttgatacc gggaagccct gggccaactt ttggcgaaaa 6900tgagacgttg atcggcacgt aagaggttcc aactttcacc ataatgaaat aagatcacta 6960ccgggcgtat tttttgagtt atcgagattt tcaggagcta aggaagctaa aatggagaaa 7020aaaatcactg gatataccac cgttgatata tcccaatggc atcgtaaaga acattttgag 7080gcatttcagt cagttgctca atgtacctat aaccagaccg ttcagctgga tattacggcc 7140tttttaaaga ccgtaaagaa aaataagcac aagttttatc cggcctttat tcacattctt 7200gcccgcctga tgaatgctca tccggaattt cgtatggcaa tgaaagacgg tgagctggtg 7260atatgggata gtgttcaccc ttgttacacc gttttccatg agcaaactga aacgttttca 7320tcgctctgga gtgaatacca cgacgatttc cggcagtttc tacacatata ttcgcaagat 7380gtggcgtgtt acggtgaaaa cctggcctat ttccctaaag ggtttattga gaatatgttt 7440ttcgtctcag ccaatccctg ggtgagtttc accagttttg atttaaacgt ggccaatatg 7500gacaacttct tcgcccccgt tttcaccatg ggcaaatatt atacgcaagg cgacaaggtg 7560ctgatgccgc tggcgattca ggttcatcat gccgtctgtg atggcttcca tgtcggcaga 7620atgcttaatg aattacaaca gtactgcgat gagtggcagg gcggggcgta atttttttaa 7680ggcagttatt ggtgccctta aacgcctggt gctacgcctg aataagtgat aataagcgga 7740tgaatggcag aaattcagct tggcccagtg ccaagctcca atacgcaaac cgcctctccc 7800cgcgcgttgg ccgattcatt aatgcagctg gcacgacagg tttcccgact ggaaagcggg 7860cagtgagcgc aacgcaatta atgtgagtta gctcactcat taggcacccc aggctttaca 7920ctttatgctt ccggctcgta tgttgtgtgg aattgtgagc ggataacaat ttcacacagg 7980aaacagctat gaccatgatt acgaattcga gctcggtacc cggggatcct ctagagtc 8038988497DNAArtificialpRET1001Rv 98ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga

5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggactctaga 5820ggatccccgg gtaccgagct cgaattcgta atcatggtca tagctgtttc ctgtgtgaaa 5880ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg 5940gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca 6000gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg 6060tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg 6120gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg 6180ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa 6240ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg 6300acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc 6360tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc 6420ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc 6480ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg 6540ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc 6600actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga 6660gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc 6720tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac 6780caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg 6840atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc 6900acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa 6960ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta 7020ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt 7080tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag 7140tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca 7200gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc 7260tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt 7320tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag 7380ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt 7440tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat 7500ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt 7560gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc 7620ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat 7680cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag 7740ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt 7800ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg 7860gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta 7920ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc 7980gcgcacattt ccccgaaaag tgccacctga cgtctaagaa accattatta tcatgacatt 8040aacctataaa aataggcgta tcacgaggcc ctttcgtctc gcgcgtttcg gtgatgacgg 8100tgaaaacctc tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc 8160cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct 8220taactatgcg gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc 8280gcacagatgc gtaaggagaa aataccgcat caggcgccat tcgccattca ggctgcgcaa 8340ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg 8400atgtgctgca aggcgattaa gttgggtaac gccagggttt tcccagtcac gacgttgtaa 8460aacgacggcc agtgccaagc ttgcatgcct gcaggtc 8497998487DNAArtificialpRET1002Rv 99ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggactctaga 5820ggatccccgg gtaccgagct cgaattcact ggccgtcgtt ttacaacgtc gtgactggga 5880aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg 5940taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga 6000atgcgattta ttcaacaaag ccgccgtccc gtcaagtcag cgtaatgctc tgccagtgtt 6060acaaccaatt aaccaattct gattagaaaa actcatcgag catcaaatga aactgcaatt 6120tattcatatc aggattatca ataccatatt tttgaaaaag ccgtttctgt aatgaaggag 6180aaaactcacc gaggcagttc cataggatgg caagatcctg gtatcggtct gcgattccga 6240ctcgtccaac atcaatacaa cctattaatt tcccctcgtc aaaaataagg ttatcaagtg 6300agaaatcacc atgagtgacg actgaatccg gtgagaatgg caaaagctta tgcatttctt 6360tccagacttg ttcaacaggc cagccattac gctcgtcatc aaaatcactc gcatcaacca 6420aaccgttatt cattcgtgat tgcgcctgag cgagacgaaa tacgcgatcg ctgttaaaag 6480gacaattaca aacaggaatc gaatgcaacc ggcgcaggaa cactgccagc gcatcaacaa 6540tattttcacc tgaatcagga tattcttcta atacctggaa tgctgttttc ccggggatcg 6600cagtggtgag taaccatgca tcatcaggag tacggataaa atgcttgatg gtcggaagag 6660gcataaattc cgtcagccag tttagtctga ccatctcatc tgtaacatca ttggcaacgc 6720tacctttgcc atgtttcaga aacaactctg gcgcatcggg cttcccatac aatcgataga 6780ttgtcgcacc tgattgcccg acattatcgc gagcccattt atacccatat aaatcagcat 6840ccatgttgga atttaatcgc ggcttcgagc aagacgtttc ccgttgaata tggctcataa 6900caccccttgt attactgttt atgtaagcag acagttttat tgttcatgat gatatatttt 6960tatcttgtgc aatgtaacat cagagatttt gagacacaac gtggctttgt tgaataaatc 7020gaacttttgc tgagttgaag gatcagatca cgcatcttcc cgacaacgca gaccgttccg 7080tggcaaagca aaagttcaaa atcaccaact ggtccaccta caacaaagct ctcatcaacc 7140gtggctccct cactttctgg ctggatgatg gggcgattca ggcctggtat gagtcagcaa 7200caccttcttc acgaggcaga cctctcgacg gatcgttcca ctgagcgtca gaccccgtag 7260aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc tgcttgcaaa 7320caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt 7380ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt ctagtgtagc 7440cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa 7500tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa 7560gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc 7620ccagcttgga gcgaacgacc tacaccgaac tgagatacct acagcgtgag cattgagaaa 7680gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa 7740caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg 7800ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc 7860tatggaaaaa cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg 7920ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg 7980agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg 8040aagcggaaga agctcgcaca ttcagcagcg tttttcagcg cgttttcgat cagcgtttca 8100atgttggtat caacaccagg tttaactttg aacttatcgg cactgacggt tactgatttt 8160gaacttttgc tttgccacgg aacggtctgc gttgtcggga agatgcgtga tctgatcctt 8220caactcagca aaagttcgcc aatacgcaaa ccgcctctcc ccgcgcgttg gccgattcat 8280taatgcagct ggcacgacag gtttcccgac tggaaagcgg gcagtgagcg caacgcaatt 8340aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt 8400atgttgtgtg gaattgtgag cggataacaa tttcacacag gaaacagcta tgaccatgat 8460tacgccaagc ttgcatgcct gcaggtc 84871008038DNAArtificialpRET1003Rv 100ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg

3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggactctaga 5820ggatccccgg gtaccgagct cgaattcgta atcatggtca tagctgtttc ctgtgtgaaa 5880ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg 5940gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca 6000gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg 6060tttgcgtatt ggagcttggc actgggccaa gctgaatttc tgccattcat ccgcttatta 6120tcacttattc aggcgtagca ccaggcgttt aagggcacca ataactgcct taaaaaaatt 6180acgccccgcc ctgccactca tcgcagtact gttgtaattc attaagcatt ctgccgacat 6240ggaagccatc acagacggca tgatgaacct gaatcgccag cggcatcagc accttgtcgc 6300cttgcgtata atatttgccc atggtgaaaa cgggggcgaa gaagttgtcc atattggcca 6360cgtttaaatc aaaactggtg aaactcaccc agggattggc tgagacgaaa aacatattct 6420caataaaccc tttagggaaa taggccaggt tttcaccgta acacgccaca tcttgcgaat 6480atatgtgtag aaactgccgg aaatcgtcgt ggtattcact ccagagcgat gaaaacgttt 6540cagtttgctc atggaaaacg gtgtaacaag ggtgaacact atcccatatc accagctcac 6600cgtctttcat tgccatacga aattccggat gagcattcat caggcgggca agaatgtgaa 6660taaaggccgg ataaaacttg tgcttatttt tctttacggt ctttaaaaag gccgtaatat 6720ccagctgaac ggtctggtta taggtacatt gagcaactga ctgaaatgcc tcaaaatgtt 6780ctttacgatg ccattgggat atatcaacgg tggtatatcc agtgattttt ttctccattt 6840tagcttcctt agctcctgaa aatctcgata actcaaaaaa tacgcccggt agtgatctta 6900tttcattatg gtgaaagttg gaacctctta cgtgccgatc aacgtctcat tttcgccaaa 6960agttggccca gggcttcccg gtatcaacag ggacaccagg atttatttat tctgcgaagt 7020gatcttccgt tcgacggagt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc 7080ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct 7140accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg 7200cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca 7260cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc 7320tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga 7380taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac 7440gacctacacc gaactgagat acctacagcg tgagcattga gaaagcgcca cgcttcccga 7500agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag 7560ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg 7620acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag 7680caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc 7740tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc 7800tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagaagctca 7860ttcgccattc aggctgcgca actgttggga agggcgatcg gtgcgggcct cttcgctatt 7920acgccagctg gcgaaagggg gatgtgctgc aaggcgatta agttgggtaa cgccagggtt 7980ttcccagtca cgacgttgta aaacgacggc cagtgccaag cttgcatgcc tgcaggtc 8038



Patent applications by DAIICHI FINE CHEMICAL CO., LTD.

Patent applications in class Escherichia (e.g., E. coli, etc.)

Patent applications in all subclasses Escherichia (e.g., E. coli, etc.)


User Contributions:

Comment about this patent or add new information about this topic:

CAPTCHA
Images included with this patent application:
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
NOVEL PLASMIDS AND UTILIZATION THEREOF diagram and imageNOVEL PLASMIDS AND UTILIZATION THEREOF diagram and image
Similar patent applications:
DateTitle
2012-06-07Soluble protein and utilization of the same
2012-07-05Spontaneously contracting fish cell aggregates, use thereof and method for the production thereof
2012-07-05Highly sensitive rapid isothermal method for the detection of point mutations and snps, a set of primers and a kit therefor
2012-04-19Dna polymerases and mutants thereof
2012-06-14Cellulose degradable yeast and method for production thereof
New patent applications in this class:
DateTitle
2017-08-17Hydrocarbon synthase gene and use thereof
2016-06-30Site-specific incorporation of phosphoserine into proteins in escherichia coli
2016-06-09Nucleotide sequences, vectors and host cells
2016-05-05Galectin-3 inhibitor (gal-3m) is associated with additive anti-myeloma and anti-solid tumor effects, decreased osteoclastogenesis and organ protection when used in combination with proteasome inhibitors
2016-04-28Microorganisms engineered to use unconventional sources of nitrogen
Top Inventors for class "Chemistry: molecular biology and microbiology"
RankInventor's name
1Marshall Medoff
2Anthony P. Burgard
3Mark J. Burk
4Robin E. Osterhout
5Rangarajan Sampath
Website © 2025 Advameg, Inc.