Patent application title: NOVEL PLASMIDS AND UTILIZATION THEREOF
Inventors:
Eitora Yamamura (Takaoka-Shi, JP)
Noboru Fujimoto (Takaoka-Shi, JP)
Assignees:
DAIICHI FINE CHEMICAL CO., LTD.
IPC8 Class: AC12N121FI
USPC Class:
43525233
Class name: Bacteria or actinomycetales; media therefor transformants (e.g., recombinant dna or vector or foreign or exogenous gene containing, fused bacteria, etc.) escherichia (e.g., e. coli, etc.)
Publication date: 2009-02-12
Patent application number: 20090042275
Claims:
1-24. (canceled)
25. An isolated DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 77.
26. An isolated DNA fragment comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 77.
27-28. (canceled)
29. An isolated vector comprising the isolated DNA fragment according to claim 25.
30. The isolated vector according to claim 29, having inserted therein an aminoketone asymmetric reductase gene.
31. The isolated vector according to claim 30, wherein the aminoketone asymmetric reductase gene is a nucleic acid coding for a protein comprising the amino acid sequence set forth as SEQ ID NO: 78, or a nucleic acid that codes for a protein having the amino acid sequence set forth as SEQ ID NO: 78 with a deletion, insertion, substitution or addition of one or a plurality of amino acids, and having aminoketone asymmetric reduction activity.
32. The isolated vector according to claim 30, wherein the aminoketone asymmetric reductase gene is a nucleic acid comprising the nucleotide sequence set forth as SEQ ID NO: 79, or a nucleic acid that hybridizes with nucleic acid having a nucleotide sequence complementary to the nucleotide sequence set forth as SEQ ID NO: 79 under stringent conditions, and that codes for a protein having aminoketone asymmetric reduction activity.
33. A transformant containing the isolated vector according to claim 29.
34. A transformant containing the isolated vector according to claim 30.
35-38. (canceled)
Description:
TECHNICAL FIELD
[0001]The present invention relates to novel plasmids derived from any of microorganisms belonging to the genus Rhodococcus (hereinafter referred to as "the genus Rhodococcus") and to utilization thereof. More specifically, the invention relates to plasmids or their partial DNA fragments (hereinafter also referred to simply as "DNA fragments"), and to shuttle vectors, vectors, transformants, aminoketone asymmetric reductase production methods and optically active aminoalcohol production methods which utilize them.
BACKGROUND ART
[0002]The genus Rhodococcus is known to produce enzymes involved in nitrile metabolism and to produce enzymes which asymmetrically reduce aminoketones. In particular, Rhodococcus erythropolis is known to have very high aminoketone asymmetric reduction activity. Such microorganisms and enzymes act on α-aminoketones to high selectively produce optically active β-aminoalcohols at high yields (for example, Patent documents 1 and 5). Thus, it has long been desired to develop a host-vector system intended for mass production of useful enzymes and the like in the genus Rhodococcus. However, the development of vectors suitable for the genus Rhodococcus as hosts has lagged behind. Only a few strains of the genus Rhodococcus have been found with plasmids, namely Rhodococcus sp. H13-A (Non-patent document 1), Rhodococcus rhodochrous ATCC4276 (Patent document 2), Rhodococcus rhodochrous ATCC4001 (Patent document 3) and Rhodococcus erythropolis IFO12320 (Patent document 4).
[0003][Patent document 1] WO01/73100
[0004][Patent document 2] Japanese Unexamined Patent Publication HEI No. 4-148685
[0005][Patent document 3] Japanese Unexamined Patent Publication HEI No. 4-330287
[0006][Patent document 4] Japanese Unexamined Patent Publication HEI No. 9-28379
[0007][Patent document 5] WO02/070714
[0008][Non-patent document 1] J. Bacteriol., 170, 638, 1988
DISCLOSURE OF THE INVENTION
Problems to be Solved by the Invention
[0009]As mentioned above, it has been desired to develop new vectors for breeding and improve to industrially useful strains (mutant strains) from the genus Rhodococcus. In particular, self-cloning systems are preferred from the standpoint of safety of the recombinant DNA microbes and their products which may be used as foods and additives. It is an object of the present invention to provide novel plasmids that can be used as vectors for such a host-vector system.
[0010]It is desirable to create recombinant microbes suitable for industrial application from among Rhodococcus erythropolis which has aminoketone asymmetric reduction activity. In particular, it is a first object of the invention to provide novel plasmids or their partial DNA fragments which can be used to create such recombinant microbes.
[0011]If a plasmid such as described above can be obtained, it would become easy to construct a shuttle vector that is replicable even in other microbes. It is therefore a second object of the invention to provide nucleotide sequence data relating to DNA replication (replication region, etc.) necessary for construction of such a shuttle vector.
[0012]It is a third object of the invention to provide shuttle vectors that are replicable in both the genus Rhodococcus and E. coli.
[0013]It is a fourth object of the invention to apply the shuttle vectors to an amino ketone asymmetric reductase.
Means for Solving the Problems
[0014]The present inventors carefully screened plasmids for vector construction from among Rhodococcus strains, and as a result discovered several novel plasmids usable as vectors for host-vector systems.
[0015]Furthermore, the present inventors found that it is possible to construct shuttle vectors by transferring into the aforementioned plasmids a drug resistance gene and a gene region that is replicable in E. coli. As a result there were obtained nucleotide sequence data, plasmids and shuttle vectors that achieve the objects stated above, and the present invention has thereupon been completed.
[0016]Specifically, the present invention provides a DNA fragment, a DNA, a plasmid, a shuttle vector, a vector, a transformant, a method for production of an aminoketone asymmetric reductase, and a method for production of an optically active aminoalcohol, according to following (1) to (39).
(1) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37.(2) A plasmid or a partial DNA fragment thereof, characterized by comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37.(3) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22.(4) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22.(5) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22 and comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37.(6) A DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 76.(7) A plasmid or a partial DNA fragment thereof, characterized by comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.(8) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, SEQ ID NO: 4, SEQ ID NO: 14, SEQ ID NO: 17 and SEQ ID NO: 22, comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 35, SEQ ID NO: 36 and SEQ ID NO: 37, and comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.(9) A circular plasmid characterized by comprising a plasmid or a partial DNA fragment according to any one of (1) to (8), wherein the numbers of restriction endonuclease cleavage sites are BamH I: 2, EcoR I: 2, Kpn I: 1, Pvu II: 1, Sac I: 1 and Sma I: 1, and the size is approximately 5.4 kbp.(10) A plasmid having the nucleotide sequence set forth as SEQ ID NO: 73.(11) A plasmid or a DNA fragment according to any one of (1) to (10), characterized by being derived from a bacterium belonging to the genus Rhodococcus. (12) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72.(13) A plasmid or a partial DNA fragment thereof, characterized by comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72.(14) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69.(15) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69.(16) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69 and comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72.(17) A plasmid or a partial DNA fragment thereof, characterized by comprising a coding region for a DNA replication-related protein having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, SEQ ID NO: 42, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO: 53, SEQ ID NO: 55, SEQ ID NO: 56, SEQ ID NO: 61, SEQ ID NO: 62 and SEQ ID NO: 69, comprising a DNA replication region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 70, SEQ ID NO: 71 and SEQ ID NO: 72, and comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.(18) A DNA fragment having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 67 and SEQ ID NO: 47.(19) A plasmid or a partial DNA fragment thereof, characterized by comprising a mobilization protein region having at least one nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 67 and SEQ ID NO: 47.(20) A DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 75.(21) A plasmid or a partial DNA fragment thereof characterized by comprising a mobilization-related region having the nucleotide sequence set forth as SEQ ID NO: 75.(22) A circular plasmid characterized by comprising a plasmid or DNA fragment according to any one of (12) to (21), wherein the numbers of restriction endonuclease cleavage sites are BamH I: 2, Pvu II: 4, Sac I: 3 and Sma I: 4, and the size is approximately 5.8 kbp.(23) A plasmid having the nucleotide sequence set forth as SEQ ID NO: 74.(24) A plasmid or a DNA fragment according to any one of (12) to (23), characterized by being derived from a bacterium belonging to the genus Rhodococcus. (25) A DNA fragment having the nucleotide sequence set forth as SEQ ID NO: 77.(26) A DNA fragment characterized by comprising a promoter region having the nucleotide sequence set forth as SEQ ID NO: 77.(27) A shuttle vector replicable in bacteria belonging to the genus Rhodococcus and E. coli, and comprising a plasmid or partial DNA fragment thereof according to any one of (1) to (26) and a DNA region replicable in E. coli. (28) A vector characterized by being constructed using a shuttle vector according to (27).(29) A vector characterized by comprising a plasmid or DNA fragment according to any one of (6), (7), (25) or (26).(30) A vector according to (28) or (29), characterized by having inserted therein an aminoketone asymmetric reductase gene.(31) A vector according to (30), characterized in that the aminoketone asymmetric reductase gene is a nucleic acid coding for a protein consisting the amino acid sequence set forth as SEQ ID NO: 78, or a nucleic acid that codes for a protein having the amino acid sequence set forth as SEQ ID NO: 78 with a deletion, insertion, substitution or addition of one or a plurality of amino acids, and having aminoketone asymmetric reduction activity.(32) A vector according to (30), characterized in that the aminoketone asymmetric reductase gene is a nucleic acid consisting the nucleotide sequence set forth as SEQ ID NO: 79, or a nucleic acid that hybridizes with nucleic acid having a nucleotide sequence complementary to the nucleotide set forth as SEQ. ID NO: 79 under stringent conditions, and that codes for a protein having aminoketone asymmetric reduction activity.(33) A transformant containing a vector according to (28) or (29).(34) A transformant containing a vector according to any one of (30) to (32).(35) A method for production of an aminoketone asymmetric reductase, which comprises a culturing step in which a transformant according to (34) is cultured in medium that allows growth of said transformant, and
[0017]a purification step in which the aminoketone asymmetric reductase is purified from said transformant obtained in said culturing step.
(36) A method for production of an optically active aminoalcohol, wherein an aminoketone asymmetric reductase obtained by the production method of (35) is reacted with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1):
##STR00001##
wherein X may be the same or different and represents at least one species selected from the group consisting of halogen, lower alkyl, hydroxyl optionally protected with a protecting group, nitro and sulfonyl;n represents an integer of 0 to 3;R1 represents lower alkyl;R2 and R3 may be the same or different and represent at least one species selected from the group consisting of hydrogen and lower alkyl; and* represents asymmetric carbon,or a salt thereof to produce an optically active aminoalcohol compound represented by the following general formula (2):
##STR00002##
wherein X, n, R1, R2, R3 and * have the same definitions as above, and having the desired optical activity.(37) A method for production of an optically active aminoalcohol, wherein a transformant according to (34) is reacted with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1):
##STR00003##
wherein X may be the same or different and represents at least one species selected from the group consisting of halogen, lower alkyl, hydroxyl optionally protected with a protecting group, nitro and sulfonyl;n represents an integer of 0 to 3;R1 represents lower alkyl;R2 and R3 may be the same or different and represent at least one species selected from the group consisting of hydrogen and lower alkyl; and* represents asymmetric carbon,or a salt thereof, to produce an optically active aminoalcohol compound represented by the following general formula (2):
##STR00004##
wherein X, n, R1, R2, R3 and * have the same definitions as above, and having the desired optical activity.(38) A production method for an optically active aminoalcohol according to (37), wherein the production method for the optically active aminoalcohol is carried out with further addition of a compound represented by the following general formula (3):
##STR00005##
wherein A represents the following formula (Y) or (Z):
##STR00006##
wherein R4 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R8,
##STR00007##
wherein R5 represents hydrogen, C1-3 alkyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R6 or R9;R6 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R9;R7 represents hydrogen or optionally substituted C1-6 alkyl;R8 represents hydrogen, carboxyl, optionally substituted C1-6 alkyl, a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R4 or a C5-10 hydrocarbon ring which is bonded to R6;R9 represents hydrogen, optionally substituted C1-6 alkyl, optionally substituted C1-6 alkyloxycarbonyl, optionally substituted acyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R6; andR10 represents hydrogen or optionally substituted C1-6 alkyl,or a pharmaceutically acceptable salt or solvate thereof, for production of an optically active aminoalcohol.(39) A shuttle vector according to (27), having a nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 89 to SEQ ID NO: 100.
EFFECT OF THE INVENTION
[0018]The plasmids of the invention are novel plasmids unknown to the prior art, and are valuable as vectors for host-vector systems belonging to the industrially useful the genus Rhodococcus. They are of particular utility in the creation of recombinant microbes capable of industrial asymmetric reduction of aminoketones. An example of asymmetric reduction of an aminoketone to which such microbes may contribute is a reaction for production of d-(1S,2S)-pseudoephedrine from 1-2-methylamino-1-phenyl-1-propanone.
[0019]The plasmids of the invention can coexist in single Rhodococcus cell and therefore can be used not only alone for their replicating function, but also as compatible plasmids. That is, by inserting different protein (for example, enzyme) genes into the different plasmids, it is possible to express the proteins simultaneously in the same cell.
[0020]The shuttle vectors of the invention are useful for creation of industrially useful recombinant microbes of the genus Rhodococcus and Escherichia coli.
[0021]The nucleotide sequence data relating to DNA replication obtained from the plasmids of the invention may serve as the basis for construction of the aforementioned shuttle vectors, and specifically they provide DNA fragments as constituent elements of the vectors.
BRIEF DESCRIPTION OF THE DRAWINGS
[0022]FIG. 1 is a restriction enzyme cleavage map of plasmid pRET1100.
[0023]FIG. 2 is a restriction enzyme cleavage map of plasmid pRET1000.
[0024]FIG. 3 is a summary illustration for construction of shuttle vector pRET1101.
[0025]FIG. 4 is a summary illustration for construction of shuttle vector pRET1102.
[0026]FIG. 5 is a summary illustration for construction of shuttle vector pRET1103.
BEST MODE FOR CARRYING OUT THE INVENTION
[0027]Preferred embodiments of the invention will now be explained.
[0028]The first plasmid of the invention is a plasmid isolated from the genus Rhodococcus, or a derivative thereof. Specifically, it may be isolated from, for example, Rhodococcus erythropolis IAM1400, IAM1503, JCM2893 and JCM2894 strains, has a size of approximately 5.4 kbp and is a circular plasmid cleavable by the restriction enzymes shown in Table 1. The plasmids isolated from each of these strains are designated as pRET1100, pRET1300, pRET1500 and pRET1700, respectively. Plasmids of the invention may be prepared from these sample strains by publicly known methods (for example, boiling, alkali dissolution, cesium chloride density gradient ultracentrifugation: Lab Manual Idenshi Kogaku, 3rd Edition, Chapter 10, pp. 55-59, Maruzen).
TABLE-US-00001 TABLE 1 Restriction Number of Fragment sizes enzyme cleavage sites (kbp) BamH I 2 0.4, 5.0 EcoR I 2 0.3, 5.1 Kpn I 1 5.4 Pvu II 1 5.4 Sac I 1 5.4 Sma I 1 5.4
[0029]FIG. 1 shows a restriction enzyme cleavage map for pRET1100. This plasmid was sequenced by a publicly known method (using a fluorescent automatic sequencer, for example) and its full nucleotide sequence was revealed to be 5444 bp set forth as SEQ ID NO: 73 of the Sequence Listing.
[0030]The second plasmid of the invention is also a plasmid isolated from the genus Rhodococcus, or its derivative. Specifically, it may be isolated from, for example, Rhodococcus rhodnii JCM3203, has a size of approximately 5.8 kbp and is a circular plasmid cleavable by the restriction enzymes shown in Table 2. This plasmid is designated as pRET1000.
TABLE-US-00002 TABLE 2 Restriction Number of Fragment sizes enzyme cleavage sites (kbp) BamH I 2 2.0, 3.8 Pvu II 4 0.1, 1.4, 2.0, 2.3 Sac I 3 0.9, 1.0, 3.9 Sma I 4 0.1, 1.2, 1.6, 2.9
[0031]FIG. 2 shows a restriction enzyme cleavage map for pRET1000. This plasmid was also sequenced by a publicly known method and its full nucleotide sequence was revealed to be 5813 bp set forth as SEQ ID NO: 74 of the Sequence Listing.
[0032]The plasmids of the invention (natural- or wild-types) are circular plasmids that can also be defined by the restriction enzyme cleavage patterns shown in Tables 1 and 2. Thus, the present invention encompasses the following two types of circular plasmids.
[0033](1) A circular plasmid derived from a Rhodococcus strain, characterized by having a size of approximately 5.4 kbp and possessing the following restriction enzyme cleavage sites: BamH I:2, EcoR I:2, Kpn I:1, Pvu II:1, Sac I:1 and Sma I:1.
[0034](2) A circular plasmid derived from a Rhodococcus strain, characterized by having a size of approximately 5.8 kbp and possessing the following restriction enzyme cleavage sites: BamH I:2, Pvu II:4, Sac I:3 and Sma I:4.
[0035]As a result of analysis of the nucleotide sequences of plasmids pRET1100 and pRET1000 (i.e., SEQ ID NO: 73 and SEQ ID NO: 74), there is predicted the existence of a group of nucleotide sequences (open reading frames, hereinafter "orf") coding for proteins for DNA replication or other functions.
[0036]In the relevant technical field, "DNA replication" refers to using DNA itself as template to form two double-stranded DNA molecules exactly identical to existing double-stranded DNA (parent DNA). The replication mechanism consists of three stages: initiation from the starting point of replication (replication origin), DNA chain elongation and termination. During replication, a portion of the DNA double strand is unraveled and new DNA strands are synthesized complementary to each single strand. The double strand is unraveled by DNA helicase and helix destabilizing proteins (also known as single-strand DNA-binding protein), and the unraveled portion is referred to as the replication fork. The template DNA in the direction from 3' to 5' toward the replication fork is the "leading strand", and the one in the direction from 5' to 3' is the "lagging strand". DNA polymerase extends the DNA strand in the direction from 5' to 3'. Therefore when the leading strand is the template, DNA is synthesized in the direction of the replication fork. However when the opposite lagging strand is the template, the DNA strand must be extended in the opposite direction from the replication fork. Consequently, replication of the lagging strand is accomplished in fragments of about 200 bases, known as Okazaki fragments. Every approximately 200 bases, RNA primer is used with DNA as template to synthesize 10 bases of RNA in the direction from 5' to 3'. From this RNA as primer, DNA polymerase synthesizes a DNA strand in the direction from 5' to 3' on the lagging strand as template. The replicated DNA fragment of approximately 200 bases then binds to the single-stranded DNA from which RNA is removed. In this replication mechanism, several proteins including DNA helicase and helix-destabilizing protein work together to form the replicating machinery. Other proteins involved include DNA topoisomerase (which prevents twisting during the DNA replication), replication initiation proteins and replication termination proteins. The DNA replication mechanism is described in detail in, for example, "Saibou no Bunshiseibutsugaku [Molecular Biology of the Cell]", 3rd Edition, translated by Keiko Nakamura et al., pp. 251-262, Kyoikusha, 1996.
[0037]Upon analysis of the nucleotide sequences of the plasmids pRET1100 and pRET1000, they were found to include sequences of AT-rich homologous or analogous repeats and a sequence thought to have a DNA secondary structure, i.e. a nucleotide sequence predicted to be a DNA replication region (a nucleotide sequence region recognized by proteins involved in DNA replication or a region including the DNA replication origin), in the vicinity of the aforementioned orf relating to DNA replication.
[0038]DNA replication requires a DNA replication region and a region coding for a protein involved in DNA replication (hereinafter referred to as "DNA replication-related protein"). According to the present invention it is possible to obtain data relating to the nucleotide sequences of these regions for both plasmids pRET1100 and pRET1000.
[0039]First, the nucleotide sequences set forth as SEQ ID NO: 35-37 were identified as DNA replication regions for plasmid pRET1100. As regions coding for proteins related to DNA replication there were identified the nucleotide sequences set forth as SEQ ID NO: 1-3 (orf1), the nucleotide sequence set forth as SEQ ID NO: 4 (orf2), the nucleotide sequences set forth as SEQ ID NO: 5-16 (orf3), the nucleotide sequences set forth as SEQ ID NO: 17-21 (orf4), the nucleotide sequences set forth as SEQ ID NO: 22-26 (orf5), the nucleotide sequence set forth as SEQ ID NO: 27 or 28 (orf6), the nucleotide sequence set forth as SEQ ID NO: 29 or 30 (orf7), the nucleotide sequence set forth as SEQ ID NO: 31 or 32 (orf8), and the nucleotide sequence set forth as SEQ ID NO: 33 or 34 (orf9).
[0040]Construction of a plasmid capable of DNA replication from pRET100 requires that the recombinant plasmid have at least one DNA replication region and at least one coding region (orf) for a DNA replication-related protein. Thus, the (recombinant) plasmids of the invention are characterized by comprising at least one DNA is replication region and at least one coding region for a DNA replication-related protein. The coding region for a DNA replication-related protein preferably has a nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 1, 4, 14, 17 and 22.
[0041]The region of the nucleotide sequence set forth as SEQ ID NO: 76 has been suggested as a promoter involved in expression of replication-related proteins, and the plasmids of the invention preferably comprise a promoter region having the nucleotide sequence set forth as SEQ ID NO: 76.
[0042]For plasmid construction, the DNA fragments are appropriately selected based on the aforementioned nucleotide sequence data. The present invention also encompasses derivatives or functional (DNA-replicating) fragments of the plasmids.
[0043]Next, the nucleotide sequences set forth as SEQ ID NO: 70-72 were identified as DNA replication regions for plasmid pRET1000. As regions coding for proteins related to DNA replication there were identified the nucleotide sequences set forth as SEQ ID NO: 38-41 (orf10), the nucleotide sequence set forth as SEQ ID NO: 42 or 43 (orf11), the nucleotide sequence set forth as SEQ ID NO: 44 (orf12), the nucleotide sequence set forth as SEQ ID NO: 45 or 46 (orf13), the nucleotide sequences set forth as SEQ ID NO: 48-56 (orf14), the nucleotide sequence set forth as SEQ ID NO: 51 or 52 (orf15), the nucleotide sequence set forth as SEQ ID NO: 53 or 54 (orf16), the nucleotide sequence set forth as SEQ ID NO: 55 (orf17), the nucleotide sequences set forth as SEQ ID NO: 56-60 (orf18), the nucleotide sequence set forth as SEQ ID NO: 61 (orf19), the nucleo tide sequence set forth as SEQ ID NO: 62 (orf20), and the nucleotide sequences set forth as SEQ ID NO: 63-69 (orf11).
[0044]Construction of a plasmid capable of DNA replication from pRET1000 requires that the recombinant plasmid have at least one DNA replication region and at least one coding region (orf) for a DNA replication-related protein. Thus, the (recombinant) plasmids of the invention are characterized by comprising at least one DNA replication region and at least one coding region for a DNA replication-related protein. The coding region for a DNA replication-related protein preferably has a nucleotide sequence selected from the group consisting of the nucleotide sequences set forth as SEQ ID NO: 40, 42, 44, 45, 53, 55, 56, 61, 62 and 69.
[0045]The regions with the nucleotide sequences set forth as SEQ ID NO: 67 and 47 are homologous with mobilization proteins, and have been implicated in mobilization. The region with the nucleotide sequence set forth as SEQ ID NO: 75 has been implicated in gene expression of mobilization protein and suggested as a recognition site for an expressed protein. Thus, the plasmids of the invention preferably include mobilization protein regions having the nucleotide sequences set forth as SEQ ID NO: 67 and 47, or include a region involved in mobilization having the nucleotide sequence set forth as SEQ ID NO: 75.
[0046]For plasmid construction, the DNA fragments are appropriately selected based on the aforementioned nucleotide sequence data. The present invention also encompasses derivatives or functional (DNA-replicating) fragments of the plasmids.
[0047]The plasmids or DNA fragments of the invention may also contain nucleotide sequences with a substitution, deletion or insertion of one or a plurality of nucleotides in a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, so long as the function of each region is not impaired.
[0048]The shuttle vectors of the invention may be any which comprise a plasmid or DNA fragment having a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, and a DNA region that is replicable in E. coli, and which are replicable in the genus Rhodococcus and E. coli, such as those having the nucleotide sequences set forth as SEQ ID NO: 89 to 100. The shuttle vectors of the invention may also have nucleotide sequences with one or a plurality of nucleotide substitutions, deletions or insertions in the aforementioned nucleotide sequences, so long as they are replicable in the genus Rhodococcus and E. coli.
[0049]The "plurality" referred to above will differ depending on the type of region, and specifically may be 2-1100, preferably 2-800, more preferably 2-300, even more preferably 2-100, yet more preferably 2-20 and most preferably 2-10.
[0050]As a plasmid or DNA fragment having substantially the same nucleotide sequence as the aforementioned DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, there may be mentioned specifically, a nucleotide sequence which hybridizes with a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, under stringent conditions. Here "stringent conditions" are conditions under which specific hybrids are formed and non-specific hybrids are not formed. While it is difficult to precisely quantify the conditions, one example is a set of conditions that permit hybridization of DNA with high homology, such as 80% or greater, preferably 90% or greater or more preferably 95% or greater homology, while not permitting hybridization of DNA with lower homology. More specifically, there may be mentioned hybridization conditions with ordinary Southern hybridization washing at 60° C., 1×SSC, 0.1% SDS or preferably Southern hybridization washing at 0.1×SSC, 0.1% SDS corresponding salt concentration. When a DNA fragment with a length of approximately 300 bp is used as a portion of the DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, the hybridization washing conditions may be 50° C., 2×SSC, 0.1% SDS.
[0051]The aforementioned plasmid or DNA fragment having substantially the same nucleotide sequence as the aforementioned DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, may be obtained by, for example, modification of a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, by site-directed mutagenesis so as to have a substitution, deletion or insertion of nucleotides at a specific site. Such modified DNA may also be obtained by mutation treatment known in the prior art. As mutation treatments there may be mentioned methods of in vitro treatment of DNA including a DNA replication region, DNA replication-related protein coding region, promoter region, mobilization protein region or mobilization-related region, or a portion thereof, with hydroxylamine or the like, and methods of treating a microbe possessing the DNA above, such as the genus Escherichia, with ultraviolet rays or with a mutagenic agent ordinarily used for mutagenesis such as N-methyl-N'-nitro-N-nitrosoguanidine (NTG) or EMS.
[0052]Nucleotide substitutions, deletions or insertions as mentioned above include those found in naturally occurring mutants or variants due to differences in Rhodococcus strains.
[0053]A shuttle vector of the invention includes a DNA fragment (A) as the aforementioned plasmid or portion thereof, and a DNA region (B) which is replicable in E. coli. In some cases it is preferred for the shuttle vector to comprise a DNA region including a drug resistance gene. In the relevant technical field, a "shuttle vector" is a vector which comprises the DNA replication mechanism for two different cell types, and preferably also a drug resistance gene or the like as a selective marker, allowing its auto-replication in the two different cell types. The DNA fragment (A) as the aforementioned plasmid or portion thereof is a DNA region that is replicable in the genus Rhodococcus. The DNA region (B) which is replicable in E. coli may be a full plasmid or a portion thereof, so long as it can be replicated and amplified in E. coli. As such DNA regions that are replicable in E. coli there may be used, for example, the plasmids pUC18, pHSG299 and pHSG398.
[0054]When the shuttle vector of the invention includes a drug resistance gene, the preferred ones are ampicillin resistance gene, kanamycin resistance gene and chloramphenol resistance gene, but there are no particular restrictions on the manner of drug so long as the gene is expressed in the genus Rhodococcus and E. coli as hosts and confers drug resistance to the host cells, in order to allow verification of the presence of plasmids in the two genera based on resistance to the drug. Also, a plurality of such drug resistance genes may be used in combination.
[0055]The shuttle vector preferably contains multiple cloning sites (multicloning sites), and the cloning sites and drug resistance gene may be induced from, for example, an E. coli plasmid. That is, a publicly known E. coli plasmid such as one listed above may be cleaved with an appropriate restriction endonuclease and a DNA region containing the cloning sites and drug resistance gene constructed and ligated with another DNA fragment (a DNA region which is replicable in the genus Rhodococcus).
[0056]As an illustration, outline of shuttle vector constructions is shown in FIGS. 3 to 5. The shuttle vectors may be constructed by treating the aforementioned plasmids and E. coli plasmids with suitable restriction endonucleases and then ligating them. In this manner, the present inventors constructed 18 shuttle vectors (Table 5) using the Rhodococcus plasmids pRET1000, pRET1100 or pRET1200, and the E. coli plasmids pUC18, pHSG299 or pHSG398.
[0057]The shuttle vectors of the invention are replicable in the genus Rhodococcus and E. coli as hosts, and are industrially useful. The Rhodococcus and E. coli strains transformed by the shuttle vectors of the invention, as well as other microbial transformants, are useful in this way and such transformants are also encompassed by the scope of the invention.
[0058]A vector of the invention is characterized by being constructed using a shuttle vector of the invention. Specifically, it is a vector having target DNA inserted therein which is to be introduced into the shuttle vector of the invention. The DNA to be introduced and the shuttle vector of the invention are treated with appropriate restriction endonucleases and then ligated them to construct the vector. The vector may then be used to obtain transformants having the desired DNA transferred therein.
[0059]As examples of DNA to be inserted there may be mentioned aminoketone asymmetric reductase genes and coenzyme-regenerating system enzyme genes. Aminoketone asymmetric reductase genes are genes coding for aminoketone asymmetric reductases as described in WO02/070714, and more specifically, DNA coding for a protein comprising the amino acid sequence set forth as SEQ ID NO: 78 (aminoketone asymmetric reductase derived from R. erythropolis MAK-34), and particularly DNA comprising the nucleotide sequence set forth as SEQ ID NO: 79. The entirety of the content described in WO02/070714 is incorporated herein by reference.
[0060]An aminoketone asymmetric reductase is any having the properties described in WO02/070714, and includes a protein having the amino acid sequence set forth as SEQ ID NO: 78 of the Sequence Listing, as well as proteins having amino acid sequences obtained by deletion, insertion, substitution or addition of one or more amino acids in the aforementioned amino acid sequence, and exhibiting aminoketone asymmetric reduction activity. Aminoketone asymmetric reduction activity is activity of producing an optically active aminoalcohol represented by general formula (2) above using an α-aminoketone represented by general formula (1) above as the substrate.
[0061]There are no particular restrictions on the methods of deletion, insertion, substitution and addition, and any publicly known methods may be employed. For example, there may be mentioned the methods described in "Zoku Seikagaku Jikken Kouza 1, Idenshi Kenkyuhou II", edited by the Japanese Biochemical Society, p105 (Hirose, S.), Tokyo Kagaku Dojin (1986); "Shin Seikagaku Jikken Kouza 2, Kakusan III (Recombinant DNA Technology)", edited by the Japanese Biochemical Society, p. 233 (Hirose, S.), Tokyo Kagaku Dojin (1992); R. Wu, L. Grossman ed., "Methods in Enzymology", Vol. 154, p. 350 & p. 367, Academic Press, New York (1987); R. Wu, L. Grossman, ed., "Methods in Enzymology", Vol. 100, p. 457 & p. 468, Academic Press, New York (1983); J. A. Wells et al., "Gene", Vol. 34, p. 315 (1985); T. Grundstroem et al., "Nucleic Acids Res", Vol. 13, p. 3305 (1985); J. Taylor et al., "Nucleic Acids Res.", Vol. 13, p. 8765 (1985); R. Wu, ed., "Methods in Enzymology", Vol. 155, p. 568, Academic Press, New York (1987); and A. R. Oliphant et al., "Gene", Vol. 44, p. 177 (1986). As specific examples, there may be mentioned the site-directed mutagenesis method (site-specific mutagenesis method) utilizing synthetic oligonucleotides, the Kunkel method, the dNTP[αS] method (Eckstein method), and the region-directed mutagenesis method using sulfurous acid or nitrous acid.
[0062]Sugar chains are attached to the majority of proteins, and substitution of one or a plurality of amino acids can modify the attachment of sugar chains. Thus, the aminoketone asymmetric reductases of the invention also include proteins having the amino acid sequence set forth as SEQ ID NO: 78 of the Sequence Listing and having modifications of sugar chains, so long as they exhibit the aforementioned aminoketone asymmetric reduction activity.
[0063]The aminoketone asymmetric reductases of the invention may also have modifications of their amino acid residues by chemical methods, or their derivatives may be enhanced by modification or partial degradation using peptidase enzymes such as pepsin, chymotrypsin, papain, bromelain, endopeptidase and exopeptidase.
[0064]When the aminoketone asymmetric reductases of the invention are produced by a gene recombinant method, a fusion protein may be expressed and then converted or processed into a protein having biological activity which is substantially equivalent to a natural aminoketone asymmetric reductase either in vivo or ex vivo. In this case, a fusion production method ordinarily employed for genetic engineering may be used, and the fusion protein may be purified by affinity chromatography or the like, utilizing the fused portion thereof. Modification and enhancement of protein structures may be carried out with reference to "Shin Seikagaku Jikken Kouza 1, Tanpakushitsu VII, Tanpakushitsu Kogaku", edited by the Japanese Biochemical Society, Tokyo Kagaku Dojin (1993), by the methods described therein, the methods described in literature cited therein, or methods which are essentially equivalent thereto.
[0065]The aminoketone asymmetric reductase of the invention may also differ from naturally occurring forms in the identities of one or more of the amino acid residues or in the positions of one or more of the amino acid residues. The present invention also encompasses deletion analogues with deletion of one or more (for example, 1-80, preferably 1-60, more preferably 1-40, even more preferably 1-20 and especially 1-10) amino acid residues, substitution analogues with substitution of one or more (for example, 1-80, preferably 1-60, more preferably 1-40, even more preferably 1-20 and especially 1-10) amino acid residues or addition analogues with addition of one or more (for example, 1-80, preferably 1-60, more preferably 1-40, even more preferably 1-20 and especially 1-10) amino acid residues peculiar to natural aminoketone asymmetric reductases. Also encompassed are enzymes having the domain structure characteristic of natural aminoketone asymmetric reductases. There may also be mentioned isomers of the aminoketone asymmetric reductases.
[0066]So long as the domain structure characteristic of natural aminoketone asymmetric reductases is maintained, all mutants above are also encompassed among the aminoketone asymmetric reductases of the invention. In addition, it is assumed that enzymes having a primary structural conformation substantially equivalent to natural aminoketone asymmetric reductases of the invention, or a portion thereof, as well as enzymes having biological activity substantially equivalent to natural aminoketone asymmetric reductases, may also be included. Naturally occurring mutants may also be mentioned. The aminoketone asymmetric reductases of the invention may be separated and purified in the manner explained below. The present invention encompasses DNA fragments coding for the aforementioned polypeptides, polypeptides of aminoketone asymmetric reductases having all or some of the natural features, and DNA fragments coding for analogues or derivatives thereof. The nucleotides of the aminoketone asymmetric reductases may be modified (for example, with addition, deletions or substitutions), and such modified forms are also encompassed by the invention.
[0067]An aminoketone asymmetric reductase gene according to the invention is a nucleic acid coding for any of the aforementioned aminoketone asymmetric reductases. As representative examples there may be mentioned nucleic acid coding for a protein having the amino acid sequence set forth as SEQ ID NO: 78 of the Sequence Listing, and especially nucleic acid having the nucleotide sequence set forth as SEQ ID NO: 79, but since several nucleotide sequences (codons) can code for each amino acid, there exist numerous nucleic acids coding for a protein having the amino acid sequence set forth as SEQ ID NO: 78. Thus, all such nucleic acids are also encompassed among the aminoketone asymmetric reductase genes of the invention. Here, "coding for a protein" means that, when the DNA consists of two strands, one of the two complementary strands has a nucleotide sequence coding for the protein, and therefore the nucleic acids of the invention include nucleic acids comprising nucleotide sequences directly coding for the amino acid sequence set forth as SEQ ID NO: 78 and nucleic acids comprising nucleotide sequences which are complementary thereto. In addition, the aminoketone asymmetric reductase genes of the invention may be nucleic acids which hybridize with nucleic acid comprising a nucleotide sequence complementary to SEQ ID NO: 79 under stringent conditions, and which code for proteins with aminoketone asymmetric reduction activity. Here, "stringent conditions" has the same definition as explained above.
[0068]The coenzyme-regenerating system enzyme gene may be one for various dehydrogenases, specifically, glucose dehydrogenase, glucose-6-phosphate dehydrogenase, aldehyde dehydrogenases, alcohol dehydrogenases, organic acid dehydrogenases and amino acid dehydrogenases. More specifically, there may be suitably used acetaldehyde dehydrogenase, ethanol dehydrogenase, propanol dehydrogenase, glycerol dehydrogenase, formate dehydrogenase, acetate dehydrogenase, butyrate dehydrogenase, lactate dehydrogenase, maleate dehydrogenase and glutamate dehydrogenase.
[0069]A transformant according to the invention is characterized by comprising the aforementioned vector. The transformant is obtained by introducing the vector into host cells. The vector introduction method may be a publicly known method, such as the calcium phosphate method, lipofection, electroporation, microinjection or the like.
[0070]For example, a transformant of the invention comprising a vector having an aminoketone asymmetric reductase gene inserted therein has aminoketone asymmetric reduction activity, and may be applied for an aminoketone asymmetric reductase production method or optically active aminoalcohol production method as described below.
[0071]The method for production of an aminoketone asymmetric reductase of the invention is characterized by comprising a culturing step in which transformants containing a vector having an aminoketone asymmetric reductase gene inserted therein are cultured in medium which allows growth of the transformants, and a purification step in which the aminoketone asymmetric reductase is purified from the transformants obtained in the culturing step.
[0072]The method for culturing may be a publicly known method with no particular restrictions so long as it permits growth of the cells used, and ordinarily a liquid medium containing a carbon source, nitrogen source and other nutrients is used. As carbon sources for the medium there may be used any of those that can be utilized by the cells. Specifically, there may be mentioned sugars such as glucose, fructose, sucrose, dextrin, starch and sorbitol, alcohols such as methanol, ethanol and glycerol, organic acids such as fumaric acid, citric acid, acetic acid and propionic acid, and their salts, hydrocarbons such as paraffin, and mixtures thereof. As nitrogen sources there may be used any of those that can be utilized by the cells. Specifically, there may be mentioned ammonium salts of inorganic acids such as ammonium chloride, ammonium sulfate and ammonium phosphate; ammonium salts of organic acids such as ammonium fumarate and ammonium citrate; nitric acid salts such as sodium nitrate and potassium nitrate; and inorganic or organic nitrogenous compounds such as meat extract, yeast extract, malt extract and peptone, as well as mixtures thereof. The medium may also contain appropriately added nutrient sources ordinarily used for culturing, such as inorganic salts, trace metal salts and vitamins. When necessary, there may also be added to the medium substances that promote cell growth and buffering substances effective for maintaining the pH of the medium.
[0073]The culturing of the cells may be carried out under conditions suitable for growth. Specifically, the medium pH may be 3-10, preferably 4-9, and the temperature may be 0-50° C., preferably 20-40° C. The cell culturing may be conducted either under aerobic or anaerobic conditions. The culturing time is preferably 10-150 hours, but should be appropriately determined for the type of cells used.
[0074]The culture solution of the cells cultured in the manner described above is filtered or centrifuged and the cells are rinsed with water or buffer solution. The rinsed cells are suspended in a suitable amount of buffer solution for disruption of the cells. The method of disruption is not particularly restricted but as examples there may be mentioned mechanical disruption with a mortar, Dynomill, French press, ultrasonic cell disrupter or the like. The aminoketone asymmetric reductase in the cell-free extract obtained by filtration or centrifugation of the solid matter from the cell disruptate is recovered by an ordinary enzyme isolating method.
[0075]There are no particular restrictions on the method for isolation of the enzyme and any publicly known method may be employed, but as examples there may be mentioned purification by salting out such as ammonium sulfate precipitation; gel filtration methods using Sephadex and the like; ion-exchange chromatography methods using carriers with diethylaminoethyl groups or carboxymethyl groups; hydrophobic chromatography using carriers with hydrophobic groups such as butyl, octyl and phenyl; dye gel chromatography methods; electrophoresis methods; dialysis; ultrafiltration methods; affinity chromatography methods; high performance liquid chromatography methods and the like.
[0076]The enzyme may also be used as an immobilized enzyme. There are no particular restrictions on the method and any publicly known method may be employed, among which there may be mentioned immobilization of the enzyme or the enzyme-producing cells, and the immobilization may be accomplished by a carrier bonding method such as a covalent bonding method or adsorption method, a crosslinking method, entrapment method or the like. A condensing agent such as glutaraldehyde, hexamethylene diisocyanate or hexamethylene diisothiocyanate may also be used if necessary. Other immobilizing methods include: a monomer method in which a monomer is gelled by polymerizing reaction; a prepolymer method in which molecules larger than monomers are polymerized; a polymer method in which a polymer is gelled; immobilization using polyacrylamide; immobilization using natural polymers such as alginic acid, collagen, gelatin, agar and K-carrageenan; and immobilization using synthetic polymers such as photosetting resins and urethane polymers.
[0077]The enzyme purified in this manner is judged as having been adequately purified if a single band is confirmed in electrophoresis (SDS-PAGE, etc.).
[0078]A method for production of an optically active aminoalcohol according to the invention is characterized to produce an optically active aminoalcohol compound represented by the following general formula (2), which compound exhibits the desired optical activity, by reacting an aminoketone asymmetric reductase obtained by the production method of the invention with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1) or a salt thereof.
##STR00008##
wherein X may be the same or different and represents at least one species selected from the group consisting of halogen, lower alkyl, hydroxyl optionally protected with a protecting group, nitro and sulfonyl;n represents an integer of 0 to 3;R1 represents lower alkyl;R2 and R3 may be the same or different and represent at least one species selected from the group consisting of hydrogen and lower alkyl; and* represents asymmetric carbon.
##STR00009##
wherein X, n, R1, R2, R3 and * have the same definitions as above.
[0079]First, the α-aminoketone compound represented by general formula (1) according to the invention will be explained.
[0080]The substituent X is as follows. As the aforementioned halogen there may be mentioned fluorine, chlorine, bromine and iodine.
[0081]As lower alkyl there are preferred C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures. As substituents they may contain halogens such as fluorine or chlorine, or hydroxyl, alkyl, amino, alkoxy and the like.
[0082]As protecting groups for hydroxyl optionally protected with a protecting group there may be mentioned groups that can be removed by treatment with water, groups that can be removed by acid or weak base treatment, groups that can be removed by hydrogenation or groups that can be removed with Lewis acid catalysts and thiourea, and such protecting groups include optionally substituted acyl, optionally substituted silyl, alkoxyalkyl, optionally substituted lower alkyl, benzyl, p-methoxybenzyl, 2,2,2-trichloroethoxycarbonyl, allyloxycarbonyl, trityl and the like.
[0083]The aforementioned acyl groups include acetyl, chloroacetyl, dichloroacetyl, pivaloyl, benzoyl, p-nitrobenzoyl and the like. They may contain hydroxyl, alkyl, alkoxy, nitro, halogen and the like as substituents. The aforementioned silyl groups include trimethylsilyl, t-butyldimethylsilyl, triarylsilyl and the like. They may contain alkyl, aryl, hydroxyl, alkoxy, nitro, halogen and the like as substituents. The aforementioned alkoxyalkyl groups include methoxymethyl, 2-methoxyethoxymethyl and the like. The aforementioned lower alkyl include C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures. As substituents they may contain halogen such as fluorine or chlorine, or hydroxyl, alkyl, amino, alkoxy and the like.
[0084]X may be nitro or sulfonyl, and specifically there may be mentioned methylsulfonyl and the like.
[0085]The number "n" for X is an integer of 0-3, and is preferably 0.
[0086]R1 in general formula (1) above represents lower alkyl. As lower alkyl there are preferred C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures.
[0087]Each of R2 and R3 represent hydrogen or lower alkyl. The lower alkyl include C1-6 alkyl, among which there may be mentioned methyl, ethyl, propyl, isopropyl, butyl, isobutyl, s-butyl, t-butyl, pentyl, isopentyl, hexyl and the like. These may have straight-chain or branched structures.
[0088]As salts of the aforementioned α-aminoketone compounds there may be mentioned salts of inorganic acids such as hydrochloride, sulfate, nitrate, phosphate and carbonate, and salts of organic acids such as acetic acid and citric acid.
[0089]The α-aminoketone can be easily synthesized by halogenation (for example, bromination) of the α-carbon of a corresponding 1-phenylketone derivative, followed by replacement of the halogen such as bromine with an amine (Ger. (East), 11, 332, Mar. 12, 1956).
[0090]The optically active aminoalcohol represented by general formula (2) above according to the invention will now be explained. In general formula (2), X, n, R1, R2, R3 and * have the same definitions as in general formula (1) above. As β-aminoalcohols having the desired optical activity there may be mentioned (1S,2S)aminoalcohols. As specific examples of (1S,2S)aminoalcohols there may be mentioned d-threo-2-methylamino-1-phenylpropanol (d-pseudoephedrine), d-threo-2-dimethylamino-1-phenylpropanol (d-methylpseudoephedrine), (1S,2S)-α-(1-aminoethyl)-benzyl alcohol (d-norpseudoephedrine), (1S,2S)-1-(p-hydroxyphenyl)-2-methylamino-1-propanol, (1S,2S)-α-(1-aminoethyl)-2,5-dimethoxy-benzyl alcohol, (1S,2S)-1-(m-hydroxyphenyl)-2-amino-1-propanol, (1S,2S)-1-(p-hydroxyphenyl)-2-amino-1-propanol, (1S,2S)-1-phenyl-2-ethylamino-1-propanol, (1S,2S)-1-phenyl-2-amino-1-butanol, (1S,2S)-1-phenyl-2-methylamino-1-butanol and the like.
[0091]The conditions for reaction of the aminoketone asymmetric reductase are not particularly restricted so long as an optically active aminoalcohol represented by general formula (2) having the desired optical activity is produced, but since the enzyme optimum pH is 8.1 and the optimum temperature is 55° C., the reaction is preferably carried out under conditions of pH 7-9 and 30-65° C. temperature.
[0092]A method for production of an optically active aminoalcohol according to the invention is also characterized to produce an optically active aminoalcohol compound represented by the following general formula (2), which compound exhibits the desired optical activity, by reacting a transformant of the invention with an enantiomeric mixture of an α-aminoketone compound represented by the following general formula (1) or a salt thereof.
##STR00010##
[0093]As the reaction conditions for the reaction described above, for example, the transformants shake cultured in liquid medium may be collected, an aqueous aminoketone solution (0.1-10% concentration) added to the obtained cells, and reaction conducted at a temperature of 20-40° C. for a period of several hours to one day while regulating the pH to between 6-8. Upon completion of the reaction, the cells may be separated and the product in the reaction solution isolated to obtain an optically active aminoalcohol. The reaction may be conducted in the same manner for treated transformant cells (dry cells or immobilized cells) or the enzyme or immobilized enzyme obtained from the transformants.
[0094]In the production method for an optically active aminoalcohol of the invention, the reaction may be carried out with further addition of a compound represented by the following general formula (3) or a pharmaceutically acceptable salt or solvate thereof, for more efficient production of the optically active aminoalcohol.
##STR00011##
(wherein A represents the following formula (Y) or (Z))
##STR00012##
(wherein R4 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R8)
##STR00013##
(wherein R5 represents hydrogen, C1-3 alkyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R6 or R9, R6 represents hydrogen, optionally substituted C1-3 alkyl, a C5-10 hydrocarbon ring which is bonded to R8 or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R9, and R7 represents hydrogen or optionally substituted C1-6 alkyl); R8 represents hydrogen, carboxyl, optionally substituted C1-6 alkyl, a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R4 or a C5-10 hydrocarbon ring which is bonded to R6; R9 represents hydrogen, optionally substituted C1-6 alkyl, optionally substituted C1-6 alkyloxycarbonyl, optionally substituted acyl or a 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms which is bonded to R5 or R6; and R10 represents hydrogen or optionally substituted C1-6 alkyl)
[0095]In general formula (3) above, C1-3 alkyl may be straight-chain or branched, and specifically there may be mentioned methyl, ethyl, n-propyl, isopropyl and the like. C1-6 alkyl may be straight-chain or branched, and specifically there may be mentioned methyl, ethyl, n-propyl, isopropyl, n-butyl, i-butyl, s-butyl, t-butyl, pentyl, hexyl and the like. As C5-10 hydrocarbon rings there may be mentioned cyclopentyl, cyclohexyl, cycloheptyl, cyclooctyl, cyclononyl, cyclodecanyl and the like.
[0096]As heteroatoms for the 5- to 8-membered heterocyclic skeleton containing 1-3 heteroatoms there may be mentioned nitrogen, oxygen, sulfur and the like, among which nitrogen and oxygen are particularly preferred, and as 5- to 8-membered heterocyclic skeletons there may be mentioned pyrrolidine, piperidine, imidazolidine, piperazine, tetrahydrofuran, tetrahydropyran, tetrahydrothiophene, morpholine and the like.
[0097]As C1-6 alkyloxycarbonyl there may be mentioned methyloxycarbonyl, ethyloxycarbonyl, isopropyloxycarbonyl, isobutyloxycarbonyl, t-butyloxycarbonyl and the like. As acyl there may be mentioned formyl, acetyl, propionyl, butyryl, isobutyryl, pivaloyl, benzoyl, valeryl and the like. When the aforementioned C1-3 or C1-6 alkyl, C1-6 alkyloxycarbonyl or acyl have substituents there are no particular restrictions on the types, positions and numbers of substituents, and as examples of substituents there may be mentioned halogen such as fluorine and chlorine, hydroxyl, alkyl, carboxyl, amino, alkoxy, nitro, aryl and the like. As pharmaceutically acceptable salts there may be mentioned salts of inorganic acids such as hydrochloric acid, sulfuric acid, nitric acid and phosphoric acid, salts of organic acids such as acetic acid and citric acid, salts of inorganic bases such as Na, K, Mg, Ca and ammonia, and salts of organic bases such as triethylamine and cyclohexylamine.
[0098]As examples of compounds represented by general formula (3) above there may be mentioned 1-acetylamino-2-hydroxypropane, 1-methylamino-2-hydroxypropane, 1-amino-2-oxopropane, 1-amino-2-hydroxycyclopentane, 1-amino-2,3-dihydroxypropane, L-threonine, 4-amino-3-hydroxybutanoic acid, 1-amino-2-oxocyclohexane, morpholine, 3-hydroxypyrrolidine, 3-hydroxypiperidine, 2-aminomethyl-tetrahydrofuran, 1-(2-hydroxypropyl)amino-2-hydroxypropane, 1-t-butyloxycarbonylamino-2-hydroxypropane, 2-amino-3-hydroxybutane, DL-serine, 1-amino-2-hydroxypropane, 1-amino-2-hydroxybutane and 1-amino-2-hydroxycyclohexane. Compounds among these having asymmetric carbons may be optically active forms or racemic forms, unless otherwise specified.
[0099]Addition of such activity inducers to the medium can induce cellular activity and thus more efficiently promote production of the optically active β-aminoalcohol than when no such activity inducers are added. The activity inducers may be used alone, or several such activity inducers may be used in admixture. The amount of such activity inducers is preferably 0.01-10 wt % with respect to the medium.
[0100]The reaction method for production of the β-aminoalcohol of the invention is not particularly restricted so long as it is a method in which the cells or the cell-produced enzyme is reacted with an enantiomeric mixture of an α-aminoketone compound represented by general formula (1) above or its salt, to produce the corresponding optically active β-aminoalcohol compound represented by general formula (2), and the reaction is initiated by mixing the cells rinsed with buffer solution or water with the α-aminoketone aqueous solution used as the starting material.
[0101]The reaction conditions may be selected within a range that does not impede production of the optically active β-aminoalcohol compound represented by general formula (2). The cell volume is preferably 1/100 to 1000-fold and more preferably 1/10 to 100-fold in terms of dry weight with respect to the racemic aminoketone. The concentration of the racemic aminoketone substrate is preferably 0.01-20% and more preferably 0.1-10%. The pH of the reaction solution is preferably 5-9 and more preferably 6-8, and the reaction temperature is preferably 10-50° C. and more preferably 20-40° C. The reaction time is preferably 5-150 hours, but this may be appropriately determined depending on the cell type.
[0102]In order to more efficiently promote the reaction, there may be added sugars such as glucose, organic acids such as acetic acid and energy sources such as glycerol. These may be used alone or as mixtures. The amount of addition is preferably 1/100 to 10-fold with respect to the substrate. Coenzymes and the like may also be added. As coenzymes there may be used nicotinamide adenine dinucleotide (NAD), reduced nicotinamide adenine dinucleotide (NADH), nicotinamide adenine dinucleotide phosphate (NADP), reduced nicotinamide adenine dinucleotide phosphate (NADPH) and the like, either alone or in mixtures, added in amounts of preferably 1/1000 to 1/5 with respect to the racemic aminoketone. In addition to such coenzymes, there may be added coenzyme-regenerating enzymes such as glucose dehydrogenase, in amounts of 1/1000 to 1/5 with respect to the racemic aminoketone. Also, substrates for coenzyme-regenerating enzymes, such as glucose, may be added, in amounts of 1/100 to 10-fold with respect to the racemic aminoketone. There may also be used combinations of sugars such as glucose, organic acids such as acetic acid, energy sources such as glycerol, coenzymes, coenzyme-regenerating enzymes and coenzyme-regenerating enzyme substrates. These usually accumulate in the cells but if necessary they may be added to increase the reaction speed or yield, and therefore may be added as appropriate.
[0103]If the reaction solution is reacted with addition of the specific salts described above under the aforementioned conditions, racemization of the unreacted α-aminoketone isomers will be aided, thus more efficiently promoting conversion to the enantiomer which will serve as the substrate of the cells or cell-produced enzyme. This will tend to yield the target aminoalcohol from the starting material at a high yield of 50% or greater.
[0104]As salts that promote racemization of unreacted α-aminoketones there may be used weak acid salts such as acetate, tartarate, benzoate, citrate, malonate, phosphate, carbonate, paranitrophenol salt, sulfite and borate, but there are preferably used phosphate (for example, sodium dihydrogen phosphate, potassium dihydrogen phosphate, ammonium dihydrogen phosphate), carbonate (for example, sodium carbonate, sodium hydrogen carbonate, potassium carbonate, ammonium carbonate) and citrate (for example, sodium citrate, potassium citrate, ammonium citrate). Mixtures thereof may also be used, with a buffer solution with a pH of 6.0-8.0 added to a final concentration of preferably 0.01-1 M. In the case of a phosphate, for example, sodium dihydrogen phosphate and sodium monohydrogen phosphate may be mixed in a proportion of between 9:1 and 5:95.
[0105]The optically active α-aminoalcohol produced by the reaction may be purified by ordinary separation and purification means. For example, the optically active β-aminoalcohol may be obtained directly from the reaction solution or after separation of the cells, by being subjected to a common purification process such as membrane separation, extraction with an organic solvent (for example, toluene, chloroform, etc.), column chromatography, vacuum concentration, distillation, crystallization, recrystallization or the like. The optical purity of the produced optically active β-aminoalcohol can be measured by high performance liquid chromatography (HPLC).
EXAMPLES
[0106]The present invention will now be explained in greater detail through examples, with the understanding that these examples in no way limit the technical scope of the invention.
Example 1
Isolation and Purification of Plasmids
[0107](1) Method
[0108]Rhodococcus strains were inoculated to 5 mL of GPY medium (1% glucose, 0.5% bactopeptone, 0.3% yeast extract) and cultured with shaking at 25° C. After adding 250 μL of a 100 mg/mL ampicillin solution in the logarithmic growth phase, culturing was continued at 25° C. for 2 hours with shaking. The cells were harvested by centrifugation (12 krpm, 5 min), and after removing off the supernatant, they were suspended in 1 mL of 50 mM Tris (pH 7.5), the cells were again harvested by centrifugation (12 krpm, 5 min) and the supernatant was removed off. They were then suspended in 250 μL of a 10 mg/mL lysozyme solution dissolved in TE solution (10 mM Tris (pH 7.5), 1 mM EDTA), and the suspension was allowed to stand at 37° C. for 30 minutes. Next, 100 μL of 3 M sodium chloride and 25 μL of 10% SDS were added and the mixture was allowed to stand at -20° C. overnight. To the supernatant from centrifugation (12 krpm, 5 min) there were added 0.5 μL each of 50 μg/mL Proteinase K and 50 μg/mL RNase A, and the mixture was allowed to stand at 37° C. for 15 minutes. An equivalent of phenol/chloroform/isoamyl alcohol solution was added and centrifugation was performed (12 krpm, 5 min). A 2.5-fold amount of ethanol was added to the supernatant, the mixture was centrifuged (12 krpm, 5 min), and the precipitate was dissolved in 50 μL of sterilized water. Confirmation of plasmids was accomplished by electrophoresis with 0.8% agarose gel and staining with ethidium bromide, followed by UV irradiation.
[0109](2) Test Bacteria Strains and Results
[0110]Throughout the examples, the presence or absence of plasmids was screened from available strains belonging to the genus Rhodococcus and its related genus Mycobacterium followed the method described in (1) above.
[0111]Table 3 shows the screened strains confirmed to contain plasmids. Specifically, Rhodococcus erythropolis (IAM1400, IAM1503, JCM2893, JCM2894 and JCM2895) and Rhodococcus rhodnii (JCM3203) were confirmed to contain plasmids of approximately 5.4 kbp and 5.8 kbp, respectively. These plasmids were designated according to the names listed in Table 3: pRET100, pRET1200, pRET1300, pRET1400, pRET1500, pRET1600, pRET1700, pRET1800, pRET0500, pRET1000 (see Table 3).
[0112]R. erythropolis IAM1400 and IAM1503 are described in "IAM Catalogue of Strains, Third Edition, 2004" published by the Institute of Molecular and Cellular Biosciences, The University of Tokyo, and are available from the institute. Also, R. erythropolis JCM2893, JCM2894 and JCM2895 and R. rhodnii JCM3203 are described in "JCM Catalogue of Strains, Eighth Edition 2002" published by RIKEN, Japan, and are available from the institute.
TABLE-US-00003 TABLE 3 Strain No. Size (kbp) Name Rhodococcus erythropolis IAM 1400 5.4 pRET1100 5.4 pRET1200 '' IAM 1503 5.4 pRET1300 5.4 pRET1400 '' JCM 2893 5.4 pRET1500 5.4 pRET1600 '' JCM 2894 5.4 pRET1700 5.4 pRET1800 '' JCM 2895 5.4 pRET0500 Rhodococcus rhodnii JCM 3203 5.8 pRET1000
Example 2
Identification of Restriction Endonuclease Sites
[0113]Various restriction endonucleases were used to determine restriction endonuclease sites, for classification of the plasmids shown in Table 3. Each plasmid was isolated by the method described in Example 1, and then digested with EcoR I, Hind III, Pvu II, Sca I, Sph I, Sma I, Sac I, BamH I and Kpn I, and electrophoresed on 0.8% agarose gel for confirmation of the DNA fragments. The size marker used was Loading Quick DNA size Marker X/EcoR I+Hind III double digest (Toyobo). The numbers of sites cleaved by the restriction endonucleases and the sizes of the fragments were determined based on the size marker. The results are shown in Table 4.
TABLE-US-00004 TABLE 4 R. erythropolis R. rhodnii IAM 1400 IAM 1503 JCM 2893 JCM 2894 JCM 2895 JCM 3203 pRET1100 pRET1200 pRET1300 pRET1400 pRET1500 pRET1600 pRET1700 pRET1800 pRET0500 pRET1000 BamH I 2(0.4, 5.0) 1(5.4) same same same same same same same 2(2.0, 3.8) EcoR I 2(0.3, 5.1) 1(5.4) as as as as as as as 0 Hind III 0 0 pRET1100 pRET1200 pRET1100 pRET1200 pRET1100 pRET1200 pRET1200 0 Kpn I 1(5.4) 0 0 Pvu II 1(5.4) 2(0.9, 4.5) 4(0.1, 1.4, 2.0, 2.3) Sac I 1(5.4) 1(5.4) 3(0.9, 1.0, 3.9) Sca I 0 0 0 Sph I 0 0 0 Sma I 1(5.4) 2(0.4, 0.5) 4(0.1, 1.2, 1.6, 2.9) Values in parentheses indicate sizes (kbp)
[0114]Based on the analysis results shown above, the plasmids in Table 3 were classified into three types: plasmids possessing the same restriction endonuclease sites as pRET1100, plasmids possessing the same restriction endonuclease sites as pRET1200, and pRET1000.
Example 3
Plasmid Sequencing and Homology Search
[0115]As the plasmids were classified into three types, i.e. pRET1000, pRET1100 and pRET1200 based on the results of Example 2, it was attempted to sequence each of the plasmids.
[0116]First, the DNA fragments of the plasmids were cloned for determination of the nucleotide sequences. For Rhodococcus erythropolis (IAM1400), the plasmids (pRET1100, pRET1200) were isolated and digested with Sma I and Sac I. Upon electrophoresis on 0.8% agarose gel, DNA fragments with sizes of approximately 0.5 kbp, approximately 1.7 kbp, approximately 3.7 kbp and approximately 4.9 kbp were confirmed. The respective DNA fragments were recovered from the agarose gel using a GFX® PCR DNA and Gel Band Purification Kit (Amersham Bioscience) and used as insert DNA. Separately, pBluescript II KS(-) was used after digesting with Sma I alone or with Sma I and Sac I, as vector DNA. The insert DNA and vector DNA were ligated with Ligation High (Toyobo) and used to transform E. coli JM109. The obtained transformants were screened using a GFX Micro Plasmid Prep Kit (Amersham Bioscience) to obtain different clones.
[0117]For Rhodococcus rhodnii (JCM3203), the plasmid (pRET1000) was isolated and then digested with BamH I. Upon electrophoresis on 0.8% agarose gel, DNA fragments with sizes of approximately 2.0 kbp and approximately 3.8 kbp were confirmed. The respective DNA fragments were recovered from the gel using the aforementioned Kit and used as insert DNA. The vector DNA used was pBluescript II KS(-) digested with BamH I.
[0118]Determination of the nucleotide sequences of the plasmid inserts was accomplished by the primer walking method. The apparatus used was an ABI PRISM® 310NT Genetic Analyzer, and the enzyme used was a BigDye Terminator v3.1 Cycle Sequencing Kit (ABI).
[0119]First, P7 (M13 forward, Toyobo) and P8 (M13 reverse, Toyobo) primers were used for partial decoding of the insert nucleotide sequences. Next, primers were designed within the decoded sequence (using the sequence analyzing software DNASIS Pro; Hitachi Software Corp.), and the designed primers (synthetic oligo DNA) were used for further decoding of the nucleotide sequence. This procedure was repeated until decoding of the entirety of each insert nucleotide sequence. Upon completion of the insert nucleotide sequence decoding, primers were designed for reaction from the ends of each insert to the vector direction in order to analyze how the inserts were linked, and PCR was conducted (using KOD-plus-), using the plasmid isolated from Rhodococcus erythropolis (IAM1400) as template. The PCR product was purified using a GFX® PCR DNA and Gel Band Purification Kit, and sequencing was carried out using the same primers used for PCR, to analyze the arrangement of the inserts.
[0120]The results of sequencing showed that pRET1100 consisted of 5444 bp, with a G+C content of 59%. The full determined nucleotide sequence is set forth as SEQ ID NO: 73 of the Sequence Listing. Plasmid pRET1200 consisted of 5421 bp and had a G+C content of 62%. Plasmid pRET1000 consisted of 5813 bp and had a G+C content of 67%. The full determined nucleotide sequence is set forth as SEQ ID NO: 74 of the Sequence Listing.
[0121]A homology search for the determined nucleotide sequences using DNASIS Pro revealed that pRET1000 and pRET1100 were novel plasmids. On the other hand, pRET1200 had approximately 99.6% homology with pN30 (GenBank accession no. AF312210) (calculated based on pRET1200).
[0122]For pRET1000 and pRET1100, comparison was made with publicly known plasmids based on the determined nucleotide sequences, using DNASIS Pro. As a result, neither of the plasmids were found to have completely matching restriction endonuclease sites with other plasmids.
Example 4
Nucleotide Sequence Analysis
[0123]The results of analysis of the nucleotide sequences of pRET1100 and pRET1000 are shown below.
[0124]The following orfs were found in pRET1100:
[0125]orf1 (SEQ ID NO: 1, SEQ ID NO: 2 or SEQ ID NO: 3) consisting of the nucleotide sequence from bases 202, 238 or 337 to 480 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0126]orf2 (SEQ ID NO: 4) consisting of the nucleotide sequence from bases 477 to 758 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0127]orf3 (SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15 or SEQ ID NO: 16) consisting of the nucleotide sequence from bases 862, 1294, 1450, 1462, 1486, 1489, 1513, 1630, 1645, 1687, 2224 or 2227 to 2409 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0128]orf4 (SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20 or SEQ ID NO: 21) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 1875, 1734, 1701, 1674 or 1581 to 1444 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0129]orf5 (SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25 or SEQ ID NO: 26) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 2828, 2792, 2747, 2594 or 2540 to 2406 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0130]orf6 (SEQ ID NO: 27 or SEQ ID NO: 28) consisting of the nucleotide sequence from bases 2971 or 3049 to 3306 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0131]orf7 (SEQ ID NO: 29 or SEQ ID NO: 30) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 3577 or 3571 to 3053 of the nucleotide sequence set forth as SEQ ID NO: 73;
[0132]orf8 (SEQ ID NO: 31 or SEQ ID NO: 32) consisting of the nucleotide sequence from bases 3339 or 3648 to 3902 of the nucleotide sequence set forth as SEQ ID NO: 73; and
[0133]orf9 (SEQ ID NO: 33 or SEQ ID NO: 34) consisting of the nucleotide sequence from bases 4366 or 4477 to 5034 of the nucleotide sequence set forth as SEQ ID NO: 73.
[0134]The following orfs were found in pRET1000:
[0135]orf10 (SEQ ID NO: 38, SEQ ID NO: 39, SEQ ID NO: 40 or SEQ ID NO: 41) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 3350, 3251, 2945 or 2849 to 2412 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0136]orf11 (SEQ ID NO: 42 or SEQ ID NO: 43) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 2365 or 2332 to 2159 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0137]orf12 (SEQ ID NO: 44) consisting of the nucleotide sequence from bases 3197 to 3526 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0138]orf13 (SEQ ID NO: 45 or SEQ ID NO: 46) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 4035 or 3996 to 3679 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0139]orf14 (SEQ ID NO: 48, SEQ ID NO: 49 or SEQ ID NO: 50) consisting of the nucleotide sequence from bases 4621, 4654 or 4666 to 4830 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0140]orf15 (SEQ ID NO: 51 or SEQ ID NO: 52) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 5161 or 5062 to 4709 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0141]orf16 (SEQ ID NO: 53 or SEQ ID NO: 54) consisting of the nucleotide sequence from bases 2331 or 2334 to 2618 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0142]orf17 (SEQ ID NO: 55) consisting of the nucleotide sequence from bases 2907 to 3242 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0143]orf18 (SEQ ID NO: 56, SEQ ID NO: 57, SEQ ID NO: 58, SEQ ID NO: 59 or SEQ ID NO: 60) consisting of the nucleotide sequence from bases 1650, 1689, 1713, 1827 or 1875 to 2162 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0144]orf19 (SEQ ID NO: 61) consisting of the nucleotide sequence from bases 1906 to 2169 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0145]orf20 (SEQ ID NO: 62) consisting of the nucleotide sequence complementary to the nucleotide sequence from bases 810 to 553 of the nucleotide sequence set forth as SEQ ID NO: 74;
[0146]orf21 (SEQ ID NO: 63, SEQ ID NO: 64, SEQ ID NO: 65, SEQ ID NO: 66, SEQ ID NO: 67, SEQ ID NO: 68 or SEQ ID NO: 69) consisting of the nucleotide sequence from bases 117, 147, 306, 456, 5144, 5276 or 5534 to 656 of the nucleotide sequence set forth as SEQ ID NO:74.
[0147]The DNA replication region of pRET1100 is the region represented by the nucleotide sequence set forth as SEQ ID NO: 35 (from bases 2410 to 3200), the nucleotide sequence set forth as SEQ ID NO: 36 (from bases 1000 to 1500) or the nucleotide sequence set forth as SEQ ID NO: 37 (from bases 5000 to 500). The DNA replication region of pRET1000 is the region represented by the nucleotide sequence set forth as SEQ ID NO: 70 (from bases 3355 to 3507), the nucleotide sequence set forth as SEQ ID NO: 71 (from bases 4290 to 4350) or the nucleotide sequence set forth as SEQ ID NO: 72 (from bases 3570 to 3894).
[0148]The region of the nucleotide sequence from bases 5144 to 656 (SEQ ID NO: 67) and the region of the nucleotide sequence from bases 4381 to 4830 (SEQ ID NO: 47) of the nucleotide sequence of pRET1000 (SEQ ID NO: 74) are homologous with mobilization proteins, suggesting that they are involved in mobilization.
[0149]A DNA secondary structure is predicted for the region of the nucleotide sequence from bases 4260 to 4339 (SEQ ID NO: 75) of the nucleotide sequence of pRET1000 (SEQ ID NO: 74), and it is presumably involved in expression of the mobilization protein gene or is the recognition site of the expressed protein.
[0150]On the other hand, it was suggested that the region of the nucleotide sequence from bases 761 to 868 (SEQ ID NO: 76) of the nucleotide sequence of pRET1100 (SEQ ID NO: 73) is a promoter involved in expression of a protein related to replication.
Example 5
Construction of Shuttle Vectors
[0151]For construction of a shuttle vector between Rhodococcus strains and E. coli, the Rhodococcus plasmids pRET1000, pRET1100 and pRET1200 and the E. coli plasmids pUC18, pHSG299 and pHSG398 were used for the following experiment.
[0152]First, DNA fragments were prepared from R. erythropolis plasmids. Specifically, plasmids pRET1100 and pRET1200 were obtained from R. erythropolis (IAM1400), and then Alw44 I was used for digestion of pRET1100 at 37° C. for 2 hours and Blunting High (Toyobo) was used for blunting of the ends, while BspLU11 I was used for digestion of pRET1200 at 48° C. for 2 hours and Blunting High (Toyobo) was used for blunting of the ends, to obtain DNA fragments of R. erythropolis plasmid. Each of the DNA fragments was dissolved in TE solution.
[0153]For pRET1000, plasmid pRET1000 was obtained from R. rhodnii (JCM3203), and then Drd I was used for digestion of pRET1000 at 37° C. for 2 hours and Blunting High was used for blunting of the ends, to obtain pRET1000 DNA fragments, which were dissolved in TE solution.
[0154]Next, DNA fragments were prepared from the E. coli plasmids. Specifically, pUC18 (containing the ampicillin-resistance gene (Ampr)) was digested with Sma I at 30° C. for 2 hours, and pHSG299 (containing the kanamycin-resistance gene (Kmr)) and pHSG398 (containing the chloramphenicol-resistance gene (Cmr)) were digested with Hinc II at 37° C. for 2 hours to obtain DNA fragments of E. coli plasmid, which were dissolved in TE.
[0155]After ligating the DNA fragments from the Rhodococcus and E. coli plasmids prepared in the manner described above, they were used for transformation in E. coli DH5α, which were plated on LB (1% tryptophan, 0.5% yeast extract, 1% sodium chloride; pH 7.2) agar medium containing 100 μg/mL kanamycin, 100 μg/mL ampicillin or 30 μg/1 mL chloramphenicol, coated with 30 μL of 0.1 M IPTG (isopropyl-β-galactoside) and 4% X-gal (5-bromo-4-chloro-3-indole-β-D-galactopyranoside) and allowed to stand at 30° C. for 60 hours. White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL kanamycin, 100 μg/mL ampicillin or 30 μg/mL chloramphenicol, at 30° C. for 60 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit (Amersham Bioscience, with purification under the manufacturer's specified conditions). The obtained DNA was confirmed by electrophoresis on 0.8% agarose gel. The obtained shuttle vectors are shown in Table 5, and the methods for constructing each of the shuttle vectors using pRET1100 are shown in FIGS. 3 to 5.
TABLE-US-00005 TABLE 5 Constructed shuttle Origin vectors Rhodococcus E. coli pRET1001, pRET1001Rv pRET1000 pUC18 pRET1002, pRET1002Rv pRET1000 pHSG299 pRET1003, pRET1003Rv pRET1000 pHSG398 pRET1101, pRET1101Rv pRET1100 pUC18 pRET1102, pRET1102Rv pRET1100 pHSG299 pRET1103, pRET1103Rv pRET1100 pHSG398 pRET1201, pRET1201Rv pRET1200 pUC18 pRET1202, pRET1202Rv pRET1200 pHSG299 pRET1203, pRET1203Rv pRET1200 pHSG398
[0156]The shuttle vectors constructed with pRET1100 and pUC18, pHSG299 or pHSG398 were designated respectively as pRET1101 (SEQ ID NO: 89), pRET1102 (SEQ ID. NO: 90) or pRET1103 (SEQ ID NO: 91), respectively. Of the shuttle vectors, pRET1100 exhibits ampicillin resistance, pRET1102 exhibits kanamycin resistance and pRET1103 exhibits chloramphenicol resistance. Also, the shuttle vectors pRET1101 to 1103 wherein the E. coli gene and pRET1100 were linked in reverse (Rv) were designated respectively as pRET1101Rv (SEQ ID NO: 92), pRET1102Rv (SEQ ID NO: 93) and pRET1103Rv (SEQ ID NO: 94).
[0157]Similarly, the shuttle vectors constructed using pRET1000 and pRET1200 were designated as pRET1001-pRET1003 (SEQ ID NO: 95-SEQ ID NO: 97) and pRET1001Rv-pRET1003Rv (SEQ ID NO: 98-SEQ ID NO: 100), and as pRET1201-pRET1203 and pRET1201Rv-pRET1203Rv (Table 5).
Example 6
Examining Method of Transformation to R. erythropolis
[0158]The Rhodococcus-E. coli shuttle vectors obtained in Example 5 were used for transformation of R. erythropolis MAK-34 strain (MAK-34; deposited at the National Institute of Bioscience and Human-Technology, National Institute of Advanced Industrial Science and Technology, Ministry of Economy, Trade and Industry, (currently: International Patent Organism Depositary, National Institute of Advanced Industrial Science and Technology) on Feb. 15, 2001 as FERM BP-7451). Electroporation was investigated as the method of gene transfer.
[0159]First, R. erythropolis MAK-34 strain was inoculated to 5 mL of GPY medium and cultured with shaking at 30° C. for 36 hours. After seeding 1 mL of culture solution in 100 mL of LB medium, culturing was continued at 200 rpm at 30° C. for 10 hours. The cultured cells were harvested by centrifugation (12 krpm, 5 min, 4° C.) and the harvested cells were rinsed twice with ultrapurified water. The rinsed cells were harvested by centrifugation (12 krpm, 5 min, 4° C.) and suspended in 2.4 mL of a 10% glycerol solution. The suspension was dispensed into 300 μl portions and frozen at -80° C. as competent cells.
[0160]A 90 μL portion of the prepared competent cells and a 5 μL portion of the shuttle vector (pRET1001, pRET1002, pRET1003, pRET1101, pRET1102, pRET1103, pRET1201, pRET1202 or pRET1203) were mixed on ice. The mixed solution was gently poured into a 0.1 cm cuvette which had been cooled on ice, and was set in a Gene Pulser II Electroporation System (BIO-RAD). After pulsing at 20 kV/cm, 400 Ω. 25° F., the mixed solution was added with 300 μL of LB medium immediately and was allowed to stand at 25° C. for 3 hours.
[0161]A portion of the cell suspension was plated on an antibiotic-containing LB plate (100 μg/mL kanamycin, 100 μg/mL ampicillin or 30 μg/mL chloramphenicol). As a result, colonies were obtained when using pRET1002, pRET1102 and pRET1202 containing the kanamycin resistance gene. In order to confirm that the obtained colonies contained the plasmids, the plasmids were isolated and all were verified to contain the shuttle vector.
[0162]This suggested that R. erythropolis can be transformed by electroporation and that pRET1002, pRET1102 and pRET1202 function as shuttle vectors.
Example 7
Obtaining Aminoketone Asymmetric Reductase Gene (Mak Gene)
[0163]The mak gene was isolated from R. erythropolis MAK-34 strain for insertion of the mak gene into the shuttle vector shown in FIG. 5.
[0164]First, genomic DNA was obtained from R. erythropolis MAK-34 strain. After inoculating R. erythropolis MAK-34 strain to 5 mL of GPY medium, culturing with shaking was performed at 30° C. for 48 hours, and then the culture solution was seeded in 100 mL of GPY medium and subcultured at 200 rpm at 30° C. for 10 hours. The genomic DNA was obtained using a Genomic DNA Buffer set and Genomic-tip 500/G (QIAGEN).
[0165]The obtained genomic DNA was used as template for PCR using KOD-plus-. The primers used were MAKF 1 (5'-GAATCTTCTCGTTGATGCAGATCAGGTC-3'; SEQ ID NO: 80) and MAKR2 (5'-CTGACTCCGTAGTGTTCTGCCAGTTC-3'; SEQ ID NO: 81), for PCR at an annealing temperature of 68° C. and extension reaction for 1 minute and 50 seconds. The obtained PCR product was subjected to phenol/chloroform treatment and ethanol precipitation, and then mixed with pUC18 that had been digested with Sma I for 2 hours at 30° C., and ligated therewith using Ligation High. Competent High (Toyobo) was used for transformation of E. coli DH5α, which was then plated on LB agar medium (containing 100 μg/mL ampicillin) that had been coated with 30 μL of 0.1 M IPTG and 4% X-gal, and was allowed to stand at 30° C. for 60 hours. White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL ampicillin at 30° C. for 60 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit. The obtained DNA was confirmed by electrophoresis on 0.8% agarose gel. The obtained clone was designated as pMAK-1.
Example 8
Construction of Expression Vector-1
[0166]A promoter and aminoketone asymmetric reductase gene (mak gene) were inserted into the shuttle vector shown in Table 5.
[0167]First, an expression vector (without exogenous promoter) containing approximately 400 bp upstream from the mak gene was constructed.
[0168]pMAK-1 was digested with Sma I at 30° C. for 2 hours, and then with Pst I at 37° C. for 2 hours. The solution was supplied for 0.8% agarose gel electrophoresis. The DNA size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest. After electrophoresis, an approximately 1.4 kbp DNA fragment was purified using a GFX® PCR DNA and Gel Band Purification Kit, and used as the insert DNA. On the other hand, the vector used was pRET1102 digested with Hinc II and Pst I at 37° C. for 2 hours. The DNA fragments were ligated with Ligation High and Competent High was used for transformation of E. coli DH5α. The cells were plated on LB agar medium containing 100 μg/mL kanamycin and allowed to stand at 30° C. for 60 hours.
[0169]The appearing colonies were cultured with shaking on LB liquid medium containing 100 μg/mL kanamycin at 30° C. for 60 hours. The DNA was purified from the obtained cultured medium using a GFX® Micro Plasmid Prep Kit. The obtained DNA was confirmed by 0.8% agarose gel electrophoresis.
[0170]For screening, the obtained DNA without restriction endonuclease treatment and the DNA after digestion with Pst I at 37° C. for 2 hours were subjected to 0.8% agarose gel electrophoresis, and the target plasmid was obtained based on the size of the DNA. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest, pRET1102 and pRET1102 that had been digested with Pst I at 37° C. for 2 hours. The plasmid obtained in this manner was designated as pRET1104.
Example 9
Construction of Expression Vector-2
[0171]The shuttle vectors were reduced, since reduction of shuttle vectors is effective for expression vector enhancement, gene modification, transformation efficiency improvement and replication in cells.
[0172]First, shuttle vector pRET1102 was reduced. After digesting pRET1102 with BamH I and Hinc II for 2 hours, it was electrophoresed on 0.8% agarose gel and an approximately 2.7 kbp DNA fragment was recovered using a GFX® PCR DNA and Gel Band Purification Kit to prepare a pRET1102 DNA fragment. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest.
[0173]Separately, a DNA fragment replicable in E. coli was prepared by digesting pHSG299 with BamH I and Hinc II for 2 hours, subjecting it to 0.8% agarose gel electrophoresis, and recovering an approximately 2.7 kbp DNA fragment using a GFX® PCR DNA and Gel Band Purification Kit.
[0174]The DNA fragments were ligated with Ligation High and Competent High was used for transformation of E. coli JM109 cells, which were then plated on LB agar medium, containing 100 μg/mL kanamycin, that had been coated with 30 μL of 0.1 M IPTG and 4% X-gal, and was allowed to stand at 30° C. for 48 hours.
[0175]White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL kanamycin at 30° C. for 48 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit. The reduced shuttle vector of pRET1102 obtained in this manner was designated as pRET1123 (approximately 5.3 kbp).
[0176]Next, shuttle vector pRET1202 was reduced. The Rhodococcus-derived DNA fragment was prepared by digesting pRET1202 with EcoR I for 2 hours and then with Dra III for 2 hours, using Blunting High for blunting of the ends, performing 0.8% agarose gel electrophoresis, and then recovering an approximately 3.7 kbp DNA fragment using a GFX® PCR DNA and Gel Band Purification Kit. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest. The DNA fragment was inserted at the Hinc II site of pHSG299. After ligation, Competent High was used for transformation of E. coli DH5α, which was then plated on LB agar medium, containing 100 μg/mL kanamycin, that had been coated with 30 μL of 0.1 M IPTG and 4% X-gal, and was allowed to stand at 30° C. for 72 hours. White colonies were selected from among the appearing colonies, and were cultured with shaking in LB liquid medium containing 100 μg/mL kanamycin at 30° C. for 72 hours. The DNA was purified from the obtained culture solution using a GFX® Micro Plasmid Prep Kit. When the plasmid obtained by screening was digested with Sac I, BamH I, Pst I or EcoR I for 2 hours, all of the clones had approximately 500 bp clipped at the side of EcoR I site of the Rhodococcus-derived region. The plasmid was designated as pRET1204 (approximately 5.9 kbp). It was not possible to obtain a clone with no clipping of the genus Rhodococcus replication region.
[0177]The shuttle vector pRET1002 was reduced in a similar manner to obtain pRET1006 (approximately 4.9 kbp).
[0178]R. erythropolis was transformed with these three reduced plasmids, pRET1006, pRET1123 and pRET1204, and upon confirming the presence or absence of shuttle vector by the method described in Example 6, all the shuttle vectors were detected in the transformed cells. This suggested that the three reduced plasmids pRET1006, pRET1123 and pRET1204 are replicated in R. erythropolis.
Example 10
Construction of Expression Vector-3
[0179]An expression vector was constructed by having the mak gene inserted into the shuttle vector constructed in Example 9.
[0180]The Pst I site of pRET1123 constructed in Example 9 was deleted for cloning of the promoter in the single step. After digesting pRET1123 with Pst I for 2 hours, Blunting High was used for blunting of the ends and Ligation High was used for ligation. The solution was used to transform E. coli JM109 using Competent High, and culturing was performed on an LB plate containing 100 μg/mL kanamycin at 30° C. for 36 hours. The formed colonies were inoculated on LB liquid medium containing 100 μg/mL kanamycin and cultured at 30° C. for 24 hours, and then the DNA was purified using a GFX® Micro Plasmid Prep Kit to obtain pRET1132.
[0181]The obtained pRET1132 was digested with Pst I for 1 hour and then electrophoresed on 0.8% agarose, which resulted in confirming lack of cleavage of pRET1132 by Pst 1. As controls there were used pRET1123 and pRET1132 not digested with Pst I, and pRET1123 digested with Pst I.
Example 11
Construction of Expression Vector-4
[0182]A clone was constructed having a promoter and the mak gene inserted in the aforementioned shuttle vector.
[0183]A clone was constructed having a Pst I site upstream from the mak gene, for insertion of a promoter. The procedure was carried out in the following manner to obtain a clone having His-Tag added to the C-terminus of the aminoketone asymmetric reductase. PCR was conducted with KOD-plus- using the pMAK-1 obtained in Example 7 as template, MAKPstF (5'-GACCACTGCAGATCAATCAACTCTGATGAGGTCC-3'; SEQ ID NO: 82) and MAKHisBglIIR (5'-CGCTTAGATCTCAGTTCGCCGAGCGCCATCGCCG-3'; SEQ ID NO: 83) as primers, with an annealing temperature of 68° C. and extension reaction for 1 minute and 50 seconds. A PCR fragment (insert) produced by digesting the obtained PCR product with Bgl II at 37° C. for 2 hours was ligated with pQE70 (digested with Sph I at 37° C. for 2 hours, blunted with Blunting High and digested with Bgl II at 37° C. for 2 hours) using Ligation High, and then Competent High was used for transformation of E. coli DH5α cells, which were plated on LB agar medium containing 100 μg/mL ampicillin and allowed to stand at 30° C. for 60 hours. The appearing colonies were cultured with shaking on LB liquid medium containing 100 μg/mL ampicillin at 30° C. for 60 hours. The DNA was purified using a GFX® Micro Plasmid Prep Kit. The obtained DNA was confirmed by 0.8% agarose gel electrophoresis.
[0184]For screening, the DNA without restriction endonuclease treatment and the DNA after digestion with Pst I and Bgl II at 37° C. for 2 hours were subjected to 0.8% agarose gel electrophoresis, and the target plasmid was obtained based on the size of the DNA. The plasmid obtained in this manner was designated as pMAK-2. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest, pQE70, and pQE70 that had been digested with Bgl II at 37° C. for 2 hours.
[0185]A clone was constructed by inserting the pRET1200 repA promoter (obtained by PCR amplification using as template a clone of pRET1204 wherein the orientation of repA encoded by the Rhodococcus-derived DNA fragment was in the same orientation as the kanamycin resistance gene encoded by pHSG299, and using as primers P1200rep-Pst5195 (5'-AGCCGCTGCAGAAGCAACACCGCATCCGCCCATTG-3'; SEQ ID NO: 84) and P7 (5'-CGCCAGGGTTTTCCCAGTCACGAC-3'; SEQ ID NO: 85), with an annealing temperature of 60° C. and extension reaction for 1 minute, followed by digestion with EcoR I and Pst I at 37° C. for 2 hours) at the EcoR I-Pst I site of pMAK-2 (designated as pMAK-19).
[0186]Next, PCR was conducted with KOD-plus- using as template pMAK-19 and as primers pQE70F1 (5'-GGCGTATCACGAGGCCCTTTCGTCTTCACC-3'; SEQ ID NO: 86) and pQE70R1135Bm (5'-GGTTGGATCCGTCATCACCGAAACGCGCGAGGCAG-3'; SEQ ID NO: 87), with an annealing temperature of 60° C. and extension reaction for 3 minutes. The PCR product was purified from the reaction solution by using a GFX® PCR DNA and Gel Band Purification Kit and after digestion of the purified PCR product with EcoR I and BamH I for 2 hours, it was electrophoresed on 0.8% agarose gel and the DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The DNA fragment was used as an insert DNA.
[0187]Separately, a vector to be used as the expression shuttle vector was obtained by digesting pRET1132 with EcoR I and BamH I for 2 hours, subjecting the DNA fragment to 0.8% agarose gel electrophoresis and purifying the DNA fragment by using a GFX® PCR DNA and Gel Band Purification Kit. After mixing the insert DNA and vector, Ligation High was used for ligating them and Competent High was used for transformation of E. coli JM109 cells, which were plated on an LB plate containing 100 μg/mL kanamycin. The obtained colonies were cultured on LB liquid medium containing 100 μg/mL kanamycin, and then the plasmid DNA was recovered by using a GFX® Micro Plasmid Prep Kit and subjected to 0.8% agarose gel electrophoresis for screening. The size markers used were Loading Quick DNA size Marker λ/EcoR I+Hind III double digest and pRET1132. The obtained expression vector was designated as pRET1133.
[0188]Also, pMAK-19 was digested with EcoR I and Hind III at 37° C. for 2 hours, blunted with Blunting High and subjected to 0.8% agarose gel electrophoresis, and the approximately 1.6 kbp DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The clone having this fragment inserted at the Hinc II site of pRET1102 was designated as pRET1114.
[0189]The pRET1133 promoter was also modified. The mak gene-expressing promoter encoded in pRET1133 is the repA gene promoter of pRET1200 and has a length of approximately 800 bp, and a plasmid was constructed by having approximately 200 bp clipped off from this promoter. The promoter used for the cloning was prepared by PCR. Plasmid pRET1200 was used as template, P1204rep-Ec2958 (5'-CGCGGAATTCGACCACCACGCACGCACACCGCAC-3'; SEQ ID NO: 88) and P1200rep-Pst5195 (5'-AGCCGCTGCAGAAGCAACACCGCATCCGCCCATTG-3'; SEQ ID NO: 84) were used as primers, and KOD-plus- was used as the PCR enzyme for PCR at an annealing temperature of 60° C. and extension reaction for 50 seconds. The PCR product was purified by using a GFX® PCR DNA and Gel Band Purification Kit, digested with the restriction endonucleases EcoR I and Pst I for 2 hours, and subjected to 1.6% agarose gel electrophoresis, and the DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The DNA fragment was used as the insert DNA. The nucleotide sequence of the promoter region in the DNA fragment is set forth as SEQ ID NO: 77.
[0190]Separately, for the vector, pRET1133 was digested with restriction endonucleases EcoR I and Pst I for 2 hours and subjected to 0.8% agarose gel electrophoresis, and an approximately 7.2 kbp DNA fragment was purified by using a GFX® PCR DNA and Gel Band Purification Kit. The size marker used was Loading Quick DNA size Marker λ/EcoR I+Hind III double digest.
[0191]The insert DNA and vector obtained in this manner were ligated by using Ligation High, and Competent High was used for transformation of E. coli JM109 cells, which were plated on an LB plate containing 100 μg/mL kanamycin. The obtained colonies were cultured on LB liquid medium containing 100 μg/mL kanamycin, and then the plasmid DNA was recovered by using a GFX® Micro Plasmid Prep Kit and subjected to 0.8% agarose gel electrophoresis for screening. The size markers used were Loading Quick DNA size Marker λ/EcoR I+Hind III double digest and pRET1133.
[0192]Also, after digesting the obtained DNA with restriction endonucleases EcoR I and Pst I for 2 hours, it was subjected to 1.6% agarose gel electrophoresis and a DNA fragment corresponding to the approximately 600 bp insert DNA was confirmed. The size marker used was a 100 bp DNA Ladder. The expression vector obtained in this manner was designated as pRET1138.
Example 12
Preparation of Recombinant R. erythropolis and Measurement of Enzyme Activity
[0193]The aforementioned expression vectors pRET1102, pRET1104, pRET1114 and pRET1138 were used for transformation of R. erythropolis MAK-34 strain and R. erythropolis JCM2895 (provided by RIKEN Japan), and the enzyme activity was measured. The aminoketone asymmetric reductase purified from MAK-34 strain has the abilities to react with 1-2-methylamino-1-phenyl-1-propanone as described in International Patent Publication WO02/070714, and to produce d-(1S,2S)-pseudoephedrine. It was also reacted with 1-2-dimethylaminopropiophenone, 1-amino-2-butanone, etc. and production of each corresponding β-aminoalcohol was confirmed.
[0194]The activity assay was conducted by preparing a reaction solution with a cell density O.D.=5, 2% glucose and 0.2 M sodium phosphate buffer (pH 6.0), and 3% (1S,2S)-2-(N-ethylamino)-1-phenyl-1-propanol (EAM) was contained in the reaction as substrate. A synthesis method for EAM is described in J. Am. Chem. Soc., Vol. 50, pp. 2287-2292, 1928. The reaction solution was incubated with shaking at 30° C. for 16 hours. Confirmation of (1S,2S)-2-(N-ethylamino)-1-phenyl-1-propanol (EPE), which was β-amino alcohol as the reaction product, was accomplished by HPLC. The column used was an Inertsil Ph-3 3.0×75 mm, the eluent was aqueous 7% acetonitrile and 0.05 M sodium phosphate buffer (pH 6.0), and the detection was carried out with UV (220 nm).
[0195]The results of the activity assay carried out in this manner are shown in Table 6. The pRET1104-introduced recombinant cells lacking the exogenous promoter region exhibited about the same activity as the pRET1102-introduced recombinant cells lacking the mak gene used as the control, and no recombinant enzyme expression was found.
[0196]With transformation of pRET1114 into MAK-34 strain, high specific activity was found compared to pRET1104. This indicated that the pRET1200 repA promoter region inserted into the vector functions as a promoter.
[0197]With transformation of pRET1138, the specific activity of the recombinant R. erythropolis MAK-34 strain was 37.7 μg/hmL/O.D. while the specific activity of the recombinant R. erythropolis JCM2895 was 34.9 μg/hmL/O.D., and therefore expression of the enzyme in R. erythropolis strain was confirmed.
TABLE-US-00006 TABLE 6 Vector MAK-34 JCM2895 pRET1102 1.0 1.0 pRET1104 0.7 2.0 pRET1114 17.2 not tested pRET1138 37.7 34.9 Specific activity (units: μg/h mL/O.D.)
Example 13
Purification of Enzyme
[0198]The recombinant cells obtained in Example 12 were cultured at 30° C. for 4 days in 100 mL of LB medium containing 100 μg/mL kanamycin, the cells were harvested by centrifugation at 12,000 rpm for 5 minutes and the protein having His-tag was purified with The QIAexpressionist Kit (Qiagen). Specifically, the cells were disrupted by ultrasonic treatment, the supernatant was obtained by centrifugation, and the protein was purified with a nickel chelate column. Upon applying the obtained protein to SDS-PAGE, a band of protein, which molecular weight is approximately 28,000, was observed. This molecular weight is roughly equivalent to the molecular weight of the aminoketone asymmetric reductase described in International Patent Publication WO02/070714, thus indicating that the aminoketone asymmetric reductase was produced in the recombinant Rhodococcus strains.
Example 14
Enzymatic Production of β-aminoalcohol
[0199]A 0.5 mL portion of reaction solution containing the purified enzyme (0.5 μg/mL) obtained in Example 13, 5 mM NADPH, 120 mM Tris-HCl (pH 7.5) and 5 mM EAM was reacted at 37° C. for 16 hours. The substrate and product (EPE) were analyzed by HPLC. The column used was an Inertsil Ph-3 3.0×75 mm, the eluent was aqueous 7% acetonitrile and 0.05 M sodium phosphate buffer (pH 6.0), and the detection was carried out with UV (220 nm). The results confirmed production of EPE.
[0200]Similarly, the purified enzyme or the crude enzyme extract obtained from the recombinant cells cultured as described in Example 13 was reacted with 1-2-dimethylaminopropiophenone and 1-amino-2-butanone, etc. and production of the corresponding β-aminoalcohols was confirmed.
INDUSTRIAL APPLICABILITY
[0201]As explained above, the plasmids and shuttle vectors of the invention are derived from Rhodococcus strains (especially Rhodococcus erythropolis and Rhodococcus rhodnii), and when utilized them for modification of the same bacteria by recombination, they allow creation of bacterial strains that more efficiently produce aminoketone asymmetric reductases. They also permit mass production of useful enzymes including aminoketone asymmetric reductases in transformants.
Sequence CWU
1
1001279DNARhodococcus erythropolismisc_feature202bp to 480bp pRET1100
1atgactctga gggtggacga accggagtcg gtgagaatgc ttcatccgag cgcttccccg
60gaagactgtg ccctggtcga gaccttcaag cctggtacct gccttttcga gaagccagga
120gaaggccggc agattatgcg atgcgacttt gtcggcgagt acgggagata tgcgcgagcc
180atcgagtctt cggatctgcg ttttctcgcc accctccagc aagaccaggc ccaacgcgaa
240ttcttcgctg aggagttcgg tgtggtggat ccgtcatga
2792243DNARhodococcus erythropolismisc_feature238bp to 480bp pRET1100
2atgcttcatc cgagcgcttc cccggaagac tgtgccctgg tcgagacctt caagcctggt
60acctgccttt tcgagaagcc aggagaaggc cggcagatta tgcgatgcga ctttgtcggc
120gagtacggga gatatgcgcg agccatcgag tcttcggatc tgcgttttct cgccaccctc
180cagcaagacc aggcccaacg cgaattcttc gctgaggagt tcggtgtggt ggatccgtca
240tga
2433144DNARhodococcus erythropolismisc_feature337bp to 480bp pRET1100
3atgcgatgcg actttgtcgg cgagtacggg agatatgcgc gagccatcga gtcttcggat
60ctgcgttttc tcgccaccct ccagcaagac caggcccaac gcgaattctt cgctgaggag
120ttcggtgtgg tggatccgtc atga
1444282DNARhodococcus erythropolismisc_feature477bp to 758bp pRET1100
4atgactggac cacaggagag aaagcgcaag gcggcgaagc cgtcgcggga gcctcagttg
60aactgctgtg aagcggacgt gccgaaacga gcaaaacagc ccccggttcc ctctacgttc
120gacctgctca cggtgaagga gactgcgggg ctgctgagag tcagtcaggc aactctttac
180cggctgcttc ggagtgggga aggacccaca tacacacgga tcggtggaca gatacgcgtt
240caccgcgagt cgctgcgtcg gttcatcgaa ccgcgtggat aa
28251548DNARhodococcus erythropolismisc_feature862bp to 2409bp pRET1100
5atgcacttcc acgataacgc agaggtcgga caagagggaa gaactgccgt tctctcgccg
60ttgcgcggcg tagccgccaa gcgggacgtg tctgacgatg cagcgaagcg gagtcggcag
120gcgcggcacg cgcctgggct tgttacatct gccacaactg tccgtgaatc tctgccagct
180cctgaaaccg ctggtcaggg ccttgcggaa tccgtgaccg ctgatgattt ttggtctcat
240tcgttccccc gcgctgacga tgtacgcggc gcagctgctt ccttccagtc ggtggctaac
300tgggatgggc gtgagggtcc gaggccgcgt ttcgttgtcg cgcctggcgt tgtccgcttg
360gaggtttgtg atctcgcacg ccgcgaacga acggctgaac gtgcgtatct ggctgctcgg
420gctcgggtgg atatggcggc tgccaggcat aactcgccgt acgacttcga cgtggacgat
480gaagagttgg cggaactggc ttctctgcaa ggcctcgagg acgacgacat tgggggctgg
540tctgcggaga gggaaatagt gggctggtct gctcgttctc ggtcacggat gatcttgcga
600atggcagaac tcgactgggc tcccatgatg gatttgccgg gcattcctgc gatggtgacc
660ctcacctatc cgggggactg gcttacggtt gcccccaccg gcgctgaggt caaaaaacat
720ctccagacgt tcttcaaacg gttccaacgg gcctggggca ttgcctggat gggtgcgtgg
780aaaatggagt tccaaagccg aggcgctccg cattttcacc tgtacatggt ccctcctcat
840gggaaggcag gagactcgcg gaagctgcgg catgatgctg agctcttgaa atgggagata
900gcacgtgcag agggtgaaga cccaggtcgc aggccgtatt tccgggaagc tccaagcgat
960ggattgaagt ttcgtccgtg gctttctgcg gtgtgggccg acgtcgtaga tcatccggac
1020cccaaggaaa aagaaaagca cgtcagtgcc ggcactggag tggactacgc ggagggcacg
1080cgagggtcag atccgaaaag gcttgcggtg tacttctcca agcatggaac ctttgccgac
1140aaggaatatc agcacgtagt tcctgctcaa tggcagaaaa cgggtgcggg acctggcagg
1200ttctggggct accgcggttt gtcgccggcc acggctgcca ccgagatttc ctgggatgag
1260tacctgcttt tatctcgcac gttgcgacga ttgtcagcgc gaacgaagat ctgggacccg
1320gctttacgag gcggtagcgg cggccacaga tggactaagg cgatgatgcg acgcacggtt
1380acccggcacc gcttggacct cgtgaccggt gagattctgg gcacgaagac gcggaaggtt
1440cgggcgccag tgaagaggtt tgtccggact tcgggatacc tgtgtgtcaa tgacgggccc
1500gcactggctc gaaccctcag ccgtcttcgt acaagctgcc tgagctag
154861116DNARhodococcus erythropolismisc_feature1294bp to 2409bp pRET1100
6atggcggctg ccaggcataa ctcgccgtac gacttcgacg tggacgatga agagttggcg
60gaactggctt ctctgcaagg cctcgaggac gacgacattg ggggctggtc tgcggagagg
120gaaatagtgg gctggtctgc tcgttctcgg tcacggatga tcttgcgaat ggcagaactc
180gactgggctc ccatgatgga tttgccgggc attcctgcga tggtgaccct cacctatccg
240ggggactggc ttacggttgc ccccaccggc gctgaggtca aaaaacatct ccagacgttc
300ttcaaacggt tccaacgggc ctggggcatt gcctggatgg gtgcgtggaa aatggagttc
360caaagccgag gcgctccgca ttttcacctg tacatggtcc ctcctcatgg gaaggcagga
420gactcgcgga agctgcggca tgatgctgag ctcttgaaat gggagatagc acgtgcagag
480ggtgaagacc caggtcgcag gccgtatttc cgggaagctc caagcgatgg attgaagttt
540cgtccgtggc tttctgcggt gtgggccgac gtcgtagatc atccggaccc caaggaaaaa
600gaaaagcacg tcagtgccgg cactggagtg gactacgcgg agggcacgcg agggtcagat
660ccgaaaaggc ttgcggtgta cttctccaag catggaacct ttgccgacaa ggaatatcag
720cacgtagttc ctgctcaatg gcagaaaacg ggtgcgggac ctggcaggtt ctggggctac
780cgcggtttgt cgccggccac ggctgccacc gagatttcct gggatgagta cctgctttta
840tctcgcacgt tgcgacgatt gtcagcgcga acgaagatct gggacccggc tttacgaggc
900ggtagcggcg gccacagatg gactaaggcg atgatgcgac gcacggttac ccggcaccgc
960ttggacctcg tgaccggtga gattctgggc acgaagacgc ggaaggttcg ggcgccagtg
1020aagaggtttg tccggacttc gggatacctg tgtgtcaatg acgggcccgc actggctcga
1080accctcagcc gtcttcgtac aagctgcctg agctag
11167960DNARhodocuccus erythropolismisc_feature1450bp to 2409bp pRET1100
7atgatcttgc gaatggcaga actcgactgg gctcccatga tggatttgcc gggcattcct
60gcgatggtga ccctcaccta tccgggggac tggcttacgg ttgcccccac cggcgctgag
120gtcaaaaaac atctccagac gttcttcaaa cggttccaac gggcctgggg cattgcctgg
180atgggtgcgt ggaaaatgga gttccaaagc cgaggcgctc cgcattttca cctgtacatg
240gtccctcctc atgggaaggc aggagactcg cggaagctgc ggcatgatgc tgagctcttg
300aaatgggaga tagcacgtgc agagggtgaa gacccaggtc gcaggccgta tttccgggaa
360gctccaagcg atggattgaa gtttcgtccg tggctttctg cggtgtgggc cgacgtcgta
420gatcatccgg accccaagga aaaagaaaag cacgtcagtg ccggcactgg agtggactac
480gcggagggca cgcgagggtc agatccgaaa aggcttgcgg tgtacttctc caagcatgga
540acctttgccg acaaggaata tcagcacgta gttcctgctc aatggcagaa aacgggtgcg
600ggacctggca ggttctgggg ctaccgcggt ttgtcgccgg ccacggctgc caccgagatt
660tcctgggatg agtacctgct tttatctcgc acgttgcgac gattgtcagc gcgaacgaag
720atctgggacc cggctttacg aggcggtagc ggcggccaca gatggactaa ggcgatgatg
780cgacgcacgg ttacccggca ccgcttggac ctcgtgaccg gtgagattct gggcacgaag
840acgcggaagg ttcgggcgcc agtgaagagg tttgtccgga cttcgggata cctgtgtgtc
900aatgacgggc ccgcactggc tcgaaccctc agccgtcttc gtacaagctg cctgagctag
9608948DNARhodococcus erythropolismisc_feature1462bp to 2409bp pRET1100
8atggcagaac tcgactgggc tcccatgatg gatttgccgg gcattcctgc gatggtgacc
60ctcacctatc cgggggactg gcttacggtt gcccccaccg gcgctgaggt caaaaaacat
120ctccagacgt tcttcaaacg gttccaacgg gcctggggca ttgcctggat gggtgcgtgg
180aaaatggagt tccaaagccg aggcgctccg cattttcacc tgtacatggt ccctcctcat
240gggaaggcag gagactcgcg gaagctgcgg catgatgctg agctcttgaa atgggagata
300gcacgtgcag agggtgaaga cccaggtcgc aggccgtatt tccgggaagc tccaagcgat
360ggattgaagt ttcgtccgtg gctttctgcg gtgtgggccg acgtcgtaga tcatccggac
420cccaaggaaa aagaaaagca cgtcagtgcc ggcactggag tggactacgc ggagggcacg
480cgagggtcag atccgaaaag gcttgcggtg tacttctcca agcatggaac ctttgccgac
540aaggaatatc agcacgtagt tcctgctcaa tggcagaaaa cgggtgcggg acctggcagg
600ttctggggct accgcggttt gtcgccggcc acggctgcca ccgagatttc ctgggatgag
660tacctgcttt tatctcgcac gttgcgacga ttgtcagcgc gaacgaagat ctgggacccg
720gctttacgag gcggtagcgg cggccacaga tggactaagg cgatgatgcg acgcacggtt
780acccggcacc gcttggacct cgtgaccggt gagattctgg gcacgaagac gcggaaggtt
840cgggcgccag tgaagaggtt tgtccggact tcgggatacc tgtgtgtcaa tgacgggccc
900gcactggctc gaaccctcag ccgtcttcgt acaagctgcc tgagctag
9489924DNARhodoccus erythropolismisc_feature1486bp to 2409bp pRET1100
9atgatggatt tgccgggcat tcctgcgatg gtgaccctca cctatccggg ggactggctt
60acggttgccc ccaccggcgc tgaggtcaaa aaacatctcc agacgttctt caaacggttc
120caacgggcct ggggcattgc ctggatgggt gcgtggaaaa tggagttcca aagccgaggc
180gctccgcatt ttcacctgta catggtccct cctcatggga aggcaggaga ctcgcggaag
240ctgcggcatg atgctgagct cttgaaatgg gagatagcac gtgcagaggg tgaagaccca
300ggtcgcaggc cgtatttccg ggaagctcca agcgatggat tgaagtttcg tccgtggctt
360tctgcggtgt gggccgacgt cgtagatcat ccggacccca aggaaaaaga aaagcacgtc
420agtgccggca ctggagtgga ctacgcggag ggcacgcgag ggtcagatcc gaaaaggctt
480gcggtgtact tctccaagca tggaaccttt gccgacaagg aatatcagca cgtagttcct
540gctcaatggc agaaaacggg tgcgggacct ggcaggttct ggggctaccg cggtttgtcg
600ccggccacgg ctgccaccga gatttcctgg gatgagtacc tgcttttatc tcgcacgttg
660cgacgattgt cagcgcgaac gaagatctgg gacccggctt tacgaggcgg tagcggcggc
720cacagatgga ctaaggcgat gatgcgacgc acggttaccc ggcaccgctt ggacctcgtg
780accggtgaga ttctgggcac gaagacgcgg aaggttcggg cgccagtgaa gaggtttgtc
840cggacttcgg gatacctgtg tgtcaatgac gggcccgcac tggctcgaac cctcagccgt
900cttcgtacaa gctgcctgag ctag
92410921DNARhodococcus erythropolismisc_feature1489bp to 2409bp pRET1100
10atggatttgc cgggcattcc tgcgatggtg accctcacct atccggggga ctggcttacg
60gttgccccca ccggcgctga ggtcaaaaaa catctccaga cgttcttcaa acggttccaa
120cgggcctggg gcattgcctg gatgggtgcg tggaaaatgg agttccaaag ccgaggcgct
180ccgcattttc acctgtacat ggtccctcct catgggaagg caggagactc gcggaagctg
240cggcatgatg ctgagctctt gaaatgggag atagcacgtg cagagggtga agacccaggt
300cgcaggccgt atttccggga agctccaagc gatggattga agtttcgtcc gtggctttct
360gcggtgtggg ccgacgtcgt agatcatccg gaccccaagg aaaaagaaaa gcacgtcagt
420gccggcactg gagtggacta cgcggagggc acgcgagggt cagatccgaa aaggcttgcg
480gtgtacttct ccaagcatgg aacctttgcc gacaaggaat atcagcacgt agttcctgct
540caatggcaga aaacgggtgc gggacctggc aggttctggg gctaccgcgg tttgtcgccg
600gccacggctg ccaccgagat ttcctgggat gagtacctgc ttttatctcg cacgttgcga
660cgattgtcag cgcgaacgaa gatctgggac ccggctttac gaggcggtag cggcggccac
720agatggacta aggcgatgat gcgacgcacg gttacccggc accgcttgga cctcgtgacc
780ggtgagattc tgggcacgaa gacgcggaag gttcgggcgc cagtgaagag gtttgtccgg
840acttcgggat acctgtgtgt caatgacggg cccgcactgg ctcgaaccct cagccgtctt
900cgtacaagct gcctgagcta g
92111897DNARhodococcus erythropolismisc_feature1513bp to 2409bp pRET1100
11atggtgaccc tcacctatcc gggggactgg cttacggttg cccccaccgg cgctgaggtc
60aaaaaacatc tccagacgtt cttcaaacgg ttccaacggg cctggggcat tgcctggatg
120ggtgcgtgga aaatggagtt ccaaagccga ggcgctccgc attttcacct gtacatggtc
180cctcctcatg ggaaggcagg agactcgcgg aagctgcggc atgatgctga gctcttgaaa
240tgggagatag cacgtgcaga gggtgaagac ccaggtcgca ggccgtattt ccgggaagct
300ccaagcgatg gattgaagtt tcgtccgtgg ctttctgcgg tgtgggccga cgtcgtagat
360catccggacc ccaaggaaaa agaaaagcac gtcagtgccg gcactggagt ggactacgcg
420gagggcacgc gagggtcaga tccgaaaagg cttgcggtgt acttctccaa gcatggaacc
480tttgccgaca aggaatatca gcacgtagtt cctgctcaat ggcagaaaac gggtgcggga
540cctggcaggt tctggggcta ccgcggtttg tcgccggcca cggctgccac cgagatttcc
600tgggatgagt acctgctttt atctcgcacg ttgcgacgat tgtcagcgcg aacgaagatc
660tgggacccgg ctttacgagg cggtagcggc ggccacagat ggactaaggc gatgatgcga
720cgcacggtta cccggcaccg cttggacctc gtgaccggtg agattctggg cacgaagacg
780cggaaggttc gggcgccagt gaagaggttt gtccggactt cgggatacct gtgtgtcaat
840gacgggcccg cactggctcg aaccctcagc cgtcttcgta caagctgcct gagctag
89712780DNARhodococcus erythropolismisc_feature1630bp to 2409bp pRET1100
12atgggtgcgt ggaaaatgga gttccaaagc cgaggcgctc cgcattttca cctgtacatg
60gtccctcctc atgggaaggc aggagactcg cggaagctgc ggcatgatgc tgagctcttg
120aaatgggaga tagcacgtgc agagggtgaa gacccaggtc gcaggccgta tttccgggaa
180gctccaagcg atggattgaa gtttcgtccg tggctttctg cggtgtgggc cgacgtcgta
240gatcatccgg accccaagga aaaagaaaag cacgtcagtg ccggcactgg agtggactac
300gcggagggca cgcgagggtc agatccgaaa aggcttgcgg tgtacttctc caagcatgga
360acctttgccg acaaggaata tcagcacgta gttcctgctc aatggcagaa aacgggtgcg
420ggacctggca ggttctgggg ctaccgcggt ttgtcgccgg ccacggctgc caccgagatt
480tcctgggatg agtacctgct tttatctcgc acgttgcgac gattgtcagc gcgaacgaag
540atctgggacc cggctttacg aggcggtagc ggcggccaca gatggactaa ggcgatgatg
600cgacgcacgg ttacccggca ccgcttggac ctcgtgaccg gtgagattct gggcacgaag
660acgcggaagg ttcgggcgcc agtgaagagg tttgtccgga cttcgggata cctgtgtgtc
720aatgacgggc ccgcactggc tcgaaccctc agccgtcttc gtacaagctg cctgagctag
78013765DNARhodococcus erythropolismisc_feature1645bp to 2409bp pRET1100
13atggagttcc aaagccgagg cgctccgcat tttcacctgt acatggtccc tcctcatggg
60aaggcaggag actcgcggaa gctgcggcat gatgctgagc tcttgaaatg ggagatagca
120cgtgcagagg gtgaagaccc aggtcgcagg ccgtatttcc gggaagctcc aagcgatgga
180ttgaagtttc gtccgtggct ttctgcggtg tgggccgacg tcgtagatca tccggacccc
240aaggaaaaag aaaagcacgt cagtgccggc actggagtgg actacgcgga gggcacgcga
300gggtcagatc cgaaaaggct tgcggtgtac ttctccaagc atggaacctt tgccgacaag
360gaatatcagc acgtagttcc tgctcaatgg cagaaaacgg gtgcgggacc tggcaggttc
420tggggctacc gcggtttgtc gccggccacg gctgccaccg agatttcctg ggatgagtac
480ctgcttttat ctcgcacgtt gcgacgattg tcagcgcgaa cgaagatctg ggacccggct
540ttacgaggcg gtagcggcgg ccacagatgg actaaggcga tgatgcgacg cacggttacc
600cggcaccgct tggacctcgt gaccggtgag attctgggca cgaagacgcg gaaggttcgg
660gcgccagtga agaggtttgt ccggacttcg ggatacctgt gtgtcaatga cgggcccgca
720ctggctcgaa ccctcagccg tcttcgtaca agctgcctga gctag
76514723DNARhodococcus erythropolismisc_feature1687bp to 2409bp pRET1100
14atggtccctc ctcatgggaa ggcaggagac tcgcggaagc tgcggcatga tgctgagctc
60ttgaaatggg agatagcacg tgcagagggt gaagacccag gtcgcaggcc gtatttccgg
120gaagctccaa gcgatggatt gaagtttcgt ccgtggcttt ctgcggtgtg ggccgacgtc
180gtagatcatc cggaccccaa ggaaaaagaa aagcacgtca gtgccggcac tggagtggac
240tacgcggagg gcacgcgagg gtcagatccg aaaaggcttg cggtgtactt ctccaagcat
300ggaacctttg ccgacaagga atatcagcac gtagttcctg ctcaatggca gaaaacgggt
360gcgggacctg gcaggttctg gggctaccgc ggtttgtcgc cggccacggc tgccaccgag
420atttcctggg atgagtacct gcttttatct cgcacgttgc gacgattgtc agcgcgaacg
480aagatctggg acccggcttt acgaggcggt agcggcggcc acagatggac taaggcgatg
540atgcgacgca cggttacccg gcaccgcttg gacctcgtga ccggtgagat tctgggcacg
600aagacgcgga aggttcgggc gccagtgaag aggtttgtcc ggacttcggg atacctgtgt
660gtcaatgacg ggcccgcact ggctcgaacc ctcagccgtc ttcgtacaag ctgcctgagc
720tag
72315186DNARhodococcus erythropolismisc_feature2224bp to 2409bp pRET1100
15atgatgcgac gcacggttac ccggcaccgc ttggacctcg tgaccggtga gattctgggc
60acgaagacgc ggaaggttcg ggcgccagtg aagaggtttg tccggacttc gggatacctg
120tgtgtcaatg acgggcccgc actggctcga accctcagcc gtcttcgtac aagctgcctg
180agctag
18616183DNARhodococcus erythropolismisc_feature2227bp to 2409bp pRET1100
16atgcgacgca cggttacccg gcaccgcttg gacctcgtga ccggtgagat tctgggcacg
60aagacgcgga aggttcgggc gccagtgaag aggtttgtcc ggacttcggg atacctgtgt
120gtcaatgacg ggcccgcact ggctcgaacc ctcagccgtc ttcgtacaag ctgcctgagc
180tag
18317432DNARhodococcus erythropolismisc_feature1875bp to 1444bp pRET1100
17atgatctacg acgtcggccc acaccgcaga aagccacgga cgaaacttca atccatcgct
60tggagcttcc cggaaatacg gcctgcgacc tgggtcttca ccctctgcac gtgctatctc
120ccatttcaag agctcagcat catgccgcag cttccgcgag tctcctgcct tcccatgagg
180agggaccatg tacaggtgaa aatgcggagc gcctcggctt tggaactcca ttttccacgc
240acccatccag gcaatgcccc aggcccgttg gaaccgtttg aagaacgtct ggagatgttt
300tttgacctca gcgccggtgg gggcaaccgt aagccagtcc cccggatagg tgagggtcac
360catcgcagga atgcccggca aatccatcat gggagcccag tcgagttctg ccattcgcaa
420gatcatccgt ga
43218291DNARhodococcus erythropolismisc_feature1734bp to 1444bp pRET1100
18atgccgcagc ttccgcgagt ctcctgcctt cccatgagga gggaccatgt acaggtgaaa
60atgcggagcg cctcggcttt ggaactccat tttccacgca cccatccagg caatgcccca
120ggcccgttgg aaccgtttga agaacgtctg gagatgtttt ttgacctcag cgccggtggg
180ggcaaccgta agccagtccc ccggataggt gagggtcacc atcgcaggaa tgcccggcaa
240atccatcatg ggagcccagt cgagttctgc cattcgcaag atcatccgtg a
29119258DNARhodococcus erythropolismisc_feature1701bp to 1444bp pRET1100
19atgaggaggg accatgtaca ggtgaaaatg cggagcgcct cggctttgga actccatttt
60ccacgcaccc atccaggcaa tgccccaggc ccgttggaac cgtttgaaga acgtctggag
120atgttttttg acctcagcgc cggtgggggc aaccgtaagc cagtcccccg gataggtgag
180ggtcaccatc gcaggaatgc ccggcaaatc catcatggga gcccagtcga gttctgccat
240tcgcaagatc atccgtga
25820231DNARhodococcus erythropolismisc_feature1674bp to 1444bp pRET1100
20atgcggagcg cctcggcttt ggaactccat tttccacgca cccatccagg caatgcccca
60ggcccgttgg aaccgtttga agaacgtctg gagatgtttt ttgacctcag cgccggtggg
120ggcaaccgta agccagtccc ccggataggt gagggtcacc atcgcaggaa tgcccggcaa
180atccatcatg ggagcccagt cgagttctgc cattcgcaag atcatccgtg a
23121138DNARhodococcus erythropolismisc_feature1581bp to 1444bp pRET1100
21atgttttttg acctcagcgc cggtgggggc aaccgtaagc cagtcccccg gataggtgag
60ggtcaccatc gcaggaatgc ccggcaaatc catcatggga gcccagtcga gttctgccat
120tcgcaagatc atccgtga
13822423DNARhodococcus erythropolismisc_feature2828bp to 2406bp pRET1100
22atggtgggag ggcaacactc ccaatacgct tcagttatga atgaagacag agacaacatc
60atcgccaggt tccgcgtcga aatgctccgc tcaatcgagg atgcaattca tttagccgca
120ctctccgcga acgacgaaaa ccgttatgcc gcaacagaag acaatcgacc cgtgcggaca
180caactatcgc aacaacagca ggttgtcctg accgagctga cattggccga ccacatggaa
240aagctcgcgc gggagcacct cgtttaccta gccgacagag cgcgggagat gaattgcacc
300tgggtagaga taggtcagtc gttgggtctc tctccccacg gagcgcagca gcgcatcacc
360agaagccgcc caaaacccgc catccagcaa aagacaaagc cgaaaggcgt tccgcgcgtc
420tag
42323387DNARhodococcus erythropolismisc_feature2792bp to 2406bp pRET1100
23atgaatgaag acagagacaa catcatcgcc aggttccgcg tcgaaatgct ccgctcaatc
60gaggatgcaa ttcatttagc cgcactctcc gcgaacgacg aaaaccgtta tgccgcaaca
120gaagacaatc gacccgtgcg gacacaacta tcgcaacaac agcaggttgt cctgaccgag
180ctgacattgg ccgaccacat ggaaaagctc gcgcgggagc acctcgttta cctagccgac
240agagcgcggg agatgaattg cacctgggta gagataggtc agtcgttggg tctctctccc
300cacggagcgc agcagcgcat caccagaagc cgcccaaaac ccgccatcca gcaaaagaca
360aagccgaaag gcgttccgcg cgtctag
38724342DNARhodococcus erythropolismisc_feature2747bp to 2406bp pRET1100
24atgctccgct caatcgagga tgcaattcat ttagccgcac tctccgcgaa cgacgaaaac
60cgttatgccg caacagaaga caatcgaccc gtgcggacac aactatcgca acaacagcag
120gttgtcctga ccgagctgac attggccgac cacatggaaa agctcgcgcg ggagcacctc
180gtttacctag ccgacagagc gcgggagatg aattgcacct gggtagagat aggtcagtcg
240ttgggtctct ctccccacgg agcgcagcag cgcatcacca gaagccgccc aaaacccgcc
300atccagcaaa agacaaagcc gaaaggcgtt ccgcgcgtct ag
34225189DNARhodococcus erythropolismisc_feature2594bp to 2406bp pRET1100
25atggaaaagc tcgcgcggga gcacctcgtt tacctagccg acagagcgcg ggagatgaat
60tgcacctggg tagagatagg tcagtcgttg ggtctctctc cccacggagc gcagcagcgc
120atcaccagaa gccgcccaaa acccgccatc cagcaaaaga caaagccgaa aggcgttccg
180cgcgtctag
18926135DNARhodococcus erythropolismisc_feature2540bp to 2406bp pRET1100
26atgaattgca cctgggtaga gataggtcag tcgttgggtc tctctcccca cggagcgcag
60cagcgcatca ccagaagccg cccaaaaccc gccatccagc aaaagacaaa gccgaaaggc
120gttccgcgcg tctag
13527336DNARhodococcus erythropolismisc_feature2971bp to 3306bp pRET1100
27atggctttga aagctgctgg caacgtgatt cctgattcct ccgcgtacga gtaccgggcg
60gttcaggtcg agccgaagat ggtcagaaaa gacccggaag acccgaactc tgagcagttc
120cagaagcaga aggacggcac gccggtgtgg tcgatcgact gcattcgggt cgaccgggca
180tcaggcaaca aggcaatcgt gaccgtgacg gttccggacg tgatggaacc ggatgttgcg
240gggccggtgg agttctccga gatgattgcc ggtttctggg tttcgcgcag tggttcgggc
300atgtggtttt cggcaagcgc cgtcgcttct ctctga
33628258DNARhodococcus erythropolismisc_feature3049bp to 3306bp pRET1100
28atggtcagaa aagacccgga agacccgaac tctgagcagt tccagaagca gaaggacggc
60acgccggtgt ggtcgatcga ctgcattcgg gtcgaccggg catcaggcaa caaggcaatc
120gtgaccgtga cggttccgga cgtgatggaa ccggatgttg cggggccggt ggagttctcc
180gagatgattg ccggtttctg ggtttcgcgc agtggttcgg gcatgtggtt ttcggcaagc
240gccgtcgctt ctctctga
25829525DNARhodococcus erythropolismisc_feature3577bp to 3053bp pRET1100
29atgtcgatgt actgccctcc gctgaacggc cccagctctt ccggagagag aacgaggcac
60ccggcaacgt ccgagaacac cccgttttcc cacttcggat cggccggcac tctcagcggc
120acagcttcgg actgtgaacg atcactgaac acgttcgccg cttgccaacc tgccgcaacc
180agcacaaaca cgagcacgag ggcacccaca cccagcgcaa cgccttttcc tttggacatt
240tccgaacctt tcgaggggcg acgatcagcg atcagagaga agcgacggcg cttgccgaaa
300accacatgcc cgaaccactg cgcgaaaccc agaaaccggc aatcatctcg gagaactcca
360ccggccccgc aacatccggt tccatcacgt ccggaaccgt cacggtcacg attgccttgt
420tgcctgatgc ccggtcgacc cgaatgcagt cgatcgacca caccggcgtg ccgtccttct
480gcttctggaa ctgctcagag ttcgggtctt ccgggtcttt tctga
52530519DNARhodococcus erythropolismisc_feature3571bp to 3053bp pRET1100
30atgtactgcc ctccgctgaa cggccccagc tcttccggag agagaacgag gcacccggca
60acgtccgaga acaccccgtt ttcccacttc ggatcggccg gcactctcag cggcacagct
120tcggactgtg aacgatcact gaacacgttc gccgcttgcc aacctgccgc aaccagcaca
180aacacgagca cgagggcacc cacacccagc gcaacgcctt ttcctttgga catttccgaa
240cctttcgagg ggcgacgatc agcgatcaga gagaagcgac ggcgcttgcc gaaaaccaca
300tgcccgaacc actgcgcgaa acccagaaac cggcaatcat ctcggagaac tccaccggcc
360ccgcaacatc cggttccatc acgtccggaa ccgtcacggt cacgattgcc ttgttgcctg
420atgcccggtc gacccgaatg cagtcgatcg accacaccgg cgtgccgtcc ttctgcttct
480ggaactgctc agagttcggg tcttccgggt cttttctga
51931564DNARhodococcus erythropolismisc_feature3339bp to 3902bp pRET1100
31atgtccaaag gaaaaggcgt tgcgctgggt gtgggtgccc tcgtgctcgt gtttgtgctg
60gttgcggcag gttggcaagc ggcgaacgtg ttcagtgatc gttcacagtc cgaagctgtg
120ccgctgagag tgccggccga tccgaagtgg gaaaacgggg tgttctcgga cgttgccggg
180tgcctcgttc tctctccgga agagctgggg ccgttcagcg gagggcagta catcgacata
240gtgaggccag ttgagccgga gaggttggag cgcgactggg tgaggtcggc tgagtgcgtt
300tcggcgtcga tgaatgtctc tgacctgttg gtttctgctc ttccagagtc cacccgtccc
360cccggcgatt tcgttcgttc gtggaaagtg gcgagtgatg attactgcta tgagggtgat
420aacccgcaag gctgcacttc tcgtatgccg gtttgggtct ctgcaaaaaa ctggtggtgc
480acagaacccg tactcgatcc gctcgttcgt cgctgtgagg tctttcctgc aaggcaaatc
540gttgtgccgg aaggggtttc gtga
56432255DNARhodococcus erythropolismisc_feature3648bp to 3902bp pRET1100
32atgaatgtct ctgacctgtt ggtttctgct cttccagagt ccacccgtcc ccccggcgat
60ttcgttcgtt cgtggaaagt ggcgagtgat gattactgct atgagggtga taacccgcaa
120ggctgcactt ctcgtatgcc ggtttgggtc tctgcaaaaa actggtggtg cacagaaccc
180gtactcgatc cgctcgttcg tcgctgtgag gtctttcctg caaggcaaat cgttgtgccg
240gaaggggttt cgtga
25533669DNARhodococcus erythropolismisc_feature4366bp to 5034bp pRET1100
33atgggcaccc cacgcccaag taaccgctgg tgcgctggat atttcggcgg tggtctcgtg
60agcggggaga agcggcacag cgaggccggc ccggtagaaa tcatcttttt gatgctggca
120gtcagggcgg gggactacat cgtcgccgtg actgcggttc tcgcggtcgg gttcttcgcg
180gtcgcggttg agggtttctg gttcctggtc gtcgcagtca tcgctgcacc ggcgtggtgg
240tttctgcgcg actgggaatc gaagcggagg gccgtacggg tctttgaacg ggcatggaag
300gggacacctg aatcccccgg tattgctctc tcccttggcc tgtcgaacgt ggcggggtct
360ctgccgaggt tgaggaagtt tgaaactggt tcggggatac gcacactcgt gttttctttg
420ccgcccggag tcactgccga gagctttgag aaagttcgcc ctgcgctggc agacgcgatg
480gggggtcacc gctgccaagt agagaaggtg gcccccggac aggtccgcgt cagagtgatt
540gatgaggatt cgatgaagac gccgcgtgat gcgggatggg cgaaagatgt tgtgctggaa
600gaggatacgt tcgacggtct tccgggcgag acgcgatcct ggttcgagca agaggggccg
660gcatcatga
66934558DNARhodococcus erythropolismisc_feature4477bp to 5034bp pRET1100
34atgctggcag tcagggcggg ggactacatc gtcgccgtga ctgcggttct cgcggtcggg
60ttcttcgcgg tcgcggttga gggtttctgg ttcctggtcg tcgcagtcat cgctgcaccg
120gcgtggtggt ttctgcgcga ctgggaatcg aagcggaggg ccgtacgggt ctttgaacgg
180gcatggaagg ggacacctga atcccccggt attgctctct cccttggcct gtcgaacgtg
240gcggggtctc tgccgaggtt gaggaagttt gaaactggtt cggggatacg cacactcgtg
300ttttctttgc cgcccggagt cactgccgag agctttgaga aagttcgccc tgcgctggca
360gacgcgatgg ggggtcaccg ctgccaagta gagaaggtgg cccccggaca ggtccgcgtc
420agagtgattg atgaggattc gatgaagacg ccgcgtgatg cgggatgggc gaaagatgtt
480gtgctggaag aggatacgtt cgacggtctt ccgggcgaga cgcgatcctg gttcgagcaa
540gaggggccgg catcatga
55835791DNARhodococcus erythropolismisc_feature2410bp to 3200bp pRET1100
35acgcgcggaa cgcctttcgg ctttgtcttt tgctggatgg cgggttttgg gcggcttctg
60gtgatgcgct gctgcgctcc gtggggagag agacccaacg actgacctat ctctacccag
120gtgcaattca tctcccgcgc tctgtcggct aggtaaacga ggtgctcccg cgcgagcttt
180tccatgtggt cggccaatgt cagctcggtc aggacaacct gctgttgttg cgatagttgt
240gtccgcacgg gtcgattgtc ttctgttgcg gcataacggt tttcgtcgtt cgcggagagt
300gcggctaaat gaattgcatc ctcgattgag cggagcattt cgacgcggaa cctggcgatg
360atgttgtctc tgtcttcatt cataactgaa gcgtattggg agtgttgccc tcccaccatg
420tgtgccaatg caggtgtgaa ctgagtcaca gtttctcaat agactccaag tttgtgatcc
480ttttactccc aaaatggggc atgatgtgtg cgtgcctcgg ttcaggggcg aaagttcgac
540acctcgaaag aaggcctcga catggctttg aaagctgctg gcaacgtgat tcctgattcc
600tccgcgtacg agtaccgggc ggttcaggtc gagccgaaga tggtcagaaa agacccggaa
660gacccgaact ctgagcagtt ccagaagcag aaggacggca cgccggtgtg gtcgatcgac
720tgcattcggg tcgaccgggc atcaggcaac aaggcaatcg tgaccgtgac ggttccggac
780gtgatggaac c
79136501DNARhodococcus erythropolismisc_feature1000bp to 1500bp pRET1100
36cttgttacat ctgccacaac tgtccgtgaa tctctgccag ctcctgaaac cgctggtcag
60ggccttgcgg aatccgtgac cgctgatgat ttttggtctc attcgttccc ccgcgctgac
120gatgtacgcg gcgcagctgc ttccttccag tcggtggcta actgggatgg gcgtgagggt
180ccgaggccgc gtttcgttgt cgcgcctggc gttgtccgct tggaggtttg tgatctcgca
240cgccgcgaac gaacggctga acgtgcgtat ctggctgctc gggctcgggt ggatatggcg
300gctgccaggc ataactcgcc gtacgacttc gacgtggacg atgaagagtt ggcggaactg
360gcttctctgc aaggcctcga ggacgacgac attgggggct ggtctgcgga gagggaaata
420gtgggctggt ctgctcgttc tcggtcacgg atgatcttgc gaatggcaga actcgactgg
480gctcccatga tggatttgcc g
50137945DNARhodococcus erythropolismisc_feature5000bp to 500bp pRET1100
37gatcctggtt cgagcaagag gggccggcat catgagaaaa tcggcgggag tatctcggat
60tcctatccgt ctcgggcgct ctcagtacgg ggaagacgtt ggattcgatc tcgctgcgga
120cgccgctcac atcgccatgc agggcaaaac ccgatccggc aaaagtcagg cgacgtacaa
180cgtgttagct caggcagcag cgaacgcggc ggttcgagtc gtagggtccg acccgacaca
240cgtactcctg gagcccttca aacatcgagg ggtgtccgag ccttacgtgg tttcgggact
300gaatgcgcag gccacggtgg acatgctggg ctgggtcaag cgtgagtctg atcgtcgcat
360cgaccagatg tggcccctgc gtaccgacaa gttttccgag ttcggggctt cgttcccgct
420gatactcgtc gtgctcgaag agtttcccgg gatcctcgag ggggcagcgg acgaagacgc
480cgcgttaggc cgaaaacctg ccgagcgtct cgcaccccgc atttcggcct acgtgcgtca
540gatagcagcg cagtcggcaa aggctggaat tcgccttctc ctgctctcgc aacgagcgga
600ggcctcgatc attggcggca atgcgcgttc gaatttcggg gtcaagatga ctctgagggt
660ggacgaaccg gagtcggtga gaatgcttca tccgagcgct tccccggaag actgtgccct
720ggtcgagacc ttcaagcctg gtacctgcct tttcgagaag ccaggagaag gccggcagat
780tatgcgatgc gactttgtcg gcgagtacgg gagatatgcg cgagccatcg agtcttcgga
840tctgcgtttt ctcgccaccc tccagcaaga ccaggcccaa cgcgaattct tcgctgagga
900gttcggtgtg gtggatccgt catgactgga ccacaggaga gaaag
94538939DNARhodococcus rhodniimisc_feature3350bp to 2412bp pRET1000
38atggttgcgg tggaagagca cacaggcggc gcctgggaac agctgtggct accgctgtgg
60ccactggcaa ccgacgattt cctcgacggc gtctaccgga tgcggcgatc agacgcactg
120gatcgccgct acatcgagtc gaacccgcag gcattgagca acctgctcgt cgtggacgtt
180gaccacccgg acgccgcgct gcgggcgctg tcggcggccg ggaatcatcc tctgccgaac
240gcgatcgtgg agaacccccg taacgggcac gcacacgctg tgtgggcgct ggcagagccg
300ttcacccgca ccgagtacgc ccgtcgtaag ccgctcgcct atgcggccgc cgtcaccgaa
360ggcctccggc gcgccgtcca gggggacaag ggctattcgg gcctgatgac caagaacccg
420actcacggtg actgggacac ccattggctg cacaccgagc ggcgatccct cgccgagctc
480gaggcggaac tcggcatcca catgccgcca acgcgctggc ggcaaacccg atcgcgccgt
540gagaacccga tcggcctcgg ccgaaactgc gccctgttcg aaaccgcacg cacctgggcc
600taccgcgaaa tccgcttcca ctggggcgac ccgaccggcc tcggggccgc gatctatgcg
660gaagccgcac agatcaacgc cacgttcagg aacccggtca caggcaggcc cgatccactg
720ccagcaagcg agctacgcgc cgtcgcggcc tccattaccc gctggatcac tacaaagtcc
780cggatgtggg ccgacggccc tgctgtctac gaggccacat tcatcgccat acaagccgca
840cgcggtcgca agatgagtga gaagaagcgc gaggcaaacc ggaaacgagc gacgaaggtc
900gaccggaacg cattgtggga ggcagaccgt gggcgctga
93939840DNARhodococcus rhodniimisc_feature3251bp to 2412bp pRET1000
39atgcggcgat cagacgcact ggatcgccgc tacatcgagt cgaacccgca ggcattgagc
60aacctgctcg tcgtggacgt tgaccacccg gacgccgcgc tgcgggcgct gtcggcggcc
120gggaatcatc ctctgccgaa cgcgatcgtg gagaaccccc gtaacgggca cgcacacgct
180gtgtgggcgc tggcagagcc gttcacccgc accgagtacg cccgtcgtaa gccgctcgcc
240tatgcggccg ccgtcaccga aggcctccgg cgcgccgtcc agggggacaa gggctattcg
300ggcctgatga ccaagaaccc gactcacggt gactgggaca cccattggct gcacaccgag
360cggcgatccc tcgccgagct cgaggcggaa ctcggcatcc acatgccgcc aacgcgctgg
420cggcaaaccc gatcgcgccg tgagaacccg atcggcctcg gccgaaactg cgccctgttc
480gaaaccgcac gcacctgggc ctaccgcgaa atccgcttcc actggggcga cccgaccggc
540ctcggggccg cgatctatgc ggaagccgca cagatcaacg ccacgttcag gaacccggtc
600acaggcaggc ccgatccact gccagcaagc gagctacgcg ccgtcgcggc ctccattacc
660cgctggatca ctacaaagtc ccggatgtgg gccgacggcc ctgctgtcta cgaggccaca
720ttcatcgcca tacaagccgc acgcggtcgc aagatgagtg agaagaagcg cgaggcaaac
780cggaaacgag cgacgaaggt cgaccggaac gcattgtggg aggcagaccg tgggcgctga
84040534DNARhodococcus rhodniimisc_feature2945bp to 2412bp pRET1000
40atgaccaaga acccgactca cggtgactgg gacacccatt ggctgcacac cgagcggcga
60tccctcgccg agctcgaggc ggaactcggc atccacatgc cgccaacgcg ctggcggcaa
120acccgatcgc gccgtgagaa cccgatcggc ctcggccgaa actgcgccct gttcgaaacc
180gcacgcacct gggcctaccg cgaaatccgc ttccactggg gcgacccgac cggcctcggg
240gccgcgatct atgcggaagc cgcacagatc aacgccacgt tcaggaaccc ggtcacaggc
300aggcccgatc cactgccagc aagcgagcta cgcgccgtcg cggcctccat tacccgctgg
360atcactacaa agtcccggat gtgggccgac ggccctgctg tctacgaggc cacattcatc
420gccatacaag ccgcacgcgg tcgcaagatg agtgagaaga agcgcgaggc aaaccggaaa
480cgagcgacga aggtcgaccg gaacgcattg tgggaggcag accgtgggcg ctga
53441438DNARhodococcus rhodniimisc_feature2849bp to 2412bp pRET1000
41atgccgccaa cgcgctggcg gcaaacccga tcgcgccgtg agaacccgat cggcctcggc
60cgaaactgcg ccctgttcga aaccgcacgc acctgggcct accgcgaaat ccgcttccac
120tggggcgacc cgaccggcct cggggccgcg atctatgcgg aagccgcaca gatcaacgcc
180acgttcagga acccggtcac aggcaggccc gatccactgc cagcaagcga gctacgcgcc
240gtcgcggcct ccattacccg ctggatcact acaaagtccc ggatgtgggc cgacggccct
300gctgtctacg aggccacatt catcgccata caagccgcac gcggtcgcaa gatgagtgag
360aagaagcgcg aggcaaaccg gaaacgagcg acgaaggtcg accggaacgc attgtgggag
420gcagaccgtg ggcgctga
43842207DNARhodococcus rhodniimisc_feature2365bp to 2159bp pRET1000
42atgggggcct ccacgcgcac gatccagcgc atcatggccg agccgcggga ccagttcctc
60gcacgggcag ccgagaaccg tcgccgggcc gtcgagctgc gcgagcaggg cctgaagtac
120cgcgagatcg ccgaggagat gggaatctcc accggaacgg tgggaaagct cctgcacgac
180gcacgcaagt acgcggtcag ctcctag
20743174DNARhodococcus rhodniimisc_feature2332bp to 2159bp pRET1000
43atggccgagc cgcgggacca gttcctcgca cgggcagccg agaaccgtcg ccgggccgtc
60gagctgcgcg agcagggcct gaagtaccgc gagatcgccg aggagatggg aatctccacc
120ggaacggtgg gaaagctcct gcacgacgca cgcaagtacg cggtcagctc ctag
17444330DNARhodococcus rhodniimisc_feature3197bp to 3526bp pRET1000
44atgcctgcgg gttcgactcg atgtagcggc gatccagtgc gtctgatcgc cgcatccggt
60agacgccgtc gaggaaatcg tcggttgcca gtggccacag cggtagccac agctgttccc
120aggcgccgcc tgtgtgctct tccaccgcaa ccatggggaa cacactcaca cacaagatcg
180atttattccg gtacgacacg ccagccaagt cagatgtttc ggtttctgga gcggtcctcc
240agacctttga gatccgctcc agaaacgtcc acaaattatt ggggtacgtc gaaccaagcc
300ttatcaggta tcccggggtt ccgggggtga
33045357DNARhodococcus rhodniimisc_feature4035bp to 3679bp pRET1000
45atggggtggt tattgcttgt tgcgtcgggg gccgtggcga tggtggccgg tgtggtctta
60ccgcgccggg atcgtctcgg gccggcacca ggatttccct ggttctgggt ggtgttccca
120tccacgtgca ttgccatcgc tgccgcggtg ggtgtcttcg cttggcccca agcggttacc
180ggcacgggga gctactggtg ggatccgccc agcgcgagct caccgaccct gcagttcctg
240tcaaacgagc agtaccggcg cctcgtgaca ctgcgccggt tgcagggggc gctaccggtg
300gtgtccctcg tgggaagcgg attgtgcgtg tgggcctggc gtcgacgccg cttctga
35746318DNARhodococcus rhodniimisc_feature3996bp to 3679bp pRET1000
46atggtggccg gtgtggtctt accgcgccgg gatcgtctcg ggccggcacc aggatttccc
60tggttctggg tggtgttccc atccacgtgc attgccatcg ctgccgcggt gggtgtcttc
120gcttggcccc aagcggttac cggcacgggg agctactggt gggatccgcc cagcgcgagc
180tcaccgaccc tgcagttcct gtcaaacgag cagtaccggc gcctcgtgac actgcgccgg
240ttgcaggggg cgctaccggt ggtgtccctc gtgggaagcg gattgtgcgt gtgggcctgg
300cgtcgacgcc gcttctga
31847450DNARhodococcus rhodniimisc_feature4381bp to 4830bp pRET1000
47atggccgctg acgctgcatc tgacgaccgg cggaccgagg tccgcgccgc tgcttcgcgg
60gccgctgacg cggccccggc gaagcgcacc cgcaccgtgg cggtgcggct gaccgatggg
120gaggaggccg cgtggatcga cgccgcgctg gccgatggcc accggcagct cggggcgtgg
180gtgcgtgagc gggcggtggc cggctatctc gggaaggtcc gcccgaagac cggcagtgga
240atgtcggcgg aggcggccgc ggaggtcgcc gcgatgcggc agcagatgac gaaggtgggg
300aacaacctga accagatcgc gagggcgatc aacgccgggc aggtgccgtc gcagatggcc
360gagtccctgc agaaggggtg gctggagagg tgggggcagg agttggggcg gatggcggat
420cggctcgacg cgctcgacga ccagggctga
45048210DNARhodococcus rhodniimisc_feature4621bp to 4830bp pRET1000
48atgtcggcgg aggcggccgc ggaggtcgcc gcgatgcggc agcagatgac gaaggtgggg
60aacaacctga accagatcgc gagggcgatc aacgccgggc aggtgccgtc gcagatggcc
120gagtccctgc agaaggggtg gctggagagg tgggggcagg agttggggcg gatggcggat
180cggctcgacg cgctcgacga ccagggctga
21049177DNARhodococcus rhodniimisc_feature4654bp to 4830bp pRET1000
49atgcggcagc agatgacgaa ggtggggaac aacctgaacc agatcgcgag ggcgatcaac
60gccgggcagg tgccgtcgca gatggccgag tccctgcaga aggggtggct ggagaggtgg
120gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc tcgacgacca gggctga
17750165DNARhodococcus rhodniimisc_feature4666bp to 4830bp pRET1000
50atgacgaagg tggggaacaa cctgaaccag atcgcgaggg cgatcaacgc cgggcaggtg
60ccgtcgcaga tggccgagtc cctgcagaag gggtggctgg agaggtgggg gcaggagttg
120gggcggatgg cggatcggct cgacgcgctc gacgaccagg gctga
16551453DNARhodococcus rhodniimisc_feature5161bp to 4709bp pRET1000
51atgactctcg aagcccatcc gctcggcgac cgtctgcgcg atgtccgcga actcggtatc
60ggtcagccgc cgatccccgg gcgcgcaccg cagcgagcaa tgccacaccg gcttacccac
120ccgcgcgttc gtcgcggcgg cccgctcgaa gtcccgcccc caccgggtcg ggtttttggc
180ggtgacctgc accgatcccg cgatcaccgt cccgccggca atcagccggc ccgcctcggt
240gcggtagctg tgcggggtgg ccttccccgg cccgtgcaga tacgccgcca accccttcgg
300gtcgctgccc gtgctgatct tcgcgatcac gtcagccctg gtcgtcgagc gcgtcgagcc
360gatccgccat ccgccccaac tcctgccccc acctctccag ccaccccttc tgcagggact
420cggccatctg cgacggcacc tgcccggcgt tga
45352354DNARhodococcus rhodniimisc_feature5062bp to 4709bp pRET1000
52atgccacacc ggcttaccca cccgcgcgtt cgtcgcggcg gcccgctcga agtcccgccc
60ccaccgggtc gggtttttgg cggtgacctg caccgatccc gcgatcaccg tcccgccggc
120aatcagccgg cccgcctcgg tgcggtagct gtgcggggtg gccttccccg gcccgtgcag
180atacgccgcc aaccccttcg ggtcgctgcc cgtgctgatc ttcgcgatca cgtcagccct
240ggtcgtcgag cgcgtcgagc cgatccgcca tccgccccaa ctcctgcccc cacctctcca
300gccacccctt ctgcagggac tcggccatct gcgacggcac ctgcccggcg ttga
35453288DNARhodococcus rhodniimisc_feature2331bp to 2618bp pRET1000
53atgatgcgct ggatcgtgcg cgtggaggcc cccatcttct cggccagctc gcgagctgtc
60tgcttgcggc ggatcggtcg ttcagcgccc acggtctgcc tcccacaatg cgttccggtc
120gaccttcgtc gctcgtttcc ggtttgcctc gcgcttcttc tcactcatct tgcgaccgcg
180tgcggcttgt atggcgatga atgtggcctc gtagacagca gggccgtcgg cccacatccg
240ggactttgta gtgatccagc gggtaatgga ggccgcgacg gcgcgtag
28854285DNARhodococcus rhodniimisc_feature2334bp to 2618bp pRET1000
54atgcgctgga tcgtgcgcgt ggaggccccc atcttctcgg ccagctcgcg agctgtctgc
60ttgcggcgga tcggtcgttc agcgcccacg gtctgcctcc cacaatgcgt tccggtcgac
120cttcgtcgct cgtttccggt ttgcctcgcg cttcttctca ctcatcttgc gaccgcgtgc
180ggcttgtatg gcgatgaatg tggcctcgta gacagcaggg ccgtcggccc acatccggga
240ctttgtagtg atccagcggg taatggaggc cgcgacggcg cgtag
28555336DNARhodococcus rhodniimisc_feature2907bp to 3242bp pRET1000
55atgggtgtcc cagtcaccgt gagtcgggtt cttggtcatc aggcccgaat agcccttgtc
60cccctggacg gcgcgccgga ggccttcggt gacggcggcc gcataggcga gcggcttacg
120acgggcgtac tcggtgcggg tgaacggctc tgccagcgcc cacacagcgt gtgcgtgccc
180gttacggggg ttctccacga tcgcgttcgg cagaggatga ttcccggccg ccgacagcgc
240ccgcagcgcg gcgtccgggt ggtcaacgtc cacgacgagc aggttgctca atgcctgcgg
300gttcgactcg atgtagcggc gatccagtgc gtctga
33656513DNARhodococcus rhodniimisc_feature1650bp to 2162bp pRET1000
56atgcggattg aactagttca tttggggaac gatgacctga tgaccgggga tcgtgaccta
60cccatgctga ccatcgccga ggcggtggac gcgacgcaga ccagtgagag cacgatcaag
120cgccgcctgc ggtcgggcgc gttcccgaac gcggtccgca ctgccgacgg gaagtggatg
180attcccctcg gtgacctatc agcggcaggg ctgagaccag ggaaaatggc gaaacctgac
240ccggtgaccc cttcaaatga ccgggtccgt gacctggcag ctgagaacgc cgagctccgt
300cagcgcctgg ccgtggccga agccctggcc agcgaacgca atcggatcat cgacgtgcag
360caacagatgc tccggatgct cgaagcccgg ccggtgtcgg ccctggagcc cgcggcggtt
420ccagtggcgg gtccgccgcc gcccgtcccg gccgccgatg gtcgggcagc tacgggcgcc
480ctggcccgga tacgtcgacg gcttctcggc tag
51357474DNARhodococcus rhodniimisc_feature1689bp to 2162bp pRET1000
57atgaccgggg atcgtgacct acccatgctg accatcgccg aggcggtgga cgcgacgcag
60accagtgaga gcacgatcaa gcgccgcctg cggtcgggcg cgttcccgaa cgcggtccgc
120actgccgacg ggaagtggat gattcccctc ggtgacctat cagcggcagg gctgagacca
180gggaaaatgg cgaaacctga cccggtgacc ccttcaaatg accgggtccg tgacctggca
240gctgagaacg ccgagctccg tcagcgcctg gccgtggccg aagccctggc cagcgaacgc
300aatcggatca tcgacgtgca gcaacagatg ctccggatgc tcgaagcccg gccggtgtcg
360gccctggagc ccgcggcggt tccagtggcg ggtccgccgc cgcccgtccc ggccgccgat
420ggtcgggcag ctacgggcgc cctggcccgg atacgtcgac ggcttctcgg ctag
47458450DNARhodococcus rhodniimisc_feature1713bp to 2162bp pRET1000
58atgctgacca tcgccgaggc ggtggacgcg acgcagacca gtgagagcac gatcaagcgc
60cgcctgcggt cgggcgcgtt cccgaacgcg gtccgcactg ccgacgggaa gtggatgatt
120cccctcggtg acctatcagc ggcagggctg agaccaggga aaatggcgaa acctgacccg
180gtgacccctt caaatgaccg ggtccgtgac ctggcagctg agaacgccga gctccgtcag
240cgcctggccg tggccgaagc cctggccagc gaacgcaatc ggatcatcga cgtgcagcaa
300cagatgctcc ggatgctcga agcccggccg gtgtcggccc tggagcccgc ggcggttcca
360gtggcgggtc cgccgccgcc cgtcccggcc gccgatggtc gggcagctac gggcgccctg
420gcccggatac gtcgacggct tctcggctag
45059336DNARhodococcus rhodniimisc_feature1827bp to 2162bp pRET1000
59atgattcccc tcggtgacct atcagcggca gggctgagac cagggaaaat ggcgaaacct
60gacccggtga ccccttcaaa tgaccgggtc cgtgacctgg cagctgagaa cgccgagctc
120cgtcagcgcc tggccgtggc cgaagccctg gccagcgaac gcaatcggat catcgacgtg
180cagcaacaga tgctccggat gctcgaagcc cggccggtgt cggccctgga gcccgcggcg
240gttccagtgg cgggtccgcc gccgcccgtc ccggccgccg atggtcgggc agctacgggc
300gccctggccc ggatacgtcg acggcttctc ggctag
33660288DNARhodococcus rhodniimisc_feature1875bp to 2162bp pRET1000
60atggcgaaac ctgacccggt gaccccttca aatgaccggg tccgtgacct ggcagctgag
60aacgccgagc tccgtcagcg cctggccgtg gccgaagccc tggccagcga acgcaatcgg
120atcatcgacg tgcagcaaca gatgctccgg atgctcgaag cccggccggt gtcggccctg
180gagcccgcgg cggttccagt ggcgggtccg ccgccgcccg tcccggccgc cgatggtcgg
240gcagctacgg gcgccctggc ccggatacgt cgacggcttc tcggctag
28861264DNARhodococcus rhodniimisc_feature1906bp to 2169bp pRET1000
61atgaccgggt ccgtgacctg gcagctgaga acgccgagct ccgtcagcgc ctggccgtgg
60ccgaagccct ggccagcgaa cgcaatcgga tcatcgacgt gcagcaacag atgctccgga
120tgctcgaagc ccggccggtg tcggccctgg agcccgcggc ggttccagtg gcgggtccgc
180cgccgcccgt cccggccgcc gatggtcggg cagctacggg cgccctggcc cggatacgtc
240gacggcttct cggctaggag ctga
26462258DNARhodococcus rhodniimisc_feature810bp to 553bp pRET1000
62atgctatggg aggtatgcac ctttcgcgcg ttatgtacgc atcctgggca ccctgggcac
60gaccgacctt ctagcgatcg atggtgttct tggacatgct tcgccaggcc tgcgtctgtt
120ccctacgctc cacgaaagcc ttctcgctct ctgctcacag tcccattccg gattctcgac
180ctcggtcgcg gccgggtggc tgataccccg gggccgactg cggcatggtt ggtccctggc
240ggcgggccgg gggtttga
25863540DNARhodococcus rhodniimisc_feature117bp to 656bp pRET1000
63atgggaggcc acccgacacc gctacgggac atgctcgccg cccaggagca gcgccggaag
60ccgtggactc cggagcagaa acgccagtac gcgaccgcaa aagcccaagc agaacgcgcc
120gcgaaggcca aggacgccgc gaaatggacc gaggtcgccg gcggcggcta ccagcgggac
180gtgcgcggga tgaacctgcg actgtgggtg gctgaggacg gcgcctggtc gatcacctcg
240aagaaggacc ccgaccgcca gtacgccgca ggtcaggccg acaccgtcgc gcaggcccaa
300gccgcggcca cggccacagc gaaaacgcag gcccaggcga tgtggaagca ggtcccggcc
360gacaagcgca ccgagtcagc caccagagcg gtccggcgcg tgatcgcgga tctcaccccc
420accaaacccg ccgaggtcaa acccccggcc cgccgccagg gaccaaccat gccgcagtcg
480gccccggggt atcagccacc cggccgcgac cgaggtcgag aatccggaat gggactgtga
54064510DNARhodococcus rhodniimisc_feature147bp to 656bp pRET1000
64atgctcgccg cccaggagca gcgccggaag ccgtggactc cggagcagaa acgccagtac
60gcgaccgcaa aagcccaagc agaacgcgcc gcgaaggcca aggacgccgc gaaatggacc
120gaggtcgccg gcggcggcta ccagcgggac gtgcgcggga tgaacctgcg actgtgggtg
180gctgaggacg gcgcctggtc gatcacctcg aagaaggacc ccgaccgcca gtacgccgca
240ggtcaggccg acaccgtcgc gcaggcccaa gccgcggcca cggccacagc gaaaacgcag
300gcccaggcga tgtggaagca ggtcccggcc gacaagcgca ccgagtcagc caccagagcg
360gtccggcgcg tgatcgcgga tctcaccccc accaaacccg ccgaggtcaa acccccggcc
420cgccgccagg gaccaaccat gccgcagtcg gccccggggt atcagccacc cggccgcgac
480cgaggtcgag aatccggaat gggactgtga
51065351DNARhodococcus rhodniimisc_feature306bp to 656bp pRET1000
65atgaacctgc gactgtgggt ggctgaggac ggcgcctggt cgatcacctc gaagaaggac
60cccgaccgcc agtacgccgc aggtcaggcc gacaccgtcg cgcaggccca agccgcggcc
120acggccacag cgaaaacgca ggcccaggcg atgtggaagc aggtcccggc cgacaagcgc
180accgagtcag ccaccagagc ggtccggcgc gtgatcgcgg atctcacccc caccaaaccc
240gccgaggtca aacccccggc ccgccgccag ggaccaacca tgccgcagtc ggccccgggg
300tatcagccac ccggccgcga ccgaggtcga gaatccggaa tgggactgtg a
35166201DNARhodococcus rhodniimisc_feature456bp to 656bp pRET1000
66atgtggaagc aggtcccggc cgacaagcgc accgagtcag ccaccagagc ggtccggcgc
60gtgatcgcgg atctcacccc caccaaaccc gccgaggtca aacccccggc ccgccgccag
120ggaccaacca tgccgcagtc ggccccgggg tatcagccac ccggccgcga ccgaggtcga
180gaatccggaa tgggactgtg a
201671326DNARhodococcus rhodniimisc_feature5144bp to 656bp pRET1000
67atgggcttcg agagtcatcc gtgggtggcg gtgcggcacg acgacgacca catccacctg
60gctgtctccc gggtcgattt tcagggcgtg acctggaaga acagcaacga ccggtggaag
120gtcgtcgagg tgatgcgcga ggtcgaacgc gcgcacggcc tgatcgaggt ggcgagcccg
180gagcgggccc gtggccggca agccagcagc ggcgagcaac gccgcgcggt gcggaccggc
240aaggtggcgc agcgggacgg tctgagggaa attgtgaccg ccgcccgcga catcgccgca
300ggccagggtg tgggggcgtt cgaagtggcg ctcgtacaga acccgattac ccgagtgcag
360gtgcggcgca acgtcgcgaa gacgggccgg atgaatggct acagcttcaa cctgcccggc
420tacgtcgacg ccgccgggga gccgatctgg ttgccggcct ccaaactcga ccggggtttg
480tcctggtcac agctggaaaa gacgctgacc agaccccgcc cggaccgcct cgccggcgag
540gagacggtgc cgcggaagcg gctcgagcgc gccgccgcgt gggagcagcg ccgccgcgag
600gtcggcggcg agcagttcgc agctgcccgc tgggagcagg cccgcgcgaa tgttggtgag
660acggccgggc ggatccgcgc cgaacagtcc gcggacacga agtggaagca ggtgaacgag
720gcgttgacca gccaagaccg ggccgaggag caggctgccg aggcagcgcg ggtcgcctcc
780gctgtcatgg gaggccaccc gacaccgcta cgggacatgc tcgccgccca ggagcagcgc
840cggaagccgt ggactccgga gcagaaacgc cagtacgcga ccgcaaaagc ccaagcagaa
900cgcgccgcga aggccaagga cgccgcgaaa tggaccgagg tcgccggcgg cggctaccag
960cgggacgtgc gcgggatgaa cctgcgactg tgggtggctg aggacggcgc ctggtcgatc
1020acctcgaaga aggaccccga ccgccagtac gccgcaggtc aggccgacac cgtcgcgcag
1080gcccaagccg cggccacggc cacagcgaaa acgcaggccc aggcgatgtg gaagcaggtc
1140ccggccgaca agcgcaccga gtcagccacc agagcggtcc ggcgcgtgat cgcggatctc
1200acccccacca aacccgccga ggtcaaaccc ccggcccgcc gccagggacc aaccatgccg
1260cagtcggccc cggggtatca gccacccggc cgcgaccgag gtcgagaatc cggaatggga
1320ctgtga
1326681194DNARhodococcus rhodniimisc_feature5276bp to 656bp pRET1000
68atgcgcgagg tcgaacgcgc gcacggcctg atcgaggtgg cgagcccgga gcgggcccgt
60ggccggcaag ccagcagcgg cgagcaacgc cgcgcggtgc ggaccggcaa ggtggcgcag
120cgggacggtc tgagggaaat tgtgaccgcc gcccgcgaca tcgccgcagg ccagggtgtg
180ggggcgttcg aagtggcgct cgtacagaac ccgattaccc gagtgcaggt gcggcgcaac
240gtcgcgaaga cgggccggat gaatggctac agcttcaacc tgcccggcta cgtcgacgcc
300gccggggagc cgatctggtt gccggcctcc aaactcgacc ggggtttgtc ctggtcacag
360ctggaaaaga cgctgaccag accccgcccg gaccgcctcg ccggcgagga gacggtgccg
420cggaagcggc tcgagcgcgc cgccgcgtgg gagcagcgcc gccgcgaggt cggcggcgag
480cagttcgcag ctgcccgctg ggagcaggcc cgcgcgaatg ttggtgagac ggccgggcgg
540atccgcgccg aacagtccgc ggacacgaag tggaagcagg tgaacgaggc gttgaccagc
600caagaccggg ccgaggagca ggctgccgag gcagcgcggg tcgcctccgc tgtcatggga
660ggccacccga caccgctacg ggacatgctc gccgcccagg agcagcgccg gaagccgtgg
720actccggagc agaaacgcca gtacgcgacc gcaaaagccc aagcagaacg cgccgcgaag
780gccaaggacg ccgcgaaatg gaccgaggtc gccggcggcg gctaccagcg ggacgtgcgc
840gggatgaacc tgcgactgtg ggtggctgag gacggcgcct ggtcgatcac ctcgaagaag
900gaccccgacc gccagtacgc cgcaggtcag gccgacaccg tcgcgcaggc ccaagccgcg
960gccacggcca cagcgaaaac gcaggcccag gcgatgtgga agcaggtccc ggccgacaag
1020cgcaccgagt cagccaccag agcggtccgg cgcgtgatcg cggatctcac ccccaccaaa
1080cccgccgagg tcaaaccccc ggcccgccgc cagggaccaa ccatgccgca gtcggccccg
1140gggtatcagc cacccggccg cgaccgaggt cgagaatccg gaatgggact gtga
119469936DNARhodococcus rhodniimisc_feature5534bp to 656bp pRET1000
69atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg
60ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc
120agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc
180gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc
240tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc
300gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag
360caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta
420cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc
480cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa
540tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg
600tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac
660gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa
720acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc
780agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc
840ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc
900cgcgaccgag gtcgagaatc cggaatggga ctgtga
93670153DNARhodococcus rhodniimisc_feature3355bp to 3507bp pRET1000
70aacacactca cacacaagat cgatttattc cggtacgaca cgccagccaa gtcagatgtt
60tcggtttctg gagcggtcct ccagaccttt gagatccgct ccagaaacgt ccacaaatta
120ttggggtacg tcgaaccaag ccttatcagg tat
1537161DNARhodococcus rhodniimisc_feature4290bp to 4350bp pRET1000
71gagctatgcc cagggttgcg cagtgacttc gtcactgcgt aaccctgggc gctcgcctcc
60c
6172325DNARhodococcus rhodniimisc_feature3570bp to 3894bp pRET1000
72ccgctcgaag tccttgagtc agtgacagga ccactgctgg gctcccagcg cagaaggcaa
60gtgaaggcag acgactgcgg gaggtaagtc gggtacggca tgaggtcctt cagaagcggc
120gtcgacgcca ggcccacacg cacaatccgc ttcccacgag ggacaccacc ggtagcgccc
180cctgcaaccg gcgcagtgtc acgaggcgcc ggtactgctc gtttgacagg aactgcaggg
240tcggtgagct cgcgctgggc ggatcccacc agtagctccc cgtgccggta accgcttggg
300gccaagcgaa gacacccacc gcggc
325735444DNARhodococcus erythropolismisc_featurepRET1100 Full Length
73cccgggatcc tcgagggggc agcggacgaa gacgccgcgt taggccgaaa acctgccgag
60cgtctcgcac cccgcatttc ggcctacgtg cgtcagatag cagcgcagtc ggcaaaggct
120ggaattcgcc ttctcctgct ctcgcaacga gcggaggcct cgatcattgg cggcaatgcg
180cgttcgaatt tcggggtcaa gatgactctg agggtggacg aaccggagtc ggtgagaatg
240cttcatccga gcgcttcccc ggaagactgt gccctggtcg agaccttcaa gcctggtacc
300tgccttttcg agaagccagg agaaggccgg cagattatgc gatgcgactt tgtcggcgag
360tacgggagat atgcgcgagc catcgagtct tcggatctgc gttttctcgc caccctccag
420caagaccagg cccaacgcga attcttcgct gaggagttcg gtgtggtgga tccgtcatga
480ctggaccaca ggagagaaag cgcaaggcgg cgaagccgtc gcgggagcct cagttgaact
540gctgtgaagc ggacgtgccg aaacgagcaa aacagccccc ggttccctct acgttcgacc
600tgctcacggt gaaggagact gcggggctgc tgagagtcag tcaggcaact ctttaccggc
660tgcttcggag tggggaagga cccacataca cacggatcgg tggacagata cgcgttcacc
720gcgagtcgct gcgtcggttc atcgaaccgc gtggataacg tcacagagac agcgaaaacg
780cctcccctgg gtcaatccgg ttaccgccgg actgggggag gcgcttcgac acctacatcc
840gtcgcccctc gaaaggctca gatgcacttc cacgataacg cagaggtcgg acaagaggga
900agaactgccg ttctctcgcc gttgcgcggc gtagccgcca agcgggacgt gtctgacgat
960gcagcgaagc ggagtcggca ggcgcggcac gcgcctgggc ttgttacatc tgccacaact
1020gtccgtgaat ctctgccagc tcctgaaacc gctggtcagg gccttgcgga atccgtgacc
1080gctgatgatt tttggtctca ttcgttcccc cgcgctgacg atgtacgcgg cgcagctgct
1140tccttccagt cggtggctaa ctgggatggg cgtgagggtc cgaggccgcg tttcgttgtc
1200gcgcctggcg ttgtccgctt ggaggtttgt gatctcgcac gccgcgaacg aacggctgaa
1260cgtgcgtatc tggctgctcg ggctcgggtg gatatggcgg ctgccaggca taactcgccg
1320tacgacttcg acgtggacga tgaagagttg gcggaactgg cttctctgca aggcctcgag
1380gacgacgaca ttgggggctg gtctgcggag agggaaatag tgggctggtc tgctcgttct
1440cggtcacgga tgatcttgcg aatggcagaa ctcgactggg ctcccatgat ggatttgccg
1500ggcattcctg cgatggtgac cctcacctat ccgggggact ggcttacggt tgcccccacc
1560ggcgctgagg tcaaaaaaca tctccagacg ttcttcaaac ggttccaacg ggcctggggc
1620attgcctgga tgggtgcgtg gaaaatggag ttccaaagcc gaggcgctcc gcattttcac
1680ctgtacatgg tccctcctca tgggaaggca ggagactcgc ggaagctgcg gcatgatgct
1740gagctcttga aatgggagat agcacgtgca gagggtgaag acccaggtcg caggccgtat
1800ttccgggaag ctccaagcga tggattgaag tttcgtccgt ggctttctgc ggtgtgggcc
1860gacgtcgtag atcatccgga ccccaaggaa aaagaaaagc acgtcagtgc cggcactgga
1920gtggactacg cggagggcac gcgagggtca gatccgaaaa ggcttgcggt gtacttctcc
1980aagcatggaa cctttgccga caaggaatat cagcacgtag ttcctgctca atggcagaaa
2040acgggtgcgg gacctggcag gttctggggc taccgcggtt tgtcgccggc cacggctgcc
2100accgagattt cctgggatga gtacctgctt ttatctcgca cgttgcgacg attgtcagcg
2160cgaacgaaga tctgggaccc ggctttacga ggcggtagcg gcggccacag atggactaag
2220gcgatgatgc gacgcacggt tacccggcac cgcttggacc tcgtgaccgg tgagattctg
2280ggcacgaaga cgcggaaggt tcgggcgcca gtgaagaggt ttgtccggac ttcgggatac
2340ctgtgtgtca atgacgggcc cgcactggct cgaaccctca gccgtcttcg tacaagctgc
2400ctgagctaga cgcgcggaac gcctttcggc tttgtctttt gctggatggc gggttttggg
2460cggcttctgg tgatgcgctg ctgcgctccg tggggagaga gacccaacga ctgacctatc
2520tctacccagg tgcaattcat ctcccgcgct ctgtcggcta ggtaaacgag gtgctcccgc
2580gcgagctttt ccatgtggtc ggccaatgtc agctcggtca ggacaacctg ctgttgttgc
2640gatagttgtg tccgcacggg tcgattgtct tctgttgcgg cataacggtt ttcgtcgttc
2700gcggagagtg cggctaaatg aattgcatcc tcgattgagc ggagcatttc gacgcggaac
2760ctggcgatga tgttgtctct gtcttcattc ataactgaag cgtattggga gtgttgccct
2820cccaccatgt gtgccaatgc aggtgtgaac tgagtcacag tttctcaata gactccaagt
2880ttgtgatcct tttactccca aaatggggca tgatgtgtgc gtgcctcggt tcaggggcga
2940aagttcgaca cctcgaaaga aggcctcgac atggctttga aagctgctgg caacgtgatt
3000cctgattcct ccgcgtacga gtaccgggcg gttcaggtcg agccgaagat ggtcagaaaa
3060gacccggaag acccgaactc tgagcagttc cagaagcaga aggacggcac gccggtgtgg
3120tcgatcgact gcattcgggt cgaccgggca tcaggcaaca aggcaatcgt gaccgtgacg
3180gttccggacg tgatggaacc ggatgttgcg gggccggtgg agttctccga gatgattgcc
3240ggtttctggg tttcgcgcag tggttcgggc atgtggtttt cggcaagcgc cgtcgcttct
3300ctctgatcgc tgatcgtcgc ccctcgaaag gttcggaaat gtccaaagga aaaggcgttg
3360cgctgggtgt gggtgccctc gtgctcgtgt ttgtgctggt tgcggcaggt tggcaagcgg
3420cgaacgtgtt cagtgatcgt tcacagtccg aagctgtgcc gctgagagtg ccggccgatc
3480cgaagtggga aaacggggtg ttctcggacg ttgccgggtg cctcgttctc tctccggaag
3540agctggggcc gttcagcgga gggcagtaca tcgacatagt gaggccagtt gagccggaga
3600ggttggagcg cgactgggtg aggtcggctg agtgcgtttc ggcgtcgatg aatgtctctg
3660acctgttggt ttctgctctt ccagagtcca cccgtccccc cggcgatttc gttcgttcgt
3720ggaaagtggc gagtgatgat tactgctatg agggtgataa cccgcaaggc tgcacttctc
3780gtatgccggt ttgggtctct gcaaaaaact ggtggtgcac agaacccgta ctcgatccgc
3840tcgttcgtcg ctgtgaggtc tttcctgcaa ggcaaatcgt tgtgccggaa ggggtttcgt
3900gatgtttctc cgagcgtttt ttcgttccaa gttggtcatg gtggctcttg tcctggtcgc
3960tggcctgttt ctctacaacg cctgctcttc ttctgacgca aaggaagaga tcggcagcag
4020tctgaatctc tctcctgtca ctgctcgttc gaatccgtat gagggcgtcc agcccacgat
4080gagcgaaaaa agccctgttc ccgtccctgt cgtttccggc gacaggattt cgggggtggc
4140atcgtgcggg acggattacg ccgggaagcc tgcggtgacg ctggaagctg tgtggatttc
4200gtccgactcg gtgaactaca cactcgataa gaggcattgc ctggtgacga ccggcccgct
4260gtggaaacaa gcgatccgta aagcgtcagg gtcagagatt cggcctgagg gcgggagctg
4320gatacgggtg gtgcttgcca tgcctgacgg caatttcagg gcaggatggg caccccacgc
4380ccaagtaacc gctggtgcgc tggatatttc ggcggtggtc tcgtgagcgg ggagaagcgg
4440cacagcgagg ccggcccggt agaaatcatc tttttgatgc tggcagtcag ggcgggggac
4500tacatcgtcg ccgtgactgc ggttctcgcg gtcgggttct tcgcggtcgc ggttgagggt
4560ttctggttcc tggtcgtcgc agtcatcgct gcaccggcgt ggtggtttct gcgcgactgg
4620gaatcgaagc ggagggccgt acgggtcttt gaacgggcat ggaaggggac acctgaatcc
4680cccggtattg ctctctccct tggcctgtcg aacgtggcgg ggtctctgcc gaggttgagg
4740aagtttgaaa ctggttcggg gatacgcaca ctcgtgtttt ctttgccgcc cggagtcact
4800gccgagagct ttgagaaagt tcgccctgcg ctggcagacg cgatgggggg tcaccgctgc
4860caagtagaga aggtggcccc cggacaggtc cgcgtcagag tgattgatga ggattcgatg
4920aagacgccgc gtgatgcggg atgggcgaaa gatgttgtgc tggaagagga tacgttcgac
4980ggtcttccgg gcgagacgcg atcctggttc gagcaagagg ggccggcatc atgagaaaat
5040cggcgggagt atctcggatt cctatccgtc tcgggcgctc tcagtacggg gaagacgttg
5100gattcgatct cgctgcggac gccgctcaca tcgccatgca gggcaaaacc cgatccggca
5160aaagtcaggc gacgtacaac gtgttagctc aggcagcagc gaacgcggcg gttcgagtcg
5220tagggtccga cccgacacac gtactcctgg agcccttcaa acatcgaggg gtgtccgagc
5280cttacgtggt ttcgggactg aatgcgcagg ccacggtgga catgctgggc tgggtcaagc
5340gtgagtctga tcgtcgcatc gaccagatgt ggcccctgcg taccgacaag ttttccgagt
5400tcggggcttc gttcccgctg atactcgtcg tgctcgaaga gttt
5444745813DNARhodococcus rhodniimisc_featurepRET1000 Full Length
74ggatccgcgc cgaacagtcc gcggacacga agtggaagca ggtgaacgag gcgttgacca
60gccaagaccg ggccgaggag caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg
120gaggccaccc gacaccgcta cgggacatgc tcgccgccca ggagcagcgc cggaagccgt
180ggactccgga gcagaaacgc cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga
240aggccaagga cgccgcgaaa tggaccgagg tcgccggcgg cggctaccag cgggacgtgc
300gcgggatgaa cctgcgactg tgggtggctg aggacggcgc ctggtcgatc acctcgaaga
360aggaccccga ccgccagtac gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg
420cggccacggc cacagcgaaa acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca
480agcgcaccga gtcagccacc agagcggtcc ggcgcgtgat cgcggatctc acccccacca
540aacccgccga ggtcaaaccc ccggcccgcc gccagggacc aaccatgccg cagtcggccc
600cggggtatca gccacccggc cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag
660agagcgagaa ggctttcgtg gagcgtaggg aacagacgca ggcctggcga agcatgtcca
720agaacaccat cgatcgctag aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa
780cgcgcgaaag gtgcatacct cccatagcat cggcgcgtat ggtagggaaa atgatcttca
840aacgtattgc tgtggtcgtg ctcgctggtg gggctttggt agtgggaggc agccaggttg
900ctggtgctac cacggtttca gctccacagc cgagtccttc agcagcggtg gtgccgacgg
960ttcttccacc agtcactttc accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg
1020attcccggcg atgccgtctg attccacttc cacagggccg agcgatctgc tgggcggcag
1080ccgctgcccg ttacgcagcg tgccgcgccg gaaactaggt agaacgtgag catggacgag
1140cttcccacct tcatcgccga cgacatcgtg atggccagaa cgttcgacag ccctaacggc
1200caggtggtgc tcgaggtgaa cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac
1260tactgctgca ccttccggat cagcgggaac atggatgccc cttacgacgg attcggtggc
1320ggcgtcgacg cagtgcaggc gctgctactc gcattggcca tggcacacga ggaacttcgt
1380caaacttcgc cagagttgac gtttctaggc gagacgaacc tcggtctacc ggtcttgaac
1440atcaagcccg acaacgcgat cgaagccgtg gtctcattcc ccgctccctg atgtgacgca
1500ctttcacccc tggcactcat gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg
1560cttcgcgttg acttgccact gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg
1620gttcatgaca ccgctaacac gctgcggaaa tgcggattga actagttcat ttggggaacg
1680atgacctgat gaccggggat cgtgacctac ccatgctgac catcgccgag gcggtggacg
1740cgacgcagac cagtgagagc acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg
1800cggtccgcac tgccgacggg aagtggatga ttcccctcgg tgacctatca gcggcagggc
1860tgagaccagg gaaaatggcg aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg
1920acctggcagc tgagaacgcc gagctccgtc agcgcctggc cgtggccgaa gccctggcca
1980gcgaacgcaa tcggatcatc gacgtgcagc aacagatgct ccggatgctc gaagcccggc
2040cggtgtcggc cctggagccc gcggcggttc cagtggcggg tccgccgccg cccgtcccgg
2100ccgccgatgg tcgggcagct acgggcgccc tggcccggat acgtcgacgg cttctcggct
2160aggagctgac cgcgtacttg cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga
2220ttcccatctc ctcggcgatc tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc
2280ggcgacggtt ctcggctgcc cgtgcgagga actggtcccg cggctcggcc atgatgcgct
2340ggatcgtgcg cgtggaggcc cccatcttct cggccagctc gcgagctgtc tgcttgcggc
2400ggatcggtcg ttcagcgccc acggtctgcc tcccacaatg cgttccggtc gaccttcgtc
2460gctcgtttcc ggtttgcctc gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt
2520atggcgatga atgtggcctc gtagacagca gggccgtcgg cccacatccg ggactttgta
2580gtgatccagc gggtaatgga ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg
2640ggcctgcctg tgaccgggtt cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc
2700gcggccccga ggccggtcgg gtcgccccag tggaagcgga tttcgcggta ggcccaggtg
2760cgtgcggttt cgaacagggc gcagtttcgg ccgaggccga tcgggttctc acggcgcgat
2820cgggtttgcc gccagcgcgt tggcggcatg tggatgccga gttccgcctc gagctcggcg
2880agggatcgcc gctcggtgtg cagccaatgg gtgtcccagt caccgtgagt cgggttcttg
2940gtcatcaggc ccgaatagcc cttgtccccc tggacggcgc gccggaggcc ttcggtgacg
3000gcggccgcat aggcgagcgg cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc
3060agcgcccaca cagcgtgtgc gtgcccgtta cgggggttct ccacgatcgc gttcggcaga
3120ggatgattcc cggccgccga cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg
3180acgagcaggt tgctcaatgc ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct
3240gatcgccgca tccggtagac gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt
3300agccacagct gttcccaggc gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca
3360ctcacacaca agatcgattt attccggtac gacacgccag ccaagtcaga tgtttcggtt
3420tctggagcgg tcctccagac ctttgagatc cgctccagaa acgtccacaa attattgggg
3480tacgtcgaac caagccttat caggtatccc ggggttccgg gggtgaacac caccctccga
3540ccggtccaga atccgtcgat ctcacctatc cgctcgaagt ccttgagtca gtgacaggac
3600cactgctggg ctcccagcgc agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg
3660ggtacggcat gaggtccttc agaagcggcg tcgacgccag gcccacacgc acaatccgct
3720tcccacgagg gacaccaccg gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg
3780gtactgctcg tttgacagga actgcagggt cggtgagctc gcgctgggcg gatcccacca
3840gtagctcccc gtgccggtaa ccgcttgggg ccaagcgaag acacccaccg cggcagcgat
3900ggcaatgcac gtggatggga acaccaccca gaaccaggga aatcctggtg ccggcccgag
3960acgatcccgg cgcggtaaga ccacaccggc caccatcgcc acggcccccg acgcaacaag
4020caataaccac cccatgagcg gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc
4080gccagcccgt gaccggaccg gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc
4140cgtgcccgtt ctgaccggtg gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca
4200gcccgtgacc gtgccgtcca ccacccggtg cctggtctgc gtctccctcg gctcgttcct
4260cgcctatcct ggtgaccaga caccggagcg agctatgccc agggttgcgc agtgacttcg
4320tcactgcgta accctgggcg ctcgcctccc attcgcttcg ctcacaggag ggggccgtcg
4380atggccgctg acgctgcatc tgacgaccgg cggaccgagg tccgcgccgc tgcttcgcgg
4440gccgctgacg cggccccggc gaagcgcacc cgcaccgtgg cggtgcggct gaccgatggg
4500gaggaggccg cgtggatcga cgccgcgctg gccgatggcc accggcagct cggggcgtgg
4560gtgcgtgagc gggcggtggc cggctatctc gggaaggtcc gcccgaagac cggcagtgga
4620atgtcggcgg aggcggccgc ggaggtcgcc gcgatgcggc agcagatgac gaaggtgggg
4680aacaacctga accagatcgc gagggcgatc aacgccgggc aggtgccgtc gcagatggcc
4740gagtccctgc agaaggggtg gctggagagg tgggggcagg agttggggcg gatggcggat
4800cggctcgacg cgctcgacga ccagggctga cgtgatcgcg aagatcagca cgggcagcga
4860cccgaagggg ttggcggcgt atctgcacgg gccggggaag gccaccccgc acagctaccg
4920caccgaggcg ggccggctga ttgccggcgg gacggtgatc gcgggatcgg tgcaggtcac
4980cgccaaaaac ccgacccggt gggggcggga cttcgagcgg gccgccgcga cgaacgcgcg
5040ggtgggtaag ccggtgtggc attgctcgct gcggtgcgcg cccggggatc ggcggctgac
5100cgataccgag ttcgcggaca tcgcgcagac ggtcgccgag cggatgggct tcgagagtca
5160tccgtgggtg gcggtgcggc acgacgacga ccacatccac ctggctgtct cccgggtcga
5220ttttcagggc gtgacctgga agaacagcaa cgaccggtgg aaggtcgtcg aggtgatgcg
5280cgaggtcgaa cgcgcgcacg gcctgatcga ggtggcgagc ccggagcggg cccgtggccg
5340gcaagccagc agcggcgagc aacgccgcgc ggtgcggacc ggcaaggtgg cgcagcggga
5400cggtctgagg gaaattgtga ccgccgcccg cgacatcgcc gcaggccagg gtgtgggggc
5460gttcgaagtg gcgctcgtac agaacccgat tacccgagtg caggtgcggc gcaacgtcgc
5520gaagacgggc cggatgaatg gctacagctt caacctgccc ggctacgtcg acgccgccgg
5580ggagccgatc tggttgccgg cctccaaact cgaccggggt ttgtcctggt cacagctgga
5640aaagacgctg accagacccc gcccggaccg cctcgccggc gaggagacgg tgccgcggaa
5700gcggctcgag cgcgccgccg cgtgggagca gcgccgccgc gaggtcggcg gcgagcagtt
5760cgcagctgcc cgctgggagc aggcccgcgc gaatgttggt gagacggccg ggc
58137580DNARhodococcus rhodniimisc_feature4260bp to 4339bp pRET1000
75tcgcctatcc tggtgaccag acaccggagc gagctatgcc cagggttgcg cagtgacttc
60gtcactgcgt aaccctgggc
8076108DNARhodococcus erythropolismisc_feature761bp to 868bp pRET1100
76tcacagagac agcgaaaacg cctcccctgg gtcaatccgg ttaccgccgg actgggggag
60gcgcttcgac acctacatcc gtcgcccctc gaaaggctca gatgcact
10877556DNARhodococcus erythropolis 77gaagcaacac cgcatccgcc cattgccgat
cgctcagcac gccccccgtt gcggatttca 60tggggcaact gtgcccgccc acatcaacta
ttcgagtccg acgcgccgag gctatatgga 120aaattattcg actacgcaaa acaaagccat
atcaggtatc ccggcgacac cccccaaaac 180ctcctcccca ccaacccctg ctttttgaac
cttgccgcgc tggatcgttc gatttcttct 240ggaaccctgc gagcggaaag ccacggtcgg
caccttggtg caagaggtgt gctcgggttg 300ggctttgcgt cggtggatgg tgagcacagg
cgggtgagta cggcggtact cccgggagct 360gcttcgagct gcgggaggta ggtcgggtac
ggcgcgcaga gcggaagcgt ggtcggtggt 420tgttcactct tctgctcggc cgaatcgagc
gccggccgaa tcgagcgccg gccgaatcga 480gcgccggccg aatcgagcgc cggccgaatc
gagcgccggc cgaatcgtta gtgcggtgtg 540cgtgcgtggt ggtcga
55678259PRTRhodococcus erythropolis
78Met Phe Asn Ser Ile Glu Gly Arg Ser Val Val Val Thr Gly Gly Ser 1
5 10 15Lys Gly Ile Gly Leu Gly
Met Val Arg Val Phe Ala Arg Ala Gly Ala 20
25 30Asn Val Leu Met Thr Ala Arg Asp Ala Leu Thr Leu Glu
Arg Ala Ala 35 40 45Glu Gly Leu
Asn Gly Leu Pro Gly Ala Val Ser Thr Leu Gln Val Asp 50
55 60Val Thr Asn Pro Asp Ser Leu Ala Gly Met Ala Glu
Val Ala Ala Glu 65 70 75
80Arg His Gly Gly Ile Asp Val Leu Cys Ala Asn Ala Gly Ile Phe Pro
85 90 95Ser Lys Arg Leu Gly
Glu Met Thr Ser Glu Asp Met Asp Ser Val Phe 100
105 110Gly Val Asn Val Lys Gly Thr Ile His Ala Val Gln
Ala Cys Met Pro 115 120 125Trp Leu
Glu Thr Ser Gly Arg Gly Arg Val Val Val Thr Ser Ser Ile 130
135 140Thr Gly Pro Val Thr Gly Tyr Pro Gly Trp Ser
His Tyr Gly Ala Ser145 150 155
160Lys Ala Ala Gln Met Gly Phe Ile Arg Thr Ala Ala Ile Glu Leu Ala
165 170 175Pro Lys Arg Ile
Thr Ile Asn Ala Val Leu Pro Gly Asn Val Ile Thr 180
185 190Glu Gly Leu Asp Gly Leu Gly Gln Glu Tyr Leu
Asp Gln Met Ala Ser 195 200 205Ser
Val Pro Ala Gly Ser Leu Gly Ser Val Glu Asp Ile Ala Asn Ala 210
215 220Ala Leu Phe Phe Ala Leu Asp Glu Ala Ala
Tyr Ile Thr Gly Gln Ser225 230 235
240Leu Ile Val Asp Gly Gly Gln Val Leu Pro Glu Ser Ala Met Ala
Leu 245 250 255Gly Glu
Leu79780DNARhodococcus erythropolis 79atgttcaact ccattgaagg tcgttcggtc
gtcgtcaccg gcggtagcaa gggcatcggc 60ttgggaatgg tccgggtatt cgcgcgcgca
ggggccaatg tgctcatgac cgcgcgagac 120gctctgactc tcgaacgtgc cgcggagggt
ttgaatggtc ttcctggcgc ggtctccaca 180cttcaagtcg acgtcacgaa tcctgactcc
ttggccggta tggcagaagt tgcggccgag 240cgacacggag gaatcgacgt gttgtgcgcg
aacgctggga tcttcccgtc gaagcggttg 300ggagagatga cctcggagga catggacagc
gtattcggcg tcaacgtcaa ggggaccatc 360cacgccgtgc aagcgtgcat gccgtggctc
gaaacttctg ggcgtggaag ggttgtcgtg 420acatcgtcga tcaccggacc cgtaaccggt
tatccgggtt ggtcgcacta cggggcaagc 480aaggctgcgc agatgggctt catccgaact
gctgccattg agttggcacc gaagaggatc 540acgatcaacg ccgtcttgcc cggcaacgtg
atcaccgagg ggctcgacgg tttgggacag 600gaatatctcg accaaatggc gtccagcgtc
ccggccggca gtctgggcag cgtcgaggat 660atcgccaatg ccgcactgtt ctttgcactg
gacgaagccg cgtacatcac cggtcagtcg 720ttgatcgtag atggtggaca ggttcttccg
gagtcggcga tggcgctcgg cgaactgtaa 7808028DNAArtificialprimer (MAK F1)
80gaatcttctc gttgatgcag atcaggtc
288126DNAArtificialprimer (MAK R2) 81ctgactccgt agtgttctgc cagttc
268234DNAArtificialprimer (MAK Pst F)
82gaccactgca gatcaatcaa ctctgatgag gtcc
348334DNAArtificialprimer (MAK His Bgl II R) 83cgcttagatc tcagttcgcc
gagcgccatc gccg 348435DNAArtificialprimer
(P1200rep-Pst5195) 84agccgctgca gaagcaacac cgcatccgcc cattg
358524DNAArtificialprimer (P7) 85cgccagggtt ttcccagtca
cgac 248630DNAArtificialprimer
(pQE70 F1) 86ggcgtatcac gaggcccttt cgtcttcacc
308735DNAArtificialprimer (pQE70 R1135Bm) 87ggttggatcc
gtcatcaccg aaacgcgcga ggcag
358834DNAArtificialprimer ( P1204rep-Ec2958) 88cgcggaattc gaccaccacg
cacgcacacc gcac
34898134DNAArtificialpRET1101 89gggtaccgag ctcgaattcg taatcatggt
catagctgtt tcctgtgtga aattgttatc 60cgctcacaat tccacacaac atacgagccg
gaagcataaa gtgtaaagcc tggggtgcct 120aatgagtgag ctaactcaca ttaattgcgt
tgcgctcact gcccgctttc cagtcgggaa 180acctgtcgtg ccagctgcat taatgaatcg
gccaacgcgc ggggagaggc ggtttgcgta 240ttgggcgctc ttccgcttcc tcgctcactg
actcgctgcg ctcggtcgtt cggctgcggc 300gagcggtatc agctcactca aaggcggtaa
tacggttatc cacagaatca ggggataacg 360caggaaagaa catgtgagca aaaggccagc
aaaaggccag gaaccgtaaa aaggccgcgt 420tgctggcgtt tttccatagg ctccgccccc
ctgacgagca tcacaaaaat cgacgctcaa 480gtcagaggtg gcgaaacccg acaggactat
aaagatacca ggcgtttccc cctggaagct 540ccctcgtgcg ctctcctgtt ccgaccctgc
cgcttaccgg atacctgtcc gcctttctcc 600cttcgggaag cgtggcgctt tctcatagct
cacgctgtag gtatctcagt tcggtgtagg 660tcgttcgctc caagctgggc tgtgtgcacg
aaccccccgt tcagcccgac cgctgcgcct 720tatccggtaa ctatcgtctt gagtccaacc
cggtaagaca cgacttatcg ccactggcag 780cagccactgg taacaggatt agcagagcga
ggtatgtagg cggtgctaca gagttcttga 840agtggtggcc taactacggc tacactagaa
ggacagtatt tggtatctgc gctctgctga 900agccagttac cttcggaaaa agagttggta
gctcttgatc cggcaaacaa accaccgctg 960gtagcggtgg tttttttgtt tgcaagcagc
agattacgcg cagaaaaaaa ggatctcaag 1020aagatccttt gatcttttct acggggtctg
acgctcagtg gaacgaaaac tcacgttaag 1080ggattttggt catgagatta tcaaaaagga
tcttcaccta gatcctttta aattaaaaat 1140gaagttttaa atcaatctaa agtatatatg
agtaaacttg gtctgacagt taccaatgct 1200taatcagtga ggcacctatc tcagcgatct
gtctatttcg ttcatccata gttgcctgac 1260tccccgtcgt gtagataact acgatacggg
agggcttacc atctggcccc agtgctgcaa 1320tgataccgcg agacccacgc tcaccggctc
cagatttatc agcaataaac cagccagccg 1380gaagggccga gcgcagaagt ggtcctgcaa
ctttatccgc ctccatccag tctattaatt 1440gttgccggga agctagagta agtagttcgc
cagttaatag tttgcgcaac gttgttgcca 1500ttgctacagg catcgtggtg tcacgctcgt
cgtttggtat ggcttcattc agctccggtt 1560cccaacgatc aaggcgagtt acatgatccc
ccatgttgtg caaaaaagcg gttagctcct 1620tcggtcctcc gatcgttgtc agaagtaagt
tggccgcagt gttatcactc atggttatgg 1680cagcactgca taattctctt actgtcatgc
catccgtaag atgcttttct gtgactggtg 1740agtactcaac caagtcattc tgagaatagt
gtatgcggcg accgagttgc tcttgcccgg 1800cgtcaatacg ggataatacc gcgccacata
gcagaacttt aaaagtgctc atcattggaa 1860aacgttcttc ggggcgaaaa ctctcaagga
tcttaccgct gttgagatcc agttcgatgt 1920aacccactcg tgcacccaac tgatcttcag
catcttttac tttcaccagc gtttctgggt 1980gagcaaaaac aggaaggcaa aatgccgcaa
aaaagggaat aagggcgaca cggaaatgtt 2040gaatactcat actcttcctt tttcaatatt
attgaagcat ttatcagggt tattgtctca 2100tgagcggata catatttgaa tgtatttaga
aaaataaaca aataggggtt ccgcgcacat 2160ttccccgaaa agtgccacct gacgtctaag
aaaccattat tatcatgaca ttaacctata 2220aaaataggcg tatcacgagg ccctttcgtc
tcgcgcgttt cggtgatgac ggtgaaaacc 2280tctgacacat gcagctcccg gagacggtca
cagcttgtct gtaagcggat gccgggagca 2340gacaagcccg tcagggcgcg tcagcgggtg
ttggcgggtg tcggggctgg cttaactatg 2400cggcatcaga gcagattgta ctgagagtgc
accatatgcg gtgtgaaata ccgcacagat 2460gcgtaaggag aaaataccgc atcaggcgcc
attcgccatt caggctgcgc aactgttggg 2520aagggcgatc ggtgcgggcc tcttcgctat
tacgccagct ggcgaaaggg ggatgtgctg 2580caaggcgatt aagttgggta acgccagggt
tttcccagtc acgacgttgt aaaacgacgg 2640ccagtgccaa gcttgcatgc ctgcaggtcg
actctagagg atcccctgca cagaacccgt 2700actcgatccg ctcgttcgtc gctgtgaggt
ctttcctgca aggcaaatcg ttgtgccgga 2760aggggtttcg tgatgtttct ccgagcgttt
tttcgttcca agttggtcat ggtggctctt 2820gtcctggtcg ctggcctgtt tctctacaac
gcctgctctt cttctgacgc aaaggaagag 2880atcggcagca gtctgaatct ctctcctgtc
actgctcgtt cgaatccgta tgagggcgtc 2940cagcccacga tgagcgaaaa aagccctgtt
cccgtccctg tcgtttccgg cgacaggatt 3000tcgggggtgg catcgtgcgg gacggattac
gccgggaagc ctgcggtgac gctggaagct 3060gtgtggattt cgtccgactc ggtgaactac
acactcgata agaggcattg cctggtgacg 3120accggcccgc tgtggaaaca agcgatccgt
aaagcgtcag ggtcagagat tcggcctgag 3180ggcgggagct ggatacgggt ggtgcttgcc
atgcctgacg gcaatttcag ggcaggatgg 3240gcaccccacg cccaagtaac cgctggtgcg
ctggatattt cggcggtggt ctcgtgagcg 3300gggagaagcg gcacagcgag gccggcccgg
tagaaatcat ctttttgatg ctggcagtca 3360gggcggggga ctacatcgtc gccgtgactg
cggttctcgc ggtcgggttc ttcgcggtcg 3420cggttgaggg tttctggttc ctggtcgtcg
cagtcatcgc tgcaccggcg tggtggtttc 3480tgcgcgactg ggaatcgaag cggagggccg
tacgggtctt tgaacgggca tggaagggga 3540cacctgaatc ccccggtatt gctctctccc
ttggcctgtc gaacgtggcg gggtctctgc 3600cgaggttgag gaagtttgaa actggttcgg
ggatacgcac actcgtgttt tctttgccgc 3660ccggagtcac tgccgagagc tttgagaaag
ttcgccctgc gctggcagac gcgatggggg 3720gtcaccgctg ccaagtagag aaggtggccc
ccggacaggt ccgcgtcaga gtgattgatg 3780aggattcgat gaagacgccg cgtgatgcgg
gatgggcgaa agatgttgtg ctggaagagg 3840atacgttcga cggtcttccg ggcgagacgc
gatcctggtt cgagcaagag gggccggcat 3900catgagaaaa tcggcgggag tatctcggat
tcctatccgt ctcgggcgct ctcagtacgg 3960ggaagacgtt ggattcgatc tcgctgcgga
cgccgctcac atcgccatgc agggcaaaac 4020ccgatccggc aaaagtcagg cgacgtacaa
cgtgttagct caggcagcag cgaacgcggc 4080ggttcgagtc gtagggtccg acccgacaca
cgtactcctg gagcccttca aacatcgagg 4140ggtgtccgag ccttacgtgg tttcgggact
gaatgcgcag gccacggtgg acatgctggg 4200ctgggtcaag cgtgagtctg atcgtcgcat
cgaccagatg tggcccctgc gtaccgacaa 4260gttttccgag ttcggggctt cgttcccgct
gatactcgtc gtgctcgaag agtttcccgg 4320gatcctcgag ggggcagcgg acgaagacgc
cgcgttaggc cgaaaacctg ccgagcgtct 4380cgcaccccgc atttcggcct acgtgcgtca
gatagcagcg cagtcggcaa aggctggaat 4440tcgccttctc ctgctctcgc aacgagcgga
ggcctcgatc attggcggca atgcgcgttc 4500gaatttcggg gtcaagatga ctctgagggt
ggacgaaccg gagtcggtga gaatgcttca 4560tccgagcgct tccccggaag actgtgccct
ggtcgagacc ttcaagcctg gtacctgcct 4620tttcgagaag ccaggagaag gccggcagat
tatgcgatgc gactttgtcg gcgagtacgg 4680gagatatgcg cgagccatcg agtcttcgga
tctgcgtttt ctcgccaccc tccagcaaga 4740ccaggcccaa cgcgaattct tcgctgagga
gttcggtgtg gtggatccgt catgactgga 4800ccacaggaga gaaagcgcaa ggcggcgaag
ccgtcgcggg agcctcagtt gaactgctgt 4860gaagcggacg tgccgaaacg agcaaaacag
cccccggttc cctctacgtt cgacctgctc 4920acggtgaagg agactgcggg gctgctgaga
gtcagtcagg caactcttta ccggctgctt 4980cggagtgggg aaggacccac atacacacgg
atcggtggac agatacgcgt tcaccgcgag 5040tcgctgcgtc ggttcatcga accgcgtgga
taacgtcaca gagacagcga aaacgcctcc 5100cctgggtcaa tccggttacc gccggactgg
gggaggcgct tcgacaccta catccgtcgc 5160ccctcgaaag gctcagatgc acttccacga
taacgcagag gtcggacaag agggaagaac 5220tgccgttctc tcgccgttgc gcggcgtagc
cgccaagcgg gacgtgtctg acgatgcagc 5280gaagcggagt cggcaggcgc ggcacgcgcc
tgggcttgtt acatctgcca caactgtccg 5340tgaatctctg ccagctcctg aaaccgctgg
tcagggcctt gcggaatccg tgaccgctga 5400tgatttttgg tctcattcgt tcccccgcgc
tgacgatgta cgcggcgcag ctgcttcctt 5460ccagtcggtg gctaactggg atgggcgtga
gggtccgagg ccgcgtttcg ttgtcgcgcc 5520tggcgttgtc cgcttggagg tttgtgatct
cgcacgccgc gaacgaacgg ctgaacgtgc 5580gtatctggct gctcgggctc gggtggatat
ggcggctgcc aggcataact cgccgtacga 5640cttcgacgtg gacgatgaag agttggcgga
actggcttct ctgcaaggcc tcgaggacga 5700cgacattggg ggctggtctg cggagaggga
aatagtgggc tggtctgctc gttctcggtc 5760acggatgatc ttgcgaatgg cagaactcga
ctgggctccc atgatggatt tgccgggcat 5820tcctgcgatg gtgaccctca cctatccggg
ggactggctt acggttgccc ccaccggcgc 5880tgaggtcaaa aaacatctcc agacgttctt
caaacggttc caacgggcct ggggcattgc 5940ctggatgggt gcgtggaaaa tggagttcca
aagccgaggc gctccgcatt ttcacctgta 6000catggtccct cctcatggga aggcaggaga
ctcgcggaag ctgcggcatg atgctgagct 6060cttgaaatgg gagatagcac gtgcagaggg
tgaagaccca ggtcgcaggc cgtatttccg 6120ggaagctcca agcgatggat tgaagtttcg
tccgtggctt tctgcggtgt gggccgacgt 6180cgtagatcat ccggacccca aggaaaaaga
aaagcacgtc agtgccggca ctggagtgga 6240ctacgcggag ggcacgcgag ggtcagatcc
gaaaaggctt gcggtgtact tctccaagca 6300tggaaccttt gccgacaagg aatatcagca
cgtagttcct gctcaatggc agaaaacggg 6360tgcgggacct ggcaggttct ggggctaccg
cggtttgtcg ccggccacgg ctgccaccga 6420gatttcctgg gatgagtacc tgcttttatc
tcgcacgttg cgacgattgt cagcgcgaac 6480gaagatctgg gacccggctt tacgaggcgg
tagcggcggc cacagatgga ctaaggcgat 6540gatgcgacgc acggttaccc ggcaccgctt
ggacctcgtg accggtgaga ttctgggcac 6600gaagacgcgg aaggttcggg cgccagtgaa
gaggtttgtc cggacttcgg gatacctgtg 6660tgtcaatgac gggcccgcac tggctcgaac
cctcagccgt cttcgtacaa gctgcctgag 6720ctagacgcgc ggaacgcctt tcggctttgt
cttttgctgg atggcgggtt ttgggcggct 6780tctggtgatg cgctgctgcg ctccgtgggg
agagagaccc aacgactgac ctatctctac 6840ccaggtgcaa ttcatctccc gcgctctgtc
ggctaggtaa acgaggtgct cccgcgcgag 6900cttttccatg tggtcggcca atgtcagctc
ggtcaggaca acctgctgtt gttgcgatag 6960ttgtgtccgc acgggtcgat tgtcttctgt
tgcggcataa cggttttcgt cgttcgcgga 7020gagtgcggct aaatgaattg catcctcgat
tgagcggagc atttcgacgc ggaacctggc 7080gatgatgttg tctctgtctt cattcataac
tgaagcgtat tgggagtgtt gccctcccac 7140catgtgtgcc aatgcaggtg tgaactgagt
cacagtttct caatagactc caagtttgtg 7200atccttttac tcccaaaatg gggcatgatg
tgtgcgtgcc tcggttcagg ggcgaaagtt 7260cgacacctcg aaagaaggcc tcgacatggc
tttgaaagct gctggcaacg tgattcctga 7320ttcctccgcg tacgagtacc gggcggttca
ggtcgagccg aagatggtca gaaaagaccc 7380ggaagacccg aactctgagc agttccagaa
gcagaaggac ggcacgccgg tgtggtcgat 7440cgactgcatt cgggtcgacc gggcatcagg
caacaaggca atcgtgaccg tgacggttcc 7500ggacgtgatg gaaccggatg ttgcggggcc
ggtggagttc tccgagatga ttgccggttt 7560ctgggtttcg cgcagtggtt cgggcatgtg
gttttcggca agcgccgtcg cttctctctg 7620atcgctgatc gtcgcccctc gaaaggttcg
gaaatgtcca aaggaaaagg cgttgcgctg 7680ggtgtgggtg ccctcgtgct cgtgtttgtg
ctggttgcgg caggttggca agcggcgaac 7740gtgttcagtg atcgttcaca gtccgaagct
gtgccgctga gagtgccggc cgatccgaag 7800tgggaaaacg gggtgttctc ggacgttgcc
gggtgcctcg ttctctctcc ggaagagctg 7860gggccgttca gcggagggca gtacatcgac
atagtgaggc cagttgagcc ggagaggttg 7920gagcgcgact gggtgaggtc ggctgagtgc
gtttcggcgt cgatgaatgt ctctgacctg 7980ttggtttctg ctcttccaga gtccacccgt
ccccccggcg atttcgttcg ttcgtggaaa 8040gtggcgagtg atgattactg ctatgagggt
gataacccgc aaggctgcac ttctcgtatg 8100ccggtttggg tctctgcaaa aaactggtgg
tgca 8134908124DNAArtificialpRET1102
90gacctgcagg catgcaagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg
60ttatccgctc acaattccac acaacatacg agccggaagc ataaagtgta aagcctgggg
120tgcctaatga gtgagctaac tcacattaat tgcgttgcgc tcactgcccg ctttccagtc
180gggaaacctg tcgtgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt
240gcgtattggc gaacttttgc tgagttgaag gatcagatca cgcatcttcc cgacaacgca
300gaccgttccg tggcaaagca aaagttcaaa atcagtaacc gtcagtgccg ataagttcaa
360agttaaacct ggtgttgata ccaacattga aacgctgatc gaaaacgcgc tgaaaaacgc
420tgctgaatgt gcgagcttct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc
480ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag
540gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa
600aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc
660gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc
720ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg
780cctttctccc ttcgggaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt
840cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc
900gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc
960cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag
1020agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg
1080ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa
1140ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag
1200gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg aacgatccgt
1260cgagaggtct gcctcgtgaa gaaggtgttg ctgactcata ccaggcctga atcgccccat
1320catccagcca gaaagtgagg gagccacggt tgatgagagc tttgttgtag gtggaccagt
1380tggtgatttt gaacttttgc tttgccacgg aacggtctgc gttgtcggga agatgcgtga
1440tctgatcctt caactcagca aaagttcgat ttattcaaca aagccacgtt gtgtctcaaa
1500atctctgatg ttacattgca caagataaaa atatatcatc atgaacaata aaactgtctg
1560cttacataaa cagtaataca aggggtgtta tgagccatat tcaacgggaa acgtcttgct
1620cgaagccgcg attaaattcc aacatggatg ctgatttata tgggtataaa tgggctcgcg
1680ataatgtcgg gcaatcaggt gcgacaatct atcgattgta tgggaagccc gatgcgccag
1740agttgtttct gaaacatggc aaaggtagcg ttgccaatga tgttacagat gagatggtca
1800gactaaactg gctgacggaa tttatgcctc ttccgaccat caagcatttt atccgtactc
1860ctgatgatgc atggttactc accactgcga tccccgggaa aacagcattc caggtattag
1920aagaatatcc tgattcaggt gaaaatattg ttgatgcgct ggcagtgttc ctgcgccggt
1980tgcattcgat tcctgtttgt aattgtcctt ttaacagcga tcgcgtattt cgtctcgctc
2040aggcgcaatc acgaatgaat aacggtttgg ttgatgcgag tgattttgat gacgagcgta
2100atggctggcc tgttgaacaa gtctggaaag aaatgcataa gcttttgcca ttctcaccgg
2160attcagtcgt cactcatggt gatttctcac ttgataacct tatttttgac gaggggaaat
2220taataggttg tattgatgtt ggacgagtcg gaatcgcaga ccgataccag gatcttgcca
2280tcctatggaa ctgcctcggt gagttttctc cttcattaca gaaacggctt tttcaaaaat
2340atggtattga taatcctgat atgaataaat tgcagtttca tttgatgctc gatgagtttt
2400tctaatcaga attggttaat tggttgtaac actggcagag cattacgctg acttgacggg
2460acggcggctt tgttgaataa atcgcattcg ccattcaggc tgcgcaactg ttgggaaggg
2520cgatcggtgc gggcctcttc gctattacgc cagctggcga aagggggatg tgctgcaagg
2580cgattaagtt gggtaacgcc agggttttcc cagtcacgac gttgtaaaac gacggccagt
2640gaattcgagc tcggtacccg gggatcctct agagtctgca cagaacccgt actcgatccg
2700ctcgttcgtc gctgtgaggt ctttcctgca aggcaaatcg ttgtgccgga aggggtttcg
2760tgatgtttct ccgagcgttt tttcgttcca agttggtcat ggtggctctt gtcctggtcg
2820ctggcctgtt tctctacaac gcctgctctt cttctgacgc aaaggaagag atcggcagca
2880gtctgaatct ctctcctgtc actgctcgtt cgaatccgta tgagggcgtc cagcccacga
2940tgagcgaaaa aagccctgtt cccgtccctg tcgtttccgg cgacaggatt tcgggggtgg
3000catcgtgcgg gacggattac gccgggaagc ctgcggtgac gctggaagct gtgtggattt
3060cgtccgactc ggtgaactac acactcgata agaggcattg cctggtgacg accggcccgc
3120tgtggaaaca agcgatccgt aaagcgtcag ggtcagagat tcggcctgag ggcgggagct
3180ggatacgggt ggtgcttgcc atgcctgacg gcaatttcag ggcaggatgg gcaccccacg
3240cccaagtaac cgctggtgcg ctggatattt cggcggtggt ctcgtgagcg gggagaagcg
3300gcacagcgag gccggcccgg tagaaatcat ctttttgatg ctggcagtca gggcggggga
3360ctacatcgtc gccgtgactg cggttctcgc ggtcgggttc ttcgcggtcg cggttgaggg
3420tttctggttc ctggtcgtcg cagtcatcgc tgcaccggcg tggtggtttc tgcgcgactg
3480ggaatcgaag cggagggccg tacgggtctt tgaacgggca tggaagggga cacctgaatc
3540ccccggtatt gctctctccc ttggcctgtc gaacgtggcg gggtctctgc cgaggttgag
3600gaagtttgaa actggttcgg ggatacgcac actcgtgttt tctttgccgc ccggagtcac
3660tgccgagagc tttgagaaag ttcgccctgc gctggcagac gcgatggggg gtcaccgctg
3720ccaagtagag aaggtggccc ccggacaggt ccgcgtcaga gtgattgatg aggattcgat
3780gaagacgccg cgtgatgcgg gatgggcgaa agatgttgtg ctggaagagg atacgttcga
3840cggtcttccg ggcgagacgc gatcctggtt cgagcaagag gggccggcat catgagaaaa
3900tcggcgggag tatctcggat tcctatccgt ctcgggcgct ctcagtacgg ggaagacgtt
3960ggattcgatc tcgctgcgga cgccgctcac atcgccatgc agggcaaaac ccgatccggc
4020aaaagtcagg cgacgtacaa cgtgttagct caggcagcag cgaacgcggc ggttcgagtc
4080gtagggtccg acccgacaca cgtactcctg gagcccttca aacatcgagg ggtgtccgag
4140ccttacgtgg tttcgggact gaatgcgcag gccacggtgg acatgctggg ctgggtcaag
4200cgtgagtctg atcgtcgcat cgaccagatg tggcccctgc gtaccgacaa gttttccgag
4260ttcggggctt cgttcccgct gatactcgtc gtgctcgaag agtttcccgg gatcctcgag
4320ggggcagcgg acgaagacgc cgcgttaggc cgaaaacctg ccgagcgtct cgcaccccgc
4380atttcggcct acgtgcgtca gatagcagcg cagtcggcaa aggctggaat tcgccttctc
4440ctgctctcgc aacgagcgga ggcctcgatc attggcggca atgcgcgttc gaatttcggg
4500gtcaagatga ctctgagggt ggacgaaccg gagtcggtga gaatgcttca tccgagcgct
4560tccccggaag actgtgccct ggtcgagacc ttcaagcctg gtacctgcct tttcgagaag
4620ccaggagaag gccggcagat tatgcgatgc gactttgtcg gcgagtacgg gagatatgcg
4680cgagccatcg agtcttcgga tctgcgtttt ctcgccaccc tccagcaaga ccaggcccaa
4740cgcgaattct tcgctgagga gttcggtgtg gtggatccgt catgactgga ccacaggaga
4800gaaagcgcaa ggcggcgaag ccgtcgcggg agcctcagtt gaactgctgt gaagcggacg
4860tgccgaaacg agcaaaacag cccccggttc cctctacgtt cgacctgctc acggtgaagg
4920agactgcggg gctgctgaga gtcagtcagg caactcttta ccggctgctt cggagtgggg
4980aaggacccac atacacacgg atcggtggac agatacgcgt tcaccgcgag tcgctgcgtc
5040ggttcatcga accgcgtgga taacgtcaca gagacagcga aaacgcctcc cctgggtcaa
5100tccggttacc gccggactgg gggaggcgct tcgacaccta catccgtcgc ccctcgaaag
5160gctcagatgc acttccacga taacgcagag gtcggacaag agggaagaac tgccgttctc
5220tcgccgttgc gcggcgtagc cgccaagcgg gacgtgtctg acgatgcagc gaagcggagt
5280cggcaggcgc ggcacgcgcc tgggcttgtt acatctgcca caactgtccg tgaatctctg
5340ccagctcctg aaaccgctgg tcagggcctt gcggaatccg tgaccgctga tgatttttgg
5400tctcattcgt tcccccgcgc tgacgatgta cgcggcgcag ctgcttcctt ccagtcggtg
5460gctaactggg atgggcgtga gggtccgagg ccgcgtttcg ttgtcgcgcc tggcgttgtc
5520cgcttggagg tttgtgatct cgcacgccgc gaacgaacgg ctgaacgtgc gtatctggct
5580gctcgggctc gggtggatat ggcggctgcc aggcataact cgccgtacga cttcgacgtg
5640gacgatgaag agttggcgga actggcttct ctgcaaggcc tcgaggacga cgacattggg
5700ggctggtctg cggagaggga aatagtgggc tggtctgctc gttctcggtc acggatgatc
5760ttgcgaatgg cagaactcga ctgggctccc atgatggatt tgccgggcat tcctgcgatg
5820gtgaccctca cctatccggg ggactggctt acggttgccc ccaccggcgc tgaggtcaaa
5880aaacatctcc agacgttctt caaacggttc caacgggcct ggggcattgc ctggatgggt
5940gcgtggaaaa tggagttcca aagccgaggc gctccgcatt ttcacctgta catggtccct
6000cctcatggga aggcaggaga ctcgcggaag ctgcggcatg atgctgagct cttgaaatgg
6060gagatagcac gtgcagaggg tgaagaccca ggtcgcaggc cgtatttccg ggaagctcca
6120agcgatggat tgaagtttcg tccgtggctt tctgcggtgt gggccgacgt cgtagatcat
6180ccggacccca aggaaaaaga aaagcacgtc agtgccggca ctggagtgga ctacgcggag
6240ggcacgcgag ggtcagatcc gaaaaggctt gcggtgtact tctccaagca tggaaccttt
6300gccgacaagg aatatcagca cgtagttcct gctcaatggc agaaaacggg tgcgggacct
6360ggcaggttct ggggctaccg cggtttgtcg ccggccacgg ctgccaccga gatttcctgg
6420gatgagtacc tgcttttatc tcgcacgttg cgacgattgt cagcgcgaac gaagatctgg
6480gacccggctt tacgaggcgg tagcggcggc cacagatgga ctaaggcgat gatgcgacgc
6540acggttaccc ggcaccgctt ggacctcgtg accggtgaga ttctgggcac gaagacgcgg
6600aaggttcggg cgccagtgaa gaggtttgtc cggacttcgg gatacctgtg tgtcaatgac
6660gggcccgcac tggctcgaac cctcagccgt cttcgtacaa gctgcctgag ctagacgcgc
6720ggaacgcctt tcggctttgt cttttgctgg atggcgggtt ttgggcggct tctggtgatg
6780cgctgctgcg ctccgtgggg agagagaccc aacgactgac ctatctctac ccaggtgcaa
6840ttcatctccc gcgctctgtc ggctaggtaa acgaggtgct cccgcgcgag cttttccatg
6900tggtcggcca atgtcagctc ggtcaggaca acctgctgtt gttgcgatag ttgtgtccgc
6960acgggtcgat tgtcttctgt tgcggcataa cggttttcgt cgttcgcgga gagtgcggct
7020aaatgaattg catcctcgat tgagcggagc atttcgacgc ggaacctggc gatgatgttg
7080tctctgtctt cattcataac tgaagcgtat tgggagtgtt gccctcccac catgtgtgcc
7140aatgcaggtg tgaactgagt cacagtttct caatagactc caagtttgtg atccttttac
7200tcccaaaatg gggcatgatg tgtgcgtgcc tcggttcagg ggcgaaagtt cgacacctcg
7260aaagaaggcc tcgacatggc tttgaaagct gctggcaacg tgattcctga ttcctccgcg
7320tacgagtacc gggcggttca ggtcgagccg aagatggtca gaaaagaccc ggaagacccg
7380aactctgagc agttccagaa gcagaaggac ggcacgccgg tgtggtcgat cgactgcatt
7440cgggtcgacc gggcatcagg caacaaggca atcgtgaccg tgacggttcc ggacgtgatg
7500gaaccggatg ttgcggggcc ggtggagttc tccgagatga ttgccggttt ctgggtttcg
7560cgcagtggtt cgggcatgtg gttttcggca agcgccgtcg cttctctctg atcgctgatc
7620gtcgcccctc gaaaggttcg gaaatgtcca aaggaaaagg cgttgcgctg ggtgtgggtg
7680ccctcgtgct cgtgtttgtg ctggttgcgg caggttggca agcggcgaac gtgttcagtg
7740atcgttcaca gtccgaagct gtgccgctga gagtgccggc cgatccgaag tgggaaaacg
7800gggtgttctc ggacgttgcc gggtgcctcg ttctctctcc ggaagagctg gggccgttca
7860gcggagggca gtacatcgac atagtgaggc cagttgagcc ggagaggttg gagcgcgact
7920gggtgaggtc ggctgagtgc gtttcggcgt cgatgaatgt ctctgacctg ttggtttctg
7980ctcttccaga gtccacccgt ccccccggcg atttcgttcg ttcgtggaaa gtggcgagtg
8040atgattactg ctatgagggt gataacccgc aaggctgcac ttctcgtatg ccggtttggg
8100tctctgcaaa aaactggtgg tgca
8124917675DNAArtificialpRET1103 91gacctgcagg catgcaagct tggcactggc
cgtcgtttta caacgtcgtg actgggaaaa 60ccctggcgtt acccaactta atcgccttgc
agcacatccc cctttcgcca gctggcgtaa 120tagcgaagag gcccgcaccg atcgcccttc
ccaacagttg cgcagcctga atggcgaatg 180agcttcttcc gcttcctcgc tcactgactc
gctgcgctcg gtcgttcggc tgcggcgagc 240ggtatcagct cactcaaagg cggtaatacg
gttatccaca gaatcagggg ataacgcagg 300aaagaacatg tgagcaaaag gccagcaaaa
ggccaggaac cgtaaaaagg ccgcgttgct 360ggcgtttttc cataggctcc gcccccctga
cgagcatcac aaaaatcgac gctcaagtca 420gaggtggcga aacccgacag gactataaag
ataccaggcg tttccccctg gaagctccct 480cgtgcgctct cctgttccga ccctgccgct
taccggatac ctgtccgcct ttctcccttc 540gggaagcgtg gcgctttctc aatgctcacg
ctgtaggtat ctcagttcgg tgtaggtcgt 600tcgctccaag ctgggctgtg tgcacgaacc
ccccgttcag cccgaccgct gcgccttatc 660cggtaactat cgtcttgagt ccaacccggt
aagacacgac ttatcgccac tggcagcagc 720cactggtaac aggattagca gagcgaggta
tgtaggcggt gctacagagt tcttgaagtg 780gtggcctaac tacggctaca ctagaaggac
agtatttggt atctgcgctc tgctgaagcc 840agttaccttc ggaaaaagag ttggtagctc
ttgatccggc aaacaaacca ccgctggtag 900cggtggtttt tttgtttgca agcagcagat
tacgcgcaga aaaaaaggat ctcaagaaga 960tcctttgatc ttttctacgg ggtctgacgc
tcagtggaac tccgtcgaac ggaagatcac 1020ttcgcagaat aaataaatcc tggtgtccct
gttgataccg ggaagccctg ggccaacttt 1080tggcgaaaat gagacgttga tcggcacgta
agaggttcca actttcacca taatgaaata 1140agatcactac cgggcgtatt ttttgagtta
tcgagatttt caggagctaa ggaagctaaa 1200atggagaaaa aaatcactgg atataccacc
gttgatatat cccaatggca tcgtaaagaa 1260cattttgagg catttcagtc agttgctcaa
tgtacctata accagaccgt tcagctggat 1320attacggcct ttttaaagac cgtaaagaaa
aataagcaca agttttatcc ggcctttatt 1380cacattcttg cccgcctgat gaatgctcat
ccggaatttc gtatggcaat gaaagacggt 1440gagctggtga tatgggatag tgttcaccct
tgttacaccg ttttccatga gcaaactgaa 1500acgttttcat cgctctggag tgaataccac
gacgatttcc ggcagtttct acacatatat 1560tcgcaagatg tggcgtgtta cggtgaaaac
ctggcctatt tccctaaagg gtttattgag 1620aatatgtttt tcgtctcagc caatccctgg
gtgagtttca ccagttttga tttaaacgtg 1680gccaatatgg acaacttctt cgcccccgtt
ttcaccatgg gcaaatatta tacgcaaggc 1740gacaaggtgc tgatgccgct ggcgattcag
gttcatcatg ccgtctgtga tggcttccat 1800gtcggcagaa tgcttaatga attacaacag
tactgcgatg agtggcaggg cggggcgtaa 1860tttttttaag gcagttattg gtgcccttaa
acgcctggtg ctacgcctga ataagtgata 1920ataagcggat gaatggcaga aattcagctt
ggcccagtgc caagctccaa tacgcaaacc 1980gcctctcccc gcgcgttggc cgattcatta
atgcagctgg cacgacaggt ttcccgactg 2040gaaagcgggc agtgagcgca acgcaattaa
tgtgagttag ctcactcatt aggcacccca 2100ggctttacac tttatgcttc cggctcgtat
gttgtgtgga attgtgagcg gataacaatt 2160tcacacagga aacagctatg accatgatta
cgaattcgag ctcggtaccc ggggatcctc 2220tagagtctgc acagaacccg tactcgatcc
gctcgttcgt cgctgtgagg tctttcctgc 2280aaggcaaatc gttgtgccgg aaggggtttc
gtgatgtttc tccgagcgtt ttttcgttcc 2340aagttggtca tggtggctct tgtcctggtc
gctggcctgt ttctctacaa cgcctgctct 2400tcttctgacg caaaggaaga gatcggcagc
agtctgaatc tctctcctgt cactgctcgt 2460tcgaatccgt atgagggcgt ccagcccacg
atgagcgaaa aaagccctgt tcccgtccct 2520gtcgtttccg gcgacaggat ttcgggggtg
gcatcgtgcg ggacggatta cgccgggaag 2580cctgcggtga cgctggaagc tgtgtggatt
tcgtccgact cggtgaacta cacactcgat 2640aagaggcatt gcctggtgac gaccggcccg
ctgtggaaac aagcgatccg taaagcgtca 2700gggtcagaga ttcggcctga gggcgggagc
tggatacggg tggtgcttgc catgcctgac 2760ggcaatttca gggcaggatg ggcaccccac
gcccaagtaa ccgctggtgc gctggatatt 2820tcggcggtgg tctcgtgagc ggggagaagc
ggcacagcga ggccggcccg gtagaaatca 2880tctttttgat gctggcagtc agggcggggg
actacatcgt cgccgtgact gcggttctcg 2940cggtcgggtt cttcgcggtc gcggttgagg
gtttctggtt cctggtcgtc gcagtcatcg 3000ctgcaccggc gtggtggttt ctgcgcgact
gggaatcgaa gcggagggcc gtacgggtct 3060ttgaacgggc atggaagggg acacctgaat
cccccggtat tgctctctcc cttggcctgt 3120cgaacgtggc ggggtctctg ccgaggttga
ggaagtttga aactggttcg gggatacgca 3180cactcgtgtt ttctttgccg cccggagtca
ctgccgagag ctttgagaaa gttcgccctg 3240cgctggcaga cgcgatgggg ggtcaccgct
gccaagtaga gaaggtggcc cccggacagg 3300tccgcgtcag agtgattgat gaggattcga
tgaagacgcc gcgtgatgcg ggatgggcga 3360aagatgttgt gctggaagag gatacgttcg
acggtcttcc gggcgagacg cgatcctggt 3420tcgagcaaga ggggccggca tcatgagaaa
atcggcggga gtatctcgga ttcctatccg 3480tctcgggcgc tctcagtacg gggaagacgt
tggattcgat ctcgctgcgg acgccgctca 3540catcgccatg cagggcaaaa cccgatccgg
caaaagtcag gcgacgtaca acgtgttagc 3600tcaggcagca gcgaacgcgg cggttcgagt
cgtagggtcc gacccgacac acgtactcct 3660ggagcccttc aaacatcgag gggtgtccga
gccttacgtg gtttcgggac tgaatgcgca 3720ggccacggtg gacatgctgg gctgggtcaa
gcgtgagtct gatcgtcgca tcgaccagat 3780gtggcccctg cgtaccgaca agttttccga
gttcggggct tcgttcccgc tgatactcgt 3840cgtgctcgaa gagtttcccg ggatcctcga
gggggcagcg gacgaagacg ccgcgttagg 3900ccgaaaacct gccgagcgtc tcgcaccccg
catttcggcc tacgtgcgtc agatagcagc 3960gcagtcggca aaggctggaa ttcgccttct
cctgctctcg caacgagcgg aggcctcgat 4020cattggcggc aatgcgcgtt cgaatttcgg
ggtcaagatg actctgaggg tggacgaacc 4080ggagtcggtg agaatgcttc atccgagcgc
ttccccggaa gactgtgccc tggtcgagac 4140cttcaagcct ggtacctgcc ttttcgagaa
gccaggagaa ggccggcaga ttatgcgatg 4200cgactttgtc ggcgagtacg ggagatatgc
gcgagccatc gagtcttcgg atctgcgttt 4260tctcgccacc ctccagcaag accaggccca
acgcgaattc ttcgctgagg agttcggtgt 4320ggtggatccg tcatgactgg accacaggag
agaaagcgca aggcggcgaa gccgtcgcgg 4380gagcctcagt tgaactgctg tgaagcggac
gtgccgaaac gagcaaaaca gcccccggtt 4440ccctctacgt tcgacctgct cacggtgaag
gagactgcgg ggctgctgag agtcagtcag 4500gcaactcttt accggctgct tcggagtggg
gaaggaccca catacacacg gatcggtgga 4560cagatacgcg ttcaccgcga gtcgctgcgt
cggttcatcg aaccgcgtgg ataacgtcac 4620agagacagcg aaaacgcctc ccctgggtca
atccggttac cgccggactg ggggaggcgc 4680ttcgacacct acatccgtcg cccctcgaaa
ggctcagatg cacttccacg ataacgcaga 4740ggtcggacaa gagggaagaa ctgccgttct
ctcgccgttg cgcggcgtag ccgccaagcg 4800ggacgtgtct gacgatgcag cgaagcggag
tcggcaggcg cggcacgcgc ctgggcttgt 4860tacatctgcc acaactgtcc gtgaatctct
gccagctcct gaaaccgctg gtcagggcct 4920tgcggaatcc gtgaccgctg atgatttttg
gtctcattcg ttcccccgcg ctgacgatgt 4980acgcggcgca gctgcttcct tccagtcggt
ggctaactgg gatgggcgtg agggtccgag 5040gccgcgtttc gttgtcgcgc ctggcgttgt
ccgcttggag gtttgtgatc tcgcacgccg 5100cgaacgaacg gctgaacgtg cgtatctggc
tgctcgggct cgggtggata tggcggctgc 5160caggcataac tcgccgtacg acttcgacgt
ggacgatgaa gagttggcgg aactggcttc 5220tctgcaaggc ctcgaggacg acgacattgg
gggctggtct gcggagaggg aaatagtggg 5280ctggtctgct cgttctcggt cacggatgat
cttgcgaatg gcagaactcg actgggctcc 5340catgatggat ttgccgggca ttcctgcgat
ggtgaccctc acctatccgg gggactggct 5400tacggttgcc cccaccggcg ctgaggtcaa
aaaacatctc cagacgttct tcaaacggtt 5460ccaacgggcc tggggcattg cctggatggg
tgcgtggaaa atggagttcc aaagccgagg 5520cgctccgcat tttcacctgt acatggtccc
tcctcatggg aaggcaggag actcgcggaa 5580gctgcggcat gatgctgagc tcttgaaatg
ggagatagca cgtgcagagg gtgaagaccc 5640aggtcgcagg ccgtatttcc gggaagctcc
aagcgatgga ttgaagtttc gtccgtggct 5700ttctgcggtg tgggccgacg tcgtagatca
tccggacccc aaggaaaaag aaaagcacgt 5760cagtgccggc actggagtgg actacgcgga
gggcacgcga gggtcagatc cgaaaaggct 5820tgcggtgtac ttctccaagc atggaacctt
tgccgacaag gaatatcagc acgtagttcc 5880tgctcaatgg cagaaaacgg gtgcgggacc
tggcaggttc tggggctacc gcggtttgtc 5940gccggccacg gctgccaccg agatttcctg
ggatgagtac ctgcttttat ctcgcacgtt 6000gcgacgattg tcagcgcgaa cgaagatctg
ggacccggct ttacgaggcg gtagcggcgg 6060ccacagatgg actaaggcga tgatgcgacg
cacggttacc cggcaccgct tggacctcgt 6120gaccggtgag attctgggca cgaagacgcg
gaaggttcgg gcgccagtga agaggtttgt 6180ccggacttcg ggatacctgt gtgtcaatga
cgggcccgca ctggctcgaa ccctcagccg 6240tcttcgtaca agctgcctga gctagacgcg
cggaacgcct ttcggctttg tcttttgctg 6300gatggcgggt tttgggcggc ttctggtgat
gcgctgctgc gctccgtggg gagagagacc 6360caacgactga cctatctcta cccaggtgca
attcatctcc cgcgctctgt cggctaggta 6420aacgaggtgc tcccgcgcga gcttttccat
gtggtcggcc aatgtcagct cggtcaggac 6480aacctgctgt tgttgcgata gttgtgtccg
cacgggtcga ttgtcttctg ttgcggcata 6540acggttttcg tcgttcgcgg agagtgcggc
taaatgaatt gcatcctcga ttgagcggag 6600catttcgacg cggaacctgg cgatgatgtt
gtctctgtct tcattcataa ctgaagcgta 6660ttgggagtgt tgccctccca ccatgtgtgc
caatgcaggt gtgaactgag tcacagtttc 6720tcaatagact ccaagtttgt gatcctttta
ctcccaaaat ggggcatgat gtgtgcgtgc 6780ctcggttcag gggcgaaagt tcgacacctc
gaaagaaggc ctcgacatgg ctttgaaagc 6840tgctggcaac gtgattcctg attcctccgc
gtacgagtac cgggcggttc aggtcgagcc 6900gaagatggtc agaaaagacc cggaagaccc
gaactctgag cagttccaga agcagaagga 6960cggcacgccg gtgtggtcga tcgactgcat
tcgggtcgac cgggcatcag gcaacaaggc 7020aatcgtgacc gtgacggttc cggacgtgat
ggaaccggat gttgcggggc cggtggagtt 7080ctccgagatg attgccggtt tctgggtttc
gcgcagtggt tcgggcatgt ggttttcggc 7140aagcgccgtc gcttctctct gatcgctgat
cgtcgcccct cgaaaggttc ggaaatgtcc 7200aaaggaaaag gcgttgcgct gggtgtgggt
gccctcgtgc tcgtgtttgt gctggttgcg 7260gcaggttggc aagcggcgaa cgtgttcagt
gatcgttcac agtccgaagc tgtgccgctg 7320agagtgccgg ccgatccgaa gtgggaaaac
ggggtgttct cggacgttgc cgggtgcctc 7380gttctctctc cggaagagct ggggccgttc
agcggagggc agtacatcga catagtgagg 7440ccagttgagc cggagaggtt ggagcgcgac
tgggtgaggt cggctgagtg cgtttcggcg 7500tcgatgaatg tctctgacct gttggtttct
gctcttccag agtccacccg tccccccggc 7560gatttcgttc gttcgtggaa agtggcgagt
gatgattact gctatgaggg tgataacccg 7620caaggctgca cttctcgtat gccggtttgg
gtctctgcaa aaaactggtg gtgca 7675928134DNAArtificialpRET1101Rv
92ggggatcctc tagagtcgac ctgcaggcat gcaagcttgg cactggccgt cgttttacaa
60cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct
120ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc
180agcctgaatg gcgaatggcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt
240tcacaccgca tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag
300ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct cccggcatcc
360gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca
420tcaccgaaac gcgcgagacg aaagggcctc gtgatacgcc tatttttata ggttaatgtc
480atgataataa tggtttctta gacgtcaggt ggcacttttc ggggaaatgt gcgcggaacc
540cctatttgtt tatttttcta aatacattca aatatgtatc cgctcatgag acaataaccc
600tgataaatgc ttcaataata ttgaaaaagg aagagtatga gtattcaaca tttccgtgtc
660gcccttattc ccttttttgc ggcattttgc cttcctgttt ttgctcaccc agaaacgctg
720gtgaaagtaa aagatgctga agatcagttg ggtgcacgag tgggttacat cgaactggat
780ctcaacagcg gtaagatcct tgagagtttt cgccccgaag aacgttttcc aatgatgagc
840acttttaaag ttctgctatg tggcgcggta ttatcccgta ttgacgccgg gcaagagcaa
900ctcggtcgcc gcatacacta ttctcagaat gacttggttg agtactcacc agtcacagaa
960aagcatctta cggatggcat gacagtaaga gaattatgca gtgctgccat aaccatgagt
1020gataacactg cggccaactt acttctgaca acgatcggag gaccgaagga gctaaccgct
1080tttttgcaca acatggggga tcatgtaact cgccttgatc gttgggaacc ggagctgaat
1140gaagccatac caaacgacga gcgtgacacc acgatgcctg tagcaatggc aacaacgttg
1200cgcaaactat taactggcga actacttact ctagcttccc ggcaacaatt aatagactgg
1260atggaggcgg ataaagttgc aggaccactt ctgcgctcgg cccttccggc tggctggttt
1320attgctgata aatctggagc cggtgagcgt gggtctcgcg gtatcattgc agcactgggg
1380ccagatggta agccctcccg tatcgtagtt atctacacga cggggagtca ggcaactatg
1440gatgaacgaa atagacagat cgctgagata ggtgcctcac tgattaagca ttggtaactg
1500tcagaccaag tttactcata tatactttag attgatttaa aacttcattt ttaatttaaa
1560aggatctagg tgaagatcct ttttgataat ctcatgacca aaatccctta acgtgagttt
1620tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt
1680tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt
1740ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag
1800ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta
1860gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat
1920aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg
1980ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg
2040agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac
2100aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga
2160aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt
2220ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta
2280cggttcctgg ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat
2340tctgtggata accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg
2400accgagcgca gcgagtcagt gagcgaggaa gcggaagagc gcccaatacg caaaccgcct
2460ctccccgcgc gttggccgat tcattaatgc agctggcacg acaggtttcc cgactggaaa
2520gcgggcagtg agcgcaacgc aattaatgtg agttagctca ctcattaggc accccaggct
2580ttacacttta tgcttccggc tcgtatgttg tgtggaattg tgagcggata acaatttcac
2640acaggaaaca gctatgacca tgattacgaa ttcgagctcg gtaccctgca cagaacccgt
2700actcgatccg ctcgttcgtc gctgtgaggt ctttcctgca aggcaaatcg ttgtgccgga
2760aggggtttcg tgatgtttct ccgagcgttt tttcgttcca agttggtcat ggtggctctt
2820gtcctggtcg ctggcctgtt tctctacaac gcctgctctt cttctgacgc aaaggaagag
2880atcggcagca gtctgaatct ctctcctgtc actgctcgtt cgaatccgta tgagggcgtc
2940cagcccacga tgagcgaaaa aagccctgtt cccgtccctg tcgtttccgg cgacaggatt
3000tcgggggtgg catcgtgcgg gacggattac gccgggaagc ctgcggtgac gctggaagct
3060gtgtggattt cgtccgactc ggtgaactac acactcgata agaggcattg cctggtgacg
3120accggcccgc tgtggaaaca agcgatccgt aaagcgtcag ggtcagagat tcggcctgag
3180ggcgggagct ggatacgggt ggtgcttgcc atgcctgacg gcaatttcag ggcaggatgg
3240gcaccccacg cccaagtaac cgctggtgcg ctggatattt cggcggtggt ctcgtgagcg
3300gggagaagcg gcacagcgag gccggcccgg tagaaatcat ctttttgatg ctggcagtca
3360gggcggggga ctacatcgtc gccgtgactg cggttctcgc ggtcgggttc ttcgcggtcg
3420cggttgaggg tttctggttc ctggtcgtcg cagtcatcgc tgcaccggcg tggtggtttc
3480tgcgcgactg ggaatcgaag cggagggccg tacgggtctt tgaacgggca tggaagggga
3540cacctgaatc ccccggtatt gctctctccc ttggcctgtc gaacgtggcg gggtctctgc
3600cgaggttgag gaagtttgaa actggttcgg ggatacgcac actcgtgttt tctttgccgc
3660ccggagtcac tgccgagagc tttgagaaag ttcgccctgc gctggcagac gcgatggggg
3720gtcaccgctg ccaagtagag aaggtggccc ccggacaggt ccgcgtcaga gtgattgatg
3780aggattcgat gaagacgccg cgtgatgcgg gatgggcgaa agatgttgtg ctggaagagg
3840atacgttcga cggtcttccg ggcgagacgc gatcctggtt cgagcaagag gggccggcat
3900catgagaaaa tcggcgggag tatctcggat tcctatccgt ctcgggcgct ctcagtacgg
3960ggaagacgtt ggattcgatc tcgctgcgga cgccgctcac atcgccatgc agggcaaaac
4020ccgatccggc aaaagtcagg cgacgtacaa cgtgttagct caggcagcag cgaacgcggc
4080ggttcgagtc gtagggtccg acccgacaca cgtactcctg gagcccttca aacatcgagg
4140ggtgtccgag ccttacgtgg tttcgggact gaatgcgcag gccacggtgg acatgctggg
4200ctgggtcaag cgtgagtctg atcgtcgcat cgaccagatg tggcccctgc gtaccgacaa
4260gttttccgag ttcggggctt cgttcccgct gatactcgtc gtgctcgaag agtttcccgg
4320gatcctcgag ggggcagcgg acgaagacgc cgcgttaggc cgaaaacctg ccgagcgtct
4380cgcaccccgc atttcggcct acgtgcgtca gatagcagcg cagtcggcaa aggctggaat
4440tcgccttctc ctgctctcgc aacgagcgga ggcctcgatc attggcggca atgcgcgttc
4500gaatttcggg gtcaagatga ctctgagggt ggacgaaccg gagtcggtga gaatgcttca
4560tccgagcgct tccccggaag actgtgccct ggtcgagacc ttcaagcctg gtacctgcct
4620tttcgagaag ccaggagaag gccggcagat tatgcgatgc gactttgtcg gcgagtacgg
4680gagatatgcg cgagccatcg agtcttcgga tctgcgtttt ctcgccaccc tccagcaaga
4740ccaggcccaa cgcgaattct tcgctgagga gttcggtgtg gtggatccgt catgactgga
4800ccacaggaga gaaagcgcaa ggcggcgaag ccgtcgcggg agcctcagtt gaactgctgt
4860gaagcggacg tgccgaaacg agcaaaacag cccccggttc cctctacgtt cgacctgctc
4920acggtgaagg agactgcggg gctgctgaga gtcagtcagg caactcttta ccggctgctt
4980cggagtgggg aaggacccac atacacacgg atcggtggac agatacgcgt tcaccgcgag
5040tcgctgcgtc ggttcatcga accgcgtgga taacgtcaca gagacagcga aaacgcctcc
5100cctgggtcaa tccggttacc gccggactgg gggaggcgct tcgacaccta catccgtcgc
5160ccctcgaaag gctcagatgc acttccacga taacgcagag gtcggacaag agggaagaac
5220tgccgttctc tcgccgttgc gcggcgtagc cgccaagcgg gacgtgtctg acgatgcagc
5280gaagcggagt cggcaggcgc ggcacgcgcc tgggcttgtt acatctgcca caactgtccg
5340tgaatctctg ccagctcctg aaaccgctgg tcagggcctt gcggaatccg tgaccgctga
5400tgatttttgg tctcattcgt tcccccgcgc tgacgatgta cgcggcgcag ctgcttcctt
5460ccagtcggtg gctaactggg atgggcgtga gggtccgagg ccgcgtttcg ttgtcgcgcc
5520tggcgttgtc cgcttggagg tttgtgatct cgcacgccgc gaacgaacgg ctgaacgtgc
5580gtatctggct gctcgggctc gggtggatat ggcggctgcc aggcataact cgccgtacga
5640cttcgacgtg gacgatgaag agttggcgga actggcttct ctgcaaggcc tcgaggacga
5700cgacattggg ggctggtctg cggagaggga aatagtgggc tggtctgctc gttctcggtc
5760acggatgatc ttgcgaatgg cagaactcga ctgggctccc atgatggatt tgccgggcat
5820tcctgcgatg gtgaccctca cctatccggg ggactggctt acggttgccc ccaccggcgc
5880tgaggtcaaa aaacatctcc agacgttctt caaacggttc caacgggcct ggggcattgc
5940ctggatgggt gcgtggaaaa tggagttcca aagccgaggc gctccgcatt ttcacctgta
6000catggtccct cctcatggga aggcaggaga ctcgcggaag ctgcggcatg atgctgagct
6060cttgaaatgg gagatagcac gtgcagaggg tgaagaccca ggtcgcaggc cgtatttccg
6120ggaagctcca agcgatggat tgaagtttcg tccgtggctt tctgcggtgt gggccgacgt
6180cgtagatcat ccggacccca aggaaaaaga aaagcacgtc agtgccggca ctggagtgga
6240ctacgcggag ggcacgcgag ggtcagatcc gaaaaggctt gcggtgtact tctccaagca
6300tggaaccttt gccgacaagg aatatcagca cgtagttcct gctcaatggc agaaaacggg
6360tgcgggacct ggcaggttct ggggctaccg cggtttgtcg ccggccacgg ctgccaccga
6420gatttcctgg gatgagtacc tgcttttatc tcgcacgttg cgacgattgt cagcgcgaac
6480gaagatctgg gacccggctt tacgaggcgg tagcggcggc cacagatgga ctaaggcgat
6540gatgcgacgc acggttaccc ggcaccgctt ggacctcgtg accggtgaga ttctgggcac
6600gaagacgcgg aaggttcggg cgccagtgaa gaggtttgtc cggacttcgg gatacctgtg
6660tgtcaatgac gggcccgcac tggctcgaac cctcagccgt cttcgtacaa gctgcctgag
6720ctagacgcgc ggaacgcctt tcggctttgt cttttgctgg atggcgggtt ttgggcggct
6780tctggtgatg cgctgctgcg ctccgtgggg agagagaccc aacgactgac ctatctctac
6840ccaggtgcaa ttcatctccc gcgctctgtc ggctaggtaa acgaggtgct cccgcgcgag
6900cttttccatg tggtcggcca atgtcagctc ggtcaggaca acctgctgtt gttgcgatag
6960ttgtgtccgc acgggtcgat tgtcttctgt tgcggcataa cggttttcgt cgttcgcgga
7020gagtgcggct aaatgaattg catcctcgat tgagcggagc atttcgacgc ggaacctggc
7080gatgatgttg tctctgtctt cattcataac tgaagcgtat tgggagtgtt gccctcccac
7140catgtgtgcc aatgcaggtg tgaactgagt cacagtttct caatagactc caagtttgtg
7200atccttttac tcccaaaatg gggcatgatg tgtgcgtgcc tcggttcagg ggcgaaagtt
7260cgacacctcg aaagaaggcc tcgacatggc tttgaaagct gctggcaacg tgattcctga
7320ttcctccgcg tacgagtacc gggcggttca ggtcgagccg aagatggtca gaaaagaccc
7380ggaagacccg aactctgagc agttccagaa gcagaaggac ggcacgccgg tgtggtcgat
7440cgactgcatt cgggtcgacc gggcatcagg caacaaggca atcgtgaccg tgacggttcc
7500ggacgtgatg gaaccggatg ttgcggggcc ggtggagttc tccgagatga ttgccggttt
7560ctgggtttcg cgcagtggtt cgggcatgtg gttttcggca agcgccgtcg cttctctctg
7620atcgctgatc gtcgcccctc gaaaggttcg gaaatgtcca aaggaaaagg cgttgcgctg
7680ggtgtgggtg ccctcgtgct cgtgtttgtg ctggttgcgg caggttggca agcggcgaac
7740gtgttcagtg atcgttcaca gtccgaagct gtgccgctga gagtgccggc cgatccgaag
7800tgggaaaacg gggtgttctc ggacgttgcc gggtgcctcg ttctctctcc ggaagagctg
7860gggccgttca gcggagggca gtacatcgac atagtgaggc cagttgagcc ggagaggttg
7920gagcgcgact gggtgaggtc ggctgagtgc gtttcggcgt cgatgaatgt ctctgacctg
7980ttggtttctg ctcttccaga gtccacccgt ccccccggcg atttcgttcg ttcgtggaaa
8040gtggcgagtg atgattactg ctatgagggt gataacccgc aaggctgcac ttctcgtatg
8100ccggtttggg tctctgcaaa aaactggtgg tgca
8134938124DNAArtificialpRET1102Rv 93gactctagag gatccccggg taccgagctc
gaattcactg gccgtcgttt tacaacgtcg 60tgactgggaa aaccctggcg ttacccaact
taatcgcctt gcagcacatc cccctttcgc 120cagctggcgt aatagcgaag aggcccgcac
cgatcgccct tcccaacagt tgcgcagcct 180gaatggcgaa tgcgatttat tcaacaaagc
cgccgtcccg tcaagtcagc gtaatgctct 240gccagtgtta caaccaatta accaattctg
attagaaaaa ctcatcgagc atcaaatgaa 300actgcaattt attcatatca ggattatcaa
taccatattt ttgaaaaagc cgtttctgta 360atgaaggaga aaactcaccg aggcagttcc
ataggatggc aagatcctgg tatcggtctg 420cgattccgac tcgtccaaca tcaatacaac
ctattaattt cccctcgtca aaaataaggt 480tatcaagtga gaaatcacca tgagtgacga
ctgaatccgg tgagaatggc aaaagcttat 540gcatttcttt ccagacttgt tcaacaggcc
agccattacg ctcgtcatca aaatcactcg 600catcaaccaa accgttattc attcgtgatt
gcgcctgagc gagacgaaat acgcgatcgc 660tgttaaaagg acaattacaa acaggaatcg
aatgcaaccg gcgcaggaac actgccagcg 720catcaacaat attttcacct gaatcaggat
attcttctaa tacctggaat gctgttttcc 780cggggatcgc agtggtgagt aaccatgcat
catcaggagt acggataaaa tgcttgatgg 840tcggaagagg cataaattcc gtcagccagt
ttagtctgac catctcatct gtaacatcat 900tggcaacgct acctttgcca tgtttcagaa
acaactctgg cgcatcgggc ttcccataca 960atcgatagat tgtcgcacct gattgcccga
cattatcgcg agcccattta tacccatata 1020aatcagcatc catgttggaa tttaatcgcg
gcttcgagca agacgtttcc cgttgaatat 1080ggctcataac accccttgta ttactgttta
tgtaagcaga cagttttatt gttcatgatg 1140atatattttt atcttgtgca atgtaacatc
agagattttg agacacaacg tggctttgtt 1200gaataaatcg aacttttgct gagttgaagg
atcagatcac gcatcttccc gacaacgcag 1260accgttccgt ggcaaagcaa aagttcaaaa
tcaccaactg gtccacctac aacaaagctc 1320tcatcaaccg tggctccctc actttctggc
tggatgatgg ggcgattcag gcctggtatg 1380agtcagcaac accttcttca cgaggcagac
ctctcgacgg atcgttccac tgagcgtcag 1440accccgtaga aaagatcaaa ggatcttctt
gagatccttt ttttctgcgc gtaatctgct 1500gcttgcaaac aaaaaaacca ccgctaccag
cggtggtttg tttgccggat caagagctac 1560caactctttt tccgaaggta actggcttca
gcagagcgca gataccaaat actgtccttc 1620tagtgtagcc gtagttaggc caccacttca
agaactctgt agcaccgcct acatacctcg 1680ctctgctaat cctgttacca gtggctgctg
ccagtggcga taagtcgtgt cttaccgggt 1740tggactcaag acgatagtta ccggataagg
cgcagcggtc gggctgaacg gggggttcgt 1800gcacacagcc cagcttggag cgaacgacct
acaccgaact gagataccta cagcgtgagc 1860attgagaaag cgccacgctt cccgaaggga
gaaaggcgga caggtatccg gtaagcggca 1920gggtcggaac aggagagcgc acgagggagc
ttccaggggg aaacgcctgg tatctttata 1980gtcctgtcgg gtttcgccac ctctgacttg
agcgtcgatt tttgtgatgc tcgtcagggg 2040ggcggagcct atggaaaaac gccagcaacg
cggccttttt acggttcctg gccttttgct 2100ggccttttgc tcacatgttc tttcctgcgt
tatcccctga ttctgtggat aaccgtatta 2160ccgcctttga gtgagctgat accgctcgcc
gcagccgaac gaccgagcgc agcgagtcag 2220tgagcgagga agcggaagaa gctcgcacat
tcagcagcgt ttttcagcgc gttttcgatc 2280agcgtttcaa tgttggtatc aacaccaggt
ttaactttga acttatcggc actgacggtt 2340actgattttg aacttttgct ttgccacgga
acggtctgcg ttgtcgggaa gatgcgtgat 2400ctgatccttc aactcagcaa aagttcgcca
atacgcaaac cgcctctccc cgcgcgttgg 2460ccgattcatt aatgcagctg gcacgacagg
tttcccgact ggaaagcggg cagtgagcgc 2520aacgcaatta atgtgagtta gctcactcat
taggcacccc aggctttaca ctttatgctt 2580ccggctcgta tgttgtgtgg aattgtgagc
ggataacaat ttcacacagg aaacagctat 2640gaccatgatt acgccaagct tgcatgcctg
caggtctgca cagaacccgt actcgatccg 2700ctcgttcgtc gctgtgaggt ctttcctgca
aggcaaatcg ttgtgccgga aggggtttcg 2760tgatgtttct ccgagcgttt tttcgttcca
agttggtcat ggtggctctt gtcctggtcg 2820ctggcctgtt tctctacaac gcctgctctt
cttctgacgc aaaggaagag atcggcagca 2880gtctgaatct ctctcctgtc actgctcgtt
cgaatccgta tgagggcgtc cagcccacga 2940tgagcgaaaa aagccctgtt cccgtccctg
tcgtttccgg cgacaggatt tcgggggtgg 3000catcgtgcgg gacggattac gccgggaagc
ctgcggtgac gctggaagct gtgtggattt 3060cgtccgactc ggtgaactac acactcgata
agaggcattg cctggtgacg accggcccgc 3120tgtggaaaca agcgatccgt aaagcgtcag
ggtcagagat tcggcctgag ggcgggagct 3180ggatacgggt ggtgcttgcc atgcctgacg
gcaatttcag ggcaggatgg gcaccccacg 3240cccaagtaac cgctggtgcg ctggatattt
cggcggtggt ctcgtgagcg gggagaagcg 3300gcacagcgag gccggcccgg tagaaatcat
ctttttgatg ctggcagtca gggcggggga 3360ctacatcgtc gccgtgactg cggttctcgc
ggtcgggttc ttcgcggtcg cggttgaggg 3420tttctggttc ctggtcgtcg cagtcatcgc
tgcaccggcg tggtggtttc tgcgcgactg 3480ggaatcgaag cggagggccg tacgggtctt
tgaacgggca tggaagggga cacctgaatc 3540ccccggtatt gctctctccc ttggcctgtc
gaacgtggcg gggtctctgc cgaggttgag 3600gaagtttgaa actggttcgg ggatacgcac
actcgtgttt tctttgccgc ccggagtcac 3660tgccgagagc tttgagaaag ttcgccctgc
gctggcagac gcgatggggg gtcaccgctg 3720ccaagtagag aaggtggccc ccggacaggt
ccgcgtcaga gtgattgatg aggattcgat 3780gaagacgccg cgtgatgcgg gatgggcgaa
agatgttgtg ctggaagagg atacgttcga 3840cggtcttccg ggcgagacgc gatcctggtt
cgagcaagag gggccggcat catgagaaaa 3900tcggcgggag tatctcggat tcctatccgt
ctcgggcgct ctcagtacgg ggaagacgtt 3960ggattcgatc tcgctgcgga cgccgctcac
atcgccatgc agggcaaaac ccgatccggc 4020aaaagtcagg cgacgtacaa cgtgttagct
caggcagcag cgaacgcggc ggttcgagtc 4080gtagggtccg acccgacaca cgtactcctg
gagcccttca aacatcgagg ggtgtccgag 4140ccttacgtgg tttcgggact gaatgcgcag
gccacggtgg acatgctggg ctgggtcaag 4200cgtgagtctg atcgtcgcat cgaccagatg
tggcccctgc gtaccgacaa gttttccgag 4260ttcggggctt cgttcccgct gatactcgtc
gtgctcgaag agtttcccgg gatcctcgag 4320ggggcagcgg acgaagacgc cgcgttaggc
cgaaaacctg ccgagcgtct cgcaccccgc 4380atttcggcct acgtgcgtca gatagcagcg
cagtcggcaa aggctggaat tcgccttctc 4440ctgctctcgc aacgagcgga ggcctcgatc
attggcggca atgcgcgttc gaatttcggg 4500gtcaagatga ctctgagggt ggacgaaccg
gagtcggtga gaatgcttca tccgagcgct 4560tccccggaag actgtgccct ggtcgagacc
ttcaagcctg gtacctgcct tttcgagaag 4620ccaggagaag gccggcagat tatgcgatgc
gactttgtcg gcgagtacgg gagatatgcg 4680cgagccatcg agtcttcgga tctgcgtttt
ctcgccaccc tccagcaaga ccaggcccaa 4740cgcgaattct tcgctgagga gttcggtgtg
gtggatccgt catgactgga ccacaggaga 4800gaaagcgcaa ggcggcgaag ccgtcgcggg
agcctcagtt gaactgctgt gaagcggacg 4860tgccgaaacg agcaaaacag cccccggttc
cctctacgtt cgacctgctc acggtgaagg 4920agactgcggg gctgctgaga gtcagtcagg
caactcttta ccggctgctt cggagtgggg 4980aaggacccac atacacacgg atcggtggac
agatacgcgt tcaccgcgag tcgctgcgtc 5040ggttcatcga accgcgtgga taacgtcaca
gagacagcga aaacgcctcc cctgggtcaa 5100tccggttacc gccggactgg gggaggcgct
tcgacaccta catccgtcgc ccctcgaaag 5160gctcagatgc acttccacga taacgcagag
gtcggacaag agggaagaac tgccgttctc 5220tcgccgttgc gcggcgtagc cgccaagcgg
gacgtgtctg acgatgcagc gaagcggagt 5280cggcaggcgc ggcacgcgcc tgggcttgtt
acatctgcca caactgtccg tgaatctctg 5340ccagctcctg aaaccgctgg tcagggcctt
gcggaatccg tgaccgctga tgatttttgg 5400tctcattcgt tcccccgcgc tgacgatgta
cgcggcgcag ctgcttcctt ccagtcggtg 5460gctaactggg atgggcgtga gggtccgagg
ccgcgtttcg ttgtcgcgcc tggcgttgtc 5520cgcttggagg tttgtgatct cgcacgccgc
gaacgaacgg ctgaacgtgc gtatctggct 5580gctcgggctc gggtggatat ggcggctgcc
aggcataact cgccgtacga cttcgacgtg 5640gacgatgaag agttggcgga actggcttct
ctgcaaggcc tcgaggacga cgacattggg 5700ggctggtctg cggagaggga aatagtgggc
tggtctgctc gttctcggtc acggatgatc 5760ttgcgaatgg cagaactcga ctgggctccc
atgatggatt tgccgggcat tcctgcgatg 5820gtgaccctca cctatccggg ggactggctt
acggttgccc ccaccggcgc tgaggtcaaa 5880aaacatctcc agacgttctt caaacggttc
caacgggcct ggggcattgc ctggatgggt 5940gcgtggaaaa tggagttcca aagccgaggc
gctccgcatt ttcacctgta catggtccct 6000cctcatggga aggcaggaga ctcgcggaag
ctgcggcatg atgctgagct cttgaaatgg 6060gagatagcac gtgcagaggg tgaagaccca
ggtcgcaggc cgtatttccg ggaagctcca 6120agcgatggat tgaagtttcg tccgtggctt
tctgcggtgt gggccgacgt cgtagatcat 6180ccggacccca aggaaaaaga aaagcacgtc
agtgccggca ctggagtgga ctacgcggag 6240ggcacgcgag ggtcagatcc gaaaaggctt
gcggtgtact tctccaagca tggaaccttt 6300gccgacaagg aatatcagca cgtagttcct
gctcaatggc agaaaacggg tgcgggacct 6360ggcaggttct ggggctaccg cggtttgtcg
ccggccacgg ctgccaccga gatttcctgg 6420gatgagtacc tgcttttatc tcgcacgttg
cgacgattgt cagcgcgaac gaagatctgg 6480gacccggctt tacgaggcgg tagcggcggc
cacagatgga ctaaggcgat gatgcgacgc 6540acggttaccc ggcaccgctt ggacctcgtg
accggtgaga ttctgggcac gaagacgcgg 6600aaggttcggg cgccagtgaa gaggtttgtc
cggacttcgg gatacctgtg tgtcaatgac 6660gggcccgcac tggctcgaac cctcagccgt
cttcgtacaa gctgcctgag ctagacgcgc 6720ggaacgcctt tcggctttgt cttttgctgg
atggcgggtt ttgggcggct tctggtgatg 6780cgctgctgcg ctccgtgggg agagagaccc
aacgactgac ctatctctac ccaggtgcaa 6840ttcatctccc gcgctctgtc ggctaggtaa
acgaggtgct cccgcgcgag cttttccatg 6900tggtcggcca atgtcagctc ggtcaggaca
acctgctgtt gttgcgatag ttgtgtccgc 6960acgggtcgat tgtcttctgt tgcggcataa
cggttttcgt cgttcgcgga gagtgcggct 7020aaatgaattg catcctcgat tgagcggagc
atttcgacgc ggaacctggc gatgatgttg 7080tctctgtctt cattcataac tgaagcgtat
tgggagtgtt gccctcccac catgtgtgcc 7140aatgcaggtg tgaactgagt cacagtttct
caatagactc caagtttgtg atccttttac 7200tcccaaaatg gggcatgatg tgtgcgtgcc
tcggttcagg ggcgaaagtt cgacacctcg 7260aaagaaggcc tcgacatggc tttgaaagct
gctggcaacg tgattcctga ttcctccgcg 7320tacgagtacc gggcggttca ggtcgagccg
aagatggtca gaaaagaccc ggaagacccg 7380aactctgagc agttccagaa gcagaaggac
ggcacgccgg tgtggtcgat cgactgcatt 7440cgggtcgacc gggcatcagg caacaaggca
atcgtgaccg tgacggttcc ggacgtgatg 7500gaaccggatg ttgcggggcc ggtggagttc
tccgagatga ttgccggttt ctgggtttcg 7560cgcagtggtt cgggcatgtg gttttcggca
agcgccgtcg cttctctctg atcgctgatc 7620gtcgcccctc gaaaggttcg gaaatgtcca
aaggaaaagg cgttgcgctg ggtgtgggtg 7680ccctcgtgct cgtgtttgtg ctggttgcgg
caggttggca agcggcgaac gtgttcagtg 7740atcgttcaca gtccgaagct gtgccgctga
gagtgccggc cgatccgaag tgggaaaacg 7800gggtgttctc ggacgttgcc gggtgcctcg
ttctctctcc ggaagagctg gggccgttca 7860gcggagggca gtacatcgac atagtgaggc
cagttgagcc ggagaggttg gagcgcgact 7920gggtgaggtc ggctgagtgc gtttcggcgt
cgatgaatgt ctctgacctg ttggtttctg 7980ctcttccaga gtccacccgt ccccccggcg
atttcgttcg ttcgtggaaa gtggcgagtg 8040atgattactg ctatgagggt gataacccgc
aaggctgcac ttctcgtatg ccggtttggg 8100tctctgcaaa aaactggtgg tgca
8124947675DNAArtificialpRET1103Rv
94gactctagag gatccccggg taccgagctc gaattcgtaa tcatggtcat agctgtttcc
60tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg
120taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc
180cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg
240gagaggcggt ttgcgtattg gagcttggca ctgggccaag ctgaatttct gccattcatc
300cgcttattat cacttattca ggcgtagcac caggcgttta agggcaccaa taactgcctt
360aaaaaaatta cgccccgccc tgccactcat cgcagtactg ttgtaattca ttaagcattc
420tgccgacatg gaagccatca cagacggcat gatgaacctg aatcgccagc ggcatcagca
480ccttgtcgcc ttgcgtataa tatttgccca tggtgaaaac gggggcgaag aagttgtcca
540tattggccac gtttaaatca aaactggtga aactcaccca gggattggct gagacgaaaa
600acatattctc aataaaccct ttagggaaat aggccaggtt ttcaccgtaa cacgccacat
660cttgcgaata tatgtgtaga aactgccgga aatcgtcgtg gtattcactc cagagcgatg
720aaaacgtttc agtttgctca tggaaaacgg tgtaacaagg gtgaacacta tcccatatca
780ccagctcacc gtctttcatt gccatacgaa attccggatg agcattcatc aggcgggcaa
840gaatgtgaat aaaggccgga taaaacttgt gcttattttt ctttacggtc tttaaaaagg
900ccgtaatatc cagctgaacg gtctggttat aggtacattg agcaactgac tgaaatgcct
960caaaatgttc tttacgatgc cattgggata tatcaacggt ggtatatcca gtgatttttt
1020tctccatttt agcttcctta gctcctgaaa atctcgataa ctcaaaaaat acgcccggta
1080gtgatcttat ttcattatgg tgaaagttgg aacctcttac gtgccgatca acgtctcatt
1140ttcgccaaaa gttggcccag ggcttcccgg tatcaacagg gacaccagga tttatttatt
1200ctgcgaagtg atcttccgtt cgacggagtt ccactgagcg tcagaccccg tagaaaagat
1260caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa
1320accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa
1380ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt
1440aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt
1500accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata
1560gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt
1620ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag aaagcgccac
1680gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga
1740gcgcacgagg gagcttccag ggggaaacgc ctggtatctt tatagtcctg tcgggtttcg
1800ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggcgga gcctatggaa
1860aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat
1920gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc
1980tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga
2040agaagctcat tcgccattca ggctgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc
2100ttcgctatta cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac
2160gccagggttt tcccagtcac gacgttgtaa aacgacggcc agtgccaagc ttgcatgcct
2220gcaggtctgc acagaacccg tactcgatcc gctcgttcgt cgctgtgagg tctttcctgc
2280aaggcaaatc gttgtgccgg aaggggtttc gtgatgtttc tccgagcgtt ttttcgttcc
2340aagttggtca tggtggctct tgtcctggtc gctggcctgt ttctctacaa cgcctgctct
2400tcttctgacg caaaggaaga gatcggcagc agtctgaatc tctctcctgt cactgctcgt
2460tcgaatccgt atgagggcgt ccagcccacg atgagcgaaa aaagccctgt tcccgtccct
2520gtcgtttccg gcgacaggat ttcgggggtg gcatcgtgcg ggacggatta cgccgggaag
2580cctgcggtga cgctggaagc tgtgtggatt tcgtccgact cggtgaacta cacactcgat
2640aagaggcatt gcctggtgac gaccggcccg ctgtggaaac aagcgatccg taaagcgtca
2700gggtcagaga ttcggcctga gggcgggagc tggatacggg tggtgcttgc catgcctgac
2760ggcaatttca gggcaggatg ggcaccccac gcccaagtaa ccgctggtgc gctggatatt
2820tcggcggtgg tctcgtgagc ggggagaagc ggcacagcga ggccggcccg gtagaaatca
2880tctttttgat gctggcagtc agggcggggg actacatcgt cgccgtgact gcggttctcg
2940cggtcgggtt cttcgcggtc gcggttgagg gtttctggtt cctggtcgtc gcagtcatcg
3000ctgcaccggc gtggtggttt ctgcgcgact gggaatcgaa gcggagggcc gtacgggtct
3060ttgaacgggc atggaagggg acacctgaat cccccggtat tgctctctcc cttggcctgt
3120cgaacgtggc ggggtctctg ccgaggttga ggaagtttga aactggttcg gggatacgca
3180cactcgtgtt ttctttgccg cccggagtca ctgccgagag ctttgagaaa gttcgccctg
3240cgctggcaga cgcgatgggg ggtcaccgct gccaagtaga gaaggtggcc cccggacagg
3300tccgcgtcag agtgattgat gaggattcga tgaagacgcc gcgtgatgcg ggatgggcga
3360aagatgttgt gctggaagag gatacgttcg acggtcttcc gggcgagacg cgatcctggt
3420tcgagcaaga ggggccggca tcatgagaaa atcggcggga gtatctcgga ttcctatccg
3480tctcgggcgc tctcagtacg gggaagacgt tggattcgat ctcgctgcgg acgccgctca
3540catcgccatg cagggcaaaa cccgatccgg caaaagtcag gcgacgtaca acgtgttagc
3600tcaggcagca gcgaacgcgg cggttcgagt cgtagggtcc gacccgacac acgtactcct
3660ggagcccttc aaacatcgag gggtgtccga gccttacgtg gtttcgggac tgaatgcgca
3720ggccacggtg gacatgctgg gctgggtcaa gcgtgagtct gatcgtcgca tcgaccagat
3780gtggcccctg cgtaccgaca agttttccga gttcggggct tcgttcccgc tgatactcgt
3840cgtgctcgaa gagtttcccg ggatcctcga gggggcagcg gacgaagacg ccgcgttagg
3900ccgaaaacct gccgagcgtc tcgcaccccg catttcggcc tacgtgcgtc agatagcagc
3960gcagtcggca aaggctggaa ttcgccttct cctgctctcg caacgagcgg aggcctcgat
4020cattggcggc aatgcgcgtt cgaatttcgg ggtcaagatg actctgaggg tggacgaacc
4080ggagtcggtg agaatgcttc atccgagcgc ttccccggaa gactgtgccc tggtcgagac
4140cttcaagcct ggtacctgcc ttttcgagaa gccaggagaa ggccggcaga ttatgcgatg
4200cgactttgtc ggcgagtacg ggagatatgc gcgagccatc gagtcttcgg atctgcgttt
4260tctcgccacc ctccagcaag accaggccca acgcgaattc ttcgctgagg agttcggtgt
4320ggtggatccg tcatgactgg accacaggag agaaagcgca aggcggcgaa gccgtcgcgg
4380gagcctcagt tgaactgctg tgaagcggac gtgccgaaac gagcaaaaca gcccccggtt
4440ccctctacgt tcgacctgct cacggtgaag gagactgcgg ggctgctgag agtcagtcag
4500gcaactcttt accggctgct tcggagtggg gaaggaccca catacacacg gatcggtgga
4560cagatacgcg ttcaccgcga gtcgctgcgt cggttcatcg aaccgcgtgg ataacgtcac
4620agagacagcg aaaacgcctc ccctgggtca atccggttac cgccggactg ggggaggcgc
4680ttcgacacct acatccgtcg cccctcgaaa ggctcagatg cacttccacg ataacgcaga
4740ggtcggacaa gagggaagaa ctgccgttct ctcgccgttg cgcggcgtag ccgccaagcg
4800ggacgtgtct gacgatgcag cgaagcggag tcggcaggcg cggcacgcgc ctgggcttgt
4860tacatctgcc acaactgtcc gtgaatctct gccagctcct gaaaccgctg gtcagggcct
4920tgcggaatcc gtgaccgctg atgatttttg gtctcattcg ttcccccgcg ctgacgatgt
4980acgcggcgca gctgcttcct tccagtcggt ggctaactgg gatgggcgtg agggtccgag
5040gccgcgtttc gttgtcgcgc ctggcgttgt ccgcttggag gtttgtgatc tcgcacgccg
5100cgaacgaacg gctgaacgtg cgtatctggc tgctcgggct cgggtggata tggcggctgc
5160caggcataac tcgccgtacg acttcgacgt ggacgatgaa gagttggcgg aactggcttc
5220tctgcaaggc ctcgaggacg acgacattgg gggctggtct gcggagaggg aaatagtggg
5280ctggtctgct cgttctcggt cacggatgat cttgcgaatg gcagaactcg actgggctcc
5340catgatggat ttgccgggca ttcctgcgat ggtgaccctc acctatccgg gggactggct
5400tacggttgcc cccaccggcg ctgaggtcaa aaaacatctc cagacgttct tcaaacggtt
5460ccaacgggcc tggggcattg cctggatggg tgcgtggaaa atggagttcc aaagccgagg
5520cgctccgcat tttcacctgt acatggtccc tcctcatggg aaggcaggag actcgcggaa
5580gctgcggcat gatgctgagc tcttgaaatg ggagatagca cgtgcagagg gtgaagaccc
5640aggtcgcagg ccgtatttcc gggaagctcc aagcgatgga ttgaagtttc gtccgtggct
5700ttctgcggtg tgggccgacg tcgtagatca tccggacccc aaggaaaaag aaaagcacgt
5760cagtgccggc actggagtgg actacgcgga gggcacgcga gggtcagatc cgaaaaggct
5820tgcggtgtac ttctccaagc atggaacctt tgccgacaag gaatatcagc acgtagttcc
5880tgctcaatgg cagaaaacgg gtgcgggacc tggcaggttc tggggctacc gcggtttgtc
5940gccggccacg gctgccaccg agatttcctg ggatgagtac ctgcttttat ctcgcacgtt
6000gcgacgattg tcagcgcgaa cgaagatctg ggacccggct ttacgaggcg gtagcggcgg
6060ccacagatgg actaaggcga tgatgcgacg cacggttacc cggcaccgct tggacctcgt
6120gaccggtgag attctgggca cgaagacgcg gaaggttcgg gcgccagtga agaggtttgt
6180ccggacttcg ggatacctgt gtgtcaatga cgggcccgca ctggctcgaa ccctcagccg
6240tcttcgtaca agctgcctga gctagacgcg cggaacgcct ttcggctttg tcttttgctg
6300gatggcgggt tttgggcggc ttctggtgat gcgctgctgc gctccgtggg gagagagacc
6360caacgactga cctatctcta cccaggtgca attcatctcc cgcgctctgt cggctaggta
6420aacgaggtgc tcccgcgcga gcttttccat gtggtcggcc aatgtcagct cggtcaggac
6480aacctgctgt tgttgcgata gttgtgtccg cacgggtcga ttgtcttctg ttgcggcata
6540acggttttcg tcgttcgcgg agagtgcggc taaatgaatt gcatcctcga ttgagcggag
6600catttcgacg cggaacctgg cgatgatgtt gtctctgtct tcattcataa ctgaagcgta
6660ttgggagtgt tgccctccca ccatgtgtgc caatgcaggt gtgaactgag tcacagtttc
6720tcaatagact ccaagtttgt gatcctttta ctcccaaaat ggggcatgat gtgtgcgtgc
6780ctcggttcag gggcgaaagt tcgacacctc gaaagaaggc ctcgacatgg ctttgaaagc
6840tgctggcaac gtgattcctg attcctccgc gtacgagtac cgggcggttc aggtcgagcc
6900gaagatggtc agaaaagacc cggaagaccc gaactctgag cagttccaga agcagaagga
6960cggcacgccg gtgtggtcga tcgactgcat tcgggtcgac cgggcatcag gcaacaaggc
7020aatcgtgacc gtgacggttc cggacgtgat ggaaccggat gttgcggggc cggtggagtt
7080ctccgagatg attgccggtt tctgggtttc gcgcagtggt tcgggcatgt ggttttcggc
7140aagcgccgtc gcttctctct gatcgctgat cgtcgcccct cgaaaggttc ggaaatgtcc
7200aaaggaaaag gcgttgcgct gggtgtgggt gccctcgtgc tcgtgtttgt gctggttgcg
7260gcaggttggc aagcggcgaa cgtgttcagt gatcgttcac agtccgaagc tgtgccgctg
7320agagtgccgg ccgatccgaa gtgggaaaac ggggtgttct cggacgttgc cgggtgcctc
7380gttctctctc cggaagagct ggggccgttc agcggagggc agtacatcga catagtgagg
7440ccagttgagc cggagaggtt ggagcgcgac tgggtgaggt cggctgagtg cgtttcggcg
7500tcgatgaatg tctctgacct gttggtttct gctcttccag agtccacccg tccccccggc
7560gatttcgttc gttcgtggaa agtggcgagt gatgattact gctatgaggg tgataacccg
7620caaggctgca cttctcgtat gccggtttgg gtctctgcaa aaaactggtg gtgca
7675958497DNAArtificialpRET1001 95ccgtccacca cccggtgcct ggtctgcgtc
tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg
gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc
acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc
gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg
tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc
ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc
cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc
agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg
tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt
tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag
atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc
accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg
ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc
gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc
ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg
atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg
gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag
gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg
gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc
aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca
ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag
gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc
tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg
tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag
gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag
gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag
acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag
gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc
gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc
cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa
cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag
cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc
acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag
gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc
ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc
acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg
cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga
ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga
agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat
gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa
atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc
agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg
gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag
tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc
tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag
catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag
ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc
ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg
attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga
ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc
ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg
atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc
cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg
tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat
ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag
gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg
ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca
gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac
cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa
gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc
gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg
cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg
cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt
ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc
tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc
atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc
tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc
gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg
tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg
ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg
cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc
cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta
ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc
acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc
gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt
cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc
ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa
cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc
gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc
aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc
cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg
ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat
ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga
tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa
attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac
caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca
gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg
aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc
acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca
cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg
gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg
cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg
ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg
acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg
ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg
gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac
ctgccggtca gcccgtgacc ggacctgcag 5820gcatgcaagc ttggcactgg ccgtcgtttt
acaacgtcgt gactgggaaa accctggcgt 5880tacccaactt aatcgccttg cagcacatcc
ccctttcgcc agctggcgta atagcgaaga 5940ggcccgcacc gatcgccctt cccaacagtt
gcgcagcctg aatggcgaat ggcgcctgat 6000gcggtatttt ctccttacgc atctgtgcgg
tatttcacac cgcatatggt gcactctcag 6060tacaatctgc tctgatgccg catagttaag
ccagccccga cacccgccaa cacccgctga 6120cgcgccctga cgggcttgtc tgctcccggc
atccgcttac agacaagctg tgaccgtctc 6180cgggagctgc atgtgtcaga ggttttcacc
gtcatcaccg aaacgcgcga gacgaaaggg 6240cctcgtgata cgcctatttt tataggttaa
tgtcatgata ataatggttt cttagacgtc 6300aggtggcact tttcggggaa atgtgcgcgg
aacccctatt tgtttatttt tctaaataca 6360ttcaaatatg tatccgctca tgagacaata
accctgataa atgcttcaat aatattgaaa 6420aaggaagagt atgagtattc aacatttccg
tgtcgccctt attccctttt ttgcggcatt 6480ttgccttcct gtttttgctc acccagaaac
gctggtgaaa gtaaaagatg ctgaagatca 6540gttgggtgca cgagtgggtt acatcgaact
ggatctcaac agcggtaaga tccttgagag 6600ttttcgcccc gaagaacgtt ttccaatgat
gagcactttt aaagttctgc tatgtggcgc 6660ggtattatcc cgtattgacg ccgggcaaga
gcaactcggt cgccgcatac actattctca 6720gaatgacttg gttgagtact caccagtcac
agaaaagcat cttacggatg gcatgacagt 6780aagagaatta tgcagtgctg ccataaccat
gagtgataac actgcggcca acttacttct 6840gacaacgatc ggaggaccga aggagctaac
cgcttttttg cacaacatgg gggatcatgt 6900aactcgcctt gatcgttggg aaccggagct
gaatgaagcc ataccaaacg acgagcgtga 6960caccacgatg cctgtagcaa tggcaacaac
gttgcgcaaa ctattaactg gcgaactact 7020tactctagct tcccggcaac aattaataga
ctggatggag gcggataaag ttgcaggacc 7080acttctgcgc tcggcccttc cggctggctg
gtttattgct gataaatctg gagccggtga 7140gcgtgggtct cgcggtatca ttgcagcact
ggggccagat ggtaagccct cccgtatcgt 7200agttatctac acgacgggga gtcaggcaac
tatggatgaa cgaaatagac agatcgctga 7260gataggtgcc tcactgatta agcattggta
actgtcagac caagtttact catatatact 7320ttagattgat ttaaaacttc atttttaatt
taaaaggatc taggtgaaga tcctttttga 7380taatctcatg accaaaatcc cttaacgtga
gttttcgttc cactgagcgt cagaccccgt 7440agaaaagatc aaaggatctt cttgagatcc
tttttttctg cgcgtaatct gctgcttgca 7500aacaaaaaaa ccaccgctac cagcggtggt
ttgtttgccg gatcaagagc taccaactct 7560ttttccgaag gtaactggct tcagcagagc
gcagatacca aatactgtcc ttctagtgta 7620gccgtagtta ggccaccact tcaagaactc
tgtagcaccg cctacatacc tcgctctgct 7680aatcctgtta ccagtggctg ctgccagtgg
cgataagtcg tgtcttaccg ggttggactc 7740aagacgatag ttaccggata aggcgcagcg
gtcgggctga acggggggtt cgtgcacaca 7800gcccagcttg gagcgaacga cctacaccga
actgagatac ctacagcgtg agctatgaga 7860aagcgccacg cttcccgaag ggagaaaggc
ggacaggtat ccggtaagcg gcagggtcgg 7920aacaggagag cgcacgaggg agcttccagg
gggaaacgcc tggtatcttt atagtcctgt 7980cgggtttcgc cacctctgac ttgagcgtcg
atttttgtga tgctcgtcag gggggcggag 8040cctatggaaa aacgccagca acgcggcctt
tttacggttc ctggcctttt gctggccttt 8100tgctcacatg ttctttcctg cgttatcccc
tgattctgtg gataaccgta ttaccgcctt 8160tgagtgagct gataccgctc gccgcagccg
aacgaccgag cgcagcgagt cagtgagcga 8220ggaagcggaa gagcgcccaa tacgcaaacc
gcctctcccc gcgcgttggc cgattcatta 8280atgcagctgg cacgacaggt ttcccgactg
gaaagcgggc agtgagcgca acgcaattaa 8340tgtgagttag ctcactcatt aggcacccca
ggctttacac tttatgcttc cggctcgtat 8400gttgtgtgga attgtgagcg gataacaatt
tcacacagga aacagctatg accatgatta 8460cgaattcgag ctcggtaccc ggggatcctc
tagagtc 8497968487DNAArtificialpRET1002
96ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt
60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc
120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg
180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg
240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt
300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg
360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg
420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc
480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga
540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc
600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg
660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc
720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg
780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg
840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc
900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg
960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg
1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc
1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc
1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa
1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg
1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg
1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg
1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc
1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc
1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc
1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc
1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag
1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta
1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc
1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa
1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg
1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac
1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa
2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc
2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc
2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc
2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg
2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag
2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct
2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg
2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca
2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc
2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg
2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg
2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga
2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa
2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat
2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc
2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac
3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat
3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat
3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact
3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac
3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat
3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc
3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg
3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg
3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc
3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc
3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc
3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct
3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg
3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc
3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc
3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc
3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc
4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc
4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc
4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga
4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt
4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg
4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc
4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt
4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg
4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc
4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg
4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc
4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga
4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc
4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac
4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc
4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt
4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac
5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat
5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat
5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc
5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc
5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg
5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga
5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa
5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga
5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga
5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg
5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg
5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg
5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggacctgcag
5820gcatgcaagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct
5880cacaattcca cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg
5940agtgagctaa ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct
6000gtcgtgccag ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg
6060cgaacttttg ctgagttgaa ggatcagatc acgcatcttc ccgacaacgc agaccgttcc
6120gtggcaaagc aaaagttcaa aatcagtaac cgtcagtgcc gataagttca aagttaaacc
6180tggtgttgat accaacattg aaacgctgat cgaaaacgcg ctgaaaaacg ctgctgaatg
6240tgcgagcttc ttccgcttcc tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc
6300gagcggtatc agctcactca aaggcggtaa tacggttatc cacagaatca ggggataacg
6360caggaaagaa catgtgagca aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt
6420tgctggcgtt tttccatagg ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa
6480gtcagaggtg gcgaaacccg acaggactat aaagatacca ggcgtttccc cctggaagct
6540ccctcgtgcg ctctcctgtt ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc
6600cttcgggaag cgtggcgctt tctcaatgct cacgctgtag gtatctcagt tcggtgtagg
6660tcgttcgctc caagctgggc tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct
6720tatccggtaa ctatcgtctt gagtccaacc cggtaagaca cgacttatcg ccactggcag
6780cagccactgg taacaggatt agcagagcga ggtatgtagg cggtgctaca gagttcttga
6840agtggtggcc taactacggc tacactagaa ggacagtatt tggtatctgc gctctgctga
6900agccagttac cttcggaaaa agagttggta gctcttgatc cggcaaacaa accaccgctg
6960gtagcggtgg tttttttgtt tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag
7020aagatccttt gatcttttct acggggtctg acgctcagtg gaacgatccg tcgagaggtc
7080tgcctcgtga agaaggtgtt gctgactcat accaggcctg aatcgcccca tcatccagcc
7140agaaagtgag ggagccacgg ttgatgagag ctttgttgta ggtggaccag ttggtgattt
7200tgaacttttg ctttgccacg gaacggtctg cgttgtcggg aagatgcgtg atctgatcct
7260tcaactcagc aaaagttcga tttattcaac aaagccacgt tgtgtctcaa aatctctgat
7320gttacattgc acaagataaa aatatatcat catgaacaat aaaactgtct gcttacataa
7380acagtaatac aaggggtgtt atgagccata ttcaacggga aacgtcttgc tcgaagccgc
7440gattaaattc caacatggat gctgatttat atgggtataa atgggctcgc gataatgtcg
7500ggcaatcagg tgcgacaatc tatcgattgt atgggaagcc cgatgcgcca gagttgtttc
7560tgaaacatgg caaaggtagc gttgccaatg atgttacaga tgagatggtc agactaaact
7620ggctgacgga atttatgcct cttccgacca tcaagcattt tatccgtact cctgatgatg
7680catggttact caccactgcg atccccggga aaacagcatt ccaggtatta gaagaatatc
7740ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt cctgcgccgg ttgcattcga
7800ttcctgtttg taattgtcct tttaacagcg atcgcgtatt tcgtctcgct caggcgcaat
7860cacgaatgaa taacggtttg gttgatgcga gtgattttga tgacgagcgt aatggctggc
7920ctgttgaaca agtctggaaa gaaatgcata agcttttgcc attctcaccg gattcagtcg
7980tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa ttaataggtt
8040gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc atcctatgga
8100actgcctcgg tgagttttct ccttcattac agaaacggct ttttcaaaaa tatggtattg
8160ataatcctga tatgaataaa ttgcagtttc atttgatgct cgatgagttt ttctaatcag
8220aattggttaa ttggttgtaa cactggcaga gcattacgct gacttgacgg gacggcggct
8280ttgttgaata aatcgcattc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg
8340cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt
8400tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag
8460ctcggtaccc ggggatcctc tagagtc
8487978038DNAArtificialpRET1003 97ccgtccacca cccggtgcct ggtctgcgtc
tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg
gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc
acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc
gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg
tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc
ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc
cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc
agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg
tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt
tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag
atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc
accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg
ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc
gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc
ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg
atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg
gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag
gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg
gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc
aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca
ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag
gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc
tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg
tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag
gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag
gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag
acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag
gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc
gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc
cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa
cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag
cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc
acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag
gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc
ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc
acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg
cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga
ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga
agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat
gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa
atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc
agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg
gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag
tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc
tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag
catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag
ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc
ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg
attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga
ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc
ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg
atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc
cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg
tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat
ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag
gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg
ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca
gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac
cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa
gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc
gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg
cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg
cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt
ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc
tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc
atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc
tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc
gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg
tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg
ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg
cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc
cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta
ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc
acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc
gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt
cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc
ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa
cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc
gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc
aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc
cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg
ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat
ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga
tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa
attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac
caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca
gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg
aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc
acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca
cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg
gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg
cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg
ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg
acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg
ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg
gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac
ctgccggtca gcccgtgacc ggacctgcag 5820gcatgcaagc ttggcactgg ccgtcgtttt
acaacgtcgt gactgggaaa accctggcgt 5880tacccaactt aatcgccttg cagcacatcc
ccctttcgcc agctggcgta atagcgaaga 5940ggcccgcacc gatcgccctt cccaacagtt
gcgcagcctg aatggcgaat gagcttcttc 6000cgcttcctcg ctcactgact cgctgcgctc
ggtcgttcgg ctgcggcgag cggtatcagc 6060tcactcaaag gcggtaatac ggttatccac
agaatcaggg gataacgcag gaaagaacat 6120gtgagcaaaa ggccagcaaa aggccaggaa
ccgtaaaaag gccgcgttgc tggcgttttt 6180ccataggctc cgcccccctg acgagcatca
caaaaatcga cgctcaagtc agaggtggcg 6240aaacccgaca ggactataaa gataccaggc
gtttccccct ggaagctccc tcgtgcgctc 6300tcctgttccg accctgccgc ttaccggata
cctgtccgcc tttctccctt cgggaagcgt 6360ggcgctttct caatgctcac gctgtaggta
tctcagttcg gtgtaggtcg ttcgctccaa 6420gctgggctgt gtgcacgaac cccccgttca
gcccgaccgc tgcgccttat ccggtaacta 6480tcgtcttgag tccaacccgg taagacacga
cttatcgcca ctggcagcag ccactggtaa 6540caggattagc agagcgaggt atgtaggcgg
tgctacagag ttcttgaagt ggtggcctaa 6600ctacggctac actagaagga cagtatttgg
tatctgcgct ctgctgaagc cagttacctt 6660cggaaaaaga gttggtagct cttgatccgg
caaacaaacc accgctggta gcggtggttt 6720ttttgtttgc aagcagcaga ttacgcgcag
aaaaaaagga tctcaagaag atcctttgat 6780cttttctacg gggtctgacg ctcagtggaa
ctccgtcgaa cggaagatca cttcgcagaa 6840taaataaatc ctggtgtccc tgttgatacc
gggaagccct gggccaactt ttggcgaaaa 6900tgagacgttg atcggcacgt aagaggttcc
aactttcacc ataatgaaat aagatcacta 6960ccgggcgtat tttttgagtt atcgagattt
tcaggagcta aggaagctaa aatggagaaa 7020aaaatcactg gatataccac cgttgatata
tcccaatggc atcgtaaaga acattttgag 7080gcatttcagt cagttgctca atgtacctat
aaccagaccg ttcagctgga tattacggcc 7140tttttaaaga ccgtaaagaa aaataagcac
aagttttatc cggcctttat tcacattctt 7200gcccgcctga tgaatgctca tccggaattt
cgtatggcaa tgaaagacgg tgagctggtg 7260atatgggata gtgttcaccc ttgttacacc
gttttccatg agcaaactga aacgttttca 7320tcgctctgga gtgaatacca cgacgatttc
cggcagtttc tacacatata ttcgcaagat 7380gtggcgtgtt acggtgaaaa cctggcctat
ttccctaaag ggtttattga gaatatgttt 7440ttcgtctcag ccaatccctg ggtgagtttc
accagttttg atttaaacgt ggccaatatg 7500gacaacttct tcgcccccgt tttcaccatg
ggcaaatatt atacgcaagg cgacaaggtg 7560ctgatgccgc tggcgattca ggttcatcat
gccgtctgtg atggcttcca tgtcggcaga 7620atgcttaatg aattacaaca gtactgcgat
gagtggcagg gcggggcgta atttttttaa 7680ggcagttatt ggtgccctta aacgcctggt
gctacgcctg aataagtgat aataagcgga 7740tgaatggcag aaattcagct tggcccagtg
ccaagctcca atacgcaaac cgcctctccc 7800cgcgcgttgg ccgattcatt aatgcagctg
gcacgacagg tttcccgact ggaaagcggg 7860cagtgagcgc aacgcaatta atgtgagtta
gctcactcat taggcacccc aggctttaca 7920ctttatgctt ccggctcgta tgttgtgtgg
aattgtgagc ggataacaat ttcacacagg 7980aaacagctat gaccatgatt acgaattcga
gctcggtacc cggggatcct ctagagtc 8038988497DNAArtificialpRET1001Rv
98ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt
60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc
120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg
180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg
240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt
300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg
360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg
420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc
480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga
540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc
600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg
660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc
720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg
780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg
840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc
900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg
960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg
1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc
1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc
1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa
1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg
1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg
1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg
1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc
1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc
1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc
1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc
1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag
1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta
1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc
1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa
1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg
1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac
1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa
2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc
2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc
2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc
2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg
2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag
2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct
2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg
2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca
2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc
2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg
2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg
2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga
2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa
2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat
2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc
2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac
3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat
3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat
3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact
3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac
3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat
3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc
3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg
3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg
3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc
3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc
3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc
3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct
3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg
3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc
3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc
3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc
3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc
4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc
4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc
4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga
4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt
4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg
4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc
4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt
4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg
4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc
4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg
4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc
4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga
4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc
4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac
4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc
4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt
4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac
5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat
5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat
5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc
5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc
5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg
5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga
5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa
5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga
5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga
5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg
5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg
5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg
5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggactctaga
5820ggatccccgg gtaccgagct cgaattcgta atcatggtca tagctgtttc ctgtgtgaaa
5880ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg
5940gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca
6000gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg
6060tttgcgtatt gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg
6120gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg
6180ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa
6240ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc acaaaaatcg
6300acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc
6360tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc
6420ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc
6480ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg
6540ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc
6600actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga
6660gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc
6720tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac
6780caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg
6840atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc
6900acgttaaggg attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa
6960ttaaaaatga agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta
7020ccaatgctta atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt
7080tgcctgactc cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag
7140tgctgcaatg ataccgcgag acccacgctc accggctcca gatttatcag caataaacca
7200gccagccgga agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc
7260tattaattgt tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt
7320tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag
7380ctccggttcc caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt
7440tagctccttc ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat
7500ggttatggca gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt
7560gactggtgag tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc
7620ttgcccggcg tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat
7680cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag
7740ttcgatgtaa cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt
7800ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg
7860gaaatgttga atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta
7920ttgtctcatg agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc
7980gcgcacattt ccccgaaaag tgccacctga cgtctaagaa accattatta tcatgacatt
8040aacctataaa aataggcgta tcacgaggcc ctttcgtctc gcgcgtttcg gtgatgacgg
8100tgaaaacctc tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc
8160cgggagcaga caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct
8220taactatgcg gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc
8280gcacagatgc gtaaggagaa aataccgcat caggcgccat tcgccattca ggctgcgcaa
8340ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg
8400atgtgctgca aggcgattaa gttgggtaac gccagggttt tcccagtcac gacgttgtaa
8460aacgacggcc agtgccaagc ttgcatgcct gcaggtc
8497998487DNAArtificialpRET1002Rv 99ccgtccacca cccggtgcct ggtctgcgtc
tccctcggct cgttcctcgc ctatcctggt 60gaccagacac cggagcgagc tatgcccagg
gttgcgcagt gacttcgtca ctgcgtaacc 120ctgggcgctc gcctcccatt cgcttcgctc
acaggagggg gccgtcgatg gccgctgacg 180ctgcatctga cgaccggcgg accgaggtcc
gcgccgctgc ttcgcgggcc gctgacgcgg 240ccccggcgaa gcgcacccgc accgtggcgg
tgcggctgac cgatggggag gaggccgcgt 300ggatcgacgc cgcgctggcc gatggccacc
ggcagctcgg ggcgtgggtg cgtgagcggg 360cggtggccgg ctatctcggg aaggtccgcc
cgaagaccgg cagtggaatg tcggcggagg 420cggccgcgga ggtcgccgcg atgcggcagc
agatgacgaa ggtggggaac aacctgaacc 480agatcgcgag ggcgatcaac gccgggcagg
tgccgtcgca gatggccgag tccctgcaga 540aggggtggct ggagaggtgg gggcaggagt
tggggcggat ggcggatcgg ctcgacgcgc 600tcgacgacca gggctgacgt gatcgcgaag
atcagcacgg gcagcgaccc gaaggggttg 660gcggcgtatc tgcacgggcc ggggaaggcc
accccgcaca gctaccgcac cgaggcgggc 720cggctgattg ccggcgggac ggtgatcgcg
ggatcggtgc aggtcaccgc caaaaacccg 780acccggtggg ggcgggactt cgagcgggcc
gccgcgacga acgcgcgggt gggtaagccg 840gtgtggcatt gctcgctgcg gtgcgcgccc
ggggatcggc ggctgaccga taccgagttc 900gcggacatcg cgcagacggt cgccgagcgg
atgggcttcg agagtcatcc gtgggtggcg 960gtgcggcacg acgacgacca catccacctg
gctgtctccc gggtcgattt tcagggcgtg 1020acctggaaga acagcaacga ccggtggaag
gtcgtcgagg tgatgcgcga ggtcgaacgc 1080gcgcacggcc tgatcgaggt ggcgagcccg
gagcgggccc gtggccggca agccagcagc 1140ggcgagcaac gccgcgcggt gcggaccggc
aaggtggcgc agcgggacgg tctgagggaa 1200attgtgaccg ccgcccgcga catcgccgca
ggccagggtg tgggggcgtt cgaagtggcg 1260ctcgtacaga acccgattac ccgagtgcag
gtgcggcgca acgtcgcgaa gacgggccgg 1320atgaatggct acagcttcaa cctgcccggc
tacgtcgacg ccgccgggga gccgatctgg 1380ttgccggcct ccaaactcga ccggggtttg
tcctggtcac agctggaaaa gacgctgacc 1440agaccccgcc cggaccgcct cgccggcgag
gagacggtgc cgcggaagcg gctcgagcgc 1500gccgccgcgt gggagcagcg ccgccgcgag
gtcggcggcg agcagttcgc agctgcccgc 1560tgggagcagg cccgcgcgaa tgttggtgag
acggccgggc ggatccgcgc cgaacagtcc 1620gcggacacga agtggaagca ggtgaacgag
gcgttgacca gccaagaccg ggccgaggag 1680caggctgccg aggcagcgcg ggtcgcctcc
gctgtcatgg gaggccaccc gacaccgcta 1740cgggacatgc tcgccgccca ggagcagcgc
cggaagccgt ggactccgga gcagaaacgc 1800cagtacgcga ccgcaaaagc ccaagcagaa
cgcgccgcga aggccaagga cgccgcgaaa 1860tggaccgagg tcgccggcgg cggctaccag
cgggacgtgc gcgggatgaa cctgcgactg 1920tgggtggctg aggacggcgc ctggtcgatc
acctcgaaga aggaccccga ccgccagtac 1980gccgcaggtc aggccgacac cgtcgcgcag
gcccaagccg cggccacggc cacagcgaaa 2040acgcaggccc aggcgatgtg gaagcaggtc
ccggccgaca agcgcaccga gtcagccacc 2100agagcggtcc ggcgcgtgat cgcggatctc
acccccacca aacccgccga ggtcaaaccc 2160ccggcccgcc gccagggacc aaccatgccg
cagtcggccc cggggtatca gccacccggc 2220cgcgaccgag gtcgagaatc cggaatggga
ctgtgagcag agagcgagaa ggctttcgtg 2280gagcgtaggg aacagacgca ggcctggcga
agcatgtcca agaacaccat cgatcgctag 2340aaggtcggtc gtgcccaggg tgcccaggat
gcgtacataa cgcgcgaaag gtgcatacct 2400cccatagcat cggcgcgtat ggtagggaaa
atgatcttca aacgtattgc tgtggtcgtg 2460ctcgctggtg gggctttggt agtgggaggc
agccaggttg ctggtgctac cacggtttca 2520gctccacagc cgagtccttc agcagcggtg
gtgccgacgg ttcttccacc agtcactttc 2580accgccgctt ctgcgcactg cgaggcccag
tacgcgtcgg attcccggcg atgccgtctg 2640attccacttc cacagggccg agcgatctgc
tgggcggcag ccgctgcccg ttacgcagcg 2700tgccgcgccg gaaactaggt agaacgtgag
catggacgag cttcccacct tcatcgccga 2760cgacatcgtg atggccagaa cgttcgacag
ccctaacggc caggtggtgc tcgaggtgaa 2820cactccgcgg ccgttcgatg ctgcggcccc
ggagggtgac tactgctgca ccttccggat 2880cagcgggaac atggatgccc cttacgacgg
attcggtggc ggcgtcgacg cagtgcaggc 2940gctgctactc gcattggcca tggcacacga
ggaacttcgt caaacttcgc cagagttgac 3000gtttctaggc gagacgaacc tcggtctacc
ggtcttgaac atcaagcccg acaacgcgat 3060cgaagccgtg gtctcattcc ccgctccctg
atgtgacgca ctttcacccc tggcactcat 3120gtaccgaagc tgggactgag aaagggctgc
cgcgtcaccg cttcgcgttg acttgccact 3180gaacgggggc gtgtcccggt cagggcgggg
tgtgacctgg gttcatgaca ccgctaacac 3240gctgcggaaa tgcggattga actagttcat
ttggggaacg atgacctgat gaccggggat 3300cgtgacctac ccatgctgac catcgccgag
gcggtggacg cgacgcagac cagtgagagc 3360acgatcaagc gccgcctgcg gtcgggcgcg
ttcccgaacg cggtccgcac tgccgacggg 3420aagtggatga ttcccctcgg tgacctatca
gcggcagggc tgagaccagg gaaaatggcg 3480aaacctgacc cggtgacccc ttcaaatgac
cgggtccgtg acctggcagc tgagaacgcc 3540gagctccgtc agcgcctggc cgtggccgaa
gccctggcca gcgaacgcaa tcggatcatc 3600gacgtgcagc aacagatgct ccggatgctc
gaagcccggc cggtgtcggc cctggagccc 3660gcggcggttc cagtggcggg tccgccgccg
cccgtcccgg ccgccgatgg tcgggcagct 3720acgggcgccc tggcccggat acgtcgacgg
cttctcggct aggagctgac cgcgtacttg 3780cgtgcgtcgt gcaggagctt tcccaccgtt
ccggtggaga ttcccatctc ctcggcgatc 3840tcgcggtact tcaggccctg ctcgcgcagc
tcgacggccc ggcgacggtt ctcggctgcc 3900cgtgcgagga actggtcccg cggctcggcc
atgatgcgct ggatcgtgcg cgtggaggcc 3960cccatcttct cggccagctc gcgagctgtc
tgcttgcggc ggatcggtcg ttcagcgccc 4020acggtctgcc tcccacaatg cgttccggtc
gaccttcgtc gctcgtttcc ggtttgcctc 4080gcgcttcttc tcactcatct tgcgaccgcg
tgcggcttgt atggcgatga atgtggcctc 4140gtagacagca gggccgtcgg cccacatccg
ggactttgta gtgatccagc gggtaatgga 4200ggccgcgacg gcgcgtagct cgcttgctgg
cagtggatcg ggcctgcctg tgaccgggtt 4260cctgaacgtg gcgttgatct gtgcggcttc
cgcatagatc gcggccccga ggccggtcgg 4320gtcgccccag tggaagcgga tttcgcggta
ggcccaggtg cgtgcggttt cgaacagggc 4380gcagtttcgg ccgaggccga tcgggttctc
acggcgcgat cgggtttgcc gccagcgcgt 4440tggcggcatg tggatgccga gttccgcctc
gagctcggcg agggatcgcc gctcggtgtg 4500cagccaatgg gtgtcccagt caccgtgagt
cgggttcttg gtcatcaggc ccgaatagcc 4560cttgtccccc tggacggcgc gccggaggcc
ttcggtgacg gcggccgcat aggcgagcgg 4620cttacgacgg gcgtactcgg tgcgggtgaa
cggctctgcc agcgcccaca cagcgtgtgc 4680gtgcccgtta cgggggttct ccacgatcgc
gttcggcaga ggatgattcc cggccgccga 4740cagcgcccgc agcgcggcgt ccgggtggtc
aacgtccacg acgagcaggt tgctcaatgc 4800ctgcgggttc gactcgatgt agcggcgatc
cagtgcgtct gatcgccgca tccggtagac 4860gccgtcgagg aaatcgtcgg ttgccagtgg
ccacagcggt agccacagct gttcccaggc 4920gccgcctgtg tgctcttcca ccgcaaccat
ggggaacaca ctcacacaca agatcgattt 4980attccggtac gacacgccag ccaagtcaga
tgtttcggtt tctggagcgg tcctccagac 5040ctttgagatc cgctccagaa acgtccacaa
attattgggg tacgtcgaac caagccttat 5100caggtatccc ggggttccgg gggtgaacac
caccctccga ccggtccaga atccgtcgat 5160ctcacctatc cgctcgaagt ccttgagtca
gtgacaggac cactgctggg ctcccagcgc 5220agaaggcaag tgaaggcaga cgactgcggg
aggtaagtcg ggtacggcat gaggtccttc 5280agaagcggcg tcgacgccag gcccacacgc
acaatccgct tcccacgagg gacaccaccg 5340gtagcgcccc ctgcaaccgg cgcagtgtca
cgaggcgccg gtactgctcg tttgacagga 5400actgcagggt cggtgagctc gcgctgggcg
gatcccacca gtagctcccc gtgccggtaa 5460ccgcttgggg ccaagcgaag acacccaccg
cggcagcgat ggcaatgcac gtggatggga 5520acaccaccca gaaccaggga aatcctggtg
ccggcccgag acgatcccgg cgcggtaaga 5580ccacaccggc caccatcgcc acggcccccg
acgcaacaag caataaccac cccatgagcg 5640gacggtacaa gcgccgacgc cgggtggccg
ttaggtgcgc gccagcccgt gaccggaccg 5700gcgaagcgtg ccgctgggcg gcccgccgtg
gcgcccgtcc cgtgcccgtt ctgaccggtg 5760gtctcggtcg ctcgttcctc gcgtcctcac
ctgccggtca gcccgtgacc ggactctaga 5820ggatccccgg gtaccgagct cgaattcact
ggccgtcgtt ttacaacgtc gtgactggga 5880aaaccctggc gttacccaac ttaatcgcct
tgcagcacat ccccctttcg ccagctggcg 5940taatagcgaa gaggcccgca ccgatcgccc
ttcccaacag ttgcgcagcc tgaatggcga 6000atgcgattta ttcaacaaag ccgccgtccc
gtcaagtcag cgtaatgctc tgccagtgtt 6060acaaccaatt aaccaattct gattagaaaa
actcatcgag catcaaatga aactgcaatt 6120tattcatatc aggattatca ataccatatt
tttgaaaaag ccgtttctgt aatgaaggag 6180aaaactcacc gaggcagttc cataggatgg
caagatcctg gtatcggtct gcgattccga 6240ctcgtccaac atcaatacaa cctattaatt
tcccctcgtc aaaaataagg ttatcaagtg 6300agaaatcacc atgagtgacg actgaatccg
gtgagaatgg caaaagctta tgcatttctt 6360tccagacttg ttcaacaggc cagccattac
gctcgtcatc aaaatcactc gcatcaacca 6420aaccgttatt cattcgtgat tgcgcctgag
cgagacgaaa tacgcgatcg ctgttaaaag 6480gacaattaca aacaggaatc gaatgcaacc
ggcgcaggaa cactgccagc gcatcaacaa 6540tattttcacc tgaatcagga tattcttcta
atacctggaa tgctgttttc ccggggatcg 6600cagtggtgag taaccatgca tcatcaggag
tacggataaa atgcttgatg gtcggaagag 6660gcataaattc cgtcagccag tttagtctga
ccatctcatc tgtaacatca ttggcaacgc 6720tacctttgcc atgtttcaga aacaactctg
gcgcatcggg cttcccatac aatcgataga 6780ttgtcgcacc tgattgcccg acattatcgc
gagcccattt atacccatat aaatcagcat 6840ccatgttgga atttaatcgc ggcttcgagc
aagacgtttc ccgttgaata tggctcataa 6900caccccttgt attactgttt atgtaagcag
acagttttat tgttcatgat gatatatttt 6960tatcttgtgc aatgtaacat cagagatttt
gagacacaac gtggctttgt tgaataaatc 7020gaacttttgc tgagttgaag gatcagatca
cgcatcttcc cgacaacgca gaccgttccg 7080tggcaaagca aaagttcaaa atcaccaact
ggtccaccta caacaaagct ctcatcaacc 7140gtggctccct cactttctgg ctggatgatg
gggcgattca ggcctggtat gagtcagcaa 7200caccttcttc acgaggcaga cctctcgacg
gatcgttcca ctgagcgtca gaccccgtag 7260aaaagatcaa aggatcttct tgagatcctt
tttttctgcg cgtaatctgc tgcttgcaaa 7320caaaaaaacc accgctacca gcggtggttt
gtttgccgga tcaagagcta ccaactcttt 7380ttccgaaggt aactggcttc agcagagcgc
agataccaaa tactgtcctt ctagtgtagc 7440cgtagttagg ccaccacttc aagaactctg
tagcaccgcc tacatacctc gctctgctaa 7500tcctgttacc agtggctgct gccagtggcg
ataagtcgtg tcttaccggg ttggactcaa 7560gacgatagtt accggataag gcgcagcggt
cgggctgaac ggggggttcg tgcacacagc 7620ccagcttgga gcgaacgacc tacaccgaac
tgagatacct acagcgtgag cattgagaaa 7680gcgccacgct tcccgaaggg agaaaggcgg
acaggtatcc ggtaagcggc agggtcggaa 7740caggagagcg cacgagggag cttccagggg
gaaacgcctg gtatctttat agtcctgtcg 7800ggtttcgcca cctctgactt gagcgtcgat
ttttgtgatg ctcgtcaggg gggcggagcc 7860tatggaaaaa cgccagcaac gcggcctttt
tacggttcct ggccttttgc tggccttttg 7920ctcacatgtt ctttcctgcg ttatcccctg
attctgtgga taaccgtatt accgcctttg 7980agtgagctga taccgctcgc cgcagccgaa
cgaccgagcg cagcgagtca gtgagcgagg 8040aagcggaaga agctcgcaca ttcagcagcg
tttttcagcg cgttttcgat cagcgtttca 8100atgttggtat caacaccagg tttaactttg
aacttatcgg cactgacggt tactgatttt 8160gaacttttgc tttgccacgg aacggtctgc
gttgtcggga agatgcgtga tctgatcctt 8220caactcagca aaagttcgcc aatacgcaaa
ccgcctctcc ccgcgcgttg gccgattcat 8280taatgcagct ggcacgacag gtttcccgac
tggaaagcgg gcagtgagcg caacgcaatt 8340aatgtgagtt agctcactca ttaggcaccc
caggctttac actttatgct tccggctcgt 8400atgttgtgtg gaattgtgag cggataacaa
tttcacacag gaaacagcta tgaccatgat 8460tacgccaagc ttgcatgcct gcaggtc
84871008038DNAArtificialpRET1003Rv
100ccgtccacca cccggtgcct ggtctgcgtc tccctcggct cgttcctcgc ctatcctggt
60gaccagacac cggagcgagc tatgcccagg gttgcgcagt gacttcgtca ctgcgtaacc
120ctgggcgctc gcctcccatt cgcttcgctc acaggagggg gccgtcgatg gccgctgacg
180ctgcatctga cgaccggcgg accgaggtcc gcgccgctgc ttcgcgggcc gctgacgcgg
240ccccggcgaa gcgcacccgc accgtggcgg tgcggctgac cgatggggag gaggccgcgt
300ggatcgacgc cgcgctggcc gatggccacc ggcagctcgg ggcgtgggtg cgtgagcggg
360cggtggccgg ctatctcggg aaggtccgcc cgaagaccgg cagtggaatg tcggcggagg
420cggccgcgga ggtcgccgcg atgcggcagc agatgacgaa ggtggggaac aacctgaacc
480agatcgcgag ggcgatcaac gccgggcagg tgccgtcgca gatggccgag tccctgcaga
540aggggtggct ggagaggtgg gggcaggagt tggggcggat ggcggatcgg ctcgacgcgc
600tcgacgacca gggctgacgt gatcgcgaag atcagcacgg gcagcgaccc gaaggggttg
660gcggcgtatc tgcacgggcc ggggaaggcc accccgcaca gctaccgcac cgaggcgggc
720cggctgattg ccggcgggac ggtgatcgcg ggatcggtgc aggtcaccgc caaaaacccg
780acccggtggg ggcgggactt cgagcgggcc gccgcgacga acgcgcgggt gggtaagccg
840gtgtggcatt gctcgctgcg gtgcgcgccc ggggatcggc ggctgaccga taccgagttc
900gcggacatcg cgcagacggt cgccgagcgg atgggcttcg agagtcatcc gtgggtggcg
960gtgcggcacg acgacgacca catccacctg gctgtctccc gggtcgattt tcagggcgtg
1020acctggaaga acagcaacga ccggtggaag gtcgtcgagg tgatgcgcga ggtcgaacgc
1080gcgcacggcc tgatcgaggt ggcgagcccg gagcgggccc gtggccggca agccagcagc
1140ggcgagcaac gccgcgcggt gcggaccggc aaggtggcgc agcgggacgg tctgagggaa
1200attgtgaccg ccgcccgcga catcgccgca ggccagggtg tgggggcgtt cgaagtggcg
1260ctcgtacaga acccgattac ccgagtgcag gtgcggcgca acgtcgcgaa gacgggccgg
1320atgaatggct acagcttcaa cctgcccggc tacgtcgacg ccgccgggga gccgatctgg
1380ttgccggcct ccaaactcga ccggggtttg tcctggtcac agctggaaaa gacgctgacc
1440agaccccgcc cggaccgcct cgccggcgag gagacggtgc cgcggaagcg gctcgagcgc
1500gccgccgcgt gggagcagcg ccgccgcgag gtcggcggcg agcagttcgc agctgcccgc
1560tgggagcagg cccgcgcgaa tgttggtgag acggccgggc ggatccgcgc cgaacagtcc
1620gcggacacga agtggaagca ggtgaacgag gcgttgacca gccaagaccg ggccgaggag
1680caggctgccg aggcagcgcg ggtcgcctcc gctgtcatgg gaggccaccc gacaccgcta
1740cgggacatgc tcgccgccca ggagcagcgc cggaagccgt ggactccgga gcagaaacgc
1800cagtacgcga ccgcaaaagc ccaagcagaa cgcgccgcga aggccaagga cgccgcgaaa
1860tggaccgagg tcgccggcgg cggctaccag cgggacgtgc gcgggatgaa cctgcgactg
1920tgggtggctg aggacggcgc ctggtcgatc acctcgaaga aggaccccga ccgccagtac
1980gccgcaggtc aggccgacac cgtcgcgcag gcccaagccg cggccacggc cacagcgaaa
2040acgcaggccc aggcgatgtg gaagcaggtc ccggccgaca agcgcaccga gtcagccacc
2100agagcggtcc ggcgcgtgat cgcggatctc acccccacca aacccgccga ggtcaaaccc
2160ccggcccgcc gccagggacc aaccatgccg cagtcggccc cggggtatca gccacccggc
2220cgcgaccgag gtcgagaatc cggaatggga ctgtgagcag agagcgagaa ggctttcgtg
2280gagcgtaggg aacagacgca ggcctggcga agcatgtcca agaacaccat cgatcgctag
2340aaggtcggtc gtgcccaggg tgcccaggat gcgtacataa cgcgcgaaag gtgcatacct
2400cccatagcat cggcgcgtat ggtagggaaa atgatcttca aacgtattgc tgtggtcgtg
2460ctcgctggtg gggctttggt agtgggaggc agccaggttg ctggtgctac cacggtttca
2520gctccacagc cgagtccttc agcagcggtg gtgccgacgg ttcttccacc agtcactttc
2580accgccgctt ctgcgcactg cgaggcccag tacgcgtcgg attcccggcg atgccgtctg
2640attccacttc cacagggccg agcgatctgc tgggcggcag ccgctgcccg ttacgcagcg
2700tgccgcgccg gaaactaggt agaacgtgag catggacgag cttcccacct tcatcgccga
2760cgacatcgtg atggccagaa cgttcgacag ccctaacggc caggtggtgc tcgaggtgaa
2820cactccgcgg ccgttcgatg ctgcggcccc ggagggtgac tactgctgca ccttccggat
2880cagcgggaac atggatgccc cttacgacgg attcggtggc ggcgtcgacg cagtgcaggc
2940gctgctactc gcattggcca tggcacacga ggaacttcgt caaacttcgc cagagttgac
3000gtttctaggc gagacgaacc tcggtctacc ggtcttgaac atcaagcccg acaacgcgat
3060cgaagccgtg gtctcattcc ccgctccctg atgtgacgca ctttcacccc tggcactcat
3120gtaccgaagc tgggactgag aaagggctgc cgcgtcaccg cttcgcgttg acttgccact
3180gaacgggggc gtgtcccggt cagggcgggg tgtgacctgg gttcatgaca ccgctaacac
3240gctgcggaaa tgcggattga actagttcat ttggggaacg atgacctgat gaccggggat
3300cgtgacctac ccatgctgac catcgccgag gcggtggacg cgacgcagac cagtgagagc
3360acgatcaagc gccgcctgcg gtcgggcgcg ttcccgaacg cggtccgcac tgccgacggg
3420aagtggatga ttcccctcgg tgacctatca gcggcagggc tgagaccagg gaaaatggcg
3480aaacctgacc cggtgacccc ttcaaatgac cgggtccgtg acctggcagc tgagaacgcc
3540gagctccgtc agcgcctggc cgtggccgaa gccctggcca gcgaacgcaa tcggatcatc
3600gacgtgcagc aacagatgct ccggatgctc gaagcccggc cggtgtcggc cctggagccc
3660gcggcggttc cagtggcggg tccgccgccg cccgtcccgg ccgccgatgg tcgggcagct
3720acgggcgccc tggcccggat acgtcgacgg cttctcggct aggagctgac cgcgtacttg
3780cgtgcgtcgt gcaggagctt tcccaccgtt ccggtggaga ttcccatctc ctcggcgatc
3840tcgcggtact tcaggccctg ctcgcgcagc tcgacggccc ggcgacggtt ctcggctgcc
3900cgtgcgagga actggtcccg cggctcggcc atgatgcgct ggatcgtgcg cgtggaggcc
3960cccatcttct cggccagctc gcgagctgtc tgcttgcggc ggatcggtcg ttcagcgccc
4020acggtctgcc tcccacaatg cgttccggtc gaccttcgtc gctcgtttcc ggtttgcctc
4080gcgcttcttc tcactcatct tgcgaccgcg tgcggcttgt atggcgatga atgtggcctc
4140gtagacagca gggccgtcgg cccacatccg ggactttgta gtgatccagc gggtaatgga
4200ggccgcgacg gcgcgtagct cgcttgctgg cagtggatcg ggcctgcctg tgaccgggtt
4260cctgaacgtg gcgttgatct gtgcggcttc cgcatagatc gcggccccga ggccggtcgg
4320gtcgccccag tggaagcgga tttcgcggta ggcccaggtg cgtgcggttt cgaacagggc
4380gcagtttcgg ccgaggccga tcgggttctc acggcgcgat cgggtttgcc gccagcgcgt
4440tggcggcatg tggatgccga gttccgcctc gagctcggcg agggatcgcc gctcggtgtg
4500cagccaatgg gtgtcccagt caccgtgagt cgggttcttg gtcatcaggc ccgaatagcc
4560cttgtccccc tggacggcgc gccggaggcc ttcggtgacg gcggccgcat aggcgagcgg
4620cttacgacgg gcgtactcgg tgcgggtgaa cggctctgcc agcgcccaca cagcgtgtgc
4680gtgcccgtta cgggggttct ccacgatcgc gttcggcaga ggatgattcc cggccgccga
4740cagcgcccgc agcgcggcgt ccgggtggtc aacgtccacg acgagcaggt tgctcaatgc
4800ctgcgggttc gactcgatgt agcggcgatc cagtgcgtct gatcgccgca tccggtagac
4860gccgtcgagg aaatcgtcgg ttgccagtgg ccacagcggt agccacagct gttcccaggc
4920gccgcctgtg tgctcttcca ccgcaaccat ggggaacaca ctcacacaca agatcgattt
4980attccggtac gacacgccag ccaagtcaga tgtttcggtt tctggagcgg tcctccagac
5040ctttgagatc cgctccagaa acgtccacaa attattgggg tacgtcgaac caagccttat
5100caggtatccc ggggttccgg gggtgaacac caccctccga ccggtccaga atccgtcgat
5160ctcacctatc cgctcgaagt ccttgagtca gtgacaggac cactgctggg ctcccagcgc
5220agaaggcaag tgaaggcaga cgactgcggg aggtaagtcg ggtacggcat gaggtccttc
5280agaagcggcg tcgacgccag gcccacacgc acaatccgct tcccacgagg gacaccaccg
5340gtagcgcccc ctgcaaccgg cgcagtgtca cgaggcgccg gtactgctcg tttgacagga
5400actgcagggt cggtgagctc gcgctgggcg gatcccacca gtagctcccc gtgccggtaa
5460ccgcttgggg ccaagcgaag acacccaccg cggcagcgat ggcaatgcac gtggatggga
5520acaccaccca gaaccaggga aatcctggtg ccggcccgag acgatcccgg cgcggtaaga
5580ccacaccggc caccatcgcc acggcccccg acgcaacaag caataaccac cccatgagcg
5640gacggtacaa gcgccgacgc cgggtggccg ttaggtgcgc gccagcccgt gaccggaccg
5700gcgaagcgtg ccgctgggcg gcccgccgtg gcgcccgtcc cgtgcccgtt ctgaccggtg
5760gtctcggtcg ctcgttcctc gcgtcctcac ctgccggtca gcccgtgacc ggactctaga
5820ggatccccgg gtaccgagct cgaattcgta atcatggtca tagctgtttc ctgtgtgaaa
5880ttgttatccg ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg
5940gggtgcctaa tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcca
6000gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg
6060tttgcgtatt ggagcttggc actgggccaa gctgaatttc tgccattcat ccgcttatta
6120tcacttattc aggcgtagca ccaggcgttt aagggcacca ataactgcct taaaaaaatt
6180acgccccgcc ctgccactca tcgcagtact gttgtaattc attaagcatt ctgccgacat
6240ggaagccatc acagacggca tgatgaacct gaatcgccag cggcatcagc accttgtcgc
6300cttgcgtata atatttgccc atggtgaaaa cgggggcgaa gaagttgtcc atattggcca
6360cgtttaaatc aaaactggtg aaactcaccc agggattggc tgagacgaaa aacatattct
6420caataaaccc tttagggaaa taggccaggt tttcaccgta acacgccaca tcttgcgaat
6480atatgtgtag aaactgccgg aaatcgtcgt ggtattcact ccagagcgat gaaaacgttt
6540cagtttgctc atggaaaacg gtgtaacaag ggtgaacact atcccatatc accagctcac
6600cgtctttcat tgccatacga aattccggat gagcattcat caggcgggca agaatgtgaa
6660taaaggccgg ataaaacttg tgcttatttt tctttacggt ctttaaaaag gccgtaatat
6720ccagctgaac ggtctggtta taggtacatt gagcaactga ctgaaatgcc tcaaaatgtt
6780ctttacgatg ccattgggat atatcaacgg tggtatatcc agtgattttt ttctccattt
6840tagcttcctt agctcctgaa aatctcgata actcaaaaaa tacgcccggt agtgatctta
6900tttcattatg gtgaaagttg gaacctctta cgtgccgatc aacgtctcat tttcgccaaa
6960agttggccca gggcttcccg gtatcaacag ggacaccagg atttatttat tctgcgaagt
7020gatcttccgt tcgacggagt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc
7080ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct
7140accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg
7200cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca
7260cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc
7320tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga
7380taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac
7440gacctacacc gaactgagat acctacagcg tgagcattga gaaagcgcca cgcttcccga
7500agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag
7560ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg
7620acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag
7680caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc
7740tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc
7800tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagaagctca
7860ttcgccattc aggctgcgca actgttggga agggcgatcg gtgcgggcct cttcgctatt
7920acgccagctg gcgaaagggg gatgtgctgc aaggcgatta agttgggtaa cgccagggtt
7980ttcccagtca cgacgttgta aaacgacggc cagtgccaag cttgcatgcc tgcaggtc
8038
User Contributions:
Comment about this patent or add new information about this topic: