Patent application title: Recombinant Polypeptides for Diagnosing Infection with Trypanosoma Cruzi
Inventors:
Louis V. Kirchhoff (Iowa City, IA, US)
Keiko Otsu (Iowa City, IA, US)
IPC8 Class: AG01N3353FI
USPC Class:
435 792
Class name: Involving antigen-antibody binding, specific binding protein assay or specific ligand-receptor binding assay assay in which an enzyme present is a label heterogeneous or solid phase assay system (e.g., elisa, etc.)
Publication date: 2010-08-05
Patent application number: 20100196933
Claims:
1. A recombinant plasmid vector comprising a DNA sequence which codes for
a recombinant polypeptide corresponding to a polypeptide selected from
the group consisting of FP3 [SEQ ID NO 22], FP4 [SEQ ID NO 8], FP5 [SEQ
ID NO 10], FP6 [SEQ ID NO 14], FP7 [SEQ ID NO 12], FP8 [SEQ ID NO 16],
FP9 [SEQ ID NO 18] and FP 10 [SEQ ID NO 20].
2. The plasmid vector of claim 1, wherein the plasmid vector is selected from the group consisting of pGEX and pET plasmid vectors.
3. The plasmid vector of claim 2, wherein the plasmid vector is pET-32a.
4. The plasmid vector of claim 1, wherein the DNA sequence comprises one selected from the group consisting of SEQ lD NO 21, SEQ ID NO 7, SEQ ID NO 9, SEQ ID NO 13, SEQ ID NO 11, SEQ ID NO 15, SEQ ID NO 17, and SEQ ID NO 19.
4. An organism transfected with the plasmid vector of claim 1.
5. The organism of claim 4, wherein the organism is Escherichia coli.
6. A recombinant polypeptide comprising a sequence corresponding to one of FP3, FP4, FP6, FP7, FP8 and FP10.
7. A kit comprising:a first recombinant polypeptide wherein the first recombinant polypeptide is the recombinant polypeptide of claim 6, anda second recombinant polypeptide.
8. The kit of claim 7, wherein the second polypeptide comprises a sequence corresponding to one selected from the group consisting of Ag15 [SEQ ID NO 2], FP3, FP4, FP5, FP6, FP7 FP8, FP9 and FP10, wherein the first recombinant polypeptide is different from the second recombinant polypeptide.
9. The kit of claim 7, wherein the first recombinant polypeptide is FP4 and the second recombinant polypeptide is FP6.
10. The kit of claim 7, further comprising a third recombinant polypeptide selected from the group consisting of Ag15, FP3, FP4, FP5, FP6, FP7 FP8, FP9 and FP10, wherein the first recombinant polypeptide, the second recombinant polypeptide and the third recombinant polypeptide are different.
11. The kit of claim 10, wherein the first recombinant polypeptide corresponds to FP4, the second recombinant polypeptide corresponds to FP6 and the third polypeptide corresponds to FP10.
12. A method of detecting the presence of anti-Trypanosoma cruzi antibodies in a sample from a subject, comprising:(A) contacting the sample with a polypeptide comprising an amino acid sequence selected from the group consisting of FP3, FP4, FP6, FP7, FP8 and FP10 or an immunoreactive fragment thereof, and(B) detecting a specific binding interaction with an antibody in said sample, wherein the binding interaction comprises a specific binding between antibody in the sample and an epitope contained within the amino acid sequence set forth in FP3, FP4, FP6, FP7, FP8 and FP10 and wherein said specific binding interaction indicates past or present infection with Trypanosoma cruzi.
13. The method of claim 12, wherein the polypeptide of step A is immobilized on a carrier molecule or a solid phase.
14. The method of claim 12, wherein the polypeptide of step A has a sequence obtained from a strain or clone of Trypanosoma cruzi.
15. The method of claim 12, wherein the polypeptide has had one or more amino acids truncated.
16. The method of claim 12, wherein the step of detecting anti-Trypanosoma cruzi antibodies bound to the immobilized polypeptide is carried out by adding at least one compound that detects the antibodies.
17. The method of claim 16, wherein the at least one compound that enables detection of the anti-Trypanosoma cruzi antibodies is selected from the group consisting of a colorimetric agent, a fluorescent agent, a chemiluminescent agent and a radionucleotide.
Description:
[0001]This application claims priority from U.S. Provisional Application
No. 60/430,654, filed Dec. 4, 2002, hereby incorporated by reference in
its entirety.
BACKGROUND OF THE INVENTION
[0002]1. Field of the Invention
[0003]The present invention relates to recombinant polypeptides that are useful for diagnosing American trypanosomiasis, or Chagas disease. Chagas disease is caused by the infectious agent Trypanosoma cruzi. More particularly, the invention relates to specific combinations of recombinant T. cruzi polypeptides, synthesized using genetic engineering techniques, and to constructs and processes for producing the recombinant polypeptides, and to an assay and kit for detecting T. cruzi infection which employs the recombinant polypeptides.
[0004]2. Background
[0005]Chagas disease is a zoonosis caused by the protozoan parasite, Trypanosoma cruzi. This organism is primarily transmitted through contact with its triatomine insect vectors, but transmission by transfusion of contaminated blood and congenital transmission also are important. Historically Chagas disease has been a public health problem in all of Latin America, with the exception of the Caribbean nations. The World Health Organization estimates that 16-18 million persons are chronically infected with T. cruzi, and that 45,000 deaths occur each year due to the illness. Infection with T. cruzi is life-long and specific drug treatment lacks efficacy and often causes serious side effects. Ten to thirty percent of T. cruzi-infected persons develop chronic symptomatic Chagas disease, and the burden of disability and mortality in the endemic countries is enormous.
[0006]An estimated 80,000 to 100,000 T. cruzi-infected persons now live in the United States. These immigrants pose a risk for transfusion-associated transmission of the parasite here and in other countries to which Latin Americans have emigrated. Eight such cases have been reported in the United States, Canada, and Europe, all of which occurred in immunosuppressed patients in whom acute T. cruzi infection was diagnosed because of the fulminant course of the illness. Most transfusions are given to immunocompetent patients in whom acute Chagas disease would be a mild illness, and thus it is reasonable to assume that many other undetected instances of transfusion-associated transmission of T. cruzi have occurred in the United States and other industrialized nations. The question of whether blood donated in the United States should be screened serologically for antibodies to T. cruzi has been considered for at least a decade by both public and private entities involved in blood banking. A panel of experts convened in early 2000 by the American Red Cross to consider this issue recommended unanimously that our blood supply be screened serologically. Implementation of such a recommendation, however, is not an option currently because no test for T. cruzi infection has been cleared by the FDA for screening donated blood.
[0007]Diagnosis of T. cruzi infection presents problems. Demographic and clinical data are suggestive at best. Parasitologic tests, e.g., xenodianosis, hemoculture and PCR are insensitive. Other serologic tests are generally insensitive and lack specificity, as false positive reactions often occur with specimens from patients having infectious diseases, such as leishmaniasis, syphilis, or malaria; autoimmune diseases; and other parasitic and non-parasitic illinesses.
[0008]Such conventional tests include indirect immunofluorescence (DF), indirect hemagglutination (IHA), and complement fixation (CF) tests, as well as enzyme-linked immunosorbent assays (ELISA or EIA). Due to the lack of sensitivity and specify of the three commonly used assays, when a sample has a positive result from any, the blood must be discarded. Table I shows that in a major Brazilian blood bank (Hemocentro, Sao Paulo, Brazil), up to 3.43% of blood donations fall into this category.
TABLE-US-00001 TABLE I IIF IHA CF % w/ Results + + + 0.68% + - + 0.71% + + - - + + + - - 2.04% - + - - - + TOTAL: 3.43%
[0009]Commercially available ELISAs include lysate-based tests such as the Chagas Enzyme Immunoassay (EIA), available from Abbott Laboratories of Abbott Park, Ill. (the subject of FDA 510(k) Premarket Notification No. K933716, herein incorporated by reference in its entirety); the Chagas' IgG ELISA, available from Meridian Bioscience, Inc. of Cincinnati, Ohio, and its predecessor, Gull Laboratories (the subject of FDA 510(k) Premarket Notification No. K911233, herein incorporated by reference in its entirety); and the Chagas' kit (EIA method), available from Hemagen Diagnostics, Inc., of Waltham, Mass. (the subject of FDA 510(k) Premarket Notification No. K930272, herein incorporated by reference in its entirely). However, because these tests have less than optimal sensitivities and specificities, their use for screening donated blood would fail to detect some T. cruzi-infected units and also would cause substantial numbers of otherwise usable units to be discarded needlessly.
[0010]One of the present inventors has previously developed a radioimmune precipitation assay (RIPA), described in Kirchhoff L V, Gam A A, Gusmao R D, Goldsmith R S, Rezende J M, Rassi A. "Increased specificity of serodiagnosis of Chagas' disease by detection of antibody to the 72 and 90 kDa glycoproteins of Trypanosoma cruzi." J Infect Dis 1987;155:561-564, herein incorporated by reference in its entirety. This test is considered the benchmark against which other tests are measured, and it is the only current option for confirmatory testing in the United States. Unfortunately, the RIPA costs $175 per assay, and at that price, screening the approximately 13 million units of blood donated each year would cost over $2 billion.
[0011]Therefore, the present inventors have further developed recombinant assays for detection of T. cruzi infection. A typical recombinant polypeptide and method for assaying is described by them in U.S. Pat. No. 5,876,734, No. 6,228,601, and PCT Publication No. WO 95/25797, each of which is herein incorporated by reference in its entirety. Such assays for T. cruzi infection based on recombinant antigens, in contrast to those utilizing native antigens (e.g., the conventional lysate-based assays), as discussed above, will be more accurate, i.e., the sensitivity and specificity will be higher.
[0012]Furthermore, the recombinant assays of the invention present manufacturing advantages over the materials for the RIPA and conventional tests. Once the molecular biology has been completed, the recombinant antigens are produced in Escherichia coli, thus eliminating completely any biohazard associated with growing the parasites in liquid culture. This is a substantive advantage, as many cases of laboratory-acquired T. cruzi infection have been reported. Additionally, recombinant antigens produced in E. coli are much easier to purify, quantitate, and standardize than antigen lysates produced in liquid cultures of parasites, thus facilitating the manufacture of a consistent product and simplifying compliance with governmental regulations. A final advantage lies in the fact that several of the recombinant proteins presented in this application are comprised of two to four distinct protein segments derived from separate T. cruzi genes. This use of hybrid recombinant proteins also facilitates manufacture of an assay in that several antigenically distinct proteins are obtained in a single purification, quantitation, and standardization run.
SUMMARY OF THE INVENTION
[0013]The present invention utilizes recombinant proteins for detecting T. cruzi infected blood. The invention utilizes specific polypeptide sequences that correspond to fusion proteins FP3, FP4, FP5, FP6, FP7, FP8, FP9 and FP 10 as described below. Isolated polynucleotides that encode the inventive polypeptides according to the present invention are also utilized, as are cells transformed with a recombinant plasmid that expresses a polypeptide according to the invention. The present invention is similar to that which is described in U.S. Pat. No. 5,876,734, herein incorporated by reference in its entirety. However, the present invention replaces the proteins in the process with the recombinant proteins of this invention to achieve similar or superior results.
[0014]The present invention also provides a method for detecting the presence of antibodies to T. cruzi in an individual, comprising the steps of contacting a putative anti-T. cruzi antibody-containing sample from an individual with a polypeptide according to the invention that is typically attached or conjugated to a carrier molecule or attached or conjugated to a solid phase; allowing anti-T. cruzi and other antibodies in said sample to bind to said polypeptide; washing away unbound anti-T. cruzi antibodies; and adding a compound that enables detection of the anti-T. cruzi antibodies which are specifically bound to the polypeptide. The compound that enables detection of the anti-T. cruzi antibodies may be selected from the group consisting of a colorometric agent, a fluorescent agent, a chemiluminescent agent and a radionucleotide.
[0015]Also provided in accordance with the present invention is a kit for diagnosing the presence of anti-T. cruzi antibodies in a sample, comprising a container in which a polypeptide according to the invention is attached or conjugated to a carrier molecule or attached or conjugated to a solid phase; and directions for carrying out the method according to the invention. The kit additionally may comprise a container of a compound that binds to anti-T. cruzi antibodies and that renders said antibodies detectable.
[0016]Other objects, features and advantages of the present invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.
BRIEF DESCRIPTION OF THE DRAWINGS
[0017]FIG. 1 is a description of the prior art.
[0018]FIG. 1a-1h are schematic representations of the recombinant proteins utilized in the invention.
[0019]FIG. 2 is a bar graph showing reactivity of various blood specimens with recombinant proteins used alone or in combination as target antigens in ELISAs.
DETAILED DESCRIPTION OF THE INVENTION
[0020]FIGS. 1a-1h represent the recombinant proteins of the invention, with the various letters indicating known protein sequences, as follows. The Figs. are schematic diagrams of the recombinant T. cruzi proteins, comprised of segments A through L. Solid segments (A, C, D, F, H, I, and K) represent nonrepetitive proteins having amino acid sequences that are unrelated to each other. Saw-tooth segments (B, E, G, J, and L) represent repetitive proteins having amino acid sequences that are unrelated to each other and unrelated to those of the nonrepetitive proteins. The relative sizes and numbers of repeats in the repetitive proteins are roughly represented in the Figs. The sizes and shapes of the nonrepetitive segments bear no relation to the actual proteins.
[0021]The following information refers to FIGS. 1 and 1a-1h in which the recombinant proteins Ag15, FP3, FP4, FP5, FP6, FP7, FP8, FP9 and FP10 are depicted schematically. These proteins are derived from T. cruzi, the protozoan parasite that causes Chagas disease, and are formed from of proteins A through L as indicated, and defined herein. There are no substantive amino acid similarities among proteins A through L. Similarly there are no substantive DNA sequence similarities among the segments that encode proteins A through L. The T. cruzi DNA sequences that encode proteins A through L were cloned in combination into pGEX and pET plasmid vectors, such as pET-32a. Strains of Escherichia coli were transfected with the recombinant vectors bearing the T. cruzi DNA sequences, and the bacteria were incubated in liquid culture under conditions favoring synthesis of the recombinant proteins. The latter proteins were subsequently affinity-purified and then used as target antigens in ELISAs. ELISAs in which proteins Ag15, FP3, FP4, FP5, FP6, FP7, FP8, FP9, and FP10, alone or in combination are employed as target antigens are useful as sensitive and specific detectors of anti-T. cruzi antibodies in blood specimens obtained from persons who are chronically infected with this parasite. The detection of such antibodies is the primary means of identifying persons who are chronically infected with T. cruzi.
[0022]The following paragraphs contain information relating to the naming, localization, and function of proteins A through L, as well as the corresponding GenBank accession numbers of the sequences to which they are related and relevant publications.
[0023]It should be noted that the T. cruzi gene segments that encode protein segments A through L generally are shortened versions of the native coding regions. In this context, the constructs that encode single segments (i.e., FP5 and FP9), as well as all the others that encode more than one segment, are all unique, because, even if the individual components from which the various recombinant proteins of this invention are known, the segments of the invention have not been combined previously as described herein.
[0024]Protein AB. This hybrid recombinant protein, also designated Ag15 [SEQ ID NO. 2] in FIG. 1, is derived from the TCR27 gene of T. cruzi [SEQ ID NO. 1]. Protein A is the amino terminal nonrepetitive portion of the TCR27 protein, and Protein B is comprised of approximately 18 of the 14 amino acid repeats that make up the central portion of the TCR27 protein. The two native TCR27 genes sequenced contained approximately 69 and 105 of the 14-amino acid repeats.
[0025]Nucleotide sequence data that include the Ag15 DNA sequence were deposited with GenBank and EMBL databases by Keiko Otsu, John E. Donelson, and Louis V. Kirchhoff with the accession number L04603 and are described in U.S. Pat. No. 5,876,734 and No. 6,228,601, issued to Louis V. Kirchhoff and Keiko Otsu (each of which is herein incorporated by reference in its entirety). These references also present DNA and inferred protein sequences that include the Ag15 DNA and inferred protein sequences. The Ag15 DNA and inferred protein sequences are additionally presented in Otsu K, Donelson J E, Kirchhoff L V. "Interruption of a Trypanosoma cruzi gene encoding a protein containing 14-amino acid repeats by targeted insertion of the neomycin phosphotransferase gene." Mol Biochem Parasitol 1993;57:317-330, herein incorporated by reference in its entirety.
[0026]Protein C. This is a calcium binding protein of T. cruzi, initially called 1F8 and later designated the flagellar calcium binding protein (FCaBP) [SEQ ID NO 4]. The accession number of the original 1F8 DNA sequence [SEQ ID NO 3] deposited in GenBank is K03278. The Protein C DNA and inferred protein sequences are presented in Gonzalez A, Lerner T J, Huecas M, Sosa-Pineda B, Nogueira N, Lizardi P M. "Apparent generation of a segmented mRNA from two separate tandem gene families in Trypanosoma cruzi." Nucleic Acids Res 1985;13(16):5789-804, herein incorporated by reference in its entirety.
[0027]FIG. 1a shows a first protein (FP3) [SEQ ID NO. 22] in accordance with the invention. Specifically, FP3 corresponds essentially to the combination of Ag15 (FIG. 1), and by Protein C. The DNA sequence encoding FP3 [SEQ ID NO 21], also essentially corresponds to the sequences coding for Ag15 and Protein C.
[0028]Protein D. This is the protein core of a surface glycoprotein of T. cruzi that is referred to as GP72 [SEQ ID NO 6]. The accession number of the original gp72 DNA sequence [SEQ ID NO 5] deposited in GenBank is M65021. The Protein D DNA and inferred protein sequences are presented in Cooper R, Inverso J A, Espinosa M, Nogueira N, Cross G A. "Characterization of a candidate gene for GP72, an insect stage-specific antigen of Trypanosoma cruzi." Mol Biochem Parasitol 1991;49(1):45-59, herein incorporated by reference in its entirety.
[0029]FIG. 1b shows a second protein (FP4) [SEQ ID NO 8] in accordance with the invention. The DNA sequence [SEQ ID NO 7] that encodes Protein DABC which is a single continuous coding region, essentially corresponds to the DNA sequences from which it was constructed.
[0030]Protein E. This is a segment of the flagellar repetitive protein (FRA) [SEQ ID NO 10] of T. cruzi comprised of approximately nine repeats consisting of 68 amino acids each, shown as FIG. 1c (FP5). The accession number of the original Protein E DNA sequence [SEQ ID NO 9] deposited in GenBank is J04015. The Protein E DNA and inferred protein sequences are presented in Lafaille J J, Linss J, Krieger M A, Souto-Padron T, de Souza W, Goldenberg S. "Structure and expression of two Trypanosoma cruzi genes encoding antigenic proteins bearing repetitive epitopes." Mol Biochem Parasitol 1989; 35(2):127-136, herein incorporated by reference in its entirety.
[0031]Protein FGH. This is a protein [SEQ ID NO 12] encoded by a modified version of the T. cruzi TCR39 gene that was artificially constructed [SEQ ID NO 11], shown as FIG. 1e (FP7). The modification entailed reducing the length of the central portion of the TCR39 gene that encodes the 12-amino acid repeats. Protein F is the amino terminal nonrepetitive segment of the TCR39 protein. Protein G is comprised of approximately 13 of the 12-amino acid repeats that make up the central portion of the TCR39 protein. Protein H is the carboxy terminal nonrepetitive segment of the TCR39 protein. The accession number of the original, i.e., the unmodified, Protein FGH DNA sequence deposited in GenBank is U15616. The TCR39 DNA and inferred protein sequences, which include the entire Protein FGH sequences, are presented in Gruber A, Zingales B. "Trypanosoma cruzi: characterization of two recombinant antigens with potential application in the diagnosis of Chagas' disease." Exp Parasitol 1993;76(1):1-12, herein incorporated by reference in its entirety.
[0032]FIG. 1d shows another hybrid recombinant protein (FP6, Protein FGHE) [SEQ ID NO 14] in accordance with the invention. The DNA sequence that encodes Protein FGHE [SEQ ID NO 13], which is a single continuous coding region, essentially corresponds to the DNA sequences from which it was constructed.
[0033]Protein IJK. This is a protein [SEQ ID NO 16] encoded by a modified version of the T. cruzi shed acute phase antigen (SAPA) gene that was artificially constructed [SEQ ID NO 15], as shown in FIG. 1f (FP8). The modification entailed reducing the length of the central portion of the SAPA gene that consists of 12-amino acid repeats. Protein I is the amino terminal nonrepetitive segment of the SAPA protein. Protein J is comprised of approximately nine of the 12-amino acid repeats that make up the central portion of the SAPA protein. Protein K is the carboxy terminal nonrepetitive segment of the SAPA protein. The accession number of the original, i.e., the unmodified, Protein IJK DNA sequence deposited in Gen Bank is J03985. The SAPA DNA and protein sequences, which include the entire Protein IJK sequences, are presented in Affranchino J L, Pollevick G D, Frasch A C C. "The expression of the major shed Trypanosoma cruzi antigen results from the developmentally-regulated transcription of a small gene family." FEBS Lett 1991;280:316-320, herein incorporated by reference in its entirety.
[0034]Protein L. This is a microtubule-associated repetitive protein (MAP) [SEQ ID NO 18] of T. cruzi that is comprised of approximately five repeats consisting of 38 amino acids each, as depicted in FIG. 1g (FP9). The accession number of the original Protein L DNA sequence [SEQ ID NO 17] deposited in GenBank is S68286. The Protein L DNA and inferred protein sequences are presented in Kerner N, Liegeard P, Levin M J, Hontebeyrie-Joskowicz M. "Trypanosoma cruzi: antibodies to a MAP-like protein in chronic Chagas' disease cross-react with mammalian cytoskeleton." Experimental Parasitology 1991;73(4):451-459, herein incorporated by reference in its entirety.
[0035]FIG. 1h shows another hybrid recombinant protein (FP10, Protein IJKL) [SEQ ID NO 20] in accordance with the invention. The DNA sequence that encodes Protein IJKL [SEQ ID NO 19], which is a single continuous coding region, essentially corresponds to the DNA sequences from which it Was constructed.
[0036]Additionally, combinations of the various recombinant proteins depicted in the Figs. may be used. While it is possible to combine one or more of the recombinant proteins to form longer recombinant proteins, typically more than one recombinant protein is used simultaneously. For example, simultaneous uses of FP4 and FP5, FP5 and FP6, as well as FP4 and FP6, and combinations using more than two recombinant proteins (e.g., FP4, FP6 and FP10) are considered within the scope of the present invention. It is believed that the sensitivity and specificity of the assays according to the invention are sufficient to meet FDA standards for screening the blood supply of the United States.
[0037]Additionally, as described in U.S. Pat. No. 6,228,601 (herein incorporated by reference in its entirety), polypeptides need not correspond exactly over their entire lengths to be considered within the scope of the invention. For example, a wide variety of polypeptides which contain at least one epitope embodied in the polypeptides of the invention can be used in accordance with the present invention. Based on the nucleotide sequences, polypeptide molecules also can be produced (1) that include sequence variations, relative to the naturally-occurring sequences, (2) that have one or more amino acids truncated from the naturally-occurring sequences and variations thereof, or (3) that contain the naturally-occurring sequences and variations thereof as part of a longer sequence.
[0038]In this description, polypeptide molecules in categories (1), (2) and (3) are said to "correspond" to the amino acid sequences of the recombinant proteins of the invention. Such polypeptides also are referred to as "variants." The category of variants within the present invention includes, for example, fragments and muteins of proteins A though L, as well as larger molecules that consist essentially at least one protein sequence A through L, alone or in combination with other proteins A to L.
[0039]In this regard, a molecule that "consists essentially of" protein A to L, alone or in combination with any other proteins A to L, is one that is immunoreactive with samples from persons infected with T. cruzi, but that does not react with samples from patients with leishmaniasis, schistosomiasis, and other parasitic and infectious diseases, with samples from patients with autoimmune disorders and other illnesses, and with specimens from normal persons.
[0040]A "mutein" is a polypeptide that is homologous to the protein to which it corresponds, and that retains the basic functional attribute--the ability to react selectively with samples from persons infected with T. cruzi--of the corresponding region. For purposes of this description, "homology" between two sequences connotes a likeness short of identity indicative of a derivation of the first sequence from the second. In particular, a polypeptide is "homologous" to the corresponding protein if a comparison of amino acid sequences between the polypeptide and the corresponding region reveals an identity of greater than 40%, preferably greater than 50% and more preferably 70%. Such sequence comparisons can be performed via known algorithms, such as those described in Pearson W R, Lipman D J. "Improved tools for biological sequence comparison." Proc Natl Acad Sci USA 1988;85(8):2444-2448, herein incorporated by reference in its entirety, which are readily implemented by computer.
[0041]A fragment of a protein of the invention is a molecule in which one or more amino acids are truncated from that protein. Muteins and fragments can be produced, in accordance with the present invention, by known de novo synthesis techniques.
[0042]Also exemplary of variants within the present invention are molecules that are longer than a protein of the invention, but that contain the region or a mutein thereof within the longer sequence. For example, a variant may include a father fusion partner in addition to the protein of the invention. Such a fusion partner may allow easier purification of recombinantly-produced polypeptides. For example, use of a glutathione-S-transferase (26 kilodaltons, GST) fusion partner allows purification of recombinant polypeptides on glutathione agarose beads.
[0043]The portion of the sequence of a such molecule other than that portion of the sequence corresponding to the region may or may not be homologous to the sequence of a protein of the invention.
[0044]It will be appreciated that polypeptides shorter than the corresponding protein of the invention but that retain the ability to react selectively with samples from persons infected with T. cruzi are suitable for use in the present invention. Thus, variants may be of the same length, longer than or shorter than the protein of the invention, and also include sequences in which there are amino acid substitutions of the parent sequence. These variants must retain the ability to react selectively with samples from persons infected with T. cruzi.
[0045]In one embodiment, the assay of the invention uses FP4 as target antigen. Table II compares the results obtained by testing 45 pre-screened Argentinean specimens in an
TABLE-US-00002 TABLE II RIPA + - FP4 ELISA + 9 0 - 0 36
FP4 ELISA with those obtained by RIPA testing.
[0046]The data in Table II show that in this group of specimens, the sensitivity and specificity of the FP4 ELISA were both 100%
[0047]Similarly, the performance of an FP4+FP6 ELISA in comparison to RDA was
TABLE-US-00003 TABLE III RIPA + - FP4 + FP6 ELISA + 10 1 - 0 78
assessed by testing 89 pre-selected Guatemalan specimens.
[0048]The data shown in Table III indicate that in this group of samples, the sensitivity of the FP4+FP6 ELISA was 100% and the specificity was 98.7%.
[0049]As shown in FIG. 2, in a FP4+FP6 ELISA, performed using standard procedures, a group of previously characterized RIPA-positive samples from several Chagas-endemic countries gave a mean reactivity (absorbance) of 2.99. Thus FP4+FP6 is the preferred embodiment among the recombinant proteins tested alone and in combination in that experiment.
[0050]It should be apparent that embodiments other than those specifically described above may come within the spirit and scope of the present invention, such as recombinant proteins comprised of different combinations and/or spatial arrangements of proteins A to L. Hence, the present invention is not limited by the above description.
Sequence CWU
1
5711521DNATrypanosoma
cruziCDS(1)..(21)CDS(25)..(162)CDS(166)..(273)CDS(277)..(330)CDS(334)..(4-
29)CDS(433)..(573)CDS(628)..(678)CDS(691)..(759)CDS(763)..(834)CDS(838)..(-
861)CDS(865)..(876)CDS(880)..(897)CDS(901)..(918)CDS(922)..(933)CDS(937)..-
(948)CDS(952)..(975)CDS(979)..(1017)CDS(1021)..(1059)CDS(1063)..(1101)CDS(-
1105)..(1143)CDS(1147)..(1185)CDS(1189)..(1227)CDS(1231)..(1269)CDS(1273).-
.(1311)CDS(1315)..(1353)CDS(1357)..(1395)CDS(1399)..(1437)CDS(1441)..(1479-
)CDS(1483)..(1521)CDS(577)..(624)CDS(682)..(687) 1tat ggc ccg agc tgt ggt
gct tga gga tgg agc gct tta cgt ggc gga 48Tyr Gly Pro Ser Cys Gly
Ala Gly Trp Ser Ala Leu Arg Gly Gly1 5
10 15caa tgc caa caa cct cgt tcg aga aat ctc caa
tgg cgt tgt cac ttc 96Gln Cys Gln Gln Pro Arg Ser Arg Asn Leu Gln
Trp Arg Cys His Phe 20 25
30gtt tat tac gga agg act gct ggg ccc atc gta cat caa acc gta cag
144Val Tyr Tyr Gly Arg Thr Ala Gly Pro Ile Val His Gln Thr Val Gln
35 40 45ccg tac aaa tgg cgc tca tga
ctt gtt tgt gtc gga cac ggg caa atc 192Pro Tyr Lys Trp Arg Ser
Leu Val Cys Val Gly His Gly Gln Ile 50 55
60acg cat cat ttt tgc ccc acc tca gaa aaa aac gtt cat cac
agt gtt 240Thr His His Phe Cys Pro Thr Ser Glu Lys Asn Val His His
Ser Val 65 70 75tat aac agg att
cca gcc gga tgt tct tca aat tag cga gaa gag tcg 288Tyr Asn Arg Ile
Pro Ala Gly Cys Ser Ser Asn Arg Glu Glu Ser 80 85
90ttt gat gtt tgc cat ctg caa ttc cac gaa aat tct
tgc gat taa tat 336Phe Asp Val Cys His Leu Gln Phe His Glu Asn Ser
Cys Asp Tyr 95 100 105gca ggg agc
cac aac ccc gaa gga gta ctg gca agt tgg aaa tgc gga 384Ala Gly Ser
His Asn Pro Glu Gly Val Leu Ala Ser Trp Lys Cys Gly 110
115 120ctg cat ggg cta tca gag ttc cct cat gct cac gac
cga gga gga taa 432Leu His Gly Leu Ser Glu Phe Pro His Ala His Asp
Arg Gly Gly125 130 135act cct cta cta cgg
cat att aaa tgg aac ccc atc cat cat gtc ttt 480Thr Pro Leu Leu Arg
His Ile Lys Trp Asn Pro Ile His His Val Phe140 145
150 155acc cgc cac caa aac gaa gac gga agc acc
cag aat ttg ccc gga tgt 528Thr Arg His Gln Asn Glu Asp Gly Ser Thr
Gln Asn Leu Pro Gly Cys 160 165
170gtt gtt gca gtg gcc aca tgg gcc cat tgt ttc gct tgt gaa tat taa
576Val Val Ala Val Ala Thr Trp Ala His Cys Phe Ala Cys Glu Tyr
175 180 185caa aca tgc att tta cgt tgt
tac cgc ctc caa tgt ata cat tgt aca 624Gln Thr Cys Ile Leu Arg Cys
Tyr Arg Leu Gln Cys Ile His Cys Thr 190 195
200tga tgg ctc gta tca tcc gac tgg atc cat ggc cca gct cca
aca ggc 672 Trp Leu Val Ser Ser Asp Trp Ile His Gly Pro Ala Pro
Thr Gly 205 210 215aga aaa taa
tat cac taa ttc caa aaa aga aat gac aaa gct acg aga 720Arg Lys
Tyr His Phe Gln Lys Arg Asn Asp Lys Ala Thr Arg 220
225 230aaa agt gaa aaa ggc cga gaa aga aaa
att gga cgc cat taa ccg ggc 768Lys Ser Glu Lys Gly Arg Glu Arg Lys
Ile Gly Arg His Pro Gly 235 240
245aac caa gct gga aga gga acg aaa cca agc gta caa agc agc aca
caa 816Asn Gln Ala Gly Arg Gly Thr Lys Pro Ser Val Gln Ser Ser Thr
Gln 250 255 260ggc aga gga gga
aaa ggc taa aac att tca acg cct tat aac att tga 864Gly Arg Gly Gly
Lys Gly Asn Ile Ser Thr Pro Tyr Asn Ile 265
270 275gtc gga aaa tat taa ctt aaa gaa aag gcc aaa tga
cgc agt ttc aaa 912Val Gly Lys Tyr Leu Lys Glu Lys Ala Lys
Arg Ser Phe Lys 280 285
290tcg gga taa gaa aaa aaa ttc tga aac cgc aaa aac tga cga agt aga
960Ser Gly Glu Lys Lys Phe Asn Arg Lys Asn Arg Ser Arg
295 300gaa aca gag ggc ggc tga ggc tgc
caa ggc cgt gga gac gga gaa gca 1008Glu Thr Glu Gly Gly Gly Cys
Gln Gly Arg Gly Asp Gly Glu Ala 305 310
315gag ggc agc tga ggc cac gaa ggt tgc cga agc gga gaa gcg gaa ggc
1056Glu Gly Ser Gly His Glu Gly Cys Arg Ser Gly Glu Ala Glu Gly
320 325 330agc tga ggc cgc caa ggc
cgt gga gac gga gaa gca gag ggc agc tga 1104Ser Gly Arg Gln Gly
Arg Gly Asp Gly Glu Ala Glu Gly Ser 335 340
345agc cac gaa ggt tgc cga agc gga gaa gca gaa ggc agc tga ggc
cgc 1152Ser His Glu Gly Cys Arg Ser Gly Glu Ala Glu Gly Ser Gly
Arg 350 355 360caa ggc cgt gga gac
gga gaa gca gag ggc agc tga agc cac gaa ggt 1200Gln Gly Arg Gly Asp
Gly Glu Ala Glu Gly Ser Ser His Glu Gly 365
370 375tgc cga agc gga gaa gca gag ggc agc tga agc
cat gaa ggt tgc cga 1248Cys Arg Ser Gly Glu Ala Glu Gly Ser Ser
His Glu Gly Cys Arg 380 385
390agc gga gaa gca gaa ggc agc tga ggc cgc caa ggc cgt gga gac gga
1296Ser Gly Glu Ala Glu Gly Ser Gly Arg Gln Gly Arg Gly Asp Gly
395 400 405gaa gca gag ggc agc tga
agc cac gaa ggt tgc cga agc gga gaa gca 1344Glu Ala Glu Gly Ser
Ser His Glu Gly Cys Arg Ser Gly Glu Ala 410
415 420gaa ggc agc tga ggc cgc caa ggc cgt gga gac gga
gaa gca gag ggc 1392Glu Gly Ser Gly Arg Gln Gly Arg Gly Asp Gly
Glu Ala Glu Gly 425 430 435agc
tga agc cac gaa ggt tgc cga agc gga gaa gca gaa ggc agc tga 1440Ser
Ser His Glu Gly Cys Arg Ser Gly Glu Ala Glu Gly Ser 440
445 450ggc cgc caa ggc cgt gga gac gga gaa gca
gag ggc agc tga agc cac 1488Gly Arg Gln Gly Arg Gly Asp Gly Glu Ala
Glu Gly Ser Ser His 455 460
465gaa ggt tgc cga agc gga gaa gga tat cga tcc
1521Glu Gly Cys Arg Ser Gly Glu Gly Tyr Arg Ser 470
47527PRTTrypanosoma cruzi 2Tyr Gly Pro Ser Cys Gly Ala1
5346PRTTrypanosoma cruzi 3Gly Trp Ser Ala Leu Arg Gly Gly Gln Cys Gln
Gln Pro Arg Ser Arg1 5 10
15Asn Leu Gln Trp Arg Cys His Phe Val Tyr Tyr Gly Arg Thr Ala Gly
20 25 30Pro Ile Val His Gln Thr Val
Gln Pro Tyr Lys Trp Arg Ser 35 40
45436PRTTrypanosoma cruzi 4Leu Val Cys Val Gly His Gly Gln Ile Thr His
His Phe Cys Pro Thr1 5 10
15Ser Glu Lys Asn Val His His Ser Val Tyr Asn Arg Ile Pro Ala Gly
20 25 30Cys Ser Ser Asn
35518PRTTrypanosoma cruzi 5Arg Glu Glu Ser Phe Asp Val Cys His Leu Gln
Phe His Glu Asn Ser1 5 10
15Cys Asp632PRTTrypanosoma cruzi 6Tyr Ala Gly Ser His Asn Pro Glu Gly
Val Leu Ala Ser Trp Lys Cys1 5 10
15Gly Leu His Gly Leu Ser Glu Phe Pro His Ala His Asp Arg Gly
Gly 20 25
30747PRTTrypanosoma cruzi 7Thr Pro Leu Leu Arg His Ile Lys Trp Asn Pro
Ile His His Val Phe1 5 10
15Thr Arg His Gln Asn Glu Asp Gly Ser Thr Gln Asn Leu Pro Gly Cys
20 25 30Val Val Ala Val Ala Thr Trp
Ala His Cys Phe Ala Cys Glu Tyr 35 40
45816PRTTrypanosoma cruzi 8Gln Thr Cys Ile Leu Arg Cys Tyr Arg Leu
Gln Cys Ile His Cys Thr1 5 10
15917PRTTrypanosoma cruzi 9Trp Leu Val Ser Ser Asp Trp Ile His Gly
Pro Ala Pro Thr Gly Arg1 5 10
15Lys1023PRTTrypanosoma cruzi 10Phe Gln Lys Arg Asn Asp Lys Ala Thr
Arg Lys Ser Glu Lys Gly Arg1 5 10
15Glu Arg Lys Ile Gly Arg His 201124PRTTrypanosoma
cruzi 11Pro Gly Asn Gln Ala Gly Arg Gly Thr Lys Pro Ser Val Gln Ser Ser1
5 10 15Thr Gln Gly Arg
Gly Gly Lys Gly 20128PRTTrypanosoma cruzi 12Asn Ile Ser Thr
Pro Tyr Asn Ile1 5134PRTTrypanosoma cruzi 13Val Gly Lys
Tyr1146PRTTrypanosoma cruzi 14Leu Lys Glu Lys Ala Lys1
5156PRTTrypanosoma cruzi 15Arg Ser Phe Lys Ser Gly1
5164PRTTrypanosoma cruzi 16Glu Lys Lys Phe1174PRTTrypanosoma cruzi 17Asn
Arg Lys Asn1188PRTTrypanosoma cruzi 18Arg Ser Arg Glu Thr Glu Gly Gly1
51913PRTTrypanosoma cruzi 19Gly Cys Gln Gly Arg Gly Asp Gly
Glu Ala Glu Gly Ser1 5
102013PRTTrypanosoma cruzi 20Gly His Glu Gly Cys Arg Ser Gly Glu Ala Glu
Gly Ser1 5 102113PRTTrypanosoma cruzi
21Gly Arg Gln Gly Arg Gly Asp Gly Glu Ala Glu Gly Ser1 5
102213PRTTrypanosoma cruzi 22Ser His Glu Gly Cys Arg Ser
Gly Glu Ala Glu Gly Ser1 5
102313PRTTrypanosoma cruzi 23Gly Arg Gln Gly Arg Gly Asp Gly Glu Ala Glu
Gly Ser1 5 102413PRTTrypanosoma cruzi
24Ser His Glu Gly Cys Arg Ser Gly Glu Ala Glu Gly Ser1 5
102513PRTTrypanosoma cruzi 25Ser His Glu Gly Cys Arg Ser
Gly Glu Ala Glu Gly Ser1 5
102613PRTTrypanosoma cruzi 26Gly Arg Gln Gly Arg Gly Asp Gly Glu Ala Glu
Gly Ser1 5 102713PRTTrypanosoma cruzi
27Ser His Glu Gly Cys Arg Ser Gly Glu Ala Glu Gly Ser1 5
102813PRTTrypanosoma cruzi 28Gly Arg Gln Gly Arg Gly Asp
Gly Glu Ala Glu Gly Ser1 5
102913PRTTrypanosoma cruzi 29Ser His Glu Gly Cys Arg Ser Gly Glu Ala Glu
Gly Ser1 5 103013PRTTrypanosoma cruzi
30Gly Arg Gln Gly Arg Gly Asp Gly Glu Ala Glu Gly Ser1 5
103113PRTTrypanosoma cruzi 31Ser His Glu Gly Cys Arg Ser
Gly Glu Gly Tyr Arg Ser1 5
103242DNATrypanosoma cruzimodified_base(1)..(4)a, t, c, g 32nnnnctatta
ttgatacagt ttctgtacta tattggttgt gc
42333749DNATrypanosoma
cruziCDS(833)..(2575)sig_peptide(822)..(937)mat_peptide(938)..(2575)
33ccccctcgag gtcgacctgc aggtcaacgg atcttacctg agtacaaaag gtcaagtgag
60cggtcaaaag gatgtatata tacatatata accataaggg aaacatttgg gcatttaact
120gcctttacat ttcccttttc cttcaatatc ttgtttgttt gtttttggtt tctataggaa
180attttaggat ccggccagcg gcataggaga ttattctctt ttttattaat tgcttaatgc
240gttggtctgt gtgtgtgttg gttcccttgt gcgagctcac ggggcctaat tatgattgtt
300gcgcatatgc atatatatat atatatatat acatgtgtgt gtgtgtgtat atgtacgttt
360gttggtttgc cgctgtactc ccgcctgcgt gtgtctgtct ctctctctgt gtgtgtgatg
420ggctgcttct ctttcttttg ttgcgtccct ttattattat tatttttttt tcttctctcc
480cacttctctc cccgtgtggt gcacgcacag taaagataga gggagaaata gagcgagtgt
540ttgtatcagt gtctccgttg cggctggtac tggtagaagg agaagaatag aagaaggaga
600aaaaaaaaaa aaaaaaaaaa aaaagagaga gagagagaga agggcgaacg agaaaaaaga
660agaagaaaca tttgagaagg aattggaacg aaaattgtaa gaggaagcaa aaaaaaaaaa
720aaaaagtgtg tgtgtgtgag agagagagag agaggaagcc aataataata aaaagcaaac
780aaaaaagcaa aaacaaaaat atttgtagac cggacgtccc gtcttggacg tg atg ttt
838 Met Phe
-35tca aaa agg acg
tcg cca gca ccc ttc cgt gcg ctc ctg ctg ccg gtc 886Ser Lys Arg Thr
Ser Pro Ala Pro Phe Arg Ala Leu Leu Leu Pro Val -30
-25 -20gtg gtg gtg gtg gtg gtg gtg gtg gca tct gtg
gcc ctc cct gca gga 934Val Val Val Val Val Val Val Val Ala Ser Val
Ala Leu Pro Ala Gly -15 -10 -5gcg
cag ttt gat tta agg cag cag cag ctg gtt ata cag gat ttc ttc 982Ala
Gln Phe Asp Leu Arg Gln Gln Gln Leu Val Ile Gln Asp Phe Phe-1 1
5 10 15atc agt cgc tcc tgc gca
gga tgt tca cag ggg caa acc gat ggc cca 1030Ile Ser Arg Ser Cys Ala
Gly Cys Ser Gln Gly Gln Thr Asp Gly Pro 20
25 30agc ggt gcc ggc aca ctc ttc act gcc gcc ggt ggt
tcg ctt ggc aaa 1078Ser Gly Ala Gly Thr Leu Phe Thr Ala Ala Gly Gly
Ser Leu Gly Lys 35 40 45gat
gct tcc acg ctg ctg ttg tgt gac caa ggt ggt ggt ggc tcc agc 1126Asp
Ala Ser Thr Leu Leu Leu Cys Asp Gln Gly Gly Gly Gly Ser Ser 50
55 60gtg cgt ttg gtg aac aaa tcc ggc att
ttc acc ctt gcc ggt agt aaa 1174Val Arg Leu Val Asn Lys Ser Gly Ile
Phe Thr Leu Ala Gly Ser Lys 65 70
75acg acg cgt ggc aat caa aat ggt ccg gcg gcg acg gca ctc ttc aac
1222Thr Thr Arg Gly Asn Gln Asn Gly Pro Ala Ala Thr Ala Leu Phe Asn80
85 90 95atg ccc cga gct
gtg gtg ctt gag gat gga gcg ctt tac gtg gcg gac 1270Met Pro Arg Ala
Val Val Leu Glu Asp Gly Ala Leu Tyr Val Ala Asp 100
105 110agt gcc aac aac ctc gtt cga gaa atc tcc
aat ggc att gtc act tcg 1318Ser Ala Asn Asn Leu Val Arg Glu Ile Ser
Asn Gly Ile Val Thr Ser 115 120
125ttt att acg gag gga ctg ctg ggc cca tcg tac atc aaa ccg tac agc
1366Phe Ile Thr Glu Gly Leu Leu Gly Pro Ser Tyr Ile Lys Pro Tyr Ser
130 135 140cgt cca aat ggc gcc cat gac
ttg ttt gtg tcg gac acg ggc aaa tct 1414Arg Pro Asn Gly Ala His Asp
Leu Phe Val Ser Asp Thr Gly Lys Ser 145 150
155cgc atc att ttt gcc cca ctt cag aaa caa acg ttc atc aca gtg ttt
1462Arg Ile Ile Phe Ala Pro Leu Gln Lys Gln Thr Phe Ile Thr Val Phe160
165 170 175ata aca gga ttc
cag ccg gat gtt ctt caa att agc gag aag agt cgt 1510Ile Thr Gly Phe
Gln Pro Asp Val Leu Gln Ile Ser Glu Lys Ser Arg 180
185 190ttg atg ttt gcc atc tgc aat tcc acg aaa
att ctt tcg att aat atg 1558Leu Met Phe Ala Ile Cys Asn Ser Thr Lys
Ile Leu Ser Ile Asn Met 195 200
205cag gga gcc aca acc ccg aag gat tac tgg caa gtt gga aat gcg gac
1606Gln Gly Ala Thr Thr Pro Lys Asp Tyr Trp Gln Val Gly Asn Ala Asp
210 215 220tgc atg ggc tat cag agt tct
ctc atg ctc acg acc gag gag gat aaa 1654Cys Met Gly Tyr Gln Ser Ser
Leu Met Leu Thr Thr Glu Glu Asp Lys 225 230
235ctc ctc tac tac ggc ata tta aat gga acc cca tcc atc atg tct tta
1702Leu Leu Tyr Tyr Gly Ile Leu Asn Gly Thr Pro Ser Ile Met Ser Leu240
245 250 255ccc gcc acc aaa
acg aag acg gaa gca ccc aga att tgc ccg gat gtg 1750Pro Ala Thr Lys
Thr Lys Thr Glu Ala Pro Arg Ile Cys Pro Asp Val 260
265 270ttg ttg cgg tgg cca cat ggg ccc att gtt
tcg ctt gtg aat att aac 1798Leu Leu Arg Trp Pro His Gly Pro Ile Val
Ser Leu Val Asn Ile Asn 275 280
285aaa cat gca ttt tac gtt gtt acc gcc tcc aat gta tac att gta cat
1846Lys His Ala Phe Tyr Val Val Thr Ala Ser Asn Val Tyr Ile Val His
290 295 300gat ggc tct tat cat ccg act
gtg acg ccg aca cct cct ctg aca ccg 1894Asp Gly Ser Tyr His Pro Thr
Val Thr Pro Thr Pro Pro Leu Thr Pro 305 310
315acg cct aca cca gaa gtg aca ccc aca cct act gtg acc ccg acg cct
1942Thr Pro Thr Pro Glu Val Thr Pro Thr Pro Thr Val Thr Pro Thr Pro320
325 330 335aca ccg gaa gtg
aca ccg aca ccg cca gtg act ccg agc ccc acc atc 1990Thr Pro Glu Val
Thr Pro Thr Pro Pro Val Thr Pro Ser Pro Thr Ile 340
345 350aca atc cac cgg ggt ttt gct gtg gca gcc
ttt cct gcc caa agt ctt 2038Thr Ile His Arg Gly Phe Ala Val Ala Ala
Phe Pro Ala Gln Ser Leu 355 360
365cca atc gaa gac ccg cgg ctt atg cat gaa ctg ctt tct tgg tta atg
2086Pro Ile Glu Asp Pro Arg Leu Met His Glu Leu Leu Ser Trp Leu Met
370 375 380aag gat gta ggg att gcg ttc
gaa tcc acg gac ttt ttt gcc gta ttt 2134Lys Asp Val Gly Ile Ala Phe
Glu Ser Thr Asp Phe Phe Ala Val Phe 385 390
395cct cca gat aga gag gtt ttg gtg ccc ggt tat gta aat gtc tcc acc
2182Pro Pro Asp Arg Glu Val Leu Val Pro Gly Tyr Val Asn Val Ser Thr400
405 410 415tgg aat aac ttg
acg gtg cta ttc aac ttt gac cgc acc att gtc atc 2230Trp Asn Asn Leu
Thr Val Leu Phe Asn Phe Asp Arg Thr Ile Val Ile 420
425 430acg gaa tat ttc act cca gag ggc atg tct
tca gag gag gga cag gcc 2278Thr Glu Tyr Phe Thr Pro Glu Gly Met Ser
Ser Glu Glu Gly Gln Ala 435 440
445cga ctc ttc gct tcg ccg tgg tac tgg acg aga aat ttc ctt gat tca
2326Arg Leu Phe Ala Ser Pro Trp Tyr Trp Thr Arg Asn Phe Leu Asp Ser
450 455 460tta aag aaa aca gta gct tgg
aag gac ttg gag gcg ttt tgc atg gtc 2374Leu Lys Lys Thr Val Ala Trp
Lys Asp Leu Glu Ala Phe Cys Met Val 465 470
475aac tgt gtt gaa cac tgt gag aca atg aca ttc cat aag tca gaa tgt
2422Asn Cys Val Glu His Cys Glu Thr Met Thr Phe His Lys Ser Glu Cys480
485 490 495gta ggc tac gtc
cgg ccc cca gta tgc aac gac gtc tgt gtg ggg gcg 2470Val Gly Tyr Val
Arg Pro Pro Val Cys Asn Asp Val Cys Val Gly Ala 500
505 510gta gtg tcc tcc gtg gtg ctt ggc gcc aca
ggt atc gca ctc att gca 2518Val Val Ser Ser Val Val Leu Gly Ala Thr
Gly Ile Ala Leu Ile Ala 515 520
525ctg atg gtt gga agt tcg gcg aac tta cgg agc gct gtg att ctt gtt
2566Leu Met Val Gly Ser Ser Ala Asn Leu Arg Ser Ala Val Ile Leu Val
530 535 540cca ccc atg tagattttgt
ccccacactt tggagaaagg tgggaaatga 2615Pro Pro Met
545cttcagaaat tgaaattaga aggaaccaac aacacaagaa gcaagcgaag gtgaaaacaa
2675cgggaagaag aagaagaaga aaaaaaaaaa aagaaaagaa aaaaatgggg ggctgagtgg
2735ggaaaagaga aagaaaagaa gtgtgcgtgt aaccgtgtgt gtgtgtgccg gggaaaaaga
2795agaaacacaa aagatttctt ttttgttttt tgttttaatg gtgcaaagag ggaaacaaga
2855aagcgaaggg tgcatgtgtg tctgtagata tataaaaata aacatatgcc cccgcatgta
2915ttttaccgtt ggcagttccg tggcttcttt tttttttttt tttttgtatt tttgttattt
2975tttcctctta tttcttcgtg tgtgtgtgta tgtattatta ttcttttttg ttttttgttt
3035gtttgtttgt ttttacctac tcatctgcct tcattttttt ttttgtgtgt tttcactcag
3095cccctctctc tttctctctt cttcttctct cttcatgcgt gtatttccgc atggagtgga
3155aaaggaacgg ctgggagcga ttgtgatggt gcttgtgttg gaggtgtggc tatgcgagta
3215gtggagatgc atgtatgtat gtatatatgt ggtttggtgt atatatttaa atattatatg
3275ttgttgttgt tgctgtccga ctctcggggg acgtacaccg acctacttac ttacagagag
3335agagagagag aggaagagaa tgagagaaaa ggggggcgtg tggtgtgttc tgtattcatt
3395gaagagcgca aaaataaata aaaataaaat aataaaatga gggagagaga agggaggagg
3455aaacagcaga ggaatttgta tgccatcgtt gtgactaatt tttcataagg actctgtgat
3515ggccctgtta accacgtcca ctgcagtaga cgagtcaaaa ttgactgcga gtgttacgcc
3575aactgtacgt ctgtctccct cgtgctgtac gtgtgcaagt aagtacgtgt gtgcactgtg
3635cgtgtgcgtg tgtgtgtgtg tcaagggcgc cttttacgtg tctgtgcgct tgagtgggga
3695ggggagaaga ggaggagaga cgaagaaaga aagaaagaaa aaagcgggcg gcgc
374934581PRTTrypanosoma cruzi 34Met Phe Ser Lys Arg Thr Ser Pro Ala Pro
Phe Arg Ala Leu Leu Leu-35 -30 -25
-20Pro Val Val Val Val Val Val Val Val Val Ala Ser Val Ala Leu
Pro -15 -10 -5Ala Gly Ala
Gln Phe Asp Leu Arg Gln Gln Gln Leu Val Ile Gln Asp -1 1
5 10Phe Phe Ile Ser Arg Ser Cys Ala Gly Cys Ser
Gln Gly Gln Thr Asp 15 20 25Gly Pro
Ser Gly Ala Gly Thr Leu Phe Thr Ala Ala Gly Gly Ser Leu30
35 40 45Gly Lys Asp Ala Ser Thr Leu
Leu Leu Cys Asp Gln Gly Gly Gly Gly 50 55
60Ser Ser Val Arg Leu Val Asn Lys Ser Gly Ile Phe Thr
Leu Ala Gly 65 70 75Ser Lys
Thr Thr Arg Gly Asn Gln Asn Gly Pro Ala Ala Thr Ala Leu 80
85 90Phe Asn Met Pro Arg Ala Val Val Leu Glu
Asp Gly Ala Leu Tyr Val 95 100 105Ala
Asp Ser Ala Asn Asn Leu Val Arg Glu Ile Ser Asn Gly Ile Val110
115 120 125Thr Ser Phe Ile Thr Glu
Gly Leu Leu Gly Pro Ser Tyr Ile Lys Pro 130
135 140Tyr Ser Arg Pro Asn Gly Ala His Asp Leu Phe Val
Ser Asp Thr Gly 145 150 155Lys
Ser Arg Ile Ile Phe Ala Pro Leu Gln Lys Gln Thr Phe Ile Thr 160
165 170Val Phe Ile Thr Gly Phe Gln Pro Asp
Val Leu Gln Ile Ser Glu Lys 175 180
185Ser Arg Leu Met Phe Ala Ile Cys Asn Ser Thr Lys Ile Leu Ser Ile190
195 200 205Asn Met Gln Gly
Ala Thr Thr Pro Lys Asp Tyr Trp Gln Val Gly Asn 210
215 220Ala Asp Cys Met Gly Tyr Gln Ser Ser Leu
Met Leu Thr Thr Glu Glu 225 230
235Asp Lys Leu Leu Tyr Tyr Gly Ile Leu Asn Gly Thr Pro Ser Ile Met
240 245 250Ser Leu Pro Ala Thr Lys Thr
Lys Thr Glu Ala Pro Arg Ile Cys Pro 255 260
265Asp Val Leu Leu Arg Trp Pro His Gly Pro Ile Val Ser Leu Val
Asn270 275 280 285Ile Asn
Lys His Ala Phe Tyr Val Val Thr Ala Ser Asn Val Tyr Ile
290 295 300Val His Asp Gly Ser Tyr His
Pro Thr Val Thr Pro Thr Pro Pro Leu 305 310
315Thr Pro Thr Pro Thr Pro Glu Val Thr Pro Thr Pro Thr Val
Thr Pro 320 325 330Thr Pro Thr Pro
Glu Val Thr Pro Thr Pro Pro Val Thr Pro Ser Pro 335
340 345Thr Ile Thr Ile His Arg Gly Phe Ala Val Ala Ala
Phe Pro Ala Gln350 355 360
365Ser Leu Pro Ile Glu Asp Pro Arg Leu Met His Glu Leu Leu Ser Trp
370 375 380Leu Met Lys Asp Val
Gly Ile Ala Phe Glu Ser Thr Asp Phe Phe Ala 385
390 395Val Phe Pro Pro Asp Arg Glu Val Leu Val Pro Gly
Tyr Val Asn Val 400 405 410Ser Thr
Trp Asn Asn Leu Thr Val Leu Phe Asn Phe Asp Arg Thr Ile 415
420 425Val Ile Thr Glu Tyr Phe Thr Pro Glu Gly Met
Ser Ser Glu Glu Gly430 435 440
445Gln Ala Arg Leu Phe Ala Ser Pro Trp Tyr Trp Thr Arg Asn Phe Leu
450 455 460Asp Ser Leu Lys
Lys Thr Val Ala Trp Lys Asp Leu Glu Ala Phe Cys 465
470 475Met Val Asn Cys Val Glu His Cys Glu Thr Met
Thr Phe His Lys Ser 480 485 490Glu
Cys Val Gly Tyr Val Arg Pro Pro Val Cys Asn Asp Val Cys Val 495
500 505Gly Ala Val Val Ser Ser Val Val Leu Gly
Ala Thr Gly Ile Ala Leu510 515 520
525Ile Ala Leu Met Val Gly Ser Ser Ala Asn Leu Arg Ser Ala Val
Ile 530 535 540Leu Val Pro
Pro Met 545352151DNATrypanosoma cruziCDS(1)..(2151) 35atg gcc
cga gct gtg gtg ctt gag gat gga gcg ctt tac gtg gcg gac 48Met Ala
Arg Ala Val Val Leu Glu Asp Gly Ala Leu Tyr Val Ala Asp1 5
10 15aat gcc aac aac ctc gtt cga gaa
atc tcc aat ggc gtt gtc act tcg 96Asn Ala Asn Asn Leu Val Arg Glu
Ile Ser Asn Gly Val Val Thr Ser 20 25
30ttt att acg gaa gga ctg ctg ggc cca tcg tac atc aaa ccg tac
agc 144Phe Ile Thr Glu Gly Leu Leu Gly Pro Ser Tyr Ile Lys Pro Tyr
Ser 35 40 45cgt aca aat ggc gct
cat gac ttg ttt gtg tcg gac acg ggc aaa tca 192Arg Thr Asn Gly Ala
His Asp Leu Phe Val Ser Asp Thr Gly Lys Ser 50 55
60cgc atc att ttt gcc cca cct cag aaa aaa acg ttc atc aca
gtg ttt 240Arg Ile Ile Phe Ala Pro Pro Gln Lys Lys Thr Phe Ile Thr
Val Phe65 70 75 80ata
aca gga ttc cag ccg gat gtt ctt caa att agc gag aag agt cgt 288Ile
Thr Gly Phe Gln Pro Asp Val Leu Gln Ile Ser Glu Lys Ser Arg
85 90 95ttg atg ttt gcc atc tgc aat
tcc acg aaa att ctt gcg att aat atg 336Leu Met Phe Ala Ile Cys Asn
Ser Thr Lys Ile Leu Ala Ile Asn Met 100 105
110cag gga gcc aca acc ccg aag gag tac tgg caa gtt gga aat
gcg gac 384Gln Gly Ala Thr Thr Pro Lys Glu Tyr Trp Gln Val Gly Asn
Ala Asp 115 120 125tgc atg ggc tat
cag agt tcc ctc atg ctc acg acc gag gag gat aaa 432Cys Met Gly Tyr
Gln Ser Ser Leu Met Leu Thr Thr Glu Glu Asp Lys 130
135 140ctc ctc tac tac ggc ata tta aat gga acc cca tcc
atc atg tct tta 480Leu Leu Tyr Tyr Gly Ile Leu Asn Gly Thr Pro Ser
Ile Met Ser Leu145 150 155
160ccc gcc acc aaa acg aag acg gaa gca ccc aga att tgc ccg gat gtg
528Pro Ala Thr Lys Thr Lys Thr Glu Ala Pro Arg Ile Cys Pro Asp Val
165 170 175ttg ttg cag tgg cca
cat ggg ccc att gtt tcg ctt gtg aat att aac 576Leu Leu Gln Trp Pro
His Gly Pro Ile Val Ser Leu Val Asn Ile Asn 180
185 190aaa cat gca ttt tac gtt gtt acc gcc tcc aat gta
tac att gta cat 624Lys His Ala Phe Tyr Val Val Thr Ala Ser Asn Val
Tyr Ile Val His 195 200 205gat ggc
tcg tat cat ccg act gga tcc atg gcc cag ctc caa cag gca 672Asp Gly
Ser Tyr His Pro Thr Gly Ser Met Ala Gln Leu Gln Gln Ala 210
215 220gaa aat aat atc act aat tcc aaa aaa gaa atg
aca aag cta cga gaa 720Glu Asn Asn Ile Thr Asn Ser Lys Lys Glu Met
Thr Lys Leu Arg Glu225 230 235
240aaa gtg aaa aag gcc gag aaa gaa aaa ttg gac gcc att aac cgg gca
768Lys Val Lys Lys Ala Glu Lys Glu Lys Leu Asp Ala Ile Asn Arg Ala
245 250 255acc aag ctg gaa gag
gaa cga aac caa gcg tac aaa gca gca cac aag 816Thr Lys Leu Glu Glu
Glu Arg Asn Gln Ala Tyr Lys Ala Ala His Lys 260
265 270gca gag gag gaa aag gct aaa aca ttt caa cgc ctt
ata aca ttt gag 864Ala Glu Glu Glu Lys Ala Lys Thr Phe Gln Arg Leu
Ile Thr Phe Glu 275 280 285tcg gaa
aat att aac tta aag aaa agg cca aat gac gca gtt tca aat 912Ser Glu
Asn Ile Asn Leu Lys Lys Arg Pro Asn Asp Ala Val Ser Asn 290
295 300cgg gat aag aaa aaa aat tct gaa acc gca aaa
act gac gaa gta gag 960Arg Asp Lys Lys Lys Asn Ser Glu Thr Ala Lys
Thr Asp Glu Val Glu305 310 315
320aaa cag agg gcg gct gag gct gcc aag gcc gtg gag acg gag aag cag
1008Lys Gln Arg Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln
325 330 335agg gca gct gag gcc
acg aag gtt gcc gaa gcg gag aag cgg aag gca 1056Arg Ala Ala Glu Ala
Thr Lys Val Ala Glu Ala Glu Lys Arg Lys Ala 340
345 350gct gag gcc gcc aag gcc gtg gag acg gag aag cag
agg gca gct gaa 1104Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln
Arg Ala Ala Glu 355 360 365gcc acg
aag gtt gcc gaa gcg gag aag cag aag gca gct gag gcc gcc 1152Ala Thr
Lys Val Ala Glu Ala Glu Lys Gln Lys Ala Ala Glu Ala Ala 370
375 380aag gcc gtg gag acg gag aag cag agg gca gct
gaa gcc acg aag gtt 1200Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala
Glu Ala Thr Lys Val385 390 395
400gcc gaa gcg gag aag cag agg gca gct gaa gcc atg aag gtt gcc gaa
1248Ala Glu Ala Glu Lys Gln Arg Ala Ala Glu Ala Met Lys Val Ala Glu
405 410 415gcg gag aag cag aag
gca gct gag gcc gcc aag gcc gtg gag acg gag 1296Ala Glu Lys Gln Lys
Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu 420
425 430aag cag agg gca gct gaa gcc acg aag gtt gcc gaa
gcg gag aag cag 1344Lys Gln Arg Ala Ala Glu Ala Thr Lys Val Ala Glu
Ala Glu Lys Gln 435 440 445aag gca
gct gag gcc gcc aag gcc gtg gag acg gag aag cag agg gca 1392Lys Ala
Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln Arg Ala 450
455 460gct gaa gcc acg aag gtt gcc gaa gcg gag aag
cag aag gca gct gag 1440Ala Glu Ala Thr Lys Val Ala Glu Ala Glu Lys
Gln Lys Ala Ala Glu465 470 475
480gcc gcc aag gcc gtg gag acg gag aag cag agg gca gct gaa gcc acg
1488Ala Ala Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala Glu Ala Thr
485 490 495aag gtt gcc gaa gcg
gag aag gat atc gat ccc atg ggt gct tgt ggg 1536Lys Val Ala Glu Ala
Glu Lys Asp Ile Asp Pro Met Gly Ala Cys Gly 500
505 510tcg aag gac tcg acg agc gac aag ggg ttg gcg agc
gat aag gac ggc 1584Ser Lys Asp Ser Thr Ser Asp Lys Gly Leu Ala Ser
Asp Lys Asp Gly 515 520 525aag aac
gcc aag gac cgc aag gaa gcg tgg gag cgc att cgc cag gcg 1632Lys Asn
Ala Lys Asp Arg Lys Glu Ala Trp Glu Arg Ile Arg Gln Ala 530
535 540 att cct cgt gag aag acc gcc gag gca aaa cag
cgc cgc atc gag ctc 1680Ile Pro Arg Glu Lys Thr Ala Glu Ala Lys Gln
Arg Arg Ile Glu Leu545 550 555
560ttc aag aag ttc gac aag aac gag acc ggg aag ctg tgc tac gat gag
1728Phe Lys Lys Phe Asp Lys Asn Glu Thr Gly Lys Leu Cys Tyr Asp Glu
565 570 575gtg cac agc ggc tgc
ctc gag gtg ctg aag ttg gac gag ttc acg ccg 1776Val His Ser Gly Cys
Leu Glu Val Leu Lys Leu Asp Glu Phe Thr Pro 580
585 590cga gtg cgc gac atc acg aag cgt gca ttc gac aag
gcg agg gcc ctg 1824Arg Val Arg Asp Ile Thr Lys Arg Ala Phe Asp Lys
Ala Arg Ala Leu 595 600 605ggc agc
aag ctg gag aac aag ggc tcc gag gac ttt gtt gaa ttt ctg 1872Gly Ser
Lys Leu Glu Asn Lys Gly Ser Glu Asp Phe Val Glu Phe Leu 610
615 620gag ttc cgt ctg atg ctg tgc tac atc tac gac
ttc ttc gag ctg acg 1920Glu Phe Arg Leu Met Leu Cys Tyr Ile Tyr Asp
Phe Phe Glu Leu Thr625 630 635
640gtg atg ttc gac gag att gac gcc tcc ggc aac atg ctg gtt gac gag
1968Val Met Phe Asp Glu Ile Asp Ala Ser Gly Asn Met Leu Val Asp Glu
645 650 655gag gag ttc aag cgc
gcc gtg ccc aag ctt gag gcg tgg ggc gcc aag 2016Glu Glu Phe Lys Arg
Ala Val Pro Lys Leu Glu Ala Trp Gly Ala Lys 660
665 670gtc gag gat ccc gcg gcg ctg ttc aag gag ctc gat
aag aac ggc act 2064Val Glu Asp Pro Ala Ala Leu Phe Lys Glu Leu Asp
Lys Asn Gly Thr 675 680 685ggg tcc
gtg acg ttc gac gag ttt gct gcg tgg gct tct gca gtc aaa 2112Gly Ser
Val Thr Phe Asp Glu Phe Ala Ala Trp Ala Ser Ala Val Lys 690
695 700ctg gac gcc gac ggc gac ccg gac aac gtg ccg
gat atc 2151Leu Asp Ala Asp Gly Asp Pro Asp Asn Val Pro
Asp Ile705 710 71536717PRTTrypanosoma
cruzi 36Met Ala Arg Ala Val Val Leu Glu Asp Gly Ala Leu Tyr Val Ala Asp1
5 10 15Asn Ala Asn Asn
Leu Val Arg Glu Ile Ser Asn Gly Val Val Thr Ser 20
25 30Phe Ile Thr Glu Gly Leu Leu Gly Pro Ser Tyr
Ile Lys Pro Tyr Ser 35 40 45Arg
Thr Asn Gly Ala His Asp Leu Phe Val Ser Asp Thr Gly Lys Ser 50
55 60Arg Ile Ile Phe Ala Pro Pro Gln Lys Lys
Thr Phe Ile Thr Val Phe65 70 75
80Ile Thr Gly Phe Gln Pro Asp Val Leu Gln Ile Ser Glu Lys Ser
Arg 85 90 95Leu Met Phe
Ala Ile Cys Asn Ser Thr Lys Ile Leu Ala Ile Asn Met 100
105 110Gln Gly Ala Thr Thr Pro Lys Glu Tyr Trp
Gln Val Gly Asn Ala Asp 115 120
125Cys Met Gly Tyr Gln Ser Ser Leu Met Leu Thr Thr Glu Glu Asp Lys 130
135 140Leu Leu Tyr Tyr Gly Ile Leu Asn
Gly Thr Pro Ser Ile Met Ser Leu145 150
155 160Pro Ala Thr Lys Thr Lys Thr Glu Ala Pro Arg Ile
Cys Pro Asp Val 165 170
175Leu Leu Gln Trp Pro His Gly Pro Ile Val Ser Leu Val Asn Ile Asn
180 185 190Lys His Ala Phe Tyr Val
Val Thr Ala Ser Asn Val Tyr Ile Val His 195 200
205Asp Gly Ser Tyr His Pro Thr Gly Ser Met Ala Gln Leu Gln
Gln Ala 210 215 220Glu Asn Asn Ile Thr
Asn Ser Lys Lys Glu Met Thr Lys Leu Arg Glu225 230
235 240Lys Val Lys Lys Ala Glu Lys Glu Lys Leu
Asp Ala Ile Asn Arg Ala 245 250
255Thr Lys Leu Glu Glu Glu Arg Asn Gln Ala Tyr Lys Ala Ala His Lys
260 265 270Ala Glu Glu Glu Lys
Ala Lys Thr Phe Gln Arg Leu Ile Thr Phe Glu 275
280 285Ser Glu Asn Ile Asn Leu Lys Lys Arg Pro Asn Asp
Ala Val Ser Asn 290 295 300Arg Asp Lys
Lys Lys Asn Ser Glu Thr Ala Lys Thr Asp Glu Val Glu305
310 315 320Lys Gln Arg Ala Ala Glu Ala
Ala Lys Ala Val Glu Thr Glu Lys Gln 325
330 335Arg Ala Ala Glu Ala Thr Lys Val Ala Glu Ala Glu
Lys Arg Lys Ala 340 345 350Ala
Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala Glu 355
360 365Ala Thr Lys Val Ala Glu Ala Glu Lys
Gln Lys Ala Ala Glu Ala Ala 370 375
380Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala Glu Ala Thr Lys Val385
390 395 400Ala Glu Ala Glu
Lys Gln Arg Ala Ala Glu Ala Met Lys Val Ala Glu 405
410 415Ala Glu Lys Gln Lys Ala Ala Glu Ala Ala
Lys Ala Val Glu Thr Glu 420 425
430Lys Gln Arg Ala Ala Glu Ala Thr Lys Val Ala Glu Ala Glu Lys Gln
435 440 445Lys Ala Ala Glu Ala Ala Lys
Ala Val Glu Thr Glu Lys Gln Arg Ala 450 455
460Ala Glu Ala Thr Lys Val Ala Glu Ala Glu Lys Gln Lys Ala Ala
Glu465 470 475 480Ala Ala
Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala Glu Ala Thr
485 490 495Lys Val Ala Glu Ala Glu Lys
Asp Ile Asp Pro Met Gly Ala Cys Gly 500 505
510Ser Lys Asp Ser Thr Ser Asp Lys Gly Leu Ala Ser Asp Lys
Asp Gly 515 520 525Lys Asn Ala Lys
Asp Arg Lys Glu Ala Trp Glu Arg Ile Arg Gln Ala 530
535 540Ile Pro Arg Glu Lys Thr Ala Glu Ala Lys Gln Arg
Arg Ile Glu Leu545 550 555
560Phe Lys Lys Phe Asp Lys Asn Glu Thr Gly Lys Leu Cys Tyr Asp Glu
565 570 575Val His Ser Gly Cys
Leu Glu Val Leu Lys Leu Asp Glu Phe Thr Pro 580
585 590Arg Val Arg Asp Ile Thr Lys Arg Ala Phe Asp Lys
Ala Arg Ala Leu 595 600 605Gly Ser
Lys Leu Glu Asn Lys Gly Ser Glu Asp Phe Val Glu Phe Leu 610
615 620Glu Phe Arg Leu Met Leu Cys Tyr Ile Tyr Asp
Phe Phe Glu Leu Thr625 630 635
640Val Met Phe Asp Glu Ile Asp Ala Ser Gly Asn Met Leu Val Asp Glu
645 650 655Glu Glu Phe Lys
Arg Ala Val Pro Lys Leu Glu Ala Trp Gly Ala Lys 660
665 670Val Glu Asp Pro Ala Ala Leu Phe Lys Glu Leu
Asp Lys Asn Gly Thr 675 680 685Gly
Ser Val Thr Phe Asp Glu Phe Ala Ala Trp Ala Ser Ala Val Lys 690
695 700Leu Asp Ala Asp Gly Asp Pro Asp Asn Val
Pro Asp Ile705 710
715371836DNATrypanosoma cruziCDS(1)..(1836) 37atg gag cag gag cgc agg cag
ctg ctc gag aag gac ccg cgc agg aac 48Met Glu Gln Glu Arg Arg Gln
Leu Leu Glu Lys Asp Pro Arg Arg Asn1 5 10
15gcg aag gag atc gct gcg ctt gag gag agc atg aat gcc
cgc gca cag 96Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala
Arg Ala Gln 20 25 30gag ctg
gca cgc gag aag aag ctt gct gac cgc gcg ttc ctc gac cag 144Glu Leu
Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln 35
40 45aag ccg gag ggc gtg ccg ctg cga gag ctg
ccg ctc gac gac gac agc 192Lys Pro Glu Gly Val Pro Leu Arg Glu Leu
Pro Leu Asp Asp Asp Ser 50 55 60gac
ttt gtt gct atg gag cag gag cgc agg cag ctg ctc gag aag gac 240Asp
Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp65
70 75 80ccg cgc agg aac gcg aag
gag atc gct gcg ctt gag gag agc atg aat 288Pro Arg Arg Asn Ala Lys
Glu Ile Ala Ala Leu Glu Glu Ser Met Asn 85
90 95gcc cgc gca cag gag ctg gca cgc gag aag aag ctt
gct gac cgc gcg 336Ala Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys Leu
Ala Asp Arg Ala 100 105 110ttc
ctc gac cag aag ccg gag ggc gtg ccg ctg cga gag ctg ccg ctc 384Phe
Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu 115
120 125gac gac gac agc gac ttt gtt gct atg
gag cag gag cgc agg cag ctg 432Asp Asp Asp Ser Asp Phe Val Ala Met
Glu Gln Glu Arg Arg Gln Leu 130 135
140ctc gag aag gac ccg cgc agg aac gcg aag gag atc gct gcg ctt gag
480Leu Glu Lys Asp Pro Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu145
150 155 160gag agc atg aat
gcc cgc gca cag gag ctg gca cgc gag aag aag ctt 528Glu Ser Met Asn
Ala Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys Leu 165
170 175gct gac cgc gcg ttc ctc gac cag aag ccg
gag ggc gtg ccg ctg cga 576Ala Asp Arg Ala Phe Leu Asp Gln Lys Pro
Glu Gly Val Pro Leu Arg 180 185
190gag ctg ccg ctc gac gac gac agc gac ttt gtt gct atg gag cag gag
624Glu Leu Pro Leu Asp Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu
195 200 205cgc agg cag ctg ctc gag aag
gac ccg cgc agg aac gcg aag gag atc 672Arg Arg Gln Leu Leu Glu Lys
Asp Pro Arg Arg Asn Ala Lys Glu Ile 210 215
220gct gcg ctt gag gag agc atg aat gcc cgc gca cag gag ctg gca cgc
720Ala Ala Leu Glu Glu Ser Met Asn Ala Arg Ala Gln Glu Leu Ala Arg225
230 235 240gag aag aag ctt
gct gac cgc gcg ttc ctc gac cag aag ccg gag ggc 768Glu Lys Lys Leu
Ala Asp Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly 245
250 255gtg ccg ctg cga gag ctg ccg ctc gac gac
gac agc gac ttt gtt gct 816Val Pro Leu Arg Glu Leu Pro Leu Asp Asp
Asp Ser Asp Phe Val Ala 260 265
270atg gag cag gag cgc agg cag ctg ctc gag aag gac ccg cgc agg aac
864Met Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn
275 280 285gcg aag gag atc gct gcg ctt
gag gag agc atg aat gcc cgc gca cag 912Ala Lys Glu Ile Ala Ala Leu
Glu Glu Ser Met Asn Ala Arg Ala Gln 290 295
300gag ctg gca cgc gag aag aag ctt gct gac cgc gcg ttc ctc gac cag
960Glu Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln305
310 315 320aag ccg gag ggc
gtg ccg ctg cga gag ctg ccg ctc gac gac gac agc 1008Lys Pro Glu Gly
Val Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser 325
330 335gac ttt gtt gct atg gag cag gag cgc agg
cag ctg ctc gag aag gac 1056Asp Phe Val Ala Met Glu Gln Glu Arg Arg
Gln Leu Leu Glu Lys Asp 340 345
350ccg cgc agg aac gcg aag gag atc gct gcg ctt gag gag agc atg aat
1104Pro Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn
355 360 365gcc cgc gca cag gag ctg gca
cgc gag aag aag ctt gct gac cgc gcg 1152Ala Arg Ala Gln Glu Leu Ala
Arg Glu Lys Lys Leu Ala Asp Arg Ala 370 375
380ttc ctc gac cag aag ccg gag ggc gtg ccg ctg cga gag ctg ccg ctc
1200Phe Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu385
390 395 400gac gac gac agc
gac ttt gtt gct atg gag cag gag cgc agg cag ctg 1248Asp Asp Asp Ser
Asp Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu 405
410 415ctc gag aag gac ccg cgc agg aac gcg aag
gag atc gct gcg ctt gag 1296Leu Glu Lys Asp Pro Arg Arg Asn Ala Lys
Glu Ile Ala Ala Leu Glu 420 425
430gag agc atg aat gcc cgc gca cag gag ctg gca cgc gag aag aag ctt
1344Glu Ser Met Asn Ala Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys Leu
435 440 445gct gac cgc gcg ttc ctc gac
cag aag ccg gag ggc gtg ccg ctg cga 1392Ala Asp Arg Ala Phe Leu Asp
Gln Lys Pro Glu Gly Val Pro Leu Arg 450 455
460gag ctg ccg ctc gac gac gac agc gac ttt gtt gct atg gag cag gag
1440Glu Leu Pro Leu Asp Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu465
470 475 480cgc agg cag ctg
ctc gag aag gac ccg cgc agg aac gcg aag gag atc 1488Arg Arg Gln Leu
Leu Glu Lys Asp Pro Arg Arg Asn Ala Lys Glu Ile 485
490 495gct gcg ctt gag gag agc atg aat gcc cgc
gca cag gag ctg gca cgc 1536Ala Ala Leu Glu Glu Ser Met Asn Ala Arg
Ala Gln Glu Leu Ala Arg 500 505
510gag aag aag ctt gct gac cgc gcg ttc ctc gac cag aag ccg gag ggc
1584Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly
515 520 525gtg ccg ctg cga gag ctg ccg
ctc gac gac gac agc gac ttt gtt gct 1632Val Pro Leu Arg Glu Leu Pro
Leu Asp Asp Asp Ser Asp Phe Val Ala 530 535
540atg gag cag gag cgc agg cag ctg ctc gag aag gac ccg cgc agg aac
1680Met Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn545
550 555 560gcg aag gag atc
gct gcg ctt gag gag agc atg aat gcc cgc gca cag 1728Ala Lys Glu Ile
Ala Ala Leu Glu Glu Ser Met Asn Ala Arg Ala Gln 565
570 575gag ctg gca cgc gag aag aag ctt gct gac
cgc gcg ttc ctc gac cag 1776Glu Leu Ala Arg Glu Lys Lys Leu Ala Asp
Arg Ala Phe Leu Asp Gln 580 585
590aag ccg gag ggc gtg ccg ctg cga gag ctg ccg ctc gac gac gac agc
1824Lys Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser
595 600 605gac ttt gtt gct
1836Asp Phe Val Ala
61038612PRTTrypanosoma cruzi 38Met Glu Gln Glu Arg Arg Gln Leu Leu Glu
Lys Asp Pro Arg Arg Asn1 5 10
15Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala Arg Ala Gln
20 25 30Glu Leu Ala Arg Glu Lys
Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln 35 40
45Lys Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu Asp Asp
Asp Ser 50 55 60Asp Phe Val Ala Met
Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp65 70
75 80Pro Arg Arg Asn Ala Lys Glu Ile Ala Ala
Leu Glu Glu Ser Met Asn 85 90
95Ala Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala
100 105 110Phe Leu Asp Gln Lys
Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu 115
120 125Asp Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu
Arg Arg Gln Leu 130 135 140Leu Glu Lys
Asp Pro Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu145
150 155 160Glu Ser Met Asn Ala Arg Ala
Gln Glu Leu Ala Arg Glu Lys Lys Leu 165
170 175Ala Asp Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly
Val Pro Leu Arg 180 185 190Glu
Leu Pro Leu Asp Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu 195
200 205Arg Arg Gln Leu Leu Glu Lys Asp Pro
Arg Arg Asn Ala Lys Glu Ile 210 215
220Ala Ala Leu Glu Glu Ser Met Asn Ala Arg Ala Gln Glu Leu Ala Arg225
230 235 240Glu Lys Lys Leu
Ala Asp Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly 245
250 255Val Pro Leu Arg Glu Leu Pro Leu Asp Asp
Asp Ser Asp Phe Val Ala 260 265
270Met Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn
275 280 285Ala Lys Glu Ile Ala Ala Leu
Glu Glu Ser Met Asn Ala Arg Ala Gln 290 295
300Glu Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp
Gln305 310 315 320Lys Pro
Glu Gly Val Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser
325 330 335Asp Phe Val Ala Met Glu Gln
Glu Arg Arg Gln Leu Leu Glu Lys Asp 340 345
350Pro Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser
Met Asn 355 360 365Ala Arg Ala Gln
Glu Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala 370
375 380Phe Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg
Glu Leu Pro Leu385 390 395
400Asp Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu
405 410 415Leu Glu Lys Asp Pro
Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu 420
425 430Glu Ser Met Asn Ala Arg Ala Gln Glu Leu Ala Arg
Glu Lys Lys Leu 435 440 445Ala Asp
Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg 450
455 460Glu Leu Pro Leu Asp Asp Asp Ser Asp Phe Val
Ala Met Glu Gln Glu465 470 475
480Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn Ala Lys Glu Ile
485 490 495Ala Ala Leu Glu
Glu Ser Met Asn Ala Arg Ala Gln Glu Leu Ala Arg 500
505 510Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp
Gln Lys Pro Glu Gly 515 520 525Val
Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser Asp Phe Val Ala 530
535 540Met Glu Gln Glu Arg Arg Gln Leu Leu Glu
Lys Asp Pro Arg Arg Asn545 550 555
560Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala Arg Ala
Gln 565 570 575Glu Leu Ala
Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln 580
585 590Lys Pro Glu Gly Val Pro Leu Arg Glu Leu
Pro Leu Asp Asp Asp Ser 595 600
605Asp Phe Val Ala 61039621DNATrypanosoma cruziCDS(1)..(621) 39ttt aat
cct tct acg gac aaa ttg aag cta aac caa caa aat aag cct 48Phe Asn
Pro Ser Thr Asp Lys Leu Lys Leu Asn Gln Gln Asn Lys Pro1 5
10 15cat att gca aat aat aaa caa aaa
aca aca ctc gaa aaa act caa aca 96His Ile Ala Asn Asn Lys Gln Lys
Thr Thr Leu Glu Lys Thr Gln Thr 20 25
30gaa caa aaa aca gcg cca ttt gga cag ggc gca gca ggg tgg aca
aaa 144Glu Gln Lys Thr Ala Pro Phe Gly Gln Gly Ala Ala Gly Trp Thr
Lys 35 40 45cca tca cca ttt gga
cag gcc gca gca ggt gac aaa cca cca cca ttt 192Pro Ser Pro Phe Gly
Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe 50 55
60gga cag gcc gca gca ggt gac aaa cca cca cca ttt gga cag
gcc gca 240Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe Gly Gln
Ala Ala65 70 75 80gca
ggt gac aaa cca tca cta ttt gga cag gcc gca gca ggt gac aaa 288Ala
Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala Ala Gly Asp Lys
85 90 95cca tca cca ttt gga cag gcc
gca gca ggt gac aaa cca cca cca ttt 336Pro Ser Pro Phe Gly Gln Ala
Ala Ala Gly Asp Lys Pro Pro Pro Phe 100 105
110gga cag gcc gca gca ggt gac aaa cca tca cta ttt gga cag
gcc gca 384Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln
Ala Ala 115 120 125gca ggt gac aaa
cca tca cca ttt gga cag gcc gca gca ggt gac aaa 432Ala Gly Asp Lys
Pro Ser Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys 130
135 140cca cca cca ttt gga cag gcc gca gca ggt gac aaa
cca cca cca ttt 480Pro Pro Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys
Pro Pro Pro Phe145 150 155
160gga cag gcc gca gca ggt gac aaa cca tca cta ttt gga cag gcc gca
528Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala
165 170 175gca ggt gac aaa cca
tca cca ttt gga cag gga act gcg ttt gat gcc 576Ala Gly Asp Lys Pro
Ser Pro Phe Gly Gln Gly Thr Ala Phe Asp Ala 180
185 190tct cga agc act gtg ttt gcg aat gcg cct ggt gtt
gcc cag gtg 621Ser Arg Ser Thr Val Phe Ala Asn Ala Pro Gly Val
Ala Gln Val 195 200
20540207PRTTrypanosoma cruzi 40Phe Asn Pro Ser Thr Asp Lys Leu Lys Leu
Asn Gln Gln Asn Lys Pro1 5 10
15His Ile Ala Asn Asn Lys Gln Lys Thr Thr Leu Glu Lys Thr Gln Thr
20 25 30Glu Gln Lys Thr Ala Pro
Phe Gly Gln Gly Ala Ala Gly Trp Thr Lys 35 40
45Pro Ser Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro
Pro Phe 50 55 60Gly Gln Ala Ala Ala
Gly Asp Lys Pro Pro Pro Phe Gly Gln Ala Ala65 70
75 80Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln
Ala Ala Ala Gly Asp Lys 85 90
95Pro Ser Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe
100 105 110Gly Gln Ala Ala Ala
Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala 115
120 125Ala Gly Asp Lys Pro Ser Pro Phe Gly Gln Ala Ala
Ala Gly Asp Lys 130 135 140Pro Pro Pro
Phe Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe145
150 155 160Gly Gln Ala Ala Ala Gly Asp
Lys Pro Ser Leu Phe Gly Gln Ala Ala 165
170 175Ala Gly Asp Lys Pro Ser Pro Phe Gly Gln Gly Thr
Ala Phe Asp Ala 180 185 190Ser
Arg Ser Thr Val Phe Ala Asn Ala Pro Gly Val Ala Gln Val 195
200 205411845DNATrypanosoma cruziCDS(1)..(1845)
41ttt aat cct tct acg gac aaa ttg aag cta aac caa caa aat aag cct
48Phe Asn Pro Ser Thr Asp Lys Leu Lys Leu Asn Gln Gln Asn Lys Pro1
5 10 15cat att gca aat aat aaa
caa aaa aca aca ctc gaa aaa act caa aca 96His Ile Ala Asn Asn Lys
Gln Lys Thr Thr Leu Glu Lys Thr Gln Thr 20 25
30gaa caa aaa aca gcg cca ttt gga cag ggc gca gca ggg
tgg aca aaa 144Glu Gln Lys Thr Ala Pro Phe Gly Gln Gly Ala Ala Gly
Trp Thr Lys 35 40 45cca tca cca
ttt gga cag gcc gca gca ggt gac aaa cca cca cca ttt 192Pro Ser Pro
Phe Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe 50
55 60gga cag gcc gca gca ggt gac aaa cca cca cca ttt
gga cag gcc gca 240Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe
Gly Gln Ala Ala65 70 75
80gca ggt gac aaa cca tca cta ttt gga cag gcc gca gca ggt gac aaa
288Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala Ala Gly Asp Lys
85 90 95cca tca cca ttt gga cag
gcc gca gca ggt gac aaa cca cca cca ttt 336Pro Ser Pro Phe Gly Gln
Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe 100
105 110gga cag gcc gca gca ggt gac aaa cca tca cta ttt
gga cag gcc gca 384Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser Leu Phe
Gly Gln Ala Ala 115 120 125gca ggt
gac aaa cca tca cca ttt gga cag gcc gca gca ggt gac aaa 432Ala Gly
Asp Lys Pro Ser Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys 130
135 140cca cca cca ttt gga cag gcc gca gca ggt gac
aaa cca cca cca ttt 480Pro Pro Pro Phe Gly Gln Ala Ala Ala Gly Asp
Lys Pro Pro Pro Phe145 150 155
160gga cag gcc gca gca ggt gac aaa cca tca cta ttt gga cag gcc gca
528Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala
165 170 175gca ggt gac aaa cca
tca cca ttt gga cag gga act gcg ttt gat gcc 576Ala Gly Asp Lys Pro
Ser Pro Phe Gly Gln Gly Thr Ala Phe Asp Ala 180
185 190tct cga agc act gtg ttt gcg aat gcg cct ggt gtt
gcc cag gtg atg 624Ser Arg Ser Thr Val Phe Ala Asn Ala Pro Gly Val
Ala Gln Val Met 195 200 205gag cag
gag cgc agg cag ctg ctc gag aag gac ccg cgc agg aac gcg 672Glu Gln
Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn Ala 210
215 220aag gag atc gct gcg ctt gag gag agc atg aat
gcc cgc gca cag gag 720Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn
Ala Arg Ala Gln Glu225 230 235
240ctg gca cgc gag aag aag ctt gct gac cgc gcg ttc ctc gac cag aag
768Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln Lys
245 250 255ccg gag ggc gtg ccg
ctg cga gag ctg ccg ctc gac gac gac agc gac 816Pro Glu Gly Val Pro
Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser Asp 260
265 270ttt gtt gct atg gag cag gag cgc agg cag ctg ctc
gag aag gac ccg 864Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu Leu
Glu Lys Asp Pro 275 280 285cgc agg
aac gcg aag gag atc gct gcg ctt gag gag agc atg aat gcc 912Arg Arg
Asn Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala 290
295 300cgc gca cag gag ctg gca cgc gag aag aag ctt
gct gac cgc gcg ttc 960Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys Leu
Ala Asp Arg Ala Phe305 310 315
320ctc gac cag aag ccg gag ggc gtg ccg ctg cga gag ctg ccg ctc gac
1008Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu Asp
325 330 335gac gac agc gac ttt
gtt gct atg gag cag gag cgc agg cag ctg ctc 1056Asp Asp Ser Asp Phe
Val Ala Met Glu Gln Glu Arg Arg Gln Leu Leu 340
345 350gag aag gac ccg cgc agg aac gcg aag gag atc gct
gcg ctt gag gag 1104Glu Lys Asp Pro Arg Arg Asn Ala Lys Glu Ile Ala
Ala Leu Glu Glu 355 360 365agc atg
aat gcc cgc gca cag gag ctg gca cgc gag aag aag ctt gct 1152Ser Met
Asn Ala Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys Leu Ala 370
375 380gac cgc gcg ttc ctc gac cag aag ccg gag ggc
gtg ccg ctg cga gag 1200Asp Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly
Val Pro Leu Arg Glu385 390 395
400ctg ccg ctc gac gac gac agc gac ttt gtt gct atg gag cag gag cgc
1248Leu Pro Leu Asp Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu Arg
405 410 415agg cag ctg ctc gag
aag gac ccg cgc agg aac gcg aag gag atc gct 1296Arg Gln Leu Leu Glu
Lys Asp Pro Arg Arg Asn Ala Lys Glu Ile Ala 420
425 430gcg ctt gag gag agc atg aat gcc cgc gca cag gag
ctg gca cgc gag 1344Ala Leu Glu Glu Ser Met Asn Ala Arg Ala Gln Glu
Leu Ala Arg Glu 435 440 445aag aag
ctt gct gac cgc gcg ttc ctc gac cag aag ccg gag ggc gtg 1392Lys Lys
Leu Ala Asp Arg Ala Phe Leu Asp Gln Lys Pro Glu Gly Val 450
455 460ccg ctg cga gag ctg ccg ctc gac gac gac agc
gac ttt gtt gct atg 1440Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser
Asp Phe Val Ala Met465 470 475
480gag cag gag cgc agg cag ctg ctc gag aag gac ccg cgc agg aac gcg
1488Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn Ala
485 490 495aag gag atc gct gcg
ctt gag gag agc atg aat gcc cgc gca cag gag 1536Lys Glu Ile Ala Ala
Leu Glu Glu Ser Met Asn Ala Arg Ala Gln Glu 500
505 510ctg gca cgc gag aag aag ctt gct gac cgc gcg ttc
ctc gac cag aag 1584Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe
Leu Asp Gln Lys 515 520 525ccg gag
ggc gtg ccg ctg cga gag ctg ccg ctc gac gac gac agc gac 1632Pro Glu
Gly Val Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser Asp 530
535 540ttt gtt gct atg gag cag gag cgc agg cag ctg
ctc gag aag gac ccg 1680Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu
Leu Glu Lys Asp Pro545 550 555
560cgc agg aac gcg aag gag atc gct gcg ctt gag gag agc atg aat gcc
1728Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala
565 570 575cgc gca cag gag ctg
gca cgc gag aag aag ctt gct gac cgc gcg ttc 1776Arg Ala Gln Glu Leu
Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe 580
585 590ctc gac cag aag ccg gag ggc gtg ccg ctg cga gag
ctg ccg ctc gac 1824Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu
Leu Pro Leu Asp 595 600 605gac gac
agc gac ttt gtt gct 1845Asp Asp
Ser Asp Phe Val Ala 610 61542615PRTTrypanosoma cruzi
42Phe Asn Pro Ser Thr Asp Lys Leu Lys Leu Asn Gln Gln Asn Lys Pro1
5 10 15His Ile Ala Asn Asn Lys
Gln Lys Thr Thr Leu Glu Lys Thr Gln Thr 20 25
30Glu Gln Lys Thr Ala Pro Phe Gly Gln Gly Ala Ala Gly
Trp Thr Lys 35 40 45Pro Ser Pro
Phe Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe 50
55 60Gly Gln Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe
Gly Gln Ala Ala65 70 75
80Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala Ala Gly Asp Lys
85 90 95Pro Ser Pro Phe Gly Gln
Ala Ala Ala Gly Asp Lys Pro Pro Pro Phe 100
105 110Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser Leu Phe
Gly Gln Ala Ala 115 120 125Ala Gly
Asp Lys Pro Ser Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys 130
135 140Pro Pro Pro Phe Gly Gln Ala Ala Ala Gly Asp
Lys Pro Pro Pro Phe145 150 155
160Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser Leu Phe Gly Gln Ala Ala
165 170 175Ala Gly Asp Lys
Pro Ser Pro Phe Gly Gln Gly Thr Ala Phe Asp Ala 180
185 190Ser Arg Ser Thr Val Phe Ala Asn Ala Pro Gly
Val Ala Gln Val Met 195 200 205Glu
Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn Ala 210
215 220Lys Glu Ile Ala Ala Leu Glu Glu Ser Met
Asn Ala Arg Ala Gln Glu225 230 235
240Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln
Lys 245 250 255Pro Glu Gly
Val Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser Asp 260
265 270Phe Val Ala Met Glu Gln Glu Arg Arg Gln
Leu Leu Glu Lys Asp Pro 275 280
285Arg Arg Asn Ala Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala 290
295 300Arg Ala Gln Glu Leu Ala Arg Glu
Lys Lys Leu Ala Asp Arg Ala Phe305 310
315 320Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu
Leu Pro Leu Asp 325 330
335Asp Asp Ser Asp Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu Leu
340 345 350Glu Lys Asp Pro Arg Arg
Asn Ala Lys Glu Ile Ala Ala Leu Glu Glu 355 360
365Ser Met Asn Ala Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys
Leu Ala 370 375 380Asp Arg Ala Phe Leu
Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu385 390
395 400Leu Pro Leu Asp Asp Asp Ser Asp Phe Val
Ala Met Glu Gln Glu Arg 405 410
415Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn Ala Lys Glu Ile Ala
420 425 430Ala Leu Glu Glu Ser
Met Asn Ala Arg Ala Gln Glu Leu Ala Arg Glu 435
440 445Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln Lys
Pro Glu Gly Val 450 455 460Pro Leu Arg
Glu Leu Pro Leu Asp Asp Asp Ser Asp Phe Val Ala Met465
470 475 480Glu Gln Glu Arg Arg Gln Leu
Leu Glu Lys Asp Pro Arg Arg Asn Ala 485
490 495Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala
Arg Ala Gln Glu 500 505 510Leu
Ala Arg Glu Lys Lys Leu Ala Asp Arg Ala Phe Leu Asp Gln Lys 515
520 525Pro Glu Gly Val Pro Leu Arg Glu Leu
Pro Leu Asp Asp Asp Ser Asp 530 535
540Phe Val Ala Met Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro545
550 555 560Arg Arg Asn Ala
Lys Glu Ile Ala Ala Leu Glu Glu Ser Met Asn Ala 565
570 575Arg Ala Gln Glu Leu Ala Arg Glu Lys Lys
Leu Ala Asp Arg Ala Phe 580 585
590Leu Asp Gln Lys Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu Asp
595 600 605Asp Asp Ser Asp Phe Val Ala
610 61543858DNATrypanosoma cruziCDS(1)..(858) 43gat cca
acg tat cgt ttt gca aac cac gcg ttc acg ctg gtg gcg tcg 48Asp Pro
Thr Tyr Arg Phe Ala Asn His Ala Phe Thr Leu Val Ala Ser1 5
10 15gtg acg att cac gag gtt ccg agc
gtc gcg agt cct ttg ctg ggt gcg 96Val Thr Ile His Glu Val Pro Ser
Val Ala Ser Pro Leu Leu Gly Ala 20 25
30agc ctg gac tct tct ggt ggc aaa aaa ctc ctg ggg ctc tcg tac
gac 144Ser Leu Asp Ser Ser Gly Gly Lys Lys Leu Leu Gly Leu Ser Tyr
Asp 35 40 45gag aag cac cag tgg
cag cca ata tac gga tca acg ccg gtg acg ccg 192Glu Lys His Gln Trp
Gln Pro Ile Tyr Gly Ser Thr Pro Val Thr Pro 50 55
60acc gga tcg tgg gag atg ggt aag agg tac cac gtg gtt ctt
acg atg 240Thr Gly Ser Trp Glu Met Gly Lys Arg Tyr His Val Val Leu
Thr Met65 70 75 80gcg
aat aaa att ggc tcc gtg tac att gat gga gaa cct ctg gag ggt 288Ala
Asn Lys Ile Gly Ser Val Tyr Ile Asp Gly Glu Pro Leu Glu Gly
85 90 95tca ggg cag acc gtt gtg cca
gac gag agg acg cct gac atc tcc cac 336Ser Gly Gln Thr Val Val Pro
Asp Glu Arg Thr Pro Asp Ile Ser His 100 105
110ttc tac gtt ggc ggg tat gga agg agt gat atg cca acc ata
agc cac 384Phe Tyr Val Gly Gly Tyr Gly Arg Ser Asp Met Pro Thr Ile
Ser His 115 120 125gtg acg gtg aat
aat gtt ctt ctt tac aac cgt cag ctg aat gcc gag 432Val Thr Val Asn
Asn Val Leu Leu Tyr Asn Arg Gln Leu Asn Ala Glu 130
135 140gag atc agg acc ttg ttc ttg agc cag gac ctg att
ggc acg gaa gca 480Glu Ile Arg Thr Leu Phe Leu Ser Gln Asp Leu Ile
Gly Thr Glu Ala145 150 155
160cac atg ggc agc agc agc ggc agc agt gcc cac ggt acg ccc tcg att
528His Met Gly Ser Ser Ser Gly Ser Ser Ala His Gly Thr Pro Ser Ile
165 170 175ccc gtt gac agc agt
gcc cac ggt aca ccc tcg act ccc gtt gac agc 576Pro Val Asp Ser Ser
Ala His Gly Thr Pro Ser Thr Pro Val Asp Ser 180
185 190agt gcc cac ggt acg ccc tcg act ccc gtt gac agc
agt gcc cac ggt 624Ser Ala His Gly Thr Pro Ser Thr Pro Val Asp Ser
Ser Ala His Gly 195 200 205aca ccc
tcg act ccc gtt gac agc agt gcc cac ggt aca ccc tcg act 672Thr Pro
Ser Thr Pro Val Asp Ser Ser Ala His Gly Thr Pro Ser Thr 210
215 220ccc gtt gac agc agt gcc cac ggt aag ccc tcg
act ccc gct gac agc 720Pro Val Asp Ser Ser Ala His Gly Lys Pro Ser
Thr Pro Ala Asp Ser225 230 235
240agt gcc cac agt acg ccc tcg act ccc gct gac agc agt gcc cac agt
768Ser Ala His Ser Thr Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser
245 250 255acg ccc tca att ccc
gct gac agc agt gcc cac agt acg ccc tca gct 816Thr Pro Ser Ile Pro
Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala 260
265 270ccc gct gac aac ggc gcc aat ggt acg gtt ttg att
ttg tcg 858Pro Ala Asp Asn Gly Ala Asn Gly Thr Val Leu Ile
Leu Ser 275 280
28544286PRTTrypanosoma cruzi 44Asp Pro Thr Tyr Arg Phe Ala Asn His Ala
Phe Thr Leu Val Ala Ser1 5 10
15Val Thr Ile His Glu Val Pro Ser Val Ala Ser Pro Leu Leu Gly Ala
20 25 30Ser Leu Asp Ser Ser Gly
Gly Lys Lys Leu Leu Gly Leu Ser Tyr Asp 35 40
45Glu Lys His Gln Trp Gln Pro Ile Tyr Gly Ser Thr Pro Val
Thr Pro 50 55 60Thr Gly Ser Trp Glu
Met Gly Lys Arg Tyr His Val Val Leu Thr Met65 70
75 80Ala Asn Lys Ile Gly Ser Val Tyr Ile Asp
Gly Glu Pro Leu Glu Gly 85 90
95Ser Gly Gln Thr Val Val Pro Asp Glu Arg Thr Pro Asp Ile Ser His
100 105 110Phe Tyr Val Gly Gly
Tyr Gly Arg Ser Asp Met Pro Thr Ile Ser His 115
120 125Val Thr Val Asn Asn Val Leu Leu Tyr Asn Arg Gln
Leu Asn Ala Glu 130 135 140Glu Ile Arg
Thr Leu Phe Leu Ser Gln Asp Leu Ile Gly Thr Glu Ala145
150 155 160His Met Gly Ser Ser Ser Gly
Ser Ser Ala His Gly Thr Pro Ser Ile 165
170 175Pro Val Asp Ser Ser Ala His Gly Thr Pro Ser Thr
Pro Val Asp Ser 180 185 190Ser
Ala His Gly Thr Pro Ser Thr Pro Val Asp Ser Ser Ala His Gly 195
200 205Thr Pro Ser Thr Pro Val Asp Ser Ser
Ala His Gly Thr Pro Ser Thr 210 215
220Pro Val Asp Ser Ser Ala His Gly Lys Pro Ser Thr Pro Ala Asp Ser225
230 235 240Ser Ala His Ser
Thr Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser 245
250 255Thr Pro Ser Ile Pro Ala Asp Ser Ser Ala
His Ser Thr Pro Ser Ala 260 265
270Pro Ala Asp Asn Gly Ala Asn Gly Thr Val Leu Ile Leu Ser 275
280 28545701DNATrypanosoma
cruziCDS(1)..(699) 45act cat gac gcg tac agg ccc gtt gat ccc tcg gcg tac
aag cgc gcc 48Thr His Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr
Lys Arg Ala1 5 10 15ttg
ccg cag gaa gag caa gag gat gtg ggg ccg cgc cac gtt gat ccc 96Leu
Pro Gln Glu Glu Gln Glu Asp Val Gly Pro Arg His Val Asp Pro 20
25 30gac cac ttc cgc tcg acc tcg acg
act cat gac gcg tac agg ccc gtt 144Asp His Phe Arg Ser Thr Ser Thr
Thr His Asp Ala Tyr Arg Pro Val 35 40
45gat ccc tcg gcg tac aag cgc gcc ttg ccg cag gaa gag caa gag gat
192Asp Pro Ser Ala Tyr Lys Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp
50 55 60gtg ggg ccg cgc cac gtt gat ccc
gac cac ttc cgc tcg acg act cat 240Val Gly Pro Arg His Val Asp Pro
Asp His Phe Arg Ser Thr Thr His65 70 75
80gac gcg tac agg ccc gtt gat ccc tcg gcg tac aag cgc
gcc ttg ccg 288Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg
Ala Leu Pro 85 90 95cag
gaa gag caa gag gat gtg ggg ccg cgc cac gtt gat ccc gac cac 336Gln
Glu Glu Gln Glu Asp Val Gly Pro Arg His Val Asp Pro Asp His
100 105 110ttc cgc tcg acc tcg acg act
cat gac gcg tac agg ccc gtt gat ccc 384Phe Arg Ser Thr Ser Thr Thr
His Asp Ala Tyr Arg Pro Val Asp Pro 115 120
125tcg gcg tac aag cgc gcc ttg ccg cag gaa gag caa gag gat gtg
ggg 432Ser Ala Tyr Lys Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val
Gly 130 135 140ccg cgc cac gtt gat ccc
gac cac ttc cgc tcg acc tcg acg act cat 480Pro Arg His Val Asp Pro
Asp His Phe Arg Ser Thr Ser Thr Thr His145 150
155 160gac gcg tac agg ccc gtt gat ccc tcg gcg tac
aag cgc gcc ttg ccg 528Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr
Lys Arg Ala Leu Pro 165 170
175cag gaa gag caa gag gat gtg ggg ccg cgc cac gtt gat ccc gac cac
576Gln Glu Glu Gln Glu Asp Val Gly Pro Arg His Val Asp Pro Asp His
180 185 190ttc cgc tcg acg act cat
gac gcg tac agg ccc gtt gat ccc tcg gcg 624Phe Arg Ser Thr Thr His
Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala 195 200
205tac aag cgc gcc ttg ccg cag gaa gag caa gag gat gtg ggg
ccg cgc 672Tyr Lys Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val Gly
Pro Arg 210 215 220cac gtt gat ccc gac
cac ttc cgc tcg ac 701His Val Asp Pro Asp
His Phe Arg Ser225 23046233PRTTrypanosoma cruzi 46Thr His
Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg Ala1 5
10 15Leu Pro Gln Glu Glu Gln Glu Asp
Val Gly Pro Arg His Val Asp Pro 20 25
30Asp His Phe Arg Ser Thr Ser Thr Thr His Asp Ala Tyr Arg Pro
Val 35 40 45Asp Pro Ser Ala Tyr
Lys Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp 50 55
60Val Gly Pro Arg His Val Asp Pro Asp His Phe Arg Ser Thr
Thr His65 70 75 80Asp
Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg Ala Leu Pro
85 90 95Gln Glu Glu Gln Glu Asp Val
Gly Pro Arg His Val Asp Pro Asp His 100 105
110Phe Arg Ser Thr Ser Thr Thr His Asp Ala Tyr Arg Pro Val
Asp Pro 115 120 125Ser Ala Tyr Lys
Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val Gly 130
135 140Pro Arg His Val Asp Pro Asp His Phe Arg Ser Thr
Ser Thr Thr His145 150 155
160Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg Ala Leu Pro
165 170 175Gln Glu Glu Gln Glu
Asp Val Gly Pro Arg His Val Asp Pro Asp His 180
185 190Phe Arg Ser Thr Thr His Asp Ala Tyr Arg Pro Val
Asp Pro Ser Ala 195 200 205Tyr Lys
Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val Gly Pro Arg 210
215 220His Val Asp Pro Asp His Phe Arg Ser225
230471557DNATrypanosoma cruziCDS(1)..(1557) 47gat cca acg tat
cgt ttt gca aac cac gcg ttc acg ctg gtg gcg tcg 48Asp Pro Thr Tyr
Arg Phe Ala Asn His Ala Phe Thr Leu Val Ala Ser1 5
10 15gtg acg att cac gag gtt ccg agc gtc gcg
agt cct ttg ctg ggt gcg 96Val Thr Ile His Glu Val Pro Ser Val Ala
Ser Pro Leu Leu Gly Ala 20 25
30agc ctg gac tct tct ggt ggc aaa aaa ctc ctg ggg ctc tcg tac gac
144Ser Leu Asp Ser Ser Gly Gly Lys Lys Leu Leu Gly Leu Ser Tyr Asp
35 40 45gag aag cac cag tgg cag cca ata
tac gga tca acg ccg gtg acg ccg 192Glu Lys His Gln Trp Gln Pro Ile
Tyr Gly Ser Thr Pro Val Thr Pro 50 55
60acc gga tcg tgg gag atg ggt aag agg tac cac gtg gtt ctt acg atg
240Thr Gly Ser Trp Glu Met Gly Lys Arg Tyr His Val Val Leu Thr Met65
70 75 80gcg aat aaa att ggc
tcc gtg tac att gat gga gaa cct ctg gag ggt 288Ala Asn Lys Ile Gly
Ser Val Tyr Ile Asp Gly Glu Pro Leu Glu Gly 85
90 95tca ggg cag acc gtt gtg cca gac gag agg acg
cct gac atc tcc cac 336Ser Gly Gln Thr Val Val Pro Asp Glu Arg Thr
Pro Asp Ile Ser His 100 105
110ttc tac gtt ggc ggg tat gga agg agt gat atg cca acc ata agc cac
384Phe Tyr Val Gly Gly Tyr Gly Arg Ser Asp Met Pro Thr Ile Ser His
115 120 125gtg acg gtg aat aat gtt ctt
ctt tac aac cgt cag ctg aat gcc gag 432Val Thr Val Asn Asn Val Leu
Leu Tyr Asn Arg Gln Leu Asn Ala Glu 130 135
140gag atc agg acc ttg ttc ttg agc cag gac ctg att ggc acg gaa gca
480Glu Ile Arg Thr Leu Phe Leu Ser Gln Asp Leu Ile Gly Thr Glu Ala145
150 155 160cac atg ggc agc
agc agc ggc agc agt gcc cac ggt acg ccc tcg att 528His Met Gly Ser
Ser Ser Gly Ser Ser Ala His Gly Thr Pro Ser Ile 165
170 175ccc gtt gac agc agt gcc cac ggt aca ccc
tcg act ccc gtt gac agc 576Pro Val Asp Ser Ser Ala His Gly Thr Pro
Ser Thr Pro Val Asp Ser 180 185
190agt gcc cac ggt acg ccc tcg act ccc gtt gac agc agt gcc cac ggt
624Ser Ala His Gly Thr Pro Ser Thr Pro Val Asp Ser Ser Ala His Gly
195 200 205aca ccc tcg act ccc gtt gac
agc agt gcc cac ggt aca ccc tcg act 672Thr Pro Ser Thr Pro Val Asp
Ser Ser Ala His Gly Thr Pro Ser Thr 210 215
220 ccc gtt gac agc agt gcc cac ggt aag ccc tcg act ccc gct gac agc
720Pro Val Asp Ser Ser Ala His Gly Lys Pro Ser Thr Pro Ala Asp Ser225
230 235 240agt gcc cac
agt acg ccc tcg act ccc gct gac agc agt gcc cac agt 768Ser Ala His
Ser Thr Pro Ser Thr Pro Ala Asp Ser Ser Ala His Ser 245
250 255acg ccc tca att ccc gct gac agc agt
gcc cac agt acg ccc tca gct 816Thr Pro Ser Ile Pro Ala Asp Ser Ser
Ala His Ser Thr Pro Ser Ala 260 265
270ccc gct gac aac ggc gcc aat ggt acg gtt ttg att ttg tcg act cat
864Pro Ala Asp Asn Gly Ala Asn Gly Thr Val Leu Ile Leu Ser Thr His
275 280 285gac gcg tac agg ccc gtt gat
ccc tcg gcg tac aag cgc gcc ttg ccg 912Asp Ala Tyr Arg Pro Val Asp
Pro Ser Ala Tyr Lys Arg Ala Leu Pro 290 295
300cag gaa gag caa gag gat gtg ggg ccg cgc cac gtt gat ccc gac cac
960Gln Glu Glu Gln Glu Asp Val Gly Pro Arg His Val Asp Pro Asp His305
310 315 320ttc cgc tcg acc
tcg acg act cat gac gcg tac agg ccc gtt gat ccc 1008Phe Arg Ser Thr
Ser Thr Thr His Asp Ala Tyr Arg Pro Val Asp Pro 325
330 335tcg gcg tac aag cgc gcc ttg ccg cag gaa
gag caa gag gat gtg ggg 1056Ser Ala Tyr Lys Arg Ala Leu Pro Gln Glu
Glu Gln Glu Asp Val Gly 340 345
350ccg cgc cac gtt gat ccc gac cac ttc cgc tcg acg act cat gac gcg
1104Pro Arg His Val Asp Pro Asp His Phe Arg Ser Thr Thr His Asp Ala
355 360 365tac agg ccc gtt gat ccc tcg
gcg tac aag cgc gcc ttg ccg cag gaa 1152Tyr Arg Pro Val Asp Pro Ser
Ala Tyr Lys Arg Ala Leu Pro Gln Glu 370 375
380gag caa gag gat gtg ggg ccg cgc cac gtt gat ccc gac cac ttc cgc
1200Glu Gln Glu Asp Val Gly Pro Arg His Val Asp Pro Asp His Phe Arg385
390 395 400tcg acc tcg acg
act cat gac gcg tac agg ccc gtt gat ccc tcg gcg 1248Ser Thr Ser Thr
Thr His Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala 405
410 415tac aag cgc gcc ttg ccg cag gaa gag caa
gag gat gtg ggg ccg cgc 1296Tyr Lys Arg Ala Leu Pro Gln Glu Glu Gln
Glu Asp Val Gly Pro Arg 420 425
430cac gtt gat ccc gac cac ttc cgc tcg acc tcg acg act cat gac gcg
1344His Val Asp Pro Asp His Phe Arg Ser Thr Ser Thr Thr His Asp Ala
435 440 445tac agg ccc gtt gat ccc tcg
gcg tac aag cgc gcc ttg ccg cag gaa 1392Tyr Arg Pro Val Asp Pro Ser
Ala Tyr Lys Arg Ala Leu Pro Gln Glu 450 455
460gag caa gag gat gtg ggg ccg cgc cac gtt gat ccc gac cac ttc cgc
1440Glu Gln Glu Asp Val Gly Pro Arg His Val Asp Pro Asp His Phe Arg465
470 475 480tcg acg act cat
gac gcg tac agg ccc gtt gat ccc tcg gcg tac aag 1488Ser Thr Thr His
Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys 485
490 495cgc gcc ttg ccg cag gaa gag caa gag gat
gtg ggg ccg cgc cac gtt 1536Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp
Val Gly Pro Arg His Val 500 505
510gat ccc gac cac ttc cgc tcg
1557Asp Pro Asp His Phe Arg Ser 51548519PRTTrypanosoma cruzi 48Asp
Pro Thr Tyr Arg Phe Ala Asn His Ala Phe Thr Leu Val Ala Ser1
5 10 15Val Thr Ile His Glu Val Pro
Ser Val Ala Ser Pro Leu Leu Gly Ala 20 25
30Ser Leu Asp Ser Ser Gly Gly Lys Lys Leu Leu Gly Leu Ser
Tyr Asp 35 40 45 Glu Lys His Gln
Trp Gln Pro Ile Tyr Gly Ser Thr Pro Val Thr Pro 50 55
60Thr Gly Ser Trp Glu Met Gly Lys Arg Tyr His Val Val
Leu Thr Met65 70 75
80Ala Asn Lys Ile Gly Ser Val Tyr Ile Asp Gly Glu Pro Leu Glu Gly
85 90 95Ser Gly Gln Thr Val Val
Pro Asp Glu Arg Thr Pro Asp Ile Ser His 100
105 110Phe Tyr Val Gly Gly Tyr Gly Arg Ser Asp Met Pro
Thr Ile Ser His 115 120 125Val Thr
Val Asn Asn Val Leu Leu Tyr Asn Arg Gln Leu Asn Ala Glu 130
135 140Glu Ile Arg Thr Leu Phe Leu Ser Gln Asp Leu
Ile Gly Thr Glu Ala145 150 155
160His Met Gly Ser Ser Ser Gly Ser Ser Ala His Gly Thr Pro Ser Ile
165 170 175Pro Val Asp Ser
Ser Ala His Gly Thr Pro Ser Thr Pro Val Asp Ser 180
185 190Ser Ala His Gly Thr Pro Ser Thr Pro Val Asp
Ser Ser Ala His Gly 195 200 205Thr
Pro Ser Thr Pro Val Asp Ser Ser Ala His Gly Thr Pro Ser Thr 210
215 220Pro Val Asp Ser Ser Ala His Gly Lys Pro
Ser Thr Pro Ala Asp Ser225 230 235
240Ser Ala His Ser Thr Pro Ser Thr Pro Ala Asp Ser Ser Ala His
Ser 245 250 255Thr Pro Ser
Ile Pro Ala Asp Ser Ser Ala His Ser Thr Pro Ser Ala 260
265 270Pro Ala Asp Asn Gly Ala Asn Gly Thr Val
Leu Ile Leu Ser Thr His 275 280
285Asp Ala Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg Ala Leu Pro 290
295 300Gln Glu Glu Gln Glu Asp Val Gly
Pro Arg His Val Asp Pro Asp His305 310
315 320Phe Arg Ser Thr Ser Thr Thr His Asp Ala Tyr Arg
Pro Val Asp Pro 325 330
335Ser Ala Tyr Lys Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val Gly
340 345 350Pro Arg His Val Asp Pro
Asp His Phe Arg Ser Thr Thr His Asp Ala 355 360
365Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg Ala Leu Pro
Gln Glu 370 375 380Glu Gln Glu Asp Val
Gly Pro Arg His Val Asp Pro Asp His Phe Arg385 390
395 400Ser Thr Ser Thr Thr His Asp Ala Tyr Arg
Pro Val Asp Pro Ser Ala 405 410
415Tyr Lys Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val Gly Pro Arg
420 425 430His Val Asp Pro Asp
His Phe Arg Ser Thr Ser Thr Thr His Asp Ala 435
440 445Tyr Arg Pro Val Asp Pro Ser Ala Tyr Lys Arg Ala
Leu Pro Gln Glu 450 455 460Glu Gln Glu
Asp Val Gly Pro Arg His Val Asp Pro Asp His Phe Arg465
470 475 480Ser Thr Thr His Asp Ala Tyr
Arg Pro Val Asp Pro Ser Ala Tyr Lys 485
490 495Arg Ala Leu Pro Gln Glu Glu Gln Glu Asp Val Gly
Pro Arg His Val 500 505 510Asp
Pro Asp His Phe Arg Ser 515491521DNATrypanosoma
cruziCDS(1)..(1521) 49atg gcc cga gct gtg gtg ctt gag gat gga gcg ctt tac
gtg gcg gac 48Met Ala Arg Ala Val Val Leu Glu Asp Gly Ala Leu Tyr
Val Ala Asp1 5 10 15aat
gcc aac aac ctc gtt cga gaa atc tcc aat ggc gtt gtc act tcg 96Asn
Ala Asn Asn Leu Val Arg Glu Ile Ser Asn Gly Val Val Thr Ser 20
25 30ttt att acg gaa gga ctg ctg ggc
cca tcg tac atc aaa ccg tac agc 144Phe Ile Thr Glu Gly Leu Leu Gly
Pro Ser Tyr Ile Lys Pro Tyr Ser 35 40
45cgt aca aat ggc gct cat gac ttg ttt gtg tcg gac acg ggc aaa tca
192Arg Thr Asn Gly Ala His Asp Leu Phe Val Ser Asp Thr Gly Lys Ser
50 55 60cgc atc att ttt gcc cca cct cag
aaa aaa acg ttc atc aca gtg ttt 240Arg Ile Ile Phe Ala Pro Pro Gln
Lys Lys Thr Phe Ile Thr Val Phe65 70 75
80ata aca gga ttc cag ccg gat gtt ctt caa att agc gag
aag agt cgt 288Ile Thr Gly Phe Gln Pro Asp Val Leu Gln Ile Ser Glu
Lys Ser Arg 85 90 95ttg
atg ttt gcc atc tgc aat tcc acg aaa att ctt gcg att aat atg 336Leu
Met Phe Ala Ile Cys Asn Ser Thr Lys Ile Leu Ala Ile Asn Met
100 105 110cag gga gcc aca acc ccg aag
gag tac tgg caa gtt gga aat gcg gac 384Gln Gly Ala Thr Thr Pro Lys
Glu Tyr Trp Gln Val Gly Asn Ala Asp 115 120
125tgc atg ggc tat cag agt tcc ctc atg ctc acg acc gag gag gat
aaa 432Cys Met Gly Tyr Gln Ser Ser Leu Met Leu Thr Thr Glu Glu Asp
Lys 130 135 140ctc ctc tac tac ggc ata
tta aat gga acc cca tcc atc atg tct tta 480Leu Leu Tyr Tyr Gly Ile
Leu Asn Gly Thr Pro Ser Ile Met Ser Leu145 150
155 160ccc gcc acc aaa acg aag acg gaa gca ccc aga
att tgc ccg gat gtg 528Pro Ala Thr Lys Thr Lys Thr Glu Ala Pro Arg
Ile Cys Pro Asp Val 165 170
175ttg ttg cag tgg cca cat ggg ccc att gtt tcg ctt gtg aat att aac
576Leu Leu Gln Trp Pro His Gly Pro Ile Val Ser Leu Val Asn Ile Asn
180 185 190aaa cat gca ttt tac gtt
gtt acc gcc tcc aat gta tac att gta cat 624Lys His Ala Phe Tyr Val
Val Thr Ala Ser Asn Val Tyr Ile Val His 195 200
205gat ggc tcg tat cat ccg act gga tcc atg gcc cag ctc caa
cag gca 672Asp Gly Ser Tyr His Pro Thr Gly Ser Met Ala Gln Leu Gln
Gln Ala 210 215 220gaa aat aat atc act
aat tcc aaa aaa gaa atg aca aag cta cga gaa 720Glu Asn Asn Ile Thr
Asn Ser Lys Lys Glu Met Thr Lys Leu Arg Glu225 230
235 240aaa gtg aaa aag gcc gag aaa gaa aaa ttg
gac gcc att aac cgg gca 768Lys Val Lys Lys Ala Glu Lys Glu Lys Leu
Asp Ala Ile Asn Arg Ala 245 250
255acc aag ctg gaa gag gaa cga aac caa gcg tac aaa gca gca cac aag
816Thr Lys Leu Glu Glu Glu Arg Asn Gln Ala Tyr Lys Ala Ala His Lys
260 265 270gca gag gag gaa aag gct
aaa aca ttt caa cgc ctt ata aca ttt gag 864Ala Glu Glu Glu Lys Ala
Lys Thr Phe Gln Arg Leu Ile Thr Phe Glu 275 280
285tcg gaa aat att aac tta aag aaa agg cca aat gac gca gtt
tca aat 912Ser Glu Asn Ile Asn Leu Lys Lys Arg Pro Asn Asp Ala Val
Ser Asn 290 295 300cgg gat aag aaa aaa
aat tct gaa acc gca aaa act gac gaa gta gag 960Arg Asp Lys Lys Lys
Asn Ser Glu Thr Ala Lys Thr Asp Glu Val Glu305 310
315 320aaa cag agg gcg gct gag gct gcc aag gcc
gtg gag acg gag aag cag 1008Lys Gln Arg Ala Ala Glu Ala Ala Lys Ala
Val Glu Thr Glu Lys Gln 325 330
335agg gca gct gag gcc acg aag gtt gcc gaa gcg gag aag cgg aag gca
1056Arg Ala Ala Glu Ala Thr Lys Val Ala Glu Ala Glu Lys Arg Lys Ala
340 345 350gct gag gcc gcc aag gcc
gtg gag acg gag aag cag agg gca gct gaa 1104Ala Glu Ala Ala Lys Ala
Val Glu Thr Glu Lys Gln Arg Ala Ala Glu 355 360
365gcc acg aag gtt gcc gaa gcg gag aag cag aag gca gct gag
gcc gcc 1152Ala Thr Lys Val Ala Glu Ala Glu Lys Gln Lys Ala Ala Glu
Ala Ala 370 375 380aag gcc gtg gag acg
gag aag cag agg gca gct gaa gcc acg aag gtt 1200Lys Ala Val Glu Thr
Glu Lys Gln Arg Ala Ala Glu Ala Thr Lys Val385 390
395 400gcc gaa gcg gag aag cag agg gca gct gaa
gcc atg aag gtt gcc gaa 1248Ala Glu Ala Glu Lys Gln Arg Ala Ala Glu
Ala Met Lys Val Ala Glu 405 410
415gcg gag aag cag aag gca gct gag gcc gcc aag gcc gtg gag acg gag
1296Ala Glu Lys Gln Lys Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu
420 425 430aag cag agg gca gct gaa
gcc acg aag gtt gcc gaa gcg gag aag cag 1344Lys Gln Arg Ala Ala Glu
Ala Thr Lys Val Ala Glu Ala Glu Lys Gln 435 440
445aag gca gct gag gcc gcc aag gcc gtg gag acg gag aag cag
agg gca 1392Lys Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln
Arg Ala 450 455 460gct gaa gcc acg aag
gtt gcc gaa gcg gag aag cag aag gca gct gag 1440Ala Glu Ala Thr Lys
Val Ala Glu Ala Glu Lys Gln Lys Ala Ala Glu465 470
475 480gcc gcc aag gcc gtg gag acg gag aag cag
agg gca gct gaa gcc acg 1488Ala Ala Lys Ala Val Glu Thr Glu Lys Gln
Arg Ala Ala Glu Ala Thr 485 490
495aag gtt gcc gaa gcg gag aag gat atc gat ccc
1521Lys Val Ala Glu Ala Glu Lys Asp Ile Asp Pro 500
50550507PRTTrypanosoma cruzi 50Met Ala Arg Ala Val Val Leu Glu Asp
Gly Ala Leu Tyr Val Ala Asp1 5 10
15Asn Ala Asn Asn Leu Val Arg Glu Ile Ser Asn Gly Val Val Thr
Ser 20 25 30Phe Ile Thr Glu
Gly Leu Leu Gly Pro Ser Tyr Ile Lys Pro Tyr Ser 35
40 45Arg Thr Asn Gly Ala His Asp Leu Phe Val Ser Asp
Thr Gly Lys Ser 50 55 60Arg Ile Ile
Phe Ala Pro Pro Gln Lys Lys Thr Phe Ile Thr Val Phe65 70
75 80Ile Thr Gly Phe Gln Pro Asp Val
Leu Gln Ile Ser Glu Lys Ser Arg 85 90
95Leu Met Phe Ala Ile Cys Asn Ser Thr Lys Ile Leu Ala Ile
Asn Met 100 105 110Gln Gly Ala
Thr Thr Pro Lys Glu Tyr Trp Gln Val Gly Asn Ala Asp 115
120 125Cys Met Gly Tyr Gln Ser Ser Leu Met Leu Thr
Thr Glu Glu Asp Lys 130 135 140Leu Leu
Tyr Tyr Gly Ile Leu Asn Gly Thr Pro Ser Ile Met Ser Leu145
150 155 160Pro Ala Thr Lys Thr Lys Thr
Glu Ala Pro Arg Ile Cys Pro Asp Val 165
170 175Leu Leu Gln Trp Pro His Gly Pro Ile Val Ser Leu
Val Asn Ile Asn 180 185 190Lys
His Ala Phe Tyr Val Val Thr Ala Ser Asn Val Tyr Ile Val His 195
200 205Asp Gly Ser Tyr His Pro Thr Gly Ser
Met Ala Gln Leu Gln Gln Ala 210 215
220Glu Asn Asn Ile Thr Asn Ser Lys Lys Glu Met Thr Lys Leu Arg Glu225
230 235 240Lys Val Lys Lys
Ala Glu Lys Glu Lys Leu Asp Ala Ile Asn Arg Ala 245
250 255Thr Lys Leu Glu Glu Glu Arg Asn Gln Ala
Tyr Lys Ala Ala His Lys 260 265
270Ala Glu Glu Glu Lys Ala Lys Thr Phe Gln Arg Leu Ile Thr Phe Glu
275 280 285Ser Glu Asn Ile Asn Leu Lys
Lys Arg Pro Asn Asp Ala Val Ser Asn 290 295
300Arg Asp Lys Lys Lys Asn Ser Glu Thr Ala Lys Thr Asp Glu Val
Glu305 310 315 320Lys Gln
Arg Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln
325 330 335Arg Ala Ala Glu Ala Thr Lys
Val Ala Glu Ala Glu Lys Arg Lys Ala 340 345
350Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln Arg Ala
Ala Glu 355 360 365Ala Thr Lys Val
Ala Glu Ala Glu Lys Gln Lys Ala Ala Glu Ala Ala 370
375 380Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala Glu
Ala Thr Lys Val385 390 395
400Ala Glu Ala Glu Lys Gln Arg Ala Ala Glu Ala Met Lys Val Ala Glu
405 410 415Ala Glu Lys Gln Lys
Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu 420
425 430Lys Gln Arg Ala Ala Glu Ala Thr Lys Val Ala Glu
Ala Glu Lys Gln 435 440 445Lys Ala
Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys Gln Arg Ala 450
455 460Ala Glu Ala Thr Lys Val Ala Glu Ala Glu Lys
Gln Lys Ala Ala Glu465 470 475
480Ala Ala Lys Ala Val Glu Thr Glu Lys Gln Arg Ala Ala Glu Ala Thr
485 490 495Lys Val Ala Glu
Ala Glu Lys Asp Ile Asp Pro 500
5055134PRTTrypanosoma cruzi 51Ser Thr Asp Lys Leu Lys Leu Asn Gln Gln Asn
Lys Pro His Ile Ala1 5 10
15Asn Asn Lys Gln Lys Thr Thr Leu Glu Lys Thr Gln Thr Glu Gln Lys
20 25 30Thr Ala5212PRTTrypanosoma
cruzi 52Pro Phe Gly Gln Ala Ala Ala Gly Asp Lys Pro Ser1 5
105321PRTTrypanosoma cruzi 53Gly Thr Ala Phe Asp Ala Ser
Arg Ser Thr Val Phe Ala Asn Ala Pro1 5 10
15Gly Val Ala Gln Val 205468PRTTrypanosoma
cruzi 54Met Glu Gln Glu Arg Arg Gln Leu Leu Glu Lys Asp Pro Arg Arg Asn1
5 10 15Ala Lys Glu Ile
Ala Ala Leu Glu Glu Ser Met Asn Ala Arg Ala Gln 20
25 30Glu Leu Ala Arg Glu Lys Lys Leu Ala Asp Arg
Ala Phe Leu Asp Gln 35 40 45Lys
Pro Glu Gly Val Pro Leu Arg Glu Leu Pro Leu Asp Asp Asp Ser 50
55 60Asp Phe Val Ala655585PRTTrypanosoma cruzi
55Met Ala Gln Leu Gln Gln Ala Glu Asn Asn Ile Thr Asn Ser Lys Lys1
5 10 15Glu Met Thr Lys Leu Arg
Glu Lys Val Lys Lys Ala Glu Lys Glu Lys 20 25
30Leu Asp Ala Ile Asn Arg Ala Thr Lys Leu Glu Glu Glu
Arg Asn Gln 35 40 45Ala Tyr Lys
Ala Ala His Lys Ala Glu Glu Glu Lys Ala Lys Thr Phe 50
55 60Gln Arg Leu Ile Thr Phe Glu Ser Glu Asn Ile Asn
Leu Lys Lys Arg65 70 75
80Pro Asn Asp Ala Val 855614PRTTrypanosoma cruzi 56Gln
Arg Ala Ala Glu Ala Ala Lys Ala Val Glu Thr Glu Lys1 5
1057214PRTTrypanosoma cruzi 57Asp Ile Asp Pro Met Gly Ala
Cys Gly Ser Lys Asp Ser Thr Ser Asp1 5 10
15Lys Gly Leu Ala Ser Asp Lys Asp Gly Lys Asn Ala Lys
Asp Arg Lys 20 25 30Glu Ala
Trp Glu Arg Ile Arg Gln Ala Ile Pro Arg Glu Lys Thr Ala 35
40 45Glu Ala Lys Gln Arg Arg Ile Glu Leu Phe
Lys Lys Phe Asp Lys Asn 50 55 60Glu
Thr Gly Lys Leu Cys Tyr Asp Glu Val His Ser Gly Cys Leu Glu65
70 75 80Val Leu Lys Leu Asp Glu
Phe Thr Pro Arg Val Arg Asp Ile Thr Lys 85
90 95Arg Ala Phe Asp Lys Ala Arg Ala Leu Gly Ser Lys
Leu Glu Asn Lys 100 105 110Gly
Ser Glu Asp Phe Val Glu Phe Leu Glu Phe Arg Leu Met Leu Cys 115
120 125Tyr Ile Tyr Asp Phe Phe Glu Leu Thr
Val Met Phe Asp Glu Ile Asp 130 135
140Ala Ser Gly Asn Met Leu Val Asp Glu Glu Glu Phe Lys Arg Ala Val145
150 155 160Pro Lys Leu Glu
Ala Trp Gly Ala Lys Val Glu Asp Pro Ala Ala Leu 165
170 175Phe Lys Glu Leu Asp Lys Asn Gly Thr Gly
Ser Val Thr Phe Asp Glu 180 185
190Phe Ala Ala Trp Ala Ser Ala Val Lys Leu Asp Ala Asp Gly Asp Pro
195 200 205Asp Asn Val Pro Asp Ile
210
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20100195952 | MULTI-LAYER STRUCTURE |
20100195951 | MULTI-LAYER STRUCTURE AND METHOD FOR MANUFACTURING THE SAME |
20100195949 | ROLLING BEARING |
20100195948 | Separator for bearing assemblies with cyclic loads |
20100195947 | BEARING DEVICE FOR A WHEEL |