Patent application title: COMPARATIVE LIGAND MAPPING FROM MHC CLASS I POSITIVE CELLS
Inventors:
William H. Hildebrand (Edmond, OK, US)
William H. Hildebrand (Edmond, OK, US)
Heather D. Hickman (Oklahoma City, OK, US)
IPC8 Class: AC40B2008FI
USPC Class:
506 6
Class name: Combinatorial chemistry technology: method, library, apparatus method specially adapted for identifying a library member direct analysis of a library member, per se, by a physical method (e.g., spectroscopy, etc.)
Publication date: 2012-11-29
Patent application number: 20120302452
Abstract:
The present invention relates generally to a methodology for the
isolation, purification and identification of peptide ligands presented
by MHC positive cells. In particular, the methodology of the present
invention relates to the isolation, purification and identification of
these peptide ligands from soluble class I and class II MHC molecules
which may be uninfected, infected, or tumorgenic. The methodology of the
present invention broadly allows for these peptide ligands and their
comcomittant source proteins thereof to be identified and used as markers
for infected versus uninfected cells and/or tumorgenic versus
nontumorgenic cells with said identification being useful for marking or
targeting a cell for therapeutic treatment or priming the immune response
against infected cells.Claims:
1. A method for identifying at least one individual, endogenously loaded
peptide ligand for an individual class I MHC molecule that distinguishes
a transfected cell from a non-transfected cell, comprising the steps of:
providing a non-transfected cell line containing a construct that encodes
an individual soluble class I MHC molecule, the non-transfected cell line
being able to naturally process proteins into peptide ligands capable of
being loaded into antigen binding grooves of class I MHC molecules;
transfecting a portion of the non-transfected cell line with at least one
of a gene from a microorganism and a tumor gene, thereby providing a
transfected cell line; culturing the non-transfected cell line and the
transfected cell line under conditions which allow for expression of the
individual soluble class I MHC molecules from the construct, such
conditions also allowing for endogenous loading of a peptide ligand in
the antigen binding groove of each individual soluble class I MHC
molecule prior to secretion of the individual soluble class I MHC
molecules from the cell; isolating the secreted individual soluble class
I MHC molecules having the endogenously loaded peptide ligands bound
thereto from the non-transfected cell line and the transfected cell line;
separating the endogenously loaded peptide ligands from the individual
soluble class I MHC molecules from the non-transfected cell line and
separating the endogenously loaded peptide ligands from the individual
soluble class I MHC molecules from the transfected cell line; isolating
the endogenously loaded peptide ligands from the non-transfected cell
line and the endogenously loaded peptide ligands from the transfected
cell line; comparing the endogenously loaded peptide ligands isolated
from the transfected cell line to the endogenously loaded peptide ligands
isolated from the non-transfected cell line; and identifying at least one
individual, endogenously loaded peptide ligand presented by the
individual soluble class I MHC molecule on the transfected cell line that
is not presented by the individual soluble class I MHC molecule on the
non-transfected cell line.
2. The method of claim 1, further comprising the step of identifying a source protein from which the at least one individual, endogenously loaded peptide ligand presented by the individual soluble class I MHC molecule on the transfected cell line and not presented by the individual soluble class I MHC molecule on the non-transfected cell line is obtained.
3. The method of claim 1 wherein, in the step of identifying at least one individual, endogenously loaded peptide ligand presented by the individual soluble class I MHC molecule on the transfected cell line but not on the non-transfected cell line, the at least one individual, endogenously loaded peptide ligand is obtained from a protein encoded by at least one of the gene from a microorganism and a tumor gene with which the cell line was transfected to form the transfected cell line.
4. The method of claim 1 wherein, in the step of identifying at least one individual, endogenously loaded peptide ligand presented by the individual soluble class I MHC molecule on the transfected cell line but not on the non-transfected cell line, the at least one individual, endogenously loaded peptide ligand is obtained from a protein encoded by the non-transfected cell line.
5. The method of claim 4, wherein the protein encoded by the non-transfected cell line from which the at least one individual, endogenously loaded peptide ligand is obtained has increased expression in a tumor cell line.
6. The method of claim 1 wherein, in the step of providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the construct further encodes a tag which is attached to the individual soluble class I MHC molecule and aids in isolating the individual soluble class I MHC molecule.
7. The method of claim 1, wherein the non-transfected cell line is class I MHC negative.
8. The method of claim 1, wherein the non-transfected cell line expresses endogenous class I MHC molecules.
9. The method of claim 1 wherein, in the step of providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the non-transfected cell line containing the construct that encodes the individual soluble class I MHC molecule is produced by a method comprising the steps of: obtaining genomic DNA or cDNA encoding at least one class I MHC molecule; identifying an allele encoding an individual class I MHC molecule in the genomic DNA or cDNA; PCR amplifying the allele encoding the individual class I MHC molecule in a locus specific manner such that a PCR product produced therefrom encodes a truncated, soluble form of the individual class I MHC molecule; cloning the PCR product into an expression vector, thereby forming a construct that encodes the individual soluble class I MHC molecule; and transfecting the construct into a non-transfected cell line.
10. The method of claim 9 wherein, in the step of providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the construct further encodes a tag which is attached to the individual soluble class I MHC molecule and aids in isolating the individual soluble class I MHC molecule.
11. The method of claim 10, wherein the tag is selected from the group consisting of a HIS tail and a FLAG tail.
12. The method of claim 10, wherein the tag is encoded by a PCR primer utilized in the step of PCR amplifying the allele encoding the individual class I MHC molecule.
13. The method of claim 11 wherein the tag is encoded by the expression vector into which the PCR product is cloned.
14. The method of claim 1 wherein, in the step of transfecting a portion of the non-transfected cell line, the portion of the non-transfected cell line is transfected with a gene from HIV.
15. The method of claim 1, wherein the transfected cell is further defined as a tumorigenic cell, and the non-transfected cell is further defined as a non-tumorigenic cell, whereby the step of transfecting a portion of the non-tumorigenic cell line with at least one of a gene from a microorganism and a tumor gene, thereby provides a transformed, tumorigenic cell line.
16. The method of claim 1, further comprising the steps of: determining the source protein from which the at least one individual, endogenously loaded peptide ligand is obtained; and identifying the source protein as a self protein if the source protein is not encoded by the gene from a microorganism or tumor gene with which the transfected cell line is transfected but is encoded by the non-transfected cell line.
17. The method of claim 16, wherein the transfected cell is further defined as a tumorigenic cell, and the non-transfected cell is further defined as a non-tumorigenic cell, whereby the step of transfecting a portion of the non-tumorigenic cell line with at least one of a gene from a microorganism and a tumor gene, thereby provides a transformed, tumorigenic cell line.
18. A method for identifying at least one individual, endogenously loaded peptide ligand for an individual class I MHC molecule that distinguishes a transfected cell from a non-transfected cell, comprising the steps of: providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the non-transfected cell line being able to naturally process proteins into peptide ligands capable of being loaded into antigen binding grooves of class I MHC molecules; transfecting a portion of the non-transfected cell line with at least one of a gene from a microorganism and a tumor gene, thereby providing a transfected cell line; culturing the non-transfected cell line and the transfected cell line under conditions which allow for expression of the individual soluble class I MHC molecules from the construct, such conditions also allowing for endogenous loading of a peptide ligand in the antigen binding groove of each individual soluble class I MHC molecule prior to secretion of the individual soluble class I MHC molecules from the cell; isolating the secreted individual soluble class I MHC molecules having the endogenously loaded peptide ligands bound thereto from the non-transfected cell line and the transfected cell line; separating the endogenously loaded peptide ligands from the individual soluble class I MHC molecules from the non-transfected cell line and separating the endogenously loaded peptide ligands from the individual soluble class I MHC molecules from the transfected cell line; isolating the endogenously loaded peptide ligands from the non-transfected cell line and the endogenously loaded peptide ligands from the transfected cell line; comparing the endogenously loaded peptide ligands isolated from the non-transfected cell line to the endogenously loaded peptide ligands isolated from the transfected cell line; and identifying at least one individual, endogenously loaded peptide ligand presented by the individual soluble class I MHC molecule on the non-transfected cell line that is not presented by the individual soluble class I MHC molecule on the transfected cell line.
19. The method of claim 18 wherein, in the step of providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the construct further encodes a tag which is attached to the individual soluble class I MHC molecule and aids in isolating the individual soluble class I MHC molecule.
20. The method of claim 18, wherein the non-transfected cell line is class I MHC negative.
21. The method of claim 18, wherein the non-transfected cell line expresses endogenous class I MHC molecules.
22. The method of claim 18 wherein, in the step of providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the non-transfected cell line containing the construct that encodes the individual soluble class I MHC molecule is produced by a method comprising the steps of: obtaining genomic DNA or cDNA encoding at least one class I MHC molecule; identifying an allele encoding an individual class I MHC molecule in the genomic DNA or cDNA; PCR amplifying the allele encoding the individual class I MHC molecule in a locus specific manner such that a PCR product produced therefrom encodes a truncated, soluble form of the individual class I MHC molecule; cloning the PCR product into an expression vector, thereby forming a construct that encodes the individual soluble class I MHC molecule; and transfecting the construct into a non-transfected cell line.
23. The method of claim 22 wherein, in the step of providing a non-transfected cell line containing a construct that encodes an individual soluble class I MHC molecule, the construct further encodes a tag which is attached to the individual soluble class I MHC molecule and aids in isolating the individual soluble class I MHC molecule.
24. The method of claim 23, wherein the tag is selected from the group consisting of a HIS tail and a FLAG tail.
25. The method of claim 23, wherein the tag is encoded by a PCR primer utilized in the step of PCR amplifying the allele encoding the individual class I MHC molecule.
26. The method of claim 24 wherein the tag is encoded by the expression vector into which the PCR product is cloned.
27. The method of claim 18, further comprising the step of identifying a source protein from which the at least one individual, endogenously loaded peptide ligand presented by the individual soluble class I MHC molecule on the non-transfected cell line and not presented by the individual soluble class I MHC molecule on the transfected cell line is obtained.
28. The method of claim 18 wherein, in the step of transfecting a portion of the non-transfected cell line, the portion of the non-transfected cell line is transfected with a gene from HIV.
29. The method of claim 18, wherein the transfected cell is further defined as a tumorigenic cell, and the non-transfected cell is further defined as a non-tumorigenic cell, whereby the step of transfecting a portion of the non-tumorigenic cell line with at least one of a gene from a microorganism and a tumor gene, thereby provides a transformed, tumorigenic cell line.
30. The method of claim 18, further comprising the step of: determining the source protein from which the at least one endogenously loaded peptide ligand is obtained.
31. The method of claim 30, wherein the transfected cell is further defined as a tumorigenic cell, and the non-transfected cell is further defined as a non-tumorigenic cell, whereby the step of transfecting a portion of the non-tumorigenic cell line with at least one of a gene from a microorganism and a tumor gene, thereby provides a transformed, tumorigenic cell line.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a divisional of Ser. No. 12/951,588, filed Nov. 22, 2010; which is a continuation of U.S. Ser. No. 11/601,058, filed Nov. 17, 2006, now abandoned; which is a divisional of U.S. Ser. No. 09/974,366, filed Oct. 10, 2001, now U.S. Pat. No. 7,541,429, issued Jun. 2, 2009; which claims benefit under 35 U.S.C. 119(e) of provisional patent applications U.S. Ser. No. 60/240,143, filed Oct. 10, 2000; U.S. Ser. No. 60/299,452, filed Jun. 20, 2001; U.S. Ser. No. 60/256,410, filed Dec. 18, 2000; U.S. Ser. No. 60/256,409, filed Dec. 18, 2000; and U.S. Ser. No. 60/327,907, filed Oct. 9, 2001. The contents of each of the above-referenced patent applications are hereby expressly incorporated herein by reference in their entirety.
BACKGROUND OF THE INVENTION
[0003] 1. Field of the Invention
[0004] The present invention relates generally to a methodology for the isolation, purification and identification of peptide ligands presented by MHC positive cells. In particular, the methodology of the present invention relates to the isolation, purification and identification of these peptide ligands from soluble class I and class II MHC molecules which may be uninfected, infected, or tumorigenic. The methodology of the present invention broadly allows for these peptide ligands and their concomitant source proteins thereof to be identified and used as markers for infected versus uninfected cells and/or tumorigenic versus nontumorigenic cells with said identification being useful for marking or targeting a cell for therapeutic treatment or priming the immune response against infected cells.
[0005] 2. Description of the Background Art
[0006] Class I major histocompatibility complex (MHC) molecules, designated HLA class I in humans, bind and display peptide antigen ligands upon the cell surface. The peptide antigen ligands presented by the class I MHC molecule are derived from either normal endogenous proteins ("self") or foreign proteins ("nonself") introduced into the cell. Nonself proteins may be products of malignant transformation or intracellular pathogens such as viruses. In this manner, class I MHC molecules convey information regarding the internal fitness of a cell to immune effector cells including but not limited to, CD8.sup.+ cytotoxic T lymphocytes (CTLs), which are activated upon interaction with "nonself" peptides, thereby lysing or killing the cell presenting such "nonself" peptides.
[0007] Class II MHC molecules, designated HLA class II in humans, also bind and display peptide antigen ligands upon the cell surface. Unlike class I MHC molecules which are expressed on virtually all nucleated cells, class II MHC molecules are normally confined to specialized cells, such as B lymphocytes, macrophages, dendritic cells, and other antigen presenting cells which take up foreign antigens from the extracellular fluid via an endocytic pathway. The peptides they bind and present are derived from extracellular foreign antigens, such as products of bacteria that multiply outside of cells, wherein such products include protein toxins secreted by the bacteria that often times have deleterious and even lethal effects on the host (e.g., human). In this manner, class II molecules convey information regarding the fitness of the extracellular space in the vicinity of the cell displaying the class II molecule to immune effector cells, including but not limited to, CD4.sup.+ helper T cells, thereby helping to eliminate such pathogens the examination of such pathogens is accomplished by both helping B cells make antibodies against microbes, as well as toxins produced by such microbes, and by activating macrophages to destroy ingested microbes.
[0008] Class I and class II HLA molecules exhibit extensive polymorphism generated by systematic recombinatorial and point mutation events; as such, hundreds of different HLA types exist throughout the world's population, resulting in a large immunological diversity. Such extensive HLA diversity throughout the population results in tissue or organ transplant rejection between individuals as well as differing susceptibilities and/or resistances to infectious diseases. HLA molecules also contribute significantly to autoimmunity and cancer. Because HLA molecules mediate most, if not all, adaptive immune responses, large quantities of pure isolated HLA proteins are required in order to effectively study transplantation, autoimmunity disorders, and for vaccine development.
[0009] There are several applications in which purified, individual class I and class II MHC proteins are highly useful. Such applications include using MHC-peptide multimers as immunodiagnostic reagents for disease resistance/autoimmunity; assessing the binding of potentially therapeutic peptides; elution of peptides from MHC molecules to identify vaccine candidates; screening transplant patients for preformed MHC specific antibodies; and removal of anti-HLA antibodies from a patient. Since every individual has differing MHC molecules, the testing of numerous individual MHC molecules is a prerequisite for understanding the differences in disease susceptibility between individuals. Therefore, purified MHC molecules representative of the hundreds of different HLA types existing throughout the world's population are highly desirable for unraveling disease susceptibilities and resistances, as well as for designing therapeutics such as vaccines.
[0010] Class I HLA molecules alert the immune response to disorders within host cells. Peptides, which are derived from viral- and tumor-specific proteins within the cell, are loaded into the class I molecule's antigen binding groove in the endoplasmic reticulum of the cell and subsequently carried to the cell surface. Once the class I HLA molecule and its loaded peptide ligand are on the cell surface, the class I molecule and its peptide ligand are accessible to cytotoxic T lymphocytes (CTL). CTL survey the peptides presented by the class I molecule and destroy those cells harboring ligands derived from infectious or neoplastic agents within that cell.
[0011] While specific CTL targets have been identified, little is known about the breadth and nature of ligands presented on the surface of a diseased cell. From a basic science perspective, many outstanding questions have permeated through the art regarding peptide exhibition. For instance, it has been demonstrated that a virus can preferentially block expression of HLA class I molecules from a given locus while leaving expression at other loci intact. Similarly, there are numerous reports of cancerous cells that fail to express class I HLA at particular loci. However, there are no data describing how (or if) the three classical HLA class I loci differ in the immunoregulatory ligands they bind. It is therefore unclear how class I molecules from the different loci vary in their interaction with viral- and tumor-derived ligands and the number of peptides each will present.
[0012] Discerning virus- and tumor-specific ligands for CTL recognition is an important component of vaccine design. Ligands unique to tumorigenic or infected cells can be tested and incorporated into vaccines designed to evoke a protective CTL response. Several methodologies are currently employed to identify potentially protective peptide ligands. One approach uses T cell lines or clones to screen for biologically active ligands among chromatographic fractions of eluted peptides. (Cox et al., Science, vol 264, 1994, pages 716-719, which is expressly incorporated herein by reference in its entirety) This approach has been employed to identify peptides ligands specific to cancerous cells. A second technique utilizes predictive algorithms to identify peptides capable of binding to a particular class I molecule based upon previously determined motif and/or individual ligand sequences. (De Groot et al., Emerging Infectious Diseases, (7) 4, 2001, which is expressly incorporated herein by reference in its entirety) Peptides having high predicted probability of binding from a pathogen of interest can then be synthesized and tested for T cell reactivity in precursor, tetramer or ELISpot assays.
[0013] However, there has been no readily available source of individual HLA molecules. The quantities of HLA protein available have been small and typically consist of a mixture of different HLA molecules. Production of HLA molecules traditionally involves growth and lysis of cells expressing multiple HLA molecules. Ninety percent of the population is heterozygous at each of the HLA loci; codominant expression results in multiple HLA proteins expressed at each HLA locus. To purify native class I or class II molecules from mammalian cells requires time-consuming and cumbersome purification methods, and since each cell typically expresses multiple surface-bound HLA class I or class II molecules, HLA purification results in a mixture of many different HLA class I or class II molecules. When performing experiments using such a mixture of HLA molecules or performing experiments using a cell having multiple surface-bound HLA molecules, interpretation of results cannot directly distinguish between the different HLA molecules, and one cannot be certain that any particular HLA molecule is responsible for a given result. Therefore, a need existed in the art for a method of producing substantial quantities of individual HLA class I or class II molecules so that they can be readily purified and isolated independent of other HLA class I or class II molecules. Such individual HLA molecules, when provided in sufficient quantity and purity, would provide a powerful tool for studying and measuring immune responses.
[0014] Therefore, there exists a need in the art for improved methods of epitope discovery and comparative ligand mapping for class I and class II MHC molecules, including methods of distinguishing an infected/tumor cell from an uninfected/non-tumor cell. The present invention solves this need by coupling the production of soluble HLA molecules with an epitope isolation, discovery, and direct comparison methodology.
BRIEF DESCRIPTION OF THE DRAWINGS
[0015] FIG. 1. Overview of 2 stage PCR strategy to amplify a truncated version of the human class I MHC.
[0016] FIG. 2. Edman sequence analysis of soluble B*1501, B*1501-HIS and B*1501-FLAG. Residue intensity was categorized as either dominant (3.5-fold or more picomolar increase over previous round) or strong (2.5 to 3.5-fold increase over prior round).
[0017] FIG. 3. Representative MS ion maps from soluble B*1501, B*1501-HIS and B*1501-FLAG illustrating ion overlap between the molecules. Pooled, acid-eluted peptides were fractionated by RP-HPLC, and the individual fractions were MS scanned.
[0018] FIG. 4. Fragmentation pattern generated by MS/MS on an ion selected from fraction 11 of B*1501, B*1501-HIS and B*1501-FLAG peptides illustrating a sequence-level overlap between the three molecules.
[0019] FIG. 5. Flow chart of the epitope discovery of C-terminal-tagged sHLA molecules. Class I positive transfectants are infected with a pathogen of choice and sHLA preferentially purified utilizing the tag. Subtractive comparison of MS ion maps yields ions present only in infected cell, which are then MS/MS sequenced to derive class I epitopes.
[0020] FIG. 6. MS ion map from soluble B*0702 SupT1 cells uninfected and infected with HIV MN-1. Pooled, acid-eluted peptides were fractionated by RP-HPLC, and fraction #30 was MS scanned.
[0021] FIG. 7. MS ion map similar to FIG. 6 but zoomed in on the area from 482-488 amu to more clearly identify all ions in the immediate area.
[0022] FIG. 8. Fragmentation pattern generated by tandem mass spectrometry of the peptide ion 484.72 isolated from infected soluble B*0702 SupT1 cells.
[0023] FIG. 9. Results of a PubMed BLAST search with the sequence GPRTAALGLL identified in FIG. 8.
[0024] FIG. 10. Summary of Results of Entrez-PubMed search for the word "reticulocalbin".
[0025] FIG. 11. Results of a peptide-binding algorithm performed using Parker's Prediction using the entire source protein, reticulocalbin, which generates a list of peptides which are bound by the B*0702 HLA allele.
[0026] FIG. 12. Results of a peptide-binding algorithm performed using Rammensee's SYPEITHI Prediction using the entire source protein, reticulocalbin, which generates a list of peptides which are bound by the B*0702 HLA allele.
[0027] FIG. 13. Results of a predicted proteasomal cleavage of the complete reticulocalbin protein using the cleavage predictor PaProC.
[0028] FIG. 14. Results of a predicted proteasomal cleavage of the complete reticulocalbin protein using the cleavage predictor NetChop 2.0.
[0029] FIG. 15. Several high affinity peptides deriving from reticulocalbin were identified as peptides predicted to be presented by HLA-A*0201 and A*0101.
[0030] FIG. 16. MS ion maps from soluble B*0702 uninfected SupT1 cells of fractions 29 and 31 to determine that ion 484.72 was not present.
[0031] FIG. 17. Fragmentation patterns of soluble B*0702 uninfected SupT1 cells fraction 30 ion 484.72 under identical MS collision conditions to illustrate the absence of the reticulocalbin peptide in the uninfected cells.
[0032] FIG. 18. Comparison of the MS/MS fragmentation patterns of synthetic peptide GPRTAALGLL and peptide ion 484.72 isolated from infected soluble B*0702 SupT1 cells.
DETAILED DESCRIPTION OF THE INVENTION
[0033] Before explaining at least one embodiment of the invention in detail by way of exemplary drawings, experimentation, results, and laboratory procedures, it is to be understood that the invention is not limited in its application to the details of construction and the arrangement of the components set forth in the following description or illustrated in the drawings, experimentation and/or results. The invention is capable of other embodiments or of being practiced or carried out in various ways. As such, the language used herein is intended to be given the broadest possible scope and meaning; and the embodiments are meant to be exemplary--not exhaustive. Also, it is to be understood that the phraseology and terminology employed herein is for the purpose of description and should not be regarded as limiting.
[0034] The present invention generally relates to a method of epitope discovery and comparative ligand mapping as well as methods of distinguishing infected/tumor cells from uninfected/non-tumor cells. The present method broadly includes the following steps: (1) providing a cell line containing a construct that encodes an individual soluble class I or class II MHC molecule (wherein the cell line is capable of naturally processing self or nonself proteins into peptide ligands capable of being loaded into the antigen binding grooves of the class I or class II MHC molecules); (2) culturing the cell line under conditions which allow for expression of the individual soluble class I or class II MHC molecule from the construct, with such conditions also allowing for the endogenous loading of a peptide ligand (from the self or non-self processed protein) into the antigen binding groove of each individual soluble class I or class II MHC molecule prior to secretion of the soluble class I or class II MHC molecules having the peptide ligands bound thereto; and (4) separating the peptide ligands from the individual soluble class I or class II MHC molecules.
[0035] The methods of the present invention may, in one embodiment, utilize a method of producing MHC molecules (from genomic DNA or cDNA) that are secreted from mammalian cells in a bioreactor unit. Substantial quantities of individual MHC molecules are obtained by modifying class I or class II MHC molecules so that they are capable of being secreted, isolated, and purified. Secretion of soluble MHC molecules overcomes the disadvantages and defects of the prior art in relation to the quantity and purity of MHC molecules produced. Problems of quantity are overcome because the cells producing the MHC do not need to be detergent lysed or killed in order to obtain the MHC molecule. In this way the cells producing secreted MHC remain alive and therefore continue to produce MHC. Problems of purity are overcome because the only MHC molecule secreted from the cell is the one that has specifically been constructed to be secreted. Thus, transfection of vectors encoding such secreted MHC molecules into cells which may express endogenous, surface bound MHC provides a method of obtaining a highly concentrated form of the transfected MHC molecule as it is secreted from the cells. Greater purity is assured by transfecting the secreted MHC molecule into MHC deficient cell lines.
[0036] Production of the MHC molecules in a hollow fiber bioreactor unit allows cells to be cultured at a density substantially greater than conventional liquid phase tissue culture permits. Dense culturing of cells secreting MHC molecules further amplifies the ability to continuously harvest the transfected MHC molecules. Dense bioreactor cultures of MHC secreting cell lines allow for high concentrations of individual MHC proteins to be obtained. Highly concentrated individual MHC proteins provide an advantage in that most downstream protein purification strategies perform better as the concentration of the protein to be purified increases. Thus, the culturing of MHC secreting cells in bioreactors allows for a continuous production of individual MHC proteins in a concentrated form.
[0037] The method of producing MHC molecules utilized in the present invention begins by obtaining genomic or complementary DNA which encodes the desired MHC class I or class II molecule. Alleles at the locus which encode the desired MHC molecule are PCR amplified in a locus specific manner. These locus specific PCR products may include the entire coding region of the MHC molecule or a portion thereof. In one embodiment a nested or hemi-nested PCR is applied to produce a truncated form of the class I or class II gene so that it will be secreted rather than anchored to the cell surface. In another embodiment the PCR will directly truncate the MHC molecule.
[0038] Locus specific PCR products are cloned into a mammalian expression vector and screened with a variety of methods to identify a clone encoding the desired MHC molecule. The cloned MHC molecules are DNA sequenced to insure fidelity of the PCR. Faithful truncated clones of the desired MHC molecule are then transfected into a mammalian cell line. When such cell line is transfected with a vector encoding a recombinant class I molecule, such cell line may either lack endogenous class I MHC molecule expression or express endogenous class I MHC molecules. One of ordinary skill of the art would note the importance, given the present invention, that cells expressing endogenous class I MHC molecules may spontaneously release MHC into solution upon natural cell death. In cases where this small amount of spontaneously released MHC is a concern, the transfected class I MHC molecule can be "tagged" such that it can be specifically purified away from spontaneously released endogenous class I molecules in cells that express class I molecules. For example, a DNA fragment encoding a HIS tail may be attached to the protein by the PCR reaction or may be encoded by the vector into which the PCR fragment is cloned, and such HIS tail, therefore, further aids in the purification of the class I MHC molecules away from endogenous class I molecules. Tags beside a histidine tail have also been demonstrated to work, and one of ordinary skill in the art of tagging proteins for downstream purification would appreciate and know how to tag a MHC molecule in such a manner so as to increase the ease by which the MHC molecule may be purified.
[0039] Cloned genomic DNA fragments contain both exons and introns as well as other non-translated regions at the 5' and 3' termini of the gene. Following transfection into a cell line which transcribes the genomic DNA (gDNA) into RNA, cloned genomic DNA results in a protein product thereby removing introns and splicing the RNA to form messenger RNA (mRNA), which is then translated into an MHC protein. Transfection of MHC molecules encoded by gDNA therefore facilitates reisolation of the gDNA, mRNA/cDNA, and protein. Production of MHC molecules in non-mammalian cell lines such as insect and bacterial cells requires cDNA clones, as these lower cell types do not have the ability to splice introns out of RNA transcribed from a gDNA clone. In these instances the mammalian gDNA transfectants of the present invention provide a valuable source of RNA which can be reverse transcribed to form MHC cDNA. The cDNA can then be cloned, transferred into cells, and then translated into protein. In addition to producing secreted MHC, such gDNA transfectants therefore provide a ready source of mRNA, and therefore cDNA clones, which can then be transfected into non-mammalian cells for production of MHC. Thus, the present invention which starts with MHC genomic DNA clones allows for the production of MHC in cells from various species.
[0040] A key advantage of starting from gDNA is that viable cells containing the MHC molecule of interest are not needed. Since all individuals in the population have a different MHC repertoire, one would need to search more than 500,000 individuals to find someone with the same MHC complement as a desired individual--such a practical example of this principle is observed when trying to find a donor to match a recipient for bone marrow transplantation. Thus, if it is desired to produce a particular MHC molecule for use in an experiment or diagnostic, a person or cell expressing the MHC allele of interest would first need to be identified. Alternatively, in the method of the present invention, only a saliva sample, a hair root, an old freezer sample, or less than a milliliter (0.2 ml) of blood would be required to isolate the gDNA. Then, starting from gDNA, the MHC molecule of interest could be obtained via a gDNA clone as described herein, and following transfection of such clone into mammalian cells, the desired protein could be produced directly in mammalian cells or from cDNA in several species of cells using the methods of the present invention described herein.
[0041] Current experiments to obtain an MHC allele for protein expression typically start from mRNA, which requires a fresh sample of mammalian cells that express the MHC molecule of interest. Working from gDNA does not require gene expression or a fresh biological sample. It is also important to note that RNA is inherently unstable and is not as easily obtained as is gDNA. Therefore, if production of a particular MHC molecule starting from a cDNA clone is desired, a person or cell line that is expressing the allele of interest must traditionally first be identified in order to obtain RNA. Then a fresh sample of blood or cells must be obtained; experiments using the methodology of the present invention show that ≧5 milliliters of blood that is less than 3 days old is required to obtain sufficient RNA for MHC cDNA synthesis. Thus, by starting with gDNA, the breadth of MHC molecules that can be readily produced is expanded. This is a key factor in a system as polymorphic as the MHC system; hundreds of MHC molecules exist, and not all MHC molecules are readily available. This is especially true of MHC molecules unique to isolated populations or of MHC molecules unique to ethnic minorities. Starting class I or class II MHC molecule expression from the point of genomic DNA simplifies the isolation of the gene of interest and insures a more equitable means of producing MHC molecules for study; otherwise, one would be left to determine whose MHC molecules are chosen and not chosen for study, as well as to determine which ethnic population from which fresh samples cannot be obtained and therefore should not have their MHC molecules included in a diagnostic assay.
[0042] While cDNA may be substituted for genomic DNA as the starting material, production of cDNA for each of the desired HLA class I types will require hundreds of different, HLA typed, viable cell lines, each expressing a different HLA class I type. Alternatively, fresh samples are required from individuals with the various desired MHC types. The use of genomic DNA as the starting material allows for the production of clones for many HLA molecules from a single genomic DNA sequence, as the amplification process can be manipulated to mimic recombinatorial and gene conversion events. Several mutagenesis strategies exist whereby a given class I gDNA clone could be modified at either the level of gDNA or at the cDNA resulting from this gDNA clone. The process of producing MHC molecules utilized in the present invention does not require viable cells, and therefore the degradation which plagues RNA is not a problem.
[0043] The soluble class I MHC proteins produced by the method described herein are utilized in the methods of epitope discovery and comparative ligand mapping of the present invention. The methods of epitope discovery and comparative ligand mapping described herein which utilize secreted individual MHC molecules have several advantages over the prior art, which utilized MHC from cells expressing multiple membrane-bound MHCs. While the prior art method could distinguish if an epitope was presented on the surface of a cell, this prior art method is unable to directly distinguish in which specific MHC molecule the peptide epitope was bound. Lengthy purification processes might be used to try and obtain a single MHC molecule, but doing so limits the quantity and usefulness of the protein obtained. The novelty and flexibility of the current invention is that individual MHC specificities can be utilized in sufficient quantity through the use of recombinant, soluble MHC proteins.
[0044] Class I and class II MHC molecules are really a trimolecular complex consisting of an alpha chain, a beta chain, and the alpha/beta chain's peptide cargo (i.e., peptide ligand) which is presented on the cell surface to immune effector cells. Since it is the peptide cargo, and not the MHC alpha and beta chains, which marks a cell as infected, tumorigenic, or diseased, there is a great need to identify and characterize the peptide ligands bound by particular MHC molecules. For example, characterization of such peptide ligands greatly aids in determining how the peptides presented by a person with MHC-associated diabetes differ from the peptides presented by the MHC molecules associated with resistance to diabetes. As stated above, having a sufficient supply of an individual MHC molecule, and therefore that MHC molecule's bound peptides, provides a means for studying such diseases. Because the method of the present invention provides quantities of MHC protein previously unobtainable, unparalleled studies of MHC molecules and their important peptide cargo can now be facilitated.
[0045] Therefore, the present invention is also related to methods of epitope discovery and comparative ligand mapping which can be utilized to distinguish infected/tumor cells from uninfected/non-tumor cells by unique epitopes presented by MHC molecules in the disease or non-disease state.
[0046] Creation of sHLA Molecules from Genomic DNA (gDNA)
[0047] 1. Genomic DNA Extraction. 200 l of sample either blood, plasma, serum, buffy coat, body fluid or up to 5×106 lymphocytes in 200 l Phosphate buffered saline were used to extract genomic DNA using the QIAamp® DNA Blood Mini Kit blood and body fluid spin protocol. Genomic DNA quality and quantity was assessed using optical density readings at 260 nm and 280 nm.
TABLE-US-00001 TABLE I Primer name Sequence 5'-3' Locus Cut site Annealing s PP5UTA GCGCTCTAGACCCAGACGCCGAGGATGGCC A XbaI 5UT 3PPI4A GCCCTGACCCTGCTAAAGGT A Intron 4 PP5UTB GCGCTCTAGACCACCCGGACTCAGAATCTCCT B XbaI 5UT 3PPI4B TGCTTTCCCTGAGAAGAGAT B Intron 4 5UTB39 AGGCGAATTCCAGAGTCTCCTCAGACGCG B*39 EcoRI 5UT B39 5PKCE GGGCGAATTCCCGCCGCCACCATGCGGGTCATGGCGCC C EcoRI 5UT 3PPI4C TTCTGCTTTCCTGAGAAGAC C Intron 4 PP5UT GGGCGAATTCGGACTCAGAATCTCCCCAGACGCCGAG B EcoRI 5UT PP3PEI CCGCGAATTCTCATCTCAGGGTGAGGGGCT A, B, C EcoRI Exon 4 PP3PEIH CCGCAAGCTTTCATCTCAGGGTGAGGGGCT A, B, C HindIII Exon 4 3PEIHC7 CCGCAAGCTTTCAGCTCAGGGTGAGGGGCT Cw*07 HindIII Exon 4 indicates data missing or illegible when filed
[0048] 2.1 PCR Strategy. Primers were designed for HLA-A, -B and -C loci in order to amplify a truncated version of the human class I MHC using a 2 stage PCR strategy. The first stage PCR uses a primer set that amplify from the 5' Untranslated region to Intron 4. This amplicon is used as a template for the second PCR which results in a truncated version of the MHC Class I gene by utilizing a 3' primer that sits down in exon 4, the 5' primer remains the same as the 1st PCR. An overview can be seen in FIG. 1. The primers for each locus are listed in TABLE I. Different HLA-B locus alleles require primers with different restriction cut sites depending on the nucleotide sequence of the allele. Hence there are two 5' and two 3' truncating primers for the -B locus.
[0049] 2.2 Primary PCR. Materials: An Eppendorf Gradient Mastercycler is used for all PCR. (1) H2O:Dionized ultrafiltered water (DIUF) Fisher Scientific, W2-4,41. (2) PCR nucleotide mix (10 mM each deoxyribonucleoside triphosphate [dNTP]), Boehringer Manheim, #1814, 362. (3) 10×Pfx Amplification buffer, pH 9.0, GibcoBRL®, part #52806, formulation is proprietary information. (4) 50 mM MgSO4, GibcoBRL®, part #52044 (5) Platinum® Pfx DNA Polymerase (B Locus only), GibcoBRL®, 11708-013. (6) Pfu DNA Polymerase (A and C Locus), Promega, M7741. (7) Pfu DNA Polymerase 10× reaction Buffer with MgSO4, 200 mM Tris-HCL, pH 8.8, 100 mM KCl, 100 mM (NH4)2SO4, 20 mM MgSO4, 1 mg/ml nuclease free BSA, 1% Triton® X-100. (8) Amplification primers (in ng/ l) (see TABLE I): A locus: 5' sense PP5UTA (300); 3' antisense PPI4A (300); B locus (Not B*39's): sense PP5UTB (300); antisense PPI4B (300); B locus (B*39's): sense 5UTB39 (300); antisense PPI4B (300); C Locus: sense 5PKCE (300); antisense PPI4C (300). (9) gDNA Template.
[0050] 2.3 Secondary PCR (also used for colony PCR). (1) H2O:Dionized ultrafiltered water (DIUF) Fisher Scientific, W2-4,41. (2) PCR nucleotide mix (10 mM each deoxyribonucleoside triphosphate [dNTP]), Boehringer Manheim, #1814, 362. (3) Pfu DNA Polymerase (A and C Locus), Promega, M7741. (4) Pfu DNA Polymerase 10× reaction Buffer with MgSO4, 200 mM Tris-HCL, pH 8.8, 100 mM KCl, 100 mM (NH4)2SO4, 20 mM MgSO4, 1 mg/ml nuclease free BSA, 1% Triton® X-100. (5) Amplification primers (in ng/ l) see TABLE I: A-locus: 5' sense PP5UTA (300), 3' antisense PP3PEI (300); B-locus: sense PP5UTB (300), antisense PP3PEI (300); B-locus: sense PP5UT (300), antisense PP3PEIH (300); B-locus B39's: sense 5UTB39 (300), antisense PP3PEIH (300); C-locus: sense 5PKCE (300), antisense PP3PEI (300); C-locus Cw*7's: sense 5PKCE (300), antisense 3PEIHC7 (300). (6) Template 1:100 dilution of the primary PCR product.
[0051] 2.4 Gel Purification of PCR products and vectors. (1) Dark Reader Tansilluminator Model DR-45M, Clare Chemical Research. (2) SYBR Green, Molecular Probes Inc. (3) Quantum Prep Freeze 'N Squeeze DNA Gel Rxtraction Spin Columns, Bio-Rad Laboratories, 732-6165.
[0052] 2.5 Restriction digests, Ligation and Transformation. (1) Restriction enzymes from New England Biolabs: (a) EcoR I #R0101S; (b) Hind III #R0104S; (c) Xba I #R0145S. (2) T4 DNA Ligase, New England Biolabs, #M0202S. (3) pcDNA3.1(-), Invitrogen Corporation, V795-20. (4) 10× Buffers from New England Biolabs: (a) EcoR I buffer, 500 mM NaCl, 1000 mM Tris-HCL, 10 mM MgCL2, 0.25% Triton-X 100, pH 7.5; (b) T4 DNA ligase buffer, 500 mM Tris-HCL, 100 mM MgCL2, 100 mM DTT, 10 mM ATP, 250 ug/ml BSA, pH 7.5; (c) NEB buffer 2, 500 mM NaCl, 100 mM Tris-HCl, 100 mM MgCl2, 10 mM DDT, pH 7.9. (5) 100×BSA, New England Biolabs. (6) Z-Competent E. coli Transformation Buffer Set, Zymo Research, T3002. (7) E. coli strain JM109. (8) LB Plates with 100 g/ml ampicillin. (9) LB media with 100 g/ml ampicillin.
[0053] 2.6 Plasmid Extraction. Wizard Plus SV minipreps, Promega, #A1460.
[0054] 2.7 Sequencing of Clones. (1) Thermo Sequenase Primer Cycle Sequencing Kit, Amersham Pharmacia Biotech, 25-2538-01. (2) CY5 labelled primers (see TABLE II). (3) AlfExpress automated DNA sequencer, Amersham Pharmacia Biotech.
TABLE-US-00002 TABLE II Primer Name Sequence 5'-3' Seq. ID NO: T7Prom TAATACGACTCACTATAGGG 12 BGHrev TAGAAGGCACAGTCGAGG 13 PPI2E2R GTCGTGACCTGCGCCCC 14 PPI2E2F TTTCATTTTCAGTTTAGGCCA 15 ABCI3E4F GGTGTCCTGTCCATTCTCA 16
[0055] 2.8 Gel Casting. (1) PagePlus 40% concentrate, Amresco, E562, 500 ml. (2) Urea, Amersham Pharmacia Biotech, 17-0889-01,500g. (3) 3 N'N'N'N'-tetramethylethyleneiamine (TEMED), APB. (4) Ammonium persulphate (10% solution), APB. (5) Boric acid, APB. (6) EDTA-disodium salt, APB. (7) Tris, APB. (8) Bind-Saline, APB. (9) Isopropanol, Sigma. (10) Glacial Acetic Acid, Fisher Biotech. (11) DIUF water, Fisher Scientific. (12) EtOH 200-proof.
[0056] 2.9 Plasmid Preparation for Electroporation. Qiagen Plasmid Midi kit, Qiagen Inc., 12143.
[0057] 3.0 Electroporation. (1) Biorad Gene Pulser with capacitance extender, Bio-Rad Laboratories. (2) Gene Pulser Cuvette, Bio-Rad Laboratories. (3) Cytomix: 120 mM KCl, 0.15 mM CaCl2, 10 mMK2HPO4/KH2PO4, pH 7.6, 25 mM Hepes, pH 7.6, 2 mM EGTA, pH 7.6, 5 mM MgCl2, pH 7.6 with KOH. (4) RPMI 1640+ 20% Foetal Calf Serum+Pen/strep. (5) Haemacytometer. (6) Light Microscope. (7) CO2 37° Incubator. (8) Cells in log phase.
[0058] Primary PCR
[0059] A-Locus and C-Locus
TABLE-US-00003 10x Pfu buffer 5 μl 5' Primer (300 ng/μl) 1 μl 3' Primer (300 ng/μl) 1 μl dNTP's (10 mM each) 1 μl gDNA (50 ng/μl) 10 μl DIUF H20 31.4 μl Pfu DNA Polymerase 0.6 μl 96° C. 2 min. 95° C. 1 min 58° C. 1 min {close oversize brace} x35 73° C. 5 min 73° C. 10 min
[0060] B-Locus
TABLE-US-00004 10x Pfx buffer 5 μl 5' Primer (300 ng/μl) 1 μl 3' Primer (300 ng/μl) 1 μl dNTP's (10 mM each) 1.5 μl MgSO4 (50 mM) 1 μl gDNA (100 ng/μl) 1 μl DIUF H2O 40 μl Pfx DNA Polymerase 0.5 μl 94° C. 2 min. 94° C. 1 min 60° C. 1 min {close oversize brace} x35 68° C. 3.5 min 68° C. 5 min
[0061] Gel Purification of PCR (all PCR and Plasmids are Gel Purified)
[0062] Mix primary PCR with 5 l of 1×SYBR green and incubate at room temperature for 15 minutes then load on a 1% agarose gel. Visualize on the Dark Reader and purify using the Quantum Prep Freeze and Squeeze extraction kit according to the manufacturers instructions.
[0063] Secondary PCR
[0064] A, B and C Loci
TABLE-US-00005 10x Pfu buffer 5 μl 5' Primer (300 ng/μl) 0.5 μl 3' Primer (300 ng/μl) 0.5 μl dNTP's (10 mM each) 1 μl 1:100 1° PCR 10 μl DIUF H20 37.5 μl Pfu DNA Polymerase 0.5 μl 96° C. 2 min. 95° C. 1 min 58° C. 1 min {close oversize brace} x35 73° C. 4 min 73° C. 7 min
[0065] Restriction Digests
TABLE-US-00006 2° PCR (gel purified) 30 μl Restriction enzyme 1 X μl Restriction enzyme 2 X μl 10x buffer 5 μl 100x BSA 0.5 μl DIUF H2O 10.5 μl
[0066] The enzymes used will be determined by the cut sites incorporated into the PCR primers for each individual PCR. The expression vector pcDNA3.1 (-) will be cut in a similar manner.
[0067] Ligation
TABLE-US-00007 PcDNA3.1(--) cut with same enzymes as PCR 50 ng Cut PCR 100 ng 10x T4 DNA ligase buffer 2 μl T4 DNA Ligase 1 μl DIUF H20 up to 20 μl
[0068] Transformation
[0069] Transform JM109 using competent cells made using Z-competent E. coli Transformation Kit and Buffer Set and follow the manufacturers instructions.
[0070] Colony PCR
[0071] This will check for insert in any transformed cells. Follow the same protocol for the secondary PCR.
[0072] Mini Preps of Colonies with Insert
[0073] Use the Wizard Plus SV minipreps and follow the manufacturers instructions. Make glycerol stocks before beginning extraction protocol.
[0074] Sequencing of Positive Clones
[0075] Using the Thermo Sequenase Primer Cycle Sequencing Kit
TABLE-US-00008 A, C, G or T mix 3 μl CY5 Primer 1 pm/μl 1 μl DNA template 100 ng/μl 5 μl 96° C. .sup. 2 min 96° C. 30 sec {close oversize brace} x25 61° C. 30 sec
[0076] Add 6 l formamide loading buffer and load 10 l onto sequencing gel. Analyse sequence for good clones with no misincorporations.
[0077] Midi Preps
[0078] Prepare plasmid for electroporation using the Qiagen Plasmid Midi Kit according to the manufacturers instructions.
[0079] Electroporation
[0080] Electroporations are performed as described in "The Bw4 public epitope of HLA-B molecules confers reactivity with natural killer cell clones that express NKb1, a putative HLA receptor. Gumperz, J. E., V. Litwin, J. H. Phillips, L. L. Lanier and P. Parham. J. Exp. Med. 181:1133-1144, 1995, which is expressly incorporated herein by reference."
[0081] Screening for Production of Soluble HLA
[0082] An ELISA is used to screen for the production of soluble HLA. For biochemical analysis, monomorphic monoclonal antibodies are particularly useful for identification of HLA locus products and their subtypes.
[0083] W6/32 is one of the most common monoclonal antibodies (mAb) used to characterize human class I major histocompatibility complex (MHC) molecules. It is directed against monomorphic determinants on HLA-A, -B and -C HCs, which recognizes only mature complexed class I molecules and recognizes a conformational epitope on the intact MHC molecule containing both beta2-microglobulin (atm) and the heavy chain (HC). W6/32 binds a compact epitope on the class I molecule that includes both residue 3 of beta2m and residue 121 of the heavy chain (Ladasky J J, Shum B P, Canavez F, Seuanez H N, Parham P. Residue 3 of beta2-microglobulin affects binding of class I MHC molecules by the W6/32 antibody. Immunogenetics 1999 April; 49(4):312-20.). The constant portion of the molecule W6/32 binds to is recognized by CTLs and thus can inhibit cytotoxicity. The reactivity of W6/32 is sensitive to the amino terminus of human beta2-microglobulin (Shields M J, Ribaudo R K. Mapping of the monoclonal antibody W6/32: sensitivity to the amino terminus of beta2-microglobulin. Tissue Antigens 1998 May, 51(5):567-70). HLA-C could not be clearly identified in immunoprecipitations with W6/32 suggesting that HLA-C locus products may be associated only weakly with b2m, explaining some of the difficulties encountered in biochemical studies of HLA-C antigens [Stam, 1986 #1]. The polypeptides correlating with the C-locus products are recognized far better by HC-10 than by W6/32 which confirms that at least some of the C products may be associated with b2m more weakly than HLA-A and -B. W6/32 is available biotinylated (Serotec MCA81B) offering additional variations in ELISA procedures.
[0084] HC-10 is reactive with almost all HLA-B locus free heavy chains. The A2 heavy chains are only very weakly recognized by HC-10. Moreover, HC-10 reacts only with a few HLA-A locus heavy chains. In addition, HC-10 seems to react well with free heavy chains of HLA-C types. No evidence for reactivity of HC-10 with heavy-chain/b2m complex has been obtained. None of the immunoprecipitates obtained with HC-10 contained b2m [Stam, 1986 #1]. This indicates that HC-10 is directed against a site of the HLA class I heavy chain that includes the portion involved in interaction with the atm. The pattern of HC-10 precipitated material is qualitatively different from that isolated with W6/32.
[0085] TP25.99 detects a determinant in the alpha3 domain of HLA-ABC. It is found on denatured HLA-B (in Western) as well as partially or fully folded HLA-A, B, & C. It doesn't require a peptide or a2m, i.e., it works with the alpha 3 domain which folds without peptide. This makes it useful for HC determination.
[0086] Anti-human a2m (HRP) (DAKO P0174) recognizes denatured as well as complexed a2m. Although in principle anti-a2m reagents could be used for the purpose of identification of HLA molecules, they are less suitable when association of heavy chain and a2m is weak. The patterns of class I molecules precipitated with W6/32 and anti-a2m are usually indistinguishable [Vasilov, 1983 #10].
[0087] Rabbit anti-a2-microglobulin dissociates a2-microglobulin from heavy chain as a consequence of binding (Rogers, M. J., Appella, E., Pierotti, M. A., Invernizzi, G., and Parmiani, G. (1979) Proc Natl. Acad. Sci. U.S.A. 76, 1415-1419). It also has been reported that rabbit anti-human a2-microglobulin dissociactes a2-microglobulin from HLA heavy chains upon binding (Nakamuro, K., Tanigaki, N., and Pressman, D. (1977) Immunology 32, 139-146.). This anti-human a2m antibody is also available unconjugated (DAKO A0072).
[0088] The W6/32-HLA sandwich ELISA. Sandwich assays can be used to study a number of aspects of protein complexes. If antibodies are available to different components of a heteropolymer, a two-antibody assay can be designed to test for the presence of the complex. Using a variation of these assays, monoclonal antibodies can be used to test whether a given antigen is multimeric. If the same monoclonal antibody is used for both the solid phase and the label, monomeric antigens cannot be detected. Such combinations, however, may detect multimeric forms of the antigen. In these assays negative results may be generated both by multimeric antigen held in unfavorable steric positions as well as by monomeric antigens.
[0089] The W6/32--anti-a2m antibody sandwich assay is one of the best techniques for determining the presence and quantity of sHLA. Two antibody sandwich assays are quick and accurate, and if a source of pure antigen is available, the assay can be used to determine the absolute amounts of antigen in unknown samples. The assay requires two antibodies that bind to non-overlapping epitopes on the antigen. This assay is particularly useful to study a number of aspects of protein complexes.
[0090] To detect the antigen (sHLA), the wells of microtiter plates are coated with the specific (capture) antibody W6/32 followed by the incubation with test solutions containing antigen. Unbound antigen is washed out and a different antigen-specific antibody (anti-a2m) conjugated to HRP is added, followed by another incubation. Unbound conjugate is washed out and substrate is added. After another incubation, the degree of substrate hydrolysis is measured. The amount of substrate hydrolyzed is proportional to the amount of antigen in the test solution.
[0091] The major advantages of this technique are that the antigen does not need to be purified prior to use and that the assays are very specific. The sensitivity of the assay depends on 4 factors: (1) The number of capture antibody; (2) The avidity of the capture antibody for the antigen; (3) The avidity of the second antibody for the antigen; (4) The specific activity of the labeled second antibody.
[0092] Using an ELISA protocol template and label a clear 96-well polystyrene assay plate. Polystyrene is normally used as a microtiter plate. (Because it is not translucent, enzyme assays that will be quantitated by a plate reader should be performed in polystyrene and not PVC plates).
[0093] Coating of the W6/32 is performed in Tris buffered saline (TBS); pH 8.5. A coating solution of 8.0 g/ml of specific W6/32 antibody in TBS (pH 8.5) is prepared. (blue tube preparation stored at -20 C with a concentration of 0.2 mg/ml and a volume of 1 ml giving 0.2 mg per tube).
TABLE-US-00009 TABLE III No. of Total W6/32 TBS plates Volume antibody pH 8.5 1 10 ml 400 μl 9.6 ml 2 20 ml 800 μl 19.2 ml 3 30 ml 1200 μl 28.8 ml 4 40 ml 1600 μl 38.4 ml 5 50 ml 2000 μl 48.0 ml
[0094] Although this is well above the capacity of a microtiter plate, the binding will occur more rapidly. Higher concentrations will speed the binding of antigen to the polystyrene but the capacity of the plastic is only about 100 ng/well (300 ng/cm2), so the extra protein will not bind. (If using W6/32 of unknown composition or concentration, first titrate the amount of standard antibody solution needed to coat the plate versus a fixed, high concentration of labeled antigen. Plot the values and select the lowest level that will yield a strong signal. Do not include sodium azide in any solutions when horseradish peroxidase is used for detection.
[0095] Immediately coat the microtiter plate with 100 l of antigen solution per well using a multichannel pipet. Standard polystyrene will bind antibodies or antigens when the proteins are simply incubated with the plastic. The bonds that hold the proteins are non-covalent, but the exact types of interactions are not known. Shake the plate to ensure that the antigen solution is evenly distributed over the bottom of each well. Seal the plate with plate sealers (sealplate adhesive sealing film, nonsterile, 100 per unit; Phenix; LMT-Seal-EX) or sealing tape to Nunc-Immuno® Modules (#236366). Incubate at 4° C. overnight. Avoid detergents and extraneous proteins. Next day, remove the contents of the well by flicking the liquid into the sink or a suitable waste container. Remove last traces of solution by inverting the plate and blotting it against clean paper toweling. Complete removal of liquid at each step is essential for good performance.
[0096] Wash the plate 10 times with Wash Buffer (PBS containing 0.05% Tween-20) using a multi-channel ELISA washer. After the last wash, remove any remaining Wash Buffer by inverting the plate and blotting it against clean paper toweling. After the W6/32 is bound, the remaining sites on the plate must be saturated by incubating with blocking buffer made of 3% BSA in PBS. Fill the wells with 20011 blocking buffer. Cover the plates with an adhesive strip and incubate overnight at 4° C. Alternatively, incubate for at least 2 hours at room temperature which is, however, not the standard procedure. Blocked plates may be stored for at least 5 days at 4° C. Good pipetting practice is most important to produce reliable quantitative results. The tips are just as important a part of the system as the pipette itself. If they are of inferior quality or do not fit exactly, even the best pipette cannot produce satisfactory results. The pipette working position is always vertical: Otherwise causing too much liquid to be drawn in. The immersion depth should be only a few millimeters. Allow the pipetting button to retract gradually, observing the filling operation. There should be no turbulence developed in the tip, otherwise there is a risk of aerosols being formed and gases coming out of solution.
[0097] When maximum levels of accuracy are stipulated, prewetting should be used at all times. To do this, the required set volume is first drawn in one or two times using the same tip and then returned. Prewetting is absolutely necessary on the more difficult liquids such as 3% BSA. Do not prewet, if your intention is to mix your pipetted sample thoroughly with an already present solution. However, prewet only for volumes greater than 10 l. In the case of pipettes for volumes less than 10 l the residual liquid film is as a rule taken into account when designing and adjusting the instrument. The tips must be changed between each individual sample. With volumes<10 l special attention must also be paid to drawing in the liquid slowly, otherwise the sample will be significantly warmed up by the frictional heat generated. Then slowly withdraw the tip from the liquid, if necessary wiping off any drops clinging to the outside.
[0098] To dispense the set volume hold the tip at a slight angle, press it down uniformly as far as the first stop. In order to reduce the effects of surface tension, the tip should be in contact with the side of the container when the liquid is dispensed. After liquid has been discharged with the metering stroke, a short pause is made to enable the liquid running down the inside of the tip to collect at its lower end. Then press it down swiftly to the second stop, in order to blow out the tip with the extended stroke with which the residual liquid can be blown out. In cases that are not problematic (e.g., aqueous solutions) this brings about a rapid and virtually complete discharge of the set volume. In more difficult cases, a slower discharge and a longer pause before actuating the extended stroke can help. To determine the absolute amount of antigen (sHLA), sample values are compared with those obtained using known amounts of pure unlabeled antigen in a standard curve.
[0099] For accurate quantitation, all samples have to be run in triplicate, and the standard antigen-dilution series should be included on each plate. Pipetting should be preformed without delay to minimize differences in time of incubation between samples. All dilutions should be done in blocking buffer. Thus, prepare a standard antigen-dilution series by successive dilutions of the homologous antigen stock in 3% BSA in PBS blocking buffer. In order to measure the amount of antigen in a test sample, the standard antigen-dilution series needs to span most of the dynamic range of binding. This range spans from 5 to 100 ng sHLA/ml. A stock solution E of 1 g/ml should be prepared, aliquoted in volumes of 300 l and stored at 4 C. Prepare a 50 ml batch of standard at the time. (New batches need to be compared to the old batch before used in quantitation).
[0100] Use a tube of the standard stock solution E to prepare successive dilutions. While standard curves are necessary to accurately measure the amount of antigen in test samples, they are unnecessary for qualitative "yes/no" answers. For accurate quantitation, the test solutions containing sHLA should be assayed over a number of at least 4 dilutions to assure to be within the range of the standard curve. Prepare serial dilutions of each antigen test solution in blocking buffer (3% BSA in PBS). After mixing, prepare all dilutions in disposable U-bottom 96 well microtiter plates before adding them to the W6/32-coated plates with a multipipette. Add 150 l in each well. To further proceed, remove any remaining blocking buffer and wash the plate as described above. The plates are now ready for sample addition. Add 100 l of the sHLA containing test solutions and the standard antigen dilutions to the antibody-coated wells.
[0101] Cover the plates with an adhesive strip and incubate for exactly 1 hour at room temperature. After incubation, remove the unbound antigen by washing the plate 10× with Wash Buffer (PBS containing 0.05% Tween-20) as described. Prepare the appropriate developing reagent to detect sHLA. Use the second specific antibody, anti-human atm-HRP (DAKO P0174/0.4 mg/ml) conjugated to Horseradish Peroxidase (HRP). Dilute the anti-human atm-HRP in a ratio of 1:1000 in 3% BSA in PBS. (Do not include sodium azide in solutions when horseradish peroxidase is used for detection).
TABLE-US-00010 TABLE IV No. of Total anti-β2m-HRP 3% BSA plates Volume antibody in PBS 1 10 ml 10 μl 10 ml 2 20 ml 20 μl 2 ml 3 30 ml 30 μl 30 ml 4 40 ml 40 μl 40 ml 5 50 ml 50 μl 50 ml
[0102] Add 100 l of the secondary antibody dilution to each well. All dilutions should be done in blocking buffer. Cover with a new adhesive strip and incubate for 20 minutes at room temperature. Prepare the appropriate amount of substrate prior to the wash step. Bring the substrate to room temperature.
[0103] OPD (o-Phenylenediamine) is a peroxidase substrate suitable for use in ELISA procedures. The substrate produces a soluble end product that is yellow in color. The OPD reaction is stopped with 3NH2SO4, producing an orange-brown product and read at 492 nm. Prepare OPD fresh from tablets (Sigma, P6787; 2 mg/tablet). The solid tablets are convenient to use when small quantities of the substrate are required. After second antibody incubation, remove the unbound secondary reagent by washing the plate 10× with Wash Buffer (PBS containing 0.05% Tween-20). After the final wash, add 100 l of the OPD substrate solution to each well and allow to develop at room temperature for 10 minutes. Reagents of the developing system are light-sensitive, thus, avoid placing the plate in direct light. Prepare the 3NH2SO4 stop solution. After 10 minutes, add 100 l of stop solution per 100 l of reaction mixture to each well. Gently tap the plate to ensure thorough mixing.
[0104] Read the ELISA plate at a wavelength of 490 nm within a time period of 15 minutes after stopping the reaction. The background should be around 0.1. If your background is higher, you may have contaminated the substrate with a peroxidase. If the substrate background is low and the background in your assay is high, this may be due to insufficient blocking. Finally analyze your readings. Prepare a standard curve constructed from the data produced by serial dilutions of the standard antigen. To determine the absolute amount of antigen, compare these values with those obtained from the standard curve.
[0105] Creation of Transfectants and Production of Soluble Class I Molecules
[0106] Transfectants were established as previously described (Prilliman, K R et al., Immunogenetics 45:379, 1997, which is expressly incorporated herein by reference) with the following modifications: a cDNA clone of B*1501 containing the entire coding region of the molecule was PCR amplified in order to generate a construct devoid of the cytoplasmic domain using primers 5PXI (59-GGGCTCTAGAGGACTCAGAATCTCCCCAGAC GCCGAG-39) and 3PEI (59-CCGCGAATTCTCATCTCAGGGTGAG-39) as shown in TABLE V. Constructs were also created containing a C-terminal epitope tag consisting of either 6 consecutive histidines or the FLAG epitope (Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys). TABLE V Primers utilized to create B*1501-HIS and B*1501-FLAG were 5PXI and 3PEIHIS(59-CCGCGAATTCTCAGTGGTGGTGGTGGTGGTGCCATCTCAGGGTGAG-39) or 3PEIFLAG (59-CCGCGAATTCTCACTTGTCATCGTCGTCCTTGTAATCCCATCTCAGGGTGAG-39). PCR amplicons were purified using a Qiagen Spin PCR purification kit (Qiagen, Levsden, The Netherlands) and cloned into the mammalian expression vector pcDNA 3.1 (Invitrogen, Carlsbad, Calif., USA). TABLE V. After confirmation of insert integrity by bidirectional DNA sequencing, constructs were electroporated into the class I negative B-lymphoblastoid cell line 721.221 (Prilliman, K R et al., 1997, previously incorporated herein by reference). Transfectants were maintained in medium containing G418 post-electroporation and subcloned in order to isolate efficient producers of soluble class I as determined by ELISA (Prilliman, K R et al, 1997, previously incorporated herein by reference).
TABLE-US-00011 TABLE V Full-length Seq. Primer or ID name Sequence Truncating Notes NO: HLA5UT GGGCGTCGACGGACTCAGAATCT either 5' primer, SaI I cut site 17 CCCCAGACGCCGAG 5UTA GCGCGTCGACCCCAGACGCCGAG either 5' primer, SaI I cut site A-locus specific 18 GATGGCC 5PXI GGGCTCTAGAGGACTCAGAATCT either 5' primer, Xba I cut site 19 CCCCAGACGCCGAG CLSP23 CCGCGTCGACTCAGATTCTCCCC full-length 5' primer, SaI I cut site C-locus specific 20 AGACGCCGAGATG LDC3UTA CCGCAAGCTTAGAAACAAAGTCA full-length 3' primer, HindIII cut site A-locus specific 21 GGGTT CLSP1085 CCGCAAGCTTGGCAGCTGTCTCA full-length 3' primer, HindIII cut site C-locus specific 22 GGCTTTACAAG(CT)G 3UTA CCGCAAGCTTTTGGGGAGGGAGC full-length 3' primer, HindIII cut site A-locus specific 23 ACAGGTCAGCGTGGGAAG 3UTB CCGCAAGCTTCTGGGGAGGAAAC full-length 3' primer, HindIII cut site B-locus specific 24 ATAGGTCAGCATGGGAAC 3PEI CCGCGAATTCTCATCTCAGGGTG truncating 3' primer, EcoRI cut site 25 AG 3PEIHIS CCGCGAATTCTCAGTGGTGGTGG truncating 3' primer, EcoRI cut site adds hexa- 26 TGGTGGTGCCATCTCAGGGTGAG histidine tail 3PEIFLAG CCGCGATTCTCACTTGTCATCGT truncating 3' primer, EcoRI cut site adds FLAG- 27 CGTCCTTGTAATCCCATCTCAGG epitope GTGAG 5PKOZXB GGGCTCTAGACCGCCGCCACCAT either 5' primer, Xba I cut site C-locus specific 28 GCGGGTCATGGCGCC
[0107] Soluble B*1501, B*1501-HIS, and B*1501-FLAG were produced by culturing established transfectants in CP3000 hollow-fiber bioreactors as previously described by Prilliman et al, 1997, which has previously been incorporated herein by reference. Supernatants containing soluble class I molecules were collected in bioreactor harvests and purified on W6/32 affinity columns. At least 2 column purifications were performed per molecule.
[0108] Ligand Purification, Edman Sequencing, and Reverse-Phase HPLC Separation of Peptides
[0109] Peptide ligands were purified from class I molecules by acid elution (Prilliman, K R et al., Immunogenetics 48:89, 1998 which is expressly incorporated herein by reference) and further separated from heavy and light chains by passage through a stirred cell (Millipore, Bedford, Mass., USA) equipped with a 3-Kd cutoff membrane (Millipore). Approximately 1/100 volume of stirred cell flow through containing peptide eluted from either B*1501, B*1501-HIS, or B*1501-FLAG was subjected to 14 cycles of Edman degradation on a 492A pulsed liquid phase protein sequencer (Perkin-Elmer Applied Biosystems Division, Norwalk, Conn., USA) without the derivitization of cysteine. Edman motifs were derived by combining from multiple column elutions the picomolar yields of each amino acid and then calculating the fold increase over previous round as described in (Prilliman, K R et al, 1998, previously incorporated herein by reference) and are shown in FIG. 2.
[0110] Pooled peptide eluate was separated into fractions by RP-HPLC as previously described (Prilliman, K R et al, 1998, previously incorporated herein by reference). Briefly, 400-mg aliquots of peptides were dissolved in 100 ml of 10% acetic acid and loaded onto a 2.1 3 150 mm C18 column (Michrom Bioresources, Auburn, Calif., USA) using a gradient of 2%-10% acetonitrile with 0.06% TFA for 0.02 min followed by a 10%-60% gradient of the same for 60 min. Fractions were collected automatically at 1-min intervals with a flow rate of 180 ml/min.
[0111] Mass Spectrometric Ligand Analysis
[0112] RP-HPLC fractions were speed-vacuumed to dryness and reconstituted in 40 ml 50% methanol, 0.5% acetic acid. Approximately 6 ml from selected fractions were sprayed into an API-III triple quadrupole mass spectrometer (PE Sciex, Foster City, Calif., USA) using a NanoES ionization source inlet (Protana, Odense, Denmark). Scans were collected while using the following instrument settings: polarity--positive; needle voltage--1375 V; orifice voltage--65 V; N2 curtain gas--0.6 ml/min; step size--0.2 amu; dwell time--1.5 ms; and mass range--325-1400. Total ion traces generated from each molecule were compared visually in order to identify ions overlapping between molecules. Following identification of ion matches, individual ions were selected for MS/MS sequencing.
[0113] Sequences were predicted using the BioMultiView program (PE Sciex) algorithm predict sequence, and fragmentation patterns further assessed manually. Determinations of ion sequence homology to currently compiled sequences were performed using advanced BLAST searches against the nonredundant, human expressed sequence tag, and unfinished high throughput genomic sequences nucleotide databases currently available through the National Center for Biotechnology Information (National Institutes of Health, Bethesda, Md., USA).
[0114] The methodology of the present invention provides a direct comparative analysis of peptide ligands eluted from class I HLA molecules. In order to accomplish such comparative analyses, hollow-fiber bioreactors for class I ligand production were used along with reverse-phase HPLC for fractionating eluted ligands, and mass spectrometry for the mapping and sequencing of peptide ligands. The application of comparative ligand mapping also is applicable to cell lines that express endogenous class I. Prior to peptide sequence determination in class I positive cell lines, the effects of adding a C-terminal epitope tag to transfected class I molecules was found to have no deleterious effects. Either a tag consisting of 6 histidines (6-HIS) or a tag containing the epitope Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys (FLAG) was added to the C-terminus of soluble B*1501 through PCR. These constructs were then transfected into class I negative 721.221 cells and peptides purified as previously established (Prilliman, K R et al, 1998, previously incorporated herein by reference). Comparison of the two tailed transfectants with the untailed, soluble B*1501 allowed for the determination that tag addition had no effect on peptide binding specificity of the class I molecule and consequently had no deleterious effects on direct peptide ligand mapping and sequencing.
[0115] Edman Motifs
[0116] The most common means for discerning ligands presented by a particular class I molecule is Edman sequencing the pool of peptides eluted from that molecule. In order to demonstrate that tailing class I molecules with Cterminal tags does not disrupt endogenous peptide loading, Edman sequences of the peptide pools from B*1501, B*1501-HIS, and B*1501-FLAG was compared with previously published B*1501 data FIG. 2. Motifs were assigned to each of the various B*1501 molecules as shown in FIG. 2. At the anchor position 2 (P2) a dominant Q and subdominant M was seen in motifs as previously published by Falk et al. (Immunogenetics 41:165, 1995) and Barber et al. (J Exp Med 184:735, 1996). A more disparate P3 is seen in all molecules with F, K, N, P, R, and Y appearing; these results have also been previously reported by Falk and Barber. Again, a dominant Y and F are seen as the C-terminal anchors at P9 in all three molecules. The motif data for all three molecules are in close accord, therefore, with the published standard motifs.
[0117] Mass Spectrometric Profiles
[0118] Comparison of motifs for the surface bound, nontailed, and tailed B*1501 molecules identified no substantial differences in the pooled peptides bound by the various forms of B*1501 tested. However, the aim of the present invention is to subtractively compare the individual peptides bound by class I molecules from diseased and healthy cells. Subtractive analysis is accomplished through the comparison of mass spectrometric ion maps and, as such, the ion maps of tailed and untailed class I molecules were compared in order to determine the effect of tailing upon comparative peptide mapping.
[0119] Peptides derived from tailed and untailed B*1501 were separated into fractions via reverse phase HPLC (RP-HPLC). Each fraction was then scanned using an API-III mass spectrometer in order to identify ions present in each fraction. Overall ion scans from RP-HPLC fractions 9, 10, 11, 18, 19, and 20 were produced and visually compared in order to assess ions representing peptides overlapping between the three molecules. FIG. 3. depicts a representative section of the ion maps generated from each of the molecules. This comparison shows that the same pattern of ions is produced by the different B*1501 molecules analyzed here. The manual comparison of ion maps from each of the three fractions found little to no difference in the peptides bound by each of the three molecules.
[0120] Ligand Sequences
[0121] After identification of ion matches in MS chromatograms of each of the three molecules, individual ions were chosen for sequencing by tandem mass spectrometry in order to determine if ions were indeed matched at the peptide-sequence level. Ten ions from each fraction were initially selected for MS/MS sequence generation. Fragmentation patterns for each of the ions from each molecule were manually compared and identical fragmentation patterns were counted as peptide-sequence level matches, as illustrated in FIG. 4. Of the peptide fragmentation patterns examined, 52/57 (91%) were exact matches between the untailed molecules and the 6-HIStailed protein (TABLE VI). A more disparate pattern of fragmentation was identified in the FLAG-tailed ions selected for MS/MS sequencing: of the 57 ions selected for MS/MS fragmentation comparison, 39 (70%) fragmentation patterns matched between the FLAG-tailed and untailed molecules. Overall, 91 out of 113 (81%) spectra examined were in accord between the tailed molecules and soluble B*1501.
TABLE-US-00012 TABLE VI Molecules Ions Examined Ion Matches Percent Matched B*1501-HIS 57 52 91% B*1501-FLAG 56 39 70% B*1501-Tagged 113 91 81%
[0122] Several ligand sequences were clearly determined from the fragmentation patterns produced. The ligand QGLISRGYSY, deriving from human periplakin, was sequenced from those peptides eluted in fraction 18. A second ligand, AVRDISEASVF, an 11-mer matching a span of the 40S ribosomal protein S26, was identified in fraction 20. Notably, these two peptides lacked the strong consensus glutamine expected by the motif data, a phenomenon previously reported by our laboratory when sequencing B*1501-eluted ligands (Prilliman, K R et al, 1997, previously incorporated herein by reference). Both these ligands, however, terminate with an aromatic tyrosine or phenylalanine; these amino acids were both predicted to be strong anchors by Edman sequencing data and by previously published observations (Prilliman, K R et al, 1998, previously incorporated herein by reference).
[0123] One embodiment of the present invention contemplates characterizing peptide ligands bound by a given class I molecule by transfecting that molecule into a class I negative cell line and affinity purification of the class I molecule and bound peptide. Complications arise, however, when cell lines are chosen for study that already possess class I molecules. In this case, antibodies specific for one class I molecule must be used to selectively purify that class I molecule from others expressed by the cell. Because allele-specific antibodies recognize epitopes in and around the peptide binding groove, variations in the peptides found in the groove can alter antibody affinity for the class I molecule (Solheim, J C et al., J Immunol 151:5387, 1993; and Bluestone, J A et al., J Exp Med 176:1757, 1992). Altered antibody recognition can, in turn, bias the peptides available for elution and subsequent sequence analysis.
[0124] In order to selectively purify from a class I positive cell a transfected class I molecule and its peptide ligands in an unbiased way, it was necessary to alter the embodiment for class I purification in a non-class I positive cell. The C-terminal addition of a FLAG and 6-HIS tag to a class I molecule that had already been extensively characterized, B*1501 was shown to have little or no effect on peptide binding. This methodology was designed to allow purification of a single class I specificity from a complex mixture of endogenously expressed class I molecules. Ligands eluted from the tailed and untailed B*1501 molecule were compared to assess the effect of a tail addition on the peptide repertoire.
[0125] Pooled Edman sequencing is the commonly used method to determine the binding fingerprint of a given molecule, and this methodology was used to ascertain the large-scale effect of tail addition upon peptide binding. We subjected 1/100 of the peptides eluted from each class I MHC molecule to Edman degradation and derived motifs for each of the molecules. Both the HIS- and FLAG-tailed motifs matched published motifs for the soluble and membrane-bound B*1501. Each of the molecules exhibited motifs bearing a dominant P2 anchor of Q, a more disparate P3 in which multiple residues could be found, and another dominant anchor of Y or F at P9. Small differences in the picomolar amounts of each of the amino acids detected during Edman sequencing have been noted previously in consecutive runs with the same molecule and most likely reflect differences in cell handling and/or peptide isolation rather than disparities in bound peptides. Highly similar peptide motifs indicated that the peptide binding capabilities of class I MHC molecules are not drastically altered by the addition of a tag.
[0126] In order to insure the ligands were not skewed after tag addition, MS and MS/MS were used for the mapping and sequencing of individual peptides, respectively. Peptide mixtures subjected to MS provided ion chromatograms (FIG. 3) that were used to compare the degree of ion overlap between the three examined molecules. Extensive ion overlap indicates that the peptides bound by these tailed and untailed B*1501 molecules were nearly identical.
[0127] Selected ions were then MS/MS sequenced in order to confirm that mapped ion overlaps indeed represented exact ligand matches through comparison of fragmentation patterns between the three molecules (FIG. 4). Approximately 60 peptides were chosen initially for MS/MS--ten from each fraction. Overall, fragmentation patterns were exact matches in a majority of the peptides examined (TABLE VI). Fragmentation patterns categorized as nonmatches resulted from a mixture of peptides present at the same mass to charge ratio, one or more of which was present in the tagged molecule and not apparent in the spectra of the same ion from B*1501. Of the sequence-level matches, ligands derived from HIS-tailed molecules more closely matched those derived from B*1501 than those eluted from FLAG-tailed molecules. In total, 52/57 HIS peptides were exact matches, whereas 39/56 FLAG peptides were equivalent. Thus, the data indicates that the 6-HIS tag is less disruptive to endogenous peptide binding than is the FLAG-tag, although neither tag drastically altered the peptides bound by B*1501.
[0128] A handful of individual ligand sequences present in fractions of peptides eluted from all three molecules were determined by MS/MS. The two clearest sequences, AVRDISEASVF and QGLISRGYSY, demonstrate that tailed class I molecules indeed load endogenous peptide ligands. This supports the hypothesis that addition of a C-terminal tag does not abrogate the ability of the soluble HLA-B*1501 molecule to naturally bind endogenous peptides. Further, both peptide sequences closely matched those previously reported for B*1501 eluted peptides having a disparate N-terminus paired with a more conserved C-terminus consisting of either a phenylalanine or a tyrosine. Given the homologous Edman sequence, largely identical fragmentation patterns, and the peptide ligands shared between the three molecules, we conclude that addition of a C-terminal tag does not significantly alter the peptides bound by B*1501.
[0129] Mapping and subtractively comparing eluted peptides is a direct means for identifying differences and similarities in the individual ligands bound by a class I HLA molecule. Indeed, subtractive comparisons demonstrate how overlapping ligands bind across closely related HLA-B15 subtypes as well as pointing out which ligands are unique to virus-infected cells. Direct comparative analyses of eluted peptide ligands is well suited for a number of purposes, not the least of which is viral and cancer CTL epitope discovery. Addition of a C-terminal epitope tag provides a feasible method for production and purification of class I molecules, and therefore, peptide ligands in cell lines capable of sustaining viral infection or harboring neoplastic agents, as illustrated in FIG. 5. Direct peptide analysis from such lines should yield important information on host control of pathogenic elements as well as provide important building blocks for rational vaccine development.
[0130] The present invention further relates in particular to a novel method for detecting those peptide epitopes which distinguish the infected/tumor cell from the uninfected/non-tumor cell. The results obtained from the present inventive methodology cannot be predicted or ascertained indirectly; only with a direct epitope discovery method can the unique epitopes described herein be identified. Furthermore, only with this direct approach can it be ascertained that the source protein is degraded into potentially immunogenic peptide epitopes. Finally, this unique approach provides a glimpse of which proteins are uniquely up and down regulated in infected/tumor cells.
[0131] The utility of such HLA-presented peptide epitopes which mark the infected/tumor cell are three-fold. First, diagnostics designed to detect a disease state (i.e., infection or cancer) can use epitopes unique to infected/tumor cells to ascertain the presence/absence of a tumor/virus. Second, epitopes unique to infected/tumor cells represent vaccine candidates. Here, we describe epitopes which arise on the surface of cells infected with HIV. Such epitopes could not be predicted without natural virus infection and direct epitope discovery. The epitopes detected are derived from proteins unique to virus infected and tumor cells. These epitopes can be used for virus/tumor vaccine development and virus/tumor diagnostics. Third, the process indicates that particular proteins unique to virus infected cells are found in compartments of the host cell they would otherwise not be found in. Thus, we identify uniquely upregulated or trafficked host proteins for drug targeting to kill infected cells.
[0132] The present invention describes, in particular, peptide epitopes unique to HIV infected cells. Peptide epitopes unique to the HLA molecules of HIV infected cells were identified by direct comparison to HLA peptide epitopes from uninfected cells.
[0133] As such, and only by example, the present method is shown to be capable of identifying: (1) HLA presented peptide epitopes, derived from intracellular host proteins, that are unique to infected cells but not found on uninfected cells, and (2) that the intracellular source-proteins of the peptides are uniquely expressed/processed in HIV infected cells such that peptide fragments of the proteins can be presented by HLA on infected cells but not on uninfected cells.
[0134] The method of the present invention also, therefore, describes the unique expression of proteins in infected cells or, alternatively, the unique trafficking and processing of normally expressed host proteins such that peptide fragments thereof are presented by HLA molecules on infected cells. These HLA presented peptide fragments of intracellular proteins represent powerful alternatives for diagnosing virus infected cells and for targeting infected cells for destruction (i.e., vaccine development).
[0135] A group of the host source-proteins for HLA presented peptide epitopes unique to HIV infected cells represent source-proteins that are uniquely expressed in cancerous cells. For example, through using the methodology of the present invention a peptide fragment of reticulocalbin is uniquely found on HIV infected cells. A literature search indicates that the reticulocalbin gene is uniquely upregulated in cancer cells (breast cancer, liver cancer, colorectal cancer). Thus, the HLA presented peptide fragment of reticulocalbin which distinguishes HIV infected cells from uninfected cells can be inferred to also differentiate tumor cells from healthy non-tumor cells. Thus, HLA presented peptide fragments of host genes and gene products that distinguish the tumor cell and virus infected cell from healthy cells have been directly identified. The epitope discovery method of the present invention is also capable of identifiying host proteins that are uniquely expressed or uniquely processed on virus infected or tumor cells. HLA presented peptide fragments of such uniquely expressed or uniquely processed proteins can be used as vaccine epitopes and as diagnostic tools.
[0136] The methodology to target and detect virus infected cells may not be to target the virus-derived peptides. Rather, the methodology of the present invention indicates that the way to distinguish infected cells from healthy cells is through alterations in host encoded protein expression and processing. This is true for cancer as well as for virus infected cells. The methodology according to the present invention results in data which indicates without reservation that proteins/peptides distinguish virus/tumor cells from healthy cells.
[0137] Example of Comparative Ligand Mapping in Infected and Uninfected Cells Creation of Soluble Class I Construct
[0138] EBV-transformed cell lines expressing alleles of interest (particularly A*0201, B*0702, and Cw*0702) were grown and class I HLA typed through the sequenced-based-typing methodology described in Turner et al. 1998, J. Immunol, 161 (3) 1406-13) and U.S. Pat. No. 6,287,764 Hildebrand et al. both of which are expressly incorporated herein in their entirety by reference. Total RNA was 5pXI and 3pEI, producing a product lacking the cytoplasmic and transmembrane domains. Alternatively, a 3' primer encoding a hexa-histidine or FLAG epitope tag was placed on the C-terminus using the primers, 3pEIHIS or 3pEIFLAG (TABLE V). For the C-locus, a 5' primer was used encoding the Kozak consensus sequence. (Davis, et al. 1999. J. Exp. Med. 189: 1265-1274). Each construct was cut with the appropriate restriction endonculease (see TABLE V) and cloned into the mammalian expression vector pcDNA 3.1- (Invitrogen, Carlsbad, Calif.) encoding either a resistance gene for G418 sulfate or Zeocin (Invitrogen).
[0139] Transfection in Sup-T1 cells. Sup-T1 T cells were cultured in RPMI 1640+20% fetal calf serum at 37° C. and 5% CO2. Cells were split daily in order to maintain log-phase growth. Plasmid DNA was purified using either Qiagen Midi-prep kits (Qiagen, Santa Clarita) or Biorad Quantum Prep Midiprep Kit (Biorad, Hercules, Calif.) according to the manufacturer's protocol and resuspended in sterile DNAse-free water. Cells were electroporated with 30 gs of plasmid DNA at a voltage of 400 mV and a capacitance of 960 F. Decay constants were monitored throughout electroporation and only transfections with decay times under 25 mS were carried through to selection. Selection was performed on day 4 post-transfection with 0.45 mg/mL Zeocin (Invitrogen) selective medium containing 30% fetal calf with the pH adjusted visually to just higher than neutral. Cells were resuspended in selective medium at 2×10 6 cells per ml, fed until they no longer turned the wells yellow (using the pH indicated Phenol Red (Mediatech)), and allowed to sit until cells began to divide. After the appearance of active division, cells were slowly fed with selective medium until they reached medium (T-75) tissue culture flasks. Cells were then subcloned at limiting dilutions of 0.5, 1, and 1.5 cells per well in 96-well tissue culture plates. Cells were allowed to sit until well turned yellow; they were then gradually moved to 24 well plates and small (T-25) tissue culture flasks. Samples were taken for soluble class I ELISA, and the best producers of class I were frozen for later use at 5×10 6 cells/ml and stored at -135° C.
[0140] Soluble MHC class I ELISA. ELISAs were employed to test the concentration of the MHC class I/peptide complexes in cell culture supernatants. The monoclonal antibody W6/32 (ATCC, Manassas, Va.) was used to coat 96-well Nunc Starwell Maxi-sorp plates (VWR, West Chester, Pa.). One hundred ls of test sample containing class I was loaded into each well of the plate. Detection was with anti-aB2 microglobulin (light chain) antibody conjugated to horse-radish peroxidase followed by incubation with OPD (Sigma, St. Louis, Mo.). ELISA values were read by a SpectraMax 340 00A, Rom Version 2.04, February 1996, using the program Softmax Pro Version 2.2.1 from Molecular Devices. For determination of MHC class I complex in carboys prior to affinity purification (see below), each sample was tested in triplicate on at least 2 separate plates. Uninfected and infected harvest concentrations were read on the same plate and uninfected samples were brought to 1% Triton X 100 prior to loading on the ELISA plate. This was in an attempt to minimize variability in mass spectra generate due to large differences in the amount of peptide loaded onto affinity columns.
[0141] Full-length construct creation. Full-length constructs (in the pcDNA3.1-/G418 sulfate resistance vector) were created and transfected into the class I negative B-LCL 721.221 and T2. Both cell lines were cultured in RPMI-1640+10% fetal calf serum until growing at log phase. Cells were electroporated at 0.25 V and 960 F capacitance. After 2 days, the cells were pelleted and resuspended in selective medium consisting of RPMI-1640+20% FCS+1.5 mg/ml G418 sulfate (Mediatech, Herndon, Va.). Cells were treated in the same manner as above (Sup-T1 transfection) after this point.
[0142] Cell pharm production. Eight liters of Sup-T1 soluble MHC class I transfectants cultured in roller bottles in RPMI-1640+15% FCS+100 U penicillin/streptomycin were centrifuged for 10 min at 1100×g. Supernatant was discarded and a total of 3×10 9 total cells were resuspended in 200 mls of conditioned medium. Infected cells were then added to a feed bottle and inoculated through the ECS feed pump of a Unisyn CP2500 cell pharm (Unisyn, Hopkington, Mass.) into 30 kD molecular-weight cut-off hollow-fiber bioreactors previously primed with RPMI-1640 containing 20% fetal calf serum. Cells were allowed to incubate overnight in the bioreactor at a temperature of 37° C. and at a pH of 7.20 maintained automatically through CO2 injection into the medium reservoir of the system. No new medium was introduced into the system during this time period and the ICS recirculation was maintained at a low value of 400 mls/minutes. ECS feed was begun 12 hours post inoculation at a rate of 100 mls/day with 15% FCS supplemented RPMI-1640; ICS feed was likewise begun at a rate of 1 L/day. ECS recirculation was initiated at day 2 post-inoculation at a rate of 4 L/day. ECS and ICS samples were taken at 24-hour intervals and sHLA ELISAs (see above) and glucose tests performed. ECS and ICS feed rates as well as ECS and ICS recirculation rates were adjusted based on increasing concentrations of sHLA in the harvest and decreasing levels of glucose in the ICS medium.
[0143] Virus production and infection HIV MN-1 production. HIV MN-1 cloned virus (Genbank Accession Number M17449) was thawed from frozen stock and used to infect 25×10 6 non-transfected Sup-T1 (Denny C T, et. al. 1986. Nature. 320:549.51, which is expressly incorporated herein in its entirety by reference) T cells using standard methods. Cells were cultured in RPMI-1640+20% fetal bovine serum (MediaTech) for 5 days and observed for syncitia formation. Upon formation of syncitia, new cells were added in fresh RPMI-1640/20% FCS. Culture was continued for 5 more days when 100 mls of infected cells were removed. Supernatant was passed through a 0.45 um filter and cell-free virus was aliquotted and stored at -80° C. This process was continued until an appropriate amount of virus was harvested.
[0144] [ ] HIV-1 NL4-3 production. The infectious molecular clone pNL4-3 (Genbank Accession Number AF324493) was transformed into the Esherichia coli strain Top10F' (Invitrogen, Carlsbad, Calif.). Plasmid DNA was midiprepped from transformed cells using either the Qiagen Midi Prep Kit (Qiagen, Santa Clarita, Calif.) or the Biorad Quantum Prep Midiprep Kit (Biorad, Hercules, Calif.) according to the manufacturer's instructions. Plasmid DNA was used to transfect 293T cells (GenHunter Corporation, Nashville, Tenn.) using Roche's Fugene 6 reagent (Roche, Basel, Switzerland) following the manufacturer's protocol. Virus-containing supernatant was harvested at 24, 48, and 72 hours, clarified by centrifugation at 500×g for 10 min, aliquotted, and stored at -80° C. Sup-T1 transfectants containing either soluble A*0201, B*0702, or Cw*0702 were cocultured with virus resulting in high-titre virus. After 72 hours, infected cells were centrifuged at 1100×g for 10 minutes. Supernatant containing cell-free virus was removed, passed through a 0.45 m filter, aliquotted, and stored at -80° C. Virally-infected cells were resuspended in freeze medium (RPMI-1640+20% FCS+10% DMSO) at approximately 6×10 6 cells per ml and stored at -80° C.
[0145] Viral Titer Determination. One vial of frozen viral stock derived from either strain of HIV was thawed and used in a TCID50 assay scored two ways: 1) wells containing at least 3 syncitia were considered positive or 2) wells containing over 50 ng/ml p24 antigen as determined by ELISA were considered positive. The TCID50 was then calculated using the Spearman-Karber method (DAIDS Virology Manual for HIV Laboratories, January 1997). The average of both scoring methods was used as the final titer of the virus. As a second means of viral titer monitoring, viral stock was used undiluted in a p24 ELISA (Beckman Coulter, Miami, Fla.) in order to determine the ngs of p24 present in cell-free virus.
[0146] P24 ELISA. Determination of HIV p24 major core protein was determined by the commercially available Beckman Coulter p24 ELISA according to the manufacturer's instructions with the exceptions of the following modifications: samples were treated with 10% Triton-X 100 prior to removal from a BSL-3 facility, therefore the inactivation medium included in the kit was not used. Secondly, samples were serially diluted in water prior to use.
[0147] Hollow-fiber bioreactor culture of infected cells. All work including large-scale culture of HIV was performed in a Biosafety Level 3 Laboratory in accordance with guidelines set forth by the National Institutes of Health. HIV MN-1 frozen viral stock aliquots were thawed and pooled to a 100 ml total volume, containing approximately 5.5×10 6 TCID50's. Eight liters of Sup-T1 soluble MHC class I transfectants cultured in roller bottles in RPMI-1640+15% FCS+100 U penicillin/streptomycin were centrifuged for 10 min at 1100×g. Supernatant was discarded and a total of 3×10 9 total cells were resuspended in 200 mls of conditioned medium. The 100 mls of cell-free HIV MN-1 was then added to the resuspended cells and incubated at 37° C. in %5 CO2 for 2 hours with gentle shaking every 20 minutes. Infected cells were then added to a feed bottle and inoculated through the ECS feed pump of a Unisyn CP2500 cell pharm (Unisyn, Hopkington, Mass.) into 30 kD molecular-weight cut-off hollow-fiber bioreactors previously primed with RPMI-1640 containing 20% fetal calf serum. Cells were allowed to incubate overnight in the bioreactor at a temperature of 37° C. and at a pH of 7.20 maintained automatically through CO2 injection into the medium reservoir of the system. No new medium was introduced into the system during this time period and the recirculation was maintained at a low value of 400 mls/minutes. ECS feed was begun 12 hours post inoculation at a rate of 100 mls/day with 15% FCS supplemented RPMI-1640; ICS feed was likewise begun at a rate of 1 L/day. ECS and ICS samples were taken at 24-hour intervals, inactivated by addition of Triton-X 100 to 1%, and sHLA ELISAs, p24 ELISAs, and glucose tests performed as described above. ECS and ICS feed rates as well as ECS and ICS recirculation rates were adjusted based on increasing concentrations of sHLA in the harvest and decreasing levels of glucose in the ICS medium.
[0148] Soluble HLA purification. Soluble-HLA containing supernatant was removed in 1.9 L volumes from infected hollow-fiber bioreactors. Twenty-percent Triton-X 100 was sterilized and placed in 50 ml aliquots in 60 mls syringes; 2 syringes were injected into each 1.9 L harvest bottle as it was removed from the cell pharm, resulting in a final TX 100 percentage of 1%. Bottles were inverted gently several times to mix the TX 100 and stored at 4° C. for a minimum of 1 week. After 1 week, harvest was centrifuged at 2000×g for 10 minutes to remove cellular debris and pooled into 10 L carboys. An aliquot was then removed from the pooled, HIV-inactivated supernatant and used in a quantitative TCID50 assay (as described above) and used to initiate a coculture with Sup-T1's. Only after demonstration of a completely negative coculture as well as TCID50 were harvests removed from the BSL-3.
[0149] Class I/Peptide Production and Peptide Characterization Handling of MHC class I/peptide complexes from infected cells. Each 10 L of cell pharm harvest was separated strictly on a temporal basis during the cell pharm run. (This was an attempt to assess any epitopic changes that might occur temporally during infection as opposed to those that might occur more globally.) Harvest was treated exactly as described above, except for the removal of a 2 ml aliquot for tests in both a TCID50 assay and cell coculture assay to determine infectivity of the virus.
[0150] Affinity purification of infected and uninfected MHC class I complexes. Uninfected and infected harvest removed from CP2500 machines were treated in an identical manner post-removal from the cell pharm. Approximately 50 mgs total class I as measured by W6/32 ELISA (see above) were passed over a Pharmacia XK-50 (Amersham-Pharmacia Biotech, Piscataway, N.J.) column packed with 50 mls Sepharose Fast Flow 4B matrix (Amersham) coupled to W6/32 antibody. Bound class I complexes were washed first with 1 L 20 mM sodium phosphate wash buffer, followed by a wash with buffer containing the zwitterionic detergent Zwittergent 3-08 (Calbiochem, Merck KgaA, Darmstadt, Germany) at a concentration of 10 mM, plus NaCl at 50 mM, and 20 mM sodium phosphate. The zwittergent wash was monitored by UV absorption at a wavelength of 216 nm for removal of Triton-X 100 hydrophobically bound to the peptide complexes. After 1 L of wash had passed over the column (more than a sufficient amount for the UV to return to baseline), zwittergent buffer was removed with 2 L of 20 mM sodium phosphate wash buffer. Peptides were eluted post wash with freshly made 0.2N acetic acid, pH 2.7.
[0151] Peptide isolation and separation. Post-elution, peptide-containing eluate fractions were brought up to 10% glacial acetic acid concentration through addition of 100% glacial acetic acid. Fractions were then pooled into a model 8050 stirred cell (Millipore, Bedford, Mass.) ultrafiltration device containing a 3 kD molecular-weight cutoff regenerated cellulose membrane (Millipore). The device was capped and tubing parafilmed to prevent leaks and placed in a 78° C. water bath for 10 minutes. Post-removal, the peptide-containing elution buffer was allowed to cool to room temperature. The stirred cell was operated at a pressure of 55 psi under nitrogen flow. Peptides were collected in 50 ml conical centrifuge tubes (VWR, West Chester, Pa.), flash frozen in super-cooled ethanol, and lyophilized to dryness. Peptides were resuspended either in 10% acetic acid or 10% acetonitrile. Peptides were purified through a first-round of HPLC on a Haisil C-18 column (Higgins Analytical, Moutain View, Calif.), with an isocratic flow of 100% B (100% acetonitrile, 0.01% TFA) for 40 minutes. Following elution, peptide-containing fractions were pooled, speed-vacuumed to dryness, and resuspended in 150 ls of 10% acetic acid. Two igs of the base methyl violet were added to the peptide mixture in 10% acetic acid and this was loaded onto a Haisil C-18 column for fractionation. Peptides were fractionated by one of two methods, the latter resulting in increased peptide resolution. The first fractionation program was 2-10% B in 2 minutes, 10-60% B in 60 minutes, with 1 minute fraction collection. The second RP-HPLC gradient consisted of a 2-14% B in 2 minutes, 14-40% B in 60 minutes, 40-70% B in 20 minutes, with 1 minute fraction collection. Peptides eluting in a given fraction were monitored by UV absorbance at 216 nm. Separate but identical (down to the same buffer preparations) peptide purifications were done for each peptide-batch from uninfected and infected cells.
[0152] Mass-spectrometric mapping of fractionated peptides. Fractionated peptides were mapped by mass spectrometry to generate fraction-based ion maps. Fractions were speed-vacuumed to dryness and resuspended in 12 ls 50:50 methanol:water+0.05% acetic acid. Two ls were removed and sprayed via nanoelectrospray (Protana, Odense, Denmark) into a Q-Star quadrupole mass spectrometer with a time-of-flight detector (Perseptive SCIEX, Foster City, Calif.). Spectra were generated for masses in the range of 50-1200 amu using identical mass spectrometer settings for each fraction sprayed. Spectra were then base-line subtracted and analyzed using the programs BioMultiview version 1.5beta9 (Persceptive SCIEX) or BioAnalyst version 1.0 (Persceptive SCIEX). Spectra from the same fraction in uninfected/infected cells were manually aligned to the same mass range, locked, and 15 amu increments visually assessed for the presence of differences in the ions represented by the spectra (for an example, see Hickman et al. 2000. Human Immunology. 61:1339-1346 which is expressly incorporated herein by reference). Ions were selected for MS/MS sequencing based on upregulations or downregulation of 1.5 fold over the same ion in the uninfected cells, or the presence or absence of the ion in infected cells. Ions were thus categorized into multiple categories prior to MS/MS sequencing.
[0153] Tandem mass-spectrometric analysis of selected peptides. Lists of ions masses corresponding to each of the following categories were generated: 1) upregulated in infected cells, 2) downregulated in infected cells, 3) present only in infected cells, 4) absent in infected cells, and 5) no change in infected cells. The last category was generally disregarded for MS/MS analysis and the first 4 categories were subjected to MS/MS sequencing on the O-Star mass spectrometer. Peptide-containing fractions were sprayed into the mass spectrometer in 3 l aliquots. All MS settings were kept constant except for the Q0 and Cad gas settings, which were varied to achieve the best fragmentation. Fragmentation patterns generated were interpreted manually and with the aid of BioMultiView version 1.5 beta 9. No sequencing algorithms were used for interpretation of data, however multiple web-based applications were employed to aid in peptide identification including: MASCOT (Perkins, D N et al. 1999. Electrophoresis. 20(18):3551-3567), Protein Prospector (Clauser K. R. et al. 1999. Analytical Chemistry. 71:2871), PeptideSearch (narrador.emblheidelberg.de/GroupPages/PageLink/Peptidesearchpage.html) and BLAST search (www.ncbi.nlm.nih.gov/BLAST/).
[0154] Quality control of epitope changes. Multiple parameters were established before peptides identified in the above fashion were deemed "upregulated," "downregulated," etc. First, the peptide fractions before and after the fraction in which the peptide was identified were subjected to MS/MS at the same amu under the identical collision conditions employed in fragmentation of the peptide-of-interest and the spectra generated overlaid and compared. This was done to make sure that, in the unlikely event that the peptides had fractionated differently (even with methyl-violet base B standardization) there was not the presence of the peptide in an earlier or later fraction of the uninfected or infected peptides (and that the peptides had truly fractionated in an identical manner.) Secondly, the same amu that was used to identify the first peptide was then subjected to MS/MS in the alternate fraction (either infected or uninfected, whichever was opposite of the fraction in which the peptide was identified.) Spectra again were overlaid in order to prove conclusively that the fragmentation patterns did not match and thus the peptide was not present in the uninfected cells, or, in the case that the fragmentation patterns did match, that the peptides were upregulated in the infected cells. Finally, synthetic peptides were generated for each peptide identified. These peptides were resuspended in 10% acetic acid and RP-HPLC fractionated under the same conditions as employed for the original fractionation, ensuring that the peptide putatively identified had the same hydrophobicity as that of the ion MS/MS fragmented. This synthetic peptide was MS/MS fragmented under the same collision conditions as that of the ion, the spectra overlaid, and checked for an exact match with the original peptide fragment.
[0155] Functional Analysis\Literature Searches. After identification of epitopes, literature searches were performed on source proteins to determine their function within the infected cell. Broad inferences can be made from the function of the protein. Source proteins were classified into groups according to functions inside the cell. Again, broad inferences can be made as to the groups of proteins that would be available for specific presentation solely on infected cells. Secondly, source proteins were scanned for other possible epitopes which may be bound by other MHC class I alleles. Peptide binding predictions (Parker, K. C., et. al. 1994. J. Immunol. 152:163) were employed to determine if other peptides presented from the source proteins were predicted to bind. Proteasomal prediction algorithms (A. K. Nussbaum, et. al. 2001. Immunogenetics 53:87-94) were likewise employed to determine the likelihood of a peptide being created by the proteasome.
[0156] Sequence Identification. A discussion of the results seen with the application of this procedure is included using the peptide GPRTAALGLL as an example. Other examples and data obtained based on the methodology are listed in TABLE VII.
TABLE-US-00013 TABLE VII ION FRACTION SEQUENCE MW OBS'D MW Peptides Identified on Infected cells that are not present on Uninfected Cells 612.720 32INF EQMFEDIISL 1223.582 1223.418 509.680 31INF IPCLLISFL 1017.601 1017.334 469.180 31INF STTAICATGL 936.466 936.360 420.130 16INF APAQNPEL 838.426 838.259 500.190 28INF LVMAPRTVL 998.602 998.396 529.680 31INF APFI[NS]PADX 1057.388 523.166 12INF TPQSNRPVm 1044.500 1044.333 444.140 16INF AARPATSTL 887.495 887.280 470.650 16INF MAMMAALMA 940.413 939.410 490.620 16INF IATVDSYVI 979.240 563.640 16INF SPNQARAQAAL 1126.597 1126.364 30INF GPRTAALGLL 968.589 968.426 556.150 16INF NPNQNKNVAL 1111.586 1111.300 Peptides Identified on Uninfected cells that are not present on Infected cells 16UNINF GSHSMRY ION SOURCE PROTEIN START AA ACCESSION # Peptides Identified on Infected cells that are not present on Uninfected Cells 612.720 HIV MN-1, ENV 101 509.680 CHOLINERGIC RECEPTOR, 250 ALPHA-3 POLYPEPTIDE 469.180 UBIQUITIN-SPECIFIC PROTEASE 152 10720340 (SEQ ID NO: 43) 420.130 B-ASSOCIATED TRANSCRIPT PROTEIN 3 (BAT3) 500.190 HLA-B HEAVY CHAIN LEADER SEQUENCE 2 4566550 (SEQ ID NO: 44) 529.680 UNKNOWN, CLOSE TO SEVERAL cDNA's 523.166 RNA POLYMERASE II POLYPEPTIDE A 527 4505939 (SEQ ID NO: 45) 444.140 EUK, TRANSLATION INITIATION FACTOR 4 1073 Q04637 (SEQ ID NO: 46) 470.650 SPARC-LIKE PROTEIN 19 478522 (SEQ ID NO: 47) 490.620 TENASCIN-C (HEXABRACHION) 1823 13639246 (SEQ ID NO: 48) 563.640 POLYPYRIMIDINE TRACT-BINDING PROTEIN 1 141 131528 (SEQ ID NO: 49) RETICULOCALBIN 4 4506457 (SEQ ID NO: 50) 556.150 ELAV (HuR) 188 4503551 (SEQ ID NO: 51) Peptides Identified on Uninfected cells that are not present on Infected cells MHC CLASS I HEAVY CHAIN variable multiple (could derive from multiple alleles, i.e., HLA-B*0702 or HLA-G, etc.) ION CATEGORY SEQ ID NO: Peptides Identified on Infected cells that are not present on Uninfected Cells 612.720 HIV-DERIVED 29 509.680 30 469.180 31 420.130 MHC GENE PRODUCT 32 500.190 MHC GENE PRODUCT 33 529.680 UNKNOWN 34 523.166 RNA MACHINERY/ 35 BINDING PR 444.140 RNA MACHINERY/ 36 BINDING PR 470.650 TUMOR SUPPRESSOR 37 GENE? 490.620 TUMOR SUPPRESSOR 38 GENE? 563.640 RNA MACHINERY/ 39 BINDING PR TUMOR SUPPRESSOR 40 GENE? 556.150 RNA MACHINERY/ 41 BINDING PR Peptides Identified on Uninfected cells that are not present on Infected cells MHC CLASS I 42 Product
[0157] The first step in identification of an epitope present only on uninfected cells is performing MS ion mapping. In this case, the reversed-phase HPLC fraction 30 obtained from HIV as disclosed hereinabove (which contains a fraction of the total class I peptides) was sprayed into the mass spectrometer and an ion spectrum created. FIG. 6 shows the sections of ion map in which an ion was first identified as upregulated. The ion at 484.74 can be seen to predominate in the upper map, which is the spectrum generated from peptides from the infected cells. One can also see that there are other peptides which differ in their intensities between the uninfected cells from one spectrum to another. After a peptide is initially identified, the area of the spectrum in which the peptide is found is zoomed in on in order to more fully see all the ions in the immediate area (FIG. 7). After zooming in on the area from 482-488 amu, the ion at 484.72 can be seen to only be present in the infected cells (which are seen in the spectrum on the top). A large difference such as this is not always seen, sometimes more minor differences are chosen for sequence determination. This ion, however, was considered an extremely good candidate for further analysis.
[0158] After identification of the ion, the next step in the process is to sequence the peptide by using tandem mass spectrometry. FIG. 8 shows the spectrum generated when the peptide is fragmented. These fragments are used to discern the amino acid sequence of the peptide. The sequence of this peptide was determined to be GPRTAALGLL. This peptide was isolated from infected HLA-B*0702 molecules. One early quality control step is examining the peptide's sequence to see if it fits the sequences that were previously shown to be presented by this molecule. B*0702 binds peptides that have a G at their second position (P2) and an L as their C-terminal anchor. Based on this information, this sequence is likely to be a peptide presented by B*0702.
[0159] Descriptive characterization of peptide. Once the peptide sequence is obtained, information is gained on the source protein from which the peptide was derived in the cytosol of the infected cell. Initially, a BLAST search (available at the National Center for Biotechnology website) is done to provide protein information on the peptide. A BLAST search with the sequence GPRTAALGLL pulled up the protein reticulocalbin 2. After the source protein is known, information about the protein is ascertained first from the PubMed (again available at the National Center for Biotechnology website) and put into a format to which one can easily refer as seen in FIG. 9. All of the accession numbers for the protein, as well as the original description of the protein are included. This makes it easy to come back to the information for downstream use. Also, the protein sequence is copied, pasted, and saved as a text document for incorporation into later searches. The peptide is highlighted in the entire protein, giving some context as to where it is derived and how large the total protein is. This is the initial data gathering step post-sequence determination.
[0160] The next step in characterizing the ligand is doing literature searches on the source protein from which the peptide was derived. The protein is entered into the PubMed database and all entries with the word "reticulocalbin" are retrieved. FIG. 10 illustrates the listing that is done to summarize what has previously been described for this protein. It can be seen that for reticulocalbin, multiple articles have been published involving this protein. The literature is summarized in a paragraph following the PubMed listings and put into the report. For reticulocalbin, some of the most interesting points are that it is an ER resident protein, which can lead to speculation on why it is presented on infected cells. Secondly, it has been previously found to be upregulated in several other types of cancers, such as breast and colorectal cancers. This again leads to speculation that this protein may be broadly applicable to treat more maladies than those caused by HIV. It is also determined whether or not this protein has been previously cited as interacting with/or being interfered with by HIV. This was not seen for reticulocalbin and thus was not listed in the report, (although in some instances it is seen.) A broad understanding of the protein is gained through literature searches.
[0161] Predictive characterization of peptide. After the literature search, several secondary searches are performed. FIG. 11 illustrates the results of a peptide-binding algorithm performed using Parker's Prediction (which is described hereinabove). The entire source protein is used for input and the computer generates a list of peptides which are bound by the HLA allele chosen. In this case, B*0702 was chosen because that was the allele from which this peptide was derived. From the black arrow in the figure, it can be seen that the peptide sequenced by mass spectrometry is predicted to bind to HLA-B*0702 with a high affinity. Several other peptides are listed that are predicted to bind as well. FIG. 12 shows the same procedure being performed with the source peptide using another well-known search engine, SYPEITHI. (This engine can be found on the worldwide web using the URL: syfpeithi.bmi-heidelberg.com/Scripts/MHCServer.dll/EpPredict.htm.) Again, the results from this search engine for B*0702 shows that this peptide is predicted to bind to HLA-B*0702 with a high affinity. Also, multiple other peptides are predicted to be derived from this source protein and bound. This prediction allows us to determine several things. First, we can tell if the peptide is predicted to be bound by previous algorithms. This allows us to know how well the programs work, and/or if other people could identify this peptide (if they had the source protein) from peptide binding algorithms. All of this information can be translated into increasing importance for the present inventive methodology not only for the peptide but also for the source protein itself.
[0162] After peptide-binding algorithms are performed, searches are done to determine whether the peptides would be created by the proteasome during normal processing of proteins into peptides. It should be strongly noted that multiple pathways for class I peptide loading are now being demonstrated and that the cleavage algorithms for human proteasomes are not well established by any means. While a positive result may indicate that the proteasome is largely responsible for cleavage, a negative result by no means indicates that the peptide is not presented in the class I molecule. FIG. 13 shows the results of the first proteasomal cleavage done for the source protein reticulocalbin using the cleavage predictor PaProC (available at URL paproc.de/). The epitope is outlined. By this prediction software, the peptide is not predicted to be cleaved by the normal proteasome. This may mean that in infected cells, alternative pathways of MHC class I presentation are being used, particularly in reference to the reticulocalbin peptide. This, in turn, may present novel methods for therapeutics during viral infection. A second proteasomal cleavage search is also employed using the prediction software NetChop (available on the worldwide web) as seen in FIG. 14. By this prediction and other data from current literature in the field, the peptide would be created by the proteasome and cleaved to form the GPRTAALGLL identified.
[0163] A third round of analysis involves only the source protein. All other alleles are tested for peptide binding and lists of the highest binders generated. The proteasomal cleavage predictions are then referred to in order to elucidate how these peptides are generated. This information is useful for downstream testing of peptides and for determining whether or not this protein will be applicable for vaccine trials covering a broad range of HLA alleles. For reticulocalbin, multiple high-affinity peptides were demonstrated for differing HLA alleles (some examples of which are shown in FIG. 15) In this figure, several high affinity peptides deriving from reticulocalbin were identified for HLA-A*0201 and A*0101.
[0164] Quality control of sequence determination. There currently exists no direct means to score the quality of MS/MS sequence data. Once all descriptive and predictive steps are concluded, we return again to the original peptide sequence for quality control to ensure that the peptide is indeed what we have identified as the amino acid sequence and that the peptide is truly present only in infected cells. We employ these multiple steps so there is no doubt that the sequence is truly what we claim it to be before we move on to downstream applications involving the peptide.
[0165] Initially, we determine that the peptide is truly upregulated or present only in infected cells. For the reticulocalbin peptide, we determined that this peptide was probably only present in infected cells. In order to make certain that the peptide was truly absent in the uninfected cells and that there was no chance that our RP-HPLC fractionation had differed (remembering that we use internal controls for our fractionation as well) we generated ion spectra using MS from the fractions before and after the one in which we identified the peptide. In the case of the reticulocalbin peptide, we identified the peptide in fraction 30, so we performed MS on fractions 29 and 31 (FIG. 16). In FIG. 16, it can be seen that there is no substantial peak at the m/z 484.72. This indicated that there was not differential fractionation and that the peptide truly was absent from uninfected cells. In the case that there was a peptide peak in one of the before or after fractions, we would then turn to MS/MS to determine whether this peak represented the ion we were characterizing or another ion with the same mass-to-charge ratio.
[0166] After determining that the peptide is not present in another fraction, MS/MS was preformed on the same m/z in the uninfected spectrum (in the same fraction) in order to conclusively prove that there is no peptide present with the same sequence in the uninfected cells. In FIG. 17 one can see that the fragmentation patterns produced under identical MS collision conditions are totally different. This illustrates the absence of the reticulocalbin peptide in the uninfected cells.
[0167] Finally, in order to conclusively prove that the peptide sequence is the same as that originally identified, we synthesize synthetic peptides consisting of the same amino acids as the peptide sequence identified from the MS/MS fragmentation pattern. For the reticulocalbin peptide (i.e., the ion in fraction 30 at 484.72) we synthesized the peptide "GPRTAALGLL." We then took this peptide and did MS/MS on the peptide under identical conditions as previously used. FIG. 18 illustrates the spectrum generated from MS/MS of the endogenously loaded reticulocalbin peptide. Matching spectra, as seen here, are indicators that this peptide sequence is GPRTAALGLL as almost every amino acid combination will generate a completely different set of fragments, both in terms of production of fragments and in terms of intensity of those fragments present. FIG. 18 shows the MS/MS endogenous and synthetic "GPRTAALGLL" peptide under identical collision conditions. As can be seen, the MS/MS graphs are virtually identical.
[0168] In accordance with the present invention, one peptide ligand (i.e., "GPRTAALGLL") has been identified as being presented by the B*0702 class I MHC molecule in cells infected with the HIV MN-1 virus but not in uninfected cells. As one of ordinary skill in the art can appreciate the novelty and usefulness of the present methodology in directly identifying such peptide ligands and the importance such identification has for numerous therapeutic (vaccine development, drug targeting) and diagnostic tools. As such, numerous other peptide ligands have been uniquely identified in cells infected with HIV MN-1 (as opposed to uninfected cells_ and these results are summarized in TABLE VII. One of ordinary skill in the art given the present specification would be fully enabled to identify the "GPRTAALGLL" peptide ligand; as well as other uniquely presented peptide ligands found in cells infected with a microorganism of interest and/or tumorigenic cells.
[0169] As stated above, TABLE VII identifies the sequences of peptide ligands identified to date as being unique to HIV infected cells. Class I sHLA B*0702 was harvested for T cells infected and not infected with HIV. Peptide ligands were eluted from B*0702 and comparatively mapped on a mass spectrometer so that ions unique to infected cells were apparent. Ions unique to infected cells (and one ligand unique to uninfected cells) were subjected to mass spectrometric fragmentation for peptide sequencing. Column 1 indicates the ion selected for sequencing, column 2 is the HPLC fraction, column 3 is the peptide sequence, column 4 is the predicted molecular weight, column 5 is the molecular weight we found, column 6 is the source protein for the epitope sequenced, column 7 is where the epitope starts in the sequence of the source protein, column 8 is the accession number, and column 9 is a descriptor which briefly indicates what is known of that epitope and/or its source protein.
[0170] The methodology used herein is to use sHLA to determine what is unique to unhealthy cells as compared to healthy cells. Using sHLA to survey the contents of a cell provides a look at what is unique to unhealthy cells in terms of proteins that are processed into peptides. TABLE VII shows the utility of the method described herein for discovering epitopes and their source proteins which are unique to HIV infected cells. A detailed description of the peptide from Reticulocalbin is provided hereinabove. The other epitopes and corresponding source proteins described in TABLE VII were processed in the same manner as the reticulocalbin epitope and source protein were--i.e., as described herein above. The data summarized in TABLE VII shows that the epitope discovery technique described herein is capable of identifying sHLA bound epitopes and their corresponding source proteins which are unique to infected/unhealthy cells.
[0171] Likewise, and as is shown in TABLE VII, peptide ligands presented in individual class I MHC molecules in an uninfected cell that are not presented by individual class I MHC molecules in an uninfected cell can also be identified. The peptide "GSHSMRY", for example, was identified by the method of the present invention as being an individual class I MHC molecule which is presented in an uninfected cell but not in an infected cell.
[0172] The utility of this data is at least threefold. First, the data indicates what comes out of the cell with HLA. Such data can be used to target CTL to unhealthy cells. Second, antibodies can be targeted to specifically recognize HLA molecules carrying the ligand described. Third, realization of the source protein can lead to therapies and diagnostics which target the source protein. Thus, an epitope unique to unhealthy cells also indicates that the source protein is unique in the unhealthy cell.
[0173] The methods of epitope discovery and comparative ligand mapping described herein are not limited to cells infected by a microorganism such as HIV. Unhealthy cells analyzed by the epitope discovery process described herein can arise from virus infection or also cancerous transformation. In addition, the status of an unhealthy cell can also be mimicked by transfecting a particular gene known to be expressed during viral infection or tumor formation. For example, particular genes of HIV can be expressed in a cell line as described (Achour, A., et al., AIDS Res Hum Retroviruses, 1994. 10(1): p. 19-25; and Chiba, M., et al., CTL. Arch Virol, 1999. 144(8): p. 1469-85, all of which are expressly incorporated herein by reference) and then the epitope discovery process performed to identify how the expression of the transferred gene modifies epitope presentation by sHLA. In a similar fashion, genes known to be upregulated during cancer (Smith, E. S., et al., Nat Med, 2001. 7(8): p. 967-72, which is expressly incorporated herein by reference) can be transferred in cells with sHLA and epitope discovery then completed. Thus, epitope discovery with sHLA as described herein can be completed on cells infected with intact pathogens, cancerous cells or cell lines, or cells into which a particular cancer, viral, or bacterial gene has been transferred. In all these instances the sHLA described here will provide a means for detecting what changes in terms of epitope presentation and the source proteins for the epitopes.
[0174] Thus, in accordance with the present invention, there has been provided a methodology for epitope discovery and comparative ligand mapping which includes methodology for producing and manipulating Class I and Class II MHC molecules from gDNA that fully satisfies the objectives and advantages set forth herein above. Although the invention has been described in conjunction with the specific drawings, experimentation, results and language set forth herein above, it is evident that many alternatives, modifications, and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the invention.
Sequence CWU
1
55130DNAArtificial SequencePrimer PP5UTA 1gcgctctaga cccagacgcc gaggatggcc
30220DNAArtificial SequencePrimer
3PPI4A 2gccctgaccc tgctaaaggt
20332DNAArtificial SequencePrimer PP5UTB 3gcgctctaga ccacccggac
tcagaatctc ct 32420DNAArtificial
SequencePrimer 3PPI4B 4tgctttccct gagaagagat
20529DNAArtificial SequencePrimer 5UTB39 5aggcgaattc
cagagtctcc tcagacgcg
29638DNAArtificial SequencePrimer 5PKCE 6gggcgaattc ccgccgccac catgcgggtc
atggcgcc 38720DNAArtificial SequencePrimer
3PPI4C 7ttctgctttc ctgagaagac
20837DNAArtificial SequencePrimer PP5UT 8gggcgaattc ggactcagaa
tctccccaga cgccgag 37930DNAArtificial
SequencePrimer PP3PEI 9ccgcgaattc tcatctcagg gtgaggggct
301030DNAArtificial SequencePrimer PP3PEIH
10ccgcaagctt tcatctcagg gtgaggggct
301130DNAArtificial SequencePrimer 3PEIHC7 11ccgcaagctt tcagctcagg
gtgaggggct 301220DNAArtificial
SequencePrimer T7Prom 12taatacgact cactataggg
201318DNAArtificial SequencePrimer BGHrev
13tagaaggcac agtcgagg
181417DNAArtificial SequencePrimer PPI2E2R 14gtcgtgacct gcgcccc
171521DNAArtificial
SequencePrimer PPI2E2F 15tttcattttc agtttaggcc a
211619DNAArtificial SequencePrimer ABCI3E4F
16ggtgtcctgt ccattctca
191737DNAArtificial SequencePrimer HLA5UT 17gggcgtcgac ggactcagaa
tctccccaga cgccgag 371830DNAArtificial
SequencePrimer 5UTA 18gcgcgtcgac cccagacgcc gaggatggcc
301937DNAArtificial SequencePrimer 5PXI 19gggctctaga
ggactcagaa tctccccaga cgccgag
372036DNAArtificial SequencePrimer CLSP23 20ccgcgtcgac tcagattctc
cccagacgcc gagatg 362128DNAArtificial
SequencePrimer LDC3UTA 21ccgcaagctt agaaacaaag tcagggtt
282237DNAArtificial SequencePrimer CLSP1085
22ccgcaagctt ggcagctgtc tcaggcttta caagctg
372341DNAArtificial SequencePrimer 3UTA 23ccgcaagctt ttggggaggg
agcacaggtc agcgtgggaa g 412441DNAArtificial
SequencePrimer 3UTB 24ccgcaagctt ctggggagga aacataggtc agcatgggaa c
412525DNAArtificial SequencePrimer 3PEI 25ccgcgaattc
tcatctcagg gtgag
252646DNAArtificial SequencePrimer 3PEIHIS 26ccgcgaattc tcagtggtgg
tggtggtggt gccatctcag ggtgag 462751DNAArtificial
SequencePrimer 3PEIFLAG 27ccgcgattct cacttgtcat cgtcgtcctt gtaatcccat
ctcagggtga g 512838DNAArtificial SequencePrimer 5PKOZXB
28gggctctaga ccgccgccac catgcgggtc atggcgcc
382910PRTHuman immunodeficiency virus 29Glu Gln Met Phe Glu Asp Ile Ile
Ser Leu1 5 10309PRTHomo sapiens 30Ile Pro
Cys Leu Leu Ile Ser Phe Leu1 53110PRTHomo sapiens 31Ser Thr
Thr Ala Ile Cys Ala Thr Gly Leu1 5
10328PRTHomo sapiens 32Ala Pro Ala Gln Asn Pro Glu Leu1
5339PRTHomo sapiens 33Leu Val Met Ala Pro Arg Thr Val Leu1
53410PRTHomo sapiensMISC_FEATURE(10)..(10)Xaa = any amino acid 34Ala Pro
Phe Ile Asn Ser Pro Ala Asp Xaa1 5
10359PRTHomo sapiens 35Thr Pro Gln Ser Asn Arg Pro Val Met1
5369PRTHomo sapiens 36Ala Ala Arg Pro Ala Thr Ser Thr Leu1
5379PRTHomo sapiens 37Met Ala Met Met Ala Ala Leu Met Ala1
5389PRTHomo sapiens 38Ile Ala Thr Val Asp Ser Tyr Val Ile1
53911PRTHomo sapiens 39Ser Pro Asn Gln Ala Arg Ala Gln Ala Ala Leu1
5 104010PRTHomo sapiens 40Gly Pro Arg Thr Ala
Ala Leu Gly Leu Leu1 5 104110PRTHomo
sapiens 41Asn Pro Asn Gln Asn Lys Asn Val Ala Leu1 5
10427PRTHomo sapiens 42Gly Ser His Ser Met Arg Tyr1
543521PRTHomo sapiens 43Met Glu Cys Pro His Leu Ser Ser Ser Val Cys
Ile Ala Pro Asp Ser1 5 10
15Ala Lys Phe Pro Asn Gly Ser Pro Ser Ser Trp Cys Cys Ser Val Cys
20 25 30Arg Ser Asn Lys Ser Pro Trp
Val Cys Leu Thr Cys Ser Ser Val His 35 40
45Cys Gly Arg Tyr Val Asn Gly His Ala Lys Lys His Tyr Glu Asp
Ala 50 55 60Gln Val Pro Leu Thr Asn
His Lys Lys Ser Glu Lys Gln Asp Lys Val65 70
75 80Gln His Thr Val Cys Met Asp Cys Ser Ser Tyr
Ser Thr Tyr Cys Tyr 85 90
95Arg Cys Asp Asp Phe Val Val Asn Asp Thr Lys Leu Gly Leu Val Gln
100 105 110Lys Val Arg Glu His Leu
Gln Asn Leu Glu Asn Ser Ala Phe Thr Ala 115 120
125Asp Arg His Lys Lys Arg Lys Leu Leu Glu Asn Ser Thr Leu
Asn Ser 130 135 140Lys Leu Leu Lys Val
Asn Gly Ser Thr Thr Ala Ile Cys Ala Thr Gly145 150
155 160Leu Arg Asn Leu Gly Asn Thr Cys Phe Met
Asn Ala Ile Leu Gln Ser 165 170
175Leu Ser Asn Ile Glu Gln Phe Cys Cys Tyr Phe Lys Glu Leu Pro Ala
180 185 190Val Glu Leu Arg Asn
Gly Lys Thr Ala Gly Arg Arg Thr Tyr His Thr 195
200 205Arg Ser Gln Gly Asp Asn Asn Val Ser Leu Val Glu
Glu Phe Arg Lys 210 215 220Thr Leu Cys
Ala Leu Trp Gln Gly Ser Gln Thr Ala Phe Ser Pro Glu225
230 235 240Ser Leu Phe Tyr Val Val Trp
Lys Ile Met Pro Asn Phe Arg Gly Tyr 245
250 255Gln Gln Gln Asp Ala His Glu Phe Asn Ala Leu Pro
Phe Gly Pro Pro 260 265 270Thr
Leu Gly Asn Phe Arg Ala Val Ser Thr Val Phe Pro Ala Gln Gln 275
280 285Phe Cys Arg Arg Ile Leu Leu Cys Leu
Gln Val Asn Lys Cys Cys Ile 290 295
300Asn Gly Ala Ser Thr Val Val Thr Ala Ile Phe Gly Gly Ile Leu Gln305
310 315 320Asn Glu Val Asn
Cys Leu Ile Cys Gly Thr Glu Ser Arg Lys Phe Asp 325
330 335Pro Phe Leu Asp Leu Ser Leu Asp Ile Pro
Ser Gln Phe Arg Ser Lys 340 345
350Arg Ser Lys Asn Gln Glu Asn Gly Pro Val Cys Ser Leu Arg Asp Cys
355 360 365Leu Arg Ser Phe Thr Asp Leu
Glu Glu Leu Asp Glu Thr Glu Leu Tyr 370 375
380Met Cys His Lys Cys Lys Lys Lys Gln Lys Ser Thr Lys Lys Phe
Trp385 390 395 400Ile Gln
Lys Leu Pro Lys Val Leu Cys Leu His Leu Lys Arg Phe His
405 410 415Trp Thr Ala Tyr Leu Arg Asn
Lys Val Asp Thr Tyr Val Glu Phe Pro 420 425
430Leu Arg Gly Leu Asp Met Lys Trp Tyr Leu Leu Glu Pro Glu
Asn Ser 435 440 445Gly Pro Glu Ser
Cys Leu Tyr Asp Leu Ala Ala Val Val Val His His 450
455 460Gly Ser Gly Val Gly Ser Gly His Tyr Thr Ala Tyr
Ala Thr His Glu465 470 475
480Gly Arg Trp Phe His Phe Asn Asp Ser Thr Val Thr Leu Thr Asp Glu
485 490 495Glu Thr Val Val Lys
Ala Lys Ala Tyr Ile Leu Phe Tyr Val Glu His 500
505 510Gln Ala Lys Ala Gly Ser Asp Lys Leu 515
52044206PRTHomo sapiens 44Met Leu Val Met Ala Pro Arg Thr
Val Leu Leu Leu Leu Ser Ala Ala1 5 10
15Leu Ala Leu Thr Glu Thr Trp Ala Gly Ser His Ser Met Arg
Tyr Phe 20 25 30Tyr Thr Ala
Val Ser Arg Pro Gly Arg Gly Glu Pro Arg Phe Ile Ser 35
40 45Val Gly Tyr Val Asp Asp Thr Gln Phe Val Arg
Phe Asp Ser Asp Ala 50 55 60Ala Ser
Pro Arg Glu Glu Pro Arg Ala Pro Trp Ile Glu Gln Glu Gly65
70 75 80Pro Glu Tyr Trp Asp Arg Asn
Thr Gln Ile Cys Lys Thr Asn Thr Gln 85 90
95Thr Asp Arg Glu Ser Leu Arg Asn Leu Arg Gly Tyr Tyr
Asn Gln Ser 100 105 110Glu Ala
Gly Ser His Thr Leu Gln Ser Met Tyr Gly Cys Asp Val Gly 115
120 125Pro Asp Gly Arg Leu Leu Arg Gly Tyr Asn
Gln Phe Ala Tyr Asp Gly 130 135 140Lys
Asp Tyr Ile Ala Leu Asn Glu Asp Leu Ser Ser Trp Thr Ala Ala145
150 155 160Asp Thr Ala Ala Gln Ile
Thr Gln Arg Lys Trp Glu Ala Ala Arg Glu 165
170 175Ala Glu Gln Leu Arg Ala Tyr Leu Glu Gly Thr Cys
Val Glu Trp Leu 180 185 190Arg
Arg His Leu Glu Asn Gly Lys Glu Thr Leu Gln Arg Ala 195
200 205451970PRTHomo sapiens 45 Met His Gly Gly Gly
Pro Pro Ser Gly Asp Ser Ala Cys Pro Leu Arg1 5
10 15Thr Ile Lys Arg Val Gln Phe Gly Val Leu Ser
Pro Asp Glu Leu Lys 20 25
30Arg Met Ser Val Thr Glu Gly Gly Ile Lys Tyr Pro Glu Thr Thr Glu
35 40 45Gly Gly Arg Pro Lys Leu Gly Gly
Leu Met Asp Pro Arg Gln Gly Val 50 55
60Ile Glu Arg Thr Gly Arg Cys Gln Thr Cys Ala Gly Asn Met Thr Glu65
70 75 80Cys Pro Gly His Phe
Gly His Ile Glu Leu Ala Lys Pro Val Phe His 85
90 95Val Gly Phe Leu Val Lys Thr Met Lys Val Leu
Arg Cys Val Cys Phe 100 105
110Phe Cys Ser Lys Leu Leu Val Asp Ser Asn Asn Pro Lys Ile Lys Asp
115 120 125Ile Leu Ala Lys Ser Lys Gly
Gln Pro Lys Lys Arg Leu Thr His Val 130 135
140Tyr Asp Leu Cys Lys Gly Lys Asn Ile Cys Glu Gly Gly Glu Glu
Met145 150 155 160Asp Asn
Lys Phe Gly Val Glu Gln Pro Glu Gly Asp Glu Asp Leu Thr
165 170 175Lys Glu Lys Gly His Gly Gly
Cys Gly Arg Tyr Gln Pro Arg Ile Arg 180 185
190Arg Ser Gly Leu Glu Leu Tyr Ala Glu Trp Lys His Val Asn
Glu Asp 195 200 205Ser Gln Glu Lys
Lys Ile Leu Leu Ser Pro Glu Arg Val His Glu Ile 210
215 220Phe Lys Arg Ile Ser Asp Glu Glu Cys Phe Val Leu
Gly Met Glu Pro225 230 235
240Arg Tyr Ala Arg Pro Glu Trp Met Ile Val Thr Val Leu Pro Val Pro
245 250 255Pro Leu Ser Val Arg
Pro Ala Val Val Met Gln Gly Ser Ala Arg Asn 260
265 270Gln Asp Asp Leu Thr His Lys Leu Ala Asp Ile Val
Lys Ile Asn Asn 275 280 285Gln Leu
Arg Arg Asn Glu Gln Asn Gly Ala Ala Ala His Val Ile Ala 290
295 300Glu Asp Val Lys Leu Leu Gln Phe His Val Ala
Thr Met Val Asp Asn305 310 315
320Glu Leu Pro Gly Leu Pro Arg Ala Met Gln Lys Ser Gly Arg Pro Leu
325 330 335Lys Ser Leu Lys
Gln Arg Leu Lys Gly Lys Glu Gly Arg Val Arg Gly 340
345 350Asn Leu Met Gly Lys Arg Val Asp Phe Ser Ala
Arg Thr Val Ile Thr 355 360 365Pro
Asp Pro Asn Leu Ser Ile Asp Gln Val Gly Val Pro Arg Ser Ile 370
375 380Ala Ala Asn Met Thr Phe Ala Glu Ile Val
Thr Pro Phe Asn Ile Asp385 390 395
400Arg Leu Gln Glu Leu Val Arg Arg Gly Asn Ser Gln Tyr Pro Gly
Ala 405 410 415Lys Tyr Ile
Ile Arg Asp Asn Gly Asp Arg Ile Asp Leu Arg Phe His 420
425 430Pro Lys Pro Ser Asp Leu His Leu Gln Thr
Gly Tyr Lys Val Glu Arg 435 440
445His Met Cys Asp Gly Asp Ile Val Ile Phe Asn Arg Gln Pro Thr Leu 450
455 460His Lys Met Ser Met Met Gly His
Arg Val Arg Ile Leu Pro Trp Ser465 470
475 480Thr Phe Arg Leu Asn Leu Ser Val Thr Thr Pro Tyr
Asn Ala Asp Phe 485 490
495Asp Gly Asp Glu Met Asn Leu His Leu Pro Gln Ser Leu Glu Thr Arg
500 505 510Ala Glu Ile Gln Glu Leu
Ala Met Val Pro Arg Met Ile Val Thr Pro 515 520
525Gln Ser Asn Arg Pro Val Met Gly Ile Val Gln Asp Thr Leu
Thr Ala 530 535 540Val Arg Lys Phe Thr
Lys Arg Asp Val Phe Leu Glu Arg Gly Glu Val545 550
555 560Met Asn Leu Leu Met Phe Leu Ser Thr Trp
Asp Gly Lys Val Pro Gln 565 570
575Pro Ala Ile Leu Lys Pro Arg Pro Leu Trp Thr Gly Lys Gln Ile Phe
580 585 590Ser Leu Ile Ile Pro
Gly His Ile Asn Cys Ile Arg Thr His Ser Thr 595
600 605His Pro Asp Asp Glu Asp Ser Gly Pro Tyr Lys His
Ile Ser Pro Gly 610 615 620Asp Thr Lys
Val Val Val Glu Asn Gly Glu Leu Ile Met Gly Ile Leu625
630 635 640Cys Lys Lys Ser Leu Gly Thr
Ser Ala Gly Ser Leu Val His Ile Ser 645
650 655Tyr Leu Glu Met Gly His Asp Ile Thr Arg Leu Phe
Tyr Ser Asn Ile 660 665 670Gln
Thr Val Ile Asn Asn Trp Leu Leu Ile Glu Gly His Thr Ile Gly 675
680 685Ile Gly Asp Ser Ile Ala Asp Ser Lys
Thr Tyr Gln Asp Ile Gln Asn 690 695
700Thr Ile Lys Lys Ala Lys Gln Asp Val Ile Glu Val Ile Glu Lys Ala705
710 715 720His Asn Asn Glu
Leu Glu Pro Thr Pro Gly Asn Thr Leu Arg Gln Thr 725
730 735Phe Glu Asn Gln Val Asn Arg Ile Leu Asn
Asp Ala Arg Asp Lys Thr 740 745
750Gly Ser Ser Ala Gln Lys Ser Leu Ser Glu Tyr Asn Asn Phe Lys Ser
755 760 765Met Val Val Ser Gly Ala Lys
Gly Ser Lys Ile Asn Ile Ser Gln Val 770 775
780Ile Ala Val Val Gly Gln Gln Asn Val Glu Gly Lys Arg Ile Pro
Phe785 790 795 800Gly Phe
Lys His Arg Thr Leu Pro His Phe Ile Lys Asp Asp Tyr Gly
805 810 815Pro Glu Ser Arg Gly Phe Val
Glu Asn Ser Tyr Leu Ala Gly Leu Thr 820 825
830Pro Thr Glu Phe Phe Phe His Ala Met Gly Gly Arg Glu Gly
Leu Ile 835 840 845Asp Thr Ala Val
Lys Thr Ala Glu Thr Gly Tyr Ile Gln Arg Arg Leu 850
855 860Ile Lys Ser Met Glu Ser Val Met Val Lys Tyr Asp
Ala Thr Val Arg865 870 875
880Asn Ser Ile Asn Gln Val Val Gln Leu Arg Tyr Gly Glu Asp Gly Leu
885 890 895Ala Gly Glu Ser Val
Glu Phe Gln Asn Leu Ala Thr Leu Lys Pro Ser 900
905 910Asn Lys Ala Phe Glu Lys Lys Phe Arg Phe Asp Tyr
Thr Asn Glu Arg 915 920 925Ala Leu
Arg Arg Thr Leu Gln Glu Asp Leu Val Lys Asp Val Leu Ser 930
935 940Asn Ala His Ile Gln Asn Glu Leu Glu Arg Glu
Phe Glu Arg Met Arg945 950 955
960Glu Asp Arg Glu Val Leu Arg Val Ile Phe Pro Thr Gly Asp Ser Lys
965 970 975Val Val Leu Pro
Cys Asn Leu Leu Arg Met Ile Trp Asn Ala Gln Lys 980
985 990Ile Phe His Ile Asn Pro Arg Leu Pro Ser Asp
Leu His Pro Ile Lys 995 1000
1005Val Val Glu Gly Val Lys Glu Leu Ser Lys Lys Leu Val Ile Val
1010 1015 1020Asn Gly Asp Asp Pro Leu
Ser Arg Gln Ala Gln Glu Asn Ala Thr 1025 1030
1035Leu Leu Phe Asn Ile His Leu Arg Ser Thr Leu Cys Ser Arg
Arg 1040 1045 1050Met Ala Glu Glu Phe
Arg Leu Ser Gly Glu Ala Phe Asp Trp Leu 1055 1060
1065Leu Gly Glu Ile Glu Ser Lys Phe Asn Gln Ala Ile Ala
His Pro 1070 1075 1080Gly Glu Met Val
Gly Ala Leu Ala Ala Gln Ser Leu Gly Glu Pro 1085
1090 1095Ala Thr Gln Met Thr Leu Asn Thr Phe His Tyr
Ala Gly Val Ser 1100 1105 1110Ala Lys
Asn Val Thr Leu Gly Val Pro Arg Leu Lys Glu Leu Ile 1115
1120 1125Asn Ile Ser Lys Lys Pro Lys Thr Pro Ser
Leu Thr Val Phe Leu 1130 1135 1140Leu
Gly Gln Ser Ala Arg Asp Ala Glu Arg Ala Lys Asp Ile Leu 1145
1150 1155Cys Arg Leu Glu His Thr Thr Leu Arg
Lys Val Thr Ala Asn Thr 1160 1165
1170Ala Ile Tyr Tyr Asp Pro Asn Pro Gln Ser Thr Val Val Ala Glu
1175 1180 1185Asp Gln Glu Trp Val Asn
Val Tyr Tyr Glu Met Pro Asp Phe Asp 1190 1195
1200Val Ala Arg Ile Ser Pro Trp Leu Leu Arg Val Glu Leu Asp
Arg 1205 1210 1215Lys His Met Thr Asp
Arg Lys Leu Thr Met Glu Gln Ile Ala Glu 1220 1225
1230Lys Ile Asn Ala Gly Phe Gly Asp Asp Leu Asn Cys Ile
Phe Asn 1235 1240 1245Asp Asp Asn Ala
Glu Lys Leu Val Leu Arg Ile Arg Ile Met Asn 1250
1255 1260Ser Asp Glu Asn Lys Met Gln Glu Glu Glu Glu
Val Val Asp Lys 1265 1270 1275Met Asp
Asp Asp Val Phe Leu Arg Cys Ile Glu Ser Asn Met Leu 1280
1285 1290Thr Asp Met Thr Leu Gln Gly Ile Glu Gln
Ile Ser Lys Val Tyr 1295 1300 1305Met
His Leu Pro Gln Thr Asp Asn Lys Lys Lys Ile Ile Ile Thr 1310
1315 1320Glu Asp Gly Glu Phe Lys Ala Leu Gln
Glu Trp Ile Leu Glu Thr 1325 1330
1335Asp Gly Val Ser Leu Met Arg Val Leu Ser Glu Lys Asp Val Asp
1340 1345 1350Pro Val Arg Thr Thr Ser
Asn Asp Ile Val Glu Ile Phe Thr Val 1355 1360
1365Leu Gly Ile Glu Ala Val Arg Lys Ala Leu Glu Arg Glu Leu
Tyr 1370 1375 1380His Val Ile Ser Phe
Asp Gly Ser Tyr Val Asn Tyr Arg His Leu 1385 1390
1395Ala Leu Leu Cys Asp Thr Met Thr Cys Arg Gly His Leu
Met Ala 1400 1405 1410Ile Thr Arg His
Gly Val Asn Arg Gln Asp Thr Gly Pro Leu Met 1415
1420 1425Lys Cys Ser Phe Glu Glu Thr Val Asp Val Leu
Met Glu Ala Ala 1430 1435 1440Ala His
Gly Glu Ser Asp Pro Met Lys Gly Val Ser Glu Asn Ile 1445
1450 1455Met Leu Gly Gln Leu Ala Pro Ala Gly Thr
Gly Cys Phe Asp Leu 1460 1465 1470Leu
Leu Asp Ala Glu Lys Cys Lys Tyr Gly Met Glu Ile Pro Thr 1475
1480 1485Asn Ile Pro Gly Leu Gly Ala Ala Gly
Pro Thr Gly Met Phe Phe 1490 1495
1500Gly Ser Ala Pro Ser Pro Met Gly Gly Ile Ser Pro Ala Met Thr
1505 1510 1515Pro Trp Asn Gln Gly Ala
Thr Pro Ala Tyr Gly Ala Trp Ser Pro 1520 1525
1530Ser Val Gly Ser Gly Met Thr Pro Gly Ala Ala Gly Phe Ser
Pro 1535 1540 1545Ser Ala Ala Ser Asp
Ala Ser Gly Phe Ser Pro Gly Tyr Ser Pro 1550 1555
1560Ala Trp Ser Pro Thr Pro Gly Ser Pro Gly Ser Pro Gly
Pro Ser 1565 1570 1575Ser Pro Tyr Ile
Pro Ser Pro Gly Gly Ala Met Ser Pro Ser Tyr 1580
1585 1590Ser Pro Thr Ser Pro Ala Tyr Glu Pro Arg Ser
Pro Gly Gly Tyr 1595 1600 1605Thr Pro
Gln Ser Pro Ser Tyr Ser Pro Thr Ser Pro Ser Tyr Ser 1610
1615 1620Pro Thr Ser Pro Ser Tyr Ser Pro Thr Ser
Pro Asn Tyr Ser Pro 1625 1630 1635Thr
Ser Pro Ser Tyr Ser Pro Thr Ser Pro Ser Tyr Ser Pro Thr 1640
1645 1650Ser Pro Ser Tyr Ser Pro Thr Ser Pro
Ser Tyr Ser Pro Thr Ser 1655 1660
1665Pro Ser Tyr Ser Pro Thr Ser Pro Ser Tyr Ser Pro Thr Ser Pro
1670 1675 1680Ser Tyr Ser Pro Thr Ser
Pro Ser Tyr Ser Pro Thr Ser Pro Ser 1685 1690
1695Tyr Ser Pro Thr Ser Pro Ser Tyr Ser Pro Thr Ser Pro Ser
Tyr 1700 1705 1710Ser Pro Thr Ser Pro
Ser Tyr Ser Pro Thr Ser Pro Ser Tyr Ser 1715 1720
1725Pro Thr Ser Pro Ser Tyr Ser Pro Thr Ser Pro Ser Tyr
Ser Pro 1730 1735 1740Thr Ser Pro Asn
Tyr Ser Pro Thr Ser Pro Asn Tyr Thr Pro Thr 1745
1750 1755Ser Pro Ser Tyr Ser Pro Thr Ser Pro Ser Tyr
Ser Pro Thr Ser 1760 1765 1770Pro Asn
Tyr Thr Pro Thr Ser Pro Asn Tyr Ser Pro Thr Ser Pro 1775
1780 1785Ser Tyr Ser Pro Thr Ser Pro Ser Tyr Ser
Pro Thr Ser Pro Ser 1790 1795 1800Tyr
Ser Pro Ser Ser Pro Arg Tyr Thr Pro Gln Ser Pro Thr Tyr 1805
1810 1815Thr Pro Ser Ser Pro Ser Tyr Ser Pro
Ser Ser Pro Ser Tyr Ser 1820 1825
1830Pro Thr Ser Pro Lys Tyr Thr Pro Thr Ser Pro Ser Tyr Ser Pro
1835 1840 1845Ser Ser Pro Glu Tyr Thr
Pro Thr Ser Pro Lys Tyr Ser Pro Thr 1850 1855
1860Ser Pro Lys Tyr Ser Pro Thr Ser Pro Lys Tyr Ser Pro Thr
Ser 1865 1870 1875Pro Thr Tyr Ser Pro
Thr Thr Pro Lys Tyr Ser Pro Thr Ser Pro 1880 1885
1890Thr Tyr Ser Pro Thr Ser Pro Val Tyr Thr Pro Thr Ser
Pro Lys 1895 1900 1905Tyr Ser Pro Thr
Ser Pro Thr Tyr Ser Pro Thr Ser Pro Lys Tyr 1910
1915 1920Ser Pro Thr Ser Pro Thr Tyr Ser Pro Thr Ser
Pro Lys Gly Ser 1925 1930 1935Thr Tyr
Ser Pro Thr Ser Pro Gly Tyr Ser Pro Thr Ser Pro Thr 1940
1945 1950Tyr Ser Leu Thr Ser Pro Ala Ile Ser Pro
Asp Asp Ser Asp Glu 1955 1960 1965Glu
Asn 1970461600PRTHomo sapiens 46Met Asn Lys Ala Pro Gln Ser Thr Gly
Pro Pro Pro Ala Pro Ser Pro1 5 10
15Gly Leu Pro Gln Pro Ala Phe Pro Pro Gly Gln Thr Ala Pro Val
Val 20 25 30Phe Ser Thr Pro
Gln Ala Thr Gln Met Asn Thr Pro Ser Gln Pro Arg 35
40 45Gln His Phe Tyr Pro Ser Arg Ala Gln Pro Pro Ser
Ser Ala Ala Ser 50 55 60Arg Val Gln
Ser Ala Ala Pro Ala Arg Pro Gly Pro Ala Ala His Val65 70
75 80Tyr Pro Ala Gly Ser Gln Val Met
Met Ile Pro Ser Gln Ile Ser Tyr 85 90
95Pro Ala Ser Gln Gly Ala Tyr Tyr Ile Pro Gly Gln Gly Arg
Ser Thr 100 105 110Tyr Val Val
Pro Thr Gln Gln Tyr Pro Val Gln Pro Gly Ala Pro Gly 115
120 125Phe Tyr Pro Gly Ala Ser Pro Thr Glu Leu Gly
Thr Tyr Ala Gly Ala 130 135 140Tyr Tyr
Pro Ala Arg Gly Val Gln Gln Phe Pro Thr Gly Val Ala Pro145
150 155 160Ala Pro Val Leu Met Asn Gln
Pro Pro Gln Ile Ala Pro Lys Arg Glu 165
170 175Arg Lys Thr Ile Arg Ile Arg Asp Pro Asn Gln Gly
Gly Lys Asp Ile 180 185 190Thr
Glu Glu Ile Met Ser Gly Ala Arg Thr Ala Ser Thr Pro Thr Pro 195
200 205Pro Gln Thr Gly Gly Gly Leu Glu Pro
Gln Ala Asn Gly Glu Thr Pro 210 215
220Gln Val Ala Val Ile Val Arg Pro Asp Asp Arg Ser Gln Gly Ala Ile225
230 235 240Ile Ala Asp Arg
Pro Gly Leu Pro Gly Pro Glu His Ser Pro Ser Glu 245
250 255Ser Gln Pro Ser Ser Pro Ser Pro Thr Pro
Ser Pro Ser Pro Val Leu 260 265
270Glu Pro Gly Ser Glu Pro Asn Leu Ala Val Leu Ser Ile Pro Gly Asp
275 280 285Thr Met Thr Thr Ile Gln Met
Ser Val Glu Glu Ser Thr Pro Ile Ser 290 295
300Arg Glu Thr Gly Glu Pro Tyr Arg Leu Ser Pro Glu Pro Thr Pro
Leu305 310 315 320Ala Glu
Pro Ile Leu Glu Val Glu Val Thr Leu Ser Lys Pro Val Pro
325 330 335Glu Ser Glu Phe Ser Ser Ser
Pro Leu Gln Ala Pro Thr Pro Leu Ala 340 345
350Ser His Thr Val Glu Ile His Glu Pro Asn Gly Met Val Pro
Ser Glu 355 360 365Asp Leu Glu Pro
Glu Val Glu Ser Ser Pro Glu Leu Ala Pro Pro Pro 370
375 380Ala Cys Pro Ser Glu Ser Pro Val Pro Ile Ala Pro
Thr Ala Gln Pro385 390 395
400Glu Glu Leu Leu Asn Gly Ala Pro Ser Pro Pro Ala Val Asp Leu Ser
405 410 415Pro Val Ser Glu Pro
Glu Glu Gln Ala Lys Glu Val Thr Ala Ser Val 420
425 430Ala Pro Pro Thr Ile Pro Ser Ala Thr Pro Ala Thr
Ala Pro Ser Ala 435 440 445Thr Ser
Pro Ala Gln Glu Glu Glu Met Glu Glu Glu Glu Glu Glu Glu 450
455 460Glu Gly Glu Ala Gly Glu Ala Gly Glu Ala Glu
Ser Glu Lys Gly Gly465 470 475
480Glu Glu Leu Leu Pro Pro Glu Ser Thr Pro Ile Pro Ala Asn Leu Ser
485 490 495Gln Asn Leu Glu
Ala Ala Ala Ala Thr Gln Val Ala Val Ser Val Pro 500
505 510Lys Arg Arg Arg Lys Ile Lys Glu Leu Asn Lys
Lys Glu Ala Val Gly 515 520 525Asp
Leu Leu Asp Ala Phe Lys Glu Ala Asn Pro Ala Val Pro Glu Val 530
535 540Glu Asn Gln Pro Pro Ala Gly Ser Asn Pro
Gly Pro Glu Ser Glu Gly545 550 555
560Ser Gly Val Pro Pro Arg Pro Glu Glu Ala Asp Glu Thr Trp Asp
Ser 565 570 575Lys Glu Asp
Lys Ile His Asn Ala Glu Asn Ile Gln Pro Gly Glu Gln 580
585 590Lys Tyr Glu Tyr Lys Ser Asp Gln Trp Lys
Pro Pro Asn Leu Glu Glu 595 600
605Lys Lys Arg Tyr Asp Arg Glu Phe Leu Leu Gly Phe Gln Phe Ile Phe 610
615 620Ala Ser Met Gln Lys Pro Glu Gly
Leu Pro His Ile Ser Asp Val Val625 630
635 640Leu Asp Lys Ala Asn Lys Thr Pro Leu Arg Pro Leu
Asp Pro Thr Arg 645 650
655Leu Gln Gly Ile Asn Cys Gly Pro Asp Phe Thr Pro Ser Phe Ala Asn
660 665 670Leu Gly Arg Thr Thr Leu
Ser Thr Arg Gly Pro Pro Arg Gly Gly Pro 675 680
685Gly Gly Glu Leu Pro Arg Gly Pro Gln Ala Gly Leu Gly Pro
Arg Arg 690 695 700Ser Gln Gln Gly Pro
Arg Lys Glu Pro Arg Lys Ile Ile Ala Thr Val705 710
715 720Leu Met Thr Glu Asp Ile Lys Leu Asn Lys
Ala Glu Lys Ala Trp Lys 725 730
735Pro Ser Ser Lys Arg Thr Ala Ala Asp Lys Asp Arg Gly Glu Glu Asp
740 745 750Ala Asp Gly Ser Lys
Thr Gln Asp Leu Phe Arg Arg Val Arg Ser Ile 755
760 765Leu Asn Lys Leu Thr Pro Gln Met Phe Gln Gln Leu
Met Lys Gln Val 770 775 780Thr Gln Leu
Ala Ile Asp Thr Glu Glu Arg Leu Lys Gly Val Ile Asp785
790 795 800Leu Ile Phe Glu Lys Ala Ile
Ser Glu Pro Asn Phe Ser Val Ala Tyr 805
810 815Ala Asn Met Cys Arg Cys Leu Met Ala Leu Lys Val
Pro Thr Thr Glu 820 825 830Lys
Pro Thr Val Thr Val Asn Phe Arg Lys Leu Leu Leu Asn Arg Cys 835
840 845Gln Lys Glu Phe Glu Lys Asp Lys Asp
Asp Asp Glu Val Phe Glu Lys 850 855
860Lys Gln Lys Glu Met Asp Glu Ala Ala Thr Ala Glu Glu Arg Gly Arg865
870 875 880Leu Lys Glu Glu
Leu Glu Glu Ala Arg Asp Ile Ala Arg Arg Arg Ser 885
890 895Leu Gly Asn Ile Lys Phe Ile Gly Glu Leu
Phe Lys Leu Lys Met Leu 900 905
910Thr Glu Ala Ile Met His Asp Cys Val Val Lys Leu Leu Lys Asn His
915 920 925Asp Glu Glu Ser Leu Glu Cys
Leu Cys Arg Leu Leu Thr Thr Ile Gly 930 935
940Lys Asp Leu Asp Phe Glu Lys Ala Lys Pro Arg Met Asp Gln Tyr
Phe945 950 955 960Asn Gln
Met Glu Lys Ile Ile Lys Glu Lys Lys Thr Ser Ser Arg Ile
965 970 975Arg Phe Met Leu Gln Asp Val
Leu Asp Leu Arg Gly Ser Asn Trp Val 980 985
990Pro Arg Arg Gly Asp Gln Gly Pro Lys Thr Ile Asp Gln Ile
His Lys 995 1000 1005Glu Ala Glu
Met Glu Glu His Arg Glu His Ile Lys Val Gln Gln 1010
1015 1020Leu Met Ala Lys Gly Ser Asp Lys Arg Arg Gly
Gly Pro Pro Gly 1025 1030 1035Pro Pro
Ile Ser Arg Gly Leu Pro Leu Val Asp Asp Gly Gly Trp 1040
1045 1050Asn Thr Val Pro Ile Ser Lys Gly Ser Arg
Pro Ile Asp Thr Ser 1055 1060 1065Arg
Leu Thr Lys Ile Thr Lys Pro Gly Ser Ile Asp Ser Asn Asn 1070
1075 1080Gln Leu Phe Ala Pro Gly Gly Arg Leu
Ser Trp Gly Lys Gly Ser 1085 1090
1095Ser Gly Gly Ser Gly Ala Lys Pro Ser Asp Ala Ala Ser Glu Ala
1100 1105 1110Ala Arg Pro Ala Thr Ser
Thr Leu Asn Arg Phe Ser Ala Leu Gln 1115 1120
1125Gln Ala Val Pro Thr Glu Ser Thr Asp Asn Arg Arg Val Val
Gln 1130 1135 1140Arg Ser Ser Leu Ser
Arg Glu Arg Gly Glu Lys Ala Gly Asp Arg 1145 1150
1155Gly Asp Arg Leu Glu Arg Ser Glu Arg Gly Gly Asp Arg
Gly Asp 1160 1165 1170Arg Leu Asp Arg
Ala Arg Thr Pro Ala Thr Lys Arg Ser Phe Ser 1175
1180 1185Lys Glu Val Glu Glu Arg Ser Arg Glu Arg Pro
Ser Gln Pro Glu 1190 1195 1200Gly Leu
Arg Lys Ala Ala Ser Leu Thr Glu Asp Arg Asp Arg Gly 1205
1210 1215Arg Asp Ala Val Lys Arg Glu Ala Ala Leu
Pro Pro Val Ser Pro 1220 1225 1230Leu
Lys Ala Ala Leu Ser Glu Glu Glu Leu Glu Lys Lys Ser Lys 1235
1240 1245Ala Ile Ile Glu Glu Tyr Leu His Leu
Asn Asp Met Lys Glu Ala 1250 1255
1260Val Gln Cys Val Gln Glu Leu Ala Ser Pro Ser Leu Leu Phe Ile
1265 1270 1275Phe Val Arg His Gly Val
Glu Ser Thr Leu Glu Arg Ser Ala Ile 1280 1285
1290Ala Arg Glu His Met Gly Gln Leu Leu His Gln Leu Leu Cys
Ala 1295 1300 1305Gly His Leu Ser Thr
Ala Gln Tyr Tyr Gln Gly Leu Tyr Glu Ile 1310 1315
1320Leu Glu Leu Ala Glu Asp Met Glu Ile Asp Ile Pro His
Val Trp 1325 1330 1335Leu Tyr Leu Ala
Glu Leu Val Thr Pro Ile Leu Gln Glu Gly Gly 1340
1345 1350Val Pro Met Gly Glu Leu Phe Arg Glu Ile Thr
Lys Pro Leu Arg 1355 1360 1365Pro Leu
Gly Lys Ala Ala Ser Leu Leu Leu Glu Ile Leu Gly Leu 1370
1375 1380Leu Cys Lys Ser Met Gly Pro Lys Lys Val
Gly Thr Leu Trp Arg 1385 1390 1395Glu
Ala Gly Leu Ser Trp Lys Glu Phe Leu Pro Glu Gly Gln Asp 1400
1405 1410Ile Gly Ala Phe Val Ala Glu Gln Lys
Val Glu Tyr Thr Leu Gly 1415 1420
1425Glu Glu Ser Glu Ala Pro Gly Gln Arg Ala Leu Pro Ser Glu Glu
1430 1435 1440Leu Asn Arg Gln Leu Glu
Lys Leu Leu Lys Glu Gly Ser Ser Asn 1445 1450
1455Gln Arg Val Phe Asp Trp Ile Glu Ala Asn Leu Ser Glu Gln
Gln 1460 1465 1470Ile Val Ser Asn Thr
Leu Val Arg Ala Leu Met Thr Ala Val Cys 1475 1480
1485Tyr Ser Ala Ile Ile Phe Glu Thr Pro Leu Arg Val Asp
Val Ala 1490 1495 1500Val Leu Lys Ala
Arg Ala Lys Leu Leu Gln Lys Tyr Leu Cys Asp 1505
1510 1515Glu Gln Lys Glu Leu Gln Ala Leu Tyr Ala Leu
Gln Ala Leu Val 1520 1525 1530Val Thr
Leu Glu Gln Pro Pro Asn Leu Leu Arg Met Phe Phe Asp 1535
1540 1545Ala Leu Tyr Asp Glu Asp Val Val Lys Glu
Asp Ala Phe Tyr Ser 1550 1555 1560Trp
Glu Ser Ser Lys Asp Pro Ala Glu Gln Gln Gly Lys Gly Val 1565
1570 1575Ala Leu Lys Ser Val Thr Ala Phe Phe
Lys Trp Leu Arg Glu Ala 1580 1585
1590Glu Glu Glu Ser Asp His Asn 1595 16004725PRTHomo
sapiensmisc_feature(8)..(8)Xaa can be any naturally occurring amino acid
47Asp Ile Ser Gly Leu Thr Pro Xaa Lys Glu Ser Lys Gln Phe Ala Lys1
5 10 15Xaa Glu Lys Gln Xaa Xaa
Lys Lys Leu 20 25482201PRTHomo sapiens 48Met
Gly Ala Met Thr Gln Leu Leu Ala Gly Val Phe Leu Ala Phe Leu1
5 10 15Ala Leu Ala Thr Glu Gly Gly
Val Leu Lys Lys Val Ile Arg His Lys 20 25
30Arg Gln Ser Gly Val Asn Ala Thr Leu Pro Glu Glu Asn Gln
Pro Val 35 40 45Val Phe Asn His
Val Tyr Asn Ile Lys Leu Pro Val Gly Ser Gln Cys 50 55
60Ser Val Asp Leu Glu Ser Ala Ser Gly Glu Lys Asp Leu
Ala Pro Pro65 70 75
80Ser Glu Pro Ser Glu Ser Phe Gln Glu His Thr Val Asp Gly Glu Asn
85 90 95Gln Ile Val Phe Thr His
Arg Ile Asn Ile Pro Arg Arg Ala Cys Gly 100
105 110Cys Ala Ala Ala Pro Asp Val Lys Glu Leu Leu Ser
Arg Leu Glu Glu 115 120 125Leu Glu
Asn Leu Val Ser Ser Leu Arg Glu Gln Cys Thr Ala Gly Ala 130
135 140Gly Cys Cys Leu Gln Pro Ala Thr Gly Arg Leu
Asp Thr Arg Pro Phe145 150 155
160Cys Ser Gly Arg Gly Asn Phe Ser Thr Glu Gly Cys Gly Cys Val Cys
165 170 175Glu Pro Gly Trp
Lys Gly Pro Asn Cys Ser Glu Pro Glu Cys Pro Gly 180
185 190Asn Cys His Leu Arg Gly Arg Cys Ile Asp Gly
Gln Cys Ile Cys Asp 195 200 205Asp
Gly Phe Thr Gly Glu Asp Cys Ser Gln Leu Ala Cys Pro Ser Asp 210
215 220Cys Asn Asp Gln Gly Lys Cys Val Asn Gly
Val Cys Ile Cys Phe Glu225 230 235
240Gly Tyr Ala Gly Ala Asp Cys Ser Arg Glu Ile Cys Pro Val Pro
Cys 245 250 255Ser Glu Glu
His Gly Thr Cys Val Asp Gly Leu Cys Val Cys His Asp 260
265 270Gly Phe Ala Gly Asp Asp Cys Asn Lys Pro
Leu Cys Leu Asn Asn Cys 275 280
285Tyr Asn Arg Gly Arg Cys Val Glu Asn Glu Cys Val Cys Asp Glu Gly 290
295 300Phe Thr Gly Glu Asp Cys Ser Glu
Leu Ile Cys Pro Asn Asp Cys Phe305 310
315 320Asp Arg Gly Arg Cys Ile Asn Gly Thr Cys Tyr Cys
Glu Glu Gly Phe 325 330
335Thr Gly Glu Asp Cys Gly Lys Pro Thr Cys Pro His Ala Cys His Thr
340 345 350Gln Gly Arg Cys Glu Glu
Gly Gln Cys Val Cys Asp Glu Gly Phe Ala 355 360
365Gly Val Asp Cys Ser Glu Lys Arg Cys Pro Ala Asp Cys His
Asn Arg 370 375 380Gly Arg Cys Val Asp
Gly Arg Cys Glu Cys Asp Asp Gly Phe Thr Gly385 390
395 400Ala Asp Cys Gly Glu Leu Lys Cys Pro Asn
Gly Cys Ser Gly His Gly 405 410
415Arg Cys Val Asn Gly Gln Cys Val Cys Asp Glu Gly Tyr Thr Gly Glu
420 425 430Asp Cys Ser Gln Leu
Arg Cys Pro Asn Asp Cys His Ser Arg Gly Arg 435
440 445Cys Val Glu Gly Lys Cys Val Cys Glu Gln Gly Phe
Lys Gly Tyr Asp 450 455 460Cys Ser Asp
Met Ser Cys Pro Asn Asp Cys His Gln His Gly Arg Cys465
470 475 480Val Asn Gly Met Cys Val Cys
Asp Asp Gly Tyr Thr Gly Glu Asp Cys 485
490 495Arg Asp Arg Gln Cys Pro Arg Asp Cys Ser Asn Arg
Gly Leu Cys Val 500 505 510Asp
Gly Gln Cys Val Cys Glu Asp Gly Phe Thr Gly Pro Asp Cys Ala 515
520 525Glu Leu Ser Cys Pro Asn Asp Cys His
Gly Gln Gly Arg Cys Val Asn 530 535
540Gly Gln Cys Val Cys His Glu Gly Phe Met Gly Lys Asp Cys Lys Glu545
550 555 560Gln Arg Cys Pro
Ser Asp Cys His Gly Gln Gly Arg Cys Val Asp Gly 565
570 575Gln Cys Ile Cys His Glu Gly Phe Thr Gly
Leu Asp Cys Gly Gln His 580 585
590Ser Cys Pro Ser Asp Cys Asn Asn Leu Gly Gln Cys Val Ser Gly Arg
595 600 605Cys Ile Cys Asn Glu Gly Tyr
Ser Gly Glu Asp Cys Ser Glu Val Ser 610 615
620Pro Pro Lys Asp Leu Val Val Thr Glu Val Thr Glu Glu Thr Val
Asn625 630 635 640Leu Ala
Trp Asp Asn Glu Met Arg Val Thr Glu Tyr Leu Val Val Tyr
645 650 655Thr Pro Thr His Glu Gly Gly
Leu Glu Met Gln Phe Arg Val Pro Gly 660 665
670Asp Gln Thr Ser Thr Ile Ile Gln Glu Leu Glu Pro Gly Val
Glu Tyr 675 680 685Phe Ile Arg Val
Phe Ala Ile Leu Glu Asn Lys Lys Ser Ile Pro Val 690
695 700Ser Ala Arg Val Ala Thr Tyr Leu Pro Ala Pro Glu
Gly Leu Lys Phe705 710 715
720Lys Ser Ile Lys Glu Thr Ser Val Glu Val Glu Trp Asp Pro Leu Asp
725 730 735Ile Ala Phe Glu Thr
Trp Glu Ile Ile Phe Arg Asn Met Asn Lys Glu 740
745 750Asp Glu Gly Glu Ile Thr Lys Ser Leu Arg Arg Pro
Glu Thr Ser Tyr 755 760 765Arg Gln
Thr Gly Leu Ala Pro Gly Gln Glu Tyr Glu Ile Ser Leu His 770
775 780Ile Val Lys Asn Asn Thr Arg Gly Pro Gly Leu
Lys Arg Val Thr Thr785 790 795
800Thr Arg Leu Asp Ala Pro Ser Gln Ile Glu Val Lys Asp Val Thr Asp
805 810 815Thr Thr Ala Leu
Ile Thr Trp Phe Lys Pro Leu Ala Glu Ile Asp Gly 820
825 830Ile Glu Leu Thr Tyr Gly Ile Lys Asp Val Pro
Gly Asp Arg Thr Thr 835 840 845Ile
Asp Leu Thr Glu Asp Glu Asn Gln Tyr Ser Ile Gly Asn Leu Lys 850
855 860Pro Asp Thr Glu Tyr Glu Val Ser Leu Ile
Ser Arg Arg Gly Asp Met865 870 875
880Ser Ser Asn Pro Ala Lys Glu Thr Phe Thr Thr Gly Leu Asp Ala
Pro 885 890 895Arg Asn Leu
Arg Arg Val Ser Gln Thr Asp Asn Ser Ile Thr Leu Glu 900
905 910Trp Arg Asn Gly Lys Ala Ala Ile Asp Ser
Tyr Arg Ile Lys Tyr Ala 915 920
925Pro Ile Ser Gly Gly Asp His Ala Glu Val Asp Val Pro Lys Ser Gln 930
935 940Gln Ala Thr Thr Lys Thr Thr Leu
Thr Gly Leu Arg Pro Gly Thr Glu945 950
955 960Tyr Gly Ile Gly Val Ser Ala Val Lys Glu Asp Lys
Glu Ser Asn Pro 965 970
975Ala Thr Ile Asn Ala Ala Thr Glu Leu Asp Thr Pro Lys Asp Leu Gln
980 985 990Val Ser Glu Thr Ala Glu
Thr Ser Leu Thr Leu Leu Trp Lys Thr Pro 995 1000
1005Leu Ala Lys Phe Asp Arg Tyr Arg Leu Asn Tyr Ser
Leu Pro Thr 1010 1015 1020Gly Gln Trp
Val Gly Val Gln Leu Pro Arg Asn Thr Thr Ser Tyr 1025
1030 1035Val Leu Arg Gly Leu Glu Pro Gly Gln Glu Tyr
Asn Val Leu Leu 1040 1045 1050Thr Ala
Glu Lys Gly Arg His Lys Ser Lys Pro Ala Arg Val Lys 1055
1060 1065Ala Ser Thr Glu Gln Ala Pro Glu Leu Glu
Asn Leu Thr Val Thr 1070 1075 1080Glu
Val Gly Trp Asp Gly Leu Arg Leu Asn Trp Thr Ala Ala Asp 1085
1090 1095Gln Ala Tyr Glu His Phe Ile Ile Gln
Val Gln Glu Ala Asn Lys 1100 1105
1110Val Glu Ala Ala Arg Asn Leu Thr Val Pro Gly Ser Leu Arg Ala
1115 1120 1125Val Asp Ile Pro Gly Leu
Lys Ala Ala Thr Pro Tyr Thr Val Ser 1130 1135
1140Ile Tyr Gly Val Ile Gln Gly Tyr Arg Thr Pro Val Leu Ser
Ala 1145 1150 1155Glu Ala Ser Thr Gly
Glu Thr Pro Asn Leu Gly Glu Val Val Val 1160 1165
1170Ala Glu Val Gly Trp Asp Ala Leu Lys Leu Asn Trp Thr
Ala Pro 1175 1180 1185Glu Gly Ala Tyr
Glu Tyr Phe Phe Ile Gln Val Gln Glu Ala Asp 1190
1195 1200Thr Val Glu Ala Ala Gln Asn Leu Thr Val Pro
Gly Gly Leu Arg 1205 1210 1215Ser Thr
Asp Leu Pro Gly Leu Lys Ala Ala Thr His Tyr Thr Ile 1220
1225 1230Thr Ile Arg Gly Val Thr Gln Asp Phe Ser
Thr Thr Pro Leu Ser 1235 1240 1245Val
Glu Val Leu Thr Glu Glu Val Pro Asp Met Gly Asn Leu Thr 1250
1255 1260Val Thr Glu Val Ser Trp Asp Ala Leu
Arg Leu Asn Trp Thr Thr 1265 1270
1275Pro Asp Gly Thr Tyr Asp Gln Phe Thr Ile Gln Val Gln Glu Ala
1280 1285 1290Asp Gln Val Glu Glu Ala
His Asn Leu Thr Val Pro Gly Ser Leu 1295 1300
1305Arg Ser Met Glu Ile Pro Gly Leu Arg Ala Gly Thr Pro Tyr
Thr 1310 1315 1320Val Thr Leu His Gly
Glu Val Arg Gly His Ser Thr Arg Pro Leu 1325 1330
1335Ala Val Glu Val Val Thr Glu Asp Leu Pro Gln Leu Gly
Asp Leu 1340 1345 1350Ala Val Ser Glu
Val Gly Trp Asp Gly Leu Arg Leu Asn Trp Thr 1355
1360 1365Ala Ala Asp Asn Ala Tyr Glu His Phe Val Ile
Gln Val Gln Glu 1370 1375 1380Val Asn
Lys Val Glu Ala Ala Gln Asn Leu Thr Leu Pro Gly Ser 1385
1390 1395Leu Arg Ala Val Asp Ile Pro Gly Leu Glu
Ala Ala Thr Pro Tyr 1400 1405 1410Arg
Val Ser Ile Tyr Gly Val Ile Arg Gly Tyr Arg Thr Pro Val 1415
1420 1425Leu Ser Ala Glu Ala Ser Thr Ala Lys
Glu Pro Glu Ile Gly Asn 1430 1435
1440Leu Asn Val Ser Asp Ile Thr Pro Glu Ser Phe Asn Leu Ser Trp
1445 1450 1455Met Ala Thr Asp Gly Ile
Phe Glu Thr Phe Thr Ile Glu Ile Ile 1460 1465
1470Asp Ser Asn Arg Leu Leu Glu Thr Val Glu Tyr Asn Ile Ser
Gly 1475 1480 1485Ala Glu Arg Thr Ala
His Ile Ser Gly Leu Pro Pro Ser Thr Asp 1490 1495
1500Phe Ile Val Tyr Leu Ser Gly Leu Ala Pro Ser Ile Arg
Thr Lys 1505 1510 1515Thr Ile Ser Ala
Thr Ala Thr Thr Glu Ala Leu Pro Leu Leu Glu 1520
1525 1530Asn Leu Thr Ile Ser Asp Ile Asn Pro Tyr Gly
Phe Thr Val Ser 1535 1540 1545Trp Met
Ala Ser Glu Asn Ala Phe Asp Ser Phe Leu Val Thr Val 1550
1555 1560Val Asp Ser Gly Lys Leu Leu Asp Pro Gln
Glu Phe Thr Leu Ser 1565 1570 1575Gly
Thr Gln Arg Lys Leu Glu Leu Arg Gly Leu Ile Thr Gly Ile 1580
1585 1590Gly Tyr Glu Val Met Val Ser Gly Phe
Thr Gln Gly His Gln Thr 1595 1600
1605Lys Pro Leu Arg Ala Glu Ile Val Thr Glu Ala Glu Pro Glu Val
1610 1615 1620Asp Asn Leu Leu Val Ser
Asp Ala Thr Pro Asp Gly Phe Arg Leu 1625 1630
1635Ser Trp Thr Ala Asp Glu Gly Val Phe Asp Asn Phe Val Leu
Lys 1640 1645 1650Ile Arg Asp Thr Lys
Lys Gln Ser Glu Pro Leu Glu Ile Thr Leu 1655 1660
1665Leu Ala Pro Glu Arg Thr Arg Asp Ile Thr Gly Leu Arg
Glu Ala 1670 1675 1680Thr Glu Tyr Glu
Ile Glu Leu Tyr Gly Ile Ser Lys Gly Arg Arg 1685
1690 1695Ser Gln Thr Val Ser Ala Ile Ala Thr Thr Ala
Met Gly Ser Pro 1700 1705 1710Lys Glu
Val Ile Phe Ser Asp Ile Thr Glu Asn Ser Ala Thr Val 1715
1720 1725Ser Trp Arg Ala Pro Thr Ala Gln Val Glu
Ser Phe Arg Ile Thr 1730 1735 1740Tyr
Val Pro Ile Thr Gly Gly Thr Pro Ser Met Val Thr Val Asp 1745
1750 1755Gly Thr Lys Thr Gln Thr Arg Leu Val
Lys Leu Ile Pro Gly Val 1760 1765
1770Glu Tyr Leu Val Ser Ile Ile Ala Met Lys Gly Phe Glu Glu Ser
1775 1780 1785Glu Pro Val Ser Gly Ser
Phe Thr Thr Ala Leu Asp Gly Pro Ser 1790 1795
1800Gly Leu Val Thr Ala Asn Ile Thr Asp Ser Glu Ala Leu Ala
Arg 1805 1810 1815Trp Gln Pro Ala Ile
Ala Thr Val Asp Ser Tyr Val Ile Ser Tyr 1820 1825
1830Thr Gly Glu Lys Val Pro Glu Ile Thr Arg Thr Val Ser
Gly Asn 1835 1840 1845Thr Val Glu Tyr
Ala Leu Thr Asp Leu Glu Pro Ala Thr Glu Tyr 1850
1855 1860Thr Leu Arg Ile Phe Ala Glu Lys Gly Pro Gln
Lys Ser Ser Thr 1865 1870 1875Ile Thr
Ala Lys Phe Thr Thr Asp Leu Asp Ser Pro Arg Asp Leu 1880
1885 1890Thr Ala Thr Glu Val Gln Ser Glu Thr Ala
Leu Leu Thr Trp Arg 1895 1900 1905Pro
Pro Arg Ala Ser Val Thr Gly Tyr Leu Leu Val Tyr Glu Ser 1910
1915 1920Val Asp Gly Thr Val Lys Glu Val Ile
Val Gly Pro Asp Thr Thr 1925 1930
1935Ser Tyr Ser Leu Ala Asp Leu Ser Pro Ser Thr His Tyr Thr Ala
1940 1945 1950Lys Ile Gln Ala Leu Asn
Gly Pro Leu Arg Ser Asn Met Ile Gln 1955 1960
1965Thr Ile Phe Thr Thr Ile Gly Leu Leu Tyr Pro Phe Pro Lys
Asp 1970 1975 1980Cys Ser Gln Ala Met
Leu Asn Gly Asp Thr Thr Ser Gly Leu Tyr 1985 1990
1995Thr Ile Tyr Leu Asn Gly Asp Lys Ala Glu Ala Leu Glu
Val Phe 2000 2005 2010Cys Asp Met Thr
Ser Asp Gly Gly Gly Trp Ile Val Phe Leu Arg 2015
2020 2025Arg Lys Asn Gly Arg Glu Asn Phe Tyr Gln Asn
Trp Lys Ala Tyr 2030 2035 2040Ala Ala
Gly Phe Gly Asp Arg Arg Glu Glu Phe Trp Leu Gly Leu 2045
2050 2055Asp Asn Leu Asn Lys Ile Thr Ala Gln Gly
Gln Tyr Glu Leu Arg 2060 2065 2070Val
Asp Leu Arg Asp His Gly Glu Thr Ala Phe Ala Val Tyr Asp 2075
2080 2085Lys Phe Ser Val Gly Asp Ala Lys Thr
Arg Tyr Lys Leu Lys Val 2090 2095
2100Glu Gly Tyr Ser Gly Thr Ala Gly Asp Ser Met Ala Tyr His Asn
2105 2110 2115Gly Arg Ser Phe Ser Thr
Phe Asp Lys Asp Thr Asp Ser Ala Ile 2120 2125
2130Thr Asn Cys Ala Leu Ser Tyr Lys Gly Ala Phe Trp Tyr Arg
Asn 2135 2140 2145Cys His Arg Val Asn
Leu Met Gly Arg Tyr Gly Asp Asn Asn His 2150 2155
2160Ser Gln Gly Val Asn Trp Phe His Trp Lys Gly His Glu
His Ser 2165 2170 2175Ile Gln Phe Ala
Glu Met Lys Leu Arg Pro Ser Asn Phe Arg Asn 2180
2185 2190Leu Glu Gly Arg Arg Lys Arg Ala 2195
220049531PRTHomo sapiens 49Met Asp Gly Ile Val Pro Asp Ile Ala
Val Gly Thr Lys Arg Gly Ser1 5 10
15Asp Glu Leu Phe Ser Thr Cys Val Thr Asn Gly Pro Phe Ile Met
Ser 20 25 30Ser Asn Ser Ala
Ser Ala Ala Asn Gly Asn Asp Ser Lys Lys Phe Lys 35
40 45Gly Asp Ser Arg Ser Ala Gly Val Pro Ser Arg Val
Ile His Ile Arg 50 55 60Lys Leu Pro
Ile Asp Val Thr Glu Gly Glu Val Ile Ser Leu Gly Leu65 70
75 80Pro Phe Gly Lys Val Thr Asn Leu
Leu Met Leu Lys Gly Lys Asn Gln 85 90
95Ala Phe Ile Glu Met Asn Thr Glu Glu Ala Ala Asn Thr Met
Val Asn 100 105 110Tyr Tyr Thr
Ser Val Thr Pro Val Leu Arg Gly Gln Pro Ile Tyr Ile 115
120 125Gln Phe Ser Asn His Lys Glu Leu Lys Thr Asp
Ser Ser Pro Asn Gln 130 135 140Ala Arg
Ala Gln Ala Ala Leu Gln Ala Val Asn Ser Val Gln Ser Gly145
150 155 160Asn Leu Ala Leu Ala Ala Ser
Ala Ala Ala Val Asp Ala Gly Met Ala 165
170 175Met Ala Gly Gln Ser Pro Val Leu Arg Ile Ile Val
Glu Asn Leu Phe 180 185 190Tyr
Pro Val Thr Leu Asp Val Leu His Gln Ile Phe Ser Lys Phe Gly 195
200 205Thr Val Leu Lys Ile Ile Thr Phe Thr
Lys Asn Asn Gln Phe Gln Ala 210 215
220Leu Leu Gln Tyr Ala Asp Pro Val Ser Ala Gln His Ala Lys Leu Ser225
230 235 240Leu Asp Gly Gln
Asn Ile Tyr Asn Ala Cys Cys Thr Leu Arg Ile Asp 245
250 255Phe Ser Lys Leu Thr Ser Leu Asn Val Lys
Tyr Asn Asn Asp Lys Ser 260 265
270Arg Asp Tyr Thr Arg Pro Asp Leu Pro Ser Gly Asp Ser Gln Pro Ser
275 280 285Leu Asp Gln Thr Met Ala Ala
Ala Phe Gly Leu Ser Val Pro Asn Val 290 295
300His Gly Ala Leu Ala Pro Leu Ala Ile Pro Ser Ala Ala Ala Ala
Ala305 310 315 320Ala Ala
Ala Gly Arg Ile Ala Ile Pro Gly Leu Ala Gly Ala Gly Asn
325 330 335Ser Val Leu Leu Val Ser Asn
Leu Asn Pro Glu Arg Val Thr Pro Gln 340 345
350Ser Leu Phe Ile Leu Phe Gly Val Tyr Gly Asp Val Gln Arg
Val Lys 355 360 365Ile Leu Phe Asn
Lys Lys Glu Asn Ala Leu Val Gln Met Ala Asp Gly 370
375 380Asn Gln Ala Gln Leu Ala Met Ser His Leu Asn Gly
His Lys Leu His385 390 395
400Gly Lys Pro Ile Arg Ile Thr Leu Ser Lys His Gln Asn Val Gln Leu
405 410 415Pro Arg Glu Gly Gln
Glu Asp Gln Gly Leu Thr Lys Asp Tyr Gly Asn 420
425 430Ser Pro Leu His Arg Phe Lys Lys Pro Gly Ser Lys
Asn Phe Gln Asn 435 440 445Ile Phe
Pro Pro Ser Ala Thr Leu His Leu Ser Asn Ile Pro Pro Ser 450
455 460Val Ser Glu Glu Asp Leu Lys Val Leu Phe Ser
Ser Asn Gly Gly Val465 470 475
480Val Lys Gly Phe Lys Phe Phe Gln Lys Asp Arg Lys Met Ala Leu Ile
485 490 495Gln Met Gly Ser
Val Glu Glu Ala Val Gln Ala Leu Ile Asp Leu His 500
505 510Asn His Asp Leu Gly Glu Asn His His Leu Arg
Val Ser Phe Ser Lys 515 520 525Ser
Thr Ile 53050317PRTHomo sapiens 50Met Arg Leu Gly Pro Arg Thr Ala Ala
Leu Gly Leu Leu Leu Leu Cys1 5 10
15Ala Ala Ala Ala Gly Ala Gly Lys Ala Glu Glu Leu His Tyr Pro
Leu 20 25 30Gly Glu Arg Arg
Ser Asp Tyr Asp Arg Glu Ala Leu Leu Gly Val Gln 35
40 45Glu Asp Val Asp Glu Tyr Val Lys Leu Gly His Glu
Glu Gln Gln Lys 50 55 60Arg Leu Gln
Ala Ile Ile Lys Lys Ile Asp Leu Asp Ser Asp Gly Phe65 70
75 80Leu Thr Glu Ser Glu Leu Ser Ser
Trp Ile Gln Met Ser Phe Lys His 85 90
95Tyr Ala Met Gln Glu Ala Lys Gln Gln Phe Val Glu Tyr Asp
Lys Asn 100 105 110Ser Asp Asp
Thr Val Thr Trp Asp Glu Tyr Asn Ile Gln Met Tyr Asp 115
120 125Arg Val Ile Asp Phe Asp Glu Asn Thr Ala Leu
Asp Asp Ala Glu Glu 130 135 140Glu Ser
Phe Arg Lys Leu His Leu Lys Asp Lys Lys Arg Phe Glu Lys145
150 155 160Ala Asn Gln Asp Ser Gly Pro
Gly Leu Ser Leu Glu Glu Phe Ile Ala 165
170 175Phe Glu His Pro Glu Glu Val Asp Tyr Met Thr Glu
Phe Val Ile Gln 180 185 190Glu
Ala Leu Glu Glu His Asp Lys Asn Gly Asp Gly Phe Val Ser Leu 195
200 205Glu Glu Phe Leu Gly Asp Tyr Arg Trp
Asp Pro Thr Ala Asn Glu Asp 210 215
220Pro Glu Trp Ile Leu Val Glu Lys Asp Arg Phe Val Asn Asp Tyr Asp225
230 235 240Lys Asp Asn Asp
Gly Arg Leu Asp Pro Gln Glu Leu Leu Pro Trp Val 245
250 255Val Pro Asn Asn Gln Gly Ile Ala Gln Glu
Glu Ala Leu His Leu Ile 260 265
270Asp Glu Met Asp Leu Asn Gly Asp Lys Lys Leu Ser Glu Glu Glu Ile
275 280 285Leu Glu Asn Pro Asp Leu Phe
Leu Thr Ser Glu Ala Thr Asp Tyr Gly 290 295
300Arg Gln Leu His Asp Asp Tyr Phe Tyr His Asp Glu Leu305
310 31551326PRTHomo sapiens 51Met Ser Asn Gly Tyr
Glu Asp His Met Ala Glu Asp Cys Arg Gly Asp1 5
10 15Ile Gly Arg Thr Asn Leu Ile Val Asn Tyr Leu
Pro Gln Asn Met Thr 20 25
30Gln Asp Glu Leu Arg Ser Leu Phe Ser Ser Ile Gly Glu Val Glu Ser
35 40 45Ala Lys Leu Ile Arg Asp Lys Val
Ala Gly His Ser Leu Gly Tyr Gly 50 55
60Phe Val Asn Tyr Val Thr Ala Lys Asp Ala Glu Arg Ala Ile Asn Thr65
70 75 80Leu Asn Gly Leu Arg
Leu Gln Ser Lys Thr Ile Lys Val Ser Tyr Ala 85
90 95Arg Pro Ser Ser Glu Val Ile Lys Asp Ala Asn
Leu Tyr Ile Ser Gly 100 105
110Leu Pro Arg Thr Met Thr Gln Lys Asp Val Glu Asp Met Phe Ser Arg
115 120 125Phe Gly Arg Ile Ile Asn Ser
Arg Val Leu Val Asp Gln Thr Thr Gly 130 135
140Leu Ser Arg Gly Val Ala Phe Ile Arg Phe Asp Lys Arg Ser Glu
Ala145 150 155 160Glu Glu
Ala Ile Thr Ser Phe Asn Gly His Lys Pro Pro Gly Ser Ser
165 170 175Glu Pro Ile Ala Val Lys Phe
Ala Ala Asn Pro Asn Gln Asn Lys Asn 180 185
190Val Ala Leu Leu Ser Gln Leu Tyr His Ser Pro Ala Arg Arg
Phe Gly 195 200 205Gly Pro Val His
His Gln Ala Gln Arg Phe Arg Phe Ser Pro Met Gly 210
215 220Val Asp His Met Ser Gly Leu Ser Gly Val Asn Val
Pro Gly Asn Ala225 230 235
240Ser Ser Gly Trp Cys Ile Phe Ile Tyr Asn Leu Gly Gln Asp Ala Asp
245 250 255Glu Gly Ile Leu Trp
Gln Met Phe Gly Pro Phe Gly Ala Val Thr Asn 260
265 270Val Lys Val Ile Arg Asp Phe Asn Thr Asn Lys Cys
Lys Gly Phe Gly 275 280 285Phe Val
Thr Met Thr Asn Tyr Glu Glu Ala Ala Met Ala Ile Ala Ser 290
295 300Leu Asn Gly Tyr Arg Leu Gly Asp Lys Ile Leu
Gln Val Ser Phe Lys305 310 315
320Thr Asn Lys Ser His Lys 3255225DNAArtificial
SequencePrimer 3PEI 52ccgcgaattc tcatctcagg gtgag
25538PRTArtificial SequenceFLAG epitope 53Asp Tyr Lys
Asp Asp Asp Asp Lys1 55410PRTHomo sapiens 54Gln Gly Leu Ile
Ser Arg Gly Tyr Ser Tyr1 5 105511PRTHomo
sapiens 55Ala Val Arg Asp Ile Ser Glu Ala Ser Val Phe1 5
10
User Contributions:
Comment about this patent or add new information about this topic: