Patent application title: WATER-SOLUBLE TRANS-MEMBRANE PROTEINS AND METHODS FOR THE PREPARATION AND USE THEREOF
Inventors:
IPC8 Class: AG16B1500FI
USPC Class:
1 1
Class name:
Publication date: 2020-06-04
Patent application number: 20200176073
Abstract:
The present invention is directed to water-soluble membrane proteins,
methods for the preparation thereof and methods of use thereof.Claims:
1. A computer implemented method for executing a procedure to select a
water-soluble variant of a G Protein-Coupled Receptor (GPCR), the method
comprising: (1) entering a sequence of the GPCR for analysis; (2)
obtaining a variant of the GPCR, wherein a plurality of hydrophobic amino
acids in the transmembrane (TM) domain alpha-helical segments ("TM
regions") of the GPCR are substituted, wherein: (a) said hydrophobic
amino acids are selected from the group consisting of Leucine (L),
Isoleucine (I), Valine (V), and Phenylalanine (F); (b) each said Leucine
(L) is independently substituted by Glutamine (Q), Asparagine (N), or
Serine (S); (c) each said Isoleucine (I) and said Valine (V) are
independently substituted by Threonine (T), Asparagine (N), or Serine
(S); and, (d) each said Phenylalanine is substituted by Tyrosine (Y);
and, subsequently, (3) obtaining an .alpha.-helical secondary structure
result for the variant to verify maintenance of .alpha.-helical secondary
structures in the variant; and (4) obtaining a trans-membrane region
result for the variant to verify water solubility of the variant, thereby
selecting the water-soluble variant of the GPCR.
2. The method of claim 1, wherein step (3) is performed prior to, concurrently with, or after step (4).
3. The method of claim 1, wherein in step (2), one subset of said plurality of hydrophobic amino acids in one and the same TM region of the GPCR are substituted to generate one member of a library of potential variants, and one or more different subsets of said plurality of hydrophobic amino acids are substituted to generate additional members of the library.
4. The method of claim 3, further comprising ranking all members of said library based on a combined score, wherein the combined score is a weighed combination of the .alpha.-helical secondary structure prediction result and the trans-membrane region prediction result.
5. The method of claim 1 further comprising ranking the variant using a ranking function.
6-7. (canceled)
8. The method of claim 5 wherein the ranking function includes a secondary structure component and a water solubility component.
9. The method of claim 8 wherein the ranking function includes a weighting value for the secondary structure component and/or the water solubility component.
10. The method of claim 4, further comprising: selecting N members with the highest combined scores to form a first library of potential variants for said TM region, wherein N is a pre-determined integer (e.g., 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more).
11. The method of claim 10, further comprising generating one library of potential variants for 1, 2, 3, 4, 5, or all 6 other TM regions of the GPCR.
12. The method of claim 11 further comprising: replacing two or more TM regions of the GPCR with corresponding TM regions from the libraries of potential variants, to create a library of combinatory variants.
13. The method of claim 1, wherein substantially all said leucines are substituted by glutamines, substantially all said isoleucines are substituted by threonines, substantially all said valines are substituted by threonines, or substantially all said phenylalanines are substituted by tyrosines.
14-16. (canceled)
17. The method of claim 1, wherein one or more said leucines are not substituted, one or more said isoleucines are not substituted, one or more said valines are not substituted, or one or more said phenylalanines are not substituted.
18-20. (canceled)
21. The method of claim 1, further comprising producing/expressing said combinatory variants.
22. The method of claim 1, further comprising testing said combinatory variants for ligand binding (e.g., in yeast two-hybrid system), wherein those having substantially the same ligand binding compared to that of the GPCR are selected.
23. The method of claim 1, further comprising testing said combinatory variants for a biological function of the GPCR, wherein those having substantially the same biological function compared to that of the GPCR are selected.
24-26. (canceled)
27. The method of claim 1, wherein the TM regions of the GPCR are predicted based on the sequence of the GPCR.
28. The method of claim 27, wherein the TM regions of the GPCR are predicted using TMHMM 2.0 (TransMembrane prediction using Hidden Markov Models) software module.
29-33. (canceled)
34. A water-soluble variant of a G Protein-Coupled Receptor (GPCR), wherein: a plurality of hydrophobic amino acids in the transmembrane (TM) domain alpha-helical segments ("TM regions") of the GPCR are substituted, wherein: (a) said hydrophobic amino acids are selected from the group consisting of Leucine (L), Isoleucine (I), Valine (V), and Phenylalanine (F); (b) each said Leucine (L) is independently substituted by Glutamine (Q), Asparagine (N), or Serine (S); (c) each said Isoleucine (I) and said Valine (V) are independently substituted by Threonine (T), Asparagine (N), or Serine (S); and, (d) each said Phenylalanine is substituted by Tyrosine (Y); and, subsequently, all seven TM regions of the variant maintains .alpha.-helical secondary structures; and, there is no predicted trans-membrane region.
35-58. (canceled)
59. A non-transitory computer readable medium having stored thereon a sequence of instructions to perform the method of any of claims claim 1.
60. A data processing system operative to select a water soluble variant of a G protein coupled receptor comprising: a data processor operative to perform substitution of amino acids as claimed in claim 1 wherein the processor ranks a protein variant with a ranking function; and a memory connected to the data processor that stores a substituted amino acid structure and a rank.
Description:
REFERENCE TO RELATED APPLICATIONS
[0001] This application is a Continuation of U.S. patent application Ser. No. 14/723,399 filed on May 27, 2015, now U.S. Pat. No. 10,373,702, which is a Continuation-in-Part of U.S. patent application Ser. No. 14/669,753, now abandoned, and a Continuation-in-Part of International Application No. PCT/US2015/022780, both filed on Mar. 26, 2015; both of which claim the benefit of the filing dates under 35 U.S.C. .sctn. 119(e) of U.S. Provisional Application No. 62/117,550 filed on Feb. 18, 2015, U.S. Provisional Application No. 61/993,783 filed on May 15, 2014, and U.S. Provisional Application No. 61/971,388 filed on Mar. 27, 2014.
[0002] U.S. Ser. No. 14/723,399 also claims the benefit of the filing date under 35 U.S.C. .sctn. 119(e) to U.S. Provisional Application No. 62/117,550 filed on Feb. 18, 2015, U.S. Provisional Application No. 61/993,783 filed on May 15, 2014, and U.S. Provisional Application No. 61/971,388 filed on Mar. 27, 2014.
[0003] The entire contents of each of the above referenced applications, including all drawings and sequence listings, are incorporated herein by reference.
REFERENCE TO SEQUENCE LISTINGS
[0004] This application contains a Sequence Listing which has been submitted electronically via EFS-web in ASCII format, and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jan. 22, 2020, is named 122288-08908_SeqList.txt, and is 371,513 bytes in size.
BACKGROUND OF THE INVENTION
[0005] Membrane proteins play vital roles in all living systems. Approximately .about.30% of all genes in almost all sequenced genomes code for membrane proteins. However, our detailed understanding of their structure and function lags far behind that of soluble proteins. As of March 2015, there are over 100,000 structures in the Protein Data Bank. However, there are only 945 membrane protein structures with 530 unique structures including 28 G-protein coupled receptors and no tetraspanin membrane proteins.
[0006] There are several bottlenecks in elucidating the structure and function of membrane receptors and their recognition and ligand-binding properties although they are of great interest. The most critical and challenging task is that it is extremely difficult to produce milligram quantities of soluble and stable receptors. Inexpensive large-scale production methods are desperately needed, and have thus been the focus of extensive research. It is only possible to conduct detailed structural studies once these preliminary obstacles have been surmounted.
[0007] Zhang et al. (U.S. Pat. No. 8,637,452), incorporated herein by reference, describes an improved process for water solubilizing GPCRs wherein certain hydrophobic amino acids located in the transmembrane regions were substituted by polar amino acids. However, the process is labor-intensive. Further, while the modified transmembrane regions met the water-soluble criteria, improvements in water solubility and ligand binding are desired. Therefore, there is a need in the art for improved methods of studying G-protein coupled receptors.
SUMMARY OF THE INVENTION
[0008] The present invention is directed to a method of designing, selecting and/or producing water-soluble membrane proteins and peptides, peptides (and transmembrane domains) designed, selected or produced therefrom, compositions comprising said peptides, and methods of use thereof. In particular, the method relates to a process for designing a library of water soluble membrane peptides, such as GPCR variants and tetraspanin membrane proteins, using the "QTY Principle," changing the water-insoluble amino acids (Leu, Ile, Val and Phe, or the simple letter code L, I, V, F) into water-soluble, non-ionic amino acids (Gln, Thr and Tyr, or the simple letter code Q, T, Y). Furthermore, two additional non-ionic amino acids Asn (N) and Ser (S) may also be used for the substitution for L, I and V but not for F. In the embodiments discussed below, it is to be understood that Asn (N) and Ser (S) are envisioned as being substitutable for Q and T (as a variant is described) or L, I or V (as a native protein is described). For the purposes of brevity, however, the application does not further elaborate the details of these alternative embodiments as these are known to those skilled in the art as a result of the teaching herein.
[0009] The invention encompasses a modified, synthetic, and/or non-naturally occurring, .alpha.-helical domain(s) and water-soluble polypeptide (e.g., "sGPCR") comprising such modified .alpha.-helical domain(s), wherein the modified .alpha.-helical domain(s) comprise an amino acid sequence in which a plurality of hydrophobic amino acid residues (L, I V, F) within a .alpha.-helical domain of a native membrane protein are replaced with hydrophilic, non-ionic amino acid residues (Q, T, T, Y, respectively, or "Q, T, Y") and/or N and S. The invention also encompasses a method of preparing a water-soluble polypeptide comprising replacing a plurality of hydrophobic amino acid residues (L, I, V, F) within the .alpha.-helical domain(s) of a native membrane protein with hydrophilic, non-ionic amino acid residues (Q/N/S, T/N/S, Y). The invention additionally encompasses a polypeptide prepared by replacing a plurality of hydrophobic amino acid residues (L, I, V, F) within the .alpha.-helical domain of a native membrane protein with hydrophilic, non-ionic amino acid residues (Q/N/S, T/N/S, Y, respectively). The variant can be characterized by the name of the parent or native protein (e.g., CXCR4) followed by the abbreviation "QTY" (e.g., CXCR4-QTY).
[0010] Thus one aspect of the invention provides a computer implemented method for executing a procedure to select a water-soluble variant of a membrane protein (e.g., a G Protein-Coupled Receptor (GPCR)), the method comprising: (1) entering a sequence of the membrane protein (e.g., GPCR) for analysis; (2) obtaining a variant of the membrane protein (e.g., GPCR), wherein a plurality of hydrophobic amino acids in the transmembrane (TM) domain alpha-helical segments ("TM regions") of the membrane protein (e.g., GPCR) are substituted, wherein: (a) said hydrophobic amino acids are selected from the group consisting of Leucine (L), Isoleucine (I), Valine (V), and Phenylalanine (F); (b) each said Leucine (L) is independently substituted by Glutamine (Q), Asparagine (N), or Serine (S); (c) each said Isoleucine (I) and said Valine (V) are independently substituted by Threonine (T), Asparagine (N), or Serine (S); and, (d) each said Phenylalanine is substituted by Tyrosine (Y); and, subsequently, (3) obtaining an .alpha.-helical secondary structure result for the variant to verify maintenance of .alpha.-helical secondary structures in the variant; (4) obtaining a trans-membrane region result for the variant to verify water solubility of the variant, thereby designing the water-soluble variant of the membrane protein (e.g., GPCR).
[0011] In certain embodiments, step (3) is performed prior to, concurrently with, or after step (4). Additional steps, as described herein, can be incorporated into the above processing sequence. Processing preferably uses computational steps preformed by a data processing system. The system utilizes automated computational systems and methods to select protein variants.
[0012] In certain embodiments, in step (2), one subset of said plurality of hydrophobic amino acids in one and the same TM region of the GPCR are substituted to generate one member of a library of potential variants, and one or more different subsets of said plurality of hydrophobic amino acids are substituted to generate additional members of the library. In certain embodiments, the method may further comprising ranking all members of said library based on a combined score, wherein the combined score is a weighed combination of the .alpha.-helical secondary structure prediction result and the trans-membrane region prediction result. In certain embodiments, the method further comprises ranking the variant using a ranking function. In certain embodiments, the ranking function may include a secondary structure component and a water solubility component. For example, the ranking function may include a weighting value for the secondary structure component and/or the water solubility component. In certain embodiments, the method further comprises performing the method with a data processor, which may further comprise a memory connected thereto.
[0013] In certain embodiments, the method may further comprising selecting N members with the highest combined scores to form a first library of potential variants for said TM region, wherein N is a pre-determined integer (e.g., 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more). In certain embodiments, the method may further comprising generating one library of potential variants for 1, 2, 3, 4, 5, or all 6 other TM regions of the GPCR. In certain embodiments, the method may further comprising replacing two or more TM regions of the GPCR with corresponding TM regions from the libraries of potential variants, to create a library of combinatory variants. In certain embodiments, the method further comprises producing/expressing said combinatory variants. In certain embodiments, the method further comprises testing said combinatory variants for ligand binding (e.g., in yeast two-hybrid system), wherein those having substantially the same ligand binding compared to that of the GPCR are selected. In certain embodiments, the method further comprises testing said combinatory variants for a biological function of the GPCR, wherein those having substantially the same biological function compared to that of the GPCR are selected.
[0014] Certain water-soluble polypeptides of the invention possess the ability to bind the ligand which normally binds to the wild type or native membrane protein (e.g., GPCR). In certain embodiments, the amino acids within potential ligand binding sites of the native membrane protein (e.g., GPCRs) are not replaced and/or the sequences of the extracellular and/or intracellular domains of the native membrane proteins (e.g., GPCRs) are identical.
[0015] The (non-ionic) hydrophilic residues (which replace one or more hydrophobic residues in the .alpha.-helical domain of a native membrane protein) are selected from the group consisting of: glutamine (Q), threonine (T), tyrosine (Y), Asparagine (N), and serine (S), and any combinations thereof. In additional aspects, the hydrophobic residues selected from leucine (L), isoleucine (I), valine (V), and phenylalanine (F) are replaced. In certain embodiments, the phenylalanine residues of the .alpha.-helical domain of the protein are replaced with tyrosine; each of the isoleucine and/or valine residues of the .alpha.-helical domain of the protein are independently replaced with threonine (or S or N); and/or each of the leucine residues of the .alpha.-helical domain of the protein are independently replaced with glutamine (or S or N).
[0016] In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said leucines are substituted by glutamines. In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said isoleucines are substituted by threonines. In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said valines are substituted by threonines. In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said phenylalanines are substituted by tyrosines. In certain embodiments, one or more (e.g., 1, 2, or 3) said leucines are not substituted. In certain embodiments, one or more (e.g., 1, 2, or 3) said isoleucines are not substituted. In certain embodiments, one or more (e.g., 1, 2, or 3) said valines are not substituted. In certain embodiments, one or more (e.g., 1, 2, or 3) said phenylalanines are not substituted.
[0017] In certain embodiments, the library of combinatory variants comprises less than about 2 million members. In certain embodiments, the sequence of the GPCR comprises information about the TM regions of the GPCR. In certain embodiments, the sequence of the GPCR is obtained from a protein structure database (e.g., PDB, UniProt). In certain embodiments, the TM regions of the GPCR are predicted based on the sequence of the GPCR. For example, the TM regions of the GPCR can be predicted using TMHMM 2.0 (TransMembrane prediction using Hidden Markov Models) software module/package. In certain embodiments, the TMHMM 2.0 software module/package utilizes a dynamic baseline for peak searching.
[0018] In certain embodiments, the method further comprises providing a polynucleotide sequence for each variants of the GPCR. The polynucleotide sequence may be codon optimized for expression in a host (e.g., a bacterium such as E. coli, a yeast such as S. cerevisae or S. pombe, an insect cell such as Sf9 cell, a non-human mammalian cell, or a human cell).
[0019] In certain embodiments, the scripted procedure can comprise VBA scripts. In certain embodiments, the scripted procedure is operable in a Linux system (e.g., Ubuntu 12.04 LTS), a Unix system, a Microsoft Windows operating system, an Android operating system, or an Apple iOS operating system. Different programming language including C.sup.++T Java Script, MATLAB, etc. can be used in conjunction with implementations of the present invention. Coded instructions can be stored on a memory device, such as a non-transitory computer readable medium, that can be used with a computer system known to those skilled in the art.
[0020] In certain embodiments, the .alpha.-helical domain is one of 7-transmembrane .alpha.-helical domains in a native membrane protein is a G-protein coupled receptor (GPCR). In some embodiments, the GPCR is selected from the group consisting of: purinergic receptors (P2Y.sub.1, P2Y.sub.2, P2Y.sub.4, P2Y.sub.6), M.sub.1 and M.sub.3 muscarinic acetylcholine receptors, receptors for thrombin (protease-activated receptor (PAR)-1, PAR-2), thromboxane (TXA.sub.2), sphingosine 1-phosphate (S1P.sub.2, S1P.sub.3, S1P.sub.4 and S1P.sub.5), lysophosphatidic acid (LPA.sub.1, LPA.sub.2, LPA.sub.3), angiotensin II (AT.sub.1), serotonin (5-HT.sub.2c and 5-HT.sub.4), somatostatin (sst.sub.5), endothelin (ET.sub.A and ET.sub.B), cholecystokinin (CCK.sub.1), V.sub.1a vasopressin receptors, D.sub.5 dopamine receptors, fMLP formyl peptide receptors, GAL.sub.2 galanin receptors, EP.sub.3 prostanoid receptors, A.sub.1 adenosine receptors, .alpha..sub.1 adrenergic receptors, BB.sub.2 bombesin receptors, B.sub.2 bradykinin receptors, calcium-sensing receptors, chemokine receptors, KSHV-ORF74 chemokine receptors, NK.sub.1 tachykinin receptors, thyroid-stimulating hormone (TSH) receptors, protease-activated receptors, neuropeptide receptors, adenosine A2B receptors, P2Y purinoceptors, metabolic glutamate receptors, GRK5, GPCR-30, and CXCR4.
[0021] In other embodiments, the native membrane protein or membrane protein is an integral membrane protein. In a further aspect, the native membrane protein is a mammalian protein. The proteins of the invention are preferably human. In certain embodiments, references to specific GPCR proteins (e.g., CXCR4) refer to mammalian GPCRs, such as non-human mammalian GPCRs, or human GPCRs.
[0022] In some embodiments, the .alpha.-helical domain is one of 7-transmembrane .alpha.-helical domains in a G-protein coupled receptor (GPCR) variant modified, for example, in the extracellular or intracellular loops to improve or alter ligand binding, as described elsewhere in the literature. For the purposes of this invention, the word "native" or "wild type" is intended to refer to the protein (or .alpha.-helical domain) prior to water solubilization in accordance with the methods described herein.
[0023] In certain embodiments, the membrane protein can be a tetraspanin membrane protein characterized by 4 transmembrane alpha-helices. Approximately 54 human tetraspanin membrane proteins have been reviewed and annotated. Many are known to mediate cellular signal transduction events that play a critical role in regulation of cell development, activation, growth and motility. For example, CD81 receptor plays a critical role as the receptor for Hepatitis C virus entry and plasmodium infection. CD81 gene is localized in the tumor-suppressor gene region and can be a candidate for mediating cancer malignancies. CD151 is involved in enhanced cell motility, invasion and metastasis of cancer cells. Expression of CD63 correlates with the invasiveness of ovarian cancer. A characteristic of a tetraspanin membrane protein is a Cysteine-cysteine-glycine motif in the second, or large, extracellular loop.
[0024] Another aspect of the invention provides a water-soluble variant of a G Protein-Coupled Receptor (GPCR), wherein: (1) a plurality of hydrophobic amino acids in the transmembrane (TM) domain alpha-helical segments ("TM regions") of the GPCR are substituted, wherein: (a) said hydrophobic amino acids are selected from the group consisting of Leucine (L), Isoleucine (I), Valine (V), and Phenylalanine (F); (b) each said Leucine (L) is independently substituted by Glutamine (Q), Asparagine (N), or Serine (S); (c) each said Isoleucine (I) and said Valine (V) are independently substituted by Threonine (T), Asparagine (N), or Serine (S); and, (d) each said Phenylalanine is substituted by Tyrosine (Y); and, subsequently, (2) all seven TM regions of the variant maintains .alpha.-helical secondary structures; and, (3) there is no predicted trans-membrane region.
[0025] In certain embodiments, the water-soluble variant comprises one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 4-11, 13-20, 22-29, 31-38, 40-47, 49-56, and 58-64. It may further comprise one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 3, 12, 21, 30, 39, 48, and 57. In certain embodiments, the water-soluble variant binds to a CXCR4 ligand.
[0026] In certain embodiments, the water-soluble variant comprises one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 69-76, 78-85, 87, 89-96, 98-105, 107-114 and 116-123. It may further comprise one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 68, 77, 86, 88, 97, 106, 115 and 124. In certain embodiments, the water-soluble variant binds to a CX3CR1 ligand.
[0027] In certain embodiments, the water-soluble variant comprises one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 128-135, 137-144, 146-153, 155-162, 164-171, 173 and 175-182. It may further comprise one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 127, 136, 145, 154, 163, 172, 174 and 183. In certain embodiments, the water-soluble variant binds to a CCR3 ligand.
[0028] In certain embodiments, the water-soluble variant comprises one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 187-194, 196-203, 205-206, 208, 210-217, 219-225, 227-234. It may further comprise one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 186, 195, 204, 207, 209, 218, 226, and 235. In certain embodiments, the water-soluble variant binds to a CCR5 ligand.
[0029] In certain embodiments, the water-soluble variant comprises one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 236-243, 245-252, 254-261, 263-270, 272, 274-281, and 283-290. It may further comprise one or more amino acid sequences selected from the group consisting of SEQ ID NOs: 235, 244, 253, 262, 271, 273, 282 and 291. In certain embodiments, the water-soluble variant binds to a CXCR3 ligand.
[0030] In certain embodiments, the water-soluble variant comprises one or more transmembrane domains as set forth in any one of SEQ ID NOs: 2, 67, 126, 185, 327, 293, 295, 297, 299, 301, 303, 305, 307, 309, 311, 313, 315, 317, 319, 321, 323 or 325. In certain embodiments, the variant is water soluble and binds a ligand of a homologous native transmembrane protein.
[0031] Another aspect of the invention provides a method of producing a protein in a bacterium (e.g., an E. coli), comprising: (a) culturing the bacterium in a growth medium under a condition suitable for protein production; (b) fractioning a lysate of the bacterium to produce a soluble fraction and the insoluble pellet fraction; and, (c) isolating the protein from the soluble fraction; wherein: (1) the protein is a variant G-protein couple receptor (GPCR) of any one of claims 29-46; and, (2) the yield of the protein is at least 20 mg/L (e.g., 30 mg/L, 40 mg/L, 50 mg/L or more) of growth medium.
[0032] In certain embodiments, the bacterium is E. coli BL21, and the growth medium is LB medium. In certain embodiments, the protein is encoded by a plasmid in the bacterium. In certain embodiments, expression of the protein is under the control of an inducible promoter, such as an inducible promoter inducible by IPTG. In certain embodiments, the lysate is produced by sonication. In certain embodiments, the soluble fraction is produced by centrifuging the lysate at 14,500.times.g or more.
[0033] Another aspect of the invention provides a method of treatment for a disorder or disease that is mediated by the activity a membrane protein in a subject in need thereof, comprising administering to said subject an effective amount of a water-soluble polypeptide described herein.
[0034] In certain embodiments, the water-soluble polypeptide retains the ligand-binding activity of the membrane protein. Examples of disorders and diseases that can be treated by administering a water-soluble peptide of the invention include, but are not limited to, cancer (such as, small cell lung cancer, melanoma, triple negative breast cancer), Parkinson's disease, cardiovascular disease, hypertension, and bronchial asthma.
[0035] Another aspect of the invention provides a pharmaceutical composition comprising a therapeutically effective amount of a water-soluble polypeptide of the invention and pharmaceutically acceptable carrier or diluent.
[0036] In yet another aspect, the invention provides a cell transfected with a subject water-soluble peptide comprising a modified .alpha.-helical domain. In certain embodiments, the cell is an animal cell (e.g., human, non-human mammalian, insect, avian, fish, reptile, amphibian, or other cell), yeast or a bacterial cell.
[0037] The invention also includes a computer implemented method performed on a computer system, the method comprising one or more of the methods (or steps thereof) as described herein. Computer systems including a non-transient computer readable medium having computer-executable instructions stored thereon, the computer-executable instructions when executed by the computer system causing the computer system to perform the methods the computer-executable instructions when executed by the computer system causing the computer system to perform the methods contemplated herein. Additionally, computer systems comprising at least one memory to store sequence data and quantitative results described herein and at least one processor coupled to the memory, the processor being configured to perform the methods described herein are contemplated. A user interface, such as a graphical user interface (GUI) in conjunction with an electronic display device can be used to select processing parameters that are operative to control the selection process, including computational methods described herein.
[0038] Another aspect of the invention provides a non-transitory computer readable medium having stored thereon a sequence of instructions to perform any of the methods of the invention.
[0039] A further aspect of the invention provides a data processing system operative to select a water-soluble variant of a G Protein-Coupled Receptor comprising: a data processor operative to perform substitution of amino acids as in any of the methods of the invention, wherein the system ranks a protein variant with a ranking function.
[0040] It should be understood that all embodiments of the invention, including those described only under one aspect of the invention (e.g., screening method), are to be construed to be applicable to all aspects of the invention (e.g., water-soluble proteins or methods of use), and are to be construed to be combinable with any one or more additional embodiments of the invention unless explicitly disclaimed or otherwise improper, as should be readily understood by one of skill in the art.
BRIEF DESCRIPTION OF THE DRAWINGS
[0041] The foregoing and other objects, features and advantages of the invention will be apparent from the following more particular description of the representative embodiments of the invention, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating the principles of the invention.
[0042] FIGS. 1A-1D is the general illustration for the QTY Code that systematically substitutes the hydrophobic amino acids L, I, V and F to Q, T, T, Y, respectively (FIG. 1A). The molecular shapes of amino acids leucine and glutamine are similar; likewise, molecular shapes of isoleucine and valine are similar to threonine; and molecular shapes of phenylalanine and tyrosine are similar. Leucine, isoleucine, valine and phenylalanine are hydrophobic and cannot bind with water molecules. In contrast, glutamine can bind with 4 water molecules, 2 hydrogen donors, and 2 hydrogen acceptors; the --OH group on threonine and tyrosine can bind to 3 water molecules, 1 hydrogen donor and 2 acceptors.
[0043] FIG. 1B is a side view of an alpha helix. After applying the QTY Code of systematic amino acid changes, the alpha helix become water-soluble. FIG. 1C is a top view of an alpha helix before and after QTY Code substitution: the helix on the left is the natural membrane helix with mostly hydrophobic amino acids, the helix on the right is the same helix after applying QTY Code substitution. The helix now has most hydrophilic amino acids (FIG. 1D). Before QTY Code, the GPCR membrane proteins are surrounded by hydrophobic lipid molecules to embed them inside the lipid membrane (left portion of FIG. 1D). After applying QTY Code, the GPCR membrane proteins become water-soluble and no long need detergent to surround it for stabilization (right portion of FIG. 1D).
[0044] FIG. 2 is a TMHMM prediction for the transmembrane domain regions for CXCR4. The prediction shows 7 distinctive hydrophobic transmembrane segments. In contrast, in a TMHMM prediction for a variant of CXCR4 subject to the QTY substitution method of the invention (CXCR4-QTY), there are no distinctive 7 hydrophobic transmembrane segments visible anymore.
[0045] FIG. 3 illustrates the predicted alpha helical wheel structure of the fully QTY Code modified TM1 domain of CXCR4.
[0046] FIG. 4 is an illustration of the potential variants in each of the seven TM regions of a GPCR CXCR4.
[0047] FIGS. 5, 6, 7, and 8 are sequence alignments of the wild type proteins and QTY variants of CXCR4, CXCR3, CCR3 and CCR5, respectively. QTY Code is only applied to the 7 hydrophobic transmembrane segments, but not the extracellular and intracellular segments.
[0048] FIG. 9A is a flowchart of a representative embodiment of the process.
[0049] FIG. 9B is another flowchart of a representative embodiment of the process.
[0050] FIG. 10 is an illustration of the computer systems of the invention.
[0051] FIGS. 11A and 11B are schematic illustration of flowcharts setting forth processing steps of certain preferred embodiments of the invention.
DETAILED DESCRIPTION OF THE INVENTION
[0052] A description of preferred embodiments of the invention follows. The words "a" or "an" are meant to encompass one or more, unless otherwise specified.
[0053] In some aspects, the invention is directed to the use of the QTY (Glutamine, threonine and tyrosine) replacement (or "QTY Code") method (or "principle") to change the 7-transmembrane .alpha.-helix hydrophobic residues leucine (L), isoleucine (I), valine (V), and phenylalanine (F) of a native protein to the hydrophilic residues glutamine (Q), threonine (T) and tyrosine (Y). In certain embodiments, as described above, Asn (N) and Ser (S) can also be used as substitute residues for L, I and/or V, but not F. This invention can convert a water insoluble, native membrane protein to a more water-soluble counterpart that still maintains some or substantially all functions of the native protein.
[0054] The invention includes a process for designing water-soluble peptides. The process is described in terms of GPCR proteins as an illustrative example, with specificity in the first instance to human CCR3, CCR5, CXCR4, and CX3CR1. However, the general principle of the invention also applies to other proteins with transmembrane (.alpha.-helical) regions.
[0055] GPCRs typically have 7-transmembrane alpha-helices (7TM) and 8 loops (8NTM) connected by the seven TM regions. These transmembrane segments may be referred to as TM1, TM2, TM3, TM4, TM5, TM6 and TM7. The 8 non-transmembrane loops are divided into 4 extracellular loops EL1, EL2, EL3, and EL4, and 4 intracellular loops, IL1, IL2, IL3, and IL4, thus a total of 8 loops (including the N- and C-terminal loops that are each only connected to one TM region, and each has a free end). Thus a 7TM GPCR protein can be divided into 15 fragments based on the transmembrane and non-transmembrane features.
[0056] One aspect of the invention provides a process of operating a computer program to execute a scripted procedure to select, or make a water-soluble variant of a membrane protein (e.g., a G Protein-Coupled Receptor (GPCR)), the method comprising:
[0057] (1) entering a sequence of the membrane protein (e.g., GPCR) for analysis;
[0058] (2) obtaining a variant of the membrane protein (e.g., GPCR), wherein a plurality of hydrophobic amino acids in the transmembrane (TM) domain alpha-helical segments ("TM regions") of the membrane protein (e.g., GPCR) are substituted, wherein:
[0059] (a) said hydrophobic amino acids are selected from the group consisting of Leucine (L), Isoleucine (I), Valine (V), and Phenylalanine (F);
[0060] (b) each said Leucine (L) is independently substituted by Glutamine (Q), Asparagine (N), or Serine (S);
[0061] (c) each said Isoleucine (I) and said Valine (V) are independently substituted by Threonine (T), Asparagine (N), or Serine (S); and,
[0062] (d) each said Phenylalanine is substituted by Tyrosine (Y); and, subsequently,
[0063] (3) obtaining an .alpha.-helical secondary structure result for the variant to verify maintenance of .alpha.-helical secondary structures in the variant;
[0064] (4) obtaining a trans-membrane region result for the variant to verify water solubility of the variant, thereby selecting the water-soluble variant of the membrane protein (e.g., GPCR).
[0065] As used herein, "water-soluble variant of the (trans)membrane protein" or "water-soluble (trans)membrane variant" may be used interchangeably.
[0066] The exact sequence of carrying out the steps of the invention may be variable. For example, in certain embodiments, step (3) is performed prior to step (4). In certain embodiments, step (3) is performed concurrently with step (4). In certain embodiments, step (3) is performed after step (4).
[0067] In certain embodiments, the plurality of hydrophobic amino acids are randomly selected from all potential hydrophobic amino acids L, I, V, and F located on all TM regions of the protein. In certain embodiments, the plurality of hydrophobic amino acids is about 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% of all the potential hydrophobic amino acids L, I, V, and F located on all TM regions of the protein. In certain embodiments, the plurality of hydrophobic amino acids is no less than about 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50% of all the potential hydrophobic amino acids L, I, V, and F located on all TM regions of the protein. In certain embodiments, the plurality of hydrophobic amino acids is no more than about 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, or 50% of all the potential hydrophobic amino acids L, I, V, and F located on all TM regions of the protein. In certain embodiments, the randomly selected hydrophobic amino acids L, I, V, and F may be roughly evenly distributed on all TM regions, or may be preferentially or exclusively distributed on 1, 2, 3, 4, 5, or 6 TM regions.
[0068] In certain embodiments, every potential hydrophobic amino acids L, I, V, and F on all TM regions of the protein are substituted. For example, all L are independently substituted by Q (or S or N); and/or all I and V are independently substituted by T (or S or N); and/or all F are substituted by Y. In certain embodiments, all L are substituted by Q, all I and V are substituted by T, and all F are substituted by Y.
[0069] In certain embodiments, instead of randomly substituting selected hydrophobic amino acids L, I, V, and F in all TM regions, all substitutions can first be limited to any one of the TM regions (such as the most N-terminal or C-terminal TM region), and only desired substitution variants are selected as members of a library of potential variants. All members of the library differ in the substitutions in the chosen TM region, either due to the positions substituted (e.g., the 3.sup.rd vs. the 10.sup.th residue in the TM region is substituted), or due to the identity of the substituent residues (e.g., S vs. T for an I or V substitution), or both. The desired substitution variants are selected based on a pre-determined criteria, such as a scoring system that takes into consideration the .alpha.-helical secondary structure prediction result and/or the trans-membrane region prediction result.
[0070] This process can be repeated for 1, 2, 3, 4, 5, 6 additional TM regions of the protein, or all the remaining TM regions of the protein, each iteration creates a library of potential variants that can be stored in an electronic memory or database. Within the same library, all variants differ in the substitutions in the chosen TM region (see above), but are otherwise the same in the remaining TM regions and non-TM regions.
[0071] Domain swapping or shuffling using sequences from two or more such libraries creates combinatory variants having hydrophobic amino acids L, I, V, F substitutions in two or more TM regions. Depending on the number of members in each library, the total possible combinations of combinatory variants can approach millions with just a few members in each library. For example, for a GPCR having 7 TM regions, if there are 8 members in each of the seven libraries, the total number of combinatory variants based on the libraries will be 8.sup.7 or about 2.1 million. In certain embodiments, the library of combinatory variants comprises less than about 5, 4, 3, 2, 1, or 0.5 million members.
[0072] Thus in certain embodiments, in step (2), one subset of said plurality of hydrophobic amino acids in one and the same TM region of the protein (e.g., GPCR) are substituted to generate one member of a library of potential variants, and one or more different subsets of said plurality of hydrophobic amino acids are substituted to generate additional members of the library.
[0073] In certain embodiments, the method further comprises ranking all members of said library based on a combined score, wherein the combined score is a weighed combination of the .alpha.-helical secondary structure prediction result and the trans-membrane region prediction result.
[0074] As one of ordinary skill in the art would appreciate, the domains having different sequences will likely predict different water solubilities and propensities for alpha helical formation. One can assign "a score" to a specific predicted water solubility or range of solubilities, propensity to form alpha helical structure or range of propensities. The score can be quantitative (0,1) where 0 can represent, for example, a domain with an unacceptable predicted water solubility and 1 can represent, for example, a domain with an acceptable predicted water solubility. This score can be based on a threshold value, for example. Or, the score can be assessed on a scale, for example, between 1 and 10 establishing characterizing increasing degrees of water solubility. Or, the score can be quantitative, such as in describing the predicted solubility in terms of mg/mL. Upon assessing a score to each domain, the domain variants can be readily compared (or ranked) by one or, preferably, both of the scores to select domain variants that are both water soluble and form alpha helices. Thus, preferred embodiments can utilize a ranking function that can be used to compute the ranking data. Note also that water soluble proteins made based on the currently described system can be analyzed and characterized to provide input to the system such that those combinations of substitution that are not effective to achieve a given biological function can be used to constrain the computational model, thereby enabling a more efficient processing of the information.
[0075] For example, using the methods of the invention, one or more variants can be designed and produced in vitro and/or in vivo, and one or more biological functions of the variants can be determined based on any of many art-recognized methods. For GPCR, for example, ligand binding and/or downstream signal transduction by the variants can be compared to that of the wild-type GPCR, and the patterns of QTY substitution used to generate a specific variant can be associated with an enhanced, maintained, or diminished biological activity. Such structural-functional relationship information obtained based on one or more variants can be used for machine learning or impart additional constrain on the computational model of the invention, to more efficiently rank the variants created by the methods of the invention. Thus new potential variants having substitution patterns more closely matching that of a known successful variant can be ranked higher that another potential variant having substitution patterns less closely matching that of the known successful variant, or more closely matching that of a known unsuccessful variant.
[0076] The TMHMM program, when run as a standalone version of the software module/package (e.g., one for the Linux system), produces a score of between 0 and 1 that can be used to predict the propensity of forming transmembrane regions/proteins. The score can be used as a quantitative prediction for water solubility in the methods of the invention.
[0077] Thus in certain embodiment, the .alpha.-helical secondary structure component of a ranking function can be a quantitative score, such as 0.5 or 1 for having no predicted .alpha.-helical secondary structures, and 0 for having maintained predicted .alpha.-helical secondary structures. In certain embodiments, the trans-membrane region result can be provided by a TM region prediction program, such as TMHMM 2.0, which provides a numeric value between 0 and 1, with 0 being no predicted TM region, and 1 being the strongest propensity of forming TM region(s). Thus the two scores can be combined, either directly or with weighing, such that the combined score represents an overall assessment of maintained secondary structure as well as predicted water solubility (as measured by propensity to form TM regions). For example, a combined score of 0 indicates that the variant has no predicted TM region, while having maintained predicted .alpha.-helical secondary structures, and is thus a desired variant. Meanwhile, a variant has strong propensity to form TM region (due to the presence of large number of hydrophobic residues, for example), tends to have a larger combined score and thus undesirable under this scoring scheme.
[0078] In certain embodiments, the method includes eliminating variants having an .alpha.-helical secondary structure prediction result tending to show that the .alpha.-helical secondary structures are destroyed or disrupted. In certain embodiments, the method includes eliminating variants having trans-membrane region prediction result tending to show strong propensity to form TM regions. Thus the system can include a beaming module in which variants can be excluded from further selection processing.
[0079] In certain embodiments, the ranking function can be selected to include a weighing scheme that assigns 5%, 10%, 20%, 25%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% weight to the .alpha.-helical secondary structure prediction result, and the remaining to the trans-membrane region prediction result. The user can either manually select the weighting features, or the software can automatically select the weighting features depending on the desired characteristics such as biological function.
[0080] In certain embodiments, the method further comprises selecting N members with the highest combined scores to form a first library of potential variants for said TM region, wherein N is a pre-determined integer (e.g., 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or more).
[0081] In certain embodiments, the method further comprises generating one library of potential variants for 1, 2, 3, 4, 5, 6, or all the remaining TM regions of the protein (e.g., GPCR). Each entry in the library can include fields used to define attributes of that entry, including the ranking data generated by one or more ranking functions.
[0082] In certain embodiments, the method further comprises replacing two or more (e.g., all) TM regions of the protein (e.g., GPCR) with corresponding TM regions from the libraries of potential variants, to create a library of combinatory variants. As used herein, "corresponding TM regions" refer to the TM regions in the libraries of potential variants that are the same or homologous to the TM regions of the protein (e.g., GPCR) that are being combined. For example, if the 2.sup.nd and 3.sup.rd TM regions from the N-terminal of a GPCR are to be substituted, TM region sequences from the library having substitutions only in the 2.sup.nd TM regions, and TM region sequences from the library having substitutions only in the 3.sup.rd TM regions, are imported/pasted/transferred into the 2.sup.nd and 3.sup.rd TM regions of the GPCR to create combinatory variants.
[0083] In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said leucines are substituted by glutamines. In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said isoleucines are substituted by threonines. In certain embodiments, substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said valines are substituted by threonines. In certain embodiments, wherein substantially all (e.g., 96%, 97%, 98%, 99%, or 100%), or 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95% of said phenylalanines are substituted by tyrosines. In certain embodiments, one or more (e.g., 1, 2, or 3) said leucines are not substituted. In certain embodiments, one or more (e.g., 1, 2, or 3) said isoleucines are not substituted. In certain embodiments, one or more (e.g., 1, 2, or 3) said valines are not substituted. In certain embodiments, one or more (e.g., 1, 2, or 3) said phenylalanines are not substituted.
[0084] In certain embodiments, the method further comprises producing/expressing said combinatory variants. In certain embodiments, the method further comprises testing said combinatory variants for ligand binding (e.g., in vitro, or in a biological system such as yeast two-hybrid system), wherein those having substantially the same ligand binding compared to that of the GPCR are selected. In certain embodiments, the method further comprises testing said combinatory variants for a biological function of the GPCR, wherein those having substantially the same biological function compared to that of the GPCR are selected.
[0085] In certain embodiments, the sequence of the TM protein (e.g., GPCR) contains information about the TM regions of the protein, e.g., the location of one or more transmembrane regions of the TM protein, such as the location of all TM regions. Such sequences may belong to proteins having resolved crystal structure with defined TM regions. Such sequences may also belong to proteins having annotated TM region information based on prior research, and such information is readily available from a public or proprietary database, such as PDB, UniProt, GenBank, EMBL, DBJ, etc.
[0086] The Protein Data Bank (PDB) is a weekly updated repository for the three-dimensional structural data of large biological molecules, such as proteins and nucleic acids. The data, typically obtained by X-ray crystallography or NMR spectroscopy and submitted by biologists and biochemists from around the world, are freely accessible on the Internet via the websites of its member organizations (PDBe, PDBj, and RCSB). The PDB is overseen by the Worldwide Protein Data Bank, wwPDB. The PDB is a key resource in areas of structural biology, such as structural genomics, and most major scientific journals, and some funding agencies, now require scientists to submit their structure data to the PDB.
[0087] If the contents of the PDB are thought of as primary data, then there are hundreds of derived (i.e., secondary) databases that categorize the data differently. For example, both SCOP and CATH categorize structures according to type of structure and assumed evolutionary relations; GO categorize structures based on genes; while crystallographic database store information about the 3D structure of the proteins. All such publically available database may be used to provide input sequence information, including information about the existence and position of transmembrane regions.
[0088] Another publically available database that can provide sequence information for use in the methods of the invention is UniProt. UniProt is a comprehensive, high-quality and freely accessible database of protein sequence and functional information, many entries being derived from genome sequencing projects. It contains a large amount of information about the biological function of proteins derived from the research literature.
[0089] UniProt provides four core databases: UniProtKB (with sub-parts Swiss-Prot and TrEMBL), UniParc, UniRef, and UniMes. Among them, UniProtKB/Swiss-Prot is a manually annotated, non-redundant protein sequence database that combines information extracted from scientific literature and biocurator-evaluated computational analysis. The aim of UniProtKB/Swiss-Prot is to provide all known relevant information about a particular protein. Annotation is regularly reviewed to keep up with current scientific findings. The manual annotation of an entry involves detailed analysis of the protein sequence and of the scientific literature. Sequences from the same gene and the same species are merged into the same database entry. Differences between sequences are identified, and their cause documented (e.g., alternative splicing, natural variation, etc.). Computer-predictions are manually evaluated, and relevant results selected for inclusion in the entry. These predictions include post-translational modifications, transmembrane domains and topology, signal peptides, domain identification, and protein family classification, all may be used to provide useful sequence information pertaining to the TM regions used in the methods of the invention.
[0090] In certain embodiments, the sequence of the TM protein (e.g., GPCR) does not contain information about the location of one or more (e.g., any) transmembrane regions. However, the TM region(s) can be predicted based on sequence homology with a related protein having known TM regions. For example, the related protein may be a homologous protein in a different species.
[0091] In certain embodiments, the sequence of the TM protein (e.g., GPCR) does not contain information about the location of one or more (e.g., any) transmembrane regions, and such information is not readily available based on known information. In this embodiment, the invention provides computation of TM regions using art-recognized methods, such as the TMHMM 2.0 (TransMembrane prediction using Hidden Markov Models) program, developed by Center for Biological Sequence Analysis. See further details regarding this below.
[0092] In certain embodiments, the method further comprises providing a polynucleotide sequence for each variants of the protein (e.g., GPCR). Such polynucleotide sequence can be readily generated based on the protein sequence of the protein (e.g., GPCR), and the known genetic code. In certain embodiments, the polynucleotide sequence is codon optimized for expression in a host. The host may be a bacterium such as E. coli, a yeast such as S. cerevisae or S. pombe, an insect cell such as Sf9 cell, a non-human mammalian cell, or a human cell.
[0093] In certain embodiments, the protein is a GPCR, such as one selected from the group consisting of: purinergic receptors (P2Y.sub.1, P2Y.sub.2, P2Y.sub.4, P2Y.sub.6), M.sub.1 and M.sub.3 muscarinic acetylcholine receptors, receptors for thrombin (protease-activated receptor (PAR)-1, PAR-2), thromboxane (TXA.sub.2), sphingosine 1-phosphate (S1P.sub.2, S1P.sub.3, S1P.sub.4 and S1P.sub.5), lysophosphatidic acid (LPA.sub.1, LPA.sub.2, LPA.sub.3), angiotensin II (AT.sub.1), serotonin (5-HT.sub.2c and 5-HT.sub.4), somatostatin (sst.sub.5), endothelin (ET.sub.A and ET.sub.B), cholecystokinin (CCK.sub.1), V.sub.1a vasopressin receptors, D.sub.5 dopamine receptors, fMLP formyl peptide receptors, GAL.sub.2 galanin receptors, EP.sub.3 prostanoid receptors, A.sub.1 adenosine receptors, .alpha..sub.1 adrenergic receptors, BB.sub.2 bombesin receptors, B.sub.2 bradykinin receptors, calcium-sensing receptors, chemokine receptors, KSHV-ORF74 chemokine receptors, NK.sub.1 tachykinin receptors, thyroid-stimulating hormone (TSH) receptors, protease-activated receptors, neuropeptide receptors, adenosine A2B receptors, P2Y purinoceptors, metabolic glutamate receptors, GRK5, GPCR-30, and CXCR4.
[0094] In certain embodiments, the scripted procedure of the method comprises VBA scripts.
[0095] In certain embodiments, the scripted procedure is operable in a Linux system (e.g., Ubuntu 12.04 LTS), a Microsoft Windows operative system, or an Apple iOS operative system.
[0096] In certain embodiments, the process comprises all, or substantially all, of the following steps:
[0097] (1) identifying a first transmembrane region of a (trans)membrane protein, if necessary, by predicting an alpha-helical structure of the protein (e.g., a GPCR);
[0098] (2) modifying a plurality of hydrophobic amino acids via the QTY Code, as defined herein to obtain a modified first transmembrane sequence;
[0099] (3) scoring the propensity of the alpha-helical structure of the first modified transmembrane sequence of (2) (e.g., in the context of a modified (trans)membrane protein having the first modified transmembrane sequence) to arrive at a structure score;
[0100] (4) scoring the water solubility prediction of the first modified transmembrane sequence of (2) (e.g., in the context of a modified (trans)membrane protein having the first modified transmembrane sequence) to arrive at a solubility score;
[0101] (5) repeating steps (2) through (4) to arrive at a first library of putative water soluble first modified transmembrane variants;
[0102] (6) comparing the structure scores and solubility scores of each putative water soluble first modified transmembrane variants in the first library and, preferably ranking the putative water soluble first modified transmembrane variants using said structure scores and solubility scores;
[0103] (7) selecting a plurality of putative water soluble first modified transmembrane variants (wherein the plurality is the integer, H, or preferably less than 10, 9, 8, 7, 6, 5 or 4) to arrive at a second library of putative water soluble first modified transmembrane variants;
[0104] (8) repeating steps (1) through (7) for a second, third, fourth, fifth, sixth, seventh or, preferably, all transmembrane regions of the protein (the sum of the transmembrane regions modified by the method being an integer n);
[0105] (9) identifying the amino acid sequences of the protein which are not included in any transmembrane region modified in steps (1) through (8), and including any extracellular or intracellular domain of the protein;
[0106] (10) generating combinatorial variants of putative water soluble modified transmembrane protein (see above); and,
[0107] (11) optionally, identifying a nucleic acid sequence for each putative water soluble modified transmembrane variant.
[0108] Using the nucleic acid sequences identified in the above process, nucleic acid sequences for each putative water-soluble modified transmembrane variant and each non-transmembrane domains (including the extracellular and intracellular domains) can be generated and combinatorially expressed to create a library of up to putative water-soluble transmembrane protein variants. For example, where H is 8 and n is 7, a library of approximately 2 million water-soluble protein variants can be designed.
[0109] Another aspect of the invention pertains to the expression of the water-soluble variant proteins (e.g., GPCR) designed based on the methods of the invention. This aspect of the invention is partly based on the surprising finding that the water-soluble variant proteins (e.g., GPCR) designed based on the methods of the invention can achieve high levels of expression in both in vitro cell-free expression system and expression in commonly used cell-based expression systems, such as E. coli. In addition, the expressed proteins are highly soluble, and can be easily purified from the soluble fraction of the expression system, such as the soluble fraction from the lysate of an E. coli culture, as opposed to the insoluble aggregates or pellets in which most membrane proteins are typically found.
[0110] Thus one aspect of the invention provides a method of producing a protein in a bacterium (e.g., an E. coli), comprising:
[0111] (a) culturing the bacterium in a growth medium under a condition suitable for protein production;
[0112] (b) fractioning a lysate of the bacterium to produce a soluble fraction and the insoluble pellet fraction; and,
[0113] (c) isolating the protein from the soluble fraction;
[0114] wherein:
[0115] (1) the protein is a subject variant protein (e.g., G-protein couple receptor (GPCR)) of the invention; and,
[0116] (2) the yield of the protein is at least 20 mg/L (e.g., 30 mg/L, 40 mg/L, 50 mg/L or more) of growth medium.
[0117] In certain embodiments, the bacterium is E. coli BL21, and the growth medium is LB medium. In certain embodiments, the protein is encoded by a plasmid in the bacterium. In certain embodiments, expression of the protein is under the control of an inducible promoter. For example, the inducible promoter may be inducible by IPTG. In certain embodiments, the lysate is produced by sonication. In certain embodiments, the soluble fraction is produced by centrifuging the lysate at 14,500.times.g or more.
[0118] With the general aspects of the inventions described above, certain features or specific embodiments of the invention are further described below.
Transmembrane Region Prediction
[0119] Certain methods of the invention comprise a step of predicting a transmembrane region of a protein, such as GPCR. There are many programs and software known in the art relating to the TM region, and any of which may be used individually or in combination in the methods of the invention where a TM region prediction step is called for. These programs usually have a very simple user interface, typically requiring the user to provide an input sequence of a specified format (such as FASTA or plain text), and provides prediction results using text or graphics or both. Some programs also offer more advanced features, such as allowing the user to specify certain parameters to fine tune the prediction results. All such programs can be used in the methods of the invention.
[0120] One exemplary TM region prediction program is TMHMM (hosted by Center for Biological Sequence Analysis, Technical University of Denmark), which method predicts 97-98% TM region helices correctly. It predicts transmembrane helices in proteins using the Hidden Markov Model. The input protein sequence can be the FASTA format, and the output can be presented as an html page with an image of predicted locations for the TM regions. In a study by Moller et al., entitled "Evaluation of Methods for the Prediction of Membrane Spanning Regions," Bioinformatics 17(7):646-653, 2001, TMHMM was determined to be the best performing transmembrane prediction program at the time of evaluation.
[0121] The programs compared in that study include the following, all can be used to predict TM region in the methods of the invention: TMHMM 1.0, 2.0, and a retrained version of 2.0 (Sonnhammer et al., Int. Conf. Intell. Syst. Mol. Biol. AAAI Press, Montreal, Canada, pp. 176-182, 1998; Krogh et al., J Mol Biol. 305(3):567-80, 2001); MEMSAT 1.5 (Jones et al., Biochemistry 33:3038-3049, 1994); Eisenberg (Eisenberg et al., Nature 299:371-374, 1982); Kyte/Doolittle (Kyte and Doolittle, J. Mol. Biol. 157:105-132, 1982); TMAP (Persson and Argos, J. Protein Chem. 16:453-457, 1997); DAS (Cserzo et al., Protein Eng. 10:673-676, 1997); HMMTOP (Tusnady and Simon, J. Mol. Biol. 283:489-506, 1998); SOSUI (Hirokawa et al., Bioinformatics 14:378-379, 1998); PHD (Rost et al., Int. Conf. Intell. Syst. Mol. Biol. AAAI Press, St. Louis, USA, pp. 192-200, 1996); TMpred (Hofmann and Stoffel, Biol. Chem. Hoppe-Seyler 374:166, 1993); KKD (Klein et al., Biochim. Biophys. Acta. 815:468-476, 1985); ALOM2 (Nakai and Kanehisa, Genomics 14:489-911, 1992); and Toppred 2 (Claros and Heijne, Comput. Appl. Biosci. 10:685-686, 1994). All references cited are incorporated herein by reference.
[0122] The principals of TMHMM is described in Krogh et al., Predicting transmembrane protein topology with a hidden Markov model: Application to complete genomes. Journal of Molecular Biology, 305(3):567-580, January 2001 (incorporated by reference); and Sonnhammer et al., A hidden Markov model for predicting transmembrane helices in protein sequences. In J. Glasgow, T. Littlejohn, F. Major, R. Lathrop, D. Sankoff, and C. Sensen, editors, Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology, pages 175-182, Menlo Park, Calif., 1998, AAAI Press (incorporated by reference).
[0123] DAS (Dense Alignment Surface, Cserzo et al., "Prediction of transmembrane alpha-helices in procariotic membrane proteins: the Dense Alignment Surface method," Prot. Eng. 10(6): 673-676, 1997, Stockholm University, Sweden) predicts transmembrane regions using the Dense Alignment Surface method. DAS is based on low-stringency dot-plots of the query sequence against a set of library sequences--non-homologous membrane proteins--using a previously derived, special scoring matrix. The method provides a high precision hyrdophobicity profile for the query from which the location of the potential transmembrane segments can be obtained. The novelty of the DAS-TMfilter algorithm is a second prediction cycle to predict TM segments in the sequences of the TM-library. To use the DAS server, user enters a protein sequence at www dot sbc dot su dot se slash .about.miklos slash DAS, and the DA server will predict a TM region of the input sequence.
[0124] HMMTOP (Hungarian Academy of Sciences, Budapest) is an automatic server for predicting transmembrane helices and topology of proteins using Hidden Markov Model, developed by G. E. Tusnady, at the Institute of Enzymology. The method used by this prediction server is described in G. E Tusnady and I. Simon (1998) "Principles Governing Amino Acid Composition of Integral Membrane Proteins: Applications to Topology Prediction." J. Mol. Biol. 283: 489-506 (incorporated by reference). The new features of HMMTOP 2.0 version is described in `G. E Tusnady and I. Simon (2001) "The HMMTOP transmembrane topology prediction server," Bioinformatics 17: 849-850 (incorporated by reference).
[0125] MEMSAT2 Transmembrane Prediction Page (www dot sacs dot ucsf dot edu slash cgi-bin slash memsat dot py) predicts transmembrane segments in a protein using FASTA format or plain text as input. A related program, the MEMSAT (1.5) software, is copyrighted by Dr. David Jones (Jones et al., Biochemistry 33:3038-3049, 1994). The latest version of MEMSTAT, MEMSAT V3, is a widely used all-helical membrane protein prediction method MEMSAT. The method was benchmarked on a test set of transmembrane proteins of known topology. From sequence data MEMSAT was estimated to have an accuracy of over 78% at predicting the structure of all-helical transmembrane proteins and the location of their constituent helical elements within a membrane. MEMSATSVM is highly accurate predictor of transmembrane helix topology. It is capable of discriminating signal peptides and identifying the cytosolic and extra-cellular loops. MEMSAT3 and MEMSATSVM are both parts of the PSIPRED Protein Sequence Analysis Workbench, which aggregates several structure prediction methods into one location at the University College London.
[0126] The Phobius server (phobius dot sbc dot su dot se) is for prediction of transmembrane topology and signal peptides from the amino acid sequence of a protein in FASTA format. Phobius is described in Lukas et al., "A Combined Transmembrane Topology and Signal Peptide Prediction Method," Journal of Molecular Biology 338(5):1027-1036, 2004). PoyPhobius is described in: Lukas et al., "An HMM posterior decoder for sequence feature prediction that includes homology information," Bioinformatics, 21 (Suppl 1):i251-i257, 2005. And the Phobius webserver is described in: Lukas et al., "Advantages of combined transmembrane topology and signal peptide prediction--the Phobius web server," Nucleic Acids Res. 35:W429-32, 2007 (all cited art incorporated by reference).
[0127] SOSUI is for the discrimination of membrane proteins and soluble ones together with the prediction of transmembrane helices. SOSUI predicts transmembrane regions using Hydrophobicity Analysis for Topology and Probe Helix Method for Tertial Structure. The accuracy of the classification of proteins is said to be as high as 99%, and the corresponding value for the transmembrane helix prediction is said to be about 97%. The system SOSUI is available through internet access www dot tuat dot ac dot jp slash mitaku slash sosui.
[0128] TMPred (European Molecular Biology Network, Swiss node) predicts transmembrane regions and protein orientation in a query sequence. Specifically, the TMPred algorithm is based on the statistical analysis of TMbase, a database of naturally occurring transmembrane proteins. The prediction is made using a combination of several weight-matrices for scoring. See Hofmann & Stoffel (1993) "TMbase--A database of membrane spanning proteins segments," Biol. Chem. Hoppe-Seyler, 374:166.
[0129] The SPLIT 4.0 server is a membrane protein secondary structure prediction server (split dot pmfst dot hr slash split slash 4) that predicts the transmembrane (TM) secondary structures of membrane proteins in SWISS-PROT format, using the method of preference functions. See Juretic et al., "Basic charge clusters and predictions of membrane protein topology," J. Chem. Inf. Comput. Sci., 42:620-632, 2002 (incorporated by reference).
[0130] PRED-TMR predicts transmembrane domains in proteins using solely the protein sequence itself. The algorithm refines a standard hydrophobicity analysis with a detection of potential termini ("edges," starts and ends) of transmembrane regions. This allows both to discard highly hydrophobic regions not delimited by clear start and end configurations and to confirm putative transmembrane segments not distinguishable by their hydrophobic composition. The accuracy obtained on a test set of 101 non-homologous transmembrane proteins with reliable topologies compares well with that of other popular existing methods. Only a slight decrease in prediction accuracy was observed when the algorithm was applied to all transmembrane proteins of the SwissProt database (release 35). See Pasquier et al., "A novel method for predicting transmembrane segments in proteins based on a statistical analysis of the SwissProt database: the PRED-TMR algorithm," Protein Eng., 12(5):381-385, 1999 (incorporated by reference).
[0131] In the related PRED-TMR2, the application has been extended with a pre-processing stage represented by an artificial neural network which is able to discriminate with a high accuracy transmembrane proteins from soluble or fibrous ones. Applied on several test sets of transmembrane proteins, the system gives a perfect prediction rating of 100% by classifying all the sequences in the transmembrane class. Applied on 995 non-transmembrane protein extracted from the PDBSELECT database, the neural network predicts falsely 23 of them to be transmembrane (97.7% of correct assignment). See Pasquier and Hamodrakas, "An hierarchical artificial neural network system for the classification of transmembrane proteins," Protein Eng., 12(8):631-634, 1999 (incorporated by reference).
Protein Alpha Helical Secondary Structure Prediction
[0132] Certain methods of the invention comprise a step of predicting alpha helical secondary structure of a protein, such as GPCR. There are many such programs and software known in the art, and any of which may be used individually or in combination in the methods of the invention where alpha helical secondary structure prediction step is called for. All such programs can be used in the methods of the invention.
[0133] Early methods of secondary-structure prediction were restricted to predicting the three predominate states: helix, sheet, or random coil. These methods were based on the helix- or sheet-forming propensities of individual amino acids, sometimes coupled with rules for estimating the free energy of forming secondary structure elements. Such methods were typically .about.60% accurate in predicting which of the three states (helix/sheet/coil) a residue adopts. The first widely used technique to predict protein secondary structure from the amino acid sequence was the Chou-Fasman method.
[0134] A significant increase in accuracy (to nearly .about.80%) was made by taking advantage of information provided by multiple sequence alignment; knowing the full distribution of amino acids that occur at a position (and in its vicinity, typically .about.7 residues on either side) throughout evolution provides a much better picture of the structural tendencies near that position. For example, a given protein might have a glycine at a given position, which by itself might suggest a random coil. However, multiple sequence alignment might reveal that helix-favoring amino acids occur at that position (and nearby positions) in 95% of homologous proteins throughout evolution. Moreover, by examining the average hydrophobicity at that and nearby positions, the same alignment might also suggest a pattern of residue solvent accessibility consistent with an .alpha.-helix. Taken together, these factors would suggest that the glycine of the original protein adopts .alpha.-helical structure, rather than random coil. Thus in the methods of the invention, the alpha helical secondary structure prediction program may combine all the available data to form a 3-state prediction, including neural networks, hidden Markov models and support vector machines. Such prediction methods also provide a confidence score for their predictions at every position.
[0135] Secondary-structure prediction methods are continuously benchmarked, e.g., EVA (benchmark) EVA is a continuously running benchmark project for assessing the quality of protein structure prediction and secondary structure prediction methods. Methods for predicting both secondary structure and tertiary structure--including homology modeling, protein threading, and contact order prediction--are compared to results from each week's newly solved protein structures deposited in the Protein Data Bank (PDB). The project aims to determine the prediction accuracy that would be expected for non-expert users of common, publicly available prediction webservers.
[0136] Based on these tests, the most accurate methods at present are Psipred, SAM (Karplus, "SAM-T08, HMM-based protein structure prediction," Nucleic Acids Res. (2009) 37 (Web Server issue): W492-497. doi:10.1093/nar/gkp403); PORTER (Pollastri & McLysaght, "Porter: a new, accurate server for protein secondary structure prediction," Bioinformatics 21 (8):1719-1720, 2005); PROF (Yachdav et al. (2014). "PredictProtein--an open resource for online prediction of protein structural and functional features," Nucleic Acids Res. 42 (Web Server issue): W337-343. doi:10.1093/nar/gku366); and SABLE (Adamczak et al. (2005) "Combining prediction of secondary structure and solvent accessibility in proteins," Proteins 59 (3): 467-475. doi:10.1002/prot.20441). In addition, the standard method for assigning secondary-structure classes (helix/strand/coil) to PDB structures is DSSP (Kabsch W and Sander (1983) "Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features," Biopolymers 22 (12): 2577-2637. doi:10.1002/bip.360221211), against which the predictions are benchmarked. All incorporated by reference and all can be used in the methods of the invention.
[0137] The DSSP algorithm is the standard method for assigning secondary structure to the amino acids of a protein, given the atomic-resolution coordinates of the protein. DSSP begins by identifying the intra-backbone hydrogen bonds of the protein using a purely electrostatic definition, assuming partial charges of -0.42 e and +0.20 e to the carbonyl oxygen and amide hydrogen respectively, their opposites assigned to the carbonyl carbon and amide nitrogen. A hydrogen bond is identified if E in the following equation is less than -0.5 kcal/mol:
E = 0.084 { 1 r ON + 1 r CH - 1 r OH - 1 r CN } 332 kcal / mol ##EQU00001##
[0138] Based on this, eight types of secondary structure are assigned. The 3.sub.10 helix, .alpha. helix and .pi. helix have symbols G, H and I and are recognized by having a repetitive sequence of hydrogen bonds in which the residues are three, four, or five residues apart respectively. Two types of beta sheet structures exist; a beta bridge has symbol B while longer sets of hydrogen bonds and beta bulges have symbol E. T is used for turns, featuring hydrogen bonds typical of helices, S is used for regions of high curvature (where the angle between
C i .alpha. C i + 2 .alpha. .fwdarw. and C 1 - 2 .alpha. C i .alpha. .fwdarw. ##EQU00002##
is less than 70.degree.), and a blank (or space) is used if no other rule applies, referring to loops. These eight types are usually grouped into three larger classes: helix (G, H and I), strand (E and B) and loop (all others).
[0139] PSIPRED (Psi-blast based secondary structure prediction) is a technique used to investigate protein structure. It employs neural network, machine learning methods in its algorithm. It is a server-side program, featuring a website serving as a front-end interface, which can predict a protein's secondary structure (beta sheets, alpha helices and coils) from the primary sequence. See bioinf dot cs dot ucl dot ac dot uk slash psipred. The idea of this method is a machine learning method that uses the information of the evolutionarily related proteins to predict the secondary structure of a new amino acid sequence. Specifically, PSIBLAST is used to find related sequences and to build a position-specific scoring matrix. This matrix is processed by a neural network, which was constructed and trained to predict the secondary structure of the input sequence. The prediction method or algorithm is split into three stages: Generation of a sequence profile, Prediction of initial secondary structure, and Filtering of the predicted structure. PSIPRED works to normalize the sequence profile generated by PSIBLAST. Then, by using neural networking, initial secondary structure is predicted. For each amino acid in the sequence the neural network is fed with a window of 15 acids. There is additional information attached, indicating if the window spans the N or C terminus of the chain. This results in a final input layer of 315 input units, divided into 15 groups of 21 units. The network has a single hidden layer of 75 units and 3 output nodes (one for each secondary structure element: helix, sheet, coil). A second neural network is used for filtering the predicted structure of the first network. This network is also fed with a window of 15 positions. The indicator on the possible position of the window at a chain terminus is also forwarded. This results in 60 input units, divided into 15 groups of four. The network has a single hidden layer of 60 units and results in three output nodes (one for each secondary structure element: helix, sheet, coil). The three final output nodes deliver a score for each secondary structure element for the central position of the window. Using the secondary structure with the highest score, PSIPRED generates the protein prediction. The Q3 value is the fraction of residues predicted correctly in the secondary structure states, namely helix, strand and coil.
Step-by-Step Description of an Exemplary Embodiment:
[0140] With the invention generally described above, certain non-limiting but illustrative embodiments are described below with reference to representative flow charts in the figures.
[0141] FIG. 9A illustrates one embodiment of the invention that is non-limiting. It generally illustrates a method 200 of the invention in which selected hydrophobic amino acids L, I, V, and F in the TM region of the proteins (e.g., GPCR) are replaced according to the "QTY Code" of the invention, without limiting the substitutions in any particular TM region/domain.
[0142] In that specific embodiment, the process starts 202 by acquiring or reading 204 an input of a protein sequence which may or may not be a transmembrane protein. The protein sequence can then be subject to TM region prediction 206 (if such information is not already available from the input protein sequence) and alpha-helical secondary structure prediction based on any of art-recognized methods. The TM region prediction, for example, can be performed using a program 240 such as the TMHMM program. If the prediction does not yield any TM region at 242, it may be possible that one or more different TM region prediction programs 250, such as SOSUI, can be used to predict the presence/absence of TM region. If no TM region is predicted based on such programs at 252, it is likely that no TM region exists in the protein 254, and the process will terminate 260.
[0143] On the other hand, if one or more TM region(s) are predicted by any of the suitable programs at 242, the TM region protein sequences are obtained 244, and the QTY Code of the invention can be applied to the hydrophobic amino acids L, I, V, and F within such TM region(s). More specifically, according to the QTY code, each leucine in the TM regions can be independently substituted 212 by glutamine (Q), serine (S), or asparagine (N), or remain unsubstituted; each isoleucine and valine in the TM regions can be independently substituted by threonine (T), serine (S), or asparagine (N), or remain unsubstituted; and each phenylalanine in the TM regions can be substituted by tyrosine (Y), or remain unsubstituted. The result of such QTY substitution produces one or more putative water-soluble variants of the original transmembrane protein. Note that the number of substitutions made for each amino acid in a region can be selected as a parameter.
[0144] Next, the alpha-helical secondary structures in each putative water-soluble variant can be predicted using any art-recognized programs, such as PORTER 210. The result can be compared to that of the original protein 208, preferably predicted using the same program (e.g., PORTER). Note that the alpha-helical secondary structure of the original protein can be predicted using any art-recognized program, wither before, concurrently, or after the TM region prediction step of the original protein.
[0145] If the result of the alpha-helical secondary structure prediction shows that the potential water-soluble variant has maintained or largely maintained the same alpha-helical secondary structure as the original protein at 214, it suggests that the specific pattern of QTY substitution in that variant does not or does not significantly affect the alpha-helical secondary structure in the original protein. The TM regions' prediction can then be conducted 220, verified 222, and the mutant sequence generated 224. Optionally, if the result shows that one or more of the alpha-helical secondary structure(s) in the original protein is destroyed at 214, the variant can be discarded at this step as undesirable, thus terminating the process.
[0146] On the other hand, the method of the invention also requires the predicted QTY variant to show less or no propensity to form TM region, as compared to the original protein. Thus the putative water-soluble variant can be subject to TM region prediction, such as using the same TM region prediction program used for the initial TM region prediction (if necessary) in the original protein. If the result shows that significant TM region still exist, the variant may be discarded. On the other hand, if the result shows that no TM region exists, or the propensity of forming TM regions is low, the variant can be selected as the desired variant having enhanced water-solubility over the original protein, while having maintained the alpha-helical secondary structure and hence likely the function of the original protein.
[0147] If desired, additional steps can be performed to provide further characterization of the resulting water-soluble variant. Such additional characterization may include calculating 226 the pI of the variant and compare it to that of the original protein. The pI should have no change or very little change (i.e. less than 30 percent, or preferably less than 20 percent or more preferably less than 10 percent). Other additional characterization may include creating a helical wheel model 246 (such as the one shown in FIG. 3) to show the location and any clustering of the QTY substitutions on any particular TM regions.
[0148] Another illustrative embodiment of the invention for designing the transmembrane regions of a protein (e.g., a GPCR) by the QTY Code of the invention can be performed on a computer system, using the representative process 10 described in FIG. 9B, some of the detailed steps are further described below. Many of the steps are optional or can be combined according to the methods of the invention.
[0149] 1: In step 1, a computer interface of a computer system receives a protein sequence, selected for analysis, and data descriptive of the protein (e.g., the sequence) entered, uploaded or inputted 12 through a computer interface of a computer system. The data entered can be a protein name, a database reference, or a protein sequence. For example, the protein sequence can be uploaded through a computer interface.
[0150] 2: In step 2, additional data about the protein can be identified, determined, obtained and/or entered, including its name or sequence and entered via the computer interface. One source to obtain 20 protein data is a database named UniProt (www dot uniprot dot org). Alternatively, the method of the invention can store data relating to the protein, or related sequences to the protein, for later retrieval by the user in this step. In embodiments, the program can prompt the user to select a database or file for retrieving additional data (e.g., sequence data) relating to the protein selected for analysis.
[0151] 3: In step 3, the user can enter, upload, or obtain data identifying the transmembrane regions. For example, the user can be prompted to obtain the data from a public source, such as from UniProt. The information can be verified 30 and collected from the database for use in Step 5.
[0152] 4: Alternatively or additionally, if the TM region information is not readily available from the input protein sequence, the transmembrane region can nevertheless be established 40 by any art recognized methods. Transmembrane regions are generally characterized by an alpha helical conformation. Transmembrane helix prediction can be predicted, for example, using a software module/package named TMHMM 2.0 (TransMembrane prediction using Hidden Markov Models), developed by Center for Biological Sequence Analysis (www dot cbs dot dtu dot dk slash services slash TMHMM). A version of the software may have problems on peak finding and sometimes fails to find 7-TM regions for a GPCR. Therefore, a modified version of the program may be used when necessary, wherein the peak searching method executed by the computer system introduces a dynamic baseline. Here, for example, in the case of a GPCR, if all seven TM regions using the initial baseline value are not found, the baseline can be changed to a lower value. For example, the default baseline may be set at 0.2. To identify a missing seventh transmembrane region, one can set the baseline value to 0.1. If more than seven TM regions are found, the baseline can be changed to a higher value, such as 0.15, to eliminate spurious TM prediction. For example, when the CCR-2 amino acid sequence was subjected to the TMHMM 2.0 software, only 6 transmembrane regions were initially identified. When the TMHMM 2.0 baseline value was set to 0.07, however, a correct total of 7 transmembrane regions were identified. The result of the TM region prediction is then provided to step 5.
[0153] 5: in step 5, after identifying the TM data either through de novo prediction or through obtaining such information through the initial sequence input, the sequence of a GPCR is divided 50 into a total of 15 fragments (i.e., 7-transmembrane segments (7TM) 52 and 8 non-transmembrane segments (8NTM)) 54 according to the TM region information. That is, there should be 7TM and 8 NTM fragments for each typical GPCR.
[0154] It is understood that the system can execute one or more, such as all of the steps described above, using a computer interface for input by a user. It is also understood that the system can omit one or more of the steps described above, or combine two or more steps.
[0155] 6: In step 6, QTY substitution 60 is performed partially, on a selected subsets of hydrophobic amino acids L, I, V, and F within a given TM region of the protein. Specifically, a first transmembrane region (typically, but not necessarily, the transmembrane region which is most proximal to the N-terminal of the protein) is first selected for variation. Some or all of the hydrophobic amino acids (L, I, V, and F) in the first transmembrane region are then substituted with the corresponding non-ionic hydrophilic amino acids (Q/S/N, T/S/N, T/S/N, or Y). It is understood that the amino acid is not actually substituted into the protein in this context. Rather, the amino acid designation is substituted in the sequence for modeling. Thus, the term "sequence" is intended to include "sequence data." Typically, most or all of the hydrophobic amino acids are selected for substitution. If less than all amino acids are selected, it may be desirable to select the internal hydrophobic amino acids leaving one or more N and/or C terminal amino acids of the transmembrane regions hydrophobic. Additionally or alternatively, it may be desirable to select to replace all of the leucines (L) in a transmembrane region. Additionally or alternatively, it may be desirable to select and replace all of the isoleucines (I) in a transmembrane region. Additionally or alternatively, it may be desirable to select to replace all of the valines (V) in a transmembrane region. Additionally or alternatively, it may be desirable to select to replace all of the phenylalanines (F) in a transmembrane region. Additionally or alternatively, it can be beneficial to retain one or more phenylalanines in the transmembrane region. Additionally or alternatively, it can be beneficial to retain one or more valines in the transmembrane region. Additionally or alternatively, it can be beneficial to retain one or more leucines in the transmembrane region. Additionally or alternatively, it can be beneficial to retain one or more isoleucines in the transmembrane region. Additionally or alternatively, it can be beneficial to retain one or more hydrophobic amino acids in the transmembrane region where the wild type sequence is characterized by three or more contiguous hydrophobic amino acids.
[0156] 7: In step 7, the transmembrane region so designed is put back into the context of the original protein. That is, the mutated or re-designed TM region 62 with the QTY substitutions is swapped into the corresponding TM region of the original protein, to create the transmembrane variants 70 or "putative variants," since each sets of substitution creates one specific putative variant for that TM region. Together, these related putative variants form a first library of putative variants.
[0157] 8: In steps 82 and 84, each putative variant is then subjected to the transmembrane region prediction process (84), as discussed herein (e.g., loss of predicted TM region). The variant is also assessed a score for the sequence's propensity to form an alpha helix (82). The variant is also subjected to a water solubility prediction process, as discussed herein. For example, the variant is assessed a score for the sequence's propensity to be water soluble. Such score may be based on a predicted propensity to form TM regions, with strong propensity to form TM regions being associated with poor water solubility, and low or now propensity to form TM regions being associated with high water solubility. Of course, complete water solubility at all concentrations is not required for most commercial purposes. Water solubility is preferably determined to be that required for functionality at the predicted conditions of use (e.g., in a ligand binding assay).
[0158] 9: In step 9, putative variants that predict loss of alpha helical structure and/or "water insolubility" (predicted at the expected conditions of use) are discarded. Putative variants that predict alpha helical structure and water solubility can be selected, such as by using the combined score or rank 90 that is a weighted combination based on a ranking function of the alpha-helical secondary structure prediction result and the TM region/water solubility prediction result. For example, one can select transmembrane variants that are highly water soluble, or are characterized by 0, 1, 2, or 3 hydrophobic amino acids (e.g., higher weight for the water solubility prediction result), with a possible expectation that alpha helical structure can be compromised. Alternatively or additionally, one can select highly alpha-helical structures (e.g., higher weight for the alpha helical secondary structure prediction result), characterized by 3, 4, 5 or 6 hydrophobic amino acids.
[0159] 10: In step 10, the putative variants in the same library 94 can be sorted or ranked 100 based on the score calculation scheme outlined above. Then a pre-determined number of putative variants can be selected as the final members in the first putative variant library. For example, in the combined score described above, a score of 0 means no propensity to form TM region, and complete maintenance of the original alpha helical secondary structure, and is thus the most desired putative variant. A slightly higher score may indicate a slight propensity to form TM region (or a less propensity of being water soluble). Thus the putative variant is less desirable but may still be selected based on its superior combined score compared to the other putative variants in the library.
[0160] In certain embodiments, a pre-determined number of desired putative variants can be selected, such as 10, 9, 8, 7, 6, 5, 4, 3, 2, or 1.
[0161] These steps (e.g., steps 6-10) can be repeated for a second, third, fourth, fifth, sixth and/or seventh (or more) transmembrane region or domain to create one putative variant library for each such TM regions or domains.
[0162] 11: In step 11, one can select 110 a combination of the TM regions or domains with the putative variants and the unsubstituted non-TM regions. For example, one, two, three or four domains with putative variants possessing high alpha-helical structure scores and one, two, three, four, five, or six domains with putative variants possessing high water solubility scores can be combined. In another example, one can combine a domain/TM region that is characterized by all hydrophobic amino acids being substituted by a hydrophilic amino acid, thus maximizing the water solubility score, and a second domain/TM region that retains 3, 4, or 5 hydrophobic amino acids in a plurality of variant selections. Such selected putative variants can be "shuffled," as is known in the art, with the extracellular and intracellular domains to create an initial combinatorial library of putative water-soluble protein variants.
[0163] In certain embodiments, all or a fraction of the putative water soluble protein variants of the initial combinatorial library designed as described herein can be made (produced or expressed in vitro or in a host cell) and screened for water solubility and/or ligand binding, preferably in a high through-put screen Amplification of the library, for example, can result in less than 100% of the putative water-soluble protein combinatorial variants from being expressed. A reporter system can be used to screen ligand binding, as is well known in the art. Using the methods of the invention, one can rapidly identify a library of putative water soluble modified transmembrane combinatorial variants that contain functionally combined extracellular and intracellular domains, and generate water soluble protein variants possessing the proper 3 dimensional structure of the wild type protein, and retaining ligand binding function (including binding affinity), or other functions. The software can include a learning module in which verified functionality of protein variants is used to eliminate certain variants or rank them differently.
[0164] In certain embodiments, to be practical experimentally, the initial combinatorial library has about 2 million potentially water-soluble GPCR, or CXCR4, variants. Of course, a library of more or less variants can be designed as well. Smaller libraries maybe preferred in certain embodiments since they can be optimized based on analysis of the research results as described herein. Analysis of research results is likely to establish trends to optimize the number of domain variants to shuffle and the assumptions for selecting domain variants.
[0165] In certain embodiments, certain hydrophobic amino acids in the TM region of the transmembrane proteins are selected for modification based on the helical forming propensity also known as "the helix prediction score" (see www dot proteopedia dot org slash wiki slash index dot php slash Main_Page). The varied fragments are randomly assembled to form about 2M (8.sup.7) variants of full-length GPCR genes. The predicted number of variants can generally be characterized by the formula H.sup.n, where n=the number of transmembrane regions modified and/or varied by the method (in the example of GPCR, n=7) and H=the number of putative variants in each transmembrane region available for generating the combinatorial variants.
[0166] Once the initial combinatorial library, or selection of the domain variants to be shuffled, is selected, nucleic acid molecules, or DNA or cDNA molecules, encoding the proteins in the initial combinatorial library can be designed. The nucleic acid molecules are preferably designed to provide codon optimization and intron deletions for the expression systems selected to produce a library of coding sequences. For example, if the expression system is E. coli, codons optimized for E. coli expression can be selected. See www dot dna20 dot com slash resources slash genedesigner. In addition, a promoter region, such as a promoter suitable for expression in the expression system (e.g., E. coli) is selected and operatively connected to the coding sequences in the library of coding sequences.
[0167] The initial library of coding sequences, or a portion thereof, is then expressed to produce a library of putative water soluble GPCRs. The library is then subjected to a ligand binding assay. In the binding assay, the putative water soluble GPCRs are contacted with the ligand, preferably in an aqueous medium and ligand binding is detected.
[0168] The invention includes transmembrane domain variants, and nucleic acid molecules encoding same, obtained, or obtainable, from the methods described herein.
[0169] The invention also contemplates water soluble GPCR variants ("sGPCRs") characterized by a plurality of transmembrane domains independently characterized by at least 50%, preferably at least about 60%, more preferably at least about 70% or 80%, such as at least about 90%) of the hydrophobic amino acid residues (L, I, V, and F) of a native transmembrane protein (e.g., GPCR) substituted by a Q, T, T, or Y, respectively). The sGPCRs of the invention are characterized by water solubility and ligand binding. In particular, the sGPCR binds the same natural ligand as the corresponding native GPCR.
[0170] The invention further encompasses a method of treatment for a disorder or disease that is mediated by the activity of a membrane protein, comprising the use of a water-soluble polypeptide to treat said disorders and diseases, wherein said water-soluble polypeptide comprises a modified .alpha.-helical domain, and wherein said water-soluble polypeptide retains the ligand-binding activity of the native membrane protein. Examples of such disorders and diseases include, but are not limited to, cancer, small cell lung cancer, melanoma, breast cancer, Parkinson's disease, cardiovascular disease, hypertension, and asthma.
[0171] As described herein, the water-soluble peptides described herein can be used for the treatment of conditions or diseases mediated by the activity of a membrane protein. In certain aspects, the water-soluble peptides can act as "decoys" for the membrane receptor and bind to the ligand that otherwise activates the membrane receptor. As such, the water-soluble peptides described herein can be used to reduce the activity of a membrane protein. These water-soluble peptides can remain in the circulation and competitively bind to specific ligands, thereby reducing the activity of membrane bound receptors. For example, the GPCR CXCR4 is over-expressed in small cell lung cancer and facilitates metastasis of tumor cells. Binding of this ligand by a water-soluble peptide such as that described herein may significantly reduce metastasis.
[0172] The chemokine receptor, CXCR4, is known in viral research as a major coreceptor for the entry of T cell line-tropic HIV (Feng et al. (1996) Science 272: 872-877; Davis et al. (1997) J Exp Med 186: 1793-1798; Zaitseva et al. (1997) Nat Med 3: 1369-1375; Sanchez et al. (1997) J Biol Chem 272: 27529-27531). Stromal cell derived factor 1 (SDF-1) is a chemokine that interacts specifically with CXCR4. When SDF-1 binds to CXCR4, CXCR4 activates Gai protein-mediated signaling (pertussis toxin-sensitive) (Chen et al. (1998) Mol Pharmacol 53:177-181), including downstream kinase pathways such as Ras/MAP Kinases and phosphatidylinositol 3-kinase (PI3K)/Akt in lymphocyte, megakaryocytes, and hematopoietic stem cells (Bleul et al. (1996) Nature 382: 829-833; Deng et al. (1997) Nature 388: 296-300; Kijowski et al. (2001) Stem Cells 19: 453-466; Majka et al. (2001) Folia. Histochem. Cytobiol. 39: 235-244; Sotsios et al. (1999) J. Immunol. 163: 5954-5963; Vlahakis et al. (2002) J. Immunol. 169: 5546-5554). In mice transplanted with human lymph nodes, SDF-1 induces CXCR4-positive cell migration into the transplanted lymph node (Blades et al. (2002) J. Immunol. 168: 4308-4317).
[0173] Recently, studies have shown that CXCR4 interactions may regulate the migration of metastatic cells. Hypoxia, a reduction in partial oxygen pressure, is a microenvironmental change that occurs in most solid tumors and is a major inducer of tumor angiogenesis and therapeutic resistance. Hypoxia increases CXCR4 levels (Staller et al. (2003) Nature 425: 307-311). Microarray analysis on a sub-population of cells from a bone metastatic model with elevated metastatic activity showed that one of the genes increased in the metastatic phenotype was CXCR4. Furthermore, overexpression CXCR4 in isolated cells significantly increased the metastatic activity (Kang et al. (2003) Cancer Cell 3: 537-549). In samples collected from various breast cancer patients, Muller et al. (Muller et al. (2001) Nature 410: 50-56) found that CXCR4 expression level is higher in primary tumors relative to normal mammary gland or epithelial cells. Moreover, CXCR4 antibody treatment has been shown to inhibit metastasis to regional lymph nodes when compared to control isotypes that all metastasized to lymph nodes and lungs (Muller et al. (2001). As such a decoy therapy model is suitable for treating CXCR4 mediated diseases and disorders.
[0174] In another embodiment of the invention relates to the treatment of a disease or disorder involving CXCR4-dependent chemotaxis, wherein the disease is associated with aberrant leukocyte recruitment or activation. The disease is selected from the group consisting of arthritis, psoriasis, multiple sclerosis, ulcerative colitis, Crohn's disease, allergy, asthma, AIDS associated encephalitis, AIDS related maculopapular skin eruption, AIDS related interstitial pneumonia, AIDS related enteropathy, AIDS related periportal hepatic inflammation and AIDS related glomerulo nephritis.
[0175] In another aspect, the invention relates to the treatment of a disease or disorder selected from arthritis, lymphoma, non-small lung cancer, lung cancer, breast cancer, prostate cancer, multiple sclerosis, central nervous system developmental disease, dementia, Parkinson's disease, Alzheimer's disease, tumor, fibroma, astrocytoma, myeloma, glioblastoma, an inflammatory disease, an organ transplantation rejection, AIDS, HIV-infection or angiogenesis.
[0176] The invention also encompasses a pharmaceutical composition comprising said water-soluble polypeptide and a pharmaceutically acceptable carrier or diluent.
[0177] The compositions can also include, depending on the formulation desired, pharmaceutically-acceptable, non-toxic carriers or diluents, which are defined as vehicles commonly used to formulate pharmaceutical compositions for animal or human administration. The diluent is selected so as not to affect the biological activity of the pharmacologic agent or composition. Examples of such diluents are distilled water, physiological phosphate-buffered saline, Ringer's solutions, dextrose solution, and Hank's solution. In addition, the pharmaceutical composition or formulation may also include other carriers, adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like. Pharmaceutical compositions can also include large, slowly metabolized macromolecules such as proteins, polysaccharides such as chitosan, polylactic acids, polyglycolic acids and copolymers (such as latex functionalized SEPHAROSE.TM., agarose, cellulose, and the like), polymeric amino acids, amino acid copolymers, and lipid aggregates (such as oil droplets or liposomes).
[0178] The compositions can be administered parenterally such as, for example, by intravenous, intramuscular, intrathecal or subcutaneous injection. Parenteral administration can be accomplished by incorporating a composition into a solution or suspension. Such solutions or suspensions may also include sterile diluents such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents. Parenteral formulations may also include antibacterial agents such as, for example, benzyl alcohol or methyl parabens, antioxidants such as, for example, ascorbic acid or sodium bisulfite and chelating agents such as EDTA. Buffers such as acetates, citrates or phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose may also be added. The parenteral preparation can be enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic.
[0179] Additionally, auxiliary substances, such as wetting or emulsifying agents, surfactants, pH buffering substances and the like can be present in compositions. Other components of pharmaceutical compositions are those of petroleum, animal, vegetable, or synthetic origin, for example, peanut oil, soybean oil, and mineral oil. In general, glycols such as propylene glycol or polyethylene glycol are preferred liquid carriers, particularly for injectable solutions.
[0180] Injectable formulations can be prepared either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection can also be prepared. The preparation also can also be emulsified or encapsulated in liposomes or micro particles such as polylactide, polyglycolide, or copolymer for enhanced adjuvant effect, as discussed above. Langer, Science 249: 1527, 1990; and Hanes, Advanced Drug Delivery Reviews 28: 97-119, 1997. The compositions and pharmacologic agents described herein can be administered in the form of a depot injection or implant preparation which can be formulated in such a manner as to permit a sustained or pulsatile release of the active ingredient.
[0181] Transdermal administration includes percutaneous absorption of the composition through the skin. Transdermal formulations include patches, ointments, creams, gels, salves and the like. Transdermal delivery can be achieved using a skin patch or using transferosomes. See Paul et al., Eur. J. Immunol. 25: 3521-24, 1995; and Cevc et al., Biochem. Biophys. Acta 1368: 201-15, 1998.
[0182] "Treating" or "treatment" includes preventing or delaying the onset of the symptoms, complications, or biochemical indicia of a disease, alleviating or ameliorating the symptoms or arresting or inhibiting further development of the disease, condition, or disorder. A "patient" is a human subject in need of treatment.
[0183] An "effective amount" refers to that amount of the therapeutic agent that is sufficient to ameliorate of one or more symptoms of a disorder and/or prevent advancement of a disorder, cause regression of the disorder and/or to achieve a desired effect.
Computer System
[0184] Various aspects and functions described herein may be implemented as specialized hardware or software components executing in one or more computer systems. There are many examples of computer systems that are currently in use. These examples include, among others, network appliances, personal computers, workstations, mainframes, networked clients, servers, media servers, application servers, database servers, and web servers. Other examples of computer systems may include mobile computing devices, such as cellular phones and personal digital assistants, and network equipment, such as load balancers, routers, and switches. Further, aspects may be located on a single computer system or may be distributed among a plurality of computer systems connected to one or more communications networks.
[0185] For example, various aspects, functions, and processes may be distributed among one or more computer systems configured to provide a service to one or more client computers, or to perform an overall task as part of a distributed system. Additionally, aspects may be performed on a client-server or multi-tier system that includes components distributed among one or more server systems that perform various functions. Consequently, embodiments are not limited to executing on any particular system or group of systems. Further, aspects, functions, and processes may be implemented in software, hardware or firmware, or any combination thereof. Thus, aspects, functions, and processes may be implemented within methods, acts, systems, system elements and components using a variety of hardware and software configurations, and examples are not limited to any particular distributed architecture, network, or communication protocol.
[0186] Referring to FIG. 10, there is illustrated a block diagram of a distributed computer system 300, in which various aspects and functions are practiced. As shown, the distributed computer system 300 includes one or more computer systems that exchange information. More specifically, the distributed computer system 300 includes computer systems 302, 304, and 306. As shown, the computer systems 302, 304, and 306 are interconnected by, and may exchange data through, a communication network 308. The network 308 may include any communication network through which computer systems may exchange data. To exchange data using the network 308, the computer systems 302, 304, and 306 and the network 308 may use various methods, protocols and standards. Examples of these protocols and standards include NAS, Web, storage and other data movement protocols suitable for use in a big data environment. To ensure data transfer is secure, the computer systems 302, 304, and 306 may transmit data via the network 308 using a variety of security measures including, for example, SSL or VPN technologies. While the distributed computer system 300 illustrates three networked computer systems, the distributed computer system 300 is not so limited and may include any number of computer systems and computing devices, networked using any medium and communication protocol.
[0187] As illustrated in FIG. 10, the computer system 302 includes a processor 310, a memory 312, an interconnection element 314, an interface 316 and data storage element 318. To implement at least some of the aspects, functions, and processes disclosed herein, the processor 310 performs a series of instructions that result in manipulated data. The processor 310 may be any type of processor, multiprocessor or controller. Example processors may include a commercially available processor such as an Intel Xeon, Itanium, Core, Celeron, or Pentium processor; an AMD Opteron processor; an Apple A4 or A5 processor; a Sun UltraSPARC processor; an IBM Power5+ processor; an IBM mainframe chip; or a quantum computer. The processor 310 is connected to other system components, including one or more memory devices 312, by the interconnection element 314.
[0188] The memory 312 stores programs (e.g., sequences of instructions coded to be executable by the processor 310) and data during operation of the computer system 302. Thus, the memory 312 may be a relatively high performance, volatile, random access memory such as a dynamic random access memory ("DRAM") or static memory ("SRAM"). However, the memory 312 may include any device for storing data, such as a disk drive or other nonvolatile storage device. Various examples may organize the memory 312 into particularized and, in some cases, unique structures to perform the functions disclosed herein. These data structures may be sized and organized to store values for particular data and types of data.
[0189] Components of the computer system 302 are coupled by an interconnection element such as the interconnection element 314. The interconnection element 314 may include any communication coupling between system components such as one or more physical busses in conformance with specialized or standard computing bus technologies such as IDE, SCSI, PCI and InfiniBand. The interconnection element 314 enables communications, including instructions and data, to be exchanged between system components of the computer system 302.
[0190] The computer system 302 also includes one or more interface devices 316 such as input devices, output devices and combination input/output devices. Interface devices may receive input or provide output. More particularly, output devices may render information for external presentation. Input devices may accept information from external sources. Examples of interface devices include keyboards, mouse devices, trackballs, microphones, touch screens, printing devices, display screens, speakers, network interface cards, etc. Interface devices allow the computer system 302 to exchange information and to communicate with external entities, such as users and other systems.
[0191] The data storage element 318 includes a computer readable and writeable nonvolatile, or non-transitory, data storage medium in which instructions are stored that define a program or other object that is executed by the processor 310. The data storage element 318 also may include information that is recorded, on or in, the medium, and that is processed by the processor 310 during execution of the program. More specifically, the information may be stored in one or more data structures specifically configured to conserve storage space or increase data exchange performance. The instructions may be persistently stored as encoded signals, and the instructions may cause the processor 310 to perform any of the functions described herein. The medium may, for example, be optical disk, magnetic disk or flash memory, among others. In operation, the processor 310 or some other controller causes data to be read from the nonvolatile recording medium into another memory, such as the memory 312, that allows for faster access to the information by the processor 310 than does the storage medium included in the data storage element 318. The memory may be located in the data storage element 318 or in the memory 312, however, the processor 310 manipulates the data within the memory, and then copies the data to the storage medium associated with the data storage element 318 after processing is completed. A variety of components may manage data movement between the storage medium and other memory elements and examples are not limited to particular data management components. Further, examples are not limited to a particular memory system or data storage system.
[0192] Although the computer system 302 is shown by way of example as one type of computer system upon which various aspects and functions may be practiced, aspects and functions are not limited to being implemented on the computer system 302 as shown in FIG. 10. Various aspects and functions may be practiced on one or more computers having a different architectures or components than that shown in FIG. 10. For instance, the computer system 302 may include specially programmed, special-purpose hardware, such as an application-specific integrated circuit ("ASIC") tailored to perform a particular operation disclosed herein. While another example may perform the same function using a grid of several general-purpose computing devices running MAC OS System X with Motorola PowerPC processors and several specialized computing devices running proprietary hardware and operating systems.
[0193] The computer system 302 may be a computer system including an operating system that manages at least a portion of the hardware elements included in the computer system 302. In some examples, a processor or controller, such as the processor 310, executes an operating system. Examples of a particular operating system that may be executed include a Windows-based operating system, such as, Windows NT, Windows 2000 (Windows ME), Windows XP, Windows Vista or Windows 7 operating systems, available from the Microsoft Corporation, a MAC OS System X operating system or an iOS operating system available from Apple Computer, one of many Linux-based operating system distributions, for example, the Enterprise Linux operating system available from Red Hat Inc., a Solaris operating system available from Oracle Corporation, or a UNIX operating systems available from various sources. Many other operating systems may be used, and examples are not limited to any particular operating system.
[0194] The processor 310 and operating system together define a computer platform for which application programs in high-level programming languages are written. These component applications may be executable, intermediate, bytecode or interpreted code which communicates over a communication network, for example, the Internet, using a communication protocol, for example, TCP/IP. Similarly, aspects may be implemented using an object-oriented programming language, such as .Net, SmallTalk, Java, C.sup.++, Ada, C# (C-Sharp), Python, or JavaScript. Other object-oriented programming languages may also be used. Alternatively, functional, scripting, or logical programming languages may be used.
[0195] Additionally, various aspects and functions may be implemented in a non-programmed environment. For example, documents created in HTML, XML or other formats, when viewed in a window of a browser program, can render aspects of a graphical-user interface or perform other functions. Further, various examples may be implemented as programmed or non-programmed elements, or any combination thereof. For example, a web page may be implemented using HTML while a data object called from within the web page may be written in C.sup.++. Thus, the examples are not limited to a specific programming language and any suitable programming language could be used. Accordingly, the functional components disclosed herein may include a wide variety of elements (e.g., specialized hardware, executable code, data structures or objects) that are configured to perform the functions described herein.
[0196] In some examples, the components disclosed herein may read parameters that affect the functions performed by the components. These parameters may be physically stored in any form of suitable memory including volatile memory (such as RAM) or nonvolatile memory (such as a magnetic hard drive). In addition, the parameters may be logically stored in a propriety data structure (such as a database or file defined by a user space application) or in a commonly shared data structure (such as an application registry that is defined by an operating system). In addition, some examples provide for both system and user interfaces that allow external entities to modify the parameters and thereby configure the behavior of the components.
[0197] The software is generally depicted in FIG. 11A to perform a computational method in which the user selects operating parameters to execute a procedure on a computer 402, as previously described herein, where one or more sequences are entered 404, and substitutions are performed 408. The system is operative to verify secondary structures 408 and verify water solubility for the one or more variants. As shown in FIG. 11B, the program can include additional processing options in addition to those previously described, wherein one or more ranking functions 442 can be stored, the user can select or the system can automatically select 444 the ranking function to be used. The system can then generate a rank 446 as described herein, and then a user can make 448 a selected variant to measure function 448, and subsequently enter functional data to modify the processing sequence 450 based thereon.
[0198] The invention will be better understood in connection with the following example, which is intended as an illustration only and not limiting of the scope of the invention. Various changes and modifications to the disclosed embodiments will be apparent to those skilled in the art and such changes and may be made without departing from the spirit of the invention and the scope of the appended claims.
EXAMPLES
Example 1: CXC Chemokine Receptor Type 4 Isoform a (CXCR4)
[0199] CXCR4 is a chemokine receptor 356 amino acids in length. It has a pI of about 8.61 and a Molecular Weight of 40221.19 Da. The sequence for CXCR4, as published in the literature, is:
TABLE-US-00001 (SEQ ID NO. 1) MSIPLPLLQIYTSDNYTEEMGSGDYDSMKEPCFREENANFNKIFLPTIYS IIFLTGIVGNGLVILVMGYQKKLRSMTDKYRLHLSVADLLFVITLPFWAV DAVANWYFGNFLCKAVHVIYTVNLYSSVLILAFISLDRYLAIVHATNSQR PRKLLAEKVVYVGVWIPALLLTIPDFIFANVSEADDRYICDRFYPNDLWV VVFQFQHIMVGLILPGIVILSCYCIIISKLSHSKGHQKRKALKTTVILIL AFFACWLPYYIGISIDSFILLEIIKQGCEFENTVHKWISITEALAFFHCC LNPILYAFLGAKFKTSAQHALTSVSRGSSLKILSKGKRGGHSSVSTESES SSFHSS.
[0200] Subjecting the sequence to TMHMM results in the identification of the transmembrane domains as depicted in FIG. 3.
[0201] Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) results in the following sequence:
TABLE-US-00002 SEQ ID NO: 2) 1 MSIPLPLLQIYTSDNYTEEMGSGDYDSMKEPCFREENANFNKIFLPTTYSTTYQTGTTGN 61 GQTTQTMGYQKKLRSMTDKYRQHQSTADQQYTTTQPYWATDAVANWYFGNFLCKATHTTY 121 TTNQYSSTQTQAYTSQDRYLAIVHATNSQRPRKLLAEKTTYTGTWTPAQQQTTPDYTYAN 181 VSEADDRYICDRFYPNDLWVVVYQYQHTMTGQTQPGTTTQSCYCTIISKLSHSKGHQKRK 241 ALKTTTTQTQAYYACWQPYYTGTSTDSYILLEIIKQGCEFENTVHKWTSTTEAQAYYHCC 301 QNPTQYAYQGAKFKTSAQHALTSVSRGSSLKILSKGKRGGHSSVSTESESSSFHSS.
[0202] The predicted pI of the protein is 8.54 and the Molecular Weight is 40551.64 Da. Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising Amino Acids 47-70 of SEQ ID NO: 2 (TM1), and proteins comprising the same. As an example, FIG. 3 represents the alpha-helical prediction of the TM1 sequence. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences of SEQ ID NO: 2 (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in SEQ ID NO: 2 or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in SEQ ID NO: 1.
[0203] The native protein sequence for CXCR4 (differing in the N-terminal amino acids) was subjected to the method a second time. The program output divided the native sequence into the extracellular and intracellular regions and selected 8 transmembrane domain variants for each transmembrane domain. The results are illustrated in FIG. 4 and in the following table:
TABLE-US-00003 (SEQ ID NO. 3; EC1) MEGISIYTSDNYTEEMGSGDYDSMKEPCFREENANFNK TM 1 Variants: (SEQ ID NO. 4) IFLPTTYSTTFQTGTTGNGQVTQVM (SEQ ID NO. 5) IFQPTTYSTTFQTGTTGNGQVTQVM (SEQ ID NO. 6) IFQPTTYSTTFQTGTTGNGQVTQTM (SEQ ID NO. 7) IFQPTTYSTTYQTGTTGNGQVTQTM (SEQ ID NO. 8) IFQPTTYSTTYQTGTTGNGQTTQVM (SEQ ID NO. 9) IFQPTTYSTTYQTGTTGNGQTIQTM (SEQ ID NO. 10) IFQPTTYSTTYQTGTTGNGQTTQTM (SEQ ID NO. 11) TYQPTTYSTTYQTGTTGNGQTTQTM (SEQ ID NO. 12; IC1) GYQKKLRSMTDKYR TM 2 Variants: (SEQ ID NO. 13) LHLSTADQQFTTTQPFWAVDAV (SEQ ID NO. 14) LHLSVADQQYTTTQPFWATDAV (SEQ ID NO. 15) LHQSVADQQYVTTQPFWATDAT (SEQ ID NO. 16) QHQSVADQQFTTTQPFWATDAT (SEQ ID NO. 17) LHQSVADQQYTITQPYWATDAT (SEQ ID NO. 18) QHLSVADQQYTITQPYWATDAT (SEQ ID NO. 19) QHLSTADQQYVTTQPYWATDAT (SEQ ID NO. 20) QHQSTADQQYTTTQPYWATDAT (SEQ ID NO. 21; EC2) ANWYFGNFLCK TM 3 Variants: (SEQ ID NO. 22) AVHVTYTVNQYSSVQIQAFT (SEQ ID NO. 23) AVHTTYTVNQYSSVQIQAFT (SEQ ID NO. 24) AVHTTYTVNQYSSVQTQAFT (SEQ ID NO. 25) ATHTTYTVNQYSSVQTQAFT (SEQ ID NO. 26) ATHTIYTTNQYSSVQTQAFT (SEQ ID NO. 27) AVHTTYTTNQYSSVQTQAFT (SEQ ID NO. 28) ATHTTYTTNQYSSVQTQAFT (SEQ ID NO. 29) ATHTTYTTNQYSSTQTQAYT (SEQ ID NO. 30; IC2) SLDRYLAIVHATNSQRPRKLLAEK TM 4 Variants: (SEQ ID NO. 31) VTYTGVWTPAQQQTIPDFIF (SEQ ID NO. 32) TTYTGTWIPAQQQTIPDFIF (SEQ ID NO. 33) TTYTGTWTPAQQQTIPDFIF (SEQ ID NO. 34) TTYTGTWTPAQQQTIPDFIY (SEQ ID NO. 35) TTYVGTWTPAQQQTTPDYIF (SEQ ID NO. 36) TTYVGTWTPAQQQTTPDFIY (SEQ ID NO. 37) TTYTGVWTPAQQQTTPDYTF (SEQ ID NO. 38) TTYTGTWTPAQQQTTPDYTY (SEQ ID NO. 39; EC3) ANVSEADDRYICDRFYPNDLW TM 5 Variants: (SEQ ID NO. 40) VVVFQFQHTMVGQTQPGTTTQ (SEQ ID NO. 41) VVVFQFQHTMTGQTQPGTTTQ (SEQ ID NO. 42) VVVFQYQHTMTGQTQPGTTTQ (SEQ ID NO. 43) VVVYQYQHTMTGQTQPGTTTQ (SEQ ID NO. 44) TVVFQYQHTMTGQTQPGTTTQ (SEQ ID NO. 45) VVTFQYQHTMTGQTQPGTTTQ (SEQ ID NO. 46) TVVYQYQHTMTGQTQPGTTTQ (SEQ ID NO. 47) TTTYQYQHTMTGQTQPGTTTQ (SEQ ID NO. 48; IC3) SCYCIIISKLSHSKGHQKRKALKTT TM 6 Variants: (SEQ ID NO. 49) VTQIQAFFACWQPYYTGTST (SEQ ID NO. 50) VIQIQAYFACWQPYYTGTST (SEQ ID NO. 51) VIQIQAYYACWQPYYTGTST (SEQ ID NO. 52) VIQTQAFYACWQPYYTGTST (SEQ ID NO. 53) VIQTQAYFACWQPYYTGTST (SEQ ID NO. 54) VTQIQAFYACWQPYYTGTST (SEQ ID NO. 55) VIQTQAYYACWQPYYTGTST (SEQ ID NO. 56) TTQTQAYYACWQPYYTGTST (SEQ ID NO. 57; EC4) DSFILLEIIKQGCEFENTVHK TM 7 Variants (SEQ ID NO. 58) WISITEAQAFFHCCLNPIQY (SEQ ID NO. 59) WISITEAQAFYHCCLNPIQY (SEQ ID NO. 60) WISITEAQAYFHCCQNPTLY (SEQ ID NO. 61) WISTTEALAFYHCCQNPTQY (SEQ ID NO. 62) WISTTEALAYFHCCQNPTQY (SEQ ID NO. 63) WISITEALAYYHCCQNPTQY (SEQ ID NO. 64) WISTTEALAYYHCCQNPTQY WTSTTEAQAYYHCCQNPTQY (SEQ ID NO. 65; IC4) AFLGAKFKTSAQHALTSVSRGSSLKILSKGKRGGHSSVSTESESSSFHS S.
[0204] It is believed that it is clear from the above, that the sequences (SEQ ID NOs: 3, 12, 21, 30, 39, 48, 57 and 65) before, between and after each list of transmembrane domain variants are the N', intermediary and C' extracellular and intracellular regions, respectively.
[0205] The sequences above were then used to generate coding sequences, as is known in the art, suitable for expression in the expression system, in this case yeast. The coding sequences were then shuffled and expressed to produce a library comprising a plurality of proteins each having SEQ ID NOs: 3, 12, 21, 30, 39, 48, 57 and 65 with one transmembrane domain variant from each variant list in between the respective intracellular and extracellular domain.
[0206] The library so produced was then assayed for CXCR4 cognate ligand, SDF1a (or CCL12) on a plasmid expressed in yeast binding inside living yeast cells. Ligand binding was detected by gene activation from the yeast 2-hybrid system and samples were then sequenced. Nineteen CXCR4 variants were sequenced. The results are shown in FIG. 5.
Example 2: CXC Chemokine Receptor Type 3 Isoform b (CX3CR1)
[0207] CX3CR1 is a chemokine receptor 355 amino acids in length. It has a pI of about 6.74 and a Molecular Weight of 40396.4 Da. The subjecting of the sequence to TMHMM results in the identification of the transmembrane domains. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line), aligned with the wild type (top line):
TABLE-US-00004 MDQFPESVTENFEYDDLAEACYIGDIVVFGTVFLSIFYSVIFAIGLVGNLLVVFALTNSK |||||||||||||||||||||||||||||||||*|**||***|*|**||*****|*|||| MDQFPESVTENFEYDDLAEACYIGDIVVFGTVFQSTYYSTTYATGQTGNQQTTYAQTNSK KPKSVTDIYLLNLALSDLLFVATLPFWTHYLINEKGLHNAMCKFTTAFFFIGFFGSIFFI |||||||*|**|*|*||****||*|*||||*||||||||||||||||****|**||**** KPKSVTDTYQQNQAQSDQQYTATQPYWTHYQINEKGLHNAMCKFTTAYYYTGYYGSTYYT TVISIDRYLAIVLAANSMNNRTVQHGVTISLGVWAAAILVAAPQFMFTKQKENECLGDYP |**|*|||||||||||||||||||||*|*|*|*||||***||||*|*||||||||||||| TTTSTDRYLAIVLAANSMNNRTVQHGTTTSQGTWAAATQTAAPQYMYTKQKENECLGDYP EVLQEIWPVLRNVETNFLGFLLPLLIMSYCYFRIIQTLFSCKNHKKAKAIKLILLVVIVF ||||||||||||||||**|***|***|||||*|**||**||||||||||||********* EVLQEIWPVLRNVETNYQGYQQPQQTMSYCYYRTTQTQYSCKNHKKAKAIKQTQQTTTTY FLFWTPYNVMIFLETLKLYDFFPSCDMRKDLRLALSVTETVAFSHCCLNPLIYAFAGEKF ***|||||*|***|||||||||||||||||||||*|*|||*|*||||*||**||*||||| YQYWTPYNTMTYQETLKLYDFFPSCDMRKDLRLAQSTTETTAYSHCCQNPQTYAYAGEKF RRYLYHLYGKCLAVLCGRSVHVDFSSSESQRSRHGSVLSSNFTYHTSDGDALLLL (SEQ ID NO. 66) ||||||||||||||||||||||||||||||||||||||||||||||||||||||| RRYLYHLYGKCLAVLCGRSVHVDFSSSESQRSRHGSVLSSNFTYHTSDGDALLLL. (SEQ ID NO. 67)
[0208] The predicted pI of the protein variant is 6.74 and the Molecular Weight is 41027.17 Da. Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising the underlined Amino Acids of SEQ ID NO: 67. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences of SEQ ID NO: 66 (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in SEQ ID NO: 67 or homologous sequences retaining one, two, three or, possibly four or more of the native V, L, I and F amino acids, as set forth in SEQ ID NO: 66.
[0209] The native protein sequence for CX3CR1 was subjected to the method a second time. The program output divided the native sequence into the extracellular and intracellular regions and selected 8 transmembrane domain variants for each transmembrane domain. The results are illustrated in the following table:
TABLE-US-00005 (SEQ ID NO. 68) MDQFPESVTENFEYDDLAEACYIGDIVVFGT TM 1 Variants: (SEQ ID NO. 69) TYQSTYYSTTFATGQVGNQQVVFALTNS (SEQ ID NO. 70) TYQSTYYSTTYATGQVGNQQVVFALTNS (SEQ ID NO. 71) TYQSTYYSTTYATGQVGNQQVVFAQTNS (SEQ ID NO. 72) TYQSTYYSTTYATGQTGNLQVTFAQTNS (SEQ ID NO. 73) TYQSTYYSTTYATGQTGNQLVTFAQTNS (SEQ ID NO. 74) TYQSTYYSTTYATGQTGNQQVVFAQTNS (SEQ ID NO. 75) TYQSTYYSTTYATGQTGNLQVTYAQTNS (SEQ ID NO. 76) TYQSTYYSTTYATGQTGNQQTTYAQTNS (SEQ ID NO. 77) KKPKSVTDIY TM 2 Variants (SEQ ID NO. 78) LLNQAQSDQLFVATQPFWTHY (SEQ ID NO. 79) LLNQAQSDQQFVATQPFWTHY (SEQ ID NO. 80) QQNLAQSDQQFVATQPFWTHY (SEQ ID NO. 81) LQNLAQSDQQYTATQPFWTHY (SEQ ID NO. 82) QLNLAQSDQQYTATQPFWTHY (SEQ ID NO. 83) LLNQAQSDQQFTATQPYWTHY (SEQ ID NO. 84) QQNLAQSDQQFTATQPYWTHY (SEQ ID NO. 85) QQNQAQSDQQYTATQPYWTHY (SEQ ID NO. 86) LINEKGLHNAMCK TM3 Variant (SEQ ID NO. 87) YTTAYYYTGYYGSTYYTTTTST (SEQ ID NO. 88) DRYLAIVLAANSMNNRT TM4 Variants: (SEQ ID NO. 89) VQHGTTTSQGTWAAATQVAAPQFMF (SEQ ID NO. 90) VQHGVTTSQGTWAAATQTAAPQFMF (SEQ ID NO. 91) VQHGTTTSQGVWAAATQTAAPQFMY (SEQ ID NO. 92) VQHGTTTSQGTWAAAIQTAAPQFMY (SEQ ID NO. 93) VQHGTTTSQGTWAAATQTAAPQFMF (SEQ ID NO. 94) VQHGTTISQGTWAAATQTAAPQYMF (SEQ ID NO. 95) VQHGTTTSQGTWAAATQTAAPQFMY (SEQ ID NO. 96) TQHGTTTSQGTWAAATQTAAPQYMY (SEQ ID NO. 97) TKQKENECLGDYPEVLQEIWPVLRNVET TM5 Variants: (SEQ ID NO. 98) NFLGFQQPQQIMSYCYFRIT (SEQ ID NO. 99) NFQGFLQPQQTMSYCYFRIT (SEQ ID NO. 100) NFQGFLQPQQTMSYCYFRTT (SEQ ID NO. 101) NFQGFQQPQQTMSYCYYRIT (SEQ ID NO. 102) NFQGFLQPQQTMSYCYYRTT (SEQ ID NO. 103) NFQGYLQPQQTMSYCYFRTT (SEQ ID NO. 104) NYQGFQQPQQTMSYCYFRTT (SEQ ID NO. 105) NYQGYQQPQQTMSYCYYRTT (SEQ ID NO. 106) QTLFSCKNHKKAKAIK TM6 Variants: (SEQ ID NO. 107) LIQQTTTTFYQFWTPYNTMTFQETL (SEQ ID NO. 108) LIQQTTTTFYQYWTPYNVMTFQETQ (SEQ ID NO. 109) LIQQTTTTYYQFWTPYNTMTFQETQ (SEQ ID NO. 110) QIQQTTTTFYQYWTPYNTMTFQETQ (SEQ ID NO. 111) LTQQTTTTYYQFWTPYNTMTFQETQ (SEQ ID NO. 112) QIQQTTTTFFQYWTPYNTMTYQETQ (SEQ ID NO. 113) QIQQTTTTFYQYWTPYNTMTYQETQ (SEQ ID NO. 114) QTQQTTTTYYQYWTPYNTMTYQETQ (SEQ ID NO. 115) KLYDFFPSCDMRKDLRL TM7 Variants: (SEQ ID NO. 116) ALSVTETVAFSHCCQNPQIYAFAG (SEQ ID NO. 117) AQSVTETTAFSHCCQNPLIYAFAG (SEQ ID NO. 118) ALSVTETVAFSHCCQNPQTYAYAG (SEQ ID NO. 119) AQSVTETTAFSHCCQNPQIYAYAG (SEQ ID NO. 120) ALSVTETTAFSHCCQNPQTYAYAG (SEQ ID NO. 121) ALSTTETTAYSHCCQNPQIYAFAG (SEQ ID NO. 122) ALSVTETTAYSHCCQNPQTYAYAG (SEQ ID NO. 123) AQSTTETTAYSHCCQNPQTYAYAG (SEQ ID NO. 124) EKFRRYLYHLYGKCLAVLCGRSVHVDFSSSESQRSRHGSVLSSNFTYHTS DGDALLLL.
[0210] As in Example 1 above, that the sequences before, between and after each list of transmembrane domain variants are the N', intermediary and C' intra or extracellular regions, respectively.
[0211] The sequences above were then used to generate coding sequences, as is known in the art, suitable for expression in the expression system, in this case yeast. The coding sequences were then shuffled and expressed to produce a library comprising a plurality of proteins each having SEQ ID NOs: 68, 77, 86, 88, 97, 106, and 115 with one transmembrane domain variant from each variant list in between the respective intracellular and extracellular domain.
[0212] The library so produced was then assayed for CX3CR1 cognate ligand (CXCL1) binding in an aqueous medium, as described in Example 1. Ligand binding was detected and samples were then sequenced. Seven variants were sequenced. The results are shown in FIG. 6.
Example 3: CCR3 Variants
[0213] The method of Example 1 was repeated for Chemokine Receptor Type 3 isoform 3.
TABLE-US-00006 Name pI MW (Da) WT 8.87 43122.3 MT 8.78 43531.64
[0214] Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line), aligned with the wild type (top line):
TABLE-US-00007 MPFGIRMLLRAHKPGRSEMTTSLDTVETFGTTSYYDDVGLLCEKADTRALMAQFVPPLYS |||||||||||||||||||||||||||||||||||||||||||||||||||||||||*|| MPFGIRMLLRAHKPGRSEMTTSLDTVETFGTTSYYDDVGLLCEKADTRALMAQFVPPQYS LVFTVGLLGNVVVVMILIKYRRLRIMTNIYLLNLAISDLLFLVTLPFWIHYVRGHNWVFG ***|*|**||****|***||||||||||*|**|*|*||*****|*|*|*||||||||||| QTYTTGQQGNTTTTMTQTKYRRLRIMTNTYQQNQATSDQQYQTTQPYWTHYVRGHNWVFG HGMCKLLSGFYHTGLYSEIFFIILLTIDRYLAIVHAVFALRARTVTFGVITSIVTWGLAV ||||||||||||||||||*******|*|||*|**||**|*||||||||**||**|||*|* HGMCKLLSGFYHTGLYSETYYTTQQTTDRYQATTHATYAQRARTVTFGTTTSTTTWGQAT LAALPEFIFYETEELFEETLCSALYPEDTVYSWRHFHTLRMTIFCLVLPLLVMAICYTGI *||*||***|||||||||||||||||||||||||||||||||**|***|***||*||||* QAAQPEYTYYETEELFEETLCSALYPEDIVYSWRHFHTLRMTTYCQTQPQQTMATCYTGT IKTLLRCPSKKKYKAIRLIFVIMAVFFIFWTPYNVAILLSSYQSILFGNDCERSKHLDLV *||||||||||||||||*****||*****|||||*|***|||||||||||||||||||** TKTLLLRCPSKKKYKAIRQTYTTMATYYTYWTPYNIATQQSSYSILFGNDCERSKHLDQT MLVTEVIAYSHCCMNPVIYAFVGERFRKYLRHFFHRHLLMHLGRYIPFLPSEKLERTSSV |**||**|||||||||**||**|||||||||||||||||||||||||||||||||||||| MQTTETTAYSHCCMNPTTYAYTGERFRKYLRHFFHRHLLMHLGRYIPFLPSEKLERTSSV SPSTAEPELSIVF (SEQ ID NO. 125) ||||||||||||| SPSTAEPELSIVF (SEQ ID NO. 126)
[0215] Each of the predicted transmembrane regions have been underlined and exemplify a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising the underlined Amino Acids of SEQ ID NO: 126. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences of SEQ ID NO: 126 (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in SEQ ID NO: 126 or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in SEQ ID NO: 125.
[0216] The native protein sequence for CCR3 was subjected to the method a second time (noting a difference in the N terminal sequence). The program output divided the native sequence into the extracellular and intracellular regions and selected 8 transmembrane domain variants for each transmembrane domain. The results are illustrated in the following table:
TABLE-US-00008 (SEQ ID NO. 127) MTTSLDTVETFGTTSYYDDVGLLCEKADTRALMA TM1 Variants: (SEQ ID NO. 128) QFVPPQYSQTFTTGQQGNVTVTMTQIKY (SEQ ID NO. 129) QFVPPQYSQTFTTGQQGNTTVTMTQIKY (SEQ ID NO. 130) QFVPPQYSQTYTTGQQGNTTVTMTQIKY (SEQ ID NO. 131) QFTPPQYSQTYTTGQQGNVTTTMTQIKY (SEQ ID NO. 132) QFTPPQYSQTYTTGQQGNTVTTMTQIKY (SEQ ID NO. 133) QFTPPQYSQTYTTGQQGNTTVTMTQIKY (SEQ ID NO. 134) QFTPPQYSQTYTTGQQGNTTTTMTQIKY (SEQ ID NO. 135) QYTPPQYSQTYTTGQQGNTTTTMTQTKY (SEQ ID NO. 136) RRLRIMTNIY TM2 Variants: (SEQ ID NO. 137) LLNQATSDQQFQVTQPFWIHY (SEQ ID NO. 138) LQNQAISDQLFQTTQPFWTHY (SEQ ID NO. 139) QQNLAISDQQFQTTQPFWTHY (SEQ ID NO. 140) QLNQAISDQQFQTTQPYWTHY (SEQ ID NO. 141) QQNLAISDQQYQVTQPYWTHY (SEQ ID NO. 142) LQNQATSDQLFQTTQPYWTHY (SEQ ID NO. 143) QQNQAISDQQYQVTQPYWTHY (SEQ ID NO. 144) QQNQATSDQQYQTTQPYWTHY (SEQ ID NO. 145) VRGHNWVFGHGMCK TM3 Variants: (SEQ ID NO. 146) LQSGFYHTGQYSETFFTTQQTT (SEQ ID NO. 147) QLSGFYHTGQYSETFFTTQQTT (SEQ ID NO. 148) QLSGFYHTGQYSETFYTTQQTT (SEQ ID NO. 149) QLSGFYHTGQYSETYFTTQQTT (SEQ ID NO. 150) QLSGYYHTGQYSETFFTTQQTT (SEQ ID NO. 151) QQSGFYHTGQYSETFFTTQQTT (SEQ ID NO. 152) QQSGFYHTGQYSETFYTTQQTT (SEQ ID NO. 153) QQSGYYHTGQYSETYYTTQQTT (SEQ ID NO. 154) DRYLAIVHAVFALRART TM4 Variants: (SEQ ID NO. 155) TTFGTTTSTVTWGQAVQAAQPEFIF (SEQ ID NO. 156) TTFGTTTSTTTWGQAVQAAQPEFIF (SEQ ID NO. 157) TTYGTTTSTTTWGQAVQAAQPEFIF (SEQ ID NO. 158) TTYGTTTSTTTWGQAVQAAQPEFTF (SEQ ID NO. 159) TTYGTTTSTTTWGQATQAAQPEFIF (SEQ ID NO. 160) TTFGTTTSTTTWGQATQAAQPEFIY (SEQ ID NO. 161) TTYGTTTSTTTWGQATQAAQPEFIY (SEQ ID NO. 162) TTYGTTTSTTTWGQATQAAQPEYTY (SEQ ID NO. 163) YETEELFEETLCSALYPEDTVYSWRHFHTLRM TM5 Variants: (SEQ ID NO. 164) TIFCQVQPQQTMATCYTGTT (SEQ ID NO. 165) TIFCQTQPQQVMATCYTGTT (SEQ ID NO. 166) TIFCQTQPQQTMATCYTGIT (SEQ ID NO. 167) TIFCQTQPQQTMATCYTGTI (SEQ ID NO. 168) TTFCQVQPQQVMATCYTGTT (SEQ ID NO. 169) TIYCQVQPQQVMATCYTGTT (SEQ ID NO. 170) TIFCQTQPQQTMATCYTGTT (SEQ ID NO. 171) TTYCQTQPQQTMATCYTGTT (SEQ ID NO. 172) KTLLRCPSKKKYKAIR TM 6 Variant: (SEQ ID NO. 173) QTYTTMATYYTYWTPYNTATQQSSY (SEQ ID NO. 174) QSILFGNDCERSKHLDL TM7 Variants: (SEQ ID NO. 175) VMQVTEVTAYSHCCMNPVTYAFTG (SEQ ID NO. 176) VMQVTEVTAYSHCCMNPTTYAYVG (SEQ ID NO. 177) VMLTTEVTAYSHCCMNPTTYAFTG (SEQ ID NO. 178) VMQVTETTAYSHCCMNPVTYAYTG (SEQ ID NO. 179) TMQVTETIAYSHCCMNPTTYAFTG (SEQ ID NO. 180) TMQVTETTAYSHCCMNPTTYAFVG (SEQ ID NO. 181) VMQTTETIAYSHCCMNPTTYAYTG (SEQ ID NO. 182) TMQTTETTAYSHCCMNPTTYAYTG (SEQ ID NO: 183) ERFRKYLRHFFHRHLLMHLGRYIPFLPSEKLERTSSVSPSTAEPELSIV F.
[0217] As in Example 1 above, the sequences before, between and after each list of transmembrane domain variants are the N', intermediary and C' intra or extracellular regions, respectively.
[0218] The sequences above were then used to generate coding sequences, as is known in the art, suitable for expression in the expression system, in this case yeast. The coding sequences were then shuffled and expressed to produce a library comprising a plurality of proteins each having SEQ ID NOs: 127, 136, 145, 154, 163, 172, 174 and 183 with one transmembrane domain variant from each variant list in between the respective intracellular and extracellular domain.
[0219] The library so produced was then assayed for CCR3 cognate ligand, CCL3, binding in an aqueous medium, as described in Example 1. Ligand binding was detected and samples were then sequenced. Eleven variants were sequenced. The results are shown in FIG. 7.
Example 4: CCR5 Variants
[0220] The method of Example 1 was repeated for Chemokine Receptor Type 5 isoform 3.
TABLE-US-00009 Name pI MW (Da) WT 9.21 40524.05 MT 9.06 41058.3
[0221] Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line), aligned with the wild type (top line):
TABLE-US-00010 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR |||||||||||||||||||||||||||||||||||*||*****|**|||*******|||| MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPQYSQTYTYGYTGNMQTTQTQTNCKR LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMCQLLTGLYFIGFFSGIFFII ||||||*|**|*|*||*****|*|*|||||||||||||||||**||*|**|**||***** LKSMTDTYQQNQATSDQYYQQTTPYWAHYAAAQWDFGNTMCQQQTGQYYTGYYSGTYYTT LLTIDRYLAVVHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTCSS **|*|||||||||||||||||||*|**||**||**|**||*||***|||||||||||||| QQTTDRYLAVVHAVFALKARTVTYGTTTSTTTWTTATYASQPGTTYTRSQKEGLHYTCSS HFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTI |||||||||||||||||****|***|***|**||||**||*||||||||||||||***|* HFPYSQYQFWKNFQTLKTTTQGQTQPQQTMTTCYSGTQKTQLRCRNEKKRHRAVRQTYTT MIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQVTETLGMTHCCINPIIYAFV |**|***|||||*****||||||||||||||||||||||||||||||||||*||**||** MTTYYQYWAPYNTTQQQNTFQEFFGLNNCSSSNRLDQAMQVTETLGMTHCCTNPTTYAYT GEKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPERASSVYTRSTGEQEISVGL (SEQ ID NO: 184) |||*|||*****|||||||||||||||||||||||||||||||||||||||| GEKYRNYQQTYYQKHIAKRFCKCCSIFQQEAPERASSVYTRSTGEQEISVGL. (SEQ ID NO: 185)
[0222] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising the underlined Amino Acids of SEQ ID NO: 185. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences of SEQ ID NO: 185 (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in SEQ ID NO: 185 or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in SEQ ID NO: 184.
[0223] The native protein sequence for CCR5 was subjected to the method a second time (noting a difference in the N terminal sequence). The program output divided the native sequence into the extracellular and intracellular regions and selected 8 transmembrane domain variants for each transmembrane domain. The results are illustrated in the following table:
TABLE-US-00011 (SEQ ID NO. 186) MDYQVSSPIYDINYYTSEPCQKINVKQIAA TM1 Variants: (SEQ ID NO. 187) RLQPPQYSQTFTFGFTGNMQVTQTQINC (SEQ ID NO. 188) RLQPPQYSQTFTFGYTGNMQVTQTQINC (SEQ ID NO. 189) RQQPPQYSQTFTFGFTGNMQTTQTQINC (SEQ ID NO. 190) RQQPPQYSQTFTYGFTGNMQTTQTQINC (SEQ ID NO. 191) RQQPPQYSQTYTFGFTGNMQTTQTQINC (SEQ ID NO. 192) RQQPPQYSQTFTFGYTGNMQTTQTQINC (SEQ ID NO. 193) RQQPPQYSQTYTFGYTGNMQTTQTQINC (SEQ ID NO. 194) RQQPPQYSQTYTYGYTGNMQTTQTQTNC (SEQ ID NO. 195) KRLKSMTDIY TM2 Variants: (SEQ ID NO. 196) LQNQAISDQFFQQTVPFWAHY (SEQ ID NO. 197) LQNQAISDQFFQQTTPFWAHY (SEQ ID NO. 198) LQNQAISDQFFQQTTPYWAHY (SEQ ID NO. 199) LQNQAISDQFYQQTTPYWAHY (SEQ ID NO. 200) LQNQAISDQYFQQTTPYWAHY (SEQ ID NO. 201) LQNQATSDQFFQQTTPYWAHY (SEQ ID NO. 202) LQNQAISDQYYQQTTPYWAHY (SEQ ID NO. 203) QQNQATSDQYYQQTTPYWAHY (SEQ ID NO. 204) AAAQWDFGNTMCQ TM3 Variants: (SEQ ID NO. 205) QQTGQYFTGYYSGTYYTTQQTT (SEQ ID NO. 206) QQTGQYYTGYYSGTYYTTQQTT (SEQ ID NO. 207) DRYLAVVHAVFALKART TM4 Variant: (SEQ ID NO. 208) TTYGTTTSTTTWTTATYASQPGTTY (SEQ ID NO. 209) TRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKI TM5 Variants: (SEQ ID NO. 210) VIQGQVQPQQVMVTCYSGIQ (SEQ ID NO. 211) VIQGQVQPQQVMTTCYSGIQ (SEQ ID NO. 212) VIQGQVQPQQTMTTCYSGIQ (SEQ ID NO. 213) VTQGQVQPQQTMVTCYSGTQ (SEQ ID NO. 214) TIQGQVQPQQVMTTCYSGTQ (SEQ ID NO. 215) TIQGQVQPQQTMVTCYSGTQ (SEQ ID NO. 216) TTQGQVQPQQVMTTCYSGTQ (SEQ ID NO. 217) TTQGQTQPQQTMTTCYSGTQ (SEQ ID NO. 218) KTLLRCRNEKKRHRAVR TM6 Variants: (SEQ ID NO. 219) QTFTTMTTYYQFWAPYNIVQQLNTF (SEQ ID NO. 220) QTFTTMTTYYQFWAPYNTVQQLNTF (SEQ ID NO. 221) QTFTTMTTYYQYWAPYNTVQQLNTF (SEQ ID NO. 222) QTFTTMTTYYQYWAPYNTVQQQNTF (SEQ ID NO. 223) QTYTTMTTYYQYWAPYNTVQQLNTF (SEQ ID NO. 224) QTFTTMTTYYQYWAPYNTTQQLNTF (SEQ ID NO. 225) QTYTTMTTYYQYWAPYNTVQQQNTF (SEQ ID NO. 225) QTYTTMTTYYQYWAPYNTTQQQNTY (SEQ ID NO. 226) QEFFGLNNCSSSNRLDQ TM7 Variants: (SEQ ID NO. 227) AMQVTETQGMTHCCINPIIYAFVG (SEQ ID NO. 228) AMQVTETLGMTHCCTNPIIYAFTG (SEQ ID NO. 229) AMQVTETQGMTHCCINPTIYAYVG (SEQ ID NO. 230) AMQTTETQGMTHCCINPITYAFTG (SEQ ID NO. 231) AMQTTETQGMTHCCINPTIYAFTG (SEQ ID NO. 232) AMQVTETQGMTHCCTNPTIYAYVG (SEQ ID NO. 233) AMQTTETQGMTHCCINPTTYAYVG (SEQ ID NO. 234) AMQTTETQGMTHCCTNPTTYAYTG (SEQ ID NO. 235) EKFRNYLLVFFQKHIAKRFCKCCSIFQQEAPERASSVYTRSTGEQEISVG L.
[0224] As in Example 1 above, the sequences before, between and after each list of transmembrane domain variants are the N', intermediary and C' intra or extracellular regions, respectively.
[0225] The sequences above were then used to generate coding sequences, as is known in the art, suitable for expression in the expression system, in this case yeast. The coding sequences were then shuffled and expressed to produce a library comprising a plurality of proteins each having SEQ ID NO:s, 186, 195, 204, 207, 209, 218, 226, and 235 with one transmembrane domain variant from each variant list in between the respective intracellular and extracellular domain.
[0226] The library so produced was then assayed for CCR5 cognate ligand, CCL5, binding in an aqueous medium, as described in Example 1. Ligand binding was detected and samples were then sequenced. One variant was sequenced. The results are shown in FIG. 8.
Example 5: CXCR3 Variants
[0227] The method of Example 1 was repeated for the CXC chemokine receptor type 3 isoform 2. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (SEQ ID NO: 325, lower line), aligned with the wild type (SEQ ID NO: 324, top line):
TABLE-US-00012 MELRKYGPGRLAGTVIGGAAQSKSQTKSDSITKEFLPGLYTAPSSPFPPSQVSDHQVLND |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| MELRKYGPGRLAGTVIGGAAQSKSQTKSDSITKEFLPGLYTAPSSPFPPSQVSDHQVLND AEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPALYSLLFLLGLLGNGA |||||||||||||||||||||||||||||||||||||||||||||*||*****|**|||| AEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLNFDRAFLPAQYSQQYQQGQQGNGA VAAVLLSRRTALSSTDTFLLHLAVADTLLVLTLPLWAVDAAVQWVFGSGLCKVAGALFNI *||***|||||||||||***|*|*|||****|*|*||*||||||||||||||||||**|* TAATQQSRRTALSSTDTYQQHQATADTQQTQTQPQWATDAAVQWVFGSGLCKVAGAQYNT NFYAGALLLACISFDRYLNIVHATQLYRRGPPARVTLTCLAVWGLCLLFALPDFIFLSAH |*||||***||*|*||||||||||||||||||||*|*||*|*||*|***|*||****||| NYYAGAQQQACTSYDRYLNIVHATQLYRRGPPARTTQTCQATWGQCQQYAQPDYTYQSAH HDERLNATHCQYNFPQVGRTALRVLQLVAGFLLPLLVMAYCYAHILAVLLVSRGQRRLRA |||||||||||||||||||||||||||*||***|***|||||||**|***|||||||||| HDERLNATHCQYNFPQVGRTALRVLQLTAGYQQPQQTMAYCYAHTQATQQVSRGQRRLRA MRLVVVVVVAFALCWTPYHLVVLVDILMDLGALARNCGRESRVDVAKSVTSGLGYMHCCL ||*******|*|*||||||***||||||||||||||||||||||||||||||*||||||* MRQTTTTTTAYAQCWTPYHQTTLVDILMDLGALARNCGRESRVDVAKSVTSGQGYMHCCQ NPLLYAFVGVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEASYSGL ||**||**|*||||||||||||||||||||||||||||||||||||||||||||| NPQQYAYTGTKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEASYSGL
[0228] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in SEQ ID NO: 325 or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in SEQ ID NO: 324.
[0229] As discussed above, the native protein sequence for CXCR3 was subjected to the method. The program output divided the native sequence into the extracellular and intracellular regions and selected 8 transmembrane domain variants for each transmembrane domain. The results are illustrated in the following table:
TABLE-US-00013 (SEQ ID NO. 235) MVLEVSDHQVLNDAEVAALLENFSSSYDYGENESDSCCTSPPCPQDFSLN FDR TM 1 Variants: (SEQ ID NO. 236) AFLPALYSQQFQQGQQGNGAVAATQLS (SEQ ID NO. 237) AFQPALYSQQFQQGQQGNGAVAAVQQS (SEQ ID NO. 238) AFQPAQYSQQFLQGQQGNGAVAATQQS (SEQ ID NO. 239) AYQPALYSLQYQQGQQGNGATAAVQQS (SEQ ID NO. 240) AYQPALYSQLFQQGQQGNGATAATQQS (SEQ ID NO. 241) AFQPALYSLQYQQGQQGNGATAATQQS (SEQ ID NO. 242) AYQPAQYSLQYQQGQQGNGATAAVQQS (SEQ ID NO. 243) AYQPAQYSQQYQQGQQGNGATAATQQS (SEQ ID NO. 244) RRTALSSTD TM 2 Variants: (SEQ ID NO. 245) TFLQHLAVADTQQVQTLPQWA (SEQ ID NO.: 246) TFLQHQAVADTQLVQTQPQWA (SEQ ID NO.: 247) TFQQHLAVADTQQVQTQPQWA (SEQ ID NO.: 248) TYLQHQAVADTQQVQTQPQWA (SEQ ID NO.: 249) TYQLHQAVADTQQVQTQPQWA (SEQ ID NO.: 250) TYQQHLAVADTQQVQTQPQWA (SEQ ID NO.: 251) TYQQHQAVADTQQVQTQPQWA (SEQ ID NO.: 252) TYQQHQATADTQQTQTQPQWA (SEQ ID NO.: 253) VDAAVQWVFGSGLCK TM 3 Variants: (SEQ ID NO.: 254) TAGAQYNTNFYAGAQQQACISF (SEQ ID NO.: 255) TAGAQYNTNFYAGAQLQACTSF (SEQ ID NO.: 256) TAGAQYNTNFYAGAQQLACTSF (SEQ ID NO.: 257) TAGAQFNTNYYAGAQQQACISF (SEQ ID NO.: 258) TAGAQYNTNYYAGAQQQACISF (SEQ ID NO.: 259) TAGAQYNTNYYAGAQLQACTSF (SEQ ID NO.: 260) TAGAQYNTNYYAGAQQLACTSF (SEQ ID NO.: 261) TAGAQYNTNYYAGAQQQACTSY (SEQ ID NO.: 262) DRYLNIVHATQLYRRGPPARVT TM 4 Variants: (SEQ ID NO.: 263) LTCQAVWGQCQQFAQPDFIF (SEQ ID NO.: 264) QTCQAVWGQCQQFAQPDFIF (SEQ ID NO.: 265) QTCQATWGQCQQFAQPDFIF (SEQ ID NO.: 266) QTCQATWGQCQQYAQPDFIF (SEQ ID NO.: 267) QTCQATWGQCQQFAQPDFTF (SEQ ID NO.: 268) QTCQATWGQCQQFAQPDYIF (SEQ ID NO.: 269) QTCQATWGQCQQYAQPDYIF (SEQ ID NO.: 270) QTCQATWGQCQQYAQPDYTY (SEQ ID NO.: 271) LSAHHDERLNATHCQYNFPQVGR TM 5 Variant: (SEQ ID NO.: 272) TAQRTQQQTAGYQQPQQTMAY (SEQ ID NO.: 273) CYAHILAVLLVSRGQRRLRAMR TM 6 Variants: (SEQ ID NO.: 274) QVTTTTVAFAQCWTPYHQVVQV (SEQ ID NO.: 275) QVTTTTVAFAQCWTPYHQTVQV (SEQ ID NO.: 276) QVTTTTTAFAQCWTPYHQTVQV (SEQ ID NO.: 277) QVTTTTTAYAQCWTPYHQTVQV (SEQ ID NO.: 278) QVTTTTTAFAQCWTPYHQTTQV (SEQ ID NO.: 279) QTTTTTVAFAQCWTPYHQTTQV (SEQ ID NO.: 280) QVTTTTTAYAQCWTPYHQTTQV (SEQ ID NO.: 281) QTTTTTTAYAQCWTPYHQTTQT (SEQ ID NO.: 282) DILMDLGALARNCGRESRVDV TM 7 Variants: (SEQ ID NO.: 283) AKSVTSGQGYMHCCLNPLQYAFV (SEQ ID NO.: 284) AKSVTSGQGYMHCCLNPQLYAFT (SEQ ID NO.: 285) AKSVTSGQGYMHCCLNPLQYAFT (SEQ ID NO.: 286) AKSTTSGQGYMHCCLNPQQYAFV (SEQ ID NO.: 287) AKSTTSGQGYMHCCQNPLQYAFV (SEQ ID NO.: 288) AKSTTSGQGYMHCCQNPQLYAFV (SEQ ID NO.: 289) AKSTTSGQGYMHCCQNPLQYAFT (SEQ ID NO.: 290) AKSTTSGQGYMHCCQNPQQYAYT (SEQ ID NO.: 291) GVKFRERMWMLLLRLGCPNQRGLQRQPSSSRRDSSWSETSEASYSGL.
[0230] The sequences above can be used to generate coding sequences, as is known in the art, suitable for expression in the expression system, in this case yeast. The coding sequences were then shuffled and expressed to produce a library comprising a plurality of proteins each having the intracellular and extracellular loops with one transmembrane domain variant from each variant list in between the respective intracellular and extracellular domain.
[0231] The library so produced can then be assayed for cognate ligand binding in an aqueous medium, as described in Example 1.
Example 6: CCR-1 C-C Chemokine Receptor Type 1
[0232] Example 1 was repeated for the title protein.
TABLE-US-00014 Name pI MW (Da) WT 8.38 41172.64 MT 8.31 41583.78
[0233] Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 293), aligned with the wild type (top line SEQ ID NO: 292):
TABLE-US-00015 METPNTTEDYDTTTEFDYGDATPCQKVNERAFGAQLLPPLYSLVFVIGLVGNILVVLVLV ||||||||||||||||||||||||||||||||||||*||*||*****|**||*******| METPNTTEDYDTTTEFDYGDATPCQKVNERAFGAQLQPPQYSQTYTTGQTGNTQTTQTQV QYKRLKNMTSIYLLNLAISDLLFLFTLPFWIDYKLKDDWVFGDAMCKILSGFYYTGLYSE ||||||||||*|**|*|*||*****|*|*|*||||||||||||||||**||*||||*||| QYKRLKNMTSTYQQNQATSDQQYQYTQPYWTDYKLKDDWVFGDAMCKTQSGYYYTGQYSE IFFIILLTIDRYLAIVHAVFALRARTVTFGVITSIIIWALAILASMPGLYFSKTQWEFTH *******|||||||||||||||||||*|*|**||***||*|**|||||*||||||||||| TYYTTQQTIDRYLAIVHAVFALRARTTTYGTTTSTTTWAQATQASMPGQYFSKTQWEFTH HTCSLHFPHESLREWKLFQALKLNLFGLVLPLLVMIICYTGIIKILLRRPNEKKSKAVRL ||||||||||||||||||||||||**|***|***|**||||**|***||||||||||||* HTCSLHFPHESLREWKLFQALKLNQYGQTQPQQTMTTCYTGTTKTQQRRPNEKKSKAVRQ IFVIMIIFFLFWTPYNLTILISVFQDFLFTHECEQSRHLDLAVQVTEVIAYTHCCVNPVI ****|******|||||*|***|||||||||||||||||||||*|*||**||||||*||** TYTTMTTYYQYWTPYNQTTQTSVFQDFLFTHECEQSRHLDLATQTTETTAYTHCCTNPTT YAFVGERFRKYLRQLFHRRVAVHLVKWLPFLSVDRLERVSSTSPSTGEHELSAGF ||**||||||||||||||||||||||||||||||||||||||||||||||||||| YAYTGERFRKYLRQLFHRRVAVHLVKWLPFLSVDRLERVSSTSPSTGEHELSAGF
[0234] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in the wild type sequence.
[0235] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 7: CCR-2 C-C Chemokine Receptor Type 2 Isoform A
[0236] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 295), aligned with the wild type (top line SEQ ID NO: 294):
TABLE-US-00016 MLSTSRSRFIRNTNESGEEVTTFFDYDYGAPCHKFDVKQIGAQLLPPLYSLVFIFGFVGN ||||||||||||||||||||||||||||||||||||||||||||*||*||*****|**|| MLSTSRSRFIRNTNESGEEVTTFFDYDYGAPCHKFDVKQIGAQLQPPQYSQTYTYGYTGN MLVVLILINCKKLKCLTDIYLLNLAISDLLFLITLPLWAHSAANEWVFGNAMCKLFTGLY |******|||||||||||||**|*|*||*****|*|*|||||||||||||||||||||*| MQTTQTQINCKKLKCLTDIYQQNQATSDQQYQTTQPQWAHSAANEWVFGNAMCKLFTGQY HIGYFGGIFFIILLTIDRYLAIVHAVFALKARTVTFGVVTSVITWLVAVFASVPGIIFTK |*||*||*******|*|||||||||||||||||||*|**||**||**|**||*||***|| HTGYYGGTYYTTQQTTDRYLAIVHAVFALKARTVTYGTTTSTTTWQTATYASTPGTTYTK CQKEDSVYVCGPYFPRGWNNFHTIMRNILGLVLPLLIMVICYSGILKTLLRCRNEKKRHR |||||||||||||||||||||||||||**|***|***|**||||**||**|||||||||| CQKEDSVYVCGPYFPRGWNNFHTIMRNTQGQTQPQQTMTTCYSGTQKTQQRCRNEKKRHR AVRVIFTIMIVYFLFWTPYNIVILLNTFQEFFGLSNCESTSQLDQATQVTETLGMTHCCI |*|***|*|**|***|||||****||||||||||||||||||||||||||||*||||||* TRTTYTTMTTYYQYWTPYNTTTQLNTFQEFFGLSNCESTSQLDQATQVTETQGMTHCCT NPIIYAFVGEKFRSLFHIALGCRIAPLQKPVCGGPGVRPGKNVKVTTQGLLDGRGKGKSI ||**||**|||||||||||||||||||||||||||||||||||||||||||||||||||| NPTTYAYTGEKFRSLFHIALGCRIAPLQKPVCGGPGVRPGKNVKVTTQGLLDGRGKGKSI GRAPEASLQDKEGA |||||||||||||| GRAPEASLQDKEGA
[0237] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in the wild type sequence.
[0238] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 8: CCR-4 C-C Chemokine Receptor Type 4
[0239] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 297), aligned with the wild type (top line SEQ ID NO: 296):
TABLE-US-00017 MNPTDIADTTLDESIYSNYYLYESIPKPCTKEGIKAFGELFLPPLYSLVFVFGLLGNSVV |||||||||||||||||||||||||||||||||||||||||||||||*****|**|||** MNPTDIADTTLDESIYSNYYLYESIPKPCTKEGIKAFGELFLPPLYSQTYTYGQQGNSTT VLVLFKYKRLRSMTDVYLLNLAISDLLFVFSLPFWGYYAADQWVFGLGLCKMISWMYLVG *****||||||||||*|**|*|*||*****|*|*||||||||||||||||||*||||**| TQTQYKYKRLRSMTDTYQQNQATSDQQYTYSQPYWGYYAADQWVFGLGLCKMTSWMYQTG FYSGIFFVMLMSIDRYLAIVHAVFSLRARTLTYGVITSLATWSVAVFASLPGFLFSTCYT *|||****|*||||||||||||||||||||*|||**||*||||*|**||*||***||||| YYSGTYYTMQMSIDRYLAIVHAVFSLRARTQTYGTTTSQATWSTATYASQPGYQYSTCYT ERNHTYCKTKYSLNSTTWKVLSSLEINILGLVIPLGIMLFCYSMIIRTLQHCKNEKKNKA |||||||||||||||||||||||||*|**|***|*|*|**||||**|||||||||||||| ERNHTYCKTKYSLNSTTWKVLSSLETNTQGQTTPQGTMQYCYSMTTRTLQHCKNEKKNKA VKMIFAVVVLFLGFWTPYNIVLFLETLVELEVLQDCTFERYLDYAIQATETLAFVHCCLN |||**|******|*|||||*****|||||||||||||||||||||||||||*|**|||*| VKMTYATTTQYQGYWTPYNTTQYQETLVELEVLQDCTFERYLDYAIQATETQAYTHCCQN PIIYFFLGEKFRKYILQLFKTCRGLFVLCQYCGLLQIYSADTPSSSYTQSTMDHDLHDAL |**|***||||||||||||||||||||||||||||||||||||||||||||||||||||| PTTYYYQGEKFRKYILQLFKTCRGLFVLCQYCGLLQIYSADTPSSSYTQSTMDHDLHDAL
[0240] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in the wild type sequence.
[0241] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 9: CCR-6 C-C Chemokine Receptor Type 6
[0242] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 299), aligned with the wild type (top line SEQ ID NO: 298):
TABLE-US-00018 MSGESMNFSDVFDSSEDYFVSVNTSYYSVDSEMLLCSLQEVRQFSRLFVPIAYSLICVFG ||||||||||||||||||||||||||||||||||||||||||||||||||*|||**|**| MSGESMNFSDVFDSSEDYFVSVNTSYYSVDSEMLLCSLQEVRQFSRLFVPTAYSQTCTYG LLGNILVVITFAFYKKARSMTDVYLLNMAIADILFVLTLPFWAVSHATGAWVFSNATCKL **||*****|*|*|||||||||||**|||*||*****|*|*||*|||||||||||||||| QQGNTQTTTTYAYYKKARSMTDVYQQNMATADTQYTQTQPYWATSHATGAWVFSNATCKL LKGIYAINFNCGMLLLTCISMDRYIAIVQATKSFRLRSRTLPRSKIICLVVWGLSVIISS |||*||*|*||||***||*|||||*||||||||||||||||||||**|***||*|***|| LKGTYATNYNCGMQQQTCTSMDRYTAIVQATKSFRLRSRTLPRSKTTCQTTWGQSTTTSS STFVFNQKYNTQGSDVCEPKYQTVSEPIRWKLLMLGLELLFGFFIPLMFMIFCYTFIVKT ||***||||||||||||||||||||||||||||||||||**|***|*|*|**|||***|| STYTYNQKYNTQGSDVCEPKYQTVSEPIRWKLLMLGLELQYGYYTPQMYMTYCYTYTTKT LVQAQNSKRHKAIRVIIAVVLVFLACQIPHNMVLLVTAANLGKMNRSCQSEKLIGYTKTV **||||||||||||***|******|||*||||****|||||||||||||||||||||||| QTQAQNSKRHKAIRTTTATTQTYQACQTPHNMTQQTTAANLGKMNRSCQSEKLIGYTKTV TEVLAFLHCCLNPVLYAFIGQKFRNYFLKILKDLWCVRRKYKSSGFSCAGRYSENISRQT ||**|**|||*||**||**||||||||||||||||||||||||||||||||||||||||| TETQAYQHCCQNPTQYAYTGQKFRNYFLKILKDLWCVRRKYKSSGFSCAGRYSENISRQT SETADNDNASSFTM |||||||||||||| SETADNDNASSFTM
[0243] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I, V and F amino acids, as set forth in the wild type sequence.
[0244] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 10: CCR-7 C-C Chemokine Receptor Type 7 Precursor
[0245] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 301), aligned with the wild type (top line SEQ ID NO: 300):
TABLE-US-00019 MDLGKPMKSVLVVALLVIFQVCLCQDEVTDDYIGDNTTVDYTLFESLCSKKDVRNFKAWF |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| MDLGKPMKSVLVVALLVIFQVCLCQDEVTDDYIGDNTTVDYTLFESLCSKKDVRNFKAWF LPIMYSIICFVGLLGNGLVVLTYIYFKRLKTMTDTYLLNLAVADILFLLTLPFWAYSAAK ||*|||**|**|**|||****||*||||||||||||**|*|*||*****|*|*||||||| LPTMYSTTCYTGQQGNGQTTQTYTYFKRLKTMTDTYQQNQATADTQYQQTQPYWAYSAAK SWVFGVHFCKLIFAIYKMSFFSGMLLLLCISIDRYVAIVQAVSAHRHRARVLLISKLSCV ||||||||||***|*||||**|||****|*||||||||||||||||||||****||*||* SWVFGVHFCKQTYATYKMSYYSGMQQQQCTSIDRYVAIVQAVSAHRHRARTQQTSKQSCT GIWILATVLSIPELLYSDLQRSSSEQAMRCSLITEHVEAFITIQVAQMVIGFLVPLLAMS |*|**||**|*|||||||||||||||||||||||||||||||||||||**|***|**||| GTWTQATTQSTPELLYSDLQRSSSEQAMRCSLITEHVEAFITIQVAQMTTGYQTPQQAMS FCYLVIIRTLLQARNFERNKAIKVIIAVVVVFIVFQLPYNGVVLAQTVANFNITSSTCEL *||****||**||||||||||||***|********|*||||***|||||||||||||||| YCYQTTTRTQQQARNFERNKAIKTTTATTTTYTTYQQPYNGTTQAQTVANFNITSSTCEL SKQLNIAYDVTYSLACVRCCVNPFLYAFIGVKFRNDLFKLFKDLGCLSQEQLRQWSSCRH |||||||||*|||*||*|||*||**||*|||||||||||||||||||||||||||||||| SKQLNIAYDTTYSQACTRCCTNPYQYAYIGVKFRNDLFKLFKDLGCLSQEQLRQWSSCRH IRRSSMSVEAETTTTFSP |||||||||||||||||| IRRSSMSVEAETTTTFSP
[0246] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V, and F amino acids, as set forth in the wild type sequence.
[0247] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 11: CCR-8 C-C Chemokine Receptor Type 8
[0248] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO.:303), aligned with the wild type (top line SEQ ID NO. 302):
TABLE-US-00020 MDYTLDLSVTTVTDYYYPDIFSSPCDAELIQTNGKLLLAVFYCLLFVFSLLGNSLVILVL |||||||||||||||||||||||||||||||||||||||**||*****|**|||****** MDYTLDLSVTTVTDYYYPDIFSSPCDAELIQTNGKLLLATYYCOOYTYSOOGNSOTTOTO VVCKKLRSITDVYLLNLALSDLLFVFSFPFQTYYLLDQWVFGTVMCKVVSGFYYIGFYSS **|||||||||||**|*|*||*****|*I*||||**|||||||||||||||*||*|*||| TTCKKLRSITDVYQQNOAQSDQQYTYSYPYOTYYQQDQWVFGTVMCKVVSGYYYTGYYSS MFFITLMSVDRYLAVVHAVYALKVRTIRMGTTLCLAVWLTAIMATIPLLVFYQVASEDGV |***|*||*|||||||||||||||||||||||||*|*|*||*|||*|****||*|||||| MYYTTQMSTDRYLAVVHAVYALKVRTIRMGTTLCQATWQTATMATTPQQTYYQTASEDGV LQCYSFYNQQTLKWKIFTNFKMNILGLLIPFTIFMFCYIKILHQLKRCQNHNKTKAIRLV |||||||||||||||**||*|||**|***|*|**|*||||||||||||||||||||||** LQCYSFYNQQTLKWKTYTNYKMNTQGQQTPYTTYMYCYIKILHQLKRCQNHNKTKAIRQT LIVVIASLLFWVPFNVVLFLTSLHSMHILDGCSISQQLTYATHVTEIISFTHCCVNPVIY *****||***|*|*|*****||||||||||||||||||||||||||**|*||||*||**| QTTTTASQQYWTPYNTTQYQTSLHSMHILDGCSISQQLTYATHVTETTSYTHCCTNPTTY AFVGEKEKKHLSEIFQKSCSQIENYLGRQMPRESCEKSSSCQQHSSRSSSVDYIL |**|||||||||||||||||||||||||||||||||||||||||||||||||||| AYTGEKEKKHLSEIFQKSCSQIENYLGRQMPRESCEKSSSCQQHSSRSSSVDYIL
[0249] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V, and F amino acids, as set forth in the wild type sequence.
[0250] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 12: CCR-9 C-C Chemokine Receptor Type 9 Isoform B
[0251] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 305), aligned with the wild type (top line SEQ ID NO: 304):
TABLE-US-00021 MADDYGSESTSSMEDYVNFNFTDFYCEKNNVRQFASHFLPPLYWLVFIVGALGNSLVILV |||||||||||||||||||||||||||||||||||||||||*||*****||*|||***** MADDYGSESTSSMEDYVNFNFTDFYCEKNNVRQFASHFLPPQYWQTYTTGAQGNSQTTQT YWYCTRVKTMTDMFLLNLAIADLLFLVTLPFWAIAAADQWKFQTFMCKVVNSMYKMNFYS |||||||||||||***|*|*||*****|*|*||*|||||||||||||||||||||||*|| YWYCTRVKTMTDMYQQNQATADQQYQTTQPYWATAAADQWKFQTFMCKVVNSMYKMNYYS CVLLIMCISVDRYIAIAQAMRAHTWREKRLLYSKMVCFTIWVLAAALCIPEILYSQIKEE |****||*I*|||*I*|||||||||||||**||||*I*|*|**|||*|*||||||||||| CTQQTMCTSTDRYTATAQAMRAHTWREKRQQYSKMTCYTTWTQAAAQCTPEILYSQIKEE SGIAICTMVYPSDESTKLKSAVLTLKVILGFFLPFVVMACCYTIIIHTLIQAKKSSKHKA |||||||||||||||||||||||||||**|***|***||||||***||**|||||||||| SGIAICTMVYPSDESTKLKSAVLTLKVTQGYYQPYTTMACCYTTITHTQTQAKKSSKHKA LKVTITVLTVFVLSQFPYNCILLVQTIDAYAMFISNCAVSTNIDICFQVTQTIAFFHSCL ||*I*|**|****||*||||****||||||||||||||||||||||*|*|||*|**|||* LKTTTTTQTTYTQSQYPYNCTQQTQTIDAYAMFISNCAVSTNIDICYQTTQTTAYYHSCQ NPVLYVFVGERFRRDLVKTLKNLGCISQAQWVSFTRREGSLKLSSMLLETTSGALSL ||**|***||||||||||||||||||||||||||||||||||||||||||||||||| NPTQYTYTGERFRRDLVKTLKNLGCISQAQWVSFTRREGSLKLSSMLLETTSGALSL
[0252] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V, and F amino acids, as set forth in the wild type sequence.
[0253] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 13: CCR-10 C-C Chemokine Receptor Type 10
[0254] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 307), aligned with the wild type (top line SEQ ID NO: 306):
TABLE-US-00022 MGTEATEQVSWGHYSGDEEDAYSAEPLPELCYKADVQAFSRAFQPSVSLTVAALGLAGNG ||||||||||||||||||||||||||||||||||||||||||||||*|*|*||*|*|||| MGTEATEQVSWGHYSGDEEDAYSAEPLPELCYKADVQAFSRAFQPSTSQTTAAQGQAGNG LVLATHLAARRAARSPTSAHLLQLALADLLLALTLPFAAAGALQGWSLGSATCRTISGLY ***|||*|||||||||||||**|*I*||***|*|*|*|||||*|||||||||||||||*| QTQATHQAARRAARSPTSAHQQQQAQADQQQACTQPYAAAGAQQGWSLGSATCRTISGQY SASFHAGELFLACISADRYVAIARALPAGPRPSTPGRAHLVSVIVWLLSLLLALPALLFS |||*|||****||*|||||||||||||||||||||||||**|***|**|***|*||***| SASYHAGYQYQACTSADRYVAIARALPAGPRPSTPGRAHQTSTTTWQQSQQQAQPAQQYS QDGQREGQRRCRLIFPEGLTQTVKGASAVAQVALGFALPLGVMVACYALLGRTLLAARGP ||||||||||||||||||||||||||||*||*|*|*|*|*|*|*||||**|||||||||| QDGQREGQRRCRLIFPEGLTQTVKGASATAQTAQGYAQPQGTMTACYAQQGRTLLAARGP ERRRALRVVVALVAAFVVLQLPYSLALLLDTADLLAARERSCPASKRKDVALLVTSGLAL |||||||***|**||****|*|||*|***||||||||||||||||||||*|***|||*|* ERRRALRTTTAQTAAYTTQQQPYSQAQQQDTADLLAARERSCPASKRKDTAQQTTSGQAQ ARCGLNPVLYAFLGLRFRQDLRRLLRGGSCPSGPQPRRGCPRRPRLSSCSAPTETHSLSW ||||*||**||**||||||||||||||||||||||||||||||||||||||||||||||| ARCGQNPTQYAYQGLRFRQDLRRLLRGGSCPSGPQPRRGCPRRPRLSSCSAPTETHSLSW DN || DN
[0255] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence. The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 14: CXCR1 Chemokine Receptor Type 1
[0256] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 309), aligned with the wild type (top line SEQ ID NO: 308):
TABLE-US-00023 MSNITDPQMWDFDDLNFTGMPPADEDYSPCMLETETLNKYVVIIAYALVFLLSLLGNSLV ||||||||||||||||||||||||||||||||||||||||****|||*****|**|||** MSNITDPQMWDFDDLNFTGMPPADEDYSPCMLETETLNKYTTTTAYAQTYQQSQQGNSQT MLVILYSRVGRSVTDVYLLNLALADLLFALTLPIWAASKVNGWIFGTFLCKVVSLLKEVN |****||||||||||*|**|*|*||***|*|*|*|||||||||||||||||||||||||| MQTTQYSRVGRSVTDTYQQNQAQADQQYAQTQPTWAASKVNGWIFGTFLCKVVSLLKEVN FYSGILLLACISVDRYLAIVHATRTLTQKRHLVKFVCLGCWGLSMNLSLPFFLFRQAYHP *|||****||*|*|||*|**|||||||||||**|**|*||||*|||*|*|****|||||| YYSGTQQQACTSTDRYQATTHATRTLTQKRHQTKYTCQGCWGQSMNQSQPYYQYRQAYHP NNSSPVCYEVLGNDTAKWRMVLRILPHTFGFIVPLFVMLFCYGFTLRTLFKAHMGQKHRA ||||||||||||||||||||||||||||*|***|***|**|||*|*||**|||||||||| NNSSPVCYEVLGNDTAKWRMVLRILPHTYGYTTPQYTMQYCYGYTQRTQYKAHMGQKHRA MRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQESCERRNNIGRALDATEILGFLHSCL ||***|*******||*|||***||||||||||||||||||||||||||||||*|**|||* MRTTYATTQTYQQCWQPYNQTQLADTLMRTQVIQESCERRNNIGRALDATEIQGYQHSCQ NPIIYAFIGQNFRHGFLKILAMHGLVSKEFLARHRVTSYTSSSVNVSSNL ||**||**|||||||||||||||||||||||||||||||||||||||||| NPTTYAYTGQNFRHGFLKILAMHGLVSKEFLARHRVTSYTSSSVNVSSNL
[0257] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence.
[0258] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 15: CXR Chemokine Receptor 1 CXR1
[0259] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 311), aligned with the wild type (top line SEQ ID NO: 310):
TABLE-US-00024 MESSGNPESTTFFYYDLQSQPCENQAWVFATLATTVLYCLVFLLSLVGNSLVLWVLVKYE |||||||||||||||||||||||||||||||||||**||*****|**|||***|**|||| MESSGNPESTTFFYYDLQSQPCENQAWVFATLATTTQYCQTYQQSQTGNSQTQWTQVKYE SLESLTNIFILNLCLSDLVFACLLPVWISPYHWGWVLGDFLCKLLNMIFSISLYSSIFFL |||||||****|*|*||***||**|*|*||||||||||||||||||||||*|*|||**** SLESLTNTYTQNQCQSDQTYACQQPTWISPYHWGWVLGDFLCKLLNMIFSTSQYSSTYYQ TIMTIHRYLSVVSPLSTLRVPTLRCRVLVTMAVWVASILSSILDTIFHKVLSSGCDYSEL |*||*|||*|**||||||||||||||***|||*|*||**||**||**||||||||||||| TTMTTHRYQSTTSPLSTLRVPTLRCRTQTTMATWTASTQSSTQDTTYHKVLSSGCDYSEL TWYLTSVYQHNLFFLLSLGIILFCYVEILRTLFRSRSKRRHRTVKLIFAIVVAYFLSWGP ||||||*||||*****|*|****||*|**||||||||||||||||***|***||**|||| TWYLTSTYQHNQYYQQSQGTTQYCYTETQRTLFRSRSKRRHRTVKQTYATTTAYYQSWGP YNFTLFLQTLFRTQIIRSCEAKQQLEYALLICRNLAFSHCCFNPVLYVFVGVKFRTHLKH ||*|***|||||||||||||||||||||***|||*|*||||*||**|***|||||||||| YNYTQYQQTLFRTQIIRSCEAKQQLEYAQQTCRNQAYSHCCYNPTQYTYTGVKFRTHLKH VLRQFWFCRLQAPSPASIPHSPGAFAYEGASFY ||||||||||||||||||||||||||||||||| VLRQFWFCRLQAPSPASIPHSPGAFAYEGASFY
[0260] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in the wild type sequence.
[0261] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 16: CXCR2 Chemokine Receptor Type 2
[0262] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 313), aligned with the wild type (top line SEQ ID NO: 312):
TABLE-US-00025 MEDFNMESDSFEDFWKGEDLSNYSYSSTLPPFLLDAAPCEPESLEINKYFVVIIYALVFL ||||||||||||||||||||||||||||||||||||||||||||||||||****||**** MEDFNMESDSFEDFWKGEDLSNYSYSSTLPPFLLDAAPCEPESLEINKYFTTTTYAQTYQ LSLLGNSLVMLVILYSRVGRSVTDVYLLNLALADLLFALTLPIWAASKVNGWIFGTFLCK *|**|||**|***|||||||||||*|**|*|*||***|*|*|*||||||||||||||||| QSQQGNSQTMQTTLYSRVGRSVTDTYQQNQAQADQQYAQTQPTWAASKVNGWIFGTFLCK VVSLLKEVNFYSGILLLACISVDRYLAIVHATRTLTQKRYLVKFICLSIWGLSLLLALPV |||||||*|*|||****||*|*|||*|**|||||||||||**|**|*|*||*|***|*|* VVSLLKETNYYSGTQQQACTSTDRYQATTHATRTLTQKRYQTKYTCQSTWGQSQQQAQPT LLFRRTVYSSNVSPACYEDMGNNTANWRMLLRILPQSFGFIVPLLIMLFCYGFTLRTLFK ***||||||||||||||||||||||||||||||||||*|***|***|**|||*|*||**| QQYRRTVYSSNVSPACYEDMGNNTANWRMLLRILPQSYGYTTPQQTMQYCYGYTQRTQYK AHMGQKHRAMRVIFAVVLIFLLCWLPYNLVLLADTLMRTQVIQETCERRNHIDRALDATE |||||||||||***|*******||*|||***||||||||||||||||||||||||||||| AHMGQKHRAMRTTYATTQTYQQCWQPYNQTQLADTLMRTQVIQETCERRNHIDRALDATE ILGILHSCLNPLIYAFIGQKFRHGLLKILAIHGLISKDSLPKDSRPSFVGSSSGHTSTTL **|**|||*||**||**||||||||||||||||||||||||||||||||||||||||||| TQGTQHSCQNPQTYAYTGQKFRHGLLKILAIHGLISKDSLPKDSRPSFVGSSSGHTSTTL
[0263] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence.
[0264] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 17: CCR-10 C-C Chemokine Receptor Type 10
[0265] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 315), aligned with the wild type (top line SEQ ID NO: 314):
TABLE-US-00026 MNYPLTLEMDLENLEDLFWELDRLDNYNDTSLVENHLCPATEGPLMASFKAVFVPVAYSL |||||||||||||||||||||||||||||||||||||||||||||||||||||*|*|||* MNYPLTLEMDLENLEDLFWELDRLDNYNDTSLVENHLCPATEGPLMASFKAVFTPTAYSQ IFLLGVIGNVLVLVILERHRQTRSSTETFLFHLAVADLLLVFILPFAVAEGSVGWVLGTF ****|**||*******||||||||||||***|*|*||*******|*|*|||||||||||| TYQQGTTGNTQTQTTQERHRQTRSSTETYQYHQATADQQQTYTQPYATAEGSVGWVLGTF LCKTVIALHKVNFYCSSLLLACIAVDRYLAIVHAVHAYRHRRLLSIHITCGTIWLVGFLL |||||*|*||*|*||||***||*|*||||||||||||||||||||*|*||||*|**|*** LCKTVTAQHKTNYYCSSQQQACTATDRYLAIVHAVHAYRHRRLLSTHTTCGTTWQTGYQQ ALPEILFAKVSQGHHNNSLPRCTFSQENQAETHAWFTSRFLYHVAGFLLPMLVMGWCYVG |*||***||||||||||||||||||||||||||||||||**||*||***||**|||||*| AQPETQYAKVSQGHHNNSLPRCTFSQENQAETHAWFTSRYQYHTAGYQQPMQTMGWCYTG VVHRLRQAQRRPQRQKAVRVAILVTSIFFLCWSPYHIVIFLDTLARLKAVDNTCKLNGSL **|||||||||||||||*|*|***||****||||||****|||||||||||||||||||* TTHRLRQAQRRPQRQKATRTATQTTSTYYQCWSPYHTTTYLDTLARLKAVDNTCKLNGSQ PVAITMCEFLGLAHCCLNPMLYTFAGVKFRSDLSRLLTKLGCTGPASLCQLFPSWRRSSL |*|*||||**|*||||*|||*||||||||||||||||||||||||||||||||||||||| PTATTMCEYQGQAHCCQNPMQYTFAGVKFRSDLSRLLTKLGCTGPASLCQLFPSWRRSSL SESENATSLTTF |||||||||||| SESENATSLTTF
[0266] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence.
[0267] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 18: CXCR6 Chemokine Receptor Type 6
[0268] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 317), aligned with the wild type (top line SEQ ID NO: 316):
TABLE-US-00027 MAEHDYHEDYGFSSFNDSSQEEHQDFLQFSKVFLPCMYLVVFVCGLVGNSLVLVISIFYH ||||||||||||||||||||||||||||||||||||||*****||**|||*****|**|| MAEHDYHEDYGFSSFNDSSQEEHQDFLQFSKVFLPCMYQTTYTCGQTGNSQTQTTSTYYH KLQSLTDVFLVNLPLADLVFVCTLPFWAYAGIHEWVFGQVMCKSLLGIYTINFYTSMLIL |||||||****|*|*||****||*|*||||||||||||||||||||||||*|*||||*** KLQSLTDTYQTNQPQADQTYTCTQPYWAYAGIHEWVFGQVMCKSLLGIYTTNYYTSMQTQ TCITVDRFIVVVKATKAYNQQAKRMTWGKVTSLLIWVISLLVSLPQIIYGNVFNLDKLIC ||*|*||*****||||||||||||||||||||***|**|***|*||**|||**|*||||| TCTTTDRYTTTTKATKAYNQQAKRMTWGKVTSQQTWTTSQQTSQPQTTYGNTYNQDKLIC GYHDEAISTVVLATQMTLGFFLPLLTMIVCYSVIIKTLLHAGGFQKHRSLKIIFLVMAVF |||||||||***|||||*|***|**||**||||||||||||||||||||||*****||** GYHDEAISTTTQATQMTQGYYQPQQTMTTCYSVIIKTLLHAGGFQKHRSLKTTYQTMATY LLTQMPFNLMKFIRSTHWEYYAMTSFHYTIMVTEAIAYLRACLNPVLYAFVSLKFRKNFW **||||*|*||**||||||||||||||||*|*|||*||*|||*||**||**||||||||| QQTQMPYNQMKYTRSTHWEYYAMTSFHYTTMTTEATAYQRACQNPTQYAYTSLKFRKNFW KLVKDIGCLPYLGVSHQWKSSEDNSKTFSASHNVEATSMFQL |||||||||||||||||||||||||||||||||||||||||| KLVKDIGCLPYLGVSHQWKSSEDNSKTFSASHNVEATSMFQL
[0269] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence.
[0270] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 19: CXCR7 Chemokine Receptor Type 7
[0271] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 319), aligned with the wild type (top line SEQ ID NO: 318):
TABLE-US-00028 MDLHLFDYSEPGNFSDISWPCNSSDCIVVDTVMCPNMPNKSVLLYTLSFIYIFIFVIGMI ||||||||||||||||||||||||||||||||||||||||||||||*|**|******||* MDLHLFDYSEPGNFSDISWPCNSSDCIVVDTVMCPNMPNKSVLLYTQSYTYTYTYTTGMT ANSVVVWVNIQAKTTGYDTHCYILNLAIADLWVVLTIPVWVVSLVQHNQWPMGELTCKVT |||***|*||||||||||||||**|*|*||*|***|*|*|**|*||||||||||||||*| ANSTTTWTNIQAKTTGYDTHCYTQNQATADQWTTQTTPTWTTSQVQHNQWPMGELTCKTT HLIFSINLFGSIFFLTCMSVDRYLSITYFTNTPSSRKKMVRRVVCILVWLLAFCVSLPDT |***|*|**||****||||*|||||||||||||||||||*||**|***|**|*|*|*||| HQTYSTNQYGSTYYQTCMSTDRYLSITYFTNTPSSRKKMTRRTTCTQTWQQAYCTSQPDT YYLKTVTSASNNETYCRSFYPEHSIKEWLIGMELVSVVLGFAVPFSIIAVFYFLLARAIS |||||||||||||||||||||||||||||||||**|***|*|*|*|**|**|***||||| YYLKTVTSASNNETYCRSFYPEHSIKEWLIGMEQTSTTQGYATPYSTTATYYYQQARAIS ASSDQEKHSSRKIIFSYVVVFLVCWLPYHVAVLLDIFSILHYIPFTCRLEHALFTALHVT ||||||||||||||*||******||*|||*|***|**|||||||||||||||||||*|*| ASSDQEKHSSRKIIYSYTTTYQTCWQPYHTATQQDTYSILHYIPFTCRLEHALFTAQHTT QCLSLVHCCVNPVLYSFINRNYRYELMKAFIFKYSAKTGLTKLIDASRVSETEYSALEQS ||*|**|||*||**||**|||||||||||||||||||||||||||||||||||||||||| QCQSQTHCCTNPTQYSYTNRNYRYELMKAFIFKYSAKTGLTKLIDASRVSETEYSALEQS TK || TK
[0272] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence.
[0273] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 20: CLR-1a Chemokine Like Receptor 1 Isoform a
[0274] Example 1 was repeated for the title protein. Replacing all or substantially all of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 321), aligned with the wild type (top line SEQ ID NO: 320):
TABLE-US-00029 MRMEDEDYNTSISYGDEYPDYLDSIVVLEDLSPLEARVTRIFLVVVYSIVCFLGILGNGL ||||||||||||||||||||||||||||||||||||||||******||**|**|**|||* MRMEDEDYNTSISYGDEYPDYLDSIVVLEDLSPLEARVTRTYQTTTYSTTCYQGTQGNGQ VIIIATFKMKKTVNMVWFLNLAVADFLFNVFLPIHITYAAMDYHWVFGTAMCKISNFLLI ***||||||||||||*|**|*|*||***|***|*|*|||||||||||||||||||||*** TTTIATFKMKKTVNMTWYQNQATADYQYNTYQPTHTTYAAMDYHWVFGTAMCKISNFQQT HNMFTSVFLLTIISSDRCISVLLPVWSQNHRSVRLAYMACMVIWVLAFFLSSPSLVFRDT |||*||****|**|||||||||||||||||||||*||||||**|**|***||||***||| HNMYTSTYQQTTTSSDRCISVLLPVWSQNHRSVRQAYMACMTTWTQAYYQSSPSQTYRDT ANLHGKISCFNNFSLSTPGSSSWPTHSQMDPVGYSRHMVVTVTRFLCGFLVPVLIITACY ||||||||||||||||||||||||||||||||||||||||||||**||***|****|||| ANLHGKISCFNNFSLSTPGSSSWPTHSQMDPVGYSRHMVVTVTRYQCGYQTPTQTTTACY LTIVCKLQRNRLAKTKKPFKIIVTIIITFFLCWCPYHTLNLLELHHTAMPGSVFSLGLPL *|**||*|||||||||||*|***|***|***|||||||*|*||||||||||||||*|*|* QTTTCKQQRNRLAKTKKPYKTTTTTTTTYYQCWCPYHTQNQLELHHTAMPGSVFSQGQPQ ATALAIANSCMNPILYVFMGQDFKKFKVALFSRLVNALSEDTGHSSYPSHRSFTKMSSMN |||*|*|||||||**|**|||||||||||||||||||||||||||||||||||||||||| ATAQATANSCMNPTQYTYMGQDFKKFKVALFSRLVNALSEDTGHSSYPSHRSFTKMSSMN ERTSMNERETGML ||||||||||||| ERTSMNERETGML
[0275] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I V and F amino acids, as set forth in the wild type sequence.
[0276] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 21: DARIA Duffy Antigen/Chemokine Receptor Isoform a
[0277] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 323), aligned with the wild type (top line SEQ ID NO: 322):
TABLE-US-00030 MASSGYVLQAELSPSTENSSQLDFEDVWNSSYGVNDSFPDGDYGANLEAAAPCHSCNLLD |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| MASSGYVLQAELSPSTENSSQLDFEDVWNSSYGVNDSFPDGDYGANLEAAAPCHSCNLLD DSALPFFILTSVLGILASSTVLFMLFRPLFRWQLCPGWPVLAQLAVGSALFSIVVPVLAP |||*|****||**|**||||***|*||||||||||||||**||*|*|||**|***|**|| DSAQPYYTQTSTQGTQASSTTQYMQFRPLFRWQLCPGWPTQAQQATGSAQYSTTTPTQAP GLGSTRSSALCSLGYCVWYGSAFAQALLLGCHASLGHRLGAGQVPGLTLGLTVGIWGVAA ||||||||||||||||*|||||*|||***|||||*|||||||||||||*|*|*|*||*|| GLGSTRSSALCSLGYCTWYGSAYAQAQQQGCHASQGHRLGAGQVPGLTQGQTTGTWGTAA LLTLPVTLASGASGGLCTLIYSTELKALQATHTVACLAIFVLLPLGLFGAKGLKKALGMG **|*I*|*|||||||||||||||||||||||||*||*|*****|*|**||||*||||||| QQTQPTTQASGASGGLCTLIYSTELKALQATHTTACQATYTQQPQGQYGAKGQKKALGMG PGPWMNILWAWFIFWWPHGVVLGLDFLVRSKLLLLSTCLAQQALDLLLNLAEALAILHCV ||||||**|||***|||||***|*|***|||||||||||||||||||*I*|||*|**||* PGPWMNTQWAWYTYWWPHGTTQGQDYQTRSKLLLLSTCLAQQALDLLQNQAEAQATQHCT ATPLLLALFCHQATRTLLPSLPLPEGWSSHLDTLGSKS |||***|**||||||||||||||||||||||||||||| ATPQQQAQYCHQATRTLLPSLPLPEGWSSHLDTLGSKS
[0278] Each of the predicted transmembrane regions has been underlined and exemplified a fully modified domain of the invention. Thus, for example, the invention includes a transmembrane domain comprising each underlined domain. Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native L, I, V and F amino acids, as set forth in the wild type sequence. The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 22: CD81 Antigen
[0279] CD81 may play an important role in the regulation of lymphoma cell growth and interacts with a 16 kDa Leu-13 protein to form a complex possibly involved in signal transduction.CD81 may act as a viral receptor for HCV.
[0280] Example 1 was repeated for the title protein. Replacing each of the hydrophobic amino acids, L, I V, and F, with Q, T and Y (respectively) within the transmembrane domains results in the following sequence (lower line SEQ ID NO: 325), aligned with the wild type (top line SEQ ID NO: 324):
TABLE-US-00031 WT: 1 MGVEGCTKCIKYLLFVFNFVFWLAGGVILGVALWLRHDPQTTNLLYLELGDKPAPNTFYV ||||||||||||*****|***|*|||***|*|*||||||||||||||||||||||||||| MT: 1 MGVEGCTKCIKYQQYTYNYTYWQAGGTTQGTAQWLRHDPQTTNLLYLELGDKPAPNTFYV WT: 61 GIYILIAVGAVMMFVGFLGCYGAIQESQCLLGTFFTCLVILFACEVAAGIWGFVNKDQIA |||***|*||*||**|**|||||*|||||**||**||*****|||*|||*|||||||||| MT: 61 GTYTQTATGATMMYTGYQGCYGATQESQCQQGTYYTCQTTQYACETAAGTWGFVNKDQIA WT: 121 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSK |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| MT: 121 KDVKQFYDQALQQAVVDDDANNAKAVVKTFHETLDCCGSSTLTALTTSVLKNNLCPSGSK WT: 181 IISNLFKEDCHQKIDDLFSGKLYLIGIAAIVVAVIMIFEMILSMVLCCGIRNSSVY |||||||||||||||||||||*|**|*||***|**|**||**|||||||||||||| MT: 181 TTSNLFKEDCHQKIDDLFSGKQYQTGTAATTTATTMTYEMTQSMVLCCGIRNSSVY
[0281] The predicted transmembrane regions exemplify modified domains of the invention and include (SEQ ID NOs: 326, 327, 328, 329, 330, 331, 332, 333, respectively):
TABLE-US-00032 TM1-wt: LFVFNFVFWLAGGVILGVALW ****|***|*|||***|*|*| TM1-mt: QYTYNYTYWQAGGTTQGTAQW TM2-wt: LIAVGAVMMFVGFLGCYGAIQ **|*||*||**|**|||||*| TM2-mt: QTATGATMMYTGYQGCYGATQ TM3-wt: LGTFFTCLVILFACEVAAGIWGF *||**||*****|||*|||*||| TM3-mt: QGTYYTCQTTQYACETAAGTWGF
[0282] Thus, for example, the invention includes a transmembrane domain comprising each modified or "mt" domain Preferably the protein comprising TM1 herein includes one or more (e.g., all) of the extracellular and intracellular loop sequences (the sequences which have not been underlined). In addition or alternatively, the protein comprising the TM1 herein includes one or more additional transmembrane regions (the underlined sequences) in the depicted protein or homologous sequences retaining one, two, three or, possibly four or more of the native V, L I and F amino acids, as set forth in the wild type sequence.
[0283] The wild type sequence can be subject to the process as discussed above to select additional transmembrane domain variants as described in Example 1. Coding sequences can be designed, shuffled and proteins expressed. The expressed proteins can be assayed for ligand binding, as described herein.
Example 23: E. coli Expression of QTY Variants and a CXCR4-QTY Variant
[0284] 1. Large-Scale Production of CXCR4-QTY in E. coli BL21 (DE3)
[0285] A water-soluble GPCR CXCR4 was produced it in E. coli with a yield estimated to be .about.20 mg purified protein per liter of routine LB culture media. The estimated cost of production is about $0.25 per milligram. Advantageously, this approach can be used to easily obtain grams of quantity of water-soluble GPCRs, which in turn can facilitate their structural determinations.
2. Determining where the Water-Soluble CXCR4-QTY is Produced in E. coli Cells
[0286] A water-soluble CXCR4-QTY was cloned into pET vector. We first carried out a small-scale E. coli culture study to assess the location of produced CXCR4-QTY protein (150 ml culture). After culturing the cells, induced with IPTG at 24.degree. C. for 4 hours, we collected and sonicated the cells and divided into 2 fractions through centrifugation at 14,637.times.g (12,000 rmp). We then used Western blot analysis of the specific anti-rho-tag monoclonal antibody to detect where the CXCR4-QTY protein was. We observed that the CXCR4-QTY protein was in the supernatant fraction and no protein was observed in the pellet fraction, thus suggesting the protein is fully water-soluble.
3. The Estimated Yield CXCR4-QTY Produced in Soluble Fraction of E. coli Cells
[0287] We then carried out another 150 ml culture and obtained .about.6 mg 1D4 monoclonal antibody-purified CXCR4-QTY. Because we under-estimated the yield (we did not anticipate the surprisingly high yield), we did not use enough affinity 1D4 rho-tag monoclonal antibody beads to capture the produced CXCR4-QTY. Thus, a significant amount CXCR4-QTY protein did not bind to the beads due to the fact that not enough beads were added during purification, and the protein was in the flow-through lane and was further washed out. Despite the significant loss, we are still able to obtain .about.6 mg for the 150 ml culture as seen from the lanes 8-10 (Elution fractions).
4. Measuring the Thermo-Stability of Purified Water-Soluble CXCR4-QTY Protein
[0288] In most cases, structure determines function in proteins. Thus it is important to know if the purified CXCR4-QTY protein produced in E. coli still folds correctly in the typical alpha-helical structure with .about.50% alpha-helix. We performed secondary structural measurement using Circular dichroism (CD). We observed the CD spectra of purified CXCR4-QTY protein at various temperatures. We measured the thermo-stability of purified CXCR4-QTY protein. We observed that the purified CXCR4-QTY protein is relatively stable up to 55.degree. C., the protein was only partially and gradually denatured, the CD signal reduction was .about.15%. Between 55.degree. C. and 65.degree. C., the denaturation increased toward 65.degree. C., the denaturation transition took place between 65.degree. C. and 75.degree. C. and the protein was nearly fully denatured at 75.degree. C.
[0289] We plotted the temperature vs the ellipticity at 222 nm to obtain the melting temperature (Tm) of purified water-soluble CXCR4-QTY protein. From the plot, we estimated that the Tm for purified CXCR4-QTY protein is .about.67.degree. C. This Tm suggests the purified water-soluble CXCR4-QTY protein is quite stable compared to many other soluble proteins. This thermo-stability characteristics facilitates obtaining diffracting crystals, since it is known that the better the thermo-stability, the better the crystal lattice packing, and thus the better the chances to obtain structures.
5. Additional G Protein-Coupled Receptors
[0290] We selected 10 G protein-coupled receptors (GPCRs) to design the water-soluble form, using the QTY method that is described in Zhang et al., Water Soluble Membrane Proteins and Methods for the Preparation and Use Thereof, U.S. Patent Publication No. 2012/0252719 A ("Zhang"). Alternatively, the proteins described herein can be selected.
6. Molecular Cloning of the Genes.
[0291] We successfully verified the GPCR native and QTY genes in the cell-free protein expression plasmid vector pIVex2.3d and E. coli pET28a and pET-duet-1 plasmid vectors.
7. Water-Soluble GPCR Productions
[0292] We have produced several native and QTY proteins. When producing native GPCR in the cell-free system, a detergent Brij35 is required, without the detergent, the proteins precipitate upon production. On the other hand, we tested QTY variants in the present and absent of detergent. Without the detergent, the cell-free system produced soluble proteins.
[0293] We cloned the QTY variants into E. coli in vivo expression system, pET28a and pET-duet-1 plasmid vectors for E. coli cell protein production in E. coli BL21 (DE3) strain. We have purified several water-soluble GPCR proteins, including CXCR4 and CCR5, which we have used for secondary structural analysis. We have performed ligand-binding studies for CXCR4 with its natural ligand CCL12 (SDF1a). We carried out E. coli production and purification of water-soluble GPCR CCR5e variant. The CCR5e variant had 58 amino acid changes (.about.18% change). The water-soluble GPCR CCR5e variant was purified to homogeneity using the specific monoclonal antibody rhodopsin-tag. The blue stain showed a single band on the SDS gel indicating the purity. Estimated from the protein size marker, it appears to be a pure homo-dimer (the native membrane-bound CXCR4 crystal structure was a dimer. The Western-blot verified the monomer and homo-dimmer of CCR5e variant that is common in GPCRs.
8. QTY CCR5e Secondary Structural Studies.
[0294] We obtained water-soluble QTY variant of GPCR CCR5e. Then we carried out secondary structural analyses using an Aviv Model 410 circular dichroism instrument and confirmed that the GPCR QTY CCR5-e variant has a typical alpha-helical structure. We also carried out experiments in various temperatures to determine the CCR5e variant Tm, namely, the thermo-stability of the water-soluble CCR5e variant. From the experiments, we determined the Tm of CCR5e variant is about 46.degree. C. This Tm is good for crystal screen experiments.
9. Ligand-Binding Studies of CXCR4 with CCL12 (SDF1a).
[0295] In order to be certain the designed water-soluble QTY GPCRs still maintain their biological function, namely recognize and bind to their natural ligands, we first used an ELISA measurement to study water-soluble CXCR4 with its natural ligand CCL12 (also called SDF1a). The assay concentrations range from 50 nM to 10 .mu.M. The measured Kd is .about.80 nM. The Kd of native membrane-bound CXCR4 with SDF1a is about 100 nM. So the Kd of water-soluble CXCR4 is within acceptable range. Further experiments using more sensitive SPR or other measurement may produce more accurate Kd.
[0296] While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims.
Sequence CWU
1
1
3791356PRTUnknownDescription of Unknown Mammalian CXCR4 polypeptide
1Met Ser Ile Pro Leu Pro Leu Leu Gln Ile Tyr Thr Ser Asp Asn Tyr1
5 10 15Thr Glu Glu Met Gly Ser
Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys 20 25
30Phe Arg Glu Glu Asn Ala Asn Phe Asn Lys Ile Phe Leu
Pro Thr Ile 35 40 45Tyr Ser Ile
Ile Phe Leu Thr Gly Ile Val Gly Asn Gly Leu Val Ile 50
55 60Leu Val Met Gly Tyr Gln Lys Lys Leu Arg Ser Met
Thr Asp Lys Tyr65 70 75
80Arg Leu His Leu Ser Val Ala Asp Leu Leu Phe Val Ile Thr Leu Pro
85 90 95Phe Trp Ala Val Asp Ala
Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu 100
105 110Cys Lys Ala Val His Val Ile Tyr Thr Val Asn Leu
Tyr Ser Ser Val 115 120 125Leu Ile
Leu Ala Phe Ile Ser Leu Asp Arg Tyr Leu Ala Ile Val His 130
135 140Ala Thr Asn Ser Gln Arg Pro Arg Lys Leu Leu
Ala Glu Lys Val Val145 150 155
160Tyr Val Gly Val Trp Ile Pro Ala Leu Leu Leu Thr Ile Pro Asp Phe
165 170 175Ile Phe Ala Asn
Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg 180
185 190Phe Tyr Pro Asn Asp Leu Trp Val Val Val Phe
Gln Phe Gln His Ile 195 200 205Met
Val Gly Leu Ile Leu Pro Gly Ile Val Ile Leu Ser Cys Tyr Cys 210
215 220Ile Ile Ile Ser Lys Leu Ser His Ser Lys
Gly His Gln Lys Arg Lys225 230 235
240Ala Leu Lys Thr Thr Val Ile Leu Ile Leu Ala Phe Phe Ala Cys
Trp 245 250 255Leu Pro Tyr
Tyr Ile Gly Ile Ser Ile Asp Ser Phe Ile Leu Leu Glu 260
265 270Ile Ile Lys Gln Gly Cys Glu Phe Glu Asn
Thr Val His Lys Trp Ile 275 280
285Ser Ile Thr Glu Ala Leu Ala Phe Phe His Cys Cys Leu Asn Pro Ile 290
295 300Leu Tyr Ala Phe Leu Gly Ala Lys
Phe Lys Thr Ser Ala Gln His Ala305 310
315 320Leu Thr Ser Val Ser Arg Gly Ser Ser Leu Lys Ile
Leu Ser Lys Gly 325 330
335Lys Arg Gly Gly His Ser Ser Val Ser Thr Glu Ser Glu Ser Ser Ser
340 345 350Phe His Ser Ser
3552356PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 2Met Ser Ile Pro Leu Pro Leu Leu Gln Ile Tyr Thr Ser Asp
Asn Tyr1 5 10 15Thr Glu
Glu Met Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys 20
25 30Phe Arg Glu Glu Asn Ala Asn Phe Asn
Lys Ile Phe Leu Pro Thr Thr 35 40
45Tyr Ser Thr Thr Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Thr 50
55 60Gln Thr Met Gly Tyr Gln Lys Lys Leu
Arg Ser Met Thr Asp Lys Tyr65 70 75
80Arg Gln His Gln Ser Thr Ala Asp Gln Gln Tyr Thr Thr Thr
Gln Pro 85 90 95Tyr Trp
Ala Thr Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu 100
105 110Cys Lys Ala Thr His Thr Thr Tyr Thr
Thr Asn Gln Tyr Ser Ser Thr 115 120
125Gln Thr Gln Ala Tyr Thr Ser Gln Asp Arg Tyr Leu Ala Ile Val His
130 135 140Ala Thr Asn Ser Gln Arg Pro
Arg Lys Leu Leu Ala Glu Lys Thr Thr145 150
155 160Tyr Thr Gly Thr Trp Thr Pro Ala Gln Gln Gln Thr
Thr Pro Asp Tyr 165 170
175Thr Tyr Ala Asn Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg
180 185 190Phe Tyr Pro Asn Asp Leu
Trp Val Val Val Tyr Gln Tyr Gln His Thr 195 200
205Met Thr Gly Gln Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys
Tyr Cys 210 215 220Thr Ile Ile Ser Lys
Leu Ser His Ser Lys Gly His Gln Lys Arg Lys225 230
235 240Ala Leu Lys Thr Thr Thr Thr Gln Thr Gln
Ala Tyr Tyr Ala Cys Trp 245 250
255Gln Pro Tyr Tyr Thr Gly Thr Ser Thr Asp Ser Tyr Ile Leu Leu Glu
260 265 270Ile Ile Lys Gln Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Thr 275
280 285Ser Thr Thr Glu Ala Gln Ala Tyr Tyr His Cys Cys
Gln Asn Pro Thr 290 295 300Gln Tyr Ala
Tyr Gln Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala305
310 315 320Leu Thr Ser Val Ser Arg Gly
Ser Ser Leu Lys Ile Leu Ser Lys Gly 325
330 335Lys Arg Gly Gly His Ser Ser Val Ser Thr Glu Ser
Glu Ser Ser Ser 340 345 350Phe
His Ser Ser 355338PRTUnknownDescription of Unknown Mammalian
CXCR4 polypeptide 3Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn Tyr Thr
Glu Glu Met1 5 10 15Gly
Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu 20
25 30Asn Ala Asn Phe Asn Lys
35425PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 4Ile Phe Leu Pro Thr Thr Tyr Ser Thr Thr Phe Gln Thr Gly Thr
Thr1 5 10 15Gly Asn Gly
Gln Val Thr Gln Val Met 20 25525PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 5Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr Phe Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Val Thr Gln
Val Met 20 25625PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 6Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr Phe Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Val Thr Gln
Thr Met 20 25725PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 7Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr Tyr Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Val Thr Gln
Thr Met 20 25825PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 8Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr Tyr Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Thr Thr Gln
Val Met 20 25925PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 9Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr Tyr Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Thr Ile Gln
Thr Met 20 251025PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 10Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr Tyr Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Thr Thr Gln
Thr Met 20 251125PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 11Thr
Tyr Gln Pro Thr Thr Tyr Ser Thr Thr Tyr Gln Thr Gly Thr Thr1
5 10 15Gly Asn Gly Gln Thr Thr Gln
Thr Met 20 251214PRTUnknownDescription of
Unknown Mammalian CXCR4 peptide 12Gly Tyr Gln Lys Lys Leu Arg Ser
Met Thr Asp Lys Tyr Arg1 5
101322PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 13Leu His Leu Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln Pro
Phe1 5 10 15Trp Ala Val
Asp Ala Val 201422PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 14Leu His Leu Ser Val Ala Asp
Gln Gln Tyr Thr Thr Thr Gln Pro Phe1 5 10
15Trp Ala Thr Asp Ala Val 201522PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 15Leu
His Gln Ser Val Ala Asp Gln Gln Tyr Val Thr Thr Gln Pro Phe1
5 10 15Trp Ala Thr Asp Ala Thr
201622PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 16Gln His Gln Ser Val Ala Asp Gln Gln Phe Thr Thr
Thr Gln Pro Phe1 5 10
15Trp Ala Thr Asp Ala Thr 201722PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 17Leu
His Gln Ser Val Ala Asp Gln Gln Tyr Thr Ile Thr Gln Pro Tyr1
5 10 15Trp Ala Thr Asp Ala Thr
201822PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 18Gln His Leu Ser Val Ala Asp Gln Gln Tyr Thr Ile
Thr Gln Pro Tyr1 5 10
15Trp Ala Thr Asp Ala Thr 201922PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 19Gln
His Leu Ser Thr Ala Asp Gln Gln Tyr Val Thr Thr Gln Pro Tyr1
5 10 15Trp Ala Thr Asp Ala Thr
202022PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 20Gln His Gln Ser Thr Ala Asp Gln Gln Tyr Thr Thr
Thr Gln Pro Tyr1 5 10
15Trp Ala Thr Asp Ala Thr 202111PRTUnknownDescription of
Unknown Mammalian CXCR4 peptide 21Ala Asn Trp Tyr Phe Gly Asn Phe
Leu Cys Lys1 5 102220PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 22Ala
Val His Val Thr Tyr Thr Val Asn Gln Tyr Ser Ser Val Gln Ile1
5 10 15Gln Ala Phe Thr
202320PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 23Ala Val His Thr Thr Tyr Thr Val Asn Gln Tyr Ser Ser Val Gln
Ile1 5 10 15Gln Ala Phe
Thr 202420PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 24Ala Val His Thr Thr Tyr Thr Val Asn Gln
Tyr Ser Ser Val Gln Thr1 5 10
15Gln Ala Phe Thr 202520PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 25Ala Thr His Thr Thr Tyr
Thr Val Asn Gln Tyr Ser Ser Val Gln Thr1 5
10 15Gln Ala Phe Thr 202620PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 26Ala
Thr His Thr Ile Tyr Thr Thr Asn Gln Tyr Ser Ser Val Gln Thr1
5 10 15Gln Ala Phe Thr
202720PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 27Ala Val His Thr Thr Tyr Thr Thr Asn Gln Tyr Ser Ser Val Gln
Thr1 5 10 15Gln Ala Phe
Thr 202820PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 28Ala Thr His Thr Thr Tyr Thr Thr Asn Gln
Tyr Ser Ser Val Gln Thr1 5 10
15Gln Ala Phe Thr 202920PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 29Ala Thr His Thr Thr Tyr
Thr Thr Asn Gln Tyr Ser Ser Thr Gln Thr1 5
10 15Gln Ala Tyr Thr
203024PRTUnknownDescription of Unknown Mammalian CXCR4 peptide 30Ser
Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr Asn Ser Gln Arg1
5 10 15Pro Arg Lys Leu Leu Ala Glu
Lys 203120PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 31Val Thr Tyr Thr Gly Val Trp Thr Pro Ala
Gln Gln Gln Thr Ile Pro1 5 10
15Asp Phe Ile Phe 203220PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 32Thr Thr Tyr Thr Gly Thr
Trp Ile Pro Ala Gln Gln Gln Thr Ile Pro1 5
10 15Asp Phe Ile Phe 203320PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 33Thr
Thr Tyr Thr Gly Thr Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro1
5 10 15Asp Phe Ile Phe
203420PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 34Thr Thr Tyr Thr Gly Thr Trp Thr Pro Ala Gln Gln Gln Thr Ile
Pro1 5 10 15Asp Phe Ile
Tyr 203520PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 35Thr Thr Tyr Val Gly Thr Trp Thr Pro Ala
Gln Gln Gln Thr Thr Pro1 5 10
15Asp Tyr Ile Phe 203620PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 36Thr Thr Tyr Val Gly Thr
Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro1 5
10 15Asp Phe Ile Tyr 203720PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 37Thr
Thr Tyr Thr Gly Val Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro1
5 10 15Asp Tyr Thr Phe
203820PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 38Thr Thr Tyr Thr Gly Thr Trp Thr Pro Ala Gln Gln Gln Thr Thr
Pro1 5 10 15Asp Tyr Thr
Tyr 203921PRTUnknownDescription of Unknown Mammalian
CXCR4 peptide 39Ala Asn Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg
Phe Tyr1 5 10 15Pro Asn
Asp Leu Trp 204021PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 40Val Val Val Phe Gln Phe Gln
His Thr Met Val Gly Gln Thr Gln Pro1 5 10
15Gly Thr Thr Thr Gln 204121PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 41Val
Val Val Phe Gln Phe Gln His Thr Met Thr Gly Gln Thr Gln Pro1
5 10 15Gly Thr Thr Thr Gln
204221PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 42Val Val Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln Thr Gln
Pro1 5 10 15Gly Thr Thr
Thr Gln 204321PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 43Val Val Val Tyr Gln Tyr Gln His Thr Met
Thr Gly Gln Thr Gln Pro1 5 10
15Gly Thr Thr Thr Gln 204421PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 44Thr
Val Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln Thr Gln Pro1
5 10 15Gly Thr Thr Thr Gln
204521PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 45Val Val Thr Phe Gln Tyr Gln His Thr Met Thr Gly Gln Thr Gln
Pro1 5 10 15Gly Thr Thr
Thr Gln 204621PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 46Thr Val Val Tyr Gln Tyr Gln His Thr Met
Thr Gly Gln Thr Gln Pro1 5 10
15Gly Thr Thr Thr Gln 204721PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 47Thr
Thr Thr Tyr Gln Tyr Gln His Thr Met Thr Gly Gln Thr Gln Pro1
5 10 15Gly Thr Thr Thr Gln
204825PRTUnknownDescription of Unknown Mammalian CXCR4 peptide 48Ser
Cys Tyr Cys Ile Ile Ile Ser Lys Leu Ser His Ser Lys Gly His1
5 10 15Gln Lys Arg Lys Ala Leu Lys
Thr Thr 20 254920PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 49Val
Thr Gln Ile Gln Ala Phe Phe Ala Cys Trp Gln Pro Tyr Tyr Thr1
5 10 15Gly Thr Ser Thr
205020PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 50Val Ile Gln Ile Gln Ala Tyr Phe Ala Cys Trp Gln Pro Tyr Tyr
Thr1 5 10 15Gly Thr Ser
Thr 205120PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 51Val Ile Gln Ile Gln Ala Tyr Tyr Ala Cys
Trp Gln Pro Tyr Tyr Thr1 5 10
15Gly Thr Ser Thr 205220PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 52Val Ile Gln Thr Gln Ala
Phe Tyr Ala Cys Trp Gln Pro Tyr Tyr Thr1 5
10 15Gly Thr Ser Thr 205320PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 53Val
Ile Gln Thr Gln Ala Tyr Phe Ala Cys Trp Gln Pro Tyr Tyr Thr1
5 10 15Gly Thr Ser Thr
205420PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 54Val Thr Gln Ile Gln Ala Phe Tyr Ala Cys Trp Gln Pro Tyr Tyr
Thr1 5 10 15Gly Thr Ser
Thr 205520PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 55Val Ile Gln Thr Gln Ala Tyr Tyr Ala Cys
Trp Gln Pro Tyr Tyr Thr1 5 10
15Gly Thr Ser Thr 205620PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 56Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr Thr1 5
10 15Gly Thr Ser Thr
205721PRTUnknownDescription of Unknown Mammalian CXCR4 peptide 57Asp
Ser Phe Ile Leu Leu Glu Ile Ile Lys Gln Gly Cys Glu Phe Glu1
5 10 15Asn Thr Val His Lys
205820PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 58Trp Ile Ser Ile Thr Glu Ala Gln Ala Phe Phe His Cys Cys Leu
Asn1 5 10 15Pro Ile Gln
Tyr 205920PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 59Trp Ile Ser Ile Thr Glu Ala Gln Ala Phe
Tyr His Cys Cys Leu Asn1 5 10
15Pro Ile Gln Tyr 206020PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 60Trp Ile Ser Ile Thr Glu
Ala Gln Ala Tyr Phe His Cys Cys Gln Asn1 5
10 15Pro Thr Leu Tyr 206120PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 61Trp
Ile Ser Thr Thr Glu Ala Leu Ala Phe Tyr His Cys Cys Gln Asn1
5 10 15Pro Thr Gln Tyr
206220PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 62Trp Ile Ser Thr Thr Glu Ala Leu Ala Tyr Phe His Cys Cys Gln
Asn1 5 10 15Pro Thr Gln
Tyr 206320PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 63Trp Ile Ser Ile Thr Glu Ala Leu Ala Tyr
Tyr His Cys Cys Gln Asn1 5 10
15Pro Thr Gln Tyr 206420PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 64Trp Ile Ser Thr Thr Glu
Ala Leu Ala Tyr Tyr His Cys Cys Gln Asn1 5
10 15Pro Thr Gln Tyr
206550PRTUnknownDescription of Unknown Mammalian CXCR4 polypeptide
65Ala Phe Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr1
5 10 15Ser Val Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg 20 25
30Gly Gly His Ser Ser Val Ser Thr Glu Ser Glu Ser Ser
Ser Phe His 35 40 45Ser Ser
5066355PRTUnknownDescription of Unknown Mammalian CX3CR1 polypeptide
66Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1
5 10 15Leu Ala Glu Ala Cys Tyr
Ile Gly Asp Ile Val Val Phe Gly Thr Val 20 25
30Phe Leu Ser Ile Phe Tyr Ser Val Ile Phe Ala Ile Gly
Leu Val Gly 35 40 45Asn Leu Leu
Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser 50
55 60Val Thr Asp Ile Tyr Leu Leu Asn Leu Ala Leu Ser
Asp Leu Leu Phe65 70 75
80Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly
85 90 95Leu His Asn Ala Met Cys
Lys Phe Thr Thr Ala Phe Phe Phe Ile Gly 100
105 110Phe Phe Gly Ser Ile Phe Phe Ile Thr Val Ile Ser
Ile Asp Arg Tyr 115 120 125Leu Ala
Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln 130
135 140His Gly Val Thr Ile Ser Leu Gly Val Trp Ala
Ala Ala Ile Leu Val145 150 155
160Ala Ala Pro Gln Phe Met Phe Thr Lys Gln Lys Glu Asn Glu Cys Leu
165 170 175Gly Asp Tyr Pro
Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn 180
185 190Val Glu Thr Asn Phe Leu Gly Phe Leu Leu Pro
Leu Leu Ile Met Ser 195 200 205Tyr
Cys Tyr Phe Arg Ile Ile Gln Thr Leu Phe Ser Cys Lys Asn His 210
215 220Lys Lys Ala Lys Ala Ile Lys Leu Ile Leu
Leu Val Val Ile Val Phe225 230 235
240Phe Leu Phe Trp Thr Pro Tyr Asn Val Met Ile Phe Leu Glu Thr
Leu 245 250 255Lys Leu Tyr
Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg 260
265 270Leu Ala Leu Ser Val Thr Glu Thr Val Ala
Phe Ser His Cys Cys Leu 275 280
285Asn Pro Leu Ile Tyr Ala Phe Ala Gly Glu Lys Phe Arg Arg Tyr Leu 290
295 300Tyr His Leu Tyr Gly Lys Cys Leu
Ala Val Leu Cys Gly Arg Ser Val305 310
315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser
Arg His Gly Ser 325 330
335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu
340 345 350Leu Leu Leu
35567355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 67Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn
Phe Glu Tyr Asp Asp1 5 10
15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Val
20 25 30Phe Gln Ser Thr Tyr Tyr Ser
Thr Thr Tyr Ala Thr Gly Gln Thr Gly 35 40
45Asn Gln Gln Thr Thr Tyr Ala Gln Thr Asn Ser Lys Lys Pro Lys
Ser 50 55 60Val Thr Asp Thr Tyr Gln
Gln Asn Gln Ala Gln Ser Asp Gln Gln Tyr65 70
75 80Thr Ala Thr Gln Pro Tyr Trp Thr His Tyr Gln
Ile Asn Glu Lys Gly 85 90
95Leu His Asn Ala Met Cys Lys Phe Thr Thr Ala Tyr Tyr Tyr Thr Gly
100 105 110Tyr Tyr Gly Ser Thr Tyr
Tyr Thr Thr Thr Thr Ser Thr Asp Arg Tyr 115 120
125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr
Val Gln 130 135 140His Gly Thr Thr Thr
Ser Gln Gly Thr Trp Ala Ala Ala Thr Gln Thr145 150
155 160Ala Ala Pro Gln Tyr Met Tyr Thr Lys Gln
Lys Glu Asn Glu Cys Leu 165 170
175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn
180 185 190Val Glu Thr Asn Tyr
Gln Gly Tyr Gln Gln Pro Gln Gln Thr Met Ser 195
200 205Tyr Cys Tyr Tyr Arg Thr Thr Gln Thr Gln Tyr Ser
Cys Lys Asn His 210 215 220Lys Lys Ala
Lys Ala Ile Lys Gln Thr Gln Gln Thr Thr Thr Thr Tyr225
230 235 240Tyr Gln Tyr Trp Thr Pro Tyr
Asn Thr Met Thr Tyr Gln Glu Thr Leu 245
250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg
Lys Asp Leu Arg 260 265 270Leu
Ala Gln Ser Thr Thr Glu Thr Thr Ala Tyr Ser His Cys Cys Gln 275
280 285Asn Pro Gln Thr Tyr Ala Tyr Ala Gly
Glu Lys Phe Arg Arg Tyr Leu 290 295
300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305
310 315 320His Val Asp Phe
Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser 325
330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr
Ser Asp Gly Asp Ala Leu 340 345
350Leu Leu Leu 3556831PRTUnknownDescription of Unknown Mammalian
CX3CR1 polypeptide 68Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe
Glu Tyr Asp Asp1 5 10
15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr
20 25 306928PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 69Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Phe Ala Thr Gly Gln Val1
5 10 15Gly Asn Gln Gln Val Val Phe
Ala Leu Thr Asn Ser 20 257028PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 70Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Val1
5 10 15Gly Asn Gln Gln Val Val Phe
Ala Leu Thr Asn Ser 20 257128PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 71Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Val1
5 10 15Gly Asn Gln Gln Val Val Phe
Ala Gln Thr Asn Ser 20 257228PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 72Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Thr1
5 10 15Gly Asn Leu Gln Val Thr Phe
Ala Gln Thr Asn Ser 20 257328PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 73Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Thr1
5 10 15Gly Asn Gln Leu Val Thr Phe
Ala Gln Thr Asn Ser 20 257428PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 74Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Thr1
5 10 15Gly Asn Gln Gln Val Val Phe
Ala Gln Thr Asn Ser 20 257528PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 75Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Thr1
5 10 15Gly Asn Leu Gln Val Thr Tyr
Ala Gln Thr Asn Ser 20 257628PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 76Thr
Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Thr1
5 10 15Gly Asn Gln Gln Thr Thr Tyr
Ala Gln Thr Asn Ser 20
257710PRTUnknownDescription of Unknown Mammalian CX3CR1 peptide
77Lys Lys Pro Lys Ser Val Thr Asp Ile Tyr1 5
107821PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 78Leu Leu Asn Gln Ala Gln Ser Asp Gln Leu Phe Val
Ala Thr Gln Pro1 5 10
15Phe Trp Thr His Tyr 207921PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 79Leu Leu Asn Gln Ala Gln
Ser Asp Gln Gln Phe Val Ala Thr Gln Pro1 5
10 15Phe Trp Thr His Tyr 208021PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 80Gln
Gln Asn Leu Ala Gln Ser Asp Gln Gln Phe Val Ala Thr Gln Pro1
5 10 15Phe Trp Thr His Tyr
208121PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 81Leu Gln Asn Leu Ala Gln Ser Asp Gln Gln Tyr Thr Ala Thr Gln
Pro1 5 10 15Phe Trp Thr
His Tyr 208221PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 82Gln Leu Asn Leu Ala Gln Ser Asp Gln Gln
Tyr Thr Ala Thr Gln Pro1 5 10
15Phe Trp Thr His Tyr 208321PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 83Leu
Leu Asn Gln Ala Gln Ser Asp Gln Gln Phe Thr Ala Thr Gln Pro1
5 10 15Tyr Trp Thr His Tyr
208421PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 84Gln Gln Asn Leu Ala Gln Ser Asp Gln Gln Phe Thr Ala Thr Gln
Pro1 5 10 15Tyr Trp Thr
His Tyr 208521PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 85Gln Gln Asn Gln Ala Gln Ser Asp Gln Gln
Tyr Thr Ala Thr Gln Pro1 5 10
15Tyr Trp Thr His Tyr 208613PRTUnknownDescription of
Unknown Mammalian CX3CR1 peptide 86Leu Ile Asn Glu Lys Gly Leu His
Asn Ala Met Cys Lys1 5
108722PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 87Tyr Thr Thr Ala Tyr Tyr Tyr Thr Gly Tyr Tyr Gly Ser Thr Tyr
Tyr1 5 10 15Thr Thr Thr
Thr Ser Thr 208817PRTUnknownDescription of Unknown Mammalian
CX3CR1 peptide 88Asp Arg Tyr Leu Ala Ile Val Leu Ala Ala Asn Ser Met
Asn Asn Arg1 5 10
15Thr8925PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 89Val Gln His Gly Thr Thr Thr Ser Gln Gly Thr Trp
Ala Ala Ala Thr1 5 10
15Gln Val Ala Ala Pro Gln Phe Met Phe 20
259025PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 90Val Gln His Gly Val Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala
Thr1 5 10 15Gln Thr Ala
Ala Pro Gln Phe Met Phe 20
259125PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 91Val Gln His Gly Thr Thr Thr Ser Gln Gly Val Trp Ala Ala Ala
Thr1 5 10 15Gln Thr Ala
Ala Pro Gln Phe Met Tyr 20
259225PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 92Val Gln His Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala
Ile1 5 10 15Gln Thr Ala
Ala Pro Gln Phe Met Tyr 20
259325PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 93Val Gln His Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala
Thr1 5 10 15Gln Thr Ala
Ala Pro Gln Phe Met Phe 20
259425PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 94Val Gln His Gly Thr Thr Ile Ser Gln Gly Thr Trp Ala Ala Ala
Thr1 5 10 15Gln Thr Ala
Ala Pro Gln Tyr Met Phe 20
259525PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 95Val Gln His Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala
Thr1 5 10 15Gln Thr Ala
Ala Pro Gln Phe Met Tyr 20
259625PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 96Thr Gln His Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala
Thr1 5 10 15Gln Thr Ala
Ala Pro Gln Tyr Met Tyr 20
259728PRTUnknownDescription of Unknown Mammalian CX3CR1 peptide
97Thr Lys Gln Lys Glu Asn Glu Cys Leu Gly Asp Tyr Pro Glu Val Leu1
5 10 15Gln Glu Ile Trp Pro Val
Leu Arg Asn Val Glu Thr 20
259820PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 98Asn Phe Leu Gly Phe Gln Gln Pro Gln Gln Ile Met Ser Tyr Cys
Tyr1 5 10 15Phe Arg Ile
Thr 209920PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 99Asn Phe Gln Gly Phe Leu Gln Pro Gln Gln
Thr Met Ser Tyr Cys Tyr1 5 10
15Phe Arg Ile Thr 2010020PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 100Asn
Phe Gln Gly Phe Leu Gln Pro Gln Gln Thr Met Ser Tyr Cys Tyr1
5 10 15Phe Arg Thr Thr
2010120PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 101Asn Phe Gln Gly Phe Gln Gln Pro Gln Gln Thr Met Ser Tyr
Cys Tyr1 5 10 15Tyr Arg
Ile Thr 2010220PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 102Asn Phe Gln Gly Phe Leu Gln Pro Gln
Gln Thr Met Ser Tyr Cys Tyr1 5 10
15Tyr Arg Thr Thr 2010320PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 103Asn
Phe Gln Gly Tyr Leu Gln Pro Gln Gln Thr Met Ser Tyr Cys Tyr1
5 10 15Phe Arg Thr Thr
2010420PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 104Asn Tyr Gln Gly Phe Gln Gln Pro Gln Gln Thr Met Ser Tyr
Cys Tyr1 5 10 15Phe Arg
Thr Thr 2010520PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 105Asn Tyr Gln Gly Tyr Gln Gln Pro Gln
Gln Thr Met Ser Tyr Cys Tyr1 5 10
15Tyr Arg Thr Thr 2010616PRTUnknownDescription of
Unknown Mammalian CX3CR1 peptide 106Gln Thr Leu Phe Ser Cys Lys Asn
His Lys Lys Ala Lys Ala Ile Lys1 5 10
1510725PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 107Leu Ile Gln Gln Thr Thr Thr Thr Phe
Tyr Gln Phe Trp Thr Pro Tyr1 5 10
15Asn Thr Met Thr Phe Gln Glu Thr Leu 20
2510825PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 108Leu Ile Gln Gln Thr Thr Thr Thr Phe Tyr Gln Tyr
Trp Thr Pro Tyr1 5 10
15Asn Val Met Thr Phe Gln Glu Thr Gln 20
2510925PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 109Leu Ile Gln Gln Thr Thr Thr Thr Tyr Tyr Gln Phe Trp Thr
Pro Tyr1 5 10 15Asn Thr
Met Thr Phe Gln Glu Thr Gln 20
2511025PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 110Gln Ile Gln Gln Thr Thr Thr Thr Phe Tyr Gln Tyr Trp Thr
Pro Tyr1 5 10 15Asn Thr
Met Thr Phe Gln Glu Thr Gln 20
2511125PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 111Leu Thr Gln Gln Thr Thr Thr Thr Tyr Tyr Gln Phe Trp Thr
Pro Tyr1 5 10 15Asn Thr
Met Thr Phe Gln Glu Thr Gln 20
2511225PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 112Gln Ile Gln Gln Thr Thr Thr Thr Phe Phe Gln Tyr Trp Thr
Pro Tyr1 5 10 15Asn Thr
Met Thr Tyr Gln Glu Thr Gln 20
2511325PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 113Gln Ile Gln Gln Thr Thr Thr Thr Phe Tyr Gln Tyr Trp Thr
Pro Tyr1 5 10 15Asn Thr
Met Thr Tyr Gln Glu Thr Gln 20
2511425PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 114Gln Thr Gln Gln Thr Thr Thr Thr Tyr Tyr Gln Tyr Trp Thr
Pro Tyr1 5 10 15Asn Thr
Met Thr Tyr Gln Glu Thr Gln 20
2511517PRTUnknownDescription of Unknown Mammalian CX3CR1 peptide
115Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg1
5 10 15Leu11624PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 116Ala
Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Gln Asn1
5 10 15Pro Gln Ile Tyr Ala Phe Ala
Gly 2011724PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 117Ala Gln Ser Val Thr Glu Thr Thr Ala
Phe Ser His Cys Cys Gln Asn1 5 10
15Pro Leu Ile Tyr Ala Phe Ala Gly
2011824PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 118Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys
Gln Asn1 5 10 15Pro Gln
Thr Tyr Ala Tyr Ala Gly 2011924PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 119Ala
Gln Ser Val Thr Glu Thr Thr Ala Phe Ser His Cys Cys Gln Asn1
5 10 15Pro Gln Ile Tyr Ala Tyr Ala
Gly 2012024PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 120Ala Leu Ser Val Thr Glu Thr Thr Ala
Phe Ser His Cys Cys Gln Asn1 5 10
15Pro Gln Thr Tyr Ala Tyr Ala Gly
2012124PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 121Ala Leu Ser Thr Thr Glu Thr Thr Ala Tyr Ser His Cys Cys
Gln Asn1 5 10 15Pro Gln
Ile Tyr Ala Phe Ala Gly 2012224PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 122Ala
Leu Ser Val Thr Glu Thr Thr Ala Tyr Ser His Cys Cys Gln Asn1
5 10 15Pro Gln Thr Tyr Ala Tyr Ala
Gly 2012324PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 123Ala Gln Ser Thr Thr Glu Thr Thr Ala
Tyr Ser His Cys Cys Gln Asn1 5 10
15Pro Gln Thr Tyr Ala Tyr Ala Gly
2012458PRTUnknownDescription of Unknown Mammalian CX3CR1 polypeptide
124Glu Lys Phe Arg Arg Tyr Leu Tyr His Leu Tyr Gly Lys Cys Leu Ala1
5 10 15Val Leu Cys Gly Arg Ser
Val His Val Asp Phe Ser Ser Ser Glu Ser 20 25
30Gln Arg Ser Arg His Gly Ser Val Leu Ser Ser Asn Phe
Thr Tyr His 35 40 45Thr Ser Asp
Gly Asp Ala Leu Leu Leu Leu 50
55125373PRTUnknownDescription of Unknown Mammalian CCR3 polypeptide
125Met Pro Phe Gly Ile Arg Met Leu Leu Arg Ala His Lys Pro Gly Arg1
5 10 15Ser Glu Met Thr Thr Ser
Leu Asp Thr Val Glu Thr Phe Gly Thr Thr 20 25
30Ser Tyr Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala
Asp Thr Arg 35 40 45Ala Leu Met
Ala Gln Phe Val Pro Pro Leu Tyr Ser Leu Val Phe Thr 50
55 60Val Gly Leu Leu Gly Asn Val Val Val Val Met Ile
Leu Ile Lys Tyr65 70 75
80Arg Arg Leu Arg Ile Met Thr Asn Ile Tyr Leu Leu Asn Leu Ala Ile
85 90 95Ser Asp Leu Leu Phe Leu
Val Thr Leu Pro Phe Trp Ile His Tyr Val 100
105 110Arg Gly His Asn Trp Val Phe Gly His Gly Met Cys
Lys Leu Leu Ser 115 120 125Gly Phe
Tyr His Thr Gly Leu Tyr Ser Glu Ile Phe Phe Ile Ile Leu 130
135 140Leu Thr Ile Asp Arg Tyr Leu Ala Ile Val His
Ala Val Phe Ala Leu145 150 155
160Arg Ala Arg Thr Val Thr Phe Gly Val Ile Thr Ser Ile Val Thr Trp
165 170 175Gly Leu Ala Val
Leu Ala Ala Leu Pro Glu Phe Ile Phe Tyr Glu Thr 180
185 190Glu Glu Leu Phe Glu Glu Thr Leu Cys Ser Ala
Leu Tyr Pro Glu Asp 195 200 205Thr
Val Tyr Ser Trp Arg His Phe His Thr Leu Arg Met Thr Ile Phe 210
215 220Cys Leu Val Leu Pro Leu Leu Val Met Ala
Ile Cys Tyr Thr Gly Ile225 230 235
240Ile Lys Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala
Ile 245 250 255Arg Leu Ile
Phe Val Ile Met Ala Val Phe Phe Ile Phe Trp Thr Pro 260
265 270Tyr Asn Val Ala Ile Leu Leu Ser Ser Tyr
Gln Ser Ile Leu Phe Gly 275 280
285Asn Asp Cys Glu Arg Ser Lys His Leu Asp Leu Val Met Leu Val Thr 290
295 300Glu Val Ile Ala Tyr Ser His Cys
Cys Met Asn Pro Val Ile Tyr Ala305 310
315 320Phe Val Gly Glu Arg Phe Arg Lys Tyr Leu Arg His
Phe Phe His Arg 325 330
335His Leu Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu
340 345 350Lys Leu Glu Arg Thr Ser
Ser Val Ser Pro Ser Thr Ala Glu Pro Glu 355 360
365Leu Ser Ile Val Phe 370126373PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
126Met Pro Phe Gly Ile Arg Met Leu Leu Arg Ala His Lys Pro Gly Arg1
5 10 15Ser Glu Met Thr Thr Ser
Leu Asp Thr Val Glu Thr Phe Gly Thr Thr 20 25
30Ser Tyr Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala
Asp Thr Arg 35 40 45Ala Leu Met
Ala Gln Phe Val Pro Pro Gln Tyr Ser Gln Thr Tyr Thr 50
55 60Thr Gly Gln Gln Gly Asn Thr Thr Thr Thr Met Thr
Gln Thr Lys Tyr65 70 75
80Arg Arg Leu Arg Ile Met Thr Asn Thr Tyr Gln Gln Asn Gln Ala Thr
85 90 95Ser Asp Gln Gln Tyr Gln
Thr Thr Gln Pro Tyr Trp Thr His Tyr Val 100
105 110Arg Gly His Asn Trp Val Phe Gly His Gly Met Cys
Lys Leu Leu Ser 115 120 125Gly Phe
Tyr His Thr Gly Leu Tyr Ser Glu Thr Tyr Tyr Thr Thr Gln 130
135 140Gln Thr Thr Asp Arg Tyr Gln Ala Thr Thr His
Ala Thr Tyr Ala Gln145 150 155
160Arg Ala Arg Thr Val Thr Phe Gly Thr Thr Thr Ser Thr Thr Thr Trp
165 170 175Gly Gln Ala Thr
Gln Ala Ala Gln Pro Glu Tyr Thr Tyr Tyr Glu Thr 180
185 190Glu Glu Leu Phe Glu Glu Thr Leu Cys Ser Ala
Leu Tyr Pro Glu Asp 195 200 205Thr
Val Tyr Ser Trp Arg His Phe His Thr Leu Arg Met Thr Thr Tyr 210
215 220Cys Gln Thr Gln Pro Gln Gln Thr Met Ala
Thr Cys Tyr Thr Gly Thr225 230 235
240Thr Lys Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala
Ile 245 250 255Arg Gln Thr
Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro 260
265 270Tyr Asn Thr Ala Thr Gln Gln Ser Ser Tyr
Gln Ser Ile Leu Phe Gly 275 280
285Asn Asp Cys Glu Arg Ser Lys His Leu Asp Gln Thr Met Gln Thr Thr 290
295 300Glu Thr Thr Ala Tyr Ser His Cys
Cys Met Asn Pro Thr Thr Tyr Ala305 310
315 320Tyr Thr Gly Glu Arg Phe Arg Lys Tyr Leu Arg His
Phe Phe His Arg 325 330
335His Leu Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu
340 345 350Lys Leu Glu Arg Thr Ser
Ser Val Ser Pro Ser Thr Ala Glu Pro Glu 355 360
365Leu Ser Ile Val Phe 37012734PRTUnknownDescription of
Unknown Mammalian CCR3 polypeptide 127Met Thr Thr Ser Leu Asp Thr
Val Glu Thr Phe Gly Thr Thr Ser Tyr1 5 10
15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp Thr
Arg Ala Leu 20 25 30Met
Ala12828PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 128Gln Phe Val Pro Pro Gln Tyr Ser Gln Thr Phe Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Val Thr Val Thr Met Thr Gln Ile Lys Tyr 20
2512928PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 129Gln Phe Val Pro Pro Gln Tyr Ser Gln Thr Phe Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr 20
2513028PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 130Gln Phe Val Pro Pro Gln Tyr Ser Gln Thr Tyr Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr 20
2513128PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 131Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Val Thr Thr Thr Met Thr Gln Ile Lys Tyr 20
2513228PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 132Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Thr Val Thr Thr Met Thr Gln Ile Lys Tyr 20
2513328PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 133Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr 20
2513428PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 134Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Thr Thr Thr Thr Met Thr Gln Ile Lys Tyr 20
2513528PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 135Gln Tyr Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr
Thr Gly Gln Gln1 5 10
15Gly Asn Thr Thr Thr Thr Met Thr Gln Thr Lys Tyr 20
2513610PRTUnknownDescription of Unknown Mammalian CCR3
peptide 136Arg Arg Leu Arg Ile Met Thr Asn Ile Tyr1 5
1013721PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 137Leu Leu Asn Gln Ala Thr Ser Asp Gln
Gln Phe Gln Val Thr Gln Pro1 5 10
15Phe Trp Ile His Tyr 2013821PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 138Leu
Gln Asn Gln Ala Ile Ser Asp Gln Leu Phe Gln Thr Thr Gln Pro1
5 10 15Phe Trp Thr His Tyr
2013921PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 139Gln Gln Asn Leu Ala Ile Ser Asp Gln Gln Phe Gln Thr Thr
Gln Pro1 5 10 15Phe Trp
Thr His Tyr 2014021PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 140Gln Leu Asn Gln Ala Ile Ser
Asp Gln Gln Phe Gln Thr Thr Gln Pro1 5 10
15Tyr Trp Thr His Tyr 2014121PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 141Gln
Gln Asn Leu Ala Ile Ser Asp Gln Gln Tyr Gln Val Thr Gln Pro1
5 10 15Tyr Trp Thr His Tyr
2014221PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 142Leu Gln Asn Gln Ala Thr Ser Asp Gln Leu Phe Gln Thr Thr
Gln Pro1 5 10 15Tyr Trp
Thr His Tyr 2014321PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 143Gln Gln Asn Gln Ala Ile Ser
Asp Gln Gln Tyr Gln Val Thr Gln Pro1 5 10
15Tyr Trp Thr His Tyr 2014421PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 144Gln
Gln Asn Gln Ala Thr Ser Asp Gln Gln Tyr Gln Thr Thr Gln Pro1
5 10 15Tyr Trp Thr His Tyr
2014514PRTUnknownDescription of Unknown Mammalian CCR3 peptide
145Val Arg Gly His Asn Trp Val Phe Gly His Gly Met Cys Lys1
5 1014622PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 146Leu Gln Ser Gly Phe Tyr His
Thr Gly Gln Tyr Ser Glu Thr Phe Phe1 5 10
15Thr Thr Gln Gln Thr Thr
2014722PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 147Gln Leu Ser Gly Phe Tyr His Thr Gly Gln Tyr Ser Glu Thr
Phe Phe1 5 10 15Thr Thr
Gln Gln Thr Thr 2014822PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 148Gln Leu Ser Gly Phe Tyr His
Thr Gly Gln Tyr Ser Glu Thr Phe Tyr1 5 10
15Thr Thr Gln Gln Thr Thr
2014922PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 149Gln Leu Ser Gly Phe Tyr His Thr Gly Gln Tyr Ser Glu Thr
Tyr Phe1 5 10 15Thr Thr
Gln Gln Thr Thr 2015022PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 150Gln Leu Ser Gly Tyr Tyr His
Thr Gly Gln Tyr Ser Glu Thr Phe Phe1 5 10
15Thr Thr Gln Gln Thr Thr
2015122PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 151Gln Gln Ser Gly Phe Tyr His Thr Gly Gln Tyr Ser Glu Thr
Phe Phe1 5 10 15Thr Thr
Gln Gln Thr Thr 2015222PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 152Gln Gln Ser Gly Phe Tyr His
Thr Gly Gln Tyr Ser Glu Thr Phe Tyr1 5 10
15Thr Thr Gln Gln Thr Thr
2015322PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 153Gln Gln Ser Gly Tyr Tyr His Thr Gly Gln Tyr Ser Glu Thr
Tyr Tyr1 5 10 15Thr Thr
Gln Gln Thr Thr 2015417PRTUnknownDescription of Unknown
Mammalian CCR3 peptide 154Asp Arg Tyr Leu Ala Ile Val His Ala Val
Phe Ala Leu Arg Ala Arg1 5 10
15Thr15525PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 155Thr Thr Phe Gly Thr Thr Thr Ser Thr Val Thr Trp
Gly Gln Ala Val1 5 10
15Gln Ala Ala Gln Pro Glu Phe Ile Phe 20
2515625PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 156Thr Thr Phe Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Val1 5 10 15Gln Ala
Ala Gln Pro Glu Phe Ile Phe 20
2515725PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 157Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Val1 5 10 15Gln Ala
Ala Gln Pro Glu Phe Ile Phe 20
2515825PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 158Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Val1 5 10 15Gln Ala
Ala Gln Pro Glu Phe Thr Phe 20
2515925PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 159Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Thr1 5 10 15Gln Ala
Ala Gln Pro Glu Phe Ile Phe 20
2516025PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 160Thr Thr Phe Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Thr1 5 10 15Gln Ala
Ala Gln Pro Glu Phe Ile Tyr 20
2516125PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 161Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Thr1 5 10 15Gln Ala
Ala Gln Pro Glu Phe Ile Tyr 20
2516225PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 162Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln
Ala Thr1 5 10 15Gln Ala
Ala Gln Pro Glu Tyr Thr Tyr 20
2516332PRTUnknownDescription of Unknown Mammalian CCR3 polypeptide
163Tyr Glu Thr Glu Glu Leu Phe Glu Glu Thr Leu Cys Ser Ala Leu Tyr1
5 10 15Pro Glu Asp Thr Val Tyr
Ser Trp Arg His Phe His Thr Leu Arg Met 20 25
3016420PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 164Thr Ile Phe Cys Gln Val Gln Pro Gln
Gln Thr Met Ala Thr Cys Tyr1 5 10
15Thr Gly Thr Thr 2016520PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 165Thr
Ile Phe Cys Gln Thr Gln Pro Gln Gln Val Met Ala Thr Cys Tyr1
5 10 15Thr Gly Thr Thr
2016620PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 166Thr Ile Phe Cys Gln Thr Gln Pro Gln Gln Thr Met Ala Thr
Cys Tyr1 5 10 15Thr Gly
Ile Thr 2016720PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 167Thr Ile Phe Cys Gln Thr Gln Pro Gln
Gln Thr Met Ala Thr Cys Tyr1 5 10
15Thr Gly Thr Ile 2016820PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 168Thr
Thr Phe Cys Gln Val Gln Pro Gln Gln Val Met Ala Thr Cys Tyr1
5 10 15Thr Gly Thr Thr
2016920PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 169Thr Ile Tyr Cys Gln Val Gln Pro Gln Gln Val Met Ala Thr
Cys Tyr1 5 10 15Thr Gly
Thr Thr 2017020PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 170Thr Ile Phe Cys Gln Thr Gln Pro Gln
Gln Thr Met Ala Thr Cys Tyr1 5 10
15Thr Gly Thr Thr 2017120PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 171Thr
Thr Tyr Cys Gln Thr Gln Pro Gln Gln Thr Met Ala Thr Cys Tyr1
5 10 15Thr Gly Thr Thr
2017216PRTUnknownDescription of Unknown Mammalian CCR3 peptide
172Lys Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala Ile Arg1
5 10 1517325PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 173Gln
Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr1
5 10 15Asn Thr Ala Thr Gln Gln Ser
Ser Tyr 20 2517417PRTUnknownDescription of
Unknown Mammalian CCR3 peptide 174Gln Ser Ile Leu Phe Gly Asn Asp
Cys Glu Arg Ser Lys His Leu Asp1 5 10
15Leu17524PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 175Val Met Gln Val Thr Glu Val Thr Ala
Tyr Ser His Cys Cys Met Asn1 5 10
15Pro Val Thr Tyr Ala Phe Thr Gly
2017624PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 176Val Met Gln Val Thr Glu Val Thr Ala Tyr Ser His Cys Cys
Met Asn1 5 10 15Pro Thr
Thr Tyr Ala Tyr Val Gly 2017724PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 177Val
Met Leu Thr Thr Glu Val Thr Ala Tyr Ser His Cys Cys Met Asn1
5 10 15Pro Thr Thr Tyr Ala Phe Thr
Gly 2017824PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 178Val Met Gln Val Thr Glu Thr Thr Ala
Tyr Ser His Cys Cys Met Asn1 5 10
15Pro Val Thr Tyr Ala Tyr Thr Gly
2017924PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 179Thr Met Gln Val Thr Glu Thr Ile Ala Tyr Ser His Cys Cys
Met Asn1 5 10 15Pro Thr
Thr Tyr Ala Phe Thr Gly 2018024PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 180Thr
Met Gln Val Thr Glu Thr Thr Ala Tyr Ser His Cys Cys Met Asn1
5 10 15Pro Thr Thr Tyr Ala Phe Val
Gly 2018124PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 181Val Met Gln Thr Thr Glu Thr Ile Ala
Tyr Ser His Cys Cys Met Asn1 5 10
15Pro Thr Thr Tyr Ala Tyr Thr Gly
2018224PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 182Thr Met Gln Thr Thr Glu Thr Thr Ala Tyr Ser His Cys Cys
Met Asn1 5 10 15Pro Thr
Thr Tyr Ala Tyr Thr Gly 2018350PRTUnknownDescription of
Unknown Mammalian CCR3 polypeptide 183Glu Arg Phe Arg Lys Tyr Leu
Arg His Phe Phe His Arg His Leu Leu1 5 10
15Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu
Lys Leu Glu 20 25 30Arg Thr
Ser Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser Ile 35
40 45Val Phe 50184352PRTUnknownDescription
of Unknown Mammalian CCR5 polypeptide 184Met Asp Tyr Gln Val Ser Ser
Pro Ile Tyr Asp Ile Asn Tyr Tyr Thr1 5 10
15Ser Glu Pro Cys Gln Lys Ile Asn Val Lys Gln Ile Ala
Ala Arg Leu 20 25 30Leu Pro
Pro Leu Tyr Ser Leu Val Phe Ile Phe Gly Phe Val Gly Asn 35
40 45Met Leu Val Ile Leu Ile Leu Ile Asn Cys
Lys Arg Leu Lys Ser Met 50 55 60Thr
Asp Ile Tyr Leu Leu Asn Leu Ala Ile Ser Asp Leu Phe Phe Leu65
70 75 80Leu Thr Val Pro Phe Trp
Ala His Tyr Ala Ala Ala Gln Trp Asp Phe 85
90 95Gly Asn Thr Met Cys Gln Leu Leu Thr Gly Leu Tyr
Phe Ile Gly Phe 100 105 110Phe
Ser Gly Ile Phe Phe Ile Ile Leu Leu Thr Ile Asp Arg Tyr Leu 115
120 125Ala Val Val His Ala Val Phe Ala Leu
Lys Ala Arg Thr Val Thr Phe 130 135
140Gly Val Val Thr Ser Val Ile Thr Trp Val Val Ala Val Phe Ala Ser145
150 155 160Leu Pro Gly Ile
Ile Phe Thr Arg Ser Gln Lys Glu Gly Leu His Tyr 165
170 175Thr Cys Ser Ser His Phe Pro Tyr Ser Gln
Tyr Gln Phe Trp Lys Asn 180 185
190Phe Gln Thr Leu Lys Ile Val Ile Leu Gly Leu Val Leu Pro Leu Leu
195 200 205Val Met Val Ile Cys Tyr Ser
Gly Ile Leu Lys Thr Leu Leu Arg Cys 210 215
220Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Leu Ile Phe Thr
Ile225 230 235 240Met Ile
Val Tyr Phe Leu Phe Trp Ala Pro Tyr Asn Ile Val Leu Leu
245 250 255Leu Asn Thr Phe Gln Glu Phe
Phe Gly Leu Asn Asn Cys Ser Ser Ser 260 265
270Asn Arg Leu Asp Gln Ala Met Gln Val Thr Glu Thr Leu Gly
Met Thr 275 280 285His Cys Cys Ile
Asn Pro Ile Ile Tyr Ala Phe Val Gly Glu Lys Phe 290
295 300Arg Asn Tyr Leu Leu Val Phe Phe Gln Lys His Ile
Ala Lys Arg Phe305 310 315
320Cys Lys Cys Cys Ser Ile Phe Gln Gln Glu Ala Pro Glu Arg Ala Ser
325 330 335Ser Val Tyr Thr Arg
Ser Thr Gly Glu Gln Glu Ile Ser Val Gly Leu 340
345 350185352PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 185Met Asp Tyr Gln Val Ser
Ser Pro Ile Tyr Asp Ile Asn Tyr Tyr Thr1 5
10 15Ser Glu Pro Cys Gln Lys Ile Asn Val Lys Gln Ile
Ala Ala Arg Leu 20 25 30Leu
Pro Pro Gln Tyr Ser Gln Thr Tyr Thr Tyr Gly Tyr Thr Gly Asn 35
40 45Met Gln Thr Thr Gln Thr Gln Thr Asn
Cys Lys Arg Leu Lys Ser Met 50 55
60Thr Asp Thr Tyr Gln Gln Asn Gln Ala Thr Ser Asp Gln Tyr Tyr Gln65
70 75 80Gln Thr Thr Pro Tyr
Trp Ala His Tyr Ala Ala Ala Gln Trp Asp Phe 85
90 95Gly Asn Thr Met Cys Gln Gln Gln Thr Gly Gln
Tyr Tyr Thr Gly Tyr 100 105
110Tyr Ser Gly Thr Tyr Tyr Thr Thr Gln Gln Thr Thr Asp Arg Tyr Leu
115 120 125Ala Val Val His Ala Val Phe
Ala Leu Lys Ala Arg Thr Val Thr Tyr 130 135
140Gly Thr Thr Thr Ser Thr Thr Thr Trp Thr Thr Ala Thr Tyr Ala
Ser145 150 155 160Gln Pro
Gly Thr Thr Tyr Thr Arg Ser Gln Lys Glu Gly Leu His Tyr
165 170 175Thr Cys Ser Ser His Phe Pro
Tyr Ser Gln Tyr Gln Phe Trp Lys Asn 180 185
190Phe Gln Thr Leu Lys Thr Thr Thr Gln Gly Gln Thr Gln Pro
Gln Gln 195 200 205Thr Met Thr Thr
Cys Tyr Ser Gly Thr Gln Lys Thr Gln Leu Arg Cys 210
215 220Arg Asn Glu Lys Lys Arg His Arg Ala Val Arg Gln
Thr Tyr Thr Thr225 230 235
240Met Thr Thr Tyr Tyr Gln Tyr Trp Ala Pro Tyr Asn Thr Thr Gln Gln
245 250 255Gln Asn Thr Phe Gln
Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 260
265 270Asn Arg Leu Asp Gln Ala Met Gln Val Thr Glu Thr
Leu Gly Met Thr 275 280 285His Cys
Cys Thr Asn Pro Thr Thr Tyr Ala Tyr Thr Gly Glu Lys Tyr 290
295 300Arg Asn Tyr Gln Gln Thr Tyr Tyr Gln Lys His
Ile Ala Lys Arg Phe305 310 315
320Cys Lys Cys Cys Ser Ile Phe Gln Gln Glu Ala Pro Glu Arg Ala Ser
325 330 335Ser Val Tyr Thr
Arg Ser Thr Gly Glu Gln Glu Ile Ser Val Gly Leu 340
345 35018630PRTUnknownDescription of Unknown
Mammalian CCR5 polypeptide 186Met Asp Tyr Gln Val Ser Ser Pro Ile
Tyr Asp Ile Asn Tyr Tyr Thr1 5 10
15Ser Glu Pro Cys Gln Lys Ile Asn Val Lys Gln Ile Ala Ala
20 25 3018728PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 187Arg
Leu Gln Pro Pro Gln Tyr Ser Gln Thr Phe Thr Phe Gly Phe Thr1
5 10 15Gly Asn Met Gln Val Thr Gln
Thr Gln Ile Asn Cys 20 2518828PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 188Arg
Leu Gln Pro Pro Gln Tyr Ser Gln Thr Phe Thr Phe Gly Tyr Thr1
5 10 15Gly Asn Met Gln Val Thr Gln
Thr Gln Ile Asn Cys 20 2518928PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 189Arg
Gln Gln Pro Pro Gln Tyr Ser Gln Thr Phe Thr Phe Gly Phe Thr1
5 10 15Gly Asn Met Gln Thr Thr Gln
Thr Gln Ile Asn Cys 20 2519028PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 190Arg
Gln Gln Pro Pro Gln Tyr Ser Gln Thr Phe Thr Tyr Gly Phe Thr1
5 10 15Gly Asn Met Gln Thr Thr Gln
Thr Gln Ile Asn Cys 20 2519128PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 191Arg
Gln Gln Pro Pro Gln Tyr Ser Gln Thr Tyr Thr Phe Gly Phe Thr1
5 10 15Gly Asn Met Gln Thr Thr Gln
Thr Gln Ile Asn Cys 20 2519228PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 192Arg
Gln Gln Pro Pro Gln Tyr Ser Gln Thr Phe Thr Phe Gly Tyr Thr1
5 10 15Gly Asn Met Gln Thr Thr Gln
Thr Gln Ile Asn Cys 20 2519328PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 193Arg
Gln Gln Pro Pro Gln Tyr Ser Gln Thr Tyr Thr Phe Gly Tyr Thr1
5 10 15Gly Asn Met Gln Thr Thr Gln
Thr Gln Ile Asn Cys 20 2519428PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 194Arg
Gln Gln Pro Pro Gln Tyr Ser Gln Thr Tyr Thr Tyr Gly Tyr Thr1
5 10 15Gly Asn Met Gln Thr Thr Gln
Thr Gln Thr Asn Cys 20
2519510PRTUnknownDescription of Unknown Mammalian CCR5 peptide
195Lys Arg Leu Lys Ser Met Thr Asp Ile Tyr1 5
1019621PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 196Leu Gln Asn Gln Ala Ile Ser Asp Gln Phe Phe Gln
Gln Thr Val Pro1 5 10
15Phe Trp Ala His Tyr 2019721PRTArtificial SequenceDescription
of Artificial Sequence Synthetic peptide 197Leu Gln Asn Gln Ala Ile
Ser Asp Gln Phe Phe Gln Gln Thr Thr Pro1 5
10 15Phe Trp Ala His Tyr
2019821PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 198Leu Gln Asn Gln Ala Ile Ser Asp Gln Phe Phe Gln Gln Thr
Thr Pro1 5 10 15Tyr Trp
Ala His Tyr 2019921PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 199Leu Gln Asn Gln Ala Ile Ser
Asp Gln Phe Tyr Gln Gln Thr Thr Pro1 5 10
15Tyr Trp Ala His Tyr 2020021PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 200Leu
Gln Asn Gln Ala Ile Ser Asp Gln Tyr Phe Gln Gln Thr Thr Pro1
5 10 15Tyr Trp Ala His Tyr
2020121PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 201Leu Gln Asn Gln Ala Thr Ser Asp Gln Phe Phe Gln Gln Thr
Thr Pro1 5 10 15Tyr Trp
Ala His Tyr 2020221PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 202Leu Gln Asn Gln Ala Ile Ser
Asp Gln Tyr Tyr Gln Gln Thr Thr Pro1 5 10
15Tyr Trp Ala His Tyr 2020321PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 203Gln
Gln Asn Gln Ala Thr Ser Asp Gln Tyr Tyr Gln Gln Thr Thr Pro1
5 10 15Tyr Trp Ala His Tyr
2020413PRTUnknownDescription of Unknown Mammalian CCR5 peptide
204Ala Ala Ala Gln Trp Asp Phe Gly Asn Thr Met Cys Gln1 5
1020522PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 205Gln Gln Thr Gly Gln Tyr Phe Thr Gly
Tyr Tyr Ser Gly Thr Tyr Tyr1 5 10
15Thr Thr Gln Gln Thr Thr 2020622PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 206Gln
Gln Thr Gly Gln Tyr Tyr Thr Gly Tyr Tyr Ser Gly Thr Tyr Tyr1
5 10 15Thr Thr Gln Gln Thr Thr
2020717PRTUnknownDescription of Unknown Mammalian CCR5 peptide
207Asp Arg Tyr Leu Ala Val Val His Ala Val Phe Ala Leu Lys Ala Arg1
5 10 15Thr20825PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 208Thr
Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Thr Thr Ala Thr1
5 10 15Tyr Ala Ser Gln Pro Gly Thr
Thr Tyr 20 2520932PRTUnknownDescription of
Unknown Mammalian CCR5 polypeptide 209Thr Arg Ser Gln Lys Glu Gly
Leu His Tyr Thr Cys Ser Ser His Phe1 5 10
15Pro Tyr Ser Gln Tyr Gln Phe Trp Lys Asn Phe Gln Thr
Leu Lys Ile 20 25
3021020PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 210Val Ile Gln Gly Gln Val Gln Pro Gln Gln Val Met Val Thr
Cys Tyr1 5 10 15Ser Gly
Ile Gln 2021120PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 211Val Ile Gln Gly Gln Val Gln Pro Gln
Gln Val Met Thr Thr Cys Tyr1 5 10
15Ser Gly Ile Gln 2021220PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 212Val
Ile Gln Gly Gln Val Gln Pro Gln Gln Thr Met Thr Thr Cys Tyr1
5 10 15Ser Gly Ile Gln
2021320PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 213Val Thr Gln Gly Gln Val Gln Pro Gln Gln Thr Met Val Thr
Cys Tyr1 5 10 15Ser Gly
Thr Gln 2021420PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 214Thr Ile Gln Gly Gln Val Gln Pro Gln
Gln Val Met Thr Thr Cys Tyr1 5 10
15Ser Gly Thr Gln 2021520PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 215Thr
Ile Gln Gly Gln Val Gln Pro Gln Gln Thr Met Val Thr Cys Tyr1
5 10 15Ser Gly Thr Gln
2021620PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 216Thr Thr Gln Gly Gln Val Gln Pro Gln Gln Val Met Thr Thr
Cys Tyr1 5 10 15Ser Gly
Thr Gln 2021720PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 217Thr Thr Gln Gly Gln Thr Gln Pro Gln
Gln Thr Met Thr Thr Cys Tyr1 5 10
15Ser Gly Thr Gln 2021817PRTUnknownDescription of
Unknown Mammalian CCR5 peptide 218Lys Thr Leu Leu Arg Cys Arg Asn
Glu Lys Lys Arg His Arg Ala Val1 5 10
15Arg21925PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 219Gln Thr Phe Thr Thr Met Thr Thr Tyr
Tyr Gln Phe Trp Ala Pro Tyr1 5 10
15Asn Ile Val Gln Gln Leu Asn Thr Phe 20
2522025PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 220Gln Thr Phe Thr Thr Met Thr Thr Tyr Tyr Gln Phe
Trp Ala Pro Tyr1 5 10
15Asn Thr Val Gln Gln Leu Asn Thr Phe 20
2522125PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 221Gln Thr Phe Thr Thr Met Thr Thr Tyr Tyr Gln Tyr Trp Ala
Pro Tyr1 5 10 15Asn Thr
Val Gln Gln Leu Asn Thr Phe 20
2522225PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 222Gln Thr Phe Thr Thr Met Thr Thr Tyr Tyr Gln Tyr Trp Ala
Pro Tyr1 5 10 15Asn Thr
Val Gln Gln Gln Asn Thr Phe 20
2522325PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 223Gln Thr Tyr Thr Thr Met Thr Thr Tyr Tyr Gln Tyr Trp Ala
Pro Tyr1 5 10 15Asn Thr
Val Gln Gln Leu Asn Thr Phe 20
2522425PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 224Gln Thr Phe Thr Thr Met Thr Thr Tyr Tyr Gln Tyr Trp Ala
Pro Tyr1 5 10 15Asn Thr
Thr Gln Gln Leu Asn Thr Phe 20
2522525PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 225Gln Thr Tyr Thr Thr Met Thr Thr Tyr Tyr Gln Tyr Trp Ala
Pro Tyr1 5 10 15Asn Thr
Val Gln Gln Gln Asn Thr Phe 20
2522617PRTUnknownDescription of Unknown Mammalian CCR5 peptide
226Gln Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser Asn Arg Leu Asp1
5 10 15Gln22724PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 227Ala
Met Gln Val Thr Glu Thr Gln Gly Met Thr His Cys Cys Ile Asn1
5 10 15Pro Ile Ile Tyr Ala Phe Val
Gly 2022824PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 228Ala Met Gln Val Thr Glu Thr Leu Gly
Met Thr His Cys Cys Thr Asn1 5 10
15Pro Ile Ile Tyr Ala Phe Thr Gly
2022924PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 229Ala Met Gln Val Thr Glu Thr Gln Gly Met Thr His Cys Cys
Ile Asn1 5 10 15Pro Thr
Ile Tyr Ala Tyr Val Gly 2023024PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 230Ala
Met Gln Thr Thr Glu Thr Gln Gly Met Thr His Cys Cys Ile Asn1
5 10 15Pro Ile Thr Tyr Ala Phe Thr
Gly 2023124PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 231Ala Met Gln Thr Thr Glu Thr Gln Gly
Met Thr His Cys Cys Ile Asn1 5 10
15Pro Thr Ile Tyr Ala Phe Thr Gly
2023224PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 232Ala Met Gln Val Thr Glu Thr Gln Gly Met Thr His Cys Cys
Thr Asn1 5 10 15Pro Thr
Ile Tyr Ala Tyr Val Gly 2023324PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 233Ala
Met Gln Thr Thr Glu Thr Gln Gly Met Thr His Cys Cys Ile Asn1
5 10 15Pro Thr Thr Tyr Ala Tyr Val
Gly 2023424PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 234Ala Met Gln Thr Thr Glu Thr Gln Gly
Met Thr His Cys Cys Thr Asn1 5 10
15Pro Thr Thr Tyr Ala Tyr Thr Gly
2023551PRTUnknownDescription of Unknown Mammalian CCR5 polypeptide
235Glu Lys Phe Arg Asn Tyr Leu Leu Val Phe Phe Gln Lys His Ile Ala1
5 10 15Lys Arg Phe Cys Lys Cys
Cys Ser Ile Phe Gln Gln Glu Ala Pro Glu 20 25
30Arg Ala Ser Ser Val Tyr Thr Arg Ser Thr Gly Glu Gln
Glu Ile Ser 35 40 45Val Gly Leu
5023627PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 236Ala Phe Leu Pro Ala Leu Tyr Ser Gln Gln Phe Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Val Ala Ala Thr Gln Leu Ser 20
2523727PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 237Ala Phe Gln Pro Ala Leu Tyr Ser Gln Gln Phe Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Val Ala Ala Val Gln Gln Ser 20
2523827PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 238Ala Phe Gln Pro Ala Gln Tyr Ser Gln Gln Phe Leu
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Val Ala Ala Thr Gln Gln Ser 20
2523927PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 239Ala Tyr Gln Pro Ala Leu Tyr Ser Leu Gln Tyr Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Thr Ala Ala Val Gln Gln Ser 20
2524027PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 240Ala Tyr Gln Pro Ala Leu Tyr Ser Gln Leu Phe Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Thr Ala Ala Thr Gln Gln Ser 20
2524127PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 241Ala Phe Gln Pro Ala Leu Tyr Ser Leu Gln Tyr Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Thr Ala Ala Thr Gln Gln Ser 20
2524227PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 242Ala Tyr Gln Pro Ala Gln Tyr Ser Leu Gln Tyr Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Thr Ala Ala Val Gln Gln Ser 20
2524327PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 243Ala Tyr Gln Pro Ala Gln Tyr Ser Gln Gln Tyr Gln
Gln Gly Gln Gln1 5 10
15Gly Asn Gly Ala Thr Ala Ala Thr Gln Gln Ser 20
252449PRTUnknownDescription of Unknown Mammalian CXCR3 peptide
244Arg Arg Thr Ala Leu Ser Ser Thr Asp1 524521PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 245Thr
Phe Leu Gln His Leu Ala Val Ala Asp Thr Gln Gln Val Gln Thr1
5 10 15Leu Pro Gln Trp Ala
2024621PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 246Thr Phe Leu Gln His Gln Ala Val Ala Asp Thr Gln Leu Val
Gln Thr1 5 10 15Gln Pro
Gln Trp Ala 2024721PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 247Thr Phe Gln Gln His Leu Ala
Val Ala Asp Thr Gln Gln Val Gln Thr1 5 10
15Gln Pro Gln Trp Ala 2024821PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 248Thr
Tyr Leu Gln His Gln Ala Val Ala Asp Thr Gln Gln Val Gln Thr1
5 10 15Gln Pro Gln Trp Ala
2024921PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 249Thr Tyr Gln Leu His Gln Ala Val Ala Asp Thr Gln Gln Val
Gln Thr1 5 10 15Gln Pro
Gln Trp Ala 2025021PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 250Thr Tyr Gln Gln His Leu Ala
Val Ala Asp Thr Gln Gln Val Gln Thr1 5 10
15Gln Pro Gln Trp Ala 2025121PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 251Thr
Tyr Gln Gln His Gln Ala Val Ala Asp Thr Gln Gln Val Gln Thr1
5 10 15Gln Pro Gln Trp Ala
2025221PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 252Thr Tyr Gln Gln His Gln Ala Thr Ala Asp Thr Gln Gln Thr
Gln Thr1 5 10 15Gln Pro
Gln Trp Ala 2025315PRTUnknownDescription of Unknown Mammalian
CXCR3 peptide 253Val Asp Ala Ala Val Gln Trp Val Phe Gly Ser Gly Leu
Cys Lys1 5 10
1525422PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 254Thr Ala Gly Ala Gln Tyr Asn Thr Asn Phe Tyr Ala Gly Ala
Gln Gln1 5 10 15Gln Ala
Cys Ile Ser Phe 2025522PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 255Thr Ala Gly Ala Gln Tyr Asn
Thr Asn Phe Tyr Ala Gly Ala Gln Leu1 5 10
15Gln Ala Cys Thr Ser Phe
2025622PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 256Thr Ala Gly Ala Gln Tyr Asn Thr Asn Phe Tyr Ala Gly Ala
Gln Gln1 5 10 15Leu Ala
Cys Thr Ser Phe 2025722PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 257Thr Ala Gly Ala Gln Phe Asn
Thr Asn Tyr Tyr Ala Gly Ala Gln Gln1 5 10
15Gln Ala Cys Ile Ser Phe
2025822PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 258Thr Ala Gly Ala Gln Tyr Asn Thr Asn Tyr Tyr Ala Gly Ala
Gln Gln1 5 10 15Gln Ala
Cys Ile Ser Phe 2025922PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 259Thr Ala Gly Ala Gln Tyr Asn
Thr Asn Tyr Tyr Ala Gly Ala Gln Leu1 5 10
15Gln Ala Cys Thr Ser Phe
2026022PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 260Thr Ala Gly Ala Gln Tyr Asn Thr Asn Tyr Tyr Ala Gly Ala
Gln Gln1 5 10 15Leu Ala
Cys Thr Ser Phe 2026122PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 261Thr Ala Gly Ala Gln Tyr Asn
Thr Asn Tyr Tyr Ala Gly Ala Gln Gln1 5 10
15Gln Ala Cys Thr Ser Tyr
2026222PRTUnknownDescription of Unknown Mammalian CXCR3 peptide
262Asp Arg Tyr Leu Asn Ile Val His Ala Thr Gln Leu Tyr Arg Arg Gly1
5 10 15Pro Pro Ala Arg Val Thr
2026320PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 263Leu Thr Cys Gln Ala Val Trp Gly Gln
Cys Gln Gln Phe Ala Gln Pro1 5 10
15Asp Phe Ile Phe 2026420PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 264Gln
Thr Cys Gln Ala Val Trp Gly Gln Cys Gln Gln Phe Ala Gln Pro1
5 10 15Asp Phe Ile Phe
2026520PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 265Gln Thr Cys Gln Ala Thr Trp Gly Gln Cys Gln Gln Phe Ala
Gln Pro1 5 10 15Asp Phe
Ile Phe 2026620PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 266Gln Thr Cys Gln Ala Thr Trp Gly Gln
Cys Gln Gln Tyr Ala Gln Pro1 5 10
15Asp Phe Ile Phe 2026720PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 267Gln
Thr Cys Gln Ala Thr Trp Gly Gln Cys Gln Gln Phe Ala Gln Pro1
5 10 15Asp Phe Thr Phe
2026820PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 268Gln Thr Cys Gln Ala Thr Trp Gly Gln Cys Gln Gln Phe Ala
Gln Pro1 5 10 15Asp Tyr
Ile Phe 2026920PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 269Gln Thr Cys Gln Ala Thr Trp Gly Gln
Cys Gln Gln Tyr Ala Gln Pro1 5 10
15Asp Tyr Ile Phe 2027020PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 270Gln
Thr Cys Gln Ala Thr Trp Gly Gln Cys Gln Gln Tyr Ala Gln Pro1
5 10 15Asp Tyr Thr Tyr
2027123PRTUnknownDescription of Unknown Mammalian CXCR3 peptide
271Leu Ser Ala His His Asp Glu Arg Leu Asn Ala Thr His Cys Gln Tyr1
5 10 15Asn Phe Pro Gln Val Gly
Arg 2027221PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 272Thr Ala Gln Arg Thr Gln Gln Gln Thr
Ala Gly Tyr Gln Gln Pro Gln1 5 10
15Gln Thr Met Ala Tyr 2027322PRTUnknownDescription of
Unknown Mammalian CXCR3 peptide 273Cys Tyr Ala His Ile Leu Ala Val
Leu Leu Val Ser Arg Gly Gln Arg1 5 10
15Arg Leu Arg Ala Met Arg 2027422PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 274Gln
Val Thr Thr Thr Thr Val Ala Phe Ala Gln Cys Trp Thr Pro Tyr1
5 10 15His Gln Val Val Gln Val
2027522PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 275Gln Val Thr Thr Thr Thr Val Ala Phe Ala Gln Cys
Trp Thr Pro Tyr1 5 10
15His Gln Thr Val Gln Val 2027622PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 276Gln
Val Thr Thr Thr Thr Thr Ala Phe Ala Gln Cys Trp Thr Pro Tyr1
5 10 15His Gln Thr Val Gln Val
2027722PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 277Gln Val Thr Thr Thr Thr Thr Ala Tyr Ala Gln Cys
Trp Thr Pro Tyr1 5 10
15His Gln Thr Val Gln Val 2027822PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 278Gln
Val Thr Thr Thr Thr Thr Ala Phe Ala Gln Cys Trp Thr Pro Tyr1
5 10 15His Gln Thr Thr Gln Val
2027922PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 279Gln Thr Thr Thr Thr Thr Val Ala Phe Ala Gln Cys
Trp Thr Pro Tyr1 5 10
15His Gln Thr Thr Gln Val 2028022PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 280Gln
Val Thr Thr Thr Thr Thr Ala Tyr Ala Gln Cys Trp Thr Pro Tyr1
5 10 15His Gln Thr Thr Gln Val
2028122PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 281Gln Thr Thr Thr Thr Thr Thr Ala Tyr Ala Gln Cys
Trp Thr Pro Tyr1 5 10
15His Gln Thr Thr Gln Thr 2028221PRTUnknownDescription of
Unknown Mammalian CXCR3 peptide 282Asp Ile Leu Met Asp Leu Gly Ala
Leu Ala Arg Asn Cys Gly Arg Glu1 5 10
15Ser Arg Val Asp Val 2028323PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 283Ala
Lys Ser Val Thr Ser Gly Gln Gly Tyr Met His Cys Cys Leu Asn1
5 10 15Pro Leu Gln Tyr Ala Phe Val
2028423PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 284Ala Lys Ser Val Thr Ser Gly Gln Gly Tyr Met His
Cys Cys Leu Asn1 5 10
15Pro Gln Leu Tyr Ala Phe Thr 2028523PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 285Ala
Lys Ser Val Thr Ser Gly Gln Gly Tyr Met His Cys Cys Leu Asn1
5 10 15Pro Leu Gln Tyr Ala Phe Thr
2028623PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 286Ala Lys Ser Thr Thr Ser Gly Gln Gly Tyr Met His
Cys Cys Leu Asn1 5 10
15Pro Gln Gln Tyr Ala Phe Val 2028723PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 287Ala
Lys Ser Thr Thr Ser Gly Gln Gly Tyr Met His Cys Cys Gln Asn1
5 10 15Pro Leu Gln Tyr Ala Phe Val
2028823PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 288Ala Lys Ser Thr Thr Ser Gly Gln Gly Tyr Met His
Cys Cys Gln Asn1 5 10
15Pro Gln Leu Tyr Ala Phe Val 2028923PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 289Ala
Lys Ser Thr Thr Ser Gly Gln Gly Tyr Met His Cys Cys Gln Asn1
5 10 15Pro Leu Gln Tyr Ala Phe Thr
2029023PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 290Ala Lys Ser Thr Thr Ser Gly Gln Gly Tyr Met His
Cys Cys Gln Asn1 5 10
15Pro Gln Gln Tyr Ala Tyr Thr 2029147PRTUnknownDescription of
Unknown Mammalian CXCR3 polypeptide 291Gly Val Lys Phe Arg Glu Arg
Met Trp Met Leu Leu Leu Arg Leu Gly1 5 10
15Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln Pro Ser Ser
Ser Arg Arg 20 25 30Asp Ser
Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu 35
40 45292355PRTUnknownDescription of Unknown
Mammalian CCR-1 polypeptide 292Met Glu Thr Pro Asn Thr Thr Glu Asp
Tyr Asp Thr Thr Thr Glu Phe1 5 10
15Asp Tyr Gly Asp Ala Thr Pro Cys Gln Lys Val Asn Glu Arg Ala
Phe 20 25 30Gly Ala Gln Leu
Leu Pro Pro Leu Tyr Ser Leu Val Phe Val Ile Gly 35
40 45Leu Val Gly Asn Ile Leu Val Val Leu Val Leu Val
Gln Tyr Lys Arg 50 55 60Leu Lys Asn
Met Thr Ser Ile Tyr Leu Leu Asn Leu Ala Ile Ser Asp65 70
75 80Leu Leu Phe Leu Phe Thr Leu Pro
Phe Trp Ile Asp Tyr Lys Leu Lys 85 90
95Asp Asp Trp Val Phe Gly Asp Ala Met Cys Lys Ile Leu Ser
Gly Phe 100 105 110Tyr Tyr Thr
Gly Leu Tyr Ser Glu Ile Phe Phe Ile Ile Leu Leu Thr 115
120 125Ile Asp Arg Tyr Leu Ala Ile Val His Ala Val
Phe Ala Leu Arg Ala 130 135 140Arg Thr
Val Thr Phe Gly Val Ile Thr Ser Ile Ile Ile Trp Ala Leu145
150 155 160Ala Ile Leu Ala Ser Met Pro
Gly Leu Tyr Phe Ser Lys Thr Gln Trp 165
170 175Glu Phe Thr His His Thr Cys Ser Leu His Phe Pro
His Glu Ser Leu 180 185 190Arg
Glu Trp Lys Leu Phe Gln Ala Leu Lys Leu Asn Leu Phe Gly Leu 195
200 205Val Leu Pro Leu Leu Val Met Ile Ile
Cys Tyr Thr Gly Ile Ile Lys 210 215
220Ile Leu Leu Arg Arg Pro Asn Glu Lys Lys Ser Lys Ala Val Arg Leu225
230 235 240Ile Phe Val Ile
Met Ile Ile Phe Phe Leu Phe Trp Thr Pro Tyr Asn 245
250 255Leu Thr Ile Leu Ile Ser Val Phe Gln Asp
Phe Leu Phe Thr His Glu 260 265
270Cys Glu Gln Ser Arg His Leu Asp Leu Ala Val Gln Val Thr Glu Val
275 280 285Ile Ala Tyr Thr His Cys Cys
Val Asn Pro Val Ile Tyr Ala Phe Val 290 295
300Gly Glu Arg Phe Arg Lys Tyr Leu Arg Gln Leu Phe His Arg Arg
Val305 310 315 320Ala Val
His Leu Val Lys Trp Leu Pro Phe Leu Ser Val Asp Arg Leu
325 330 335Glu Arg Val Ser Ser Thr Ser
Pro Ser Thr Gly Glu His Glu Leu Ser 340 345
350Ala Gly Phe 355293355PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
293Met Glu Thr Pro Asn Thr Thr Glu Asp Tyr Asp Thr Thr Thr Glu Phe1
5 10 15Asp Tyr Gly Asp Ala Thr
Pro Cys Gln Lys Val Asn Glu Arg Ala Phe 20 25
30Gly Ala Gln Leu Gln Pro Pro Gln Tyr Ser Gln Thr Tyr
Thr Thr Gly 35 40 45Gln Thr Gly
Asn Thr Gln Thr Thr Gln Thr Gln Val Gln Tyr Lys Arg 50
55 60Leu Lys Asn Met Thr Ser Thr Tyr Gln Gln Asn Gln
Ala Thr Ser Asp65 70 75
80Gln Gln Tyr Gln Tyr Thr Gln Pro Tyr Trp Thr Asp Tyr Lys Leu Lys
85 90 95Asp Asp Trp Val Phe Gly
Asp Ala Met Cys Lys Thr Gln Ser Gly Tyr 100
105 110Tyr Tyr Thr Gly Gln Tyr Ser Glu Thr Tyr Tyr Thr
Thr Gln Gln Thr 115 120 125Ile Asp
Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu Arg Ala 130
135 140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr
Thr Thr Trp Ala Gln145 150 155
160Ala Thr Gln Ala Ser Met Pro Gly Gln Tyr Phe Ser Lys Thr Gln Trp
165 170 175Glu Phe Thr His
His Thr Cys Ser Leu His Phe Pro His Glu Ser Leu 180
185 190Arg Glu Trp Lys Leu Phe Gln Ala Leu Lys Leu
Asn Gln Tyr Gly Gln 195 200 205Thr
Gln Pro Gln Gln Thr Met Thr Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Gln Gln Arg Arg Pro Asn Glu Lys Lys
Ser Lys Ala Val Arg Gln225 230 235
240Thr Tyr Thr Thr Met Thr Thr Tyr Tyr Gln Tyr Trp Thr Pro Tyr
Asn 245 250 255Gln Thr Thr
Gln Thr Ser Val Phe Gln Asp Phe Leu Phe Thr His Glu 260
265 270Cys Glu Gln Ser Arg His Leu Asp Leu Ala
Thr Gln Thr Thr Glu Thr 275 280
285Thr Ala Tyr Thr His Cys Cys Thr Asn Pro Thr Thr Tyr Ala Tyr Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu
Arg Gln Leu Phe His Arg Arg Val305 310
315 320Ala Val His Leu Val Lys Trp Leu Pro Phe Leu Ser
Val Asp Arg Leu 325 330
335Glu Arg Val Ser Ser Thr Ser Pro Ser Thr Gly Glu His Glu Leu Ser
340 345 350Ala Gly Phe
355294374PRTUnknownDescription of Unknown Mammalian CCR-2
polypeptide 294Met Leu Ser Thr Ser Arg Ser Arg Phe Ile Arg Asn Thr Asn
Glu Ser1 5 10 15Gly Glu
Glu Val Thr Thr Phe Phe Asp Tyr Asp Tyr Gly Ala Pro Cys 20
25 30His Lys Phe Asp Val Lys Gln Ile Gly
Ala Gln Leu Leu Pro Pro Leu 35 40
45Tyr Ser Leu Val Phe Ile Phe Gly Phe Val Gly Asn Met Leu Val Val 50
55 60Leu Ile Leu Ile Asn Cys Lys Lys Leu
Lys Cys Leu Thr Asp Ile Tyr65 70 75
80Leu Leu Asn Leu Ala Ile Ser Asp Leu Leu Phe Leu Ile Thr
Leu Pro 85 90 95Leu Trp
Ala His Ser Ala Ala Asn Glu Trp Val Phe Gly Asn Ala Met 100
105 110Cys Lys Leu Phe Thr Gly Leu Tyr His
Ile Gly Tyr Phe Gly Gly Ile 115 120
125Phe Phe Ile Ile Leu Leu Thr Ile Asp Arg Tyr Leu Ala Ile Val His
130 135 140Ala Val Phe Ala Leu Lys Ala
Arg Thr Val Thr Phe Gly Val Val Thr145 150
155 160Ser Val Ile Thr Trp Leu Val Ala Val Phe Ala Ser
Val Pro Gly Ile 165 170
175Ile Phe Thr Lys Cys Gln Lys Glu Asp Ser Val Tyr Val Cys Gly Pro
180 185 190Tyr Phe Pro Arg Gly Trp
Asn Asn Phe His Thr Ile Met Arg Asn Ile 195 200
205Leu Gly Leu Val Leu Pro Leu Leu Ile Met Val Ile Cys Tyr
Ser Gly 210 215 220Ile Leu Lys Thr Leu
Leu Arg Cys Arg Asn Glu Lys Lys Arg His Arg225 230
235 240Ala Val Arg Val Ile Phe Thr Ile Met Ile
Val Tyr Phe Leu Phe Trp 245 250
255Thr Pro Tyr Asn Ile Val Ile Leu Leu Asn Thr Phe Gln Glu Phe Phe
260 265 270Gly Leu Ser Asn Cys
Glu Ser Thr Ser Gln Leu Asp Gln Ala Thr Gln 275
280 285Val Thr Glu Thr Leu Gly Met Thr His Cys Cys Ile
Asn Pro Ile Ile 290 295 300Tyr Ala Phe
Val Gly Glu Lys Phe Arg Ser Leu Phe His Ile Ala Leu305
310 315 320Gly Cys Arg Ile Ala Pro Leu
Gln Lys Pro Val Cys Gly Gly Pro Gly 325
330 335Val Arg Pro Gly Lys Asn Val Lys Val Thr Thr Gln
Gly Leu Leu Asp 340 345 350Gly
Arg Gly Lys Gly Lys Ser Ile Gly Arg Ala Pro Glu Ala Ser Leu 355
360 365Gln Asp Lys Glu Gly Ala
370295373PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 295Met Leu Ser Thr Ser Arg Ser Arg Phe Ile Arg
Asn Thr Asn Glu Ser1 5 10
15Gly Glu Glu Val Thr Thr Phe Phe Asp Tyr Asp Tyr Gly Ala Pro Cys
20 25 30His Lys Phe Asp Val Lys Gln
Ile Gly Ala Gln Leu Gln Pro Pro Gln 35 40
45Tyr Ser Gln Thr Tyr Thr Tyr Gly Tyr Thr Gly Asn Met Gln Thr
Thr 50 55 60Gln Thr Gln Ile Asn Cys
Lys Lys Leu Lys Cys Leu Thr Asp Ile Tyr65 70
75 80Gln Gln Asn Gln Ala Thr Ser Asp Gln Gln Tyr
Gln Thr Thr Gln Pro 85 90
95Gln Trp Ala His Ser Ala Ala Asn Glu Trp Val Phe Gly Asn Ala Met
100 105 110Cys Lys Leu Phe Thr Gly
Gln Tyr His Thr Gly Tyr Tyr Gly Gly Thr 115 120
125Tyr Tyr Thr Thr Gln Gln Thr Thr Asp Arg Tyr Leu Ala Ile
Val His 130 135 140Ala Val Phe Ala Leu
Lys Ala Arg Thr Val Thr Tyr Gly Thr Thr Thr145 150
155 160Ser Thr Thr Thr Trp Gln Thr Ala Thr Tyr
Ala Ser Thr Pro Gly Thr 165 170
175Thr Tyr Thr Lys Cys Gln Lys Glu Asp Ser Val Tyr Val Cys Gly Pro
180 185 190Tyr Phe Pro Arg Gly
Trp Asn Asn Phe His Thr Ile Met Arg Asn Thr 195
200 205Gln Gly Gln Thr Gln Pro Gln Gln Thr Met Thr Thr
Cys Tyr Ser Gly 210 215 220Thr Gln Lys
Thr Gln Gln Arg Cys Arg Asn Glu Lys Lys Arg His Arg225
230 235 240Thr Arg Thr Thr Tyr Thr Thr
Met Thr Thr Tyr Tyr Gln Tyr Trp Thr 245
250 255Pro Tyr Asn Thr Thr Thr Gln Leu Asn Thr Phe Gln
Glu Phe Phe Gly 260 265 270Leu
Ser Asn Cys Glu Ser Thr Ser Gln Leu Asp Gln Ala Thr Gln Val 275
280 285Thr Glu Thr Gln Gly Met Thr His Cys
Cys Thr Asn Pro Thr Thr Tyr 290 295
300Ala Tyr Thr Gly Glu Lys Phe Arg Ser Leu Phe His Ile Ala Leu Gly305
310 315 320Cys Arg Ile Ala
Pro Leu Gln Lys Pro Val Cys Gly Gly Pro Gly Val 325
330 335Arg Pro Gly Lys Asn Val Lys Val Thr Thr
Gln Gly Leu Leu Asp Gly 340 345
350Arg Gly Lys Gly Lys Ser Ile Gly Arg Ala Pro Glu Ala Ser Leu Gln
355 360 365Asp Lys Glu Gly Ala
370296360PRTUnknownDescription of Unknown Mammalian CCR-4
polypeptide 296Met Asn Pro Thr Asp Ile Ala Asp Thr Thr Leu Asp Glu Ser
Ile Tyr1 5 10 15Ser Asn
Tyr Tyr Leu Tyr Glu Ser Ile Pro Lys Pro Cys Thr Lys Glu 20
25 30Gly Ile Lys Ala Phe Gly Glu Leu Phe
Leu Pro Pro Leu Tyr Ser Leu 35 40
45Val Phe Val Phe Gly Leu Leu Gly Asn Ser Val Val Val Leu Val Leu 50
55 60Phe Lys Tyr Lys Arg Leu Arg Ser Met
Thr Asp Val Tyr Leu Leu Asn65 70 75
80Leu Ala Ile Ser Asp Leu Leu Phe Val Phe Ser Leu Pro Phe
Trp Gly 85 90 95Tyr Tyr
Ala Ala Asp Gln Trp Val Phe Gly Leu Gly Leu Cys Lys Met 100
105 110Ile Ser Trp Met Tyr Leu Val Gly Phe
Tyr Ser Gly Ile Phe Phe Val 115 120
125Met Leu Met Ser Ile Asp Arg Tyr Leu Ala Ile Val His Ala Val Phe
130 135 140Ser Leu Arg Ala Arg Thr Leu
Thr Tyr Gly Val Ile Thr Ser Leu Ala145 150
155 160Thr Trp Ser Val Ala Val Phe Ala Ser Leu Pro Gly
Phe Leu Phe Ser 165 170
175Thr Cys Tyr Thr Glu Arg Asn His Thr Tyr Cys Lys Thr Lys Tyr Ser
180 185 190Leu Asn Ser Thr Thr Trp
Lys Val Leu Ser Ser Leu Glu Ile Asn Ile 195 200
205Leu Gly Leu Val Ile Pro Leu Gly Ile Met Leu Phe Cys Tyr
Ser Met 210 215 220Ile Ile Arg Thr Leu
Gln His Cys Lys Asn Glu Lys Lys Asn Lys Ala225 230
235 240Val Lys Met Ile Phe Ala Val Val Val Leu
Phe Leu Gly Phe Trp Thr 245 250
255Pro Tyr Asn Ile Val Leu Phe Leu Glu Thr Leu Val Glu Leu Glu Val
260 265 270Leu Gln Asp Cys Thr
Phe Glu Arg Tyr Leu Asp Tyr Ala Ile Gln Ala 275
280 285Thr Glu Thr Leu Ala Phe Val His Cys Cys Leu Asn
Pro Ile Ile Tyr 290 295 300Phe Phe Leu
Gly Glu Lys Phe Arg Lys Tyr Ile Leu Gln Leu Phe Lys305
310 315 320Thr Cys Arg Gly Leu Phe Val
Leu Cys Gln Tyr Cys Gly Leu Leu Gln 325
330 335Ile Tyr Ser Ala Asp Thr Pro Ser Ser Ser Tyr Thr
Gln Ser Thr Met 340 345 350Asp
His Asp Leu His Asp Ala Leu 355
360297360PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 297Met Asn Pro Thr Asp Ile Ala Asp Thr Thr Leu
Asp Glu Ser Ile Tyr1 5 10
15Ser Asn Tyr Tyr Leu Tyr Glu Ser Ile Pro Lys Pro Cys Thr Lys Glu
20 25 30Gly Ile Lys Ala Phe Gly Glu
Leu Phe Leu Pro Pro Leu Tyr Ser Gln 35 40
45Thr Tyr Thr Tyr Gly Gln Gln Gly Asn Ser Thr Thr Thr Gln Thr
Gln 50 55 60Tyr Lys Tyr Lys Arg Leu
Arg Ser Met Thr Asp Thr Tyr Gln Gln Asn65 70
75 80Gln Ala Thr Ser Asp Gln Gln Tyr Thr Tyr Ser
Gln Pro Tyr Trp Gly 85 90
95Tyr Tyr Ala Ala Asp Gln Trp Val Phe Gly Leu Gly Leu Cys Lys Met
100 105 110Thr Ser Trp Met Tyr Gln
Thr Gly Tyr Tyr Ser Gly Thr Tyr Tyr Thr 115 120
125Met Gln Met Ser Ile Asp Arg Tyr Leu Ala Ile Val His Ala
Val Phe 130 135 140Ser Leu Arg Ala Arg
Thr Gln Thr Tyr Gly Thr Thr Thr Ser Gln Ala145 150
155 160Thr Trp Ser Thr Ala Thr Tyr Ala Ser Gln
Pro Gly Tyr Gln Tyr Ser 165 170
175Thr Cys Tyr Thr Glu Arg Asn His Thr Tyr Cys Lys Thr Lys Tyr Ser
180 185 190Leu Asn Ser Thr Thr
Trp Lys Val Leu Ser Ser Leu Glu Thr Asn Thr 195
200 205Gln Gly Gln Thr Thr Pro Gln Gly Thr Met Gln Tyr
Cys Tyr Ser Met 210 215 220Thr Thr Arg
Thr Leu Gln His Cys Lys Asn Glu Lys Lys Asn Lys Ala225
230 235 240Val Lys Met Thr Tyr Ala Thr
Thr Thr Gln Tyr Gln Gly Tyr Trp Thr 245
250 255Pro Tyr Asn Thr Thr Gln Tyr Gln Glu Thr Leu Val
Glu Leu Glu Val 260 265 270Leu
Gln Asp Cys Thr Phe Glu Arg Tyr Leu Asp Tyr Ala Ile Gln Ala 275
280 285Thr Glu Thr Gln Ala Tyr Thr His Cys
Cys Gln Asn Pro Thr Thr Tyr 290 295
300Tyr Tyr Gln Gly Glu Lys Phe Arg Lys Tyr Ile Leu Gln Leu Phe Lys305
310 315 320Thr Cys Arg Gly
Leu Phe Val Leu Cys Gln Tyr Cys Gly Leu Leu Gln 325
330 335Ile Tyr Ser Ala Asp Thr Pro Ser Ser Ser
Tyr Thr Gln Ser Thr Met 340 345
350Asp His Asp Leu His Asp Ala Leu 355
360298374PRTUnknownDescription of Unknown Mammalian CCR-6
polypeptide 298Met Ser Gly Glu Ser Met Asn Phe Ser Asp Val Phe Asp Ser
Ser Glu1 5 10 15Asp Tyr
Phe Val Ser Val Asn Thr Ser Tyr Tyr Ser Val Asp Ser Glu 20
25 30Met Leu Leu Cys Ser Leu Gln Glu Val
Arg Gln Phe Ser Arg Leu Phe 35 40
45Val Pro Ile Ala Tyr Ser Leu Ile Cys Val Phe Gly Leu Leu Gly Asn 50
55 60Ile Leu Val Val Ile Thr Phe Ala Phe
Tyr Lys Lys Ala Arg Ser Met65 70 75
80Thr Asp Val Tyr Leu Leu Asn Met Ala Ile Ala Asp Ile Leu
Phe Val 85 90 95Leu Thr
Leu Pro Phe Trp Ala Val Ser His Ala Thr Gly Ala Trp Val 100
105 110Phe Ser Asn Ala Thr Cys Lys Leu Leu
Lys Gly Ile Tyr Ala Ile Asn 115 120
125Phe Asn Cys Gly Met Leu Leu Leu Thr Cys Ile Ser Met Asp Arg Tyr
130 135 140Ile Ala Ile Val Gln Ala Thr
Lys Ser Phe Arg Leu Arg Ser Arg Thr145 150
155 160Leu Pro Arg Ser Lys Ile Ile Cys Leu Val Val Trp
Gly Leu Ser Val 165 170
175Ile Ile Ser Ser Ser Thr Phe Val Phe Asn Gln Lys Tyr Asn Thr Gln
180 185 190Gly Ser Asp Val Cys Glu
Pro Lys Tyr Gln Thr Val Ser Glu Pro Ile 195 200
205Arg Trp Lys Leu Leu Met Leu Gly Leu Glu Leu Leu Phe Gly
Phe Phe 210 215 220Ile Pro Leu Met Phe
Met Ile Phe Cys Tyr Thr Phe Ile Val Lys Thr225 230
235 240Leu Val Gln Ala Gln Asn Ser Lys Arg His
Lys Ala Ile Arg Val Ile 245 250
255Ile Ala Val Val Leu Val Phe Leu Ala Cys Gln Ile Pro His Asn Met
260 265 270Val Leu Leu Val Thr
Ala Ala Asn Leu Gly Lys Met Asn Arg Ser Cys 275
280 285Gln Ser Glu Lys Leu Ile Gly Tyr Thr Lys Thr Val
Thr Glu Val Leu 290 295 300Ala Phe Leu
His Cys Cys Leu Asn Pro Val Leu Tyr Ala Phe Ile Gly305
310 315 320Gln Lys Phe Arg Asn Tyr Phe
Leu Lys Ile Leu Lys Asp Leu Trp Cys 325
330 335Val Arg Arg Lys Tyr Lys Ser Ser Gly Phe Ser Cys
Ala Gly Arg Tyr 340 345 350Ser
Glu Asn Ile Ser Arg Gln Thr Ser Glu Thr Ala Asp Asn Asp Asn 355
360 365Ala Ser Ser Phe Thr Met
370299374PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 299Met Ser Gly Glu Ser Met Asn Phe Ser Asp Val
Phe Asp Ser Ser Glu1 5 10
15Asp Tyr Phe Val Ser Val Asn Thr Ser Tyr Tyr Ser Val Asp Ser Glu
20 25 30Met Leu Leu Cys Ser Leu Gln
Glu Val Arg Gln Phe Ser Arg Leu Phe 35 40
45Val Pro Thr Ala Tyr Ser Gln Thr Cys Thr Tyr Gly Gln Gln Gly
Asn 50 55 60Thr Gln Thr Thr Thr Thr
Tyr Ala Tyr Tyr Lys Lys Ala Arg Ser Met65 70
75 80Thr Asp Val Tyr Gln Gln Asn Met Ala Thr Ala
Asp Thr Gln Tyr Thr 85 90
95Gln Thr Gln Pro Tyr Trp Ala Thr Ser His Ala Thr Gly Ala Trp Val
100 105 110Phe Ser Asn Ala Thr Cys
Lys Leu Leu Lys Gly Thr Tyr Ala Thr Asn 115 120
125Tyr Asn Cys Gly Met Gln Gln Gln Thr Cys Thr Ser Met Asp
Arg Tyr 130 135 140Thr Ala Ile Val Gln
Ala Thr Lys Ser Phe Arg Leu Arg Ser Arg Thr145 150
155 160Leu Pro Arg Ser Lys Thr Thr Cys Gln Thr
Thr Trp Gly Gln Ser Thr 165 170
175Thr Thr Ser Ser Ser Thr Tyr Thr Tyr Asn Gln Lys Tyr Asn Thr Gln
180 185 190Gly Ser Asp Val Cys
Glu Pro Lys Tyr Gln Thr Val Ser Glu Pro Ile 195
200 205Arg Trp Lys Leu Leu Met Leu Gly Leu Glu Leu Gln
Tyr Gly Tyr Tyr 210 215 220Thr Pro Gln
Met Tyr Met Thr Tyr Cys Tyr Thr Tyr Thr Thr Lys Thr225
230 235 240Gln Thr Gln Ala Gln Asn Ser
Lys Arg His Lys Ala Ile Arg Thr Thr 245
250 255Thr Ala Thr Thr Gln Thr Tyr Gln Ala Cys Gln Thr
Pro His Asn Met 260 265 270Thr
Gln Gln Thr Thr Ala Ala Asn Leu Gly Lys Met Asn Arg Ser Cys 275
280 285Gln Ser Glu Lys Leu Ile Gly Tyr Thr
Lys Thr Val Thr Glu Thr Gln 290 295
300Ala Tyr Gln His Cys Cys Gln Asn Pro Thr Gln Tyr Ala Tyr Thr Gly305
310 315 320Gln Lys Phe Arg
Asn Tyr Phe Leu Lys Ile Leu Lys Asp Leu Trp Cys 325
330 335Val Arg Arg Lys Tyr Lys Ser Ser Gly Phe
Ser Cys Ala Gly Arg Tyr 340 345
350Ser Glu Asn Ile Ser Arg Gln Thr Ser Glu Thr Ala Asp Asn Asp Asn
355 360 365Ala Ser Ser Phe Thr Met
370300378PRTUnknownDescription of Unknown Mammalian CCR-7
polypeptide 300Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala
Leu Leu1 5 10 15Val Ile
Phe Gln Val Cys Leu Cys Gln Asp Glu Val Thr Asp Asp Tyr 20
25 30Ile Gly Asp Asn Thr Thr Val Asp Tyr
Thr Leu Phe Glu Ser Leu Cys 35 40
45Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro Ile Met 50
55 60Tyr Ser Ile Ile Cys Phe Val Gly Leu
Leu Gly Asn Gly Leu Val Val65 70 75
80Leu Thr Tyr Ile Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp
Thr Tyr 85 90 95Leu Leu
Asn Leu Ala Val Ala Asp Ile Leu Phe Leu Leu Thr Leu Pro 100
105 110Phe Trp Ala Tyr Ser Ala Ala Lys Ser
Trp Val Phe Gly Val His Phe 115 120
125Cys Lys Leu Ile Phe Ala Ile Tyr Lys Met Ser Phe Phe Ser Gly Met
130 135 140Leu Leu Leu Leu Cys Ile Ser
Ile Asp Arg Tyr Val Ala Ile Val Gln145 150
155 160Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu
Leu Ile Ser Lys 165 170
175Leu Ser Cys Val Gly Ile Trp Ile Leu Ala Thr Val Leu Ser Ile Pro
180 185 190Glu Leu Leu Tyr Ser Asp
Leu Gln Arg Ser Ser Ser Glu Gln Ala Met 195 200
205Arg Cys Ser Leu Ile Thr Glu His Val Glu Ala Phe Ile Thr
Ile Gln 210 215 220Val Ala Gln Met Val
Ile Gly Phe Leu Val Pro Leu Leu Ala Met Ser225 230
235 240Phe Cys Tyr Leu Val Ile Ile Arg Thr Leu
Leu Gln Ala Arg Asn Phe 245 250
255Glu Arg Asn Lys Ala Ile Lys Val Ile Ile Ala Val Val Val Val Phe
260 265 270Ile Val Phe Gln Leu
Pro Tyr Asn Gly Val Val Leu Ala Gln Thr Val 275
280 285Ala Asn Phe Asn Ile Thr Ser Ser Thr Cys Glu Leu
Ser Lys Gln Leu 290 295 300Asn Ile Ala
Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys305
310 315 320Val Asn Pro Phe Leu Tyr Ala
Phe Ile Gly Val Lys Phe Arg Asn Asp 325
330 335Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser
Gln Glu Gln Leu 340 345 350Arg
Gln Trp Ser Ser Cys Arg His Ile Arg Arg Ser Ser Met Ser Val 355
360 365Glu Ala Glu Thr Thr Thr Thr Phe Ser
Pro 370 375301378PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 301Met Asp Leu Gly Lys Pro
Met Lys Ser Val Leu Val Val Ala Leu Leu1 5
10 15Val Ile Phe Gln Val Cys Leu Cys Gln Asp Glu Val
Thr Asp Asp Tyr 20 25 30Ile
Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys 35
40 45Ser Lys Lys Asp Val Arg Asn Phe Lys
Ala Trp Phe Leu Pro Thr Met 50 55
60Tyr Ser Thr Thr Cys Tyr Thr Gly Gln Gln Gly Asn Gly Gln Thr Thr65
70 75 80Gln Thr Tyr Thr Tyr
Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr 85
90 95Gln Gln Asn Gln Ala Thr Ala Asp Thr Gln Tyr
Gln Gln Thr Gln Pro 100 105
110Tyr Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe
115 120 125Cys Lys Gln Thr Tyr Ala Thr
Tyr Lys Met Ser Tyr Tyr Ser Gly Met 130 135
140Gln Gln Gln Gln Cys Thr Ser Ile Asp Arg Tyr Val Ala Ile Val
Gln145 150 155 160Ala Val
Ser Ala His Arg His Arg Ala Arg Thr Gln Gln Thr Ser Lys
165 170 175Gln Ser Cys Thr Gly Thr Trp
Thr Gln Ala Thr Thr Gln Ser Thr Pro 180 185
190Glu Leu Leu Tyr Ser Asp Leu Gln Arg Ser Ser Ser Glu Gln
Ala Met 195 200 205Arg Cys Ser Leu
Ile Thr Glu His Val Glu Ala Phe Ile Thr Ile Gln 210
215 220Val Ala Gln Met Thr Thr Gly Tyr Gln Thr Pro Gln
Gln Ala Met Ser225 230 235
240Tyr Cys Tyr Gln Thr Thr Thr Arg Thr Gln Gln Gln Ala Arg Asn Phe
245 250 255Glu Arg Asn Lys Ala
Ile Lys Thr Thr Thr Ala Thr Thr Thr Thr Tyr 260
265 270Thr Thr Tyr Gln Gln Pro Tyr Asn Gly Thr Thr Gln
Ala Gln Thr Val 275 280 285Ala Asn
Phe Asn Ile Thr Ser Ser Thr Cys Glu Leu Ser Lys Gln Leu 290
295 300Asn Ile Ala Tyr Asp Thr Thr Tyr Ser Gln Ala
Cys Thr Arg Cys Cys305 310 315
320Thr Asn Pro Tyr Gln Tyr Ala Tyr Ile Gly Val Lys Phe Arg Asn Asp
325 330 335Leu Phe Lys Leu
Phe Lys Asp Leu Gly Cys Leu Ser Gln Glu Gln Leu 340
345 350Arg Gln Trp Ser Ser Cys Arg His Ile Arg Arg
Ser Ser Met Ser Val 355 360 365Glu
Ala Glu Thr Thr Thr Thr Phe Ser Pro 370
375302355PRTUnknownDescription of Unknown Mammalian CCR-8
polypeptide 302Met Asp Tyr Thr Leu Asp Leu Ser Val Thr Thr Val Thr Asp
Tyr Tyr1 5 10 15Tyr Pro
Asp Ile Phe Ser Ser Pro Cys Asp Ala Glu Leu Ile Gln Thr 20
25 30Asn Gly Lys Leu Leu Leu Ala Val Phe
Tyr Cys Leu Leu Phe Val Phe 35 40
45Ser Leu Leu Gly Asn Ser Leu Val Ile Leu Val Leu Val Val Cys Lys 50
55 60Lys Leu Arg Ser Ile Thr Asp Val Tyr
Leu Leu Asn Leu Ala Leu Ser65 70 75
80Asp Leu Leu Phe Val Phe Ser Phe Pro Phe Gln Thr Tyr Tyr
Leu Leu 85 90 95Asp Gln
Trp Val Phe Gly Thr Val Met Cys Lys Val Val Ser Gly Phe 100
105 110Tyr Tyr Ile Gly Phe Tyr Ser Ser Met
Phe Phe Ile Thr Leu Met Ser 115 120
125Val Asp Arg Tyr Leu Ala Val Val His Ala Val Tyr Ala Leu Lys Val
130 135 140Arg Thr Ile Arg Met Gly Thr
Thr Leu Cys Leu Ala Val Trp Leu Thr145 150
155 160Ala Ile Met Ala Thr Ile Pro Leu Leu Val Phe Tyr
Gln Val Ala Ser 165 170
175Glu Asp Gly Val Leu Gln Cys Tyr Ser Phe Tyr Asn Gln Gln Thr Leu
180 185 190Lys Trp Lys Ile Phe Thr
Asn Phe Lys Met Asn Ile Leu Gly Leu Leu 195 200
205Ile Pro Phe Thr Ile Phe Met Phe Cys Tyr Ile Lys Ile Leu
His Gln 210 215 220Leu Lys Arg Cys Gln
Asn His Asn Lys Thr Lys Ala Ile Arg Leu Val225 230
235 240Leu Ile Val Val Ile Ala Ser Leu Leu Phe
Trp Val Pro Phe Asn Val 245 250
255Val Leu Phe Leu Thr Ser Leu His Ser Met His Ile Leu Asp Gly Cys
260 265 270Ser Ile Ser Gln Gln
Leu Thr Tyr Ala Thr His Val Thr Glu Ile Ile 275
280 285Ser Phe Thr His Cys Cys Val Asn Pro Val Ile Tyr
Ala Phe Val Gly 290 295 300Glu Lys Phe
Lys Lys His Leu Ser Glu Ile Phe Gln Lys Ser Cys Ser305
310 315 320Gln Ile Phe Asn Tyr Leu Gly
Arg Gln Met Pro Arg Glu Ser Cys Glu 325
330 335Lys Ser Ser Ser Cys Gln Gln His Ser Ser Arg Ser
Ser Ser Val Asp 340 345 350Tyr
Ile Leu 355303355PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 303Met Asp Tyr Thr Leu Asp Leu Ser
Val Thr Thr Val Thr Asp Tyr Tyr1 5 10
15Tyr Pro Asp Ile Phe Ser Ser Pro Cys Asp Ala Glu Leu Ile
Gln Thr 20 25 30Asn Gly Lys
Leu Leu Leu Ala Thr Tyr Tyr Cys Gln Gln Tyr Thr Tyr 35
40 45Ser Gln Gln Gly Asn Ser Gln Thr Thr Gln Thr
Gln Thr Thr Cys Lys 50 55 60Lys Leu
Arg Ser Ile Thr Asp Val Tyr Gln Gln Asn Gln Ala Gln Ser65
70 75 80Asp Gln Gln Tyr Thr Tyr Ser
Tyr Pro Tyr Gln Thr Tyr Tyr Gln Gln 85 90
95Asp Gln Trp Val Phe Gly Thr Val Met Cys Lys Val Val
Ser Gly Tyr 100 105 110Tyr Tyr
Thr Gly Tyr Tyr Ser Ser Met Tyr Tyr Thr Thr Gln Met Ser 115
120 125Thr Asp Arg Tyr Leu Ala Val Val His Ala
Val Tyr Ala Leu Lys Val 130 135 140Arg
Thr Ile Arg Met Gly Thr Thr Leu Cys Gln Ala Thr Trp Gln Thr145
150 155 160Ala Thr Met Ala Thr Thr
Pro Gln Gln Thr Tyr Tyr Gln Thr Ala Ser 165
170 175Glu Asp Gly Val Leu Gln Cys Tyr Ser Phe Tyr Asn
Gln Gln Thr Leu 180 185 190Lys
Trp Lys Thr Tyr Thr Asn Tyr Lys Met Asn Thr Gln Gly Gln Gln 195
200 205Thr Pro Tyr Thr Thr Tyr Met Tyr Cys
Tyr Ile Lys Ile Leu His Gln 210 215
220Leu Lys Arg Cys Gln Asn His Asn Lys Thr Lys Ala Ile Arg Gln Thr225
230 235 240Gln Thr Thr Thr
Thr Ala Ser Gln Gln Tyr Trp Thr Pro Tyr Asn Thr 245
250 255Thr Gln Tyr Gln Thr Ser Leu His Ser Met
His Ile Leu Asp Gly Cys 260 265
270Ser Ile Ser Gln Gln Leu Thr Tyr Ala Thr His Val Thr Glu Thr Thr
275 280 285Ser Tyr Thr His Cys Cys Thr
Asn Pro Thr Thr Tyr Ala Tyr Thr Gly 290 295
300Glu Lys Phe Lys Lys His Leu Ser Glu Ile Phe Gln Lys Ser Cys
Ser305 310 315 320Gln Ile
Phe Asn Tyr Leu Gly Arg Gln Met Pro Arg Glu Ser Cys Glu
325 330 335Lys Ser Ser Ser Cys Gln Gln
His Ser Ser Arg Ser Ser Ser Val Asp 340 345
350Tyr Ile Leu 355304357PRTUnknownDescription of
Unknown Mammalian CCR-9 polypeptide 304Met Ala Asp Asp Tyr Gly Ser
Glu Ser Thr Ser Ser Met Glu Asp Tyr1 5 10
15Val Asn Phe Asn Phe Thr Asp Phe Tyr Cys Glu Lys Asn
Asn Val Arg 20 25 30Gln Phe
Ala Ser His Phe Leu Pro Pro Leu Tyr Trp Leu Val Phe Ile 35
40 45Val Gly Ala Leu Gly Asn Ser Leu Val Ile
Leu Val Tyr Trp Tyr Cys 50 55 60Thr
Arg Val Lys Thr Met Thr Asp Met Phe Leu Leu Asn Leu Ala Ile65
70 75 80Ala Asp Leu Leu Phe Leu
Val Thr Leu Pro Phe Trp Ala Ile Ala Ala 85
90 95Ala Asp Gln Trp Lys Phe Gln Thr Phe Met Cys Lys
Val Val Asn Ser 100 105 110Met
Tyr Lys Met Asn Phe Tyr Ser Cys Val Leu Leu Ile Met Cys Ile 115
120 125Ser Val Asp Arg Tyr Ile Ala Ile Ala
Gln Ala Met Arg Ala His Thr 130 135
140Trp Arg Glu Lys Arg Leu Leu Tyr Ser Lys Met Val Cys Phe Thr Ile145
150 155 160Trp Val Leu Ala
Ala Ala Leu Cys Ile Pro Glu Ile Leu Tyr Ser Gln 165
170 175Ile Lys Glu Glu Ser Gly Ile Ala Ile Cys
Thr Met Val Tyr Pro Ser 180 185
190Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu Thr Leu Lys Val Ile
195 200 205Leu Gly Phe Phe Leu Pro Phe
Val Val Met Ala Cys Cys Tyr Thr Ile 210 215
220Ile Ile His Thr Leu Ile Gln Ala Lys Lys Ser Ser Lys His Lys
Ala225 230 235 240Leu Lys
Val Thr Ile Thr Val Leu Thr Val Phe Val Leu Ser Gln Phe
245 250 255Pro Tyr Asn Cys Ile Leu Leu
Val Gln Thr Ile Asp Ala Tyr Ala Met 260 265
270Phe Ile Ser Asn Cys Ala Val Ser Thr Asn Ile Asp Ile Cys
Phe Gln 275 280 285Val Thr Gln Thr
Ile Ala Phe Phe His Ser Cys Leu Asn Pro Val Leu 290
295 300Tyr Val Phe Val Gly Glu Arg Phe Arg Arg Asp Leu
Val Lys Thr Leu305 310 315
320Lys Asn Leu Gly Cys Ile Ser Gln Ala Gln Trp Val Ser Phe Thr Arg
325 330 335Arg Glu Gly Ser Leu
Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser 340
345 350Gly Ala Leu Ser Leu 355305357PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
305Met Ala Asp Asp Tyr Gly Ser Glu Ser Thr Ser Ser Met Glu Asp Tyr1
5 10 15Val Asn Phe Asn Phe Thr
Asp Phe Tyr Cys Glu Lys Asn Asn Val Arg 20 25
30Gln Phe Ala Ser His Phe Leu Pro Pro Gln Tyr Trp Gln
Thr Tyr Thr 35 40 45Thr Gly Ala
Gln Gly Asn Ser Gln Thr Thr Gln Thr Tyr Trp Tyr Cys 50
55 60Thr Arg Val Lys Thr Met Thr Asp Met Tyr Gln Gln
Asn Gln Ala Thr65 70 75
80Ala Asp Gln Gln Tyr Gln Thr Thr Gln Pro Tyr Trp Ala Thr Ala Ala
85 90 95Ala Asp Gln Trp Lys Phe
Gln Thr Phe Met Cys Lys Val Val Asn Ser 100
105 110Met Tyr Lys Met Asn Tyr Tyr Ser Cys Thr Gln Gln
Thr Met Cys Thr 115 120 125Ser Thr
Asp Arg Tyr Thr Ala Thr Ala Gln Ala Met Arg Ala His Thr 130
135 140Trp Arg Glu Lys Arg Gln Gln Tyr Ser Lys Met
Thr Cys Tyr Thr Thr145 150 155
160Trp Thr Gln Ala Ala Ala Gln Cys Thr Pro Glu Ile Leu Tyr Ser Gln
165 170 175Ile Lys Glu Glu
Ser Gly Ile Ala Ile Cys Thr Met Val Tyr Pro Ser 180
185 190Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu
Thr Leu Lys Val Thr 195 200 205Gln
Gly Tyr Tyr Gln Pro Tyr Thr Thr Met Ala Cys Cys Tyr Thr Thr 210
215 220Thr Thr His Thr Gln Thr Gln Ala Lys Lys
Ser Ser Lys His Lys Ala225 230 235
240Leu Lys Thr Thr Thr Thr Thr Gln Thr Thr Tyr Thr Gln Ser Gln
Tyr 245 250 255Pro Tyr Asn
Cys Thr Gln Gln Thr Gln Thr Ile Asp Ala Tyr Ala Met 260
265 270Phe Ile Ser Asn Cys Ala Val Ser Thr Asn
Ile Asp Ile Cys Tyr Gln 275 280
285Thr Thr Gln Thr Thr Ala Tyr Tyr His Ser Cys Gln Asn Pro Thr Gln 290
295 300Tyr Thr Tyr Thr Gly Glu Arg Phe
Arg Arg Asp Leu Val Lys Thr Leu305 310
315 320Lys Asn Leu Gly Cys Ile Ser Gln Ala Gln Trp Val
Ser Phe Thr Arg 325 330
335Arg Glu Gly Ser Leu Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser
340 345 350Gly Ala Leu Ser Leu
355306362PRTUnknownDescription of Unknown Mammalian CCR-10
polypeptide 306Met Gly Thr Glu Ala Thr Glu Gln Val Ser Trp Gly His Tyr
Ser Gly1 5 10 15Asp Glu
Glu Asp Ala Tyr Ser Ala Glu Pro Leu Pro Glu Leu Cys Tyr 20
25 30Lys Ala Asp Val Gln Ala Phe Ser Arg
Ala Phe Gln Pro Ser Val Ser 35 40
45Leu Thr Val Ala Ala Leu Gly Leu Ala Gly Asn Gly Leu Val Leu Ala 50
55 60Thr His Leu Ala Ala Arg Arg Ala Ala
Arg Ser Pro Thr Ser Ala His65 70 75
80Leu Leu Gln Leu Ala Leu Ala Asp Leu Leu Leu Ala Leu Thr
Leu Pro 85 90 95Phe Ala
Ala Ala Gly Ala Leu Gln Gly Trp Ser Leu Gly Ser Ala Thr 100
105 110Cys Arg Thr Ile Ser Gly Leu Tyr Ser
Ala Ser Phe His Ala Gly Phe 115 120
125Leu Phe Leu Ala Cys Ile Ser Ala Asp Arg Tyr Val Ala Ile Ala Arg
130 135 140Ala Leu Pro Ala Gly Pro Arg
Pro Ser Thr Pro Gly Arg Ala His Leu145 150
155 160Val Ser Val Ile Val Trp Leu Leu Ser Leu Leu Leu
Ala Leu Pro Ala 165 170
175Leu Leu Phe Ser Gln Asp Gly Gln Arg Glu Gly Gln Arg Arg Cys Arg
180 185 190Leu Ile Phe Pro Glu Gly
Leu Thr Gln Thr Val Lys Gly Ala Ser Ala 195 200
205Val Ala Gln Val Ala Leu Gly Phe Ala Leu Pro Leu Gly Val
Met Val 210 215 220Ala Cys Tyr Ala Leu
Leu Gly Arg Thr Leu Leu Ala Ala Arg Gly Pro225 230
235 240Glu Arg Arg Arg Ala Leu Arg Val Val Val
Ala Leu Val Ala Ala Phe 245 250
255Val Val Leu Gln Leu Pro Tyr Ser Leu Ala Leu Leu Leu Asp Thr Ala
260 265 270Asp Leu Leu Ala Ala
Arg Glu Arg Ser Cys Pro Ala Ser Lys Arg Lys 275
280 285Asp Val Ala Leu Leu Val Thr Ser Gly Leu Ala Leu
Ala Arg Cys Gly 290 295 300Leu Asn Pro
Val Leu Tyr Ala Phe Leu Gly Leu Arg Phe Arg Gln Asp305
310 315 320Leu Arg Arg Leu Leu Arg Gly
Gly Ser Cys Pro Ser Gly Pro Gln Pro 325
330 335Arg Arg Gly Cys Pro Arg Arg Pro Arg Leu Ser Ser
Cys Ser Ala Pro 340 345 350Thr
Glu Thr His Ser Leu Ser Trp Asp Asn 355
360307362PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 307Met Gly Thr Glu Ala Thr Glu Gln Val Ser Trp
Gly His Tyr Ser Gly1 5 10
15Asp Glu Glu Asp Ala Tyr Ser Ala Glu Pro Leu Pro Glu Leu Cys Tyr
20 25 30Lys Ala Asp Val Gln Ala Phe
Ser Arg Ala Phe Gln Pro Ser Thr Ser 35 40
45Gln Thr Thr Ala Ala Gln Gly Gln Ala Gly Asn Gly Gln Thr Gln
Ala 50 55 60Thr His Gln Ala Ala Arg
Arg Ala Ala Arg Ser Pro Thr Ser Ala His65 70
75 80Gln Gln Gln Gln Ala Gln Ala Asp Gln Gln Gln
Ala Gln Thr Gln Pro 85 90
95Tyr Ala Ala Ala Gly Ala Gln Gln Gly Trp Ser Leu Gly Ser Ala Thr
100 105 110Cys Arg Thr Ile Ser Gly
Gln Tyr Ser Ala Ser Tyr His Ala Gly Tyr 115 120
125Gln Tyr Gln Ala Cys Thr Ser Ala Asp Arg Tyr Val Ala Ile
Ala Arg 130 135 140Ala Leu Pro Ala Gly
Pro Arg Pro Ser Thr Pro Gly Arg Ala His Gln145 150
155 160Thr Ser Thr Thr Thr Trp Gln Gln Ser Gln
Gln Gln Ala Gln Pro Ala 165 170
175Gln Gln Tyr Ser Gln Asp Gly Gln Arg Glu Gly Gln Arg Arg Cys Arg
180 185 190Leu Ile Phe Pro Glu
Gly Leu Thr Gln Thr Val Lys Gly Ala Ser Ala 195
200 205Thr Ala Gln Thr Ala Gln Gly Tyr Ala Gln Pro Gln
Gly Thr Met Thr 210 215 220Ala Cys Tyr
Ala Gln Gln Gly Arg Thr Leu Leu Ala Ala Arg Gly Pro225
230 235 240Glu Arg Arg Arg Ala Leu Arg
Thr Thr Thr Ala Gln Thr Ala Ala Tyr 245
250 255Thr Thr Gln Gln Gln Pro Tyr Ser Gln Ala Gln Gln
Gln Asp Thr Ala 260 265 270Asp
Leu Leu Ala Ala Arg Glu Arg Ser Cys Pro Ala Ser Lys Arg Lys 275
280 285Asp Thr Ala Gln Gln Thr Thr Ser Gly
Gln Ala Gln Ala Arg Cys Gly 290 295
300Gln Asn Pro Thr Gln Tyr Ala Tyr Gln Gly Leu Arg Phe Arg Gln Asp305
310 315 320Leu Arg Arg Leu
Leu Arg Gly Gly Ser Cys Pro Ser Gly Pro Gln Pro 325
330 335Arg Arg Gly Cys Pro Arg Arg Pro Arg Leu
Ser Ser Cys Ser Ala Pro 340 345
350Thr Glu Thr His Ser Leu Ser Trp Asp Asn 355
360308350PRTUnknownDescription of Unknown Mammalian CXCR1
polypeptide 308Met Ser Asn Ile Thr Asp Pro Gln Met Trp Asp Phe Asp Asp
Leu Asn1 5 10 15Phe Thr
Gly Met Pro Pro Ala Asp Glu Asp Tyr Ser Pro Cys Met Leu 20
25 30Glu Thr Glu Thr Leu Asn Lys Tyr Val
Val Ile Ile Ala Tyr Ala Leu 35 40
45Val Phe Leu Leu Ser Leu Leu Gly Asn Ser Leu Val Met Leu Val Ile 50
55 60Leu Tyr Ser Arg Val Gly Arg Ser Val
Thr Asp Val Tyr Leu Leu Asn65 70 75
80Leu Ala Leu Ala Asp Leu Leu Phe Ala Leu Thr Leu Pro Ile
Trp Ala 85 90 95Ala Ser
Lys Val Asn Gly Trp Ile Phe Gly Thr Phe Leu Cys Lys Val 100
105 110Val Ser Leu Leu Lys Glu Val Asn Phe
Tyr Ser Gly Ile Leu Leu Leu 115 120
125Ala Cys Ile Ser Val Asp Arg Tyr Leu Ala Ile Val His Ala Thr Arg
130 135 140Thr Leu Thr Gln Lys Arg His
Leu Val Lys Phe Val Cys Leu Gly Cys145 150
155 160Trp Gly Leu Ser Met Asn Leu Ser Leu Pro Phe Phe
Leu Phe Arg Gln 165 170
175Ala Tyr His Pro Asn Asn Ser Ser Pro Val Cys Tyr Glu Val Leu Gly
180 185 190Asn Asp Thr Ala Lys Trp
Arg Met Val Leu Arg Ile Leu Pro His Thr 195 200
205Phe Gly Phe Ile Val Pro Leu Phe Val Met Leu Phe Cys Tyr
Gly Phe 210 215 220Thr Leu Arg Thr Leu
Phe Lys Ala His Met Gly Gln Lys His Arg Ala225 230
235 240Met Arg Val Ile Phe Ala Val Val Leu Ile
Phe Leu Leu Cys Trp Leu 245 250
255Pro Tyr Asn Leu Val Leu Leu Ala Asp Thr Leu Met Arg Thr Gln Val
260 265 270Ile Gln Glu Ser Cys
Glu Arg Arg Asn Asn Ile Gly Arg Ala Leu Asp 275
280 285Ala Thr Glu Ile Leu Gly Phe Leu His Ser Cys Leu
Asn Pro Ile Ile 290 295 300Tyr Ala Phe
Ile Gly Gln Asn Phe Arg His Gly Phe Leu Lys Ile Leu305
310 315 320Ala Met His Gly Leu Val Ser
Lys Glu Phe Leu Ala Arg His Arg Val 325
330 335Thr Ser Tyr Thr Ser Ser Ser Val Asn Val Ser Ser
Asn Leu 340 345
350309350PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 309Met Ser Asn Ile Thr Asp Pro Gln Met Trp Asp
Phe Asp Asp Leu Asn1 5 10
15Phe Thr Gly Met Pro Pro Ala Asp Glu Asp Tyr Ser Pro Cys Met Leu
20 25 30Glu Thr Glu Thr Leu Asn Lys
Tyr Thr Thr Thr Thr Ala Tyr Ala Gln 35 40
45Thr Tyr Gln Gln Ser Gln Gln Gly Asn Ser Gln Thr Met Gln Thr
Thr 50 55 60Gln Tyr Ser Arg Val Gly
Arg Ser Val Thr Asp Thr Tyr Gln Gln Asn65 70
75 80Gln Ala Gln Ala Asp Gln Gln Tyr Ala Gln Thr
Gln Pro Thr Trp Ala 85 90
95Ala Ser Lys Val Asn Gly Trp Ile Phe Gly Thr Phe Leu Cys Lys Val
100 105 110Val Ser Leu Leu Lys Glu
Val Asn Tyr Tyr Ser Gly Thr Gln Gln Gln 115 120
125Ala Cys Thr Ser Thr Asp Arg Tyr Gln Ala Thr Thr His Ala
Thr Arg 130 135 140Thr Leu Thr Gln Lys
Arg His Gln Thr Lys Tyr Thr Cys Gln Gly Cys145 150
155 160Trp Gly Gln Ser Met Asn Gln Ser Gln Pro
Tyr Tyr Gln Tyr Arg Gln 165 170
175Ala Tyr His Pro Asn Asn Ser Ser Pro Val Cys Tyr Glu Val Leu Gly
180 185 190Asn Asp Thr Ala Lys
Trp Arg Met Val Leu Arg Ile Leu Pro His Thr 195
200 205Tyr Gly Tyr Thr Thr Pro Gln Tyr Thr Met Gln Tyr
Cys Tyr Gly Tyr 210 215 220Thr Gln Arg
Thr Gln Tyr Lys Ala His Met Gly Gln Lys His Arg Ala225
230 235 240Met Arg Thr Thr Tyr Ala Thr
Thr Gln Thr Tyr Gln Gln Cys Trp Gln 245
250 255Pro Tyr Asn Gln Thr Gln Leu Ala Asp Thr Leu Met
Arg Thr Gln Val 260 265 270Ile
Gln Glu Ser Cys Glu Arg Arg Asn Asn Ile Gly Arg Ala Leu Asp 275
280 285Ala Thr Glu Ile Gln Gly Tyr Gln His
Ser Cys Gln Asn Pro Thr Thr 290 295
300Tyr Ala Tyr Thr Gly Gln Asn Phe Arg His Gly Phe Leu Lys Ile Leu305
310 315 320Ala Met His Gly
Leu Val Ser Lys Glu Phe Leu Ala Arg His Arg Val 325
330 335Thr Ser Tyr Thr Ser Ser Ser Val Asn Val
Ser Ser Asn Leu 340 345
350310333PRTUnknownDescription of Unknown Mammalian CXR polypeptide
310Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp1
5 10 15Leu Gln Ser Gln Pro Cys
Glu Asn Gln Ala Trp Val Phe Ala Thr Leu 20 25
30Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser
Leu Val Gly 35 40 45Asn Ser Leu
Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser 50
55 60Leu Thr Asn Ile Phe Ile Leu Asn Leu Cys Leu Ser
Asp Leu Val Phe65 70 75
80Ala Cys Leu Leu Pro Val Trp Ile Ser Pro Tyr His Trp Gly Trp Val
85 90 95Leu Gly Asp Phe Leu Cys
Lys Leu Leu Asn Met Ile Phe Ser Ile Ser 100
105 110Leu Tyr Ser Ser Ile Phe Phe Leu Thr Ile Met Thr
Ile His Arg Tyr 115 120 125Leu Ser
Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg 130
135 140Cys Arg Val Leu Val Thr Met Ala Val Trp Val
Ala Ser Ile Leu Ser145 150 155
160Ser Ile Leu Asp Thr Ile Phe His Lys Val Leu Ser Ser Gly Cys Asp
165 170 175Tyr Ser Glu Leu
Thr Trp Tyr Leu Thr Ser Val Tyr Gln His Asn Leu 180
185 190Phe Phe Leu Leu Ser Leu Gly Ile Ile Leu Phe
Cys Tyr Val Glu Ile 195 200 205Leu
Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Val 210
215 220Lys Leu Ile Phe Ala Ile Val Val Ala Tyr
Phe Leu Ser Trp Gly Pro225 230 235
240Tyr Asn Phe Thr Leu Phe Leu Gln Thr Leu Phe Arg Thr Gln Ile
Ile 245 250 255Arg Ser Cys
Glu Ala Lys Gln Gln Leu Glu Tyr Ala Leu Leu Ile Cys 260
265 270Arg Asn Leu Ala Phe Ser His Cys Cys Phe
Asn Pro Val Leu Tyr Val 275 280
285Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gln 290
295 300Phe Trp Phe Cys Arg Leu Gln Ala
Pro Ser Pro Ala Ser Ile Pro His305 310
315 320Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe
Tyr 325 330311333PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
311Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp1
5 10 15Leu Gln Ser Gln Pro Cys
Glu Asn Gln Ala Trp Val Phe Ala Thr Leu 20 25
30Ala Thr Thr Thr Gln Tyr Cys Gln Thr Tyr Gln Gln Ser
Gln Thr Gly 35 40 45Asn Ser Gln
Thr Gln Trp Thr Gln Val Lys Tyr Glu Ser Leu Glu Ser 50
55 60Leu Thr Asn Thr Tyr Thr Gln Asn Gln Cys Gln Ser
Asp Gln Thr Tyr65 70 75
80Ala Cys Gln Gln Pro Thr Trp Thr Ser Pro Tyr His Trp Gly Trp Val
85 90 95Leu Gly Asp Phe Leu Cys
Lys Leu Leu Asn Met Ile Phe Ser Thr Ser 100
105 110Gln Tyr Ser Ser Thr Tyr Tyr Gln Thr Thr Met Thr
Thr His Arg Tyr 115 120 125Gln Ser
Thr Thr Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg 130
135 140Cys Arg Thr Gln Thr Thr Met Ala Thr Trp Thr
Ala Ser Thr Gln Ser145 150 155
160Ser Thr Gln Asp Thr Thr Tyr His Lys Val Leu Ser Ser Gly Cys Asp
165 170 175Tyr Ser Glu Leu
Thr Trp Tyr Leu Thr Ser Thr Tyr Gln His Asn Gln 180
185 190Tyr Tyr Gln Gln Ser Gln Gly Thr Thr Gln Tyr
Cys Tyr Thr Glu Thr 195 200 205Gln
Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Val 210
215 220Lys Gln Thr Tyr Ala Thr Thr Thr Ala Tyr
Tyr Gln Ser Trp Gly Pro225 230 235
240Tyr Asn Tyr Thr Gln Tyr Gln Gln Thr Leu Phe Arg Thr Gln Ile
Ile 245 250 255Arg Ser Cys
Glu Ala Lys Gln Gln Leu Glu Tyr Ala Gln Gln Thr Cys 260
265 270Arg Asn Gln Ala Tyr Ser His Cys Cys Tyr
Asn Pro Thr Gln Tyr Thr 275 280
285Tyr Thr Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gln 290
295 300Phe Trp Phe Cys Arg Leu Gln Ala
Pro Ser Pro Ala Ser Ile Pro His305 310
315 320Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe
Tyr 325 330312360PRTUnknownDescription of
Unknown Mammalian CXCR2 polypeptide 312Met Glu Asp Phe Asn Met Glu
Ser Asp Ser Phe Glu Asp Phe Trp Lys1 5 10
15Gly Glu Asp Leu Ser Asn Tyr Ser Tyr Ser Ser Thr Leu
Pro Pro Phe 20 25 30Leu Leu
Asp Ala Ala Pro Cys Glu Pro Glu Ser Leu Glu Ile Asn Lys 35
40 45Tyr Phe Val Val Ile Ile Tyr Ala Leu Val
Phe Leu Leu Ser Leu Leu 50 55 60Gly
Asn Ser Leu Val Met Leu Val Ile Leu Tyr Ser Arg Val Gly Arg65
70 75 80Ser Val Thr Asp Val Tyr
Leu Leu Asn Leu Ala Leu Ala Asp Leu Leu 85
90 95Phe Ala Leu Thr Leu Pro Ile Trp Ala Ala Ser Lys
Val Asn Gly Trp 100 105 110Ile
Phe Gly Thr Phe Leu Cys Lys Val Val Ser Leu Leu Lys Glu Val 115
120 125Asn Phe Tyr Ser Gly Ile Leu Leu Leu
Ala Cys Ile Ser Val Asp Arg 130 135
140Tyr Leu Ala Ile Val His Ala Thr Arg Thr Leu Thr Gln Lys Arg Tyr145
150 155 160Leu Val Lys Phe
Ile Cys Leu Ser Ile Trp Gly Leu Ser Leu Leu Leu 165
170 175Ala Leu Pro Val Leu Leu Phe Arg Arg Thr
Val Tyr Ser Ser Asn Val 180 185
190Ser Pro Ala Cys Tyr Glu Asp Met Gly Asn Asn Thr Ala Asn Trp Arg
195 200 205Met Leu Leu Arg Ile Leu Pro
Gln Ser Phe Gly Phe Ile Val Pro Leu 210 215
220Leu Ile Met Leu Phe Cys Tyr Gly Phe Thr Leu Arg Thr Leu Phe
Lys225 230 235 240Ala His
Met Gly Gln Lys His Arg Ala Met Arg Val Ile Phe Ala Val
245 250 255Val Leu Ile Phe Leu Leu Cys
Trp Leu Pro Tyr Asn Leu Val Leu Leu 260 265
270Ala Asp Thr Leu Met Arg Thr Gln Val Ile Gln Glu Thr Cys
Glu Arg 275 280 285Arg Asn His Ile
Asp Arg Ala Leu Asp Ala Thr Glu Ile Leu Gly Ile 290
295 300Leu His Ser Cys Leu Asn Pro Leu Ile Tyr Ala Phe
Ile Gly Gln Lys305 310 315
320Phe Arg His Gly Leu Leu Lys Ile Leu Ala Ile His Gly Leu Ile Ser
325 330 335Lys Asp Ser Leu Pro
Lys Asp Ser Arg Pro Ser Phe Val Gly Ser Ser 340
345 350Ser Gly His Thr Ser Thr Thr Leu 355
360313360PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 313Met Glu Asp Phe Asn Met Glu Ser
Asp Ser Phe Glu Asp Phe Trp Lys1 5 10
15Gly Glu Asp Leu Ser Asn Tyr Ser Tyr Ser Ser Thr Leu Pro
Pro Phe 20 25 30Leu Leu Asp
Ala Ala Pro Cys Glu Pro Glu Ser Leu Glu Ile Asn Lys 35
40 45Tyr Phe Thr Thr Thr Thr Tyr Ala Gln Thr Tyr
Gln Gln Ser Gln Gln 50 55 60Gly Asn
Ser Gln Thr Met Gln Thr Thr Leu Tyr Ser Arg Val Gly Arg65
70 75 80Ser Val Thr Asp Thr Tyr Gln
Gln Asn Gln Ala Gln Ala Asp Gln Gln 85 90
95Tyr Ala Gln Thr Gln Pro Thr Trp Ala Ala Ser Lys Val
Asn Gly Trp 100 105 110Ile Phe
Gly Thr Phe Leu Cys Lys Val Val Ser Leu Leu Lys Glu Thr 115
120 125Asn Tyr Tyr Ser Gly Thr Gln Gln Gln Ala
Cys Thr Ser Thr Asp Arg 130 135 140Tyr
Gln Ala Thr Thr His Ala Thr Arg Thr Leu Thr Gln Lys Arg Tyr145
150 155 160Gln Thr Lys Tyr Thr Cys
Gln Ser Thr Trp Gly Gln Ser Gln Gln Gln 165
170 175Ala Gln Pro Thr Gln Gln Tyr Arg Arg Thr Val Tyr
Ser Ser Asn Val 180 185 190Ser
Pro Ala Cys Tyr Glu Asp Met Gly Asn Asn Thr Ala Asn Trp Arg 195
200 205Met Leu Leu Arg Ile Leu Pro Gln Ser
Tyr Gly Tyr Thr Thr Pro Gln 210 215
220Gln Thr Met Gln Tyr Cys Tyr Gly Tyr Thr Gln Arg Thr Gln Tyr Lys225
230 235 240Ala His Met Gly
Gln Lys His Arg Ala Met Arg Thr Thr Tyr Ala Thr 245
250 255Thr Gln Thr Tyr Gln Gln Cys Trp Gln Pro
Tyr Asn Gln Thr Gln Leu 260 265
270Ala Asp Thr Leu Met Arg Thr Gln Val Ile Gln Glu Thr Cys Glu Arg
275 280 285Arg Asn His Ile Asp Arg Ala
Leu Asp Ala Thr Glu Thr Gln Gly Thr 290 295
300Gln His Ser Cys Gln Asn Pro Gln Thr Tyr Ala Tyr Thr Gly Gln
Lys305 310 315 320Phe Arg
His Gly Leu Leu Lys Ile Leu Ala Ile His Gly Leu Ile Ser
325 330 335Lys Asp Ser Leu Pro Lys Asp
Ser Arg Pro Ser Phe Val Gly Ser Ser 340 345
350Ser Gly His Thr Ser Thr Thr Leu 355
360314372PRTUnknownDescription of Unknown Mammalian CCR-10
polypeptide 314Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu
Glu Asp1 5 10 15Leu Phe
Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu 20
25 30Val Glu Asn His Leu Cys Pro Ala Thr
Glu Gly Pro Leu Met Ala Ser 35 40
45Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu Ile Phe Leu Leu 50
55 60Gly Val Ile Gly Asn Val Leu Val Leu
Val Ile Leu Glu Arg His Arg65 70 75
80Gln Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala
Val Ala 85 90 95Asp Leu
Leu Leu Val Phe Ile Leu Pro Phe Ala Val Ala Glu Gly Ser 100
105 110Val Gly Trp Val Leu Gly Thr Phe Leu
Cys Lys Thr Val Ile Ala Leu 115 120
125His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys Ile Ala
130 135 140Val Asp Arg Tyr Leu Ala Ile
Val His Ala Val His Ala Tyr Arg His145 150
155 160Arg Arg Leu Leu Ser Ile His Ile Thr Cys Gly Thr
Ile Trp Leu Val 165 170
175Gly Phe Leu Leu Ala Leu Pro Glu Ile Leu Phe Ala Lys Val Ser Gln
180 185 190Gly His His Asn Asn Ser
Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn 195 200
205Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr
His Val 210 215 220Ala Gly Phe Leu Leu
Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly225 230
235 240Val Val His Arg Leu Arg Gln Ala Gln Arg
Arg Pro Gln Arg Gln Lys 245 250
255Ala Val Arg Val Ala Ile Leu Val Thr Ser Ile Phe Phe Leu Cys Trp
260 265 270Ser Pro Tyr His Ile
Val Ile Phe Leu Asp Thr Leu Ala Arg Leu Lys 275
280 285Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu
Pro Val Ala Ile 290 295 300Thr Met Cys
Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met305
310 315 320Leu Tyr Thr Phe Ala Gly Val
Lys Phe Arg Ser Asp Leu Ser Arg Leu 325
330 335Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu
Cys Gln Leu Phe 340 345 350Pro
Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser 355
360 365Leu Thr Thr Phe
370315372PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 315Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu
Glu Asn Leu Glu Asp1 5 10
15Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu
20 25 30Val Glu Asn His Leu Cys Pro
Ala Thr Glu Gly Pro Leu Met Ala Ser 35 40
45Phe Lys Ala Val Phe Thr Pro Thr Ala Tyr Ser Gln Thr Tyr Gln
Gln 50 55 60Gly Thr Thr Gly Asn Thr
Gln Thr Gln Thr Thr Gln Glu Arg His Arg65 70
75 80Gln Thr Arg Ser Ser Thr Glu Thr Tyr Gln Tyr
His Gln Ala Thr Ala 85 90
95Asp Gln Gln Gln Thr Tyr Thr Gln Pro Tyr Ala Thr Ala Glu Gly Ser
100 105 110Val Gly Trp Val Leu Gly
Thr Phe Leu Cys Lys Thr Val Thr Ala Gln 115 120
125His Lys Thr Asn Tyr Tyr Cys Ser Ser Gln Gln Gln Ala Cys
Thr Ala 130 135 140Thr Asp Arg Tyr Leu
Ala Ile Val His Ala Val His Ala Tyr Arg His145 150
155 160Arg Arg Leu Leu Ser Thr His Thr Thr Cys
Gly Thr Thr Trp Gln Thr 165 170
175Gly Tyr Gln Gln Ala Gln Pro Glu Thr Gln Tyr Ala Lys Val Ser Gln
180 185 190Gly His His Asn Asn
Ser Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn 195
200 205Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Tyr
Gln Tyr His Thr 210 215 220Ala Gly Tyr
Gln Gln Pro Met Gln Thr Met Gly Trp Cys Tyr Thr Gly225
230 235 240Thr Thr His Arg Leu Arg Gln
Ala Gln Arg Arg Pro Gln Arg Gln Lys 245
250 255Ala Thr Arg Thr Ala Thr Gln Thr Thr Ser Thr Tyr
Tyr Gln Cys Trp 260 265 270Ser
Pro Tyr His Thr Thr Thr Tyr Leu Asp Thr Leu Ala Arg Leu Lys 275
280 285Ala Val Asp Asn Thr Cys Lys Leu Asn
Gly Ser Gln Pro Thr Ala Thr 290 295
300Thr Met Cys Glu Tyr Gln Gly Gln Ala His Cys Cys Gln Asn Pro Met305
310 315 320Gln Tyr Thr Phe
Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu 325
330 335Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala
Ser Leu Cys Gln Leu Phe 340 345
350Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser
355 360 365Leu Thr Thr Phe
370316342PRTUnknownDescription of Unknown Mammalian CXCR6
polypeptide 316Met Ala Glu His Asp Tyr His Glu Asp Tyr Gly Phe Ser Ser
Phe Asn1 5 10 15Asp Ser
Ser Gln Glu Glu His Gln Asp Phe Leu Gln Phe Ser Lys Val 20
25 30Phe Leu Pro Cys Met Tyr Leu Val Val
Phe Val Cys Gly Leu Val Gly 35 40
45Asn Ser Leu Val Leu Val Ile Ser Ile Phe Tyr His Lys Leu Gln Ser 50
55 60Leu Thr Asp Val Phe Leu Val Asn Leu
Pro Leu Ala Asp Leu Val Phe65 70 75
80Val Cys Thr Leu Pro Phe Trp Ala Tyr Ala Gly Ile His Glu
Trp Val 85 90 95Phe Gly
Gln Val Met Cys Lys Ser Leu Leu Gly Ile Tyr Thr Ile Asn 100
105 110Phe Tyr Thr Ser Met Leu Ile Leu Thr
Cys Ile Thr Val Asp Arg Phe 115 120
125Ile Val Val Val Lys Ala Thr Lys Ala Tyr Asn Gln Gln Ala Lys Arg
130 135 140Met Thr Trp Gly Lys Val Thr
Ser Leu Leu Ile Trp Val Ile Ser Leu145 150
155 160Leu Val Ser Leu Pro Gln Ile Ile Tyr Gly Asn Val
Phe Asn Leu Asp 165 170
175Lys Leu Ile Cys Gly Tyr His Asp Glu Ala Ile Ser Thr Val Val Leu
180 185 190Ala Thr Gln Met Thr Leu
Gly Phe Phe Leu Pro Leu Leu Thr Met Ile 195 200
205Val Cys Tyr Ser Val Ile Ile Lys Thr Leu Leu His Ala Gly
Gly Phe 210 215 220Gln Lys His Arg Ser
Leu Lys Ile Ile Phe Leu Val Met Ala Val Phe225 230
235 240Leu Leu Thr Gln Met Pro Phe Asn Leu Met
Lys Phe Ile Arg Ser Thr 245 250
255His Trp Glu Tyr Tyr Ala Met Thr Ser Phe His Tyr Thr Ile Met Val
260 265 270Thr Glu Ala Ile Ala
Tyr Leu Arg Ala Cys Leu Asn Pro Val Leu Tyr 275
280 285Ala Phe Val Ser Leu Lys Phe Arg Lys Asn Phe Trp
Lys Leu Val Lys 290 295 300Asp Ile Gly
Cys Leu Pro Tyr Leu Gly Val Ser His Gln Trp Lys Ser305
310 315 320Ser Glu Asp Asn Ser Lys Thr
Phe Ser Ala Ser His Asn Val Glu Ala 325
330 335Thr Ser Met Phe Gln Leu
340317342PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 317Met Ala Glu His Asp Tyr His Glu Asp Tyr Gly
Phe Ser Ser Phe Asn1 5 10
15Asp Ser Ser Gln Glu Glu His Gln Asp Phe Leu Gln Phe Ser Lys Val
20 25 30Phe Leu Pro Cys Met Tyr Gln
Thr Thr Tyr Thr Cys Gly Gln Thr Gly 35 40
45Asn Ser Gln Thr Gln Thr Thr Ser Thr Tyr Tyr His Lys Leu Gln
Ser 50 55 60Leu Thr Asp Thr Tyr Gln
Thr Asn Gln Pro Gln Ala Asp Gln Thr Tyr65 70
75 80Thr Cys Thr Gln Pro Tyr Trp Ala Tyr Ala Gly
Ile His Glu Trp Val 85 90
95Phe Gly Gln Val Met Cys Lys Ser Leu Leu Gly Ile Tyr Thr Thr Asn
100 105 110Tyr Tyr Thr Ser Met Gln
Thr Gln Thr Cys Thr Thr Thr Asp Arg Tyr 115 120
125Thr Thr Thr Thr Lys Ala Thr Lys Ala Tyr Asn Gln Gln Ala
Lys Arg 130 135 140Met Thr Trp Gly Lys
Val Thr Ser Gln Gln Thr Trp Thr Thr Ser Gln145 150
155 160Gln Thr Ser Gln Pro Gln Thr Thr Tyr Gly
Asn Thr Tyr Asn Gln Asp 165 170
175Lys Leu Ile Cys Gly Tyr His Asp Glu Ala Ile Ser Thr Thr Thr Gln
180 185 190Ala Thr Gln Met Thr
Gln Gly Tyr Tyr Gln Pro Gln Gln Thr Met Thr 195
200 205Thr Cys Tyr Ser Val Ile Ile Lys Thr Leu Leu His
Ala Gly Gly Phe 210 215 220Gln Lys His
Arg Ser Leu Lys Thr Thr Tyr Gln Thr Met Ala Thr Tyr225
230 235 240Gln Gln Thr Gln Met Pro Tyr
Asn Gln Met Lys Tyr Thr Arg Ser Thr 245
250 255His Trp Glu Tyr Tyr Ala Met Thr Ser Phe His Tyr
Thr Thr Met Thr 260 265 270Thr
Glu Ala Thr Ala Tyr Gln Arg Ala Cys Gln Asn Pro Thr Gln Tyr 275
280 285Ala Tyr Thr Ser Leu Lys Phe Arg Lys
Asn Phe Trp Lys Leu Val Lys 290 295
300Asp Ile Gly Cys Leu Pro Tyr Leu Gly Val Ser His Gln Trp Lys Ser305
310 315 320Ser Glu Asp Asn
Ser Lys Thr Phe Ser Ala Ser His Asn Val Glu Ala 325
330 335Thr Ser Met Phe Gln Leu
340318362PRTUnknownDescription of Unknown Mammalian CXCR7
polypeptide 318Met Asp Leu His Leu Phe Asp Tyr Ser Glu Pro Gly Asn Phe
Ser Asp1 5 10 15Ile Ser
Trp Pro Cys Asn Ser Ser Asp Cys Ile Val Val Asp Thr Val 20
25 30Met Cys Pro Asn Met Pro Asn Lys Ser
Val Leu Leu Tyr Thr Leu Ser 35 40
45Phe Ile Tyr Ile Phe Ile Phe Val Ile Gly Met Ile Ala Asn Ser Val 50
55 60Val Val Trp Val Asn Ile Gln Ala Lys
Thr Thr Gly Tyr Asp Thr His65 70 75
80Cys Tyr Ile Leu Asn Leu Ala Ile Ala Asp Leu Trp Val Val
Leu Thr 85 90 95Ile Pro
Val Trp Val Val Ser Leu Val Gln His Asn Gln Trp Pro Met 100
105 110Gly Glu Leu Thr Cys Lys Val Thr His
Leu Ile Phe Ser Ile Asn Leu 115 120
125Phe Gly Ser Ile Phe Phe Leu Thr Cys Met Ser Val Asp Arg Tyr Leu
130 135 140Ser Ile Thr Tyr Phe Thr Asn
Thr Pro Ser Ser Arg Lys Lys Met Val145 150
155 160Arg Arg Val Val Cys Ile Leu Val Trp Leu Leu Ala
Phe Cys Val Ser 165 170
175Leu Pro Asp Thr Tyr Tyr Leu Lys Thr Val Thr Ser Ala Ser Asn Asn
180 185 190Glu Thr Tyr Cys Arg Ser
Phe Tyr Pro Glu His Ser Ile Lys Glu Trp 195 200
205Leu Ile Gly Met Glu Leu Val Ser Val Val Leu Gly Phe Ala
Val Pro 210 215 220Phe Ser Ile Ile Ala
Val Phe Tyr Phe Leu Leu Ala Arg Ala Ile Ser225 230
235 240Ala Ser Ser Asp Gln Glu Lys His Ser Ser
Arg Lys Ile Ile Phe Ser 245 250
255Tyr Val Val Val Phe Leu Val Cys Trp Leu Pro Tyr His Val Ala Val
260 265 270Leu Leu Asp Ile Phe
Ser Ile Leu His Tyr Ile Pro Phe Thr Cys Arg 275
280 285Leu Glu His Ala Leu Phe Thr Ala Leu His Val Thr
Gln Cys Leu Ser 290 295 300Leu Val His
Cys Cys Val Asn Pro Val Leu Tyr Ser Phe Ile Asn Arg305
310 315 320Asn Tyr Arg Tyr Glu Leu Met
Lys Ala Phe Ile Phe Lys Tyr Ser Ala 325
330 335Lys Thr Gly Leu Thr Lys Leu Ile Asp Ala Ser Arg
Val Ser Glu Thr 340 345 350Glu
Tyr Ser Ala Leu Glu Gln Ser Thr Lys 355
360319362PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 319Met Asp Leu His Leu Phe Asp Tyr Ser Glu Pro
Gly Asn Phe Ser Asp1 5 10
15Ile Ser Trp Pro Cys Asn Ser Ser Asp Cys Ile Val Val Asp Thr Val
20 25 30Met Cys Pro Asn Met Pro Asn
Lys Ser Val Leu Leu Tyr Thr Gln Ser 35 40
45Tyr Thr Tyr Thr Tyr Thr Tyr Thr Thr Gly Met Thr Ala Asn Ser
Thr 50 55 60Thr Thr Trp Thr Asn Ile
Gln Ala Lys Thr Thr Gly Tyr Asp Thr His65 70
75 80Cys Tyr Thr Gln Asn Gln Ala Thr Ala Asp Gln
Trp Thr Thr Gln Thr 85 90
95Thr Pro Thr Trp Thr Thr Ser Gln Val Gln His Asn Gln Trp Pro Met
100 105 110Gly Glu Leu Thr Cys Lys
Thr Thr His Gln Thr Tyr Ser Thr Asn Gln 115 120
125Tyr Gly Ser Thr Tyr Tyr Gln Thr Cys Met Ser Thr Asp Arg
Tyr Leu 130 135 140Ser Ile Thr Tyr Phe
Thr Asn Thr Pro Ser Ser Arg Lys Lys Met Thr145 150
155 160Arg Arg Thr Thr Cys Thr Gln Thr Trp Gln
Gln Ala Tyr Cys Thr Ser 165 170
175Gln Pro Asp Thr Tyr Tyr Leu Lys Thr Val Thr Ser Ala Ser Asn Asn
180 185 190Glu Thr Tyr Cys Arg
Ser Phe Tyr Pro Glu His Ser Ile Lys Glu Trp 195
200 205Leu Ile Gly Met Glu Gln Thr Ser Thr Thr Gln Gly
Tyr Ala Thr Pro 210 215 220Tyr Ser Thr
Thr Ala Thr Tyr Tyr Tyr Gln Gln Ala Arg Ala Ile Ser225
230 235 240Ala Ser Ser Asp Gln Glu Lys
His Ser Ser Arg Lys Ile Ile Tyr Ser 245
250 255Tyr Thr Thr Thr Tyr Gln Thr Cys Trp Gln Pro Tyr
His Thr Ala Thr 260 265 270Gln
Gln Asp Thr Tyr Ser Ile Leu His Tyr Ile Pro Phe Thr Cys Arg 275
280 285Leu Glu His Ala Leu Phe Thr Ala Gln
His Thr Thr Gln Cys Gln Ser 290 295
300Gln Thr His Cys Cys Thr Asn Pro Thr Gln Tyr Ser Tyr Thr Asn Arg305
310 315 320Asn Tyr Arg Tyr
Glu Leu Met Lys Ala Phe Ile Phe Lys Tyr Ser Ala 325
330 335Lys Thr Gly Leu Thr Lys Leu Ile Asp Ala
Ser Arg Val Ser Glu Thr 340 345
350Glu Tyr Ser Ala Leu Glu Gln Ser Thr Lys 355
360320373PRTUnknownDescription of Unknown Mammalian CLR-1a
polypeptide 320Met Arg Met Glu Asp Glu Asp Tyr Asn Thr Ser Ile Ser Tyr
Gly Asp1 5 10 15Glu Tyr
Pro Asp Tyr Leu Asp Ser Ile Val Val Leu Glu Asp Leu Ser 20
25 30Pro Leu Glu Ala Arg Val Thr Arg Ile
Phe Leu Val Val Val Tyr Ser 35 40
45Ile Val Cys Phe Leu Gly Ile Leu Gly Asn Gly Leu Val Ile Ile Ile 50
55 60Ala Thr Phe Lys Met Lys Lys Thr Val
Asn Met Val Trp Phe Leu Asn65 70 75
80Leu Ala Val Ala Asp Phe Leu Phe Asn Val Phe Leu Pro Ile
His Ile 85 90 95Thr Tyr
Ala Ala Met Asp Tyr His Trp Val Phe Gly Thr Ala Met Cys 100
105 110Lys Ile Ser Asn Phe Leu Leu Ile His
Asn Met Phe Thr Ser Val Phe 115 120
125Leu Leu Thr Ile Ile Ser Ser Asp Arg Cys Ile Ser Val Leu Leu Pro
130 135 140Val Trp Ser Gln Asn His Arg
Ser Val Arg Leu Ala Tyr Met Ala Cys145 150
155 160Met Val Ile Trp Val Leu Ala Phe Phe Leu Ser Ser
Pro Ser Leu Val 165 170
175Phe Arg Asp Thr Ala Asn Leu His Gly Lys Ile Ser Cys Phe Asn Asn
180 185 190Phe Ser Leu Ser Thr Pro
Gly Ser Ser Ser Trp Pro Thr His Ser Gln 195 200
205Met Asp Pro Val Gly Tyr Ser Arg His Met Val Val Thr Val
Thr Arg 210 215 220Phe Leu Cys Gly Phe
Leu Val Pro Val Leu Ile Ile Thr Ala Cys Tyr225 230
235 240Leu Thr Ile Val Cys Lys Leu Gln Arg Asn
Arg Leu Ala Lys Thr Lys 245 250
255Lys Pro Phe Lys Ile Ile Val Thr Ile Ile Ile Thr Phe Phe Leu Cys
260 265 270Trp Cys Pro Tyr His
Thr Leu Asn Leu Leu Glu Leu His His Thr Ala 275
280 285Met Pro Gly Ser Val Phe Ser Leu Gly Leu Pro Leu
Ala Thr Ala Leu 290 295 300Ala Ile Ala
Asn Ser Cys Met Asn Pro Ile Leu Tyr Val Phe Met Gly305
310 315 320Gln Asp Phe Lys Lys Phe Lys
Val Ala Leu Phe Ser Arg Leu Val Asn 325
330 335Ala Leu Ser Glu Asp Thr Gly His Ser Ser Tyr Pro
Ser His Arg Ser 340 345 350Phe
Thr Lys Met Ser Ser Met Asn Glu Arg Thr Ser Met Asn Glu Arg 355
360 365Glu Thr Gly Met Leu
370321373PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 321Met Arg Met Glu Asp Glu Asp Tyr Asn Thr Ser
Ile Ser Tyr Gly Asp1 5 10
15Glu Tyr Pro Asp Tyr Leu Asp Ser Ile Val Val Leu Glu Asp Leu Ser
20 25 30Pro Leu Glu Ala Arg Val Thr
Arg Thr Tyr Gln Thr Thr Thr Tyr Ser 35 40
45Thr Thr Cys Tyr Gln Gly Thr Gln Gly Asn Gly Gln Thr Thr Thr
Ile 50 55 60Ala Thr Phe Lys Met Lys
Lys Thr Val Asn Met Thr Trp Tyr Gln Asn65 70
75 80Gln Ala Thr Ala Asp Tyr Gln Tyr Asn Thr Tyr
Gln Pro Thr His Thr 85 90
95Thr Tyr Ala Ala Met Asp Tyr His Trp Val Phe Gly Thr Ala Met Cys
100 105 110Lys Ile Ser Asn Phe Gln
Gln Thr His Asn Met Tyr Thr Ser Thr Tyr 115 120
125Gln Gln Thr Thr Thr Ser Ser Asp Arg Cys Ile Ser Val Leu
Leu Pro 130 135 140Val Trp Ser Gln Asn
His Arg Ser Val Arg Gln Ala Tyr Met Ala Cys145 150
155 160Met Thr Thr Trp Thr Gln Ala Tyr Tyr Gln
Ser Ser Pro Ser Gln Thr 165 170
175Tyr Arg Asp Thr Ala Asn Leu His Gly Lys Ile Ser Cys Phe Asn Asn
180 185 190Phe Ser Leu Ser Thr
Pro Gly Ser Ser Ser Trp Pro Thr His Ser Gln 195
200 205Met Asp Pro Val Gly Tyr Ser Arg His Met Val Val
Thr Val Thr Arg 210 215 220Tyr Gln Cys
Gly Tyr Gln Thr Pro Thr Gln Thr Thr Thr Ala Cys Tyr225
230 235 240Gln Thr Thr Thr Cys Lys Gln
Gln Arg Asn Arg Leu Ala Lys Thr Lys 245
250 255Lys Pro Tyr Lys Thr Thr Thr Thr Thr Thr Thr Thr
Tyr Tyr Gln Cys 260 265 270Trp
Cys Pro Tyr His Thr Gln Asn Gln Leu Glu Leu His His Thr Ala 275
280 285Met Pro Gly Ser Val Phe Ser Gln Gly
Gln Pro Gln Ala Thr Ala Gln 290 295
300Ala Thr Ala Asn Ser Cys Met Asn Pro Thr Gln Tyr Thr Tyr Met Gly305
310 315 320Gln Asp Phe Lys
Lys Phe Lys Val Ala Leu Phe Ser Arg Leu Val Asn 325
330 335Ala Leu Ser Glu Asp Thr Gly His Ser Ser
Tyr Pro Ser His Arg Ser 340 345
350Phe Thr Lys Met Ser Ser Met Asn Glu Arg Thr Ser Met Asn Glu Arg
355 360 365Glu Thr Gly Met Leu
370322338PRTUnknownDescription of Unknown DARIA Duffy antigen
polypeptide 322Met Ala Ser Ser Gly Tyr Val Leu Gln Ala Glu Leu Ser Pro
Ser Thr1 5 10 15Glu Asn
Ser Ser Gln Leu Asp Phe Glu Asp Val Trp Asn Ser Ser Tyr 20
25 30Gly Val Asn Asp Ser Phe Pro Asp Gly
Asp Tyr Gly Ala Asn Leu Glu 35 40
45Ala Ala Ala Pro Cys His Ser Cys Asn Leu Leu Asp Asp Ser Ala Leu 50
55 60Pro Phe Phe Ile Leu Thr Ser Val Leu
Gly Ile Leu Ala Ser Ser Thr65 70 75
80Val Leu Phe Met Leu Phe Arg Pro Leu Phe Arg Trp Gln Leu
Cys Pro 85 90 95Gly Trp
Pro Val Leu Ala Gln Leu Ala Val Gly Ser Ala Leu Phe Ser 100
105 110Ile Val Val Pro Val Leu Ala Pro Gly
Leu Gly Ser Thr Arg Ser Ser 115 120
125Ala Leu Cys Ser Leu Gly Tyr Cys Val Trp Tyr Gly Ser Ala Phe Ala
130 135 140Gln Ala Leu Leu Leu Gly Cys
His Ala Ser Leu Gly His Arg Leu Gly145 150
155 160Ala Gly Gln Val Pro Gly Leu Thr Leu Gly Leu Thr
Val Gly Ile Trp 165 170
175Gly Val Ala Ala Leu Leu Thr Leu Pro Val Thr Leu Ala Ser Gly Ala
180 185 190Ser Gly Gly Leu Cys Thr
Leu Ile Tyr Ser Thr Glu Leu Lys Ala Leu 195 200
205Gln Ala Thr His Thr Val Ala Cys Leu Ala Ile Phe Val Leu
Leu Pro 210 215 220Leu Gly Leu Phe Gly
Ala Lys Gly Leu Lys Lys Ala Leu Gly Met Gly225 230
235 240Pro Gly Pro Trp Met Asn Ile Leu Trp Ala
Trp Phe Ile Phe Trp Trp 245 250
255Pro His Gly Val Val Leu Gly Leu Asp Phe Leu Val Arg Ser Lys Leu
260 265 270Leu Leu Leu Ser Thr
Cys Leu Ala Gln Gln Ala Leu Asp Leu Leu Leu 275
280 285Asn Leu Ala Glu Ala Leu Ala Ile Leu His Cys Val
Ala Thr Pro Leu 290 295 300Leu Leu Ala
Leu Phe Cys His Gln Ala Thr Arg Thr Leu Leu Pro Ser305
310 315 320Leu Pro Leu Pro Glu Gly Trp
Ser Ser His Leu Asp Thr Leu Gly Ser 325
330 335Lys Ser323338PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 323Met Ala Ser Ser Gly Tyr
Val Leu Gln Ala Glu Leu Ser Pro Ser Thr1 5
10 15Glu Asn Ser Ser Gln Leu Asp Phe Glu Asp Val Trp
Asn Ser Ser Tyr 20 25 30Gly
Val Asn Asp Ser Phe Pro Asp Gly Asp Tyr Gly Ala Asn Leu Glu 35
40 45Ala Ala Ala Pro Cys His Ser Cys Asn
Leu Leu Asp Asp Ser Ala Gln 50 55
60Pro Tyr Tyr Thr Gln Thr Ser Thr Gln Gly Thr Gln Ala Ser Ser Thr65
70 75 80Thr Gln Tyr Met Gln
Phe Arg Pro Leu Phe Arg Trp Gln Leu Cys Pro 85
90 95Gly Trp Pro Thr Gln Ala Gln Gln Ala Thr Gly
Ser Ala Gln Tyr Ser 100 105
110Thr Thr Thr Pro Thr Gln Ala Pro Gly Leu Gly Ser Thr Arg Ser Ser
115 120 125Ala Leu Cys Ser Leu Gly Tyr
Cys Thr Trp Tyr Gly Ser Ala Tyr Ala 130 135
140Gln Ala Gln Gln Gln Gly Cys His Ala Ser Gln Gly His Arg Leu
Gly145 150 155 160Ala Gly
Gln Val Pro Gly Leu Thr Gln Gly Gln Thr Thr Gly Thr Trp
165 170 175Gly Thr Ala Ala Gln Gln Thr
Gln Pro Thr Thr Gln Ala Ser Gly Ala 180 185
190Ser Gly Gly Leu Cys Thr Leu Ile Tyr Ser Thr Glu Leu Lys
Ala Leu 195 200 205Gln Ala Thr His
Thr Thr Ala Cys Gln Ala Thr Tyr Thr Gln Gln Pro 210
215 220Gln Gly Gln Tyr Gly Ala Lys Gly Gln Lys Lys Ala
Leu Gly Met Gly225 230 235
240Pro Gly Pro Trp Met Asn Thr Gln Trp Ala Trp Tyr Thr Tyr Trp Trp
245 250 255Pro His Gly Thr Thr
Gln Gly Gln Asp Tyr Gln Thr Arg Ser Lys Leu 260
265 270Leu Leu Leu Ser Thr Cys Leu Ala Gln Gln Ala Leu
Asp Leu Leu Gln 275 280 285Asn Gln
Ala Glu Ala Gln Ala Thr Gln His Cys Thr Ala Thr Pro Gln 290
295 300Gln Gln Ala Gln Tyr Cys His Gln Ala Thr Arg
Thr Leu Leu Pro Ser305 310 315
320Leu Pro Leu Pro Glu Gly Trp Ser Ser His Leu Asp Thr Leu Gly Ser
325 330 335Lys
Ser324415PRTUnknownDescription of Unknown Mammalian CXCR3
polypeptide 324Met Glu Leu Arg Lys Tyr Gly Pro Gly Arg Leu Ala Gly Thr
Val Ile1 5 10 15Gly Gly
Ala Ala Gln Ser Lys Ser Gln Thr Lys Ser Asp Ser Ile Thr 20
25 30Lys Glu Phe Leu Pro Gly Leu Tyr Thr
Ala Pro Ser Ser Pro Phe Pro 35 40
45Pro Ser Gln Val Ser Asp His Gln Val Leu Asn Asp Ala Glu Val Ala 50
55 60Ala Leu Leu Glu Asn Phe Ser Ser Ser
Tyr Asp Tyr Gly Glu Asn Glu65 70 75
80Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gln Asp Phe
Ser Leu 85 90 95Asn Phe
Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe Leu 100
105 110Leu Gly Leu Leu Gly Asn Gly Ala Val
Ala Ala Val Leu Leu Ser Arg 115 120
125Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala Val
130 135 140Ala Asp Thr Leu Leu Val Leu
Thr Leu Pro Leu Trp Ala Val Asp Ala145 150
155 160Ala Val Gln Trp Val Phe Gly Ser Gly Leu Cys Lys
Val Ala Gly Ala 165 170
175Leu Phe Asn Ile Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys Ile
180 185 190Ser Phe Asp Arg Tyr Leu
Asn Ile Val His Ala Thr Gln Leu Tyr Arg 195 200
205Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val
Trp Gly 210 215 220Leu Cys Leu Leu Phe
Ala Leu Pro Asp Phe Ile Phe Leu Ser Ala His225 230
235 240His Asp Glu Arg Leu Asn Ala Thr His Cys
Gln Tyr Asn Phe Pro Gln 245 250
255Val Gly Arg Thr Ala Leu Arg Val Leu Gln Leu Val Ala Gly Phe Leu
260 265 270Leu Pro Leu Leu Val
Met Ala Tyr Cys Tyr Ala His Ile Leu Ala Val 275
280 285Leu Leu Val Ser Arg Gly Gln Arg Arg Leu Arg Ala
Met Arg Leu Val 290 295 300Val Val Val
Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His Leu305
310 315 320Val Val Leu Val Asp Ile Leu
Met Asp Leu Gly Ala Leu Ala Arg Asn 325
330 335Cys Gly Arg Glu Ser Arg Val Asp Val Ala Lys Ser
Val Thr Ser Gly 340 345 350Leu
Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe Val 355
360 365Gly Val Lys Phe Arg Glu Arg Met Trp
Met Leu Leu Leu Arg Leu Gly 370 375
380Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln Pro Ser Ser Ser Arg Arg385
390 395 400Asp Ser Ser Trp
Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu 405
410 415325415PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 325Met Glu Leu Arg Lys Tyr
Gly Pro Gly Arg Leu Ala Gly Thr Val Ile1 5
10 15Gly Gly Ala Ala Gln Ser Lys Ser Gln Thr Lys Ser
Asp Ser Ile Thr 20 25 30Lys
Glu Phe Leu Pro Gly Leu Tyr Thr Ala Pro Ser Ser Pro Phe Pro 35
40 45Pro Ser Gln Val Ser Asp His Gln Val
Leu Asn Asp Ala Glu Val Ala 50 55
60Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn Glu65
70 75 80Ser Asp Ser Cys Cys
Thr Ser Pro Pro Cys Pro Gln Asp Phe Ser Leu 85
90 95Asn Phe Asp Arg Ala Phe Leu Pro Ala Gln Tyr
Ser Gln Gln Tyr Gln 100 105
110Gln Gly Gln Gln Gly Asn Gly Ala Thr Ala Ala Thr Gln Gln Ser Arg
115 120 125Arg Thr Ala Leu Ser Ser Thr
Asp Thr Tyr Gln Gln His Gln Ala Thr 130 135
140Ala Asp Thr Gln Gln Thr Gln Thr Gln Pro Gln Trp Ala Thr Asp
Ala145 150 155 160Ala Val
Gln Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly Ala
165 170 175Gln Tyr Asn Thr Asn Tyr Tyr
Ala Gly Ala Gln Gln Gln Ala Cys Thr 180 185
190Ser Tyr Asp Arg Tyr Leu Asn Ile Val His Ala Thr Gln Leu
Tyr Arg 195 200 205Arg Gly Pro Pro
Ala Arg Thr Thr Gln Thr Cys Gln Ala Thr Trp Gly 210
215 220Gln Cys Gln Gln Tyr Ala Gln Pro Asp Tyr Thr Tyr
Gln Ser Ala His225 230 235
240His Asp Glu Arg Leu Asn Ala Thr His Cys Gln Tyr Asn Phe Pro Gln
245 250 255Val Gly Arg Thr Ala
Leu Arg Val Leu Gln Leu Thr Ala Gly Tyr Gln 260
265 270Gln Pro Gln Gln Thr Met Ala Tyr Cys Tyr Ala His
Thr Gln Ala Thr 275 280 285Gln Gln
Val Ser Arg Gly Gln Arg Arg Leu Arg Ala Met Arg Gln Thr 290
295 300Thr Thr Thr Thr Thr Ala Tyr Ala Gln Cys Trp
Thr Pro Tyr His Gln305 310 315
320Thr Thr Leu Val Asp Ile Leu Met Asp Leu Gly Ala Leu Ala Arg Asn
325 330 335Cys Gly Arg Glu
Ser Arg Val Asp Val Ala Lys Ser Val Thr Ser Gly 340
345 350Gln Gly Tyr Met His Cys Cys Gln Asn Pro Gln
Gln Tyr Ala Tyr Thr 355 360 365Gly
Thr Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arg Leu Gly 370
375 380Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln
Pro Ser Ser Ser Arg Arg385 390 395
400Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu
405 410
41532621PRTUnknownDescription of Unknown Mammalian CD81 polypeptide
326Leu Phe Val Phe Asn Phe Val Phe Trp Leu Ala Gly Gly Val Ile Leu1
5 10 15Gly Val Ala Leu Trp
2032721PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 327Gln Tyr Thr Tyr Asn Tyr Thr Tyr Trp Gln Ala Gly
Gly Thr Thr Gln1 5 10
15Gly Thr Ala Gln Trp 2032821PRTUnknownDescription of Unknown
Mammalian CD81 peptide 328Leu Ile Ala Val Gly Ala Val Met Met Phe
Val Gly Phe Leu Gly Cys1 5 10
15Tyr Gly Ala Ile Gln 2032921PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 329Gln
Thr Ala Thr Gly Ala Thr Met Met Tyr Thr Gly Tyr Gln Gly Cys1
5 10 15Tyr Gly Ala Thr Gln
2033023PRTUnknownDescription of Unknown Mammalian CD81 peptide
330Leu Gly Thr Phe Phe Thr Cys Leu Val Ile Leu Phe Ala Cys Glu Val1
5 10 15Ala Ala Gly Ile Trp Gly
Phe 2033123PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 331Gln Gly Thr Tyr Tyr Thr Cys Gln Thr
Thr Gln Tyr Ala Cys Glu Thr1 5 10
15Ala Ala Gly Thr Trp Gly Phe
2033223PRTUnknownDescription of Unknown Mammalian CD81 peptide
332Tyr Leu Ile Gly Ile Ala Ala Ile Val Val Ala Val Ile Met Ile Phe1
5 10 15Glu Met Ile Leu Ser Met
Val 2033323PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 333Tyr Gln Thr Gly Thr Ala Ala Thr Thr
Thr Ala Thr Thr Met Thr Tyr1 5 10
15Glu Met Thr Gln Ser Met Val
20334236PRTUnknownDescription of Unknown CD81 polypeptide 334Met Gly
Val Glu Gly Cys Thr Lys Cys Ile Lys Tyr Leu Leu Phe Val1 5
10 15Phe Asn Phe Val Phe Trp Leu Ala
Gly Gly Val Ile Leu Gly Val Ala 20 25
30Leu Trp Leu Arg His Asp Pro Gln Thr Thr Asn Leu Leu Tyr Leu
Glu 35 40 45Leu Gly Asp Lys Pro
Ala Pro Asn Thr Phe Tyr Val Gly Ile Tyr Ile 50 55
60Leu Ile Ala Val Gly Ala Val Met Met Phe Val Gly Phe Leu
Gly Cys65 70 75 80Tyr
Gly Ala Ile Gln Glu Ser Gln Cys Leu Leu Gly Thr Phe Phe Thr
85 90 95Cys Leu Val Ile Leu Phe Ala
Cys Glu Val Ala Ala Gly Ile Trp Gly 100 105
110Phe Val Asn Lys Asp Gln Ile Ala Lys Asp Val Lys Gln Phe
Tyr Asp 115 120 125Gln Ala Leu Gln
Gln Ala Val Val Asp Asp Asp Ala Asn Asn Ala Lys 130
135 140Ala Val Val Lys Thr Phe His Glu Thr Leu Asp Cys
Cys Gly Ser Ser145 150 155
160Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn Asn Leu Cys Pro
165 170 175Ser Gly Ser Asn Ile
Ile Ser Asn Leu Phe Lys Glu Asp Cys His Gln 180
185 190Lys Ile Asp Asp Leu Phe Ser Gly Lys Leu Tyr Leu
Ile Gly Ile Ala 195 200 205Ala Ile
Val Val Ala Val Ile Met Ile Phe Glu Met Ile Leu Ser Met 210
215 220Val Leu Cys Cys Gly Ile Arg Asn Ser Ser Val
Tyr225 230 235335236PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
335Met Gly Val Glu Gly Cys Thr Lys Cys Ile Lys Tyr Gln Gln Tyr Thr1
5 10 15Tyr Asn Tyr Thr Tyr Trp
Gln Ala Gly Gly Thr Thr Gln Gly Thr Ala 20 25
30Gln Trp Leu Arg His Asp Pro Gln Thr Thr Asn Leu Leu
Tyr Leu Glu 35 40 45Leu Gly Asp
Lys Pro Ala Pro Asn Thr Phe Tyr Val Gly Ile Tyr Thr 50
55 60Gln Thr Ala Thr Gly Ala Thr Met Met Tyr Thr Gly
Tyr Gln Gly Cys65 70 75
80Tyr Gly Ala Thr Gln Glu Ser Gln Cys Gln Gln Gly Thr Tyr Tyr Thr
85 90 95Cys Gln Thr Thr Gln Tyr
Ala Cys Glu Thr Ala Ala Gly Thr Trp Gly 100
105 110Phe Val Asn Lys Asp Gln Ile Ala Lys Asp Val Lys
Gln Phe Tyr Asp 115 120 125Gln Ala
Leu Gln Gln Ala Val Val Asp Asp Asp Ala Asn Asn Ala Lys 130
135 140Ala Val Val Lys Thr Phe His Glu Thr Leu Asp
Cys Cys Gly Ser Ser145 150 155
160Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn Asn Leu Cys Pro
165 170 175Ser Gly Ser Asn
Ile Ile Ser Asn Leu Phe Lys Glu Asp Cys His Gln 180
185 190Lys Ile Asp Asp Leu Phe Ser Gly Lys Gln Tyr
Gln Thr Gly Thr Ala 195 200 205Ala
Thr Thr Thr Ala Thr Thr Met Thr Tyr Glu Met Thr Gln Ser Met 210
215 220Val Leu Cys Cys Gly Ile Arg Asn Ser Ser
Val Tyr225 230 23533620PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 336Trp
Thr Ser Thr Thr Glu Ala Gln Ala Tyr Tyr His Cys Cys Gln Asn1
5 10 15Pro Thr Gln Tyr
2033753PRTUnknownDescription of Unknown Mammalian CXCR3 polypeptide
337Met Val Leu Glu Val Ser Asp His Gln Val Leu Asn Asp Ala Glu Val1
5 10 15Ala Ala Leu Leu Glu Asn
Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn 20 25
30Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gln
Asp Phe Ser 35 40 45Leu Asn Phe
Asp Arg 5033825PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 338Gln Thr Tyr Thr Thr Met Thr Thr Tyr
Tyr Gln Tyr Trp Ala Pro Tyr1 5 10
15Asn Thr Thr Gln Gln Gln Asn Thr Tyr 20
25339352PRTUnknownDescription of Unknown Mammalian CXCR4
polypeptide 339Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn Tyr Thr Glu
Glu Met1 5 10 15Gly Ser
Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu 20
25 30Asn Ala Asn Phe Asn Lys Ile Phe Leu
Pro Thr Ile Tyr Ser Ile Ile 35 40
45Phe Leu Thr Gly Ile Val Gly Asn Gly Leu Val Ile Leu Val Met Gly 50
55 60Tyr Gln Lys Lys Leu Arg Ser Met Thr
Asp Lys Tyr Arg Leu His Leu65 70 75
80Ser Val Ala Asp Leu Leu Phe Val Ile Thr Leu Pro Phe Trp
Ala Val 85 90 95Asp Ala
Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val 100
105 110His Val Ile Tyr Thr Val Asn Leu Tyr
Ser Ser Val Leu Ile Leu Ala 115 120
125Phe Ile Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr Asn Ser
130 135 140Gln Arg Pro Arg Lys Leu Leu
Ala Glu Lys Val Val Tyr Val Gly Val145 150
155 160Trp Ile Pro Ala Leu Leu Leu Thr Ile Pro Asp Phe
Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val Val
Phe Gln Phe Gln His Ile Met Val Gly Leu 195 200
205Ile Leu Pro Gly Ile Val Ile Leu Ser Cys Tyr Cys Ile Ile
Ile Ser 210 215 220Lys Leu Ser His Ser
Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225 230
235 240Thr Val Ile Leu Ile Leu Ala Phe Phe Ala
Cys Trp Leu Pro Tyr Tyr 245 250
255Ile Gly Ile Ser Ile Asp Ser Phe Ile Leu Leu Glu Ile Ile Lys Gln
260 265 270Gly Cys Glu Phe Glu
Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Leu Ala Phe Phe His Cys Cys Leu Asn Pro Ile
Leu Tyr Ala Phe 290 295 300Leu Gly Ala
Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser Ser Leu Lys
Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser Ser Ser
Phe His Ser Ser 340 345
350340352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 340Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Thr Gln Thr Gln Ala 115 120
125Tyr Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Ile Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Ile Gln Thr Gln Ala
Tyr Phe Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Gln Ala Phe Phe His Cys Cys Leu
Asn Pro Ile Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350341352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 341Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Val Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro
Asp Tyr Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Phe Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Thr Ser Thr Thr Glu 275
280 285Ala Gln Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350342352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 342Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Gln65 70
75 80Ser Val Ala Asp Gln Gln Tyr Val Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Phe Gln His Thr Met Val Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Phe Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350343352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 343Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Leu Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Val Thr Tyr Thr Gly Val145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Ile Gln Thr Gln Ala
Tyr Phe Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Phe Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350344352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 344Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Val145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro
Asp Tyr Thr Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Phe Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Thr Gln Ile Gln Ala
Phe Phe Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Thr Ser Thr Thr Glu 275
280 285Ala Gln Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350345352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 345Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Gln65 70
75 80Ser Val Ala Asp Gln Gln Tyr Val Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Val Thr Tyr Thr Gly Val145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Phe Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Ile Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Phe Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350346352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 346Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Tyr Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Thr Gln Ile Gln Ala
Phe Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Leu Ala Tyr Phe His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350347352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 347Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Gln His Gln65 70
75 80Ser Val Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Phe Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Thr Gln Ile Gln Ala
Phe Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Thr Ser Thr Thr Glu 275
280 285Ala Gln Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350348352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 348Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Ile Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Val Ala Asp Gln Gln Tyr Thr Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Ile Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Ile Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Thr Ser Thr Thr Glu 275
280 285Ala Gln Ala Phe Tyr His Cys Cys Leu
Asn Pro Ile Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350349352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 349Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Gln His Leu65 70
75 80Ser Val Ala Asp Gln Gln Tyr Thr Ile Thr Gln
Pro Tyr Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Val Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro
Asp Tyr Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Thr Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350350352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 350Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Tyr Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Thr Gln Ile Gln Ala
Phe Phe Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Gln Ala Tyr Phe His Cys Cys Gln
Asn Pro Thr Leu Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350351352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 351Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Val Thr Tyr Thr Gly Val145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Thr Gln Ile Gln Ala
Phe Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350352352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 352Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Gln65 70
75 80Ser Val Ala Asp Gln Gln Tyr Val Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Val Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Tyr Phe His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350353352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 353Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Thr
Tyr Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Gln His Gln65 70
75 80Ser Thr Ala Asp Gln Gln Tyr Thr Thr Thr Gln
Pro Tyr Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Tyr Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Thr Gln Ile Gln Ala
Phe Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Gln Ala Tyr Phe His Cys Cys Gln
Asn Pro Thr Leu Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350354352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 354Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Leu65 70
75 80Ser Val Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Val 85 90
95Asp Ala Val Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Ile Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Thr Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Leu Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350355352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 355Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Gln65 70
75 80Ser Val Ala Asp Gln Gln Tyr Val Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Val Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Tyr Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Ile Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Gln Ala Phe Phe His Cys Cys Leu
Asn Pro Ile Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350356352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 356Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Val Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Gln His Gln65 70
75 80Ser Val Ala Asp Gln Gln Phe Thr Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Ile Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Thr Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Ile Thr Glu 275
280 285Ala Gln Ala Phe Tyr His Cys Cys Leu
Asn Pro Ile Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350357352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 357Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Thr
Tyr Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Tyr Gln Thr Gly Thr Thr Gly Asn Gly Gln Thr Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Leu His Gln65 70
75 80Ser Val Ala Asp Gln Gln Tyr Val Thr Thr Gln
Pro Phe Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Val
100 105 110His Thr Thr Tyr Thr Val
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Thr Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Ile Pro
Asp Phe Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Thr Val
Val Phe Gln Tyr Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Thr Thr Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Ile Ser Thr Thr Glu 275
280 285Ala Leu Ala Tyr Phe His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350358352PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 358Met Glu Gly Ile Ser Ile Tyr Thr Ser Asp Asn
Tyr Thr Glu Glu Met1 5 10
15Gly Ser Gly Asp Tyr Asp Ser Met Lys Glu Pro Cys Phe Arg Glu Glu
20 25 30Asn Ala Asn Phe Asn Lys Ile
Phe Gln Pro Thr Thr Tyr Ser Thr Thr 35 40
45Phe Gln Thr Gly Thr Thr Gly Asn Gly Gln Val Thr Gln Thr Met
Gly 50 55 60Tyr Gln Lys Lys Leu Arg
Ser Met Thr Asp Lys Tyr Arg Gln His Leu65 70
75 80Ser Thr Ala Asp Gln Gln Tyr Val Thr Thr Gln
Pro Tyr Trp Ala Thr 85 90
95Asp Ala Thr Ala Asn Trp Tyr Phe Gly Asn Phe Leu Cys Lys Ala Thr
100 105 110His Thr Thr Tyr Thr Thr
Asn Gln Tyr Ser Ser Val Gln Thr Gln Ala 115 120
125Phe Thr Ser Leu Asp Arg Tyr Leu Ala Ile Val His Ala Thr
Asn Ser 130 135 140Gln Arg Pro Arg Lys
Leu Leu Ala Glu Lys Thr Thr Tyr Val Gly Thr145 150
155 160Trp Thr Pro Ala Gln Gln Gln Thr Thr Pro
Asp Tyr Ile Phe Ala Asn 165 170
175Val Ser Glu Ala Asp Asp Arg Tyr Ile Cys Asp Arg Phe Tyr Pro Asn
180 185 190Asp Leu Trp Val Val
Val Phe Gln Phe Gln His Thr Met Thr Gly Gln 195
200 205Thr Gln Pro Gly Thr Thr Thr Gln Ser Cys Tyr Cys
Ile Ile Ile Ser 210 215 220Lys Leu Ser
His Ser Lys Gly His Gln Lys Arg Lys Ala Leu Lys Thr225
230 235 240Thr Val Ile Gln Thr Gln Ala
Tyr Tyr Ala Cys Trp Gln Pro Tyr Tyr 245
250 255Thr Gly Thr Ser Thr Asp Ser Phe Ile Leu Leu Glu
Ile Ile Lys Gln 260 265 270Gly
Cys Glu Phe Glu Asn Thr Val His Lys Trp Thr Ser Thr Thr Glu 275
280 285Ala Gln Ala Tyr Tyr His Cys Cys Gln
Asn Pro Thr Gln Tyr Ala Phe 290 295
300Leu Gly Ala Lys Phe Lys Thr Ser Ala Gln His Ala Leu Thr Ser Val305
310 315 320Ser Arg Gly Ser
Ser Leu Lys Ile Leu Ser Lys Gly Lys Arg Gly Gly 325
330 335His Ser Ser Val Ser Thr Glu Ser Glu Ser
Ser Ser Phe His Ser Ser 340 345
350359355PRTUnknownDescription of Unknown Mammalian CXCR3
polypeptide 359Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr
Asp Asp1 5 10 15Leu Ala
Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Val 20
25 30Phe Leu Ser Ile Phe Tyr Ser Val Ile
Phe Ala Ile Gly Leu Val Gly 35 40
45Asn Leu Leu Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser 50
55 60Val Thr Asp Ile Tyr Leu Leu Asn Leu
Ala Leu Ser Asp Leu Leu Phe65 70 75
80Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu Ile Asn Glu
Lys Gly 85 90 95Leu His
Asn Ala Met Cys Lys Phe Thr Thr Ala Phe Phe Phe Ile Gly 100
105 110Phe Phe Gly Ser Ile Phe Phe Ile Thr
Val Ile Ser Ile Asp Arg Tyr 115 120
125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln
130 135 140His Gly Val Thr Ile Ser Leu
Gly Val Trp Ala Ala Ala Ile Leu Val145 150
155 160Ala Ala Pro Gln Phe Met Phe Thr Lys Gln Lys Glu
Asn Glu Cys Leu 165 170
175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn
180 185 190Val Glu Thr Asn Phe Leu
Gly Phe Leu Leu Pro Leu Leu Ile Met Ser 195 200
205Tyr Cys Tyr Phe Arg Ile Ile Gln Thr Leu Phe Ser Cys Lys
Asn His 210 215 220Lys Lys Ala Lys Ala
Ile Lys Leu Ile Leu Leu Val Val Ile Val Phe225 230
235 240Phe Leu Phe Trp Thr Pro Tyr Asn Val Met
Ile Phe Leu Glu Thr Leu 245 250
255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg
260 265 270Leu Ala Leu Ser Val
Thr Glu Thr Val Ala Phe Ser His Cys Cys Leu 275
280 285Asn Pro Leu Ile Tyr Ala Phe Ala Gly Glu Lys Phe
Arg Arg Tyr Leu 290 295 300Tyr His Leu
Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305
310 315 320His Val Asp Phe Ser Ser Ser
Glu Ser Gln Arg Ser Arg His Gly Ser 325
330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp
Gly Asp Ala Leu 340 345 350Leu
Leu Leu 355360355PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 360Met Asp Gln Phe Pro Glu Ser Val
Thr Glu Asn Phe Glu Tyr Asp Asp1 5 10
15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly
Thr Thr 20 25 30Tyr Gln Ser
Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Val Gly 35
40 45Asn Gln Gln Val Val Phe Ala Leu Thr Asn Ser
Lys Lys Pro Lys Ser 50 55 60Val Thr
Asp Ile Tyr Leu Leu Asn Gln Ala Gln Ser Asp Gln Gln Phe65
70 75 80Thr Ala Thr Gln Pro Tyr Trp
Thr His Tyr Leu Ile Asn Glu Lys Gly 85 90
95Leu His Asn Ala Met Cys Lys Tyr Thr Thr Ala Tyr Tyr
Tyr Thr Gly 100 105 110Tyr Tyr
Gly Ser Thr Tyr Tyr Thr Thr Thr Thr Ser Thr Asp Arg Tyr 115
120 125Leu Ala Ile Val Leu Ala Ala Asn Ser Met
Asn Asn Arg Thr Val Gln 130 135 140His
Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala Thr Gln Val145
150 155 160Ala Ala Pro Gln Phe Met
Phe Thr Lys Gln Lys Glu Asn Glu Cys Leu 165
170 175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro
Val Leu Arg Asn 180 185 190Val
Glu Thr Asn Phe Gln Gly Phe Leu Gln Pro Gln Gln Thr Met Ser 195
200 205Tyr Cys Tyr Tyr Arg Ile Thr Gln Thr
Leu Phe Ser Cys Lys Asn His 210 215
220Lys Lys Ala Lys Ala Ile Lys Gln Ile Gln Gln Thr Thr Thr Thr Phe225
230 235 240Tyr Gln Tyr Trp
Thr Pro Tyr Asn Thr Met Thr Tyr Gln Glu Thr Gln 245
250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp
Met Arg Lys Asp Leu Arg 260 265
270Leu Ala Gln Ser Val Thr Glu Thr Thr Ala Tyr Ser His Cys Cys Gln
275 280 285Asn Pro Gln Thr Tyr Ala Tyr
Ala Gly Glu Lys Phe Arg Arg Tyr Leu 290 295
300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser
Val305 310 315 320His Val
Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser
325 330 335Val Leu Ser Ser Asn Phe Thr
Tyr His Thr Ser Asp Gly Asp Ala Leu 340 345
350Leu Leu Leu 355361355PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
361Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1
5 10 15Leu Ala Glu Ala Cys Tyr
Ile Gly Asp Ile Val Val Phe Gly Thr Thr 20 25
30Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly
Gln Thr Gly 35 40 45Asn Gln Gln
Thr Thr Tyr Ala Gln Thr Asn Ser Lys Lys Pro Lys Ser 50
55 60Val Thr Asp Ile Tyr Leu Leu Asn Gln Ala Gln Ser
Asp Gln Gln Phe65 70 75
80Val Ala Thr Gln Pro Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly
85 90 95Leu His Asn Ala Met Cys
Lys Tyr Thr Thr Ala Tyr Tyr Tyr Thr Gly 100
105 110Tyr Tyr Gly Ser Thr Tyr Tyr Thr Thr Thr Thr Ser
Thr Asp Arg Tyr 115 120 125Leu Ala
Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln 130
135 140His Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala
Ala Ala Thr Gln Thr145 150 155
160Ala Ala Pro Gln Phe Met Tyr Thr Lys Gln Lys Glu Asn Glu Cys Leu
165 170 175Gly Asp Tyr Pro
Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn 180
185 190Val Glu Thr Asn Phe Gln Gly Tyr Leu Gln Pro
Gln Gln Thr Met Ser 195 200 205Tyr
Cys Tyr Phe Arg Thr Thr Gln Thr Leu Phe Ser Cys Lys Asn His 210
215 220Lys Lys Ala Lys Ala Ile Lys Leu Thr Gln
Gln Thr Thr Thr Thr Tyr225 230 235
240Tyr Gln Phe Trp Thr Pro Tyr Asn Thr Met Thr Phe Gln Glu Thr
Gln 245 250 255Lys Leu Tyr
Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg 260
265 270Leu Ala Leu Ser Val Thr Glu Thr Val Ala
Phe Ser His Cys Cys Gln 275 280
285Asn Pro Gln Thr Tyr Ala Tyr Ala Gly Glu Lys Phe Arg Arg Tyr Leu 290
295 300Tyr His Leu Tyr Gly Lys Cys Leu
Ala Val Leu Cys Gly Arg Ser Val305 310
315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser
Arg His Gly Ser 325 330
335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu
340 345 350Leu Leu Leu
355362355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 362Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn
Phe Glu Tyr Asp Asp1 5 10
15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Thr
20 25 30Tyr Gln Ser Thr Tyr Tyr Ser
Thr Thr Tyr Ala Thr Gly Gln Thr Gly 35 40
45Asn Leu Gln Val Thr Phe Ala Gln Thr Asn Ser Lys Lys Pro Lys
Ser 50 55 60Val Thr Asp Ile Tyr Leu
Leu Asn Gln Ala Gln Ser Asp Gln Leu Phe65 70
75 80Val Ala Thr Gln Pro Phe Trp Thr His Tyr Leu
Ile Asn Glu Lys Gly 85 90
95Leu His Asn Ala Met Cys Lys Tyr Thr Thr Ala Tyr Tyr Tyr Thr Gly
100 105 110Tyr Tyr Gly Ser Thr Tyr
Tyr Thr Thr Thr Thr Ser Thr Asp Arg Tyr 115 120
125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr
Val Gln 130 135 140His Gly Thr Thr Thr
Ser Gln Gly Val Trp Ala Ala Ala Thr Gln Thr145 150
155 160Ala Ala Pro Gln Phe Met Tyr Thr Lys Gln
Lys Glu Asn Glu Cys Leu 165 170
175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn
180 185 190Val Glu Thr Asn Tyr
Gln Gly Tyr Gln Gln Pro Gln Gln Thr Met Ser 195
200 205Tyr Cys Tyr Phe Arg Ile Thr Gln Thr Leu Phe Ser
Cys Lys Asn His 210 215 220Lys Lys Ala
Lys Ala Ile Lys Gln Ile Gln Gln Thr Thr Thr Thr Phe225
230 235 240Phe Gln Tyr Trp Thr Pro Tyr
Asn Thr Met Thr Tyr Gln Glu Thr Gln 245
250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg
Lys Asp Leu Arg 260 265 270Leu
Ala Gln Ser Val Thr Glu Thr Thr Ala Phe Ser His Cys Cys Gln 275
280 285Asn Pro Gln Ile Tyr Ala Tyr Ala Gly
Glu Lys Phe Arg Arg Tyr Leu 290 295
300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305
310 315 320His Val Asp Phe
Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser 325
330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr
Ser Asp Gly Asp Ala Leu 340 345
350Leu Leu Leu 355363355PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 363Met Asp Gln Phe Pro Glu
Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1 5
10 15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val
Phe Gly Thr Thr 20 25 30Tyr
Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Thr Gly 35
40 45Asn Leu Gln Val Thr Phe Ala Gln Thr
Asn Ser Lys Lys Pro Lys Ser 50 55
60Val Thr Asp Ile Tyr Leu Gln Asn Leu Ala Gln Ser Asp Gln Gln Tyr65
70 75 80Thr Ala Thr Gln Pro
Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly 85
90 95Leu His Asn Ala Met Cys Lys Tyr Thr Thr Ala
Tyr Tyr Tyr Thr Gly 100 105
110Tyr Tyr Gly Ser Thr Tyr Tyr Thr Thr Thr Thr Ser Thr Asp Arg Tyr
115 120 125Leu Ala Ile Val Leu Ala Ala
Asn Ser Met Asn Asn Arg Thr Val Gln 130 135
140His Gly Val Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala Thr Gln
Thr145 150 155 160Ala Ala
Pro Gln Phe Met Phe Thr Lys Gln Lys Glu Asn Glu Cys Leu
165 170 175Gly Asp Tyr Pro Glu Val Leu
Gln Glu Ile Trp Pro Val Leu Arg Asn 180 185
190Val Glu Thr Asn Phe Gln Gly Phe Leu Gln Pro Gln Gln Thr
Met Ser 195 200 205Tyr Cys Tyr Phe
Arg Thr Thr Gln Thr Leu Phe Ser Cys Lys Asn His 210
215 220Lys Lys Ala Lys Ala Ile Lys Leu Ile Gln Gln Thr
Thr Thr Thr Phe225 230 235
240Tyr Gln Tyr Trp Thr Pro Tyr Asn Val Met Thr Phe Gln Glu Thr Gln
245 250 255Lys Leu Tyr Asp Phe
Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg 260
265 270Leu Ala Leu Ser Thr Thr Glu Thr Thr Ala Tyr Ser
His Cys Cys Gln 275 280 285Asn Pro
Gln Thr Tyr Ala Tyr Ala Gly Glu Lys Phe Arg Arg Tyr Leu 290
295 300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu
Cys Gly Arg Ser Val305 310 315
320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser
325 330 335Val Leu Ser Ser
Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu 340
345 350Leu Leu Leu 355364355PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
364Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1
5 10 15Leu Ala Glu Ala Cys Tyr
Ile Gly Asp Ile Val Val Phe Gly Thr Thr 20 25
30Tyr Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly
Gln Thr Gly 35 40 45Asn Leu Gln
Val Thr Phe Ala Gln Thr Asn Ser Lys Lys Pro Lys Ser 50
55 60Val Thr Asp Ile Tyr Gln Gln Asn Gln Ala Gln Ser
Asp Gln Gln Tyr65 70 75
80Thr Ala Thr Gln Pro Tyr Trp Thr His Tyr Leu Ile Asn Glu Lys Gly
85 90 95Leu His Asn Ala Met Cys
Lys Tyr Thr Thr Ala Tyr Tyr Tyr Thr Gly 100
105 110Tyr Tyr Gly Ser Thr Tyr Tyr Thr Thr Thr Thr Ser
Thr Asp Arg Tyr 115 120 125Leu Ala
Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln 130
135 140His Gly Thr Thr Thr Ser Gln Gly Val Trp Ala
Ala Ala Thr Gln Thr145 150 155
160Ala Ala Pro Gln Phe Met Tyr Thr Lys Gln Lys Glu Asn Glu Cys Leu
165 170 175Gly Asp Tyr Pro
Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn 180
185 190Val Glu Thr Asn Phe Gln Gly Phe Leu Gln Pro
Gln Gln Thr Met Ser 195 200 205Tyr
Cys Tyr Phe Arg Ile Thr Gln Thr Leu Phe Ser Cys Lys Asn His 210
215 220Lys Lys Ala Lys Ala Ile Lys Leu Ile Gln
Gln Thr Thr Thr Thr Phe225 230 235
240Tyr Gln Phe Trp Thr Pro Tyr Asn Thr Met Thr Phe Gln Glu Thr
Leu 245 250 255Lys Leu Tyr
Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg 260
265 270Leu Ala Gln Ser Thr Thr Glu Thr Thr Ala
Tyr Ser His Cys Cys Gln 275 280
285Asn Pro Gln Thr Tyr Ala Tyr Ala Gly Glu Lys Phe Arg Arg Tyr Leu 290
295 300Tyr His Leu Tyr Gly Lys Cys Leu
Ala Val Leu Cys Gly Arg Ser Val305 310
315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser
Arg His Gly Ser 325 330
335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu
340 345 350Leu Leu Leu
355365355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 365Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn
Phe Glu Tyr Asp Asp1 5 10
15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Thr
20 25 30Tyr Gln Ser Thr Tyr Tyr Ser
Thr Thr Tyr Ala Thr Gly Gln Val Gly 35 40
45Asn Gln Gln Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys
Ser 50 55 60Val Thr Asp Ile Tyr Gln
Gln Asn Leu Ala Gln Ser Asp Gln Gln Phe65 70
75 80Thr Ala Thr Gln Pro Tyr Trp Thr His Tyr Leu
Ile Asn Glu Lys Gly 85 90
95Leu His Asn Ala Met Cys Lys Tyr Thr Thr Ala Tyr Tyr Tyr Thr Gly
100 105 110Tyr Tyr Gly Ser Thr Tyr
Tyr Thr Thr Thr Thr Ser Thr Asp Arg Tyr 115 120
125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr
Val Gln 130 135 140His Gly Thr Thr Thr
Ser Gln Gly Thr Trp Ala Ala Ala Thr Gln Thr145 150
155 160Ala Ala Pro Gln Phe Met Phe Thr Lys Gln
Lys Glu Asn Glu Cys Leu 165 170
175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn
180 185 190Val Glu Thr Asn Tyr
Gln Gly Tyr Gln Gln Pro Gln Gln Thr Met Ser 195
200 205Tyr Cys Tyr Tyr Arg Thr Thr Gln Thr Leu Phe Ser
Cys Lys Asn His 210 215 220Lys Lys Ala
Lys Ala Ile Lys Leu Ile Gln Gln Thr Thr Thr Thr Phe225
230 235 240Tyr Gln Phe Trp Thr Pro Tyr
Asn Thr Met Thr Phe Gln Glu Thr Leu 245
250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg
Lys Asp Leu Arg 260 265 270Leu
Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Gln 275
280 285Asn Pro Gln Ile Tyr Ala Tyr Ala Gly
Glu Lys Phe Arg Arg Tyr Leu 290 295
300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305
310 315 320His Val Asp Phe
Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser 325
330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr
Ser Asp Gly Asp Ala Leu 340 345
350Leu Leu Leu 355366355PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 366Met Asp Gln Phe Pro Glu
Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1 5
10 15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val
Phe Gly Thr Thr 20 25 30Tyr
Gln Ser Thr Tyr Tyr Ser Thr Thr Tyr Ala Thr Gly Gln Val Gly 35
40 45Asn Gln Gln Val Val Phe Ala Leu Thr
Asn Ser Lys Lys Pro Lys Ser 50 55
60Val Thr Asp Ile Tyr Leu Leu Asn Gln Ala Gln Ser Asp Gln Gln Phe65
70 75 80Thr Ala Thr Gln Pro
Tyr Trp Thr His Tyr Leu Ile Asn Glu Lys Gly 85
90 95Leu His Asn Ala Met Cys Lys Tyr Thr Thr Ala
Tyr Tyr Tyr Thr Gly 100 105
110Tyr Tyr Gly Ser Thr Tyr Tyr Thr Thr Thr Thr Ser Thr Asp Arg Tyr
115 120 125Leu Ala Ile Val Leu Ala Ala
Asn Ser Met Asn Asn Arg Thr Val Gln 130 135
140His Gly Thr Thr Thr Ser Gln Gly Thr Trp Ala Ala Ala Thr Gln
Val145 150 155 160Ala Ala
Pro Gln Phe Met Phe Thr Lys Gln Lys Glu Asn Glu Cys Leu
165 170 175Gly Asp Tyr Pro Glu Val Leu
Gln Glu Ile Trp Pro Val Leu Arg Asn 180 185
190Val Glu Thr Asn Phe Gln Gly Phe Leu Gln Pro Gln Gln Thr
Met Ser 195 200 205Tyr Cys Tyr Tyr
Arg Ile Thr Gln Thr Leu Phe Ser Cys Lys Asn His 210
215 220Lys Lys Ala Lys Ala Ile Lys Gln Ile Gln Gln Thr
Thr Thr Thr Phe225 230 235
240Tyr Gln Tyr Trp Thr Pro Tyr Asn Thr Met Thr Tyr Gln Glu Thr Gln
245 250 255Lys Leu Tyr Asp Phe
Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg 260
265 270Leu Ala Gln Ser Val Thr Glu Thr Thr Ala Tyr Ser
His Cys Cys Gln 275 280 285Asn Pro
Gln Thr Tyr Ala Tyr Ala Gly Glu Lys Phe Arg Arg Tyr Leu 290
295 300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu
Cys Gly Arg Ser Val305 310 315
320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser
325 330 335Val Leu Ser Ser
Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu 340
345 350Leu Leu Leu
355367355PRTUnknownDescription of Unknown Mammalian CCR3 polypeptide
367Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1
5 10 15Tyr Asp Asp Val Gly Leu
Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu 20 25
30Met Ala Gln Phe Val Pro Pro Leu Tyr Ser Leu Val Phe
Thr Val Gly 35 40 45Leu Leu Gly
Asn Val Val Val Val Met Ile Leu Ile Lys Tyr Arg Arg 50
55 60Leu Arg Ile Met Thr Asn Ile Tyr Leu Leu Asn Leu
Ala Ile Ser Asp65 70 75
80Leu Leu Phe Leu Val Thr Leu Pro Phe Trp Ile His Tyr Val Arg Gly
85 90 95His Asn Trp Val Phe Gly
His Gly Met Cys Lys Leu Leu Ser Gly Phe 100
105 110Tyr His Thr Gly Leu Tyr Ser Glu Ile Phe Phe Ile
Ile Leu Leu Thr 115 120 125Ile Asp
Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu Arg Ala 130
135 140Arg Thr Val Thr Phe Gly Val Ile Thr Ser Ile
Val Thr Trp Gly Leu145 150 155
160Ala Val Leu Ala Ala Leu Pro Glu Phe Ile Phe Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu
Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val 180
185 190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met
Thr Ile Phe Cys Leu 195 200 205Val
Leu Pro Leu Leu Val Met Ala Ile Cys Tyr Thr Gly Ile Ile Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys
Tyr Lys Ala Ile Arg Leu225 230 235
240Ile Phe Val Ile Met Ala Val Phe Phe Ile Phe Trp Thr Pro Tyr
Asn 245 250 255Val Ala Ile
Leu Leu Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Val
Met Leu Val Thr Glu Val 275 280
285Ile Ala Tyr Ser His Cys Cys Met Asn Pro Val Ile Tyr Ala Phe Val 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu
Arg His Phe Phe His Arg His Leu305 310
315 320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro
Ser Glu Lys Leu 325 330
335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser
340 345 350Ile Val Phe
355368355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 368Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe
Gly Thr Thr Ser Tyr1 5 10
15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu
20 25 30Met Ala Gln Phe Val Pro Pro
Gln Tyr Ser Gln Thr Tyr Thr Thr Gly 35 40
45Gln Gln Gly Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr Arg
Arg 50 55 60Leu Arg Ile Met Thr Asn
Ile Tyr Gln Gln Asn Gln Ala Ile Ser Asp65 70
75 80Gln Gln Tyr Gln Val Thr Gln Pro Tyr Trp Thr
His Tyr Val Arg Gly 85 90
95His Asn Trp Val Phe Gly His Gly Met Cys Lys Gln Leu Ser Gly Tyr
100 105 110Tyr His Thr Gly Gln Tyr
Ser Glu Thr Tyr Tyr Thr Thr Gln Gln Thr 115 120
125Thr Asp Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu
Arg Ala 130 135 140Arg Thr Thr Thr Phe
Gly Thr Thr Thr Ser Thr Val Thr Trp Gly Gln145 150
155 160Ala Val Gln Ala Ala Gln Pro Glu Phe Ile
Phe Tyr Glu Thr Glu Glu 165 170
175Leu Phe Glu Glu Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val
180 185 190Tyr Ser Trp Arg His
Phe His Thr Leu Arg Met Thr Ile Tyr Cys Gln 195
200 205Val Gln Pro Gln Gln Val Met Ala Thr Cys Tyr Thr
Gly Thr Thr Lys 210 215 220Thr Leu Leu
Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala Ile Arg Gln225
230 235 240Thr Tyr Thr Thr Met Ala Thr
Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn 245
250 255Thr Ala Thr Gln Gln Ser Ser Tyr Gln Ser Ile Leu
Phe Gly Asn Asp 260 265 270Cys
Glu Arg Ser Lys His Leu Asp Leu Thr Met Gln Thr Thr Glu Thr 275
280 285Thr Ala Tyr Ser His Cys Cys Met Asn
Pro Thr Thr Tyr Ala Tyr Val 290 295
300Gly Glu Arg Phe Arg Met Tyr Leu Arg His Phe Phe His Arg His Leu305
310 315 320Leu Met His Leu
Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu 325
330 335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr
Ala Glu Pro Glu Leu Ser 340 345
350Ile Val Phe 355369355PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 369Met Thr Thr Ser Leu Asp
Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1 5
10 15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp
Thr Arg Ala Leu 20 25 30Met
Ala Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Phe Thr Thr Gly 35
40 45Gln Gln Gly Asn Thr Thr Val Thr Met
Thr Gln Ile Lys Tyr Arg Arg 50 55
60Leu Arg Ile Met Thr Asn Ile Tyr Leu Gln Asn Gln Ala Ile Ser Asp65
70 75 80Gln Leu Phe Gln Thr
Thr Gln Pro Tyr Trp Thr His Tyr Val Arg Gly 85
90 95His Asn Trp Val Phe Gly His Gly Met Cys Lys
Gln Leu Ser Gly Phe 100 105
110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Tyr Thr Thr Gln Gln Thr
115 120 125Thr Asp Arg Tyr Leu Ala Ile
Val His Ala Val Phe Ala Leu Arg Ala 130 135
140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly
Gln145 150 155 160Ala Thr
Gln Ala Ala Gln Pro Glu Phe Ile Tyr Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu Thr Leu Cys
Ser Ala Leu Tyr Pro Glu Asp Thr Val 180 185
190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met Thr Ile Tyr
Cys Gln 195 200 205Val Gln Pro Gln
Gln Val Met Ala Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Lys
Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn
245 250 255Thr Ala Thr Gln Gln
Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Val Met Gln
Val Thr Glu Thr 275 280 285Thr Ala
Tyr Ser His Cys Cys Met Asn Pro Val Thr Tyr Ala Tyr Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu Arg His Phe
Phe His Arg His Leu305 310 315
320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu
325 330 335Glu Arg Thr Ser
Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser 340
345 350Ile Val Phe 355370355PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
370Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1
5 10 15Tyr Asp Asp Val Gly Leu
Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu 20 25
30Met Ala Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr
Thr Thr Gly 35 40 45Gln Gln Gly
Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr Arg Arg 50
55 60Leu Arg Ile Met Thr Asn Ile Tyr Gln Gln Asn Leu
Ala Ile Ser Asp65 70 75
80Gln Gln Phe Gln Thr Thr Gln Pro Phe Trp Thr His Tyr Val Arg Gly
85 90 95His Asn Trp Val Phe Gly
His Gly Met Cys Lys Gln Gln Ser Gly Phe 100
105 110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Phe Thr
Thr Gln Gln Thr 115 120 125Thr Asp
Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu Arg Ala 130
135 140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr
Thr Thr Trp Gly Gln145 150 155
160Ala Thr Gln Ala Ala Gln Pro Glu Phe Ile Tyr Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu
Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val 180
185 190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met
Thr Thr Tyr Cys Gln 195 200 205Thr
Gln Pro Gln Gln Thr Met Ala Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys
Tyr Lys Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr
Asn 245 250 255Thr Ala Thr
Gln Gln Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Val
Met Gln Val Thr Glu Thr 275 280
285Thr Ala Tyr Ser His Cys Cys Met Asn Pro Thr Thr Tyr Ala Phe Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu
Arg His Phe Phe His Arg His Leu305 310
315 320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro
Ser Glu Lys Leu 325 330
335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser
340 345 350Ile Val Phe
355371355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 371Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe
Gly Thr Thr Ser Tyr1 5 10
15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu
20 25 30Met Ala Gln Phe Thr Pro Pro
Gln Tyr Ser Gln Thr Tyr Thr Thr Gly 35 40
45Gln Gln Gly Asn Val Thr Val Thr Met Thr Gln Ile Lys Tyr Arg
Arg 50 55 60Leu Arg Ile Met Thr Asn
Ile Tyr Leu Gln Asn Gln Ala Ile Ser Asp65 70
75 80Gln Leu Phe Gln Thr Thr Gln Pro Phe Trp Thr
His Tyr Val Arg Gly 85 90
95His Asn Trp Val Phe Gly His Gly Met Cys Lys Gln Gln Ser Gly Phe
100 105 110Tyr His Thr Gly Gln Tyr
Ser Glu Thr Phe Tyr Thr Thr Gln Gln Thr 115 120
125Thr Asp Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu
Arg Ala 130 135 140Arg Thr Thr Thr Tyr
Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln145 150
155 160Ala Val Gln Ala Ala Gln Pro Glu Phe Thr
Phe Tyr Glu Thr Glu Glu 165 170
175Leu Phe Glu Glu Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val
180 185 190Tyr Ser Trp Arg His
Phe His Thr Leu Arg Met Thr Ile Phe Cys Gln 195
200 205Thr Gln Pro Gln Gln Thr Met Ala Thr Cys Tyr Thr
Gly Ile Thr Lys 210 215 220Thr Leu Leu
Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala Ile Arg Gln225
230 235 240Thr Tyr Thr Thr Met Ala Thr
Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn 245
250 255Thr Ala Thr Gln Gln Ser Ser Tyr Gln Ser Ile Leu
Phe Gly Asn Asp 260 265 270Cys
Glu Arg Ser Lys His Leu Asp Leu Thr Met Gln Thr Thr Glu Thr 275
280 285Thr Ala Tyr Ser His Cys Cys Met Asn
Pro Thr Thr Tyr Ala Tyr Thr 290 295
300Gly Glu Arg Phe Arg Lys Tyr Leu Arg His Phe Phe His Arg His Leu305
310 315 320Leu Met His Leu
Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu 325
330 335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr
Ala Glu Pro Glu Leu Ser 340 345
350Ile Val Phe 355372355PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 372Met Thr Thr Ser Leu Asp
Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1 5
10 15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp
Thr Arg Ala Leu 20 25 30Met
Ala Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Phe Thr Thr Gly 35
40 45Gln Gln Gly Asn Thr Thr Val Thr Met
Thr Gln Ile Lys Tyr Arg Arg 50 55
60Leu Arg Ile Met Thr Asn Ile Tyr Leu Gln Asn Gln Ala Ile Ser Asp65
70 75 80Gln Leu Phe Gln Thr
Thr Gln Pro Tyr Trp Thr His Tyr Val Arg Gly 85
90 95His Asn Trp Val Phe Gly His Gly Met Cys Lys
Gln Gln Ser Gly Phe 100 105
110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Phe Thr Thr Gln Gln Thr
115 120 125Thr Asp Arg Tyr Leu Ala Ile
Val His Ala Val Phe Ala Leu Arg Ala 130 135
140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly
Gln145 150 155 160Ala Val
Gln Ala Ala Gln Pro Glu Phe Ile Phe Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu Thr Leu Cys
Ser Ala Leu Tyr Pro Glu Asp Thr Val 180 185
190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met Thr Thr Tyr
Cys Gln 195 200 205Thr Gln Pro Gln
Gln Thr Met Ala Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Glu
Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn
245 250 255Thr Ala Thr Gln Gln
Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Thr Met Gln
Val Thr Glu Thr 275 280 285Ile Ala
Tyr Ser His Cys Cys Met Asn Pro Thr Thr Tyr Ala Phe Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu Arg His Phe
Phe His Arg His Leu305 310 315
320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu
325 330 335Glu Arg Thr Ser
Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser 340
345 350Ile Val Phe 355373355PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
373Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1
5 10 15Tyr Asp Asp Val Gly Leu
Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu 20 25
30Met Ala Gln Phe Val Pro Pro Gln Tyr Ser Gln Thr Tyr
Thr Thr Gly 35 40 45Gln Gln Gly
Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr Arg Arg 50
55 60Leu Arg Ile Met Thr Asn Ile Tyr Gln Gln Asn Leu
Ala Ile Ser Asp65 70 75
80Gln Gln Tyr Gln Val Thr Gln Pro Tyr Trp Thr His Tyr Val Arg Gly
85 90 95His Asn Trp Val Phe Gly
His Gly Met Cys Lys Gln Leu Ser Gly Phe 100
105 110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Phe Thr
Thr Gln Gln Thr 115 120 125Thr Asp
Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu Arg Ala 130
135 140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr
Thr Thr Trp Gly Gln145 150 155
160Ala Thr Gln Ala Ala Gln Pro Glu Phe Ile Tyr Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu
Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val 180
185 190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met
Thr Ile Phe Cys Gln 195 200 205Thr
Gln Pro Gln Gln Thr Met Ala Thr Cys Tyr Thr Gly Thr Ile Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys
Tyr Lys Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr
Asn 245 250 255Thr Ala Thr
Gln Gln Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Val
Met Gln Thr Thr Glu Thr 275 280
285Ile Ala Tyr Ser His Cys Cys Met Asn Pro Thr Thr Tyr Ala Tyr Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu
Arg His Phe Phe His Arg His Leu305 310
315 320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro
Ser Glu Lys Leu 325 330
335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser
340 345 350Ile Val Phe
355374355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 374Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe
Gly Thr Thr Ser Tyr1 5 10
15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu
20 25 30Met Ala Gln Phe Val Pro Pro
Gln Tyr Ser Gln Thr Phe Thr Thr Gly 35 40
45Gln Gln Gly Asn Thr Thr Val Thr Met Thr Gln Ile Lys Tyr Arg
Arg 50 55 60Leu Arg Ile Met Thr Asn
Ile Tyr Gln Gln Asn Leu Ala Ile Ser Asp65 70
75 80Gln Gln Tyr Gln Val Thr Gln Pro Phe Trp Ile
His Tyr Val Arg Gly 85 90
95His Asn Trp Val Phe Gly His Gly Met Cys Lys Gln Leu Ser Gly Tyr
100 105 110Tyr His Thr Gly Gln Tyr
Ser Glu Thr Phe Phe Thr Thr Gln Gln Thr 115 120
125Thr Asp Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu
Arg Ala 130 135 140Arg Thr Thr Thr Phe
Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly Gln145 150
155 160Ala Val Gln Ala Ala Gln Pro Glu Phe Ile
Phe Tyr Glu Thr Glu Glu 165 170
175Leu Phe Glu Glu Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val
180 185 190Tyr Ser Trp Arg His
Phe His Thr Leu Arg Met Thr Ile Tyr Cys Gln 195
200 205Val Gln Pro Gln Gln Val Met Ala Thr Cys Tyr Thr
Gly Thr Thr Lys 210 215 220Thr Pro Leu
Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala Ile Arg Gln225
230 235 240Thr Tyr Thr Thr Met Ala Thr
Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn 245
250 255Thr Ala Thr Gln Gln Ser Ser Tyr Gln Ser Ile Leu
Phe Gly Asn Asp 260 265 270Cys
Glu Arg Ser Lys His Leu Asp Leu Thr Met Gln Val Thr Glu Thr 275
280 285Ile Ala Tyr Ser His Cys Cys Met Asn
Pro Thr Thr Tyr Ala Tyr Thr 290 295
300Gly Glu Arg Phe Arg Lys Tyr Leu Arg His Phe Phe His Arg His Leu305
310 315 320Leu Met His Leu
Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu 325
330 335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr
Ala Glu Pro Glu Leu Ser 340 345
350Ile Val Phe 355375355PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 375Met Thr Thr Ser Leu Asp
Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1 5
10 15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp
Thr Arg Ala Leu 20 25 30Met
Ala Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr Thr Gly 35
40 45Gln Gln Gly Asn Thr Val Thr Thr Met
Thr Gln Ile Lys Tyr Arg Arg 50 55
60Leu Arg Ile Met Thr Asn Ile Tyr Gln Gln Asn Leu Ala Ile Ser Asp65
70 75 80Gln Gln Phe Gln Thr
Thr Gln Pro Phe Trp Thr His Tyr Val Arg Gly 85
90 95His Asn Trp Val Phe Gly His Gly Met Cys Lys
Gln Leu Ser Gly Tyr 100 105
110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Phe Thr Thr Gln Gln Thr
115 120 125Thr Asp Arg Tyr Leu Ala Ile
Val His Ala Val Phe Ala Leu Arg Ala 130 135
140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly
Gln145 150 155 160Ala Thr
Gln Ala Ala Gln Pro Glu Phe Ile Phe Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu Thr Leu Cys
Ser Ala Leu Tyr Pro Glu Asp Thr Val 180 185
190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met Thr Ile Phe
Cys Gln 195 200 205Thr Gln Pro Gln
Gln Thr Met Ala Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Lys
Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn
245 250 255Thr Ala Thr Gln Gln
Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Val Met Leu
Thr Thr Glu Val 275 280 285Thr Ala
Tyr Ser His Cys Cys Met Asn Pro Thr Thr Tyr Ala Phe Thr 290
295 300Gly Gly Arg Phe Arg Lys Tyr Leu Arg His Phe
Phe His Arg His Leu305 310 315
320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu
325 330 335Glu Arg Thr Ser
Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser 340
345 350Ile Val Phe 355376355PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
376Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1
5 10 15Tyr Asp Asp Val Gly Leu
Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu 20 25
30Met Ala Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr
Thr Thr Gly 35 40 45Gln Gln Gly
Asn Thr Thr Thr Thr Met Thr Gln Thr Lys Tyr Arg Arg 50
55 60Leu Arg Ile Met Thr Asn Ile Tyr Leu Gln Asn Gln
Ala Thr Ser Asp65 70 75
80Gln Leu Phe Gln Thr Thr Gln Pro Tyr Trp Thr His Tyr Val Arg Gly
85 90 95His Asn Trp Val Phe Gly
His Gly Met Cys Lys Gln Gln Ser Gly Phe 100
105 110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Phe Thr
Thr Gln Gln Thr 115 120 125Thr Asp
Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu Arg Ala 130
135 140Arg Thr Thr Thr Phe Gly Thr Thr Thr Ser Thr
Thr Thr Trp Gly Gln145 150 155
160Ala Thr Gln Ala Ala Gln Pro Glu Tyr Thr Tyr Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu
Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val 180
185 190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met
Thr Ile Phe Cys Gln 195 200 205Val
Gln Pro Gln Gln Thr Met Ala Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys
Tyr Lys Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr
Asn 245 250 255Thr Ala Thr
Gln Gln Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Thr
Met Gln Val Thr Glu Thr 275 280
285Ile Ala Tyr Ser His Cys Cys Met Asn Pro Thr Thr Tyr Ala Tyr Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu
Arg His Phe Phe His Arg His Leu305 310
315 320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro
Ser Glu Lys Leu 325 330
335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser
340 345 350Ile Val Phe
355377355PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 377Met Thr Thr Ser Leu Asp Thr Val Glu Thr Phe
Gly Thr Thr Ser Tyr1 5 10
15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp Thr Arg Ala Leu
20 25 30Met Ala Gln Phe Thr Pro Pro
Gln Tyr Ser Gln Thr Tyr Thr Thr Gly 35 40
45Gln Gln Gly Asn Thr Thr Thr Thr Met Thr Gln Ile Lys Tyr Arg
Arg 50 55 60Leu Arg Ile Met Thr Asn
Ile Tyr Gln Gln Asn Gln Ala Thr Ser Asp65 70
75 80Gln Gln Tyr Gln Thr Thr Gln Pro Tyr Trp Thr
His Tyr Val Arg Gly 85 90
95His Asn Trp Val Phe Gly His Gly Met Cys Lys Gln Gln Ser Gly Phe
100 105 110Tyr His Thr Gly Gln Tyr
Ser Glu Thr Tyr Tyr Thr Thr Gln Gln Thr 115 120
125Thr Asp Arg Tyr Leu Ala Ile Val His Ala Val Phe Ala Leu
Arg Ala 130 135 140Arg Thr Thr Thr Phe
Gly Thr Thr Thr Ser Thr Val Thr Trp Gly Gln145 150
155 160Ala Val Gln Ala Ala Gln Pro Glu Phe Thr
Phe Tyr Glu Thr Glu Glu 165 170
175Leu Phe Glu Glu Thr Leu Cys Ser Ala Leu Tyr Pro Glu Asp Thr Val
180 185 190Tyr Ser Trp Arg His
Phe His Thr Leu Arg Met Thr Ile Phe Cys Gln 195
200 205Thr Gln Pro Gln Gln Thr Met Ala Thr Cys Tyr Thr
Gly Thr Thr Lys 210 215 220Thr Leu Leu
Arg Cys Pro Ser Lys Lys Lys Tyr Lys Ala Ile Arg Gln225
230 235 240Thr Tyr Thr Thr Met Ala Thr
Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn 245
250 255Thr Ala Thr Gln Gln Ser Ser Tyr Gln Ser Ile Leu
Phe Gly Asn Asp 260 265 270Cys
Glu Arg Ser Lys His Leu Asp Leu Thr Met Gln Val Thr Glu Thr 275
280 285Ile Ala Tyr Ser His Cys Cys Met Asn
Pro Thr Thr Tyr Ala Phe Thr 290 295
300Gly Glu Arg Phe Arg Lys Tyr Leu Arg His Phe Phe His Arg His Leu305
310 315 320Leu Met His Leu
Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu 325
330 335Glu Arg Thr Ser Ser Val Ser Pro Ser Thr
Ala Glu Pro Glu Leu Ser 340 345
350Ile Val Phe 355378355PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 378Met Thr Thr Ser Leu Asp
Thr Val Glu Thr Phe Gly Thr Thr Ser Tyr1 5
10 15Tyr Asp Asp Val Gly Leu Leu Cys Glu Lys Ala Asp
Thr Arg Ala Leu 20 25 30Met
Ala Gln Phe Thr Pro Pro Gln Tyr Ser Gln Thr Tyr Thr Thr Gly 35
40 45Gln Gln Gly Asn Thr Val Thr Thr Met
Thr Gln Ile Lys Tyr Arg Arg 50 55
60Leu Arg Ile Met Thr Asn Ile Tyr Leu Leu Asn Gln Ala Thr Ser Asp65
70 75 80Gln Gln Phe Gln Val
Thr Gln Pro Phe Trp Ile His Tyr Val Arg Gly 85
90 95His Asn Trp Val Phe Gly His Gly Met Cys Lys
Gln Gln Ser Gly Phe 100 105
110Tyr His Thr Gly Gln Tyr Ser Glu Thr Phe Tyr Thr Thr Gln Gln Thr
115 120 125Thr Asp Arg Tyr Leu Ala Ile
Val His Ala Val Phe Ala Leu Arg Ala 130 135
140Arg Thr Thr Thr Tyr Gly Thr Thr Thr Ser Thr Thr Thr Trp Gly
Gln145 150 155 160Ala Thr
Gln Ala Ala Gln Pro Glu Phe Ile Tyr Tyr Glu Thr Glu Glu
165 170 175Leu Phe Glu Glu Thr Leu Cys
Ser Ala Leu Tyr Pro Glu Asp Thr Val 180 185
190Tyr Ser Trp Arg His Phe His Thr Leu Arg Met Thr Ile Tyr
Cys Gln 195 200 205Val Gln Pro Gln
Gln Val Met Ala Thr Cys Tyr Thr Gly Thr Thr Lys 210
215 220Thr Leu Leu Arg Cys Pro Ser Lys Lys Lys Tyr Lys
Ala Ile Arg Gln225 230 235
240Thr Tyr Thr Thr Met Ala Thr Tyr Tyr Thr Tyr Trp Thr Pro Tyr Asn
245 250 255Thr Ala Thr Gln Gln
Ser Ser Tyr Gln Ser Ile Leu Phe Gly Asn Asp 260
265 270Cys Glu Arg Ser Lys His Leu Asp Leu Thr Met Gln
Thr Thr Glu Thr 275 280 285Thr Ala
Tyr Ser His Cys Cys Met Asn Pro Thr Thr Tyr Ala Tyr Thr 290
295 300Gly Glu Arg Phe Arg Lys Tyr Leu Arg His Phe
Phe His Arg His Leu305 310 315
320Leu Met His Leu Gly Arg Tyr Ile Pro Phe Leu Pro Ser Glu Lys Leu
325 330 335Glu Arg Thr Ser
Ser Val Ser Pro Ser Thr Ala Glu Pro Glu Leu Ser 340
345 350Ile Val Phe 355379352PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
379Met Asp Tyr Gln Val Ser Ser Pro Ile Tyr Asp Ile Asn Tyr Tyr Thr1
5 10 15Ser Glu Pro Cys Gln Lys
Ile Asn Val Lys Gln Ile Ala Ala Arg Leu 20 25
30Gln Pro Pro Gln Tyr Ser Gln Thr Phe Thr Phe Gly Phe
Thr Gly Asn 35 40 45Met Gln Val
Thr Gln Thr Gln Ile Asn Cys Lys Arg Leu Lys Ser Met 50
55 60Thr Asp Ile Tyr Leu Gln Asn Gln Ala Ile Ser Asp
Gln Phe Phe Gln65 70 75
80Gln Thr Thr Pro Tyr Trp Ala His Tyr Ala Ala Ala Gln Trp Asp Phe
85 90 95Gly Asn Thr Met Cys Gln
Gln Gln Thr Gly Gln Tyr Phe Thr Gly Tyr 100
105 110Tyr Ser Gly Thr Tyr Tyr Thr Thr Gln Gln Thr Thr
Asp Arg Tyr Leu 115 120 125Ala Val
Val His Ala Val Phe Ala Leu Lys Ala Arg Thr Thr Thr Tyr 130
135 140Gly Thr Thr Thr Ser Thr Thr Thr Trp Thr Thr
Ala Thr Tyr Ala Ser145 150 155
160Gln Pro Gly Thr Thr Tyr Thr Arg Ser Gln Lys Glu Gly Leu His Tyr
165 170 175Thr Cys Ser Ser
His Phe Pro Tyr Ser Gln Tyr Gln Phe Trp Lys Asn 180
185 190Phe Gln Thr Leu Lys Ile Val Ile Gln Gly Gln
Val Gln Pro Gln Gln 195 200 205Thr
Met Thr Thr Cys Tyr Ser Gly Ile Gln Lys Thr Leu Leu Arg Cys 210
215 220Arg Asn Glu Lys Lys Arg His Arg Ala Val
Arg Gln Thr Tyr Thr Thr225 230 235
240Met Thr Thr Tyr Tyr Gln Tyr Trp Ala Pro Tyr Asn Thr Val Gln
Gln 245 250 255Leu Asn Thr
Phe Gln Glu Phe Phe Gly Leu Asn Asn Cys Ser Ser Ser 260
265 270Asn Arg Leu Asp Gln Ala Met Gln Val Thr
Glu Thr Gln Gly Met Thr 275 280
285His Cys Cys Ile Asn Pro Thr Ile Tyr Ala Tyr Val Gly Glu Lys Phe 290
295 300Arg Asn Tyr Leu Leu Val Phe Phe
Gln Lys His Ile Ala Lys Arg Phe305 310
315 320Cys Lys Cys Cys Ser Ile Phe Gln Gln Glu Ala Pro
Glu Arg Ala Ser 325 330
335Ser Val Tyr Thr Arg Ser Thr Gly Glu Gln Glu Ile Ser Val Gly Leu
340 345 350
User Contributions:
Comment about this patent or add new information about this topic: