Patent application title: METHODS OF USING CHIMERIC ANTIGEN RECEPTORS TARGETING B-CELL MATURATION ANTIGEN AND USES THEREOF
Inventors:
Renier J. Brentjens (New York, NY, US)
Eric L. Smith (Brookline, MA, US)
Cheng Liu (Emeryville, CA, US)
Cheng Liu (Emeryville, CA, US)
Assignees:
MEMORIAL SLOAN-KETTERING CANCER CENTER
Eureka Therapeutics, Inc.
IPC8 Class:
USPC Class:
1 1
Class name:
Publication date: 2021-11-11
Patent application number: 20210346432
Abstract:
The presently disclosed subject matter provides for methods and
compositions for treating multiple myeloma. It relates to chimeric
antigen receptors (CARs) that specifically target B cell maturation
antigen (BCMA), and immunoresponsive cells comprising such CARs. The
presently disclosed BCMA-specific CARs have enhanced immune-activating
properties, including anti-tumor activity.Claims:
1-58. (canceled)
59. A chimeric antigen receptor (CAR), comprising an extracellular antigen-binding domain that binds to B cell maturation antigen (BCMA), a transmembrane domain and an intracellular domain, wherein the extracellular antigen-binding domain comprises: (a) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:1, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:2; (b) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:5, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:6; (c) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:9, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:10; (d) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:13, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:14; (e) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:17, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:18; (f) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:21, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth SEQ ID NO:22; (g) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:25, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:26; (h) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:29, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:30; (i) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:33, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:34; (j) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:37, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:38; (k) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:41, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:42; (l) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:45, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:46; (m) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:49, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:50; (n) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:53, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:54; (o) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:57, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:58; (p) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:61, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:62; or (q) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:65, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:66.
60. A chimeric antigen receptor (CAR), comprising an extracellular antigen-binding domain that binds to B cell maturation antigen (BCMA), a transmembrane domain, and an intracellular domain, wherein the extracellular antigen-binding domain comprises: (a) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 89, 90 and 91, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 92, 93, and 94, respectively; (b) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 95, 96, and 97, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 98, 99, and 100, respectively; (c) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 101, 102 and 103, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 104, 105, and 106, respectively; (d) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 107, 108, and 109, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 110, 111, and 112, respectively; (e) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 113, 114, and 115, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 116, 117, and 118, respectively; (f) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 119, 120, and 121, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 122, 123, and 124, respectively; (g) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 125, 126, and 127, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 128, 129, and 130, respectively; (h) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 131, 132, and 133, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 134, 135, and 136, respectively; (i) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 137, 138, and 139, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 140, 141, and 142, respectively; (j) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 143, 144, and 145, respectively; the light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 146, 147 and 148, respectively; (k) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 149, 150, and 151, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 152, 153, and 154, respectively; (l) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 155, 156, and 157, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 158, 159, and 160, respectively; (m) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 161, 162, and 163, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 164, 165, and 166, respectively; (n) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 167, 168, and 169, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 170, 171, and 172, respectively (o) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 173, 174, and 175, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 176, 177, and 178, respectively; (p) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 179, 180, and 181, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 182, 183, and 184, respectively; or (q) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 185, 186, and 187, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 188, 189, and 190, respectively.
61. The CAR of claim 59, wherein: (a) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:1, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:2; (b) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:5, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:6; (c) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:9, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:10; (d) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:13, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:14; (e) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:17, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:18; (f) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:21, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:22; (g) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:25, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:26; (h) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:29, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:30; (i) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:33, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:34; (j) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:37, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:38; (k) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:41, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:42; (l) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:45, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:46; (m) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:49, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:50; (n) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:53, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:54 (o) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:57, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:58; (p) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:61, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:62; or (q) the heavy chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:65, and the light chain variable region comprises an amino acid sequence that has at least about 90% sequence identity to the sequence set forth in SEQ ID NO:66.
62. The CAR of claim 59, wherein: (a) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:1, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:2; (b) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:5, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:6; (c) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:9, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:10; (d) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:13, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:14; (e) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:17, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:18; (f) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:21, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:22; (g) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:25, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:26; (h) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:29, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:30; (i) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:33, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:34; (j) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:37, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:38; (k) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:41, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:42; (l) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:45, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:46; (m) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:49, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:50; (n) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:53, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:54; (o) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:57, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:58; (p) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:61, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:62; or (q) the heavy chain variable region comprises the amino acid sequence set forth in SEQ ID NO:65, and the light chain variable region comprises the amino acid sequence set forth in SEQ ID NO:66.
63. The CAR of claim 59, wherein the extracellular antigen-binding domain comprises the amino acid sequence set forth in SEQ ID NO: 72, SEQ ID NO: 73, SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 77, SEQ ID NO: 78, SEQ ID NO: 79, SEQ ID NO: 80, SEQ ID NO: 81, SEQ ID NO: 82, SEQ ID NO: 83, SEQ ID NO: 84, SEQ ID NO: 85, SEQ ID NO: 86, SEQ ID NO: 87, or SEQ ID NO: 88.
64. An immunoresponsive cell comprising the CAR of claim 59.
65. An immunoresponsive cell comprising the CAR of claim 60.
66. A nucleic acid molecule encoding the CAR of claim 59.
67. A nucleic acid molecule encoding the CAR of claim 60.
68. A vector comprising the nucleic acid molecule of claim 66.
69. A vector comprising the nucleic acid molecule of claim 67.
70. A host cell comprising the nucleic acid molecule of claim 66.
71. A host cell comprising the nucleic acid molecule of claim 67.
72. A T cell comprising the CAR of claim 59.
73. The T cell of claim 72, wherein the T cell is a CD4+ T cell and/or a CD8+ T cell.
74. A T cell comprising the CAR of claim 60.
75. The T cell of claim 74, wherein the T cell is a CD4+ T cell and/or a CD8+ T cell.
76. A pharmaceutical composition comprising an effective amount of the immunoresponsive cell of claim 64 and a pharmaceutically acceptable excipient.
77. A pharmaceutical composition comprising an effective amount of the immunoresponsive cell of claim 65 and a pharmaceutically acceptable excipient.
78. A pharmaceutical composition comprising an effective amount of the T cell of claim 72 and a pharmaceutically acceptable excipient.
79. A pharmaceutical composition comprising an effective amount of the T cell of claim 73 and a pharmaceutically acceptable excipient.
80. A method for producing an immunoresponsive cell that binds to B cell maturation antigen (BMCA), comprising introducing into the immunoresponsive cell a nucleic acid sequence that encodes the chimeric antigen receptor (CAR) of claim 59.
81. A kit for treating a tumor, the kit comprising the immunoresponsive cell of claim 64 and written instructions for using the immunoresponsive cell for treating a subject having a tumor.
82. A kit for treating a tumor, the kit comprising the T cell of claim 72 and written instructions for using the immunoresponsive cell for treating a subject having a tumor.
83. The kit of claim 82, wherein the tumor is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia.
84. A method of reducing tumor burden in a subject, the method comprising administering a composition comprising an immunoresponsive cell, wherein the immunoresponsive cell comprises an extracellular antigen-binding domain that binds to B cell maturation antigen (BCMA), a transmembrane domain and an intracellular domain, and wherein the extracellular antigen-binding domain comprises: (a) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:1, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:2; (b) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:5, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:6; (c) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:9, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:10; (d) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:13, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:14; (e) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:17, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:18; (f) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:21, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:22; (g) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:25, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:26; (h) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:29, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:30; (i) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:33, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:34; (j) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:37, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:38; (k) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:41, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:42; (l) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:45, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:46; (m) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:49, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:50; (n) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:53, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:54; (o) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:57, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:58; (p) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:61, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:62; or (q) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:65, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:66.
85. A method of reducing tumor burden in a subject, the method comprising administering a composition comprising an immunoresponsive cell, wherein the immunoresponsive cell comprises an extracellular antigen-binding domain that binds to B cell maturation antigen (BCMA), a transmembrane domain and an intracellular domain, and wherein the extracellular antigen-binding domain comprises: (a) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 89, 90 and 91, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 92, 93, and 94, respectively; (b) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 95, 96, and 97, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 98, 99, and 100, respectively; (c) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 101, 102 and 103, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 104, 105, and 106, respectively; (d) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 107, 108, and 109, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 110, 111, and 112, respectively; (e) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 113, 114, and 115, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 116, 117, and 118, respectively; (f) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 119, 120, and 121, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 122, 123, and 124, respectively; (g) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 125, 126, and 127, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 128, 129, and 130, respectively; (h) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 131, 132, and 133, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 134, 135, and 136, respectively; (i) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 137, 138, and 139, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 140, 141, and 142, respectively; (j) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 143, 144, and 145, respectively; the light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 146, 147 and 148, respectively; (k) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 149, 150, and 151, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 152, 153, and 154, respectively; (l) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 155, 156, and 157, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 158, 159, and 160, respectively; (m) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 161, 162, and 163, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 164, 165, and 166, respectively; (n) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 167, 168, and 169, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 170, 171, and 172, respectively (o) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 173, 174, and 175, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 176, 177, and 178, respectively; (p) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 179, 180, and 181, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 182, 183, and 184, respectively; or (q) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 185, 186, and 187, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 188, 189, and 190, respectively.
86. A method of reducing tumor burden in a subject, the method comprising administering a composition comprising a T cell, wherein the T cell comprises an extracellular antigen-binding domain that binds to B cell maturation antigen (BCMA), a transmembrane domain and an intracellular domain, and wherein the extracellular antigen-binding domain comprises: (a) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:1, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:2; (b) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:5, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:6; (c) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:9, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:10; (d) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:13, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:14; (e) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:17, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:18; (f) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:21, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:22; (g) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:25, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:26; (h) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:29, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:30; (i) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:33, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:34; (j) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:37, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:38; (k) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:41, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:42; (l) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:45, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:46; (m) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:49, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:50; (n) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:53, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:54; (o) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:57, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:58; (p) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:61, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:62; or (q) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 of the heavy chain variable region sequence set forth in SEQ ID NO:65, and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 of the light chain variable region sequence set forth in SEQ ID NO:66.
87. A method of reducing tumor burden in a subject, the method comprising administering a composition comprising a T cell, wherein the T cell comprises an extracellular antigen-binding domain that binds to B cell maturation antigen (BCMA), a transmembrane domain and an intracellular domain, and wherein the extracellular antigen-binding domain comprises: (a) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 89, 90 and 91, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 92, 93, and 94, respectively; (b) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 95, 96, and 97, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 98, 99, and 100, respectively; (c) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 101, 102 and 103, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 104, 105, and 106, respectively; (d) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 107, 108, and 109, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 110, 111, and 112, respectively; (e) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 113, 114, and 115, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 116, 117, and 118, respectively; (f) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 119, 120, and 121, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 122, 123, and 124, respectively; (g) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 125, 126, and 127, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 128, 129, and 130, respectively; (h) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 131, 132, and 133, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 134, 135, and 136, respectively; (i) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 137, 138, and 139, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 140, 141, and 142, respectively; (j) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 143, 144, and 145, respectively; the light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 146, 147 and 148, respectively; (k) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 149, 150, and 151, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 152, 153, and 154, respectively; (l) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 155, 156, and 157, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 158, 159, and 160, respectively; (m) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 161, 162, and 163, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 164, 165, and 166, respectively; (n) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 167, 168, and 169, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 170, 171, and 172, respectively (o) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 173, 174, and 175, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 176, 177, and 178, respectively; (p) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 179, 180, and 181, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 182, 183, and 184, respectively; or (q) a heavy chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 185, 186, and 187, respectively; and a light chain variable region comprising a CDR1, a CDR2, and a CDR3 sequence set forth in SEQ ID NOs: 188, 189, and 190, respectively.
88. The method of claim 86, wherein the T cell is a CD4+ T cell and/or a CD8+ T cell.
89. The method of claim 87, wherein the T cell is a CD4+ T cell and/or a CD8+ T cell.
90. The method of claim 84, wherein the method reduces the number of tumor cells, reduces tumor size, or eradicates the tumor in the subject.
91. The method of claim 84, wherein the tumor is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia.
92. The method of claim 86, wherein the method reduces the number of tumor cells, reduces tumor size, or eradicates the tumor in the subject.
93. The method of claim 86, wherein the tumor is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia.
94. The method of claim 84, wherein from about 10.sup.4 to about 10.sup.10, from about 10.sup.5 to about 10.sup.9, or from about 10.sup.6 to about 10.sup.8 immunoresponsive cells are administered to the subject.
95. The method of claim 86, wherein from about 10.sup.4 to about 10.sup.10, from about 10.sup.5 to about 10.sup.9, or from about 10.sup.6 to about 10.sup.8 T cells are administered to the subject.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application is a Divisional of U.S. patent application Ser. No. 16/844,610, filed Apr. 9, 2020, which is a Divisional of U.S. patent application Ser. No. 15/613,638, filed Jun. 5, 2017 now U.S. Pat. No. 10,918,665, which is a Continuation of International Application Serial No. PCT/US2015/064112, filed Dec. 4, 2015, which claims priority to U.S. Provisional Patent Application Ser. No. 62/088,309, filed Dec. 5, 2014, the contents of each of which are incorporated by reference in their entirety, and to each of which priority is claimed.
SEQUENCE LISTING
[0002] The specification further incorporates by reference the Sequence Listing submitted herewith via EFS on Apr. 12, 2021. Pursuant to 37 C.F.R. .sctn. 1.52(e)(5), the Sequence Listing text file, identified as 0893330379_SL.txt, is 234,929 bytes and was created on Apr. 12, 2021. The Sequence Listing electronically filed herewith, does not extend beyond the scope of the specification and thus does not contain new matter.
INTRODUCTION
[0003] The presently disclosed subject matter provides for methods and compositions for treating cancer. It relates to chimeric antigen receptors (CARs) that specifically target B-cell maturation antigen (BCMA), immunoresponsive cells comprising such CARs, and methods of using such cells for treating cancer (e.g., multiple myeloma).
BACKGROUND OF THE INVENTION
[0004] Cell-based immunotherapy is a therapy with curative potential for the treatment of cancer. T cells and other immune cells may be modified to target tumor antigens through the introduction of genetic material coding for artificial or synthetic receptors for antigen, termed Chimeric Antigen Receptors (CARs), specific to selected antigens. Targeted T cell therapy using CARs has shown recent clinical success in treating hematologic malignancies.
[0005] Multiple myeloma (MM) is the second most common hematologic malignancy..sup.8 Approximately 25% of patients have high-risk cytogenetics, which portends a median survival of less then 2 years..sup.9,10 While recent strides have been made, regardless of cytogenetics, the disease is still considered incurable outside the immuno-therapeutic graft versus myeloma (GvM) effect of an allogeneic transplant. However, allogeneic transplants are limited by ineligibility and high rates of transplant-associated morbidity and mortality..sup.11 Similar to the GvM effect, a potentially curative T cell effect may be achieved with minimal toxicity through autologous adoptive T cell therapy.
[0006] Myeloma may be an ideal disease to test adoptive T cell therapy. First, as indicated above, allogeneic transplants demonstrate that the T cell can be a curative treatment, even with minimal or no concomitant chemotherapy such as after non-myeloablative transplants or post-transplantation donor lymphocyte infusions. Second, conditioning chemotherapy, possibly through the mechanism of depleting regulatory T cells (Tregs), enhances the efficacy of adoptive T cell therapy,.sup.4,12 as such, the immediate post-autologous transplant period could be an optimal time to administer T cells, and myeloma is one of the few diseases where autologous stem cell transplantation is the standard of care. Third, the immunomodulatory drug lenalidomide may improve CAR based therapy, as has been shown in mice,.sup.13 and lenalidomide is commonly used to treat MM. Fourth, adoptive T cell therapy works best in bone marrow predominant disease such as ALL,.sup.6,7 when compared to solid tumors or extra-medullary CLL,.sup.4 and similar to ALL, myeloma is a disease of the bone marrow.
[0007] While there are various reasons to expect that adoptive T cell therapy may work well in MM, expanding adoptive T cell therapy to myeloma also poses unique challenges. Unlike other B-cell malignancies, CD19 expression is seen in only 2% of myeloma patients..sup.14 Furthermore, unlike CD19, the common extracellular immunophenotypic markers in myeloma (CD138, CD38, and CD56) are all co-expressed on other essential cell types, and we predict CARs to any of these targets would lead to unacceptable "off tumor, on target" toxicity.sup.6 which can be fatal even in targets where antibodies are well tolerated, as was the case with a HER2 targeted CAR..sup.15 Accordingly, there are needs for novel therapeutic strategies to design CARs targeting antigens that are highly expressed in MM cells and limited expression in normal tissues for treating multiple myeloma, which strategies capable of inducing potent tumor eradication with minimal toxicity and immunogenicity.
SUMMARY OF THE INVENTION
[0008] The presently disclosed subject matter generally provides chimeric antigen receptors (CARs) that specifically target B-cell maturation antigen (BCMA), immunoresponsive cells comprising such CARs, and uses of these CARs and immunoresponsive cells for treating multiple myeloma.
[0009] The presently disclosed subject matter provides CARs. In one non-limiting example, the CAR comprises an extracellular antigen-binding domain, a transmembrane domain and an intracellular domain, where the extracellular antigen-binding domain is a human single-chain variable fragment (scFv) that specifically binds to B cell maturation antigen (BCMA). In certain embodiments, the human scFv specifically binds to BCMA with a binding affinity (K.sub.D) of from about 1.times.10.sup.-9M to about 3.times.10.sup.-6M. In certain embodiments, the human scFv specifically binds to BCMA with a binding affinity (K.sub.D) of from about 1.times.10.sup.-9M to about 1.times.10.sup.-8M.
[0010] In certain embodiments, the human scFv comprises a heavy chain variable region comprising an amino acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98% or about 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65.
[0011] In certain embodiments, the human scFv comprises a light chain variable region comprising an amino acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98% or about 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66.
[0012] In certain embodiments, the human scFv comprises (a) a heavy chain variable region comprising an amino acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98% or about 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NO: 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65; and (b) a light chain variable region comprising an amino acid sequence that is at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, about 89%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98% or about 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66.
[0013] In certain embodiments, the human scFv comprises a heavy chain variable region comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65, and conservative modifications thereof.
[0014] In certain embodiments, the human scFv comprises a light chain variable region comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66, and conservative modifications thereof.
[0015] In certain embodiments, the human scFv comprises (a) a heavy chain variable region comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65, and conservative modifications thereof, and (b) a light chain variable region comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66, and conservative modifications thereof.
[0016] In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having a sequence selected from the group consisting of: SEQ ID NOS:1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65. In certain embodiments, the human scFv comprises a light chain variable region comprising amino acids having a sequence selected from the group consisting of: SEQ ID NOS:2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:21. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:53. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:57. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:61. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:65. In certain embodiments, the human scFv comprises a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:22. In certain embodiments, the human scFv comprises a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:54. In certain embodiments, the human scFv comprises a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:58. In certain embodiments, the human scFv comprises a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:62. In certain embodiments, the human scFv comprises a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:66. In certain embodiments, the extracellular antigen-binding domain comprises (a) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:1, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:2; (b) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:5, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:6; (c) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:9, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:10; (d) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:13, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:14; (e) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:17, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:18; (f) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:21, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:22; (g) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:25, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:26; (h) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:29, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:30; (i) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:33, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:34; (j) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:37, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:38; (k) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:41, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:42; (1) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:45, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:46; (m) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:49, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:50; (n) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:53, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:54; (o) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:57, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:58; (p) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:61, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:62; or (q) a heavy chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:65, and a light chain variable region comprising amino acids having a sequence set forth in SEQ ID NO:66. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:21; and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:22. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:53; and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:54. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:57; and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:58. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:61; and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:62. In certain embodiments, the human scFv comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:65; and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:66.
[0017] In certain non-limiting embodiments, the human scFv comprises both of said heavy and light chains, optionally with a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. For example, in certain non-limiting embodiments, the human scFv comprises (i) a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:53 and (ii) a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:54, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the human scFv comprises (i) a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:21 and (ii) a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:22, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the human scFv comprises (i) a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:57 and (ii) a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:58, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the human scFv comprises (i) a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:61 and (ii) a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:62, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the human scFv comprises (i) a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:65 and (ii) a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:66, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region.
[0018] In certain embodiments, the human scFv comprises (a) a heavy chain variable region CDR3 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 91, 97, 103, 109, 115, 121, 127, 133, 139, 145, 151, 157, 163, 169, 175, 181, and 187; and (b) a light chain variable region CDR3 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 94, 100, 106, 112, 118, 124, 130, 136, 142, 148, 154, 160, 166, 172, 178, 184, and 190.
[0019] In certain embodiments, the human scFv comprises (a) a heavy chain variable region CDR2 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 90, 96, 102, 108, 114, 120, 126, 132, 138, 144, 150, 156, 162, 168, 174, 180, and 186, and conservative modifications thereof; and (b) a light chain variable region CDR2 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 93, 99, 105, 111, 117, 123, 129, 135, 141, 147, 153, 159, 165, 171, 177, 183, and 189, and conservative modifications thereof.
[0020] In certain embodiments, the human scFv comprises (a) a heavy chain variable region CDR1 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 89, 95, 101, 107, 113, 119, 125, 131, 137, 143, 149, 155, 161, 167, 173, 179, and 185, and conservative modifications thereof; and (b) a light chain variable region CDR1 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 92, 98, 104, 110, 116, 122, 128, 134, 140, 146, 152, 158, 164, 170, 176, 182, and 188, and conservative modifications thereof. In certain embodiments, the human scFv comprises: (a) a heavy chain variable region CDR1 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 89, 95, 101, 107, 113, 119, 125, 131, 137, 143, 149, 155, 161, 167, 173, 179, and 185; (b) a heavy chain variable region CDR2 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 90, 96, 102, 108, 114, 120, 126, 132, 138, 144, 150, 156, 162, 168, 174, 180, and 186; (c) a heavy chain variable region CDR3 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 91, 97, 103, 109, 115, 121, 127, 133, 139, 145, 151, 157, 163, 169, 175, 181, and 187; (d) a light chain variable region CDR1 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 92, 98, 104, 110, 116, 122, 128, 134, 140, 146, 152, 158, 164, 170, 176, 182, and 188; (e) a light chain variable region CDR2 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 93, 99, 105, 111, 117, 123, 129, 135, 141, 147, 153, 159, 165, 171, 177, 183, and 189; and (f) a light chain variable region CDR3 comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 94, 100, 106, 112, 118, 124, 130, 136, 142, 148, 154, 160, 166, 172, 178, 184, and 190. In certain embodiments, the human scFv comprises (a) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 89 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 90 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 91 or conservative modifications thereof; (b) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 95 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 96 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 97 or conservative modifications thereof; (c) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 101 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 102 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 103 or conservative modifications thereof; (d) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 107 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 108 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 109 or conservative modifications thereof; (e) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 113 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 114 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 115 or conservative modifications thereof; (f) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 119 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 120 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 121 or conservative modifications thereof; (g) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 125 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 126 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 127 or conservative modifications thereof; (h) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 131 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 132 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 133 or conservative modifications thereof; (i) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 137 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 138 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 139 or conservative modifications thereof; (j) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 143 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 144 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 145 or conservative modifications thereof; (k) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 149 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 150 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 151 or conservative modifications thereof; (1) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 155 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 156 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 157 or conservative modifications thereof; (m) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 161 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 162 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 163 or conservative modifications thereof; (n) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 167 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 168 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 169 or conservative modifications thereof; (o) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 173 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175 or conservative modifications thereof; (p) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 179 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:180 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181 or conservative modifications thereof; or (q) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 185 or conservative modifications thereof; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186 or conservative modifications thereof; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187 or conservative modifications thereof. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 167; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 168; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 169. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 119; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 120; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 121. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 173; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 179; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 180; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 185; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186; and a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187.
[0021] In certain embodiments, the human scFv comprises (a) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 92 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:93 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 94 or conservative modifications thereof; (b) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 98 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:99 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 100 or conservative modifications thereof; (c) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 104 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:105 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 106 or conservative modifications thereof; (d) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 110 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:111 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 112 or conservative modifications thereof; (e) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 116 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:117 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 118 or conservative modifications thereof; (f) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 122 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 124 or conservative modifications thereof; (g) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 128 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:129 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 130 or conservative modifications thereof; (h) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 134 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:135 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 136 or conservative modifications thereof; (i) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 140 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:141 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 142 or conservative modifications thereof; (j) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 146 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:147 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 148 or conservative modifications thereof; (k) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 152 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:153 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 154 or conservative modifications thereof; (l) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 158 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:159 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 160 or conservative modifications thereof; (m) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 164 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:165 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 166 or conservative modifications thereof; (n) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172 or conservative modifications thereof; (o) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178 or conservative modifications thereof; (p) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184 or conservative modifications thereof; or (q) a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188 or conservative modifications thereof; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189 or conservative modifications thereof; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190 or conservative modifications thereof. In certain embodiments, the human scFv comprises: a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 122; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 123; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 124. In certain embodiments, the human scFv comprises: a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172. In certain embodiments, the human scFv comprises: a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178. In certain embodiments, the human scFv comprises: a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184. In certain embodiments, the human scFv comprises: a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190.
[0022] In certain embodiments, the human scFv comprises: (a) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 89; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 90; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 91; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 92; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 93; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 94; (b) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 95; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 96; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 97; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 98; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 99; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 100; (c) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 101; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 102; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 103; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 104; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 105; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 106; (d) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 107; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 108; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 109; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 110; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 111; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 112; (e) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 113; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 114; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 115; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 116; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 117; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 118; (f) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 119; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 120; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 121; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 122; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 123; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 124; (g) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 125; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 126; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 127; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 128; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 129; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 130; (h) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 131; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 132; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 133; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 134; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 135; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 136; (i) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 137; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 138; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 139; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 140; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 141; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 142; (j) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 143; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 144; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 145; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 146; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SE ID NO: 147; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 148; (k) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 149; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 150; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 151; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 152; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 153; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 154; (1) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 155; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 156; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 157; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 158; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 159; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 160; (m) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 161; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 162; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 163; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 164; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:165; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 166; (n) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 167; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 168; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 169; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172; (o) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 173; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178; (p) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 179; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 180; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184; or (q) a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 185; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 119; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 120; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 121; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 122; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 123; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 124. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 167; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 168; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 169; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 173; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 179; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 180; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184. In certain embodiments, the human scFv comprises: a heavy chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 185; a heavy chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186; a heavy chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187; a light chain variable region CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188; a light chain variable region CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189; and a light chain variable region CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190.
[0023] In certain embodiments, the BCMA comprises the amino acid sequence set forth in SEQ ID NO:71. In certain embodiments, the human scFv binds to an epitope region comprising amino acids 14-22 of SEQ ID NO:71. In certain embodiments, the human scFv that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO:71 comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:21 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:22, In certain embodiments, the human scFv that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO:71 comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124.
[0024] In accordance with the presently disclosed subject matter, the extracellular antigen-binding domain is covalently joined to a transmembrane domain. The extracellular antigen-binding domain can comprise a signal peptide that is covalently joined to the 5' terminus of the extracellular antigen-binding domain. In certain embodiments, the transmembrane domain of the CAR comprises a CD8 polypeptide, a CD28 polypeptide, a CD3.zeta. polypeptide, a CD4 polypeptide, a 4-1BB polypeptide, an OX40 polypeptide, an ICOS polypeptide, a CTLA-4 polypeptide, a PD-1 polypeptide, a LAG-3 polypeptide, a 2B4 polypeptide, a BTLA polypeptide, a synthetic peptide (not based on a protein associated with the immune response), or a combination thereof. In certain embodiments, the transmembrane domain comprises a CD8 polypeptide. In certain embodiments, the transmembrane domain comprises a CD28 polypeptide.
[0025] In accordance with the presently disclosed subject matter, the intracellular domain comprises a CD3.zeta. polypeptide. In certain embodiments, the intracellular domain further comprises at least one signaling region. In certain embodiments, the at least one signaling region comprises a CD28 polypeptide, a 4-1BB polypeptide, an OX40 polypeptide, an ICOS polypeptide, a DAP-10 polypeptide, a PD-1 polypeptide, a CTLA-4 polypeptide, a LAG-3 polypeptide, a 2B4 polypeptide, a BTLA polypeptide, a synthetic peptide (not based on a protein associated with the immune response), or a combination thereof. In certain embodiments, the signaling region is a co-stimulatory signaling region. In certain embodiments, the co-stimulatory signaling region comprises a CD28 polypeptide, a 4-1BB polypeptide, an OX40 polypeptide, an ICOS polypeptide, a DAP-10 polypeptide, or a combination thereof. In certain embodiments, the at least one co-stimulatory signaling region comprises a CD28 polypeptide. In certain embodiments, the at least one signaling region comprises a 4-1BB polypeptide. In one specific non-limiting embodiment, the transmembrane domain comprises a CD28 polypeptide, the intracellular domain comprises a CD3.zeta. polypeptide, and the at least one signaling domain comprises a CD28 polypeptide.
[0026] In certain embodiments, the CAR is recombinantly expressed. The CAR can be expressed from a vector. In certain embodiments, the vector is a .gamma.-retroviral rector.
[0027] The presently disclosed subject matter also provides isolated immunoresponsive cells comprising the above-described CARs. In certain embodiments, the isolated immunoresponsive cell is transduced with the CAR, for example, the CAR is constitutively expressed on the surface of the immunoresponsive cell. In certain embodiments, the isolated immunoresponsive cell is further transduced with at least one co-stimulatory ligand such that the immunoresponsive cell expresses the at least one co-stimulatory ligand. In certain embodiments, the at least one co-stimulatory ligand is selected from the group consisting of 4-1BBL, CD80, CD86, CD70, OX40L, CD48, TNFRSF14, and combinations thereof. In certain embodiments, the isolated immunoresponsive cell is further transduced with at least one cytokine such that the immunoresponsive cell secrets the at least one cytokine. In certain embodiments, the at least cytokine is selected from the group consisting of IL-2, IL-3, IL-6, IL-7, IL-11, IL-12, IL-15, IL-17, IL-21, and combinations thereof. In certain embodiments, the isolated immunoresponsive cell is selected from the group consisting of a T cell, a Natural Killer (NK) cell, a cytotoxic T lymphocyte (CTL), a regulatory T cell, a human embryonic stem cell, a lymphoid progenitor cell, a T cell-precursor cell, and a pluripotent stem cell from which lymphoid cells may be differentiated. In certain embodiments, the immunoresponsive cell is a T cell.
[0028] The presently disclosed subject matter further provides nucleic acid molecules encoding the presently disclosed CARs, vectors comprising the nucleic acid molecules, and host cells expressing such nucleic acid molecules. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:207 In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:208. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:209. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:229. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:230. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:231. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:232. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:233. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:234. In certain embodiments, the nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:235. In certain embodiments, the vector is a .gamma.-retroviral vector. In certain embodiments, the host cell is a T cell.
[0029] Furthermore, the presently disclosed subject matter provides methods of using the above-described immunoresponsive cell for reducing tumor burden in a subject. For example, the presently disclosed subject matter provides methods of reducing tumor burden in a subject, where the method comprises administering an effective amount of the presently disclosed immunoresponsive cell to the subject, thereby inducing tumor cell death in the subject. In certain embodiments, the method reduces the number of tumor cells. In another embodiment, the method reduces the tumor size. In yet another embodiment, the method eradicates the tumor in the subject. In certain embodiments, the tumor is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia. In certain embodiments, the tumor is multiple myeloma. In certain embodiments, the subject is a human. In certain embodiments, the immunoresponsive cell is a T cell.
[0030] Furthermore, the presently disclosed subject matter provides methods of using the above-described immunoresponsive cell for increasing or lengthening survival of a subject having neoplasia. For example, the presently disclosed subject matter provides methods of increasing or lengthening survival of a subject having neoplasia, where the method comprises administering an effective amount of the presently disclosed immunoresponsive cell to the subject, thereby increasing or lengthening survival of the subject. In certain embodiments, the neoplasia is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia. In certain embodiments, the neoplasia is multiple myeloma. In certain embodiments, the method reduces or eradicates tumor burden in the subject.
[0031] The presently disclosed subject matter also provides methods for producing an immunoresponsive cell that binds to BCMA. In one non-limiting example, the method comprises introducing into the immunoresponsive cell a nucleic acid sequence that encodes a chimeric antigen receptor (CAR), which comprises an extracellular antigen-binding domain, a transmembrane domain and an intracellular domain, wherein the extracellular antigen-binding domain comprises a human scFv that specifically binds to BCMA.
[0032] The presently disclosed subject matter further provides pharmaceutical compositions comprising an effective amount of the presently disclosed immunoresponsive cells and a pharmaceutically acceptable excipient. In certain embodiments, the pharmaceutical compositions are for treating a neoplasia. In certain embodiments, the neoplasia is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia. In certain embodiments, the neoplasia is multiple myeloma.
[0033] The presently disclosed subject matter further provides kits for treating a neoplasia, comprising the presently disclosed immunoresponsive cells. In certain embodiments, the kit further include written instructions for using the immunoresponsive cell for treating a neoplasia. In certain embodiments, the neoplasia is selected from the group consisting of multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia. In certain embodiments, the neoplasia is multiple myeloma.
BRIEF DESCRIPTION OF THE FIGURES
[0034] The following Detailed Description, given by way of example, but not intended to limit the invention to specific embodiments described, may be understood in conjunction with the accompanying drawings.
[0035] FIG. 1 shows a chimeric antigen receptor targeting BCMA in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0036] FIGS. 2A-2D depict the human BCMA expression in normal tissues and human cancer cell lines.
[0037] FIG. 3 depicts expression of the presently disclosed BCMA CAR on human T cells.
[0038] FIG. 4 depicts the cross-reacting activity of human scFv's targeting human BCMA with mouse BCMA.
[0039] FIG. 5 depicts the killing activity of the presently disclosed BCMA for 3T3 cells overexpressing BCMA.
[0040] FIG. 6 depicts the killing activity of the presently disclosed BCMA for a human multiple myeloma cell line.
[0041] FIG. 7 shows a chimeric antigen receptor targeting BCMA in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0042] FIG. 8 depicts a nucleic acid molecule that encodes a BCMA-targeted CAR in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0043] FIG. 9 depicts a nucleic acid molecule that encodes a BCMA-targeted CAR in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0044] FIG. 10 depicts a nucleic acid molecule that encodes a BCMA-targeted CAR in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0045] FIG. 11 depicts a nucleic acid molecule that encodes a BCMA-targeted CAR in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0046] FIG. 12 depicts a nucleic acid molecule that encodes a BCMA-targeted CAR in accordance with one non-limiting embodiment of the presently disclosed subject matter.
[0047] FIG. 13 depicts the cytotoxicity of BCMA targeted CAR T cells for human multiple myeloma cell lines.
[0048] FIG. 14 depicts induction of cytokine secretion of BCMA targeted CAR T cells.
[0049] FIG. 15 depicts anti-tumor activity of BCMA targeted CAR T cells.
[0050] FIGS. 16A and 16B depict the killing activity of BCMA targeted CAR T cells. (A) Shows the percent of GFP.sup.+ tumor line at time 0. (B) Shows the killing the percent of GFP.sup.+ tumor line at time 36 hours.
[0051] FIG. 17 depicts epitope mapping of ET140-3.
[0052] FIG. 18 depicts epitope mapping of ET140-24.
[0053] FIG. 19 depicts epitope mapping of ET140-54.
[0054] FIG. 20 depicts epitope mapping of ET140-3, ET140-24, and ET140-54.
[0055] FIG. 21 depicts ELISA screening data of ET140-3, ET140-24, ET140-37, ET140-40, and ET140-54.
[0056] FIGS. 22A-22D depict FCAS screening data of ET140-3 (FIG. 22B), ET140-24 (FIG. 22C), ET140-37 (FIG. 22C), ET140-40 (FIG. 22D), and ET140-54 (FIG. 22D).
DETAILED DESCRIPTION OF THE INVENTION
[0057] The presently disclosed subject matter generally provides chimeric antigen receptors (CARs) targeting BCMA. In one non-limiting example, the CAR comprises an extracellular antigen-binding domain, a transmembrane domain and an intracellular domain, where the extracellular antigen-binding domain specifically binds to BCMA. The presently disclosed subject matter also provides immunoresponsive cells (e.g., T cell, a Natural Killer (NK) cell, a cytotoxic T lymphocyte (CTL), a regulatory T cell, a human embryonic stem cell, a lymphoid progenitor cell, a T cell-precursor cell, and a pluripotent stem cell from which lymphoid cells may be differentiated) expressing the BCMA-targeted CARs, and methods of using such immunoresponsive cells for treating a tumor, e.g., multiple myeloma.
I. Definitions
[0058] Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art to which this invention belongs. The following references provide one of skill with a general definition of many of the terms used in this invention: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them below, unless specified otherwise.
[0059] As used herein, the term "about" or "approximately" means within an acceptable error range for the particular value as determined by one of ordinary skill in the art, which will depend in part on how the value is measured or determined, i.e., the limitations of the measurement system. For example, "about" can mean within 3 or more than 3 standard deviations, per the practice in the art. Alternatively, "about" can mean a range of up to 20%, preferably up to 10%, more preferably up to 5%, and more preferably still up to 1% of a given value. Alternatively, particularly with respect to biological systems or processes, the term can mean within an order of magnitude, preferably within 5-fold, and more preferably within 2-fold, of a value.
[0060] As used herein, the term "cell population" refers to a group of at least two cells expressing similar or different phenotypes. In non-limiting examples, a cell population can include at least about 10, at least about 100, at least about 200, at least about 300, at least about 400, at least about 500, at least about 600, at least about 700, at least about 800, at least about 900, at least about 1000 cells expressing similar or different phenotypes.
[0061] As used herein, the term "antibody" means not only intact antibody molecules, but also fragments of antibody molecules that retain immunogen-binding ability. Such fragments are also well known in the art and are regularly employed both in vitro and in vivo. Accordingly, as used herein, the term "antibody" means not only intact immunoglobulin molecules but also the well-known active fragments F(ab').sub.2, and Fab. F(ab').sub.2, and Fab fragments that lack the Fe fragment of intact antibody, clear more rapidly from the circulation, and may have less non-specific tissue binding of an intact antibody (Wahl et al., J. Nucl. Med. 24:316-325 (1983). The antibodies of the invention comprise whole native antibodies, bispecific antibodies; chimeric antibodies; Fab, Fab', single chain V region fragments (scFv), fusion polypeptides, and unconventional antibodies.
[0062] As used herein, the term "single-chain variable fragment" or "scFv" is a fusion protein of the variable regions of the heavy (V.sub.H) and light chains (V.sub.L) of an immunoglobulin (e.g., mouse or human) covalently linked to form a V.sub.H::V.sub.L heterodimer. The heavy (V.sub.H) and light chains (V.sub.L) are either joined directly or joined by a peptide-encoding linker (e.g., 10, 15, 20, 25 amino acids), which connects the N-terminus of the V.sub.H with the C-terminus of the V.sub.L, or the C-terminus of the V.sub.H with the N-terminus of the V.sub.L. The linker is usually rich in glycine for flexibility, as well as serine or threonine for solubility. The linker can link the heavy chain variable region and the light chain variable region of the extracellular antigen-binding domain. Non-limiting examples of linkers are disclosed in Shen et al., Anal. Chem. 80(6):1910-1917 (2008) and WO 2014/087010, the contents of which are hereby incorporated by reference in their entireties. In certain embodiments, the linker is a G45 linker.
[0063] In a non-limiting example, the linker comprises amino acids having the sequence set forth in SEQ ID NO:210 as provided below.
TABLE-US-00001 [SEQ ID NO: 210] GGGGSGGGGSGGGGS.
[0064] In certain embodiments, the nucleic acid sequence encoding the amino acid sequence of SEQ ID NO:210 is set forth in SEQ ID NO:211, which is provided below:
TABLE-US-00002 [SEQ ID NO: 211] GGTGGAGGTGGATCAGGTGGAGGTGGATCTGGTGGAGGTGGATCT.
[0065] In one non-limiting example, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69 as provided below.
TABLE-US-00003 [SEQ ID NO: 69] SRGGGGSGGGGSGGGGSLEMA
[0066] In certain embodiments, the nucleic acid sequence encoding the amino acid sequence of SEQ ID NO:69 is set forth in SEQ ID NO:70, which is provided below:
TABLE-US-00004 [SEQ ID NO: 70] tctagaggtggtggtggtagcggcggcggcggctctggtggtggtggat ccctcgagatggcc
[0067] Despite removal of the constant regions and the introduction of a linker, scFv proteins retain the specificity of the original immunoglobulin. Single chain Fv polypeptide antibodies can be expressed from a nucleic acid comprising V.sub.H- and V.sub.L-encoding sequences as described by Huston, et al. (Proc. Nat. Acad. Sci. USA, 85:5879-5883, 1988). See, also, U.S. Pat. Nos. 5,091,513, 5,132,405 and 4,956,778; and U.S. Patent Publication Nos. 20050196754 and 20050196754. Antagonistic scFvs having inhibitory activity have been described (see, e.g., Zhao et al., Hyrbidoma (Larchmt) 2008 27(6):455-51; Peter et al., J Cachexia Sarcopenia Muscle 2012 Aug. 12; Shieh et al., J Imunol 2009 183(4):2277-85; Giomarelli et al., Thromb Haemost 2007 97(6):955-63; Fife et al., J Clin Invst 2006 116(8):2252-61; Brocks et al., Immunotechnology 1997 3(3):173-84; Moosmayer et al., Ther Immunol 1995 2(10:31-40). Agonistic scFvs having stimulatory activity have been described (see, e.g., Peter et al., J Bioi Chem 2003 25278(38):36740-7; Xie et al., Nat Biotech 1997 15(8):768-71; Ledbetter et al., Crit Rev Immunol 1997 17(5-6):427-55; Ho et al., BioChim Biophys Acta 2003 1638(3):257-66).
[0068] As used herein, "F(ab)" refers to a fragment of an antibody structure that binds to an antigen but is monovalent and does not have a Fc portion, for example, an antibody digested by the enzyme papain yields two F(ab) fragments and an Fc fragment (e.g., a heavy (H) chain constant region; Fc region that does not bind to an antigen).
[0069] As used herein, "F(ab').sub.2" refers to an antibody fragment generated by pepsin digestion of whole IgG antibodies, wherein this fragment has two antigen binding (ab') (bivalent) regions, wherein each (ab') region comprises two separate amino acid chains, a part of a H chain and a light (L) chain linked by an S--S bond for binding an antigen and where the remaining H chain portions are linked together. A "F(ab').sub.2" fragment can be split into two individual Fab' fragments.
[0070] As used herein, the term "vector" refers to any genetic element, such as a plasmid, phage, transposon, cosmid, chromosome, virus, virion, etc., which is capable of replication when associated with the proper control elements and which can transfer gene sequences into cells. Thus, the term includes cloning and expression vehicles, as well as viral vectors and plasmid vectors.
[0071] As used herein, the term "expression vector" refers to a recombinant nucleic acid sequence, i.e. recombinant DNA molecule, containing a desired coding sequence and appropriate nucleic acid sequences necessary for the expression of the operably linked coding sequence in a particular host organism. Nucleic acid sequences necessary for expression in prokaryotes usually include a promoter, an operator (optional), and a ribosome binding site, often along with other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and termination and polyadenylation signals.
[0072] As used herein, "CDRs" are defined as the complementarity determining region amino acid sequences of an antibody which are the hypervariable regions of immunoglobulin heavy and light chains. See, e.g., Kabat et al., Sequences of Proteins of Immunological Interest, 4th U. S. Department of Health and Human Services, National Institutes of Health (1987). Generally, antibodies comprise three heavy chain and three light chain CDRs or CDR regions in the variable region. CDRs provide the majority of contact residues for the binding of the antibody to the antigen or epitope. In certain embodiments, the CDRs regions are delineated using the Kabat system (Kabat, E. A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242).
[0073] As used herein, the term "affinity" is meant a measure of binding strength. Without being bound to theory, affinity depends on the closeness of stereochemical fit between antibody combining sites and antigen determinants, on the size of the area of contact between them, and on the distribution of charged and hydrophobic groups. Affinity also includes the term "avidity," which refers to the strength of the antigen-antibody bond after formation of reversible complexes. Methods for calculating the affinity of an antibody for an antigen are known in the art, comprising use of binding experiments to calculate affinity. Antibody activity in functional assays (e.g., flow cytometry assay) is also reflective of antibody affinity. Antibodies and affinities can be phenotypically characterized and compared using functional assays (e.g., flow cytometry assay).
[0074] Nucleic acid molecules useful in the methods of the invention include any nucleic acid molecule that encodes a polypeptide of the invention or a fragment thereof. Such nucleic acid molecules need not be 100% identical with an endogenous nucleic acid sequence, but will typically exhibit substantial identity. Polynucleotides having "substantial identity" to an endogenous sequence are typically capable of hybridizing with at least one strand of a double-stranded nucleic acid molecule. By "hybridize" is meant pair to form a double-stranded molecule between complementary polynucleotide sequences (e.g., a gene described herein), or portions thereof, under various conditions of stringency. (See, e.g., Wahl, G. M. and S. L. Berger (1987) Methods Enzymol. 152:399; Kimmel, A. R. (1987) Methods Enzymol. 152:507).
[0075] For example, stringent salt concentration will ordinarily be less than about 750 mM NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCl and 50 mM trisodium citrate, and more preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while high stringency hybridization can be obtained in the presence of at least about 35% formamide, and more preferably at least about 50% formamide. Stringent temperature conditions will ordinarily include temperatures of at least about 30.degree. C., more preferably of at least about 37.degree. C., and most preferably of at least about 42.degree. C. Varying additional parameters, such as hybridization time, the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion of carrier DNA, are well known to those skilled in the art. Various levels of stringency are accomplished by combining these various conditions as needed. In a preferred: embodiment, hybridization will occur at 30.degree. C. in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a more preferred embodiment, hybridization will occur at 37.degree. C. in 500 mM NaCl, 50 mM trisodium citrate, 1% SDS, 35% formamide, and 100 .mu.g/ml denatured salmon sperm DNA (ssDNA). In a most preferred embodiment, hybridization will occur at 42.degree. C. in 250 mM NaCl, 25 mM trisodium citrate, 1% SDS, 50% formamide, and 200 .mu.g/ml ssDNA. Useful variations on these conditions will be readily apparent to those skilled in the art.
[0076] For most applications, washing steps that follow hybridization will also vary in stringency. Wash stringency conditions can be defined by salt concentration and by temperature. As above, wash stringency can be increased by decreasing salt concentration or by increasing temperature. For example, stringent salt concentration for the wash steps will preferably be less than about 30 mM NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1.5 mM trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include a temperature of at least about 25.degree. C., more preferably of at least about 42.degree. C., and even more preferably of at least about 68.degree. C. In a preferred embodiment, wash steps will occur at 25.degree. C. in 30 mM NaCl, 3 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 42.degree. C. in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 68.degree. C. in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. Additional variations on these conditions will be readily apparent to those skilled in the art. Hybridization techniques are well known to those skilled in the art and are described, for example, in Benton and Davis (Science 196:180, 1977); Grunstein and Rogness (Proc. Natl. Acad. Sci., USA 72:3961, 1975); Ausubel et al. (Current Protocols in Molecular Biology, Wiley Interscience, New York, 2001); Berger and Kimmel (Guide to Molecular Cloning Techniques, 1987, Academic Press, New York); and Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York.
[0077] By "substantially identical" is meant a polypeptide or nucleic acid molecule exhibiting at least 50% identity to a reference amino acid sequence (for example, any one of the amino acid sequences described herein) or nucleic acid sequence (for example, any one of the nucleic acid sequences described herein). Preferably, such a sequence is at least 60%, more preferably 80% or 85%, and more preferably 90%, 95% or even 99% identical at the amino acid level or nucleic acid to the sequence used for comparison.
[0078] Sequence identity is typically measured using sequence analysis software (for example, Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705, BLAST, BESTFIT, GAP, or PILEUP/PRETTYBOX programs). Such software matches identical or similar sequences by assigning degrees of homology to various substitutions, deletions, and/or other modifications. In an exemplary approach to determining the degree of identity, a BLAST program may be used, with a probability score between e-3 and e-100 indicating a closely related sequence.
[0079] As used herein, the term "analog" refers to a structurally related polypeptide or nucleic acid molecule having the function of a reference polypeptide or nucleic acid molecule.
[0080] As used herein, the term "ligand" refers to a molecule that binds to a receptor. In particular, the ligand binds a receptor on another cell, allowing for cell-to-cell recognition and/or interaction.
[0081] As used herein, the term "disease" refers to any condition or disorder that damages or interferes with the normal function of a cell, tissue, or organ. Examples of diseases include neoplasia or pathogen infection of cell.
[0082] As used herein, the term "effective amount" refers to an amount sufficient to have a therapeutic effect. In certain embodiments, an "effective amount" is an amount sufficient to arrest, ameliorate, or inhibit the continued proliferation, growth, or metastasis (e.g., invasion, or migration) of a neoplasia.
[0083] As used herein, the term "heterologous nucleic acid molecule or polypeptide" refers to a nucleic acid molecule (e.g., a cDNA, DNA or RNA molecule) or polypeptide that is not normally present in a cell or sample obtained from a cell. This nucleic acid may be from another organism, or it may be, for example, an mRNA molecule that is not normally expressed in a cell or sample.
[0084] As used herein, the term "immunoresponsive cell" refers to a cell that functions in an immune response or a progenitor, or progeny thereof.
[0085] As used herein, the term "modulate" refers positively or negatively alter. Exemplary modulations include an about 1%, about 2%, about 5%, about 10%, about 25%, about 50%, about 75%, or about 100% change.
[0086] As used herein, the term "increase" refers to alter positively by at least about 5%, including, but not limited to, alter positively by about 5%, by about 10%, by about 25%, by about 30%, by about 50%, by about 75%, or by about 100%.
[0087] As used herein, the term "reduce" refers to alter negatively by at least about 5% including, but not limited to, alter negatively by about 5%, by about 10%, by about 25%, by about 30%, by about 50%, by about 75%, or by about 100%.
[0088] As used herein, the term "isolated cell" refers to a cell that is separated from the molecular and/or cellular components that naturally accompany the cell.
[0089] As used herein, the term "isolated," "purified," or "biologically pure" refers to material that is free to varying degrees from components which normally accompany it as found in its native state. "Isolate" denotes a degree of separation from original source or surroundings. "Purify" denotes a degree of separation that is higher than isolation. A "purified" or "biologically pure" protein is sufficiently free of other materials such that any impurities do not materially affect the biological properties of the protein or cause other adverse consequences. That is, a nucleic acid or peptide of this invention is purified if it is substantially free of cellular material, viral material, or culture medium when produced by recombinant DNA techniques, or chemical precursors or other chemicals when chemically synthesized. Purity and homogeneity are typically determined using analytical chemistry techniques, for example, polyacrylamide gel electrophoresis or high performance liquid chromatography. The term "purified" can denote that a nucleic acid or protein gives rise to essentially one band in an electrophoretic gel. For a protein that can be subjected to modifications, for example, phosphorylation or glycosylation, different modifications may give rise to different isolated proteins, which can be separately purified.
[0090] As used herein, the term "secreted" is meant a polypeptide that is released from a cell via the secretory pathway through the endoplasmic reticulum, Golgi apparatus, and as a vesicle that transiently fuses at the cell plasma membrane, releasing the proteins outside of the cell.
[0091] As used herein, the term "specifically binds" or "specifically binds to" or "specifically target" is meant a polypeptide or fragment thereof that recognizes and binds a biological molecule of interest (e.g., a polypeptide), but which does not substantially recognize and bind other molecules in a sample, for example, a biological sample, which naturally includes a polypeptide of the invention.
[0092] As used herein, the term "treating" or "treatment" refers to clinical intervention in an attempt to alter the disease course of the individual or cell being treated, and can be performed either for prophylaxis or during the course of clinical pathology. Therapeutic effects of treatment include, without limitation, preventing occurrence or recurrence of disease, alleviation of symptoms, diminishment of any direct or indirect pathological consequences of the disease, preventing metastases, decreasing the rate of disease progression, amelioration or palliation of the disease state, and remission or improved prognosis. By preventing progression of a disease or disorder, a treatment can prevent deterioration due to a disorder in an affected or diagnosed subject or a subject suspected of having the disorder, but also a treatment may prevent the onset of the disorder or a symptom of the disorder in a subject at risk for the disorder or suspected of having the disorder.
[0093] As used herein, the term "subject" refers to any animal (e.g., a mammal), including, but not limited to, humans, non-human primates, rodents, and the like (e.g., which is to be the recipient of a particular treatment, or from whom cells are harvested).
II. B Cell Maturation Antigen (BCMA)
[0094] BCMA is an ideal target for Adoptive T cell therapy (e.g., CAR therapy) as BCMA is involved in B cell differentiation and signaling and is known to be expressed on non-malignant differentiated B cells and plasma cells. While there might be risk of inducing a B cell aplasia, B cell aplasias induced by the CD19 CAR have been remarkably well tolerated. Several groups have confirmed BCMA multiple myeloma (MM) surface expression, with one group finding it as an alternative to CD138 as a FACS marker for malignant plasma cells from fresh or frozen patient bone marrow samples with mean relative mean fluorescence intensity (MFI) between 9-16 (n=35)..sup.1,2
[0095] In certain non-limiting embodiments, BCMA is human BCMA having the amino acid sequence set forth in SEQ ID NO:71, or fragments thereof.
[0096] SEQ ID NO:71 is provided below:
TABLE-US-00005 [SEQ ID NO: 71] MLQMAGQCSQNEYFDSLLHACIPCQLRCSSNTPPLTCQRYCNASVTNSV KGTNAILWTCLGLSLIISLAVFVLMFLLRKINSEPLKDEFKNTGSGLLG MANIDLEKSRTGDEIILPRGLEYTVEECTCEDCIKSKPKVDSDHCFPLP AMEEGATILVTTKTNDYCKSLPAALSATEIEKSISAR
III. Chimeric Antigen Receptor (CAR)
[0097] Chimeric antigen receptors (CARs) are engineered receptors, which graft or confer a specificity of interest onto an immune effector cell. CARs can be used to graft the specificity of a monoclonal antibody onto a T cell; with transfer of their coding sequence facilitated by retroviral vectors.
[0098] There are three generations of CARs. "First generation" CARs are typically composed of an extracellular antigen binding domain (e.g., a single-chain variable fragments (scFv)) fused to a transmembrane domain, fused to cytoplasmic/intracellular domain of the T cell receptor chain. "First generation" CARs typically have the intracellular domain from the CD3.xi.- chain, which is the primary transmitter of signals from endogenous TCRs. "First generation" CARs can provide de novo antigen recognition and cause activation of both CD4.sup.+ and CD8.sup.+ T cells through their CD3.zeta. chain signaling domain in a single fusion molecule, independent of HLA-mediated antigen presentation. "Second generation" CARs add intracellular domains from various co-stimulatory molecules (e.g., CD28, 4-1BB, ICOS, OX40) to the cytoplasmic tail of the CAR to provide additional signals to the T cell. "Second generation" CARs comprise those that provide both co-stimulation (e.g., CD28 or 4-1BB) and activation (CD3.zeta.). Preclinical studies have indicated that "Second Generation" CARs can improve the anti-tumor activity of T cells. For example, robust efficacy of "Second Generation" CAR modified T cells was demonstrated in clinical trials targeting the CD19 molecule in patients with chronic lymphoblastic leukemia (CLL) and acute lymphoblastic leukemia (ALL). "Third generation" CARs comprise those that provide multiple co-stimulation (e.g., CD28 and 4-1BB) and activation (CD3.zeta.).
[0099] In accordance with the presently disclosed subject matter, the CARs comprise an extracellular antigen-binding domain, a transmembrane domain and an intracellular domain, where the extracellular antigen-binding domain binds to BCMA. In certain embodiments, the extracellular antigen-binding domain is a scFv. In certain embodiments, the extracellular antigen-binding domain is a Fab, which is optionally crosslinked. In a certain embodiments, the extracellular binding domain is a F(ab).sub.2. In certain embodiments, any of the foregoing molecules may be comprised in a fusion protein with a heterologous sequence to form the extracellular antigen-binding domain. In one specific non-limiting embodiment, the extracellular antigen-binding domain comprises a human scFv that binds specifically to human BCMA.
[0100] In certain non-limiting embodiments, the extracellular antigen-binding domain of a CAR has a high binding specificity as well as high binding affinity to the BCMA. For example, in such embodiments, the extracellular antigen-binding domain of the CAR (embodied, for example, in a human scFv or an analog thereof) binds to BCMA with a dissociation constant (K.sub.D) of about 3.times.10.sup.-6 M or less. In certain embodiments, the K.sub.D is about 1.times.10.sup.-6 M or less, about 1.times.10.sup.-7 M or less, about 1.times.10.sup.-8 M or less, or about 1.times.10.sup.-9M or less, about 1.times.10.sup.-10 M or less, or about 1.times.10.sup.-11 M or less. In certain embodiments, the K.sub.D is about 1.times.10.sup.-8 M or less. In certain embodiments, the K.sub.D is from about 1.times.10.sup.-11 M to about 3.times.10.sup.-6 M, such as from about 1.times.10.sup.-11 M to about 1.times.10.sup.-10 M, from about 1.times.10.sup.-10 M to about 1.times.10.sup.-9 M, from about 1.times.10.sup.-9M to about 1.times.10.sup.-8 M, from about 1.times.10.sup.-8 M to about 1.times.10.sup.-7 M, or from about 1.times.10.sup.-7 M to about 1.times.10.sup.-6 M, or from about 1.times.10.sup.-6 M to about 3.times.10.sup.-6 M. In certain embodiments, the K.sub.D is from about 1.times.10.sup.-9M to about 1.times.10.sup.-8 M. In certain embodiments, the K.sub.D is from about 1.times.10.sup.-9 M to about 1.5.times.10.sup.-9 M. In certain embodiments, the K.sub.D is about 1.2.times.10.sup.-9M. In certain embodiments, the K.sub.D is from about 4.times.10.sup.-9 M to about 5.times.10.sup.-9 M. In certain embodiments, the K.sub.D is about 5.times.10.sup.-9 M. In certain embodiments, the K.sub.D is about 4.8.times.10.sup.-9 M. In certain embodiments, the K.sub.D is from about 8.times.10.sup.-9M to about 9.times.10.sup.-9 M. In certain embodiments, the K.sub.D is about 8.times.10.sup.-9 M. In certain embodiments, the K.sub.D is about 8.1.times.10.sup.-9M.
[0101] Binding of the extracellular antigen-binding domain (embodiment, for example, in a human scFv or an analog thereof) of a presently disclosed CAR to BCMA can be confirmed by, for example, enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), FACS analysis, bioassay (e.g., growth inhibition), or Western Blot assay. Each of these assays generally detect the presence of protein-antibody complexes of particular interest by employing a labeled reagent (e.g., an antibody, or a scFv) specific for the complex of interest. For example, the scFv can be radioactively labeled and used in a radioimmunoassay (RIA) (see, for example, Weintraub, B., Principles of Radioimmunoassays, Seventh Training Course on Radioligand Assay Techniques, The Endocrine Society, March, 1986, which is incorporated by reference herein). The radioactive isotope can be detected by such means as the use of a .gamma. counter or a scintillation counter or by autoradiography. In certain embodiments, the BCMA-targeted extracellular antigen-binding domain is labeled with a fluorescent marker. Non-limiting examples of fluorescent markers include green fluorescent protein (GFP), blue fluorescent protein (e.g., EBFP, EBFP2, Azurite, and mKalama1), cyan fluorescent protein (e.g., ECFP, Cerulean, and CyPet), and yellow fluorescent protein (e.g., YFP, Citrine, Venus, and YPet). In certain embodiments, the BCMA-targeted human scFv is labeled with GFP.
[0102] In certain embodiments, the extracellular antigen-binding domain of a presently disclosed CAR comprises a single-chain variable fragment (scFv). In one specific non-limiting embodiment, the extracellular antigen-binding domain of a presently disclosed CAR comprises a human scFv that specifically binds to human BCMA. In certain embodiments, the scFv are identified by screening scFv phage library with BCMA-Fc fusion protein.
[0103] Extracellular Antigen Binding Domain of a CAR
[0104] In certain embodiments, the extracellular antigen-binding domain (e.g., human scFv) comprises a heavy chain variable region comprising amino acids having a sequence selected from the group consisting of: SEQ ID NOS:1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65. The nucleic acid sequences encoding the amino acid sequence of SEQ ID NOS:1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65 are 3, 7, 11, 15, 19, 23, 27, 31, 35, 39, 43, 47, 51, 55, 59, 63, and 67, respectively. In certain embodiments, the extracellular antigen-binding domain (e.g., human scFv) comprises a light chain variable region comprising amino acids having a sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66. The nucleic acid sequences encoding the amino acid sequence of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66 are 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44, 48, 52, 56, 60, 64, and 68, respectively. The sequences of SEQ ID NOS:1-68 are described in the following Tables 1-17.
[0105] In certain embodiments, the extracellular antigen-binding domain (e.g., scFv) comprises heavy and light chain variable regions comprising amino acid sequences that are homologous to the amino acid sequences described herein and as disclosed in Tables 1-17. For example, and not by way of limitation, the extracellular antigen-binding domain (e.g., scFv) comprises a heavy chain variable region comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65.
[0106] In certain embodiments, the extracellular antigen-binding domain (e.g., scFv) comprises a light chain variable region comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66 are 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44, 48, 52, 56, 60, 64, and 68.
[0107] In certain embodiments, the extracellular antigen-binding domain (e.g., scFv) comprises (a) a heavy chain variable region comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 1, 5, 9, 13, 17, 21, 25, 29, 33, 37, 41, 45, 49, 53, 57, 61, and 65; and (b) a light chain variable region comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to an amino acid sequence selected from the group consisting of SEQ ID NOS: 2, 6, 10, 14, 18, 22, 26, 30, 34, 38, 42, 46, 50, 54, 58, 62, and 66 are 4, 8, 12, 16, 20, 24, 28, 32, 36, 40, 44, 48, 52, 56, 60, 64, and 68.
[0108] The presently disclosed subject matter further provides extracellular antigen-binding domains (e.g., scFv) that comprise heavy chain variable region and light chain variable region CDRs, e.g., CDR1s, CDR2s and CDR3s, as disclosed herein in Tables 1-17. The CDR regions are delineated using the Kabat system (Kabat, E. A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242). The presently disclosed subject matter further provides extracellular antigen-binding domains (e.g., scFv) that comprise conservative modifications of the antibody sequences disclosed herein. For example, and not by way of limitation, an extracellular antigen-binding domains (e.g., scFv) of the presently disclosed subject matter comprises a heavy chain variable region comprising CDR1, CDR2 and CDR3 sequences and a light chain variable region comprising CDR1, CDR2 and CDR3 sequences, wherein one or more of these CDR sequences comprise specified amino acid sequences disclosed herein, or conservative modifications thereof, and wherein the extracellular antigen-binding domains retain the desired functional properties.
[0109] In certain embodiments, the presently disclosed subject matter provides an extracellular antigen-binding domain (e.g., scFv) comprising a heavy chain variable region, wherein the heavy chain variable region comprises: (a) a CDR1 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 89, 95, 101, 107, 113, 119, 125, 131, 137, 143, 149, 155, 161, 167, 173, 179, and 185, and conservative modifications thereof; (b) a CDR2 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 90, 96, 102, 108, 114, 120, 126, 132, 138, 144, 150, 156, 162, 168, 174, 180, and 186, and conservative modifications thereof; and (c) a CDR3 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 91, 97, 103, 109, 115, 121, 127, 133, 139, 145, 151, 157, 163, 169, 175, 181, and 187, and conservative modifications thereof.
[0110] In certain embodiments, the extracellular antigen-binding domain (e.g., scFv) comprises a light chain variable region, wherein the light chain variable region comprises: (a) a CDR1 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 92, 98, 104, 110, 116, 122, 128, 134, 140, 146, 152, 158, 164, 170, 176, 182, and 188, and conservative modifications thereof; (b) a CDR2 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 93, 99, 105, 111, 117, 123, 129, 135, 141, 147, 153, 159, 165, 171, 177, 183, and 189, and conservative modifications thereof; and (c) a CDR3 comprising an amino acid sequence selected from the group consisting of SEQ ID NOS: 94, 100, 106, 112, 118, 124, 130, 136, 142, 148, 154, 160, 166, 172, 178, 184, and 190, and conservative modifications thereof.
[0111] The presently disclosed subject matter provides an extracellular antigen-binding domain (e.g., scFv) comprising a heavy chain variable region comprising CDR1, CDR2, and CDR3 sequences and a light chain variable region comprising CDR1, CDR2, and CDR3 sequences, wherein: (a) the heavy chain variable region CDR3 comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 91, 97, 103, 109, 115, 121, 127, 133, 139, 145, 151, 157, 163, 169, 175, 181, and 187, and conservative modifications thereof; and (b) the light chain variable region CDR3 comprises an amino acid sequence selected from the group consisting of SEQ ID NOS: 94, 100, 106, 112, 118, 124, 130, 136, 142, 148, 154, 160, 166, 172, 178, 184, and 190, and conservative modifications thereof; wherein the extracellular antigen-binding domain specifically binds to a BCMA polypeptide (e.g., a human BCMA polypeptide). In certain embodiments, the heavy chain variable region CDR2 comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 125, 131, 137, 143, 149, 155, 161, 167, 173, 179, 185, 191, 197, 203, 209, 215, 221, 227, 233, 239, 245, 251, 257, 263, 305, 317, 329, 341, 353, 365, 377, and 389, and conservative modifications thereof; and (b) the light chain variable region CDR2 comprises an amino acid sequence selected from the group consisting of SEQ ID NOS: 93, 99, 105, 111, 117, 123, 129, 135, 141, 147, 153, 159, 165, 171, 177, 183, and 189, and conservative modifications thereof; wherein the extracellular antigen-binding domain specifically binds to a BCMA polypeptide (e.g., a human BCMA polypeptide). In certain embodiments, the heavy chain variable region CDR1 comprises an amino acid sequence selected from the group consisting of SEQ ID NOs: 89, 95, 101, 107, 113, 119, 125, 131, 137, 143, 149, 155, 161, 167, 173, 179, and 185, and conservative modifications thereof; and (b) the light chain variable region CDR1 comprises an amino acid sequence selected from the group consisting of SEQ ID NOS: 92, 98, 104, 110, 116, 122, 128, 134, 140, 146, 152, 158, 164, 170, 176, 182, and 188, and conservative modifications thereof; wherein the extracellular antigen-binding domain specifically binds to a BCMA polypeptide (e.g., a human BCMA polypeptide).
[0112] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO: 72 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-192 scFv (also referred to as "ET140-42 scFv").
[0113] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:1 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:2, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:1, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:1, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:2, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:2, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:1 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:2, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:89 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:90 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:91 or conservative modifications thereof, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:92 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:93 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:94 or conservative modifications thereof, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:89 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:90 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:91 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:92 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:93 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:94 or conservative modifications thereof, as shown in Table 1. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:89, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:90, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:91, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:92, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:93, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:94.
TABLE-US-00006 TABLE 1 A BCMA polypeptide having the amino acid Antigen sequence of SEQ ID NO: 71 CDRs 1 2 3 V.sub.H VSSNSAAWN YRSKWYN [SEQ ARQGYSYYGYSDV [SEQ ID NO: ID NO: 90] [SEQ ID NO: 89] 91] V.sub.L SSNIGHND FDD [SEQ AAWDGSLNAFV [SEQ ID NO: ID NO: 93 [SEQ ID NO: 92] 94] Full V.sub.H QVQLQQSGPGLVKPSQTLSLTCAISGDSVSSNSAAWNWIR QSPSRGLEWLGRTYYRSKWYNDYAVSVKSRITINPDTSKN QFSLQLNSVTPEDTAVYYCARQGYSYYGYSDVWGQGTLVT VSS [SEQ ID NO: 1] DNA Caggtacagctgcagcagtcaggtccaggactggtgaagc cctcgcagaccctctcactcacctgtgccatctccgggga cagtgtctctagcaacagtgctgcttggaactggatcagg cagtccccatcgagaggccttgagtggctgggaaggacat actacaggtccaagtggtataatgattatgcagtatctgt gaaaagtcgaataaccatcaacccagacacatccaagaac cagttctccctgcagctgaactctgtgactcccgaggaca cggctgtgtattactgtgcgcgccagggttactcttacta cggttactctcgatgtttggggtcaaggtactctggtgac cgtctcctca [SEQ ID NO: 3] Full V.sub.L QSVLTQPPSVSVAPRQRVTISCSGSSSNIGHNDVSWYQHL PGKAPRLLIYFDDLLPSGVSDRFSASKSGTSASLAISGLQ SEDEADYYCAAWDGSLNAFVFGTGTKVTVLG [SEQ ID NO: 2] DNA Cagtctgtgctgactcagccaccctcggtgtctgtagccc ccaggcagagggtcaccatctcgtgttctggaagcagctc caacatcggacataatgatgtaagctggtaccagcatctc ccagggaaggctcccagactcctcatctattttgatgacc tgctgccgtcaggggtctctgaccgattctctgcctccaa gtctggcacctcagcctccctggccatcagtgggctccag tctgaggatgaggctgattattactgtgcagcatgggatg gcagcctgaatgctttgtcttcggaactgggaccaaggtc accgtcctaggt [SEQ ID NO: 4] scFv QSVLTQPPSVSVAPRQRVTISCSGSSSNIGHNDVSWYQHL PGKAPRLLIYFDDLLPSGVSDRFSASKSGTSASLAISGLQ SEDEADYYCAAWDGSLNAFVFGTGTKVTVLGSRGGGGSGG GGSGGGGSLEMAQVQLQQSGPGLVKPSQTLSLTCAISGDS VSSNSAAWNWIRQSPSRGLEWLGRTYYRSKWYNDYAVSVK SRITINPDTSKNQFSLQLNSVTPEDTAVYYCARQGYSYYG YSDVWGQGTLVTVSS [SEQ ID NO: 72]
[0114] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:73 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-197 scFv (also referred to as "ET140-47 scFv").
[0115] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:5 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:6, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:5, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:5, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:6, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:6, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:5 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:6, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:95 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:96 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:97 or conservative modifications thereof, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:98 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:99 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:100 or conservative modifications thereof, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:95 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:96 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:97 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:98 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:99 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:100 or conservative modifications thereof, as shown in Table 2. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:95, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:96, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:97, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:98, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:99, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:100.
TABLE-US-00007 TABLE 2 A BCMA polypeptide having the amino Antigen acid sequence of SEQ ID NO: 71 CDRs 1 2 3 V.sub.H VSSNSAAWN YRSKWYN ARYGFSGSRFYDT [SEQ ID NO: [SEQ ID NO: [SEQ ID NO: 95] 96] 97] V.sub.L SSNIGNNA FDD [SEQ ID AAWDDSLNGYV [SEQ ID NO: NO: 99] [SEQ ID NO: 98] 100] Full V.sub.H QVQLQQSGPGLVKPSQTLSLTCAISGDSVSSNSAAWNWIR QSPSRGLEWLGRTYYRSKWYNDYAVSVKSRITINPDTSKN QFSLQLNSVTPEDTAVYYCARYGFSGSRFYDTWGQGTLVT VSS [SEQ ID NO: 5] DNA Caggtacagctgcagcagtcaggtccaggactggtgaagc cctcgcagaccctctcactcacctgtgccatctccgggga cagtgtctctagcaacagtgctgcttggaactggatcagg cagtccccatcgagaggccttgagtggctgggaaggacat actacaggtccaagtggtataatgattatgcagtatctgt gaaaagtcgaataaccatcaacccagacacatccaagaac cagttctccctgcagctgaactctgtgactcccgaggaca cggctgtgtattactgtgcgcgctacggtttctctggttc tcgtttctacgatacttggggtcaaggtactctggtgacc gtctcctca [SEQ ID NO: 7] Full V.sub.L QPVLTQPPSVSEAPRQRVTISCSGSSSNIGNNAVNWYQQL PGKAPKLLIYFDDLLSSGVSDRFSGSKSGTSASLAISGLQ SEDEADYYCAAWDDSLNGYVFGTGTKVTVLG [SEQ ID NO: 6] DNA Cagcctgtgctgactcagccaccctcggtgtctgaagccc ccaggcagagggtcaccatctcctgttctggaagcagctc caacatcggaaataatgctgtaaactggtaccagcagctc ccaggaaaggctcccaaactcctcatctattttgatgatc tgctgtcctcaggggtctctgaccgattctctggctccaa gtctggcacctcagcctccctggccatcagtgggctccag tctgaagatgaggctgattattactgtgcagcatgggatg acagcctgaatggttatgtcttcggaactgggaccaaggt caccgtcctaggt [SEQ ID NO: 8] scFv QPVLTQPPSVSEAPRQRVTISCSGSSSNIGNNAVNWYQQL PGKAPKLLIYFDDLLSSGVSDRFSGSKSGTSASLAISGLQ SEDEADYYCAAWDDSLNGYVFGTGTKVTVLGSRGGGGSGG GGSGGGGSLEMAQVQLQQSGPGLVKPSQTLSLTCAISGDS VSSNSAAWNWIRQSPSRGLEWLGRTYYRSKWYNDYAVSVK SRITINPDTSKNQFSLQLNSVTPEDTAVYYCARYGFSGSR FYDTWGQGTLVTVSS [SEQ ID NO: 73]
[0116] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:74 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-180 scFv (also referred to as "ET140-30 scFv").
[0117] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:9 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:10, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:9, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:9, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:10, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:10, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:9 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:10, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:101 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:102 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:103 or conservative modifications thereof, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:104 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:105 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:106 or conservative modifications thereof, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:101 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:102 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:103 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:104 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:105 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:106 or conservative modifications thereof, as shown in Table 3. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:101, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:102, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:103, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:104, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:105, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:106.
TABLE-US-00008 TABLE 3 A BCMA polypeptide having the amino acid Antigen sequence of SEQ ID NO: 71 CDRs 1 2 3 V.sub.H GGTFSSYA IIPILGIA ARSGYSKSIVS [SEQ ID NO: [SEQ ID NO: YMDY [SEQ ID 101] 102] NO: 103] V.sub.L SSNIGSNV RNN [SEQ ID AAWDDSLSGYV [SEQ ID NO: NO: 105] [SEQ ID NO: 104] 106] Full V.sub.H EVQLVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQA PGQGLEWMGRIIPILGIANYAQKFQGRVTMTEDTSTDTAY MELSSLRSEDTAVYYCARSGYSKSIVSYMDYWGQGTLVTV SS [SEQ ID NO: 9] DNA Gaggtccagctggtgcagtctggagctgaggtgaagaagc ctgggtcctcggtgaaggtctcctgcaaggcttctggagg caccttcagcagctatgctatcagctgggtgcgacaggcc cctggacaagggcttgagtggatgggaaggatcatcccta tccttggtatagcaaactacgcacagaagttccagggcag agtcaccatgaccgaggacacatctacagacacagcctac atggagctgagcagcctgagatctgaggacacggccgtgt attactgtgcgcgctctggttactctaaatctatcgtttc ttacatggattactggggtcaaggtactctggtgaccgtc tcctca [SEQ ID NO: 11] Full V.sub.L LPVLTQPPSTSGTPGQRVTVSCSGSSSNIGSNVVFWYQQL PGTAPKLVIYRNNQRPSGVPDRFSVSKSGTSASLAISGLR SEDEADYYCAAWDDSLSGYVFGTGTKVTVLG [SEQ ID NO: 10] DNA Ctgcctgtgctgactcagcccccctccacgtctgggaccc ccgggcagagggtcaccgtctcttgttctggaagcagctc caacatcggaagtaatgttgtattctggtaccagcagctc ccaggcacggcccccaaacttgtcatctataggaataatc aacggccctcaggggtccctgaccgattctctgtctccaa gtctggcacctcagcctccctggccatcagtgggctccgg tccgaggacgaggctgattattattgtgcagcttgggatg acagcctgagtggttatgtcttcggaactgggaccaaggt caccgtcctaggt [SEQ ID NO: 12] scFv LPVLTQPPSTSGTPGQRVTVSCSGSSSNIGSNVVFWYQQL PGTAPKLVIYRNNQRPSGVPDRFSVSKSGTSASLAISGLR SEDEADYYCAAWDDSLSGYVFGTGTKVTVLGSRGGGGSGG GGSGGGGSLEMAEVQLVQSGAEVKKPGSSVKVSCKASGGT FSSYAISWVRQAPGQGLEWMGRIIPILGIANYAQKFQGRV TMTEDTSTDTAYMELSSLRSEDTAVYYCARSGYSKSIVSY MDYWGQGTLVTVSS [SEQ ID NO: 74]
[0118] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:75 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-172 scFv (also referred to as "ET140-22 scFv").
[0119] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:13 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:14, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:13, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:13, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:14, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:14, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:13 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:14, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:107 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:108 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:109 or conservative modifications thereof, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:110 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:111 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:112 or conservative modifications thereof, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:107 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:108 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:109 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:110 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:111 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:112 or conservative modifications thereof, as shown in Table 4. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:107, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:108, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:109, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:110, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:111, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:112.
TABLE-US-00009 TABLE 4 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYTFTSYY [SEQ INPSGGST [SEQ ARSQWGGVLDY ID NO: 107] ID NO: 108] [SEQ ID NO: 109] V.sub.L SSNIGARYD [SEQ GNN [SEQ ID QSYDSSLSASV ID NO: 110] NO: 111] [SEQ ID NO: 112] Full V.sub.H EVQLVQSGAEVKKPGASVKVSCKASGYTFTSYYMHWVRQAPGQ GLEWMGIINPSGGSTSYAQKFQGRVTMTRDTSTSTVYMELSSLRS EDTAVYYCARSQWGGVLDYWGQGTLVTVSS [SEQ ID NO: 13] DNA Gaggtccagctggtacagtctggggctgaggtgaagaagcctggggcctcagtgaaggtttcctgcaa ggcatctggatacaccttcaccagctactatatgcactgggtgcgacaggcccctggacaagggcttgag tggatgggaataatcaaccctagtggtggtagcacaagctacgcacagaagttccagggcagagtcacc atgaccagggacacgtccacgagcacagtctacatggagctgagcagcctgagatctgaggacacggc cgtgtattactgtgcgcgctctcagtggggtggtgttctggattactggggtcaaggtactctggtgaccgt ctcctca [SEQ ID NO: 15] Full V.sub.L QSVVTQPPSVSGAPGQRVTISCSGSSSNIGARYDVQWYQQLPGTAP KLLIFGNNNRPSGVPDRFSGSKSGTSASLAITGLQAEDEADYYCQS YDSSLSASVFGGGTKLTVLG [SEQ ID NO: 14] DNA Cagtctgtcgtgacgcagccgccctcagtgtctggggccccagggcagagggtcaccatctcctgcag tgggagcagctccaacatcggggcacgttatgatgttcagtggtaccagcagcttccaggaacagcccc caaactcctcatctttggtaacaacaatcggccctcaggggtccctgaccgattctctggctccaagtctgg cacgtcagcctccctggccatcactgggctccaggctgaggatgaggctgattattactgccagtcctatg acagcagcctgagtgcttcggtgttcggcggagggaccaagctgaccgtcctaggt [SEQ ID NO: 16] scFv QSVVTQPPSVSGAPGQRVTISCSGSSSNIGARYDVQWYQQLPGTAP KLLIFGNNNRPSGVPDRFSGSKSGTSASLAITGLQAEDEADYYCQS YDSSLSASVFGGGTKLTVLGSRGGGGSGGGGSGGGGSLEMAEVQ LVQSGAEVKKPGASVKVSCKASGYTFTSYYMHWVRQAPGQGLE WMGIINPSGGSTSYAQKFQGRVTMTRDTSTSTVYMELSSLRSEDT AVYYCARSQWGGVLDYWGQGTLVTVSS [SEQ ID NO: 75]
[0120] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:76 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-157 scFv (also referred to as "ET140-7 scFv").
[0121] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:17 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:18, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:17, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:17, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:18, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:18, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:17 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:18, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:113 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:114 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:115 or conservative modifications thereof, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:116 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:117 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:118 or conservative modifications thereof, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:113 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:114 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:115 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:116 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:117 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:118 or conservative modifications thereof, as shown in Table 5. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:113, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:114, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:115, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:116, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:117, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:118.
TABLE-US-00010 TABLE 5 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GGTFSSYA [SEQ IIPILGIA [SEQ ID ARTGYESWGSYEVIDR ID NO: 113] NO: 114] [SEQ ID NO: 115] V.sub.L SSNIGSNT [SEQ SNN [SEQ ID NO: 117] AAWDDSLNGVV ID NO: 116] [SEQ ID NO: 118] Full V.sub.H QVQLVESGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQGL EWMGRIIPILGIANYAQKFQGRVTITADESTSTAYMELSSLRSEDTA VYYCARTGYESWGSYEVIDRWGQGTLVTVSS [SEQ ID NO: 17] DNA Caggtgcagctggtggagtctggggctgaggtgaagaagcctgggtcctcggtgaaggtctcctgcaa ggcttctggaggcaccttcagcagctatgctatcagctgggtgcgacaggcccctggacaagggcttga gtggatgggaaggatcatccctatccttggtatagcaaactacgcacagaagttccagggcagagtcacg attaccgcggacgaatccacgagcacagcctacatggagctgagcagcctgagatctgaggacacggc cgtatattactgtgcgcgcactggttacgaatcttggggttcttacgaagttatcgatcgttggggtcaaggt actctggtgaccgtctcctca [SEQ ID NO: 19] Full V.sub.L QAVLTQPPSASGTPGQRVTISCSGSSSNIGSNTVNWYRQLPGTAPK LLIYSNNQRPSGVPDRFSGSKSGTSASLAISGLQSEDEADYYCAAW DDSLNGVVFGGGTKLTVLG [SEQ ID NO: 18] DNA Caggctgtgctgactcagccaccctcagcgtctgggacccccgggcagagggtcaccatctcttgttct ggaagcagctccaacatcggaagtaatactgtaaactggtaccggcagctcccaggaacggcccccaa actcctcatctatagtaataatcagcggccctcaggggtccctgaccgattctctggctccaagtctggcac ctcagcctccctggccatcagtgggctccagtctgaggatgaggctgattattactgtgcagcatgggatg acagcctgaatggtgtggtattcggcggagggaccaagctgaccgtcctaggt [SEQ ID NO: 20] scFv QAVLTQPPSASGTPGQRVTISCSGSSSNIGSNTVNWYRQLPGTAPK LLIYSNNQRPSGVPDRFSGSKSGTSASLAISGLQSEDEADYYCAAW DDSLNGVVFGGGTKLTVLGSRGGGGSGGGGSGGGGSLEMAQVQ LVESGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQGLEW MGRIIPILGIANYAQKFQGRVTITADESTSTAYMELSSLRSEDTAVY YCARTGYESWGSYEVIDRWGQGTLVTVSS [SEQ ID NO: 76]
[0122] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:77 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-153 scFv (also referred to as "ET140-3 scFv").
[0123] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:21 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:22, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:21, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:21, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:22, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:22, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:21 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:22, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121 or conservative modifications thereof, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124 or conservative modifications thereof, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124 or conservative modifications thereof, as shown in Table 6. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124.
TABLE-US-00011 TABLE 6 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GGTFSSYA [SEQ IIPILGIA [SEQ ID ARGGYYSHDMWSED ID NO: 119] NO: 120] [SEQ ID NO: 121] V.sub.L SSNIGSNS [SEQ SNN [SEQ ID NO: ATWDDNLNVHYV ID NO: 122] 123] [SEQ ID NO: 124] Full V.sub.H QVQLVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQGL EWMGRIIPILGIANYAQKFQGRVTITADKSTSTAYMELSSLRSEDTA VYYCARGGYYSHDMWSEDWGQGTLVTVSS [SEQ ID NO: 21] DNA Caggtgcagctggtgcagtctggggctgaggtgaagaagcctgggtcctcggtgaaggtctcctgcaag gcttctggaggcaccttcagcagctatgctatcagctgggtgcgacaggcccctggacaagggcttgagt ggatgggaaggatcatccctatccttggtatagcaaactacgcacagaagttccagggcagagtcacgatt accgcggacaaatccacgagcacagcctacatggagctgagcagcctgagatctgaggacacggccgt gtattactgtgcgcgcggtggttactactctcatgacatgtggtctgaagattggggtcaaggtactctggtg accgtctcctca [SEQ ID NO: 23] Full V.sub.L LPVLTQPPSASGTPGQRVTISCSGRSSNIGSNSVNWYRQLPGAAPKL LIYSNNQRPPGVPVRFSGSKSGTSASLAISGLQSEDEATYYCATWD DNLNVHYVFGTGTKVTVLG [SEQ ID NO: 22] DNA Ctgcctgtgctgactcagccaccctcagcgtctgggacccccgggcagagggtcaccatctcttgttctg gacgcagttccaacatcgggagtaattctgttaactggtatcgacaactcccaggagcggcccccaaactc ctcatctatagtaataatcagcggcccccaggggtccctgtgcgattctctggctccaagtctggcacctca gcctccctggccatcagtgggctccagtctgaagatgaggccacttattactgtgcaacatgggatgacaa tctgaatgttcactatgtcttcggaactgggaccaaggtcaccgtcctaggt [SEQ ID NO: 24] scFv LPVLTQPPSASGTPGQRVTISCSGRSSNIGSNSVNWYRQLPGAAPKL LIYSNNQRPPGVPVRFSGSKSGTSASLAISGLQSEDEATYYCATWD DNLNVHYVFGTGTKVTVLGSRGGGGSGGGGSGGGGSLEMAQVQ LVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQGLEW MGRIIPILGIANYAQKFQGRVTITADKSTSTAYMELSSLRSEDTAVY YCARGGYYSHDMWSEDWGQGTLVTVSS [SEQ ID NO: 77]
[0124] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:78 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-201 scFv (also referred to as "ET140-51 scFv").
[0125] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:25 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:26, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:25, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:25, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:26, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:26, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:25 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:26, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:125 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:126 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:127 or conservative modifications thereof, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:128 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:129 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:130 or conservative modifications thereof, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:125 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:126 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:127 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:128 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:129 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:130 or conservative modifications thereof, as shown in Table 7. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:125, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:126, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:127, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:128, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:129, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:130.
TABLE-US-00012 TABLE 7 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GGSISNSNW [SEQ IYHSGST [SEQ ARRDNWKTPTTKID ID NO: 125] ID NO: 126] GFDI [SEQ ID NO: 127] V.sub.L SGYSNYK [SEQ VGTGGIVG [SEQ GADHGSGSNFVYV ID NO: 128] ID NO: 129] SEQ ID NO: 130] Full V.sub.H QVQLQESGPGLVKPSGTLSLTCGVSGGSISNSNWWSWVRQPPGKG LEWIGEIYHSGSTKYNPSLRSRVTISVDKSKNQFSLKLSSVTAADT AVYYCARRDNWKTPTTKIDGFDIWGQGTMVTVSS [SEQ ID NO: 25] DNA Caggtgcagctgcaggagtcgggcccaggactggtgaagccttcggggaccctgtccctcacctgcg gtgtctctggtggctccatcagcaatagtaactggtggagttgggtccgccagccccccgggaaggggc tggagtggattggggaaatctatcatagtgggagcaccaagtacaacccgtccctcaggagtcgagtcac catatcagtagacaagtccaagaaccagttctccctaaaattgagctctgtgaccgccgcggacacggcc gtatattactgtgcgagacgagataactggaagacccccactaccaaaattgatggttttgatatctggggc caagggacaatggtcaccgtctcttca [SEQ ID NO: 27] Full V.sub.L QPVLTQPPSASASLGASVTLTCTLSSGYSNYKVDWYQQRPGKGPR FVMRVGTGGIVGSKGDGIPDRFSVLGSGLNRYLTIKNIQEEDEGDY HCGADHGSGSNFVYVFGTGTKVTVLG [SEQ ID NO: 26] DNA Cagcctgtgctgactcagccaccttctgcatcagcctccctgggagcctcggtcacactcacctgcaccc tgagcagcggctacagtaattataaagtggactggtaccagcagagaccagggaagggcccccggfttg tgatgcgagtgggcactggtgggattgtgggatccaagggggatggcatccctgatcgcttctcagtcttg ggctcaggcctgaatcggtacctgaccatcaagaacatccaggaagaagatgagggtgactatcactgt ggggcagaccatggcagtgggagcaacttcgtgtatgtcttcggaactgggaccaaggtcaccgtccta ggt [SEQ ID NO: 28] scFv QPVLTQPPSASASLGASVTLTCTLSSGYSNYKVDWYQQRPGKGPR FVMRVGTGGIVGSKGDGIPDRFSVLGSGLNRYLTIKNIQEEDEGDY HCGADHGSGSNFVYVFGTGTKVTVLGSRGGGGSGGGGSGGGGSL EMAQVQLQESGPGLVKPSGTLSLTCGVSGGSISNSNWWSWVRQP PGKGLEWIGEIYHSGSTKYNPSLRSRVTISVDKSKNQFSLKLSSVTA ADTAVYYCARRDNWKTPTTKIDGFDIWGQGTMVTVSS [SEQ ID NO: 78]
[0126] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:79 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-167 scFv (also referred to as "ET140-17 scFv").
[0127] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:29 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:30, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:29, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:29, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:30, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:30, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:29 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:30, as shown in Table 8. In certain embodiments, the anti-BCMA comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:131 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:132 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:133 or conservative modifications thereof, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:134 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:135 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:136 or conservative modifications thereof, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:131 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:132 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:133 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:134 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:135 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:136 or conservative modifications thereof, as shown in Table 8. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:131, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:132, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:133, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:134, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:135, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:136.
TABLE-US-00013 TABLE 8 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYTFTGYY [SEQ ID INPNSGGT [SEQ ARSQWGSSWDY NO: 131] ID NO: 132] [SEQ ID NO: 133] V.sub.L QSISSY [SEQ ID AAS [SEQ ID NO: QQSYSTPPT [SEQ NO: 134] 135] ID NO: 136] Full V.sub.H QVQLVQSGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQ GLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLR SDDTAVYYCARSQWGSSWDYWGQGTLVTVSS [SEQ ID NO: 29] DNA Caggtccagctggtacagtctggggctgaggtgaagaagcctggggcctcagtgaaggtctcctgcaa ggcttctggatacaccttcaccggctactatatgcactgggtgcgacaggcccctggacaagggcttgag tggatgggatggatcaaccctaacagtggtggcacaaactatgcacagaagtttcagggcagggtcacc atgaccagggacacgtccatcagcacagcctacatggagctgagcaggctgagatctgacgacacggc cgtgtattactgtgcgcgctctcagtggggttcttcttgggattactggggtcaaggtactctggtgaccgtc tcctca [SEQ ID NO: 31] Full V.sub.L DIQLTQSPSSLSASVGDRVTITCRASQSISSYLNWYQQKPGKAPKL LIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQSYS TPPTFGQGTKVEIKR [SEQ ID NO: 30] DNA Gacatccagttgacccagtctccatcctccctgtctgcatctgtaggagacagagtcaccatcacttgcc- g ggcaagtcagagcattagcagctatttaaattggtatcagcagaaaccagggaaagcccctaagctcctg atctatgctgcatccagtttgcaaagtggggtcccatcaaggttcagtggcagtggatctgggacagatttc actctcaccatcagcagtctgcaacctgaagattttgcaacttactactgtcaacagagttacagtacccctc cgacgttcggccaagggaccaaggtggagatcaaacgt [SEQ ID NO: 32] scFv DIQLTQSPSSLSASVGDRVTITCRASQSISSYLNWYQQKPGKAPKL LIYAASSLQSGVPSRFSGSGSGTDFTLTISSLQPEDFATYYCQQSYS TPPTFGQGTKVEIKRSRGGGGSGGGGSGGGGSLEMAQVQLVQSG AEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQGLEWMGW INPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAVYY CARSQWGSSWDYWGQGTLVTVSS [SEQ ID NO: 79]
[0128] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:80 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-163 scFv (also referred to as "ET140-13 scFv").
[0129] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:33 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:34, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:33, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:33, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:34, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:34, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:33 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:34, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:137 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:138 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:139 or conservative modifications thereof, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:140 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:141 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:142 or conservative modifications thereof, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:137 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:138 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:139 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:140 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:141 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:142 or conservative modifications thereof, as shown in Table 9. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:137, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:138, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:139, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:140, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:141, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:142.
TABLE-US-00014 TABLE 9 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYTFTGYY [SEQ ID INPNSGGT [SEQ ID ARSSYHLYGYDS NO: 137] NO: 138] [SEQ ID NO: 139] V.sub.L NDYTNYK [SEQ ID VGPGGIVG [SEQ GADHGTGSNFVYV NO: 140] ID NO: 141] [SEQ ID NO: 142] Full V.sub.H EVQLVESGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQAPGQ GLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLR SDDTAVYYCARSSYHLYGYDSWGQGTLVTVSS [SEQ ID NO: 33] DNA Gaggtgcagctggtggagtccggggctgaggtgaagaagcctggggcctcagtgaaggtctcctgca aggcttctggatacaccttcaccggctactatatgcactgggtgcgacaggcccctggacaagggcttga gtggatgggatggatcaaccctaacagtggtggcacaaactatgcacagaagtttcagggcagggtcac catgaccagggacacgtccatcagcacagcctacatggagctgagcaggctgagatctgacgacacgg ccgtatattactgtgcgcgctatataccatctgtacggttacgattcttggggtcaaggtactctggtga ccgtctcctca [SEQ ID NO: 35] Full V.sub.L QPVLTQPPSASASLGASVTLTCTLSNDYTNYKVDWYQQRPGKGPR FVMRVGPGGIVGSKGDGIPDRFSVLGSGLNRYLTIKNIQEEDESDY HCGADHGTGSNFVYVFGGGTKLTVLG [SEQ ID NO: 34] DNA Cagcctgtgctgactcagccaccttctgcatcagcctccctgggagcctcggtcactctcacctgcaccc- t gagcaacgactacactaattataaagtggactggtaccagcagagaccagggaagggcccccggtttgt gatgcgagtgggccctggtgggattgtgggatccaagggggatggcatccctgatcgcttctcagtcttg ggctcaggcctgaatcgatacctgaccatcaagaacatccaggaggaggatgagagtgactaccactgt ggggcggaccatggcaccgggagcaacttcgtgtacgtgttcggcggagggaccaagctgaccgtcct aggt [SEQ ID NO: 36] scFv QPVLTQPPSASASLGASVTLTCTLSNDYTNYKVDWYQQRPGKGPR FVMRVGPGGIVGSKGDGIPDRFSVLGSGLNRYLTIKNIQEEDESDY HCGADHGTGSNFVYVFGGGTKLTVLGSRGGGGSGGGGSGGGGSL EMAEVQLVESGAEVKKPGASVKVSCKASGYTFTGYYMHWVRQA PGQGLEWMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELS RLRSDDTAVYYCARSSYHLYGYDSWGQGTLVTVSS [SEQ ID NO: 80]
[0130] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:81 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-207 scFv (also referred to as "ET140-57 scFv").
[0131] In certain embodiment, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:37 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:38, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:98. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:37, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:37, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:38, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:38, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:37 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:38, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:143 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:144 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:145 or conservative modifications thereof, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:146 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:147 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:148 or conservative modifications thereof, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:143 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:144 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:145 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:146 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:147 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:148 or conservative modifications thereof, as shown in Table 10. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:143, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:144, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:145, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:146, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:147, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:148.
TABLE-US-00015 TABLE 10 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GGTFSSYA [SEQ IIPIFSTA [SEQ ARQPWTWYSPYDQ ID NO: 143] ID NO: 144] [SEQ ID NO: 145] V.sub.L SGYSNYK [SEQ VDTGGIVG [SEQ GADHGSGSNFVWV ID NO: 146] ID NO: 147] [SEQ ID NO: 148] Full V.sub.H QVQLVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQG LEWMGGIIPIFSTANYAQKFQGRVTMTTDTSTSTAYMELRSLRSD DTAVYYCARQPWTWYSPYDQWGQGTLVTVSS [SEQ ID NO: 37] DNA Caggtgcagctggtgcagtctggggctgaggtgaagaagcctgggtcctcggtgaaggtctcctgcaa ggcttctggaggcaccttcagcagctatgctatcagctgggtgcgacaggcccctggacaagggcttga gtggatgggagggatcatccctatctttagtacagcaaactacgcacagaagttccagggcagagtcacc atgaccacagacacatccacgagcacagcctacatggagctgaggagcctgagatctgacgacacggc cgtgtattactgtgcgcgccagccgtggacttggtactctccgtacgatcagtggggtcaaggtactct ggtgaccgtctcctca [SEQ ID NO: 39] Full V.sub.L QPVLTQPPSASASLGASVTLTCTLSSGYSNYKVDWYQQRPGKGPR FLMRVDTGGIVGSKGDGIPDRFSVSGSGLNRYLTIKNIQEEDESDY HCGADHGSGSNFVWVFGGGTKLTVLG [SEQ ID NO: 38] DNA Cagcctgtgctgactcagccaccttctgcatcagcctccctgggagcctcggtcacactcacctgcaccc tgagcagcggctacagtaattataaagtggactggtatcaacagagaccagggaagggcccccggtttct gatgcgagtagacaccggtgggattgtgggatccaagggggatggcatccctgatcgcttctcagtctcg ggctcaggtctgaatcggtacctgaccatcaagaacattcaggaagaggatgagagtgactaccactgtg gggcagaccatggcagtgggagcaacttcgtgtgggtgttcggcggagggaccaagctgaccgtccta ggt [SEQ ID NO: 40] scFv QPVLTQPPSASASLGASVTLTCTLSSGYSNYKVDWYQQRPGKGPR FLMRVDTGGIVGSKGDGIPDRFSVSGSGLNRYLTIKNIQEEDESDY HCGADHGSGSNFVWVFGGGTKLTVLGSRGGGGSGGGGSGGGGS LEMAQVQLVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQA PGQGLEWMGGIIPIFSTANYAQKFQGRVTMTTDTSTSTAYMELRS LRSDDTAVYYCARQPWTWYSPYDQWGQGTLVTVSS [SEQ ID NO: 81]
[0132] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:82 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-165 scFv (also referred to as "ET140-15 scFv").
[0133] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:41 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:42, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:41, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:41, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:42, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:42, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:41 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:42, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:149 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 150 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 151 or conservative modifications thereof, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 152 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 153 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 154 or conservative modifications thereof, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 147 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 150 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 151 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 152 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 153 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 154 or conservative modifications thereof, as shown in Table 11. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 149, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 150, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 151, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 152, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 153, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 154.
TABLE-US-00016 TABLE 11 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GFTFSTYA [SEQ ITPGGDRT [SEQ ARYYGYMIDM [SEQ ID NO: 149] ID NO: 150] ID NO: 151] V.sub.L QSLLHSNGYNY [SEQ LGS [SEQ ID MQALQTPLT [SEQ ID NO: 152] NO: 153] ID NO: 154] Full V.sub.H EVQLVETGGGLVQPGGSLRLSCAASGFTFSTYAMTWVRQAPGKGL EWVSAITPGGDRTYYADSVKGRFTISRDNSRNTLYLQMNSLRAEDT AVYYCARYYGYMIDMWGQGTLVTVSS [SEQ ID NO: 41] DNA Gaggtgcagctggtggagactgggggaggcctggtacagcctggggggtccctgagactctcctgtgct gcctctggattcacctttagcacctatgccatgacctgggtccgccaggctccagggaaggggctggagt gggtctcagctattactcctggtggtgatcgcacatactacgcagactccgtgaagggccgtttcactatctc cagagacaattccaggaacacgctgtatctgcaaatgaacagcctgagagccgaggacacggccgtatat tactgtgcgcgctactacggttacatgatcgatatgtggggtcaaggtactctggtgaccgtctcctca [SEQ ID NO: 43] Full V.sub.L DVVMTQSPLSLPVTPGEPASISCRSSQSLLHSNGYNYLDWYLQKPG QSPQLLIYLGSNRASGVPDRFSGSGSGTDFTLKISRVEAEDVGVYYC MQALQTPLTFGGGTKVEIKR [SEQ ID NO: 42] DNA Gatgttgtgatgactcagtctccactctccctgcccgtcacccctggagagccggcctccatctcctgca- g gtctagtcagagcctcctgcatagtaatggatacaactatttggattggtacctgcagaagccagggcagtc tccacagctcctgatctatttgggttctaatcgggcctccggggtccctgacaggttcagtggcagtggatca ggcacagattttacactgaaaatcagcagagtggaggctgaggatgttggggtttattactgcatgcaagct ctacaaactcctctcactttcggcggagggaccaaggtggaaatcaaacgt [SEQ ID NO: 44] scFv DVVMTQSPLSLPVTPGEPASISCRSSQSLLHSNGYNYLDWYLQKPG QSPQLLIYLGSNRASGVPDRFSGSGSGTDFTLKISRVEAEDVGVYYC MQALQTPLTFGGGTKVEIKRSRGGGGSGGGGSGGGGSLEMAEVQL VETGGGLVQPGGSLRLSCAASGFTFSTYAMTWVRQAPGKGLEWV SAITPGGDRTYYADSVKGRFTISRDNSRNTLYLQMNSLRAEDTAVY YCARYYGYMIDMWGQGTLVTVSS [SEQ ID NO: 82]
[0134] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:83 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-188 scFv (also referred to as "ET140-38 scFv").
[0135] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:45 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:46, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:45, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:45, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:46, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:46, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:45 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:46, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:155 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 156 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 157 or conservative modifications thereof, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 158 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 159 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 160 or conservative modifications thereof, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 155 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 156 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 157 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 158 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 159 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 160 or conservative modifications thereof, as shown in Table 12. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 155, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 156, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 157, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 158, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 159, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 160.
TABLE-US-00017 TABLE 12 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYTFTGYY [SEQ INPNSGGT [SEQ ARSQWGGTYDY ID NO: 155] ID NO: 156] [SEQ ID NO: 157] V.sub.L SSNIGSNT [SEQ SNN [SEQ ID AAWDDSLNGWV ID NO: 158] NO: 159] [SEQ ID NO: 160] Full V.sub.H QMQLVQSGAEVKKPGASVKVSCKASGYTFTGYYVHWLRQAPGQ GLEWMGWINPNSGGTNNAQEFQGRITMTRDTSINTAYMELSRLRS DDTAVYYCARSQWGGTYDYWGQGTLVTVSS [SEQ ID NO: 45] DNA Cagatgcagctggtgcagtctggggctgaggtgaagaagcctggggcctcagtgaaggtctcctgcaag gcttctggatacaccttcaccggctattatgtacactggttgcgacaggcccctggacaagggcttgagtgg atgggttggatcaaccctaacagtggcggcacaaacaatgcacaggagtttcaaggcaggatcaccatga ccagggacacgtccatcaacacagcctacatggagctgagcaggctgagatctgacgacacggccgtgt attactgtgcgcgctctcagtggggtggtacttacgattactggggtcaaggtactctggtgaccgtctcct ca [SEQ ID NO: 47] Full V.sub.L SYVLTQPPSASGTPGQRVTISCSGSSSNIGSNTVNWYQQVPGTAPKL LIYSNNQRPSGVPDRFSGSKSGASASLAISWLQSEDEADYYCAAWD DSLNGWVFGGGTKLTVLG [SEQ ID NO: 46] DNA Tcctatgtgctgactcagccaccctcagcgtctgggacccccgggcagagggtcaccatctcttgttctg- g aagcagctccaacatcggaagtaatactgtaaactggtaccagcaggtcccaggaacggcccccaaactc ctcatctatagtaataatcagcggccctcaggggtccctgaccgattctctggctccaagtctggcgcctca gcctccctggccatcagttggctccagtctgaggatgaggctgattattactgtgcagcatgggatgacagc ctgaatggttgggtgttcggcggagggaccaagctgaccgtcctaggt [SEQ ID NO: 48] scFv SYVLTQPPSASGTPGQRVTISCSGSSSNIGSNTVNWYQQVPGTAPKL LIYSNNQRPSGVPDRFSGSKSGASASLAISWLQSEDEADYYCAAWD DSLNGWVFGGGTKLTVLGSRGGGGSGGGGSGGGGSLEMAQMQL VQSGAEVKKPGASVKVSCKASGYTFTGYYVHWLRQAPGQGLEW MGWINPNSGGTNNAQEFQGRITMTRDTSINTAYMELSRLRSDDTA VYYCARSQWGGTYDYWGQGTLVTVSS [SEQ ID NO: 83]
[0136] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:84 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-196 scFv (also referred to as "ET140-46 scFv").
[0137] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:49 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:50, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:49, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:49, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:50, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:50, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:49 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:50, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:161 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 162 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 163 or conservative modifications thereof, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 164 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:165 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:166 or conservative modifications thereof, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 161 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 162 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 163 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 164 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:165 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:166 or conservative modifications thereof, as shown in Table 13. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 161, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 162, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 163, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 164, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:165, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:166.
TABLE-US-00018 TABLE 13 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYDFTTYW [SEQ IYPGDSDT [SEQ ID ARMWTFSQDG [SEQ ID NO: 161] NO: 162] ID NO: 163] V.sub.L SSNIGSYT [SEQ SNN [SEQ ID AAWDDSLNGYV ID NO: 164] NO: 165] [SEQ ID NO: 166] Full V.sub.H EVQLVQSGAEVKKPGESLKISCKGSGYDFTTYWIGWVRQMPGKG LEWMGIIYPGDSDTRYSPSVRGRVTISADKSINTAYLQWSSLEASD TAMYYCARMWTFSQDGWGQGTLVTVSS [SEQ ID NO: 49] DNA gaggtgcagctggtgcagtctggagcagaggtgaaaaagccgggggagtctctgaagatctcctgtaa gggttctggatatgactttaccacctactggatcgggtgggtgcgccagatgcccgggaagggcctgga gtggatggggatcatctatcctggtgactctgataccagatacagcccgtccgtccgaggccgggtcacc atctcagccgacaagtccatcaacaccgcctatttgcagtggagtagcctggaggcctccgacaccgcc atgtattactgtgcgcgcatgtggactttctctcaggatggttggggtcaaggtactctggtgaccgtct cctca [SEQ ID NO: 51] Full V.sub.L QAVLTQPPSASGTPGQRVTISCSGSSSNIGSYTVSWYQQLPGTAPK FLIYSNNQRPSGVPDRFSGSKSGTSASLAISGLQSEDEADYYCAAW DDSLNGYVFGTGTKVTVLG [SEQ ID NO: 50] DNA Caggctgtgctgactcagccaccctcagcgtctgggacccccgggcagagggtcaccatctcttgttct ggaagcagctccaacatcggaagttatactgtaagctggtaccagcaactcccaggaacggcccccaaa ttcctcatctattctaataatcagcggccctcaggggtccctgaccgattctctggctccaagtctggcacct cagcctccctggccatcagtgggctccagtctgaggatgaggctgattattactgtgctgcatgggatgac agcctgaatggttatgtcttcggaactgggaccaaggtcaccgtcctaggt [SEQ ID NO: 52] scFv QAVLTQPPSASGTPGQRVTISCSGSSSNIGSYTVSWYQQLPGTAPK FLIYSNNQRPSGVPDRFSGSKSGTSASLAISGLQSEDEADYYCAAW DDSLNGYVFGTGTKVTVLGSRGGGGSGGGGSGGGGSLEMAEVQL VQSGAEVKKPGESLKISCKGSGYDFTTYWIGWVRQMPGKGLEW MGIIYPGDSDTRYSPSVRGRVTISADKSINTAYLQWSSLEASDTAM YYCARMWTFSQDGWGQGTLVTVSS [SEQ ID NO: 84]
[0138] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:85 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-204 scFv (also referred to as "ET140-54 scFv").
[0139] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:53 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:54, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:53, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:53, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:54, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:54, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:53 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:54, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:167 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:168 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:169 or conservative modifications thereof, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172 or conservative modifications thereof, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 167 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 168 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 169 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172 or conservative modifications thereof, as shown in Table 14. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 167, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 168, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 169, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 170, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 171, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 172.
TABLE-US-00019 TABLE 14 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYTFIDYY [SEQ INPNSGGT [SEQ ID ARSQRDGYMDY ID NO: 167] NO: 168] [SEQ ID NO: 169] V.sub.L ISCTGTSSD [SEQ EDS [SEQ ID NO: SSNTRSSTLV [SEQ ID NO: 170] 171] ID NO: 172] Full V.sub.H EVQLVQSGAEMKKPGASLKLSCKASGYTFIDYYVYWMRQAPGQ GLESMGWINPNSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRS DDTAMYYCARSQRDGYMDYWGQGTLVTVSS [SEQ ID NO: 53] DNA Gaagtgcagctggtgcagtctggggctgagatgaagaagcctggggcctcactgaagctctcctgcaa ggcttctggatacaccttcatcgactactatgtatactggatgcgacaggcccctggacaagggcttgagt ccatgggatggatcaaccctaacagtggtggcacaaactatgcacagaagtttcagggcagggtcacca tgaccagggacacgtccatcagcacagcctacatggagctgagcaggctgagatctgacgacaccgcc atgtattactgtgcgcgctcccagcgtgacggttacatggattactggggtcaaggtactctggtgaccgt ctcctca [SEQ ID NO: 55] Full V.sub.L QSALTQPASVSASPGQSIAISCTGTSSDVGWYQQHPGKAPKLMIYE DSKRPSGVSNRFSGSKSGNTASLTISGLQAEDEADYYCSSNTRSST LVFGGGTKLTVLG [SEQ ID NO: 54] DNA Caatctgccctgactcagcctgcctccgtgtctgcgtctcctggacagtcgatcgccatctcctgcactg- g aaccagcagtgacgttggttggtatcaacagcacccaggcaaagcccccaaactcatgatttatgagga cagtaagcggccctcaggggtttctaatcgcttctctggctccaagtctggcaacacggcctccctgacc atctctgggctccaggctgaggacgaggctgattattactgcagctcaaatacaagaagcagcactttggt gttcggcggagggaccaagctgaccgtcctaggt [SEQ ID NO: 56] scFv QSALTQPASVSASPGQSIAISCTGTSSDVGWYQQHPGKAPKLMIYE DSKRPSGVSNRFSGSKSGNTASLTISGLQAEDEADYYCSSNTRSST LVFGGGTKLTVLGSRGGGGSGGGGSGGGGSLEMAEVQLVQSGAE MKKPGASLKLSCKASGYTFIDYYVYWMRQAPGQGLESMGWINP NSGGTNYAQKFQGRVTMTRDTSISTAYMELSRLRSDDTAMYYCA RSQRDGYMDYWGQGTLVTVSS [SEQ ID NO: 85]
[0140] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:86 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-190 scFv (also referred to as "ET140-40 scFv").
[0141] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:57 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:58, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:57, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:57, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:58, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:58, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:57 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:58, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:173 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175 or conservative modifications thereof, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178 or conservative modifications thereof, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 173 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178 or conservative modifications thereof, as shown in Table 15. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 173, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 174, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 175, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 176, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 177, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 178.
TABLE-US-00020 TABLE 15 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYTFTDYY [SEQ INPNSGGT [SEQ ARSPYSGVLDK [SEQ ID NO: 173] ID NO: 174] ID NO: 175] V.sub.L SSNIGAGFD [SEQ GNS [SEQ ID NO: QSYDSSLSGYV [SEQ ID NO: 176] 177] ID NO: 178] Full V.sub.H QVQLVQSGAEVKKPGASVKVSCKASGYTFTDYYMHWVRQAPGQ RLEWMGWINPNSGGTNYAQKFQDRITVTRDTSSNTGYMELTRLRS DDTAVYYCARSPYSGVLDKWGQGTLVTVSS [SEQ ID NO: 57] DNA Caggtccagctggtacagtctggggctgaggtgaagaagcctggggcctcagtgaaggtctcctgcaag gcttctggatacaccttcaccgactactatatgcactgggtgcgacaggcccctggacaacggcttgagtg gatgggatggatcaaccctaacagtggtggcacaaactatgcacagaagtttcaggacaggatcaccgtg accagggacacctccagcaacacaggctacatggagctgaccaggctgagatctgacgacacggccgt gtattactgtgcgcgctctccgtactctggtgttctggataaatggggtcaaggtactctggtgaccgtct cctca [SEQ ID NO: 59] Full V.sub.L QSVLTQPPSVSGAPGQRVTISCTGSSSNIGAGFDVHWYQQLPGTAP KLLIYGNSNRPSGVPDRFSGSKSGTSASLAITGLQAEDEADYYCQS YDSSLSGYVFGTGTKVTVLG [SEQ ID NO: 58] DNA Cagtctgtgctgacgcagccgccctcagtgtctggggccccagggcagagggtcaccatctcctgcact gggagcagctccaacatcggggcaggttttgatgtacactggtaccagcagcttccaggaacagccccc aaactcctcatctatggtaacagcaatcggccctcaggggtccctgaccgattctctggctccaagtctggc acctcagcctccctggccatcactgggctccaggctgaggatgaggctgattattactgccagtcctatgac agcagcctgagtggttatgtcttcggaactgggaccaaggtcaccgtcctaggt [SEQ ID NO: 60] scFv QSVLTQPPSVSGAPGQRVTISCTGSSSNIGAGFDVHWYQQLPGTAP KLLIYGNSNRPSGVPDRFSGSKSGTSASLAITGLQAEDEADYYCQS YDSSLSGYVFGTGTKVTVLGSRGGGGSGGGGSGGGGSLEMAQVQ LVQSGAEVKKPGASVKVSCKASGYTFTDYYMEIWVRQAPGQRLE WMGWINPNSGGTNYAQKFQDRITVTRDTSSNTGYMELTRLRSDD TAVYYCARSPYSGVLDKWGQGTLVTVSS [SEQ ID NO: 86]
[0142] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO: 87 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-187 scFv (also referred to as "ET140-37 scFv").
[0143] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:61 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:62, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:61, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:61, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:62, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:62, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:61 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:62, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:179 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 180 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181 or conservative modifications thereof, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184 or conservative modifications thereof, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 179 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 180 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184 or conservative modifications thereof, as shown in Table 16. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 179, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 180, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 181, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 182, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 183 and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 184.
TABLE-US-00021 TABLE 16 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GGTFSSYA [SEQ IIPILGTA [SEQ ID ARSGYGSYRWEDS ID NO: 179] NO: 180] [SEQ ID NO: 181] V.sub.L SSNIGSNY [SEQ SNN [SEQ ID NO: AAWDDSLSASYV ID NO: 182] 183] [SEQ ID NO: 184] Full V.sub.H QVQLVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQG LEWMGRIIPILGTANYAQKFQGRVTITADESTSTAYMELSSLRSED TAVYYCARSGYGSYRWEDSWGQGTLVTVSS [SEQ ID NO: 61] DNA Caggtgcagctggtgcagtctggggctgaggtgaagaagcctgggtcctcggtgaaggtctcctgcaa ggcttctggaggcaccttcagcagctatgctatcagctgggtgcgacaggcccctggacaagggcttga gtggatgggaaggatcatccctatccttggtacagcaaactacgcacagaagttccagggcagagtcac gattaccgcggacgaatccacgagcacagcctacatggagctgagcagcctgagatctgaggacacgg ccgtgtattactgtgcgcgctctggttacggttcttaccgttgggaagattcttggggtcaaggtactc tggtgaccgtctcctca [SEQ ID NO: 63] Full V.sub.L QAVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVFWYQQLPGTAPK LLIYSNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAW DDSLSASYVFGTGTKVTVLG [SEQ ID NO: 62] DNA Caggctgtgctgactcagccaccctcagcgtctgggacccccgggcagagggtcaccatctcttgttct ggaagcagctccaacatcggaagtaattacgtattctggtaccagcagctcccaggaacggcccccaaa ctcctcatctatagtaataatcagcggccctcaggggtccctgaccgattctctggctccaagtctggcacc tcagcctccctggccatcagtgggctccggtccgaggatgaggctgattattactgtgcagcatgggatg acagcctgagtgcctcttatgttttcggaactgggaccaaggtcaccgtcctaggt [SEQ ID NO: 64] scFv QAVLTQPPSASGTPGQRVTISCSGSSSNIGSNYVFWYQQLPGTAPK LLIYSNNQRPSGVPDRFSGSKSGTSASLAISGLRSEDEADYYCAAW DDSLSASYVFGTGTKVTVLGSRGGGGSGGGGSGGGGSLEMAQVQ LVQSGAEVKKPGSSVKVSCKASGGTFSSYAISWVRQAPGQGLEW MGRIIPILGTANYAQKFQGRVTITADESTSTAYMELSSLRSEDTAV YYCARSGYGSYRWEDSWGQGTLVTVSS [SEQ ID NO: 87]
[0144] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises the amino acid sequence of SEQ ID NO:88 and specifically binds to a BCMA polypeptide (e.g., a BCMA polypeptide having the amino acid sequence SEQ ID NO:71, or fragments thereof), which is designated as ET140-174 scFv (also referred to as "ET140-24 scFv").
[0145] In certain embodiments, the extracellular antigen-binding domain is a human scFv, which comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:65 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:66, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:65, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:65, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:66, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:66, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:65 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:66, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:185 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187 or conservative modifications thereof, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190 or conservative modifications thereof, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 185 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190 or conservative modifications thereof, as shown in Table 17. In certain embodiments, the extracellular antigen-binding domain comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 185, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 186, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 187, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO: 188, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO: 189, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO: 190.
TABLE-US-00022 TABLE 17 A BCMA polypeptide having the amino acid sequence of SEQ ID NO: 71 Antigen CDRs 1 2 3 V.sub.H GYSFTSYW [SEQ IYPGDSDT [SEQ ID ARYSGSFDN [SEQ ID NO: 185] NO: 186] ID NO: 187] V.sub.L SSNIGSHS [SEQ TNN [SEQ ID NO: AAWDGSLNGLV ID NO: 188] 189] [SEQ ID NO: 190] Full V.sub.H EVQLVQSGAEVKKPGESLKISCKGSGYSFTSYWIGWVRQMPGKG LEWMGIIYPGDSDTRYSPSFQGHVTISADKSISTAYLQWSSLKASD TAMYYCARYSGSFDNWGQGTLVTVSS [SEQ ID NO: 65] DNA Gaggtgcagctggtgcagtctggagcagaggtgaaaaagcccggggagtctctgaagatctcctgta agggttctggatacagctttaccagctactggatcggctgggtgcgccagatgcccgggaaaggcctg gagtggatggggatcatctatcctggtgactctgataccagatacagcccgtccttccaaggccacgtca ccatctcagctgacaagtccatcagcactgcctacctgcagtggagcagcctgaaggcctcggacacc gccatgtattactgtgcgcgctactctggttctttcgataactggggtcaaggtactctggtgaccgtct cctca [SEQ ID NO: 67] Full V.sub.L SYELTQPPSASGTPGQRVTMSCSGTSSNIGSHSVNWYQQLPGTAP KLLIYTNNQRPSGVPDRFSGSKSGTSASLAISGLQSEDEADYYCA AWDGSLNGLVFGGGTKLTVLG [SEQ ID NO: 66] DNA Tcctatgagctgactcagccaccctcagcgtctgggacccccgggcagagggtcaccatgtcttgttct ggaaccagctccaacatcggaagtcactctgtaaactggtaccagcagctcccaggaacggcccccaa actcctcatctatactaataatcagcggccctcaggggtccctgaccgattctctggctccaagtctggca cctcagcctccctggccatcagtggcctccagtctgaggatgaggctgattattactgtgcagcatggga tggcagcctgaatggtctggtattcggcggagggaccaagctgaccgtcctaggt [SEQ ID NO: 68] scFv SYELTQPPSASGTPGQRVTMSCSGTSSNIGSHSVNWYQQLPGTAP KLLIYTNNQRPSGVPDRFSGSKSGTSASLAISGLQSEDEADYYCA AWDGSLNGLVFGGGTKLTVLGSRGGGGSGGGGSGGGGSLEMAE VQLVQSGAEVKKPGESLKISCKGSGYSFTSYWIGWVRQMPGKGL EWMGIIYPGDSDTRYSPSFQGHVTISADKSISTAYLQWSSLKASDT AMYYCARYSGSFDNWGQGTLVTVSS [SEQ ID NO: 88]
[0146] An extracellular antigen-binding domain (e.g., scFv) comprising V.sub.H and/or V.sub.L regions having high (i.e., 80% or greater) homology to the V.sub.H and V.sub.L regions of the sequences set forth above, can be obtained by mutagenesis (e.g., site-directed or PCR-mediated mutagenesis), followed by testing of the encoded altered scFv for retained function (i.e., the binding affinity) using the binding assays described herein. In certain embodiments, a V.sub.H sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity contains substitutions (e.g., conservative substitutions to generate conservative modifications of a sequence), insertions or deletions relative to the reference sequence, but an extracellular antigen-binding domain (e.g., scFv) comprising that sequence retains the ability to bind to a BCMA polypeptide. In certain embodiments, a V.sub.L sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity contains substitutions (e.g., conservative substitutions), insertions or deletions relative to the reference sequence, but an extracellular antigen-binding domain (e.g., scFv) comprising that sequence retains the ability to bind to a BCMA polypeptide. In certain embodiments, a total of about 1 to about 10 amino acids have been substituted, inserted and/or deleted in the disclosed sequences. For example, and not by way of limitation, a V.sub.H sequence or a V.sub.L sequence, can have up to about one, up to about two, up to about three, up to about four, up to about five, up to about six, up to about seven, up to about eight, up to about nine or up to about ten amino acid residues that are modified and/or substituted. Non-limiting examples of conservative modifications are provided below, e.g., within Table 18.
[0147] As used herein, the term "conservative sequence modifications" refers to amino acid modifications that do not significantly affect or alter the binding characteristics of the presently disclosed CAR (e.g., the extracellular antigen-binding domain) comprising the amino acid sequence. Such conservative modifications include amino acid substitutions, additions and deletions. Modifications can be introduced into the human scFv of the presently disclosed subject matter by standard techniques known in the art, such as site-directed mutagenesis and PCR-mediated mutagenesis. Amino acids can be classified into groups according to their physicochemical properties such as charge and polarity. Conservative amino acid substitutions are ones in which the amino acid residue is replaced with an amino acid within the same group. For example, amino acids can be classified by charge: positively-charged amino acids include lysine, arginine, histidine, negatively-charged amino acids include aspartic acid, glutamic acid, neutral charge amino acids include alanine, asparagine, cysteine, glutamine, glycine, isoleucine, leucine, methionine, phenylalanine, proline, serine, threonine, tryptophan, tyrosine, and valine. In addition, amino acids can be classified by polarity: polar amino acids include arginine (basic polar), asparagine, aspartic acid (acidic polar), glutamic acid (acidic polar), glutamine, histidine (basic polar), lysine (basic polar), serine, threonine, and tyrosine; non-polar amino acids include alanine, cysteine, glycine, isoleucine, leucine, methionine, phenylalanine, proline, tryptophan, and valine. Thus, one or more amino acid residues within a CDR region can be replaced with other amino acid residues from the same group and the altered antibody can be tested for retained function (i.e., the functions set forth in (c) through (1) above) using the functional assays described herein. In certain embodiments, no more than one, no more than two, no more than three, no more than four, no more than five residues within a specified sequence or a CDR region are altered. Exemplary conservative amino acid substitutions are shown in Table 18.
TABLE-US-00023 TABLE 18 Exemplary conservative amino acid Original Residue Substitutions Ala (A) Val; Leu; Ile Arg (R) Lys; Gln; Asn Asn (N) Gln; His; Asp, Lys; Arg Asp (D) Glu; Asn Cys (C) Ser; Ala Gln (Q) Asn; Glu Glu (E) Asp; Gln Gly (G) Ala His (H) Asn; Gln; Lys; Arg Ile (I) Leu; Val; Met; Ala; Phe Leu (L) Ile; Val; Met; Ala; Phe Lys (K) Arg; Gln; Asn Met (M) Leu; Phe; Ile Phe (F) Trp; Leu; Val; Ile; Ala; Tyr Pro (P) Ala Ser (S) Thr Thr (T) Val; Ser Trp (W) Tyr; Phe Tyr (Y) Trp; Phe; Thr; Ser Val (V) Ile; Leu; Met; Phe; Ala
[0148] In certain non-limiting embodiments, an extracellular antigen-binding domain of the CAR can comprise a linker connecting the heavy chain variable region and light chain variable region of the extracellular antigen-binding domain. As used herein, the term "linker" refers to a functional group (e.g., chemical or polypeptide) that covalently attaches two or more polypeptides or nucleic acids so that they are connected to one another. As used herein, a "peptide linker" refers to one or more amino acids used to couple two proteins together (e.g., to couple V.sub.H and V.sub.L domains). Non-limiting examples of peptide linkers are disclosed in Shen et al., Anal. Chem. 80(6):1910-1917 (2008) and WO 2014/087010.
[0149] In one non-limiting example, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the nucleotide sequence encoding the amino acid sequence of SEQ ID NO:69 is set forth in SEQ ID NO:70. In one non-limiting example, the linker is a G45 linker that comprises amino acids having the sequence set forth in SEQ ID NO:210. In certain embodiments, the nucleotide sequence encoding the amino acid sequence of SEQ ID NO:98 is set forth in SEQ ID NO:211.
[0150] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:212 as provided below.
TABLE-US-00024 [SEQ ID NO: 212] GGGGS.
[0151] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:213 as provided below.
TABLE-US-00025 [SEQ ID NO: 213] SGGSGGS.
[0152] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:214 as provided below.
TABLE-US-00026 [SEQ ID NO: 214] GGGGSGGGS.
[0153] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:215 as provided below.
TABLE-US-00027 [SEQ ID NO: 215] GGGGSGGGGS.
[0154] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:216 as provided below.
TABLE-US-00028 [SEQ ID NO: 216] GGGGSGGGGSGGGGGGGS.
[0155] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:217 as provided below.
TABLE-US-00029 [SEQ ID NO: 217] GGGGSGGGGSGGGGSGGGGS.
[0156] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:218 as provided below.
TABLE-US-00030 [SEQ ID NO: 218] GGGGSGGGGSGGGGSGGGGSGGGGS.
[0157] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:219 as provided below.
TABLE-US-00031 [SEQ ID NO: 219] GGGGSGGGGSGGGGSGGGGSGGGGSGGGGS.
[0158] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:220 as provided below.
TABLE-US-00032 [SEQ ID NO: 220] GGGGSGGGGSGGGGSGGGGSGGGGSGGGGSGGGGS.
[0159] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:220 as provided below.
TABLE-US-00033 [SEQ ID NO: 220] EPKSCDKTHTCPPCP.
[0160] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:221 as provided below.
TABLE-US-00034 [SEQ ID NO: 222] GGGGSGGGSEPKSCDKTHTCPPCP.
[0161] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:223 as provided below.
TABLE-US-00035 [SEQ ID NO: 223] ELKTPLGDTTHTCPRCPEPKSCDTPPPCPRCPEPKSCDTPPPCPRCPEPK SCDTPPPCPRCP.
[0162] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:224 as provided below.
TABLE-US-00036 [SEQ ID NO: 224] GSGSGS.
[0163] In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:225 as provided below.
TABLE-US-00037 [SEQ ID NO: 225] AAA.
[0164] In addition, the extracellular antigen-binding domain can comprise a leader or a signal peptide that directs the nascent protein into the endoplasmic reticulum. Signal peptide or leader can be essential if the CAR is to be glycosylated and anchored in the cell membrane. The signal sequence or leader can be a peptide sequence (about 5, about 10, about 15, about 20, about 25, or about 30 amino acids long) present at the N-terminus of newly synthesized proteins that directs their entry to the secretory pathway. In non-limiting examples, the signal peptide is covalently joined to the 5' terminus of the extracellular antigen-binding domain. In certain embodiments, the signal peptide comprises a CD8 polypeptide comprising amino acids having the sequence set forth in SEQ ID NO:191 as provided below.
TABLE-US-00038 [SEQ ID NO: 191] MALPVTALLLPLALLLHAAR
The nucleotide sequence encoding the amino acid sequence of SEQ ID NO:191 is set forth in SEQ ID NO:192, which is provided below:
TABLE-US-00039 [SEQ ID NO: 192] ATGGCTCTCCCAGTGACTGCCCTACTGCTTCCCCTAGCGCTTCTCCTGCA TGCAGCTCGT
[0165] In another embodiment, the signal peptide comprises amino acids having the sequence set forth in SEQ ID NO:205 as provided below.
TABLE-US-00040 METDTLLLWVLLLWVPGSTG [SEQ ID NO: 205]
The nucleotide sequence encoding the amino acid sequence of SEQ ID NO:205 is set forth in SEQ ID NO:206, which is provided below:
TABLE-US-00041 [SEQ ID NO: 206] ATGGAAACCGACACCCTGCTGCTGTGGGTGCTGCTGCTGTGGGTGCCAGG ATCCACAGGA
[0166] In certain embodiments, the human scFv comprises a heavy chain variable region, a light chain variable region, a linker peptide between the heavy chain variable region and the light chain variable region, and an His-tag and an HA-tag. In certain embodiments, the amino acid sequence of the His-tag and HA-tag comprises the amino acid sequence of SEQ ID NO:275, which is provided below:
TABLE-US-00042 TSGQAGQHHHHHHGAYPYDVPDYAS [SEQ ID NO: 275]
[0167] The nucleotide sequence encoding SEQ ID NO: 275 is SEQ ID NO: 276, which is provided below:
TABLE-US-00043 [SEQ ID NO: 276] ACTAGTGGCCAGGCCGGCCAGCACCATCACCATCACCATGGCGCATACCC GTACGACGTTCCGGACTACGCTTCT
[0168] In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) binds to a human BCMA polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 71. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) binds to one or more portion of the amino acid sequence set forth in SEQ ID NO: 71. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) binds to one, two, three, four, five, six, or seven epitope region selected from the group consisting of amino acids 8-22, 9-23, 10-24, 11-25, 12-26, 13-27, 14-28 and 8-28 of SEQ ID NO: 71. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:9 and a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:10, optionally with (iii) a linker sequence, for example a linker peptide, between the heavy chain variable region and the light chain variable region. In certain embodiments, the linker comprises amino acids having the sequence set forth in SEQ ID NO:69. In certain embodiments, the extracellular antigen-binding domain that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 is a human scFv with V.sub.H and V.sub.L regions or CDRs selected from Table 6. In certain embodiments, the extracellular antigen-binding domain that binds to amino acids 14-22 of SEQ ID NO: 71 comprises is a human scFv-Fc fusion protein or full length human IgG with V.sub.H and V.sub.L regions or CDRs selected from Table 6. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.H comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:21. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:21. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.L comprising an amino acid sequence that is at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% homologous to the amino acid sequence set forth in SEQ ID NO:22. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:22. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.H comprising amino acids having the sequence set forth in SEQ ID NO:21 and a V.sub.L comprising amino acids having the sequence set forth in SEQ ID NO:22. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120 or conservative modifications thereof, and a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121 or conservative modifications thereof. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124 or conservative modifications thereof. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119 or conservative modifications thereof, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120 or conservative modifications thereof, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121 or conservative modifications thereof, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122 or conservative modifications thereof, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123 or conservative modifications thereof, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124 or conservative modifications thereof. In certain embodiments, the extracellular antigen-binding domain (e.g., a human scFv) that binds to an epitope region comprising amino acids 14-22 of SEQ ID NO: 71 comprises a V.sub.H CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:119, a V.sub.H CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:120, a V.sub.H CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:121, a V.sub.L CDR1 comprising amino acids having the sequence set forth in SEQ ID NO:122, a V.sub.L CDR2 comprising amino acids having the sequence set forth in SEQ ID NO:123, and a V.sub.L CDR3 comprising amino acids having the sequence set forth in SEQ ID NO:124. In certain embodiments, the extracellular antigen-binding domain is ET140-3 (or "ET140-153") scFv.
[0169] Transmembrane Domain of a CAR
[0170] In certain non-limiting embodiments, the transmembrane domain of the CAR comprises a hydrophobic alpha helix that spans at least a portion of the membrane. Different transmembrane domains result in different receptor stability. After antigen recognition, receptors cluster and a signal is transmitted to the cell. In accordance with the presently disclosed subject matter, the transmembrane domain of the CAR can comprise a CD8 polypeptide, a CD28 polypeptide, a CD3.zeta. polypeptide, a CD4 polypeptide, a 4-1BB polypeptide, an OX40 polypeptide, an ICOS polypeptide, a CTLA-4 polypeptide, a PD-1 polypeptide, a LAG-3 polypeptide, a 2B4 polypeptide, a BTLA polypeptide, a synthetic peptide (not based on a protein associated with the immune response), or a combination thereof.
[0171] In certain embodiments, the transmembrane domain of a presently disclosed CAR comprises a CD28 polypeptide. The CD28 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% homologous to the sequence having a NCBI Reference No: P10747 or NP_006130 (SEQ ID No:193), or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions. In certain embodiments, the CD28 polypeptide can have an amino acid sequence that is a consecutive portion of SEQ ID NO: 193 which is at least 20, or at least 30, or at least 40, or at least 50, and up to 220 amino acids in length. Alternatively or additionally, in non-limiting various embodiments, the CD28 polypeptide has an amino acid sequence of amino acids 1 to 220, 1 to 50, 50 to 100, 100 to 150, 150 to 200, or 200 to 220 of SEQ ID NO: 193. In certain embodiments, the CAR of the presently disclosed comprises a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a co-stimulatory signaling region that comprises a CD28 polypeptide. In certain embodiments, the CD28 polypeptide comprised in the transmembrane domain and the intracellular domain has an amino acid sequence of amino acids 114 to 220 of SEQ ID NO: 193.
[0172] SEQ ID NO: 193 is provided below:
TABLE-US-00044 [SEQ ID NO: 193] 1 MLRLLLALNL FPSIQVTGNK ILVKQSPMLV AYDNAVNLSC KYSYNLFSRE FRASLHKGLD 61 SAVEVCVVYG NYSQQLQVYS KTGFNCDGKL GNESVTFYLQ NLYVNQTDIY FCKIEVMYPP 121 PYLDNEKSNG TIIHVKGKHL CPSPLFPGPS KPFWVLVVVG GVLACYSLLV TVAFIIFWVR 181 SKRSRLLHSD YMNMTPRRPG PTRKHYQPYA PPRDFAAYRS
[0173] In accordance with the presently disclosed subject matter, a "CD28 nucleic acid molecule" refers to a polynucleotide encoding a CD28 polypeptide. In certain embodiments, the CD28 nucleic acid molecule encoding the CD28 polypeptide comprised in the transmembrane domain and the intracellular domain (e.g., the co-stimulatory signaling region) of the presently disclosed CAR (amino acids 114 to 220 of SEQ ID NO: 193) comprises nucleic acids having the sequence set forth in SEQ ID NO: 194 as provided below.
TABLE-US-00045 [SEQ ID NO: 194] ATTGAAGTTATGTATCCTCCTCCTTACCTAGACAATGAGAAGAGCAATGG AACCATTATCCATGTGAAAGGGAAACACCTTTGTCCAAGTCCCCTATTTC CCGGACCTTCTAAGCCCTTTTGGGTGCTGGTGGTGGTTGGTGGAGTCCTG GCTTGCTATAGCTTGCTAGTAACAGTGGCCTTTATTATTTTCTGGGTGAG GAGTAAGAGGAGCAGGCTCCTGCACAGTGACTACATGAACATGACTCCCC GCCGCCCCGGGCCCACCCGCAAGCATTACCAGCCCTATGCCCCACCACGC GACTTCGCAGCCTATCGCTCC
[0174] In certain embodiments, the transmembrane domain of a presently disclosed CAR comprises a CD8 polypeptide. The CD8 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% homologous to the sequence having a NCBI Reference No: AAH25715 (SEQ ID No:226), or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions. In non-limiting embodiments, the CD8 polypeptide can have an amino acid sequence that is a consecutive portion of SEQ ID NO: 226 which is at least 20, or at least 30, or at least 40, or at least 50, or at least 70, or at least 100, or at least 150, or at least 200 and up to 235 amino acids in length. Alternatively or additionally, in non-limiting various embodiments, the CD28 polypeptide has an amino acid sequence of amino acids 1 to 235, 1 to 50, 50 to 100, 100 to 150, 150 to 200, 130 to 210, or 200 to 235 of SEQ ID NO: 226. In certain embodiments, the CD8 polypeptide comprised in the transmembrane domain has an amino acid sequence of amino acids 137 to 207 of SEQ ID NO: 226.
[0175] SEQ ID NO: 226 is provided below:
TABLE-US-00046 [SEQ ID NO: 226] 1 MALPVTALLL PLALLLHAAR PSQFRVSPLD RTWNLGETVE LKCQVLLSNP TSGCSWLFQP 61 RGAAASPTFL LYLSQNKPKA AEGLDTQRFS GKRLGDTFVL TLSDFRRENE GCYFCSALSN 121 SIMYFSHFVP VFLPAKPTTT PAPRPPTPAP TIASQPLSLR PEACRPAAGG AVHTRGLDFA 181 CDIYIWAPLA GTCGVLLLSL VITLYCNHRN RRRVCKCPRP VVKSGDKPSL SARYV.
[0176] In accordance with the presently disclosed subject matter, a "CD8 nucleic acid molecule" refers to a polynucleotide encoding a CD8 polypeptide. In certain embodiments, the CD8 nucleic acid molecule encoding the CD8 polypeptide comprised in the transmembrane domain of the presently disclosed CAR (amino acids 137 to 207 of SEQ ID NO: 226) comprises nucleic acids having the sequence set forth in SEQ ID NO: 227 as provided below.
TABLE-US-00047 [SEQ ID NO: 227] CCCACCACGACGCCAGCGCCGCGACCACCAACCCCGGCGCCCACGATCGC GTCGCAGCCCCTGTCCCTGCGCCCAGAGGCGTGCCGGCCAGCGGCGGGGG GCGCAGTGCACACGAGGGGGCTGGACTTCGCCTGTGATATCTACATCTGG GCGCCCCTGGCCGGGACTTGTGGGGTCCTTCTCCTGTCACTGGTTATCAC CCTTTACTGCAAC
[0177] In certain non-limiting embodiments, a CAR can also comprise a spacer region that links the extracellular antigen-binding domain to the transmembrane domain. The spacer region can be flexible enough to allow the antigen binding domain to orient in different directions to facilitate antigen recognition. The spacer region can be the hinge region from IgG1, or the CH.sub.2CH.sub.3 region of immunoglobulin and portions of CD3.
[0178] Intracellular Domain of a CAR
[0179] In certain non-limiting embodiments, an intracellular domain of the CAR can comprise a CD3.zeta. polypeptide, which can activate or stimulate a cell (e.g., a cell of the lymphoid lineage, e.g., a T cell). CD3.zeta. comprises three ITAMs, and transmits an activation signal to the cell (e.g., a cell of the lymphoid lineage, e.g., a T cell) after antigen is bound. The CD3.zeta. polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or about 100% homologous to the sequence set forth in SEQ ID NO: 195, or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions. In non-limiting embodiments, the CD3.zeta. polypeptide can have an amino acid sequence that is a consecutive portion of SEQ ID NO: 195 which is at least 20, or at least 30, or at least 40, or at least 50, and up to 163 amino acids in length. Alternatively or additionally, in non-limiting various embodiments, the CD3.zeta. polypeptide has an amino acid sequence of amino acids 1 to 163, 1 to 50, 50 to 100, 100 to 150, or 150 to 163 of SEQ ID NO: 195. In certain embodiments, the CD3.zeta. polypeptide comprised in the intracellular domain of a presently disclosed CAR has an amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195.
[0180] SEQ ID NO: 195 is provided below:
TABLE-US-00048 [SEQ ID NO: 195] 1 MKWKALFTAA ILQAQLPITE AQSFGLLDPK LCYLLDGILF IYGVILTALF LRVKFSRSAD 61 APAYQQGQNQ LYNELNLGRR EEYDVLDKRR GRDPEMGGKP RRKNPQEGLY NELQKDKMAE 121 AYSEIGMKGE RRRGKGHDGL YQGLSTATKD TYDALHMQAL PPR
[0181] In accordance with the presently disclosed subject matter, a "CD3.zeta. nucleic acid molecule" refers to a polynucleotide encoding a CD3.zeta. polypeptide. In certain embodiments, the CD3.zeta. nucleic acid molecule encoding the CD3.zeta. polypeptide comprised in the intracellular domain of a presently disclosed CARs (amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195) comprises nucleic acids having the sequence set forth in SEQ ID NO:196 as provided below.
TABLE-US-00049 [SEQ ID NO: 196] AGAGTGAAGTTCAGCAGGAGCGCAGACGCCCCCGCGTACCAGCAGGGCCA GAACCAGCTCTATAACGAGCTCAATCTAGGACGAAGAGAGGAGTACGATG TTTTGGACAAGAGACGTGGCCGGGACCCTGAGATGGGGGGAAAGCCGAGA AGGAAGAACCCTCAGGAAGGCCTGTACAATGAACTGCAGAAAGATAAGAT GGCGGAGGCCTACAGTGAGATTGGGATGAAAGGCGAGCGCCGGAGGGGCA AGGGGCACGATGGCCTTTACCAGGGTCTCAGTACAGCCACCAAGGACACC TACGACGCCCTTCACATGCAGGCCCTGCCCCCTCGCTAA
[0182] In certain non-limiting embodiments, an intracellular domain of the CAR further comprises at least one signaling region. The at least one signaling region can include a CD28 polypeptide, a 4-1BB polypeptide, an OX40 polypeptide, an ICOS polypeptide, a DAP-10 polypeptide, a PD-1 polypeptide, a CTLA-4 polypeptide, a LAG-3 polypeptide, a 2B4 polypeptide, a BTLA polypeptide, a synthetic peptide (not based on a protein associated with the immune response), or a combination thereof.
[0183] In certain embodiments, the signaling region is a co-stimulatory signaling region. In certain embodiments, the co-stimulatory region comprises at least one co-stimulatory molecule, which can provide optimal lymphocyte activation. As used herein, "co-stimulatory molecules" refer to cell surface molecules other than antigen receptors or their ligands that are required for an efficient response of lymphocytes to antigen. The at least one co-stimulatory signaling region can include a CD28 polypeptide, a 4-1BB polypeptide, an OX40 polypeptide, an ICOS polypeptide, a DAP-10 polypeptide, or a combination thereof. The co-stimulatory molecule can bind to a co-stimulatory ligand, which is a protein expressed on cell surface that upon binding to its receptor produces a co-stimulatory response, i.e., an intracellular response that effects the stimulation provided when an antigen binds to its CAR molecule. Co-stimulatory ligands, include, but are not limited to CD80, CD86, CD70, OX40L, 4-1BBL, CD48, TNFRSF14, and PD-L1. As one example, a 4-1BB ligand (i.e., 4-1BBL) may bind to 4-1BB (also known as "CD137") for providing an intracellular signal that in combination with a CAR signal induces an effector cell function of the CAR' T cell. CARs comprising an intracellular domain that comprises a co-stimulatory signaling region comprising 4-1BB, ICOS or DAP-10 are disclosed in U.S. Pat. No. 7,446,190 (e.g., the nucleotide sequence encoding 4-1BB is set forth in SEQ ID NO:15, the nucleotide sequence encoding ICOS is set forth in SEQ ID NO:16, and the nucleotide sequence encoding DAP-10 is set forth in SEQ ID NO:17 in U.S. Pat. No. 7,446,190), which is herein incorporated by reference in its entirety. In certain embodiments, the intracellular domain of the CAR comprises a co-stimulatory signaling region that comprises a CD28 polypeptide. In certain embodiments, the intracellular domain of the CAR comprises a co-stimulatory signaling region that comprises two co-stimulatory molecules:CD28 and 4-1BB or CD28 and OX40.
[0184] 4-1BB can act as a tumor necrosis factor (TNF) ligand and have stimulatory activity. The 4-1BB polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% homologous to the sequence having a NCBI Reference No: P41273 or NP_001552 (SEQ ID NO:197) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions. In certain embodiments, the 4-1BB polypeptide comprised in the intracellular domain of a presently disclosed CAR has an amino acid sequence of amino acids 214-255 of SEQ ID NO: 197.
[0185] SEQ ID NO: 197 is provided below:
TABLE-US-00050 [SEQ ID NO: 197] 1 MGNSCYNIVA TLLLVLNFER TRSLQDPCSN CPAGTFCDNN RNQICSPCPP NSFSSAGGQR 61 TCDICRQCKG VFRTRKECSS TSNAECDCTP GFHCLGAGCS MCEQDCKQGQ ELTKKGCKDC 121 CFGTFNDQKR GICRPWTNCS LDGKSVLVNG TKERDVVCGP SPADLSPGAS SVTPPAPARE 181 PGHSPQIISF FLALTSTALL FLLFFLTLRF SVVKRGRKKL LYIFKQPFMR PVQTTQEEDG 241 CSCRFPEEEE GGCEL
[0186] In accordance with the presently disclosed subject matter, a "4-1BB nucleic acid molecule" refers to a polynucleotide encoding a 4-1BB polypeptide. In certain embodiments, the 4-1BB nucleic acid molecule encoding the 4-1BB polypeptide comprised in the intracellular domain of a presently disclosed CARs (amino acids 214-255 of SEQ ID NO: 197) comprises nucleic acids having the sequence set forth in SEQ ID NO: 228 as provided below.
TABLE-US-00051 [SEQ ID NO: 228] aaacggggcagaaagaagctcctgtatatattcaaacaaccatttatgag accagtacaaactactcaagaggaagatggctgtagctgccgatttccag aagaagaagaaggaggatgtgaactg
[0187] An OX40 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% homologous to the sequence having a NCBI Reference No: P43489 or NP_003318 (SEQ ID NO: 198), or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0188] SEQ ID NO: 198 is provided below:
TABLE-US-00052 [SEQ ID NO: 198] 1 MCVGARRLGR GPCAALLLLG LGLSTVTGLH CVGDTYPSND RCCHECRPGN GMVSRCSRSQ 61 NTVCRPCGPG FYNDVVSSKP CKPCTWCNLR SGSERKQLCT ATQDTVCRCR AGTQPLDSYK 121 PGVDCAPCPP GHFSPGDNQA CKPWTNCTLA GKHTLQPASN SSDAICEDRD PPATQPQETQ 181 GPPARPITVQ PTEAWPRTSQ GPSTRPVEVP GGRAVAAILG LGLVLGLLGP LAILLALYLL 241 RRDQRLPPDA HKPPGGGSFR TPIQEEQADA HSTLAKI
[0189] In accordance with the presently disclosed subject matter, an "OX40 nucleic acid molecule" refers to a polynucleotide encoding an OX40 polypeptide.
[0190] An ICOS polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or 100% homologous to the sequence having a NCBI Reference No: NP_036224 (SEQ ID NO: 199) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0191] SEQ ID NO: 199 is provided below:
TABLE-US-00053 [SEQ ID NO: 199] 1 MKSGLWYFFL FCLRIKVLTG EINGSANYEM FIFHNGGVQI LCKYPDIVQQ FKMQLLKGGQ 61 ILCDLTKTKG SGNTVSIKSL KFCHSQLSNN SVSFFLYNLD HSHANYYFCN LSIFDPPPFK 121 VTLTGGYLHI YESQLCCQLK FWLPIGCAAF VVVCILGCIL ICWLTKKKYS SSVHDPNGEY 181 MFMRAVNTAK KSRLTDVTL
[0192] In accordance with the presently disclosed subject matter, an "ICOS nucleic acid molecule" refers to a polynucleotide encoding an ICOS polypeptide.
[0193] CTLA-4 is an inhibitory receptor expressed by activated T cells, which when engaged by its corresponding ligands (CD80 and CD86; B7-1 and B7-2, respectively), mediates activated T cell inhibition or anergy. In both preclinical and clinical studies, CTLA-4 blockade by systemic antibody infusion, enhanced the endogenous anti-tumor response albeit, in the clinical setting, with significant unforeseen toxicities.
[0194] CTLA-4 contains an extracellular V domain, a transmembrane domain, and a cytoplasmic tail. Alternate splice variants, encoding different isoforms, have been characterized. The membrane-bound isoform functions as a homodimer interconnected by a disulfide bond, while the soluble isoform functions as a monomer. The intracellular domain is similar to that of CD28, in that it has no intrinsic catalytic activity and contains one YVKM motif able to bind PI3K, PP2A and SHP-2 and one proline-rich motif able to bind SH3 containing proteins. One role of CTLA-4 in inhibiting T cell responses seem to be directly via SHP-2 and PP2A dephosphorylation of TCR-proximal signaling proteins such as CD3 and LAT. CTLA-4 can also affect signaling indirectly via competing with CD28 for CD80/86 binding. CTLA-4 has also been shown to bind and/or interact with PI3K, CD80, AP2M1, and PPP2R5A.
[0195] In accordance with the presently disclosed subject matter, a CTLA-4 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or about 100% homologous to UniProtKB/Swiss-Prot Ref. No.: P16410.3 (SEQ ID NO: 200) (homology herein may be determined using standard software such as BLAST or FASTA) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0196] SEQ ID NO: 200 is provided below:
TABLE-US-00054 [SEQ ID NO: 200] 1 MACLGFQRHK AQLNLATRTW PCTLLFFLLF IPVFCKAMHV AQPAVVLASS RGIASFVCEY 61 ASPGKATEVR VTVLRQADSQ VTEVCAATYM MGNELTFLDD SICTGTSSGN QVNLTIQGLR 121 AMDTGLYICK VELMYPPPYY LGIGNGTQIY VIDPEPCPDS DFLLWILAAV SSGLFFYSFL 181 LTAVSLSKML KKRSPLTTGV YVKMPPTEPE CEKQFQPYFI PIN
[0197] In accordance with the presently disclosed subject matter, a "CTLA-4 nucleic acid molecule" refers to a polynucleotide encoding a CTLA-4 polypeptide.
[0198] PD-1 is a negative immune regulator of activated T cells upon engagement with its corresponding ligands PD-L1 and PD-L2 expressed on endogenous macrophages and dendritic cells. PD-1 is a type I membrane protein of 268 amino acids. PD-1 has two ligands, PD-L1 and PD-L2, which are members of the B7 family. The protein's structure comprises an extracellular IgV domain followed by a transmembrane region and an intracellular tail. The intracellular tail contains two phosphorylation sites located in an immunoreceptor tyrosine-based inhibitory motif and an immunoreceptor tyrosine-based switch motif, that PD-1 negatively regulates TCR signals. SHP-I and SHP-2 phosphatases bind to the cytoplasmic tail of PD-1 upon ligand binding. Upregulation of PD-L1 is one mechanism tumor cells may evade the host immune system. In pre-clinical and clinical trials, PD-1 blockade by antagonistic antibodies induced anti-tumor responses mediated through the host endogenous immune system.
[0199] In accordance with the presently disclosed subject matter, a PD-1 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or about 100% homologous to NCBI Reference No: NP_005009.2 (SEQ ID NO: 201) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0200] SEQ ID NO: 201 is provided below:
TABLE-US-00055 [SEQ ID NO: 201] 1 MQIPQAPWPV VWAVLQLGWR PGWFLDSPDR PWNPPTFSPA LLVVTEGDNA TFTCSFSNTS 61 ESFVLNWYRM SPSNQTDKLA AFPEDRSQPG QDCRFRVTQL PNGRDFHMSV VRARRNDSGT 121 YLCGAISLAP KAQIKESLRA ELRVTERRAE VPTAHPSPSP RPAGQFQTLV VGVVGGLLGS 181 LVLLVWVLAV ICSRAARGTI GARRTGQPLK EDPSAVPVFS VDYGELDFQW REKTPEPPVP 241 CVPEQTEYAT IVFPSGMGTS SPARRGSADG PRSAQPLRPE DGHCSWPL
[0201] In accordance with the presently disclosed subject matter, a "PD-1 nucleic acid molecule" refers to a polynucleotide encoding a PD-1 polypeptide.
[0202] Lymphocyte-activation protein 3 (LAG-3) is a negative immune regulator of immune cells. LAG-3 belongs to the immunoglobulin (1 g) superfamily and contains 4 extracellular Ig-like domains. The LAG3 gene contains 8 exons. The sequence data, exon/intron organization, and chromosomal localization all indicate a close relationship of LAG3 to CD4. LAG3 has also been designated CD223 (cluster of differentiation 223).
[0203] In accordance with the the presently disclosed subject matter, a LAG-3 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or about 100% homologous to UniProtKB/Swiss-Prot Ref. No.: P18627.5 (SEQ ID NO: 202) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0204] SEQ ID NO: 202 is provided below:
TABLE-US-00056 [SEQ ID NO: 202] 1 MWEAQFLGLL FLQPLWVAPV KPLQPGAEVP VVWAQEGAPA QLPCSPTIPL QDLSLLRRAG 61 VTWQHQPDSG PPAAAPGHPL APGPHPAAPS SWGPRPRRYT VLSVGPGGLR SGRLPLQPRV 121 QLDERGRQRG DFSLWLRPAR RADAGEYRAA VHLRDRALSC RLRLRLGQAS MTASPPGSLR 181 ASDWVILNCS FSRPDRPASV HWFRNRGQGR VPVRESPHHH LAESFLFLPQ VSPMDSGPWG 241 CILTYRDGFN VSIMYNLTVL GLEPPTPLTV YAGAGSRVGL PCRLPAGVGT RSFLTAKWTP 301 PGGGPDLLVT GDNGDFTLRL EDVSQAQAGT YTCHIHLQEQ QLNATVTLAI ITVTPKSFGS 361 PGSLGKLLCE VTPVSGQERF VWSSLDTPSQ RSFSGPWLEA QEAQLLSQPW QCQLYQGERL 421 LGAAVYFTEL SSPGAQRSGR APGALPAGHL LLFLILGVLS LLLLVTGAFG FHLWRRQWRP 481 RRFSALEQGI HPPQAQSKIE ELEQEPEPEP EPEPEPEPEP EPEQL
[0205] In accordance with the presently disclosed subject matter, a "LAG-3 nucleic acid molecule" refers to a polynucleotide encoding a LAG-3 polypeptide.
[0206] Natural Killer Cell Receptor 2B4 (2B4) mediates non-MHC restricted cell killing on NK cells and subsets of T cells. To date, the function of 2B4 is still under investigation, with the 2B4-S isoform believed to be an activating receptor, and the 2B4-L isoform believed to be a negative immune regulator of immune cells. 2B4 becomes engaged upon binding its high-affinity ligand, CD48. 2B4 contains a tyrosine-based switch motif, a molecular switch that allows the protein to associate with various phosphatases. 2B4 has also been designated CD244 (cluster of differentiation 244).
[0207] In accordance with the presently disclosed subject matter, a 2B4 polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or about 100% homologous to UniProtKB/Swiss-Prot Ref. No.: Q9BZW8.2 (SEQ ID NO: 203) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0208] SEQ ID NO: 203 is provided below:
TABLE-US-00057 [SEQ ID NO: 203] 1 MLGQVVTLIL LLLLKVYQGK GCQGSADHVV SISGVPLQLQ PNSIQTKVDS IAWKKLLPSQ 61 NGFHHILKWE NGSLPSNTSN DRFSFIVKNL SLLIKAAQQQ DSGLYCLEVT SISGKVQTAT 121 FQVFVFESLL PDKVEKPRLQ GQGKILDRGR CQVALSCLVS RDGNVSYAWY RGSKLIQTAG 181 NLTYLDEEVD INGTHTYTCN VSNPVSWESH TLNLTQDCQN AHQEFRFWPF LVIIVILSAL 241 FLGTLACFCV WRRKRKEKQS ETSPKEFLTI YEDVKDLKTR RNHEQEQTFP GGGSTIYSMI 301 QSQSSAPTSQ EPAYTLYSLI QPSRKSGSRK RNHSPSFNST IYEVIGKSQP KAQNPARLSR 361 KELENFDVYS
[0209] In accordance with the presently disclosed subject matter, a "2B4 nucleic acid molecule" refers to a polynucleotide encoding a 2B4 polypeptide.
[0210] B- and T-lymphocyte attenuator (BTLA) expression is induced during activation of T cells, and BTLA remains expressed on Th1 cells but not Th2 cells. Like PD1 and CTLA4, BTLA interacts with a B7 homolog, B7H4. However, unlike PD-1 and CTLA-4, BTLA displays T-Cell inhibition via interaction with tumor necrosis family receptors (TNF-R), not just the B7 family of cell surface receptors. BTLA is a ligand for tumor necrosis factor (receptor) superfamily, member 14 (TNFRSF14), also known as herpes virus entry mediator (HVEM). BTLA-HVEM complexes negatively regulate T-cell immune responses. BTLA activation has been shown to inhibit the function of human CD8.sup.+ cancer-specific T cells. BTLA has also been designated as CD272 (cluster of differentiation 272).
[0211] In accordance with the presently disclosed subject matter, a BTLA polypeptide can have an amino acid sequence that is at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99% or about 100% homologous to UniProtKB/Swiss-Prot Ref. No.: Q7Z6A9.3 (SEQ ID NO: 204) or fragments thereof, and/or may optionally comprise up to one or up to two or up to three conservative amino acid substitutions.
[0212] SEQ ID NO: 204 is provided below:
TABLE-US-00058 [SEQ ID NO: 204] 1 MKTLPAMLGT GKLFWVFFLI PYLDIWNIHG KESCDVQLYI KRQSEHSILA GDPFELECPV 61 KYCANRPHVT WCKLNGTTCV KLEDRQTSWK EEKNISFFIL HFEPVLPNDN GSYRCSANFQ 121 SNLIESHSTT LYVTDVKSAS ERPSKDEMAS RPWLLYRLLP LGGLPLLITT CFCLFCCLRR 181 HQGKQNELSD TAGREINLVD AHLKSEQTEA STRQNSQVLL SETGIYDNDP DLCFRMQEGS 241 EVYSNPCLEE NKPGIVYASL NHSVIGPNSR LARNVKEAPT EYASICVRS
[0213] In accordance with the presently disclosed subject matter, a "BTLA nucleic acid molecule" refers to a polynucleotide encoding a BTLA polypeptide.
[0214] In certain embodiments, the CAR comprises an extracellular antigen-binding region that comprises a human scFv that specifically binds to human BCMA, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.zeta. polypeptide and a co-stimulatory signaling region that comprises a CD28 polypeptide, as shown in FIG. 1. As shown in FIG. 1, the CAR also comprises a signal peptide or a leader covalently joined to the 5' terminus of the extracellular antigen-binding domain. In certain embodiments, the signal peptide comprises amino acids having the sequence set forth in SEQ ID NO:205.
[0215] In certain embodiments, the CAR comprises an extracellular antigen-binding region that comprises a human scFv that specifically binds to human BCMA, a transmembrane domain comprising a CD8 polypeptide, and an intracellular domain comprising a CD3.zeta. polypeptide and a co-stimulatory signaling region that comprises a 4-1BB polypeptide, as shown in FIG. 7. As shown in FIG. 7, the CAR also comprises a signal peptide or a leader covalently joined to the 5' terminus of the extracellular antigen-binding domain. In certain embodiments, the signal peptide comprises amino acids having the sequence set forth in SEQ ID NO:205.
[0216] In certain embodiments, the CAR of the presently disclosed subject matter can further comprise an inducible promoter, for expressing nucleic acid sequences in human cells. Promoters for use in expressing CAR genes can be a constitutive promoter, such as ubiquitin C (UbiC) promoter.
[0217] The presently disclosed subject matter also provides isolated nucleic acid molecule encoding the BCMA-targeted CAR described herein or a functional portion thereof. In certain embodiments, the isolated nucleic acid molecule encodes a presently disclosed BCMA-targeted CAR comprising a human scFv that specifically binds to human BCMA, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3 polypeptide and a co-stimulatory signaling region comprising a CD28 polypeptide. In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:207 provided below:
TABLE-US-00059 [SEQ ID NO: 207] caatctgccctgactcagcctgcctccgtgtctgcgtctcctggacagtcg atcgccatctcctgcactggaaccagcagtgacgttggttggtatcaacag cacccaggcaaagcccccaaactcatgatttatgaggacagtaagcggccc tcaggggtttctaatcgcttctctggctccaagtctggcaacacggcctcc ctgaccatctctgggctccaggctgaggacgaggctgattattactgcagc tcaaatacaagaagcagcactttggtgttcggcggagggaccaagctgacc gtcctaggttctagaggtggtggtggtagcggcggcggcggctctggtggt ggtggatccctcgagatggccgaagtgcagctggtgcagtctggggctgag atgaagaagcctggggcctcactgaagctctcctgcaaggcttctggatac accttcatcgactactatgtatactggatgcgacaggcccctggacaaggg cttgagtccatgggatggatcaaccctaacagtggtggcacaaactatgca cagaagtttcagggcagggtcaccatgaccagggacacgtccatcagcaca gcctacatggagctgagcaggctgagatctgacgacaccgccatgtattac tgtgcgcgctcccagcgtgacggttacatggattactggggtcaaggtact ctggtgaccgtctcctcagcggccgcaattgaagttatgtatcctcctcct tacctagacaatgagaagagcaatggaaccattatccatgtgaaagggaaa cacctttgtccaagtcccctatttcccggaccttctaagccdtttgggtgc tggtggtggttggtggagtcctggcttgctatagcttgctagtaacagtgg cctttattattttctgggtgaggagtaagaggagcaggctcctgcacagtg actacatgaacatgactccccgccgccccgggcccacccgcaagcattacc agccctatgccccaccacgcgacttcgcagcctatcgctccagagtgaagt tcagcaggagcgcagacgcccccgcgtaccagcagggccagaaccagctct ataacgagctcaatctaggacgaagagaggagtacgatgttttggacaaga gacgtggccgggaccctgagatggggggaaagccgagaaggaagaaccctc aggaaggcctgtacaatgaactgcagaaagataagatggcggaggcctaca gtgagattgggatgaaaggcgagcgccggaggggcaaggggcacgatggcc tttaccagggtctcagtacagccaccaaggacacctacgacgcccttcaca tgcaggccctgccccctcgc
[0218] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:208 provided below:
TABLE-US-00060 [SEQ ID NO: 208] cagtctgtgctgacgcagccgccctcagtgtctggggccccagggcagagg gtcaccatctcctgcactgggagcagctccaacatcggggcaggttttgat gtacactggtaccagcagcttccaggaacagcccccaaactcctcatctat ggtaacagcaatcggccctcaggggtccctgaccgattctctggctccaag tctggcacctcagcctccctggccatcactgggctccaggctgaggatgag gctgattattactgccagtcctatgacagcagcctgagtggttatgtcttc ggaactgggaccaaggtcaccgtcctaggttctagaggtggtggtggtagc ggcggcggcggctctggtggtggtggatccctcgagatggcccaggtccag ctggtacagtctggggctgaggtgaagaagcctggggcctcagtgaaggtc tcctgcaaggcttctggatacaccttcaccgactactatatgcactgggtg cgacaggcccctggacaacggcttgagtggatgggatggatcaaccctaac agtggtggcacaaactatgcacagaagtttcaggacaggatcaccgtgacc agggacacctccagcaacacaggctacatggagctgaccaggctgagatct gacgacacggccgtgtattactgtgcgcgctctccgtactctggtgttctg gataaatggggtcaaggtactctggtgaccgtctcctcagcggccgcaatt gaagttatgtatcctcctccttacctagacaatgagaagagcaatggaacc attatccatgtgaaagggaaacacctttgtccaagtcccctatttcccgga ccttctaagcccttttgggtgctggtggtggttggtggagtcctggcttgc tatagcttgctagtaacagtggcctttattattttctgggtgaggagtaag aggagcaggctcctgcacagtgactacatgaacatgactccccgccgcccc gggcccacccgcaagcattaccagccctatgccccaccacgcgacttcgca gcctatcgctccagagtgaagttcagcaggagcgcagacgcccccgcgtac cagcagggccagaaccagctctataacgagctcaatctaggacgaagagag gagtacgatgttttggacaagagacgtggccgggaccctgagatgggggga aagccgagaaggaagaaccctcaggaaggcctgtacaatgaactgcagaaa gataagatggcggaggcctacagtgagattgggatgaaaggcgagcgccgg aggggcaaggggcacgatggcctttaccagggtctcagtacagccaccaag gacacctacgacgcccttcacatgcaggccctgccccctcgc
[0219] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:209 provided below:
TABLE-US-00061 [SEQ ID NO: 209] tcctatgagctgactcagccaccctcagcgtctgggacccccgggcagagg gtcaccatgtcttgttctggaaccagctccaacatcggaagtcactctgta aactggtaccagcagctcccaggaacggcccccaaactcctcatctatact aataatcagcggccctcaggggtccctgaccgattctctggctccaagtct ggcacctcagcctccctggccatcagtggcctccagtctgaggatgaggct gattattactgtgcagcatgggatggcagcctgaatggtctggtattcggc ggagggaccaagctgaccgtcctaggttctagaggtggtggtggtagcggc ggcggcggctctggtggtggtggatccctcgagatggccgaggtgcagctg gtgcagtctggagcagaggtgaaaaagcccggggagtctctgaagatctcc tgtaagggttctggatacagctttaccagctactggatcggctgggtgcgc cagatgcccgggaaaggcctggagtggatggggatcatctatcctggtgac tctgataccagatacagcccgtccttccaaggccacgtcaccatctcagct gacaagtccatcagcactgcctacctgcagtggagcagcctgaaggcctcg gacaccgccatgtattactgtgcgcgctactctggttctttcgataactgg ggtcaaggtactctggtgaccgtctcctcagcggccgcaattgaagttatg tatcctcctccttacctagacaatgagaagagcaatggaaccattatccat gtgaaagggaaacacctttgtccaagtcccctatttcccggaccttctaag ccctttgggtgctggtggtggttggtggagtcctggcttgctatagcttgc tagtaacagtggcctttattattttctgggtgaggagtaagaggagcaggc tcctgcacagtgactacatgaacatgactccccgccgccccgggcccaccc gcaagcattaccagccctatgccccaccacgcgacttcgcagcctatcgct ccagagtgaagttcagcaggagcgcagacgcccccgcgtaccagcagggcc agaaccagctctataacgagctcaatctaggacgaagagaggagtacgatg ttttggacaagagacgtggccgggaccctgagatggggggaaagccgagaa ggaagaaccctcaggaaggcctgtacaatgaactgcagaaagataagatgg cggaggcctacagtgagattgggatgaaaggcgagcgccggaggggcaagg ggcacgatggcctttaccagggtctcagtacagccaccaaggacacctacg acgcccttcacatgcaggccctgccccctcgc
[0220] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:229 provided below:
TABLE-US-00062 [SEQ ID NO: 229] CCTTCTCTAGGCGCCCCCATATGGCCATATGAGATCTTATATGGGGCACCCCCGCCCCTTGTAAACTTCCCTGA- CCCTGA CATGACAAGAGTTACTAACAGCCCCTCTCTCCAAGCTCACTTACAGGCTCTCTACTTAGTCCAGCACGAAGTCT- GGAGAC CTCTGGCGGCAGCCTACCAAGAACAACTGGACCGACCGGTGCCGCCACCATGGAAACCGACACCCTGCTGCTGT- GGGTGC TGCTGCTGTGGGTGCCAGGATCCACAGGACTGCCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGG- CAGAGG GTCACCATCTCTTGTTCTGGACGCAGTTCCAACATCGGGAGTAATTCTGTTAACTGGTATCGACAACTCCCAGG- AGCGGC CCCCAAACTCCTCATCTATAGTAATAATCAGCGGCCCCCAGGGGTCCCTGTGCGATTCTCTGGCTCCAAGTCTG- GCACCT CAGCCTCCCTGGCCATCAGTGGGCTCCAGTCTGAAGATGAGGCCACTTATTACTGTGCAACATGGGATGACAAT- CTGAAT GTTCACTATGTCTTCGGAACTGGGACCAAGGTCACCGTCCTAGGTTCTAGAGGTGGTGGTGGTAGCGGCGGCGG- CGGCTC TGGTGGTGGTGGATCCCTCGAGATGGCCCAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGT- CCTCGG TGAAGGTCTCCTGCAAGGCTTCTGGAGGCACCTTCAGCAGCTATGCTATCAGCTGGGTGCGACAGGCCCCTGGA- CAAGGG CTTGAGTGGATGGGAAGGATCATCCCTATCCTTGGTATAGCAAACTACGCACAGAAGTTCCAGGGCAGAGTCAC- GATTAC CGCGGACAAATCCACGAGCACAGCCTACATGGAGCTGAGCAGCCTGAGATCTGAGGACACGGCCGTGTATTACT- GTGCGC GCGGTGGTTACTACTCTCATGACATGTGGTCTGAAGATTGGGGTCAAGGTACTCTGGTGACCGTCTCCTCAGCg- gccgca cccaccacgacgccagcgccgcgaccaccaaccccggcgcccacgatcgcgtcgcagcccctgtccctgcgccc- agaggc gtgccggccagcggcggggggcgcagtgcacacgagggggctggacttcgcctgtgatatctacatctgggcgc- ccctgg ccgggacttgtggggtccttctcctgtcactggttatcaccctttactgcaacaaacggggcagaaagaagctc- ctgtat atattcaaacaaccatttatgagaccagtacaaactactcaagaggaagatggctgtagctgccgatttccaga- agaaga agaaggaggatgtgaactgagagtgaagttgagcaggagcgcagagccccccgcgtaccagcagggccagaacc- agctct ataacgagctcaatctaggacgaagagaggagtacgatgttttggacaagagacgtggccgggaccctgagatg- ggggga aagccgagaaggaagaaccctcaggaaggcctgtacaatgaactgcagaaagataagatggcggaggcctacag- tgagat tgggatgaaaggcgagcgccggaggggcaaggggcacgatggcctttaccagggtctcagtacagccaccaagg- acacct acgacgcccttcacatgcaggccctgccccctcgctaacagccactcgaggatccggattagtccaatttgtta- aagaca ggatatcagtggtccaggctctagttttgactcaacaatatcaccagctgaagcctatagagtacgagccatag- ataaaa taaaagattttatttagtctccagaaaaaggggggaatgaaagaccccacctgtaggtttggcaagctagctta- agtaac gccattttgcaaggcatggaaaaatacataactgagaatagagaagttcagatcaaggtcaggaacagatggaa- cagctg aatatgggccaaacaggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggaacagct- gaatat gggccaaacaggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggtccccagatgcg- gtccag ccctgagcagtttctagagaaccatcagatgtttccagggtgccccaaggacctgaaatgaccctgtgccttat- ttgaac taaccaatcagttcgcttctcgcttctgttcgcgcgcttctgctccccgagctcaataaaagagcccacaaccc- ctcact cggggcgccagtcctccgattgactgagtcgcccgggtacccgtgtatccaataaaccctcttgcagttgcatc- cgactt gtggtctcgctgttccttgggagggtctcctctgagtgattgactacccgtcagcgggggtctttcacacatgc- agcatg tatcaaaattaatttggttttttttcttaagtatttacattaaatggccatagtacttaaagttacattggctt- ccttga aataaacatggagtattcagaatgtgtcataaatatttctaattttaagatagtatctccattggctttctact- ttttct tttatttttttttgtcctctgtcttccatttgttgttgttgttgtttgtttgtttgtttgttggttggttggtt- aatttt tttttaaagatcctacactatagttcaagctagactattagctactctgtaacccagggtgaccttgaagtcat- gggtag cctgctgttttagccttcccacatctaagattacaggtatgagctatcatttttggtatattgattgattgatt- gattga tgtgtgtgtgtgtgattgtgtttgtgtgtgtgactgtgaaaatgtgtgtatgggtgtgtgtgaatgtgtgtatg- tatgtg tgtgtgtgagtgtgtgtgtgtgtgtgtgcatgtgtgtgtgtgtgactgtgtctatgtgtatgactgtgtgtgtg- tgtgtg tgtgtgtgtgtgtgtgtgtgtgtgtgtgtgttgtgaaaaaatattctatggtagtgagagccaacgctccggct- caggtg tcaggttggtttttgagacagagtctttcacttagcttggAATTCACTGGCCGTCGTTTTACAACGTCGTGACT- GGGAAA ACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCC- CGCACC GATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCT- GTGCGG TATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACC- CGCCAA CACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGG- AGCTGC ATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGATGACGAAAGGGCCTCGTGATACGCCTATTTTTAT- AGGTTA ATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGT- TTATTT TTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAG- GAAGAG TATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACC- CAGAAA CGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGC- GGTAAG ATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGT- ATTATC CCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCAC- CAGTCA CAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACT- GCGGCC AACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAAC- TCGCCT TGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGG- CAACAA CGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCG- GATAAA GTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCG- TGGGTC TCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTC- AGGCAA CTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAA- GTTTAC TCATATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAA- TCTCAT GACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTT- GAGATC CTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGAT- CAAGAG CTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCC- GTAGTT AGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTG- CCAGTG GCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACG- GGGGGT TCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAG- CGCCAC GCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGC- TTCCAG GGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGC- TCGTCA GGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGC- TCACAT GTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCC- GCAGCC GAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCG- CGTTGG CCGATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGT- GAGTTA GCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGAT- AACAAT TTCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTTGCTCTTAGGAGTTTCCTAATACATCCCAAACT- CAAATA TATAAAGCATTTGACTTGTTCTATGCCCTAGGGGGCGGGGGGAAGCTAAGCCAGCTTTTTTTAACATTTAAAAT- GTTAAT TCCATTTTAAATGCACAGATGTTTTTATTTCATAAGGGTTTCAATGTGCATGAATGCTGCAATATTCCTGTTAC- CAAAGC TAGTATAAATAAAAATAGATAAACGTGGAAATTACTTAGAGTTTCTGTCATTAACGTTTCCTTCCTCAGTTGAC- AACATA AATGCGCTGCTGAGCAAGCCAGTTTGCATCTGTCAGGATCAATTTCCCATTATGCCAGTCATATTAATTACTAG- TCAATT AGTTGATTTTTATTTTTGACATATACATGTGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTA- ACGCCA TTTTGCAAGGCATGGAAAAATACATAACTGAGAATAGAAAAGTTCAGATCAAGGTCAGGAACAGATGGAACAGC- TGAATA TGGGCCAAACAGGATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAAT- ATGGGC CAAACAGGATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCC- AGCCCT CAGCAGTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGA- ACTAAC CAATCAGTTCGCTTCTCGCTTCTGTTCGCGCGCTTATGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCA- CTCGGG
GCGCCAGTCCTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGCATCCGAC- TTGTGG TCTCGCTGTTCCTTGGGAGGGTCTCCTCTGAGTGATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTC- GTCCGG GATCGGGAGACCCCTGCCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCT- GTCCGA TTGTCTAGTGTCTATGACTGATTTTATGCGCCTGCGTCGGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGA- CCCGTG GTGGAACTGACGAGTTCGGAACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTTCGGGGGCCGTTTTTGT- GGCCCG ACCTGAGTCCTAAAATCCCGATCGTTTAGGACTCTTTGGTGCACCCCCCTTAGAGGAGGGATATGTGGTTCTGG- TAGGAG ACGAGAACCTAAAACAGTTCCCGCCTCCGTCTGAATTTTTGCTTTCGGTTTGGGACCGAAGCCGCGCCGCGCGT- CTTGTC TGCTGCAGCATCGTTCTGTGTTGTCTCTGTCTGACTGTGTTTCTGTATTTGTCTGAAAATATGGGCCCGGGCTA- GACTGT TACCACTCCCTTAAGTTTGACCTTAGGTCACTGGAAAGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATG- TCAAGA AGAGACGTTGGGTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACCTTT- AACCGA GACCTCATCACCCAGGTTAAGATCAAGGTCTTTTCACCTGGCCCGCATGGACACCCAGACCAGGTCCCCTACAT- CGTGAC CTGGGAAGCCTTGGCTTTTGACCCCCCTCCCTGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTC- CTCCAT CCGCCCCGTCTCTCCCCCTTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT
[0221] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:230 provided below:
TABLE-US-00063 [SEQ ID NO: 230] GGCCCTCTAGGCGCCCCCATATGGCCATATGAGATCTTATATGGGGCACCCCCGCCCCTTGTAAACTTCCCTGA- CCCTGA CATGACAAGAGTTACTAACAGCCCCTCTCTCCAAGCTCACTTACAGGCTCTCTACTTAGTCCAGCACGAAGTCT- GGAGAC CTCTGGCGGCAGCCTACCAAGAACAACTGGACCGACCGGTGCCGCCACCATGGAAACCGACACCCTGCTGCTGT- GGGTGC TGCTGCTGTGGGTGCCAGGATCCACAGGAtcctatgagctgactcagccaccctcagcgtctgggacccccggg- cagagg gtcaccatgtcttgttctggaaccagctccaacatcggaagtcactctgtaaactggtaccagcagctcccagg- aacggc ccccaaactcctcatctatactaataatcagcggccctcaggggtccctgaccgattctctggctccaagtctg- gcacct cagcctccctggccatcagtggcctccagtctgaggatgaggctgattattactgtgcagcatgggatggcagc- ctgaat ggtctggtattcggcggagggaccaagctgaccgtcctaggttctagaggtggtggtggtagcggcggcggcgg- ctctgg tggtggtggatccctcgagatggccgaggtgcagctggtgcagtctggagcagaggtgaaaaagcccggggagt- ctctga agatctcctgtaagggttctggatacagctttaccagctactggatcggctgggtgcgccagatgcccgggaaa- ggcctg gagtggatggggatcatctatcctggtgactctgataccagatacagcccgtccttccaaggccacgtcaccat- ctcagc tgacaagtccatcagcactgcctacctgcagtggagcagcctgaaggcctcggacaccgccatgtattactgtg- cgcgct actctggttctttcgataactggggtcaaggtactctggtgaccgtctcctcagcggccgcacccaccacgacg- ccagcg ccgcgaccaccaaccccggcgcccacgatcgcgtcgcagcccctgtccctgcgcccagaggcgtgccggccagc- ggcggg gggcgcagtgcacacgagggggctggacttcgcctgtgatatctacatctgggcgcccctggccgggacttgtg- gggtcc ttctcctgtcactggttatcaccctttactgcaacaaacggggcagaaagaagctcctgtatatattcaaacaa- ccattt atgagaccagtacaaactactcaagaggaagatggctgtagctgccgatttccagaagaagaagaaggaggatg- tgaact gagagtgaagttcaggaggagcgcagagccccccgcgtaccaggagggccagaaccagctctataacgagctca- atctag gacgaagagaggagtacgatgttttggacaagagacgtggccgggaccctgagatggggggaaagccgagaagg- aagaac cctcaggaaggcctgtacaatgaactgcagaaagataagatggcggaggcctacagtgagattgggatgaaagg- cgagcg ccggaggggcaaggggcacgatggcctttaccagggtctcagtacagccaccaaggacacctacgacgcccttc- acatgc aggccctgccccctcgctaacagccactcgaggatccggattagtccaatttgttaaagacaggatatcagtgg- tccagg ctctagttttgactcaacaatatcaccagctgaagcctatagagtacgagccatagataaaataaaagatttta- tttagt ctccagaaaaaggggggaatgaaagaccccacctgtaggtttggcaagctagcttaagtaacgccattttgcaa- ggcatg gaaaaatacataactgagaatagagaagttcagatcaaggtcaggaacagatggaacagctgaatatgggccaa- acagga tatctgtggtaagcagttcctgccccggctcagggccaagaacagatggaacagctgaatatgggccaaacagg- atatct gtggtaaggagttcctgccccggctcagggccaagaacagatggtccccagatgcggtccagccctcaggagtt- tctaga gaaccatcagatgtttccagggtgccccaaggacctgaaatgaccctgtgccttatttgaactaaccaatcagt- tcgctt ctcgcttctgttcgcgcgcttctgctccccgagctcaataaaagagcccacaacccctcactcggggcgccagt- cctccg attgactgagtcgcccgggtacccgtgtatccaataaaccctcttgcagttgcatccgacttgtggtctcgctg- ttcctt gggagggtctcctctgagtgattgactacccgtcagcgggggtctttcacacatgcagcatgtatcaaaattaa- tttggt tttttttcttaagtatttacattaaatggccatagtacttaaagttacattggcttccttgaaataaacatgga- gtattc agaatgtgtcataaatatttctaattttaagatagtatctccattggctttctactttttcttttatttttttt- tgtcct ctgtcttccatttgttgttgttgttgtttgtttgtttgtttgttggttggttggttaatttttttttaaagatc- ctacac tatagttcaagctagactattagctactctgtaacccagggtgaccttgaagtcatgggtagcctgctgtttta- gccttc ccacatctaagattacaggtatgagctatcatttttggtatattgattgattgattgattgatgtgtgtgtgtg- tgattg tgtttgtgtgtgtgactgtgaaaatgtgtgtatgggtgtgtgtgaatgtgtgtatgtatgtgtgtgtgtgagtg- tgtgtg tgtgtgtgtgcatgtgtgtgtgtgtgactgtgtctatgtgtatgactgtgtgtgtgtgtgtgtgtgtgtgtgtg- tgtgtg tgtgtgtgtgtgttgtgaaaaaatattctatggtagtgagagccaacgctccggctcaggtgtcaggttggttt- ttgaga cagagtctttcacttagcttggAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTA- CCCAAC TTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCC- CAACAG TTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCG- CATATG GTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACG- CGCCCT GACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGG- TTTTCA CCGTCATCACCGAAACGCGCGATGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAA- TAATGG TTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACAT- TCAAAT ATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCA- ACATTT CCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAG- TAAAAG ATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGT- TTTCGC CCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGC- CGGGCA AGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATC- TTACGG ATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTG- ACAACG ATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGA- ACCGGA GCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAAC- TATTAA CTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCA- CTTCTG CGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCAT- TGCAGC ACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAAC- GAAATA GACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTT- TAGATT GATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCC- TTAACG TGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGC- GCGTAA TCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTT- TTTCCG AAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTT- CAAGAA CTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGT- GTCTTA CCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAG- CCCAGC TTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGG- GAGAAA GGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCT- GGTATC TTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGC- CTATGG AAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGC- GTTATC CCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGC- GCAGCG AGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAA- TGCAGC TGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTA- GGCACC CCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAA- ACAGCT ATGACCATGATTACGCCAAGCTTTGCTCTTAGGAGTTTCCTAATACATCCCAAACTCAAATATATAAAGCATTT- GACTTG TTCTATGCCCTAGGGGGCGGGGGGAAGCTAAGCCAGCTTTTTTTAACATTTAAAATGTTAATTCCATTTTAAAT- GCACAG ATGTTTTTATTTCATAAGGGTTTCAATGTGCATGAATGCTGCAATATTCCTGTTACCAAAGCTAGTATAAATAA- AAATAG ATAAACGTGGAAATTACTTAGAGTTTCTGTCATTAACGTTTCCTTCCTCAGTTGACAACATAAATGCGCTGCTG- AGCAAG CCAGTTTGCATCTGTCAGGATCAATTTCCCATTATGCCAGTCATATTAATTACTAGTCAATTAGTTGATTTTTA- TTTTTG ACATATACATGTGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACGCCATTTTGCAAGGCA- TGGAAA AATACATAACTGAGAATAGAAAAGTTCAGATCAAGGTCAGGAACAGATGGAACAGCTGAATATGGGCCAAACAG- GATATC TGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATAT- CTGTGG TAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCAGCCCTCAGCAGTTTCTA- GAGAAC CATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGAACTAACCAATCAGTTCGC- TTCTCG CTTCTGTTCGCGCGCTTATGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGGCGCCAGTCCTC- CGATTG
ACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGCATCCGACTTGTGGTCTCGCTGTTCC- TTGGGA GGGTCTCCTCTGAGTGATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATCGGGAGACC- CCTGCC CAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGTCCGATTGTCTAGTGTC- TATGAC TGATTTTATGCGCCTGCGTCGGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCCGTGGTGGAACTGACG- AGTTCG GAACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTTCGGGGGCCGTTTTTGTGGCCCGACCTGAGTCCTA- AAATCC CGATCGTTTAGGACTCTTTGGTGCACCCCCCTTAGAGGAGGGATATGTGGTTCTGGTAGGAGACGAGAACCTAA- AACAGT TCCCGCCTCCGTCTGAATTTTTGCTTTCGGTTTGGGACCGAAGCCGCGCCGCGCGTCTTGTCTGCTGCAGCATC- GTTCTG TGTTGTCTCTGTCTGACTGTGTTTCTGTATTTGTCTGAAAATATGGGCCCGGGCTAGACTGTTACCACTCCCTT- AAGTTT GACCTTAGGTCACTGGAAAGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCAAGAAGAGACGTTGGG- TTACCT TCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACCTTTAACCGAGACCTCATCACC- CAGGTT AAGATCAAGGTCTTTTCACCTGGCCCGCATGGACACCCAGACCAGGTCCCCTACATCGTGACCTGGGAAGCCTT- GGCTTT TGACCCCCCTCCCTGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTCCATCCGCCCCGTCTC- TCCCCC TTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT
[0222] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:231 provided below:
TABLE-US-00064 [SEQ ID NO: 231] CCTTCTCTAGGCGCCCCCATATGGCCATATGAGATCTTATATGGGGCACCCCCGCCCCTTGTAAACTTCCCTGA- CCCTGA CATGACAAGAGTTACTAACAGCCCCTCTCTCCAAGCTCACTTACAGGCTCTCTACTTAGTCCAGCACGAAGTCT- GGAGAC CTCTGGCGGCAGCCTACCAAGAACAACTGGACCGACCGGTGCCGCCACCATGGAAACCGACACCCTGCTGCTGT- GGGTGC TGCTGCTGTGGGTGCCAGGATCCACAGGACAGGCTGTGCTGACTCAGCCACCCTCAGCGTCTGGGACCCCCGGG- CAGAGG GTCACCATCTCTTGTTCTGGAAGCAGCTCCAACATCGGAAGTAATTACGTATTCTGGTACCAGCAGCTCCCAGG- AACGGC CCCCAAACTCCTCATCTATAGTAATAATCAGCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTCCAAGTCTG- GCACCT CAGCCTCCCTGGCCATCAGTGGGCTCCGGTCCGAGGATGAGGCTGATTATTACTGTGCAGCATGGGATGACAGC- CTGAGT GCCTCTTATGTTTTCGGAACTGGGACCAAGGTCACCGTCCTAGGTTCTAGAGGTGGTGGTGGTAGCGGCGGCGG- CGGCTC TGGTGGTGGTGGATCCCTCGAGATGGCCCAGGTGCAGCTGGTGCAGTCTGGGGCTGAGGTGAAGAAGCCTGGGT- CCTCGG TGAAGGTCTCCTGCAAGGCTTCTGGAGGCACCTTCAGCAGCTATGCTATCAGCTGGGTGCGACAGGCCCCTGGA- CAAGGG CTTGAGTGGATGGGAAGGATCATCCCTATCCTTGGTACAGCAAACTACGCACAGAAGTTCCAGGGCAGAGTCAC- GATTAC CGCGGACGAATCCACGAGCACAGCCTACATGGAGCTGAGCAGCCTGAGATCTGAGGACACGGCCGTGTATTACT- GTGCGC GCTCTGGTTACGGTTCTTACCGTTGGGAAGATTCTTGGGGTCAAGGTACTCTGGTGACCGTCTCCTCAGCggcc- gcaccc accacgacgccagcgccgcgaccaccaaccccggcgcccacgatcgcgtcgcagcccctgtccctgcgcccaga- ggcgtg ccggccagcggcggggggcgcagtgcacacgagggggctggacttcgcctgtgatatctacatctgggcgcccc- tggccg ggacttgtggggtccttctcctgtcactggttatcaccctttactgcaacaaacggggcagaaagaagctcctg- tatata ttcaaacaaccatttatgagaccagtacaaactactcaagaggaagatggctgtagctgccgatttccagaaga- agaaga aggaggatgtgaactgagagtgaagttcagcaggagcgcagagccccccgcgtaccagcagggccagaaccagc- tctata acgagctcaatctaggacgaagagaggagtacgatgttttggacaagagacgtggccgggaccctgagatgggg- ggaaag ccgagaaggaagaaccctcaggaaggcctgtacaatgaactgcagaaagataagatggcggaggcctacagtga- gattgg gatgaaaggcgagcgccggaggggcaaggggcacgatggcctttaccagggtctcagtacagccaccaaggaca- cctacg acgcccttcacatgcaggccctgccccctcgctaacagccactcgaggatccggattagtccaatttgttaaag- acagga tatcagtggtccaggctctagttttgactcaacaatatcaccagctgaagcctatagagtacgagccatagata- aaataa aagattttatttagtctccagaaaaaggggggaatgaaagaccccacctgtaggtttggcaagctagcttaagt- aacgcc attttgcaaggcatggaaaaatacataactgagaatagagaagttcagatcaaggtcaggaacagatggaacag- ctgaat atgggccaaacaggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggaacagctgaa- tatggg ccaaacaggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggtccccagatgcggtc- cagccc tgagcagtttctagagaaccatcagatgtttccagggtgccccaaggacctgaaatgaccctgtgccttatttg- aactaa ccaatcagttcgcttctcgcttctgttcgcgcgcttctgctccccgagctcaataaaagagcccacaacccctc- actcgg ggcgccagtcctccgattgactgagtcgcccgggtacccgtgtatccaataaaccctcttgcagttgcatccga- cttgtg gtctcgctgttccttgggagggtctcctctgagtgattgactacccgtcagcgggggtctttcacacatgcagc- atgtat caaaattaatttggttttttttcttaagtatttacattaaatggccatagtacttaaagttacattggcttcct- tgaaat aaacatggagtattcagaatgtgtcataaatatttctaattttaagatagtatctccattggctttctactttt- tctttt atttttttttgtcctctgtcttccatttgttgttgttgttgtttgtttgtttgtttgttggttggttggttaat- tttttt ttaaagatcctacactatagttcaagctagactattagctactctgtaacccagggtgaccttgaagtcatggg- tagcct gctgttttagccttcccacatctaagattacaggtatgagctatcatttttggtatattgattgattgattgat- tgatgt gtgtgtgtgtgattgtgtttgtgtgtgtgactgtgaaaatgtgtgtatgggtgtgtgtgaatgtgtgtatgtat- gtgtgt gtgtgagtgtgtgtgtgtgtgtgtgcatgtgtgtgtgtgtgactgtgtctatgtgtatgactgtgtgtgtgtgt- gtgtgt gtgtgtgtgtgtgtgtgtgtgtgtgtgttgtgaaaaaatattctatggtagtgagagccaacgctccggctcag- gtgtca ggttggtttttgagacagagtctttcacttagcttggAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGG- AAAACC CTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGC- ACCGAT CGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTG- CGGTAT TTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGC- CAACAC CCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGC- TGCATG TGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGATGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGG- TTAATG TCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTA- TTTTTC TAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAA- GAGTAT GAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAG- AAACGC TGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGT- AAGATC CTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATT- ATCCCG TATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAG- TCACAG AAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCG- GCCAAC TTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCG- CCTTGA TCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAA- CAACGT TGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGAT- AAAGTT GCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGG- GTCTCG CGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGG- CAACTA TGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTT- TACTCA TATATACTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCT- CATGAC CAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAG- ATCCTT TTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAA- GAGCTA CCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTA- GTTAGG CCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCA- GTGGCG ATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGG- GGTTCG TGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGC- CACGCT TCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTC- CAGGGG GAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCG- TCAGGG GGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCA- CATGTT CTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCA- GCCGAA CGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGT- TGGCCG ATTCATTAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAG- TTAGCT CACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAAC- AATTTC ACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTTGCTCTTAGGAGTTTCCTAATACATCCCAAACTCAA- ATATAT AAAGCATTTGACTTGTTCTATGCCCTAGGGGGCGGGGGGAAGCTAAGCCAGCTTTTTTTAACATTTAAAATGTT- AATTCC ATTTTAAATGCACAGATGTTTTTATTTCATAAGGGTTTCAATGTGCATGAATGCTGCAATATTCCTGTTACCAA- AGCTAG TATAAATAAAAATAGATAAACGTGGAAATTACTTAGAGTTTCTGTCATTAACGTTTCCTTCCTCAGTTGACAAC- ATAAAT GCGCTGCTGAGCAAGCCAGTTTGCATCTGTCAGGATCAATTTCCCATTATGCCAGTCATATTAATTACTAGTCA- ATTAGT TGATTTTTATTTTTGACATATACATGTGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACG- CCATTT TGCAAGGCATGGAAAAATACATAACTGAGAATAGAAAAGTTCAGATCAAGGTCAGGAACAGATGGAACAGCTGA- ATATGG GCCAAACAGGATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATG- GGCCAA ACAGGATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCAGC- CCTCAG CAGTTTCTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGAACT- AACCAA TCAGTTCGCTTCTCGCTTCTGTTCGCGCGCTTATGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTC- GGGGCG
CCAGTCCTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGCATCCGACTTG- TGGTCT CGCTGTTCCTTGGGAGGGTCTCCTCTGAGTGATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTC- CGGGAT CGGGAGACCCCTGCCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGTC- CGATTG TCTAGTGTCTATGACTGATTTTATGCGCCTGCGTCGGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCC- GTGGTG GAACTGACGAGTTCGGAACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTTCGGGGGCCGTTTTTGTGGC- CCGACC TGAGTCCTAAAATCCCGATCGTTTAGGACTCTTTGGTGCACCCCCCTTAGAGGAGGGATATGTGGTTCTGGTAG- GAGACG AGAACCTAAAACAGTTCCCGCCTCCGTCTGAATTTTTGCTTTCGGTTTGGGACCGAAGCCGCGCCGCGCGTCTT- GTCTGC TGCAGCATCGTTCTGTGTTGTCTCTGTCTGACTGTGTTTCTGTATTTGTCTGAAAATATGGGCCCGGGCTAGAC- TGTTAC CACTCCCTTAAGTTTGACCTTAGGTCACTGGAAAGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCA- AGAAGA GACGTTGGGTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACCTTTAAC- CGAGAC CTCATCACCCAGGTTAAGATCAAGGTCTTTTCACCTGGCCCGCATGGACACCCAGACCAGGTCCCCTACATCGT- GACCTG GGAAGCCTTGGCTTTTGACCCCCCTCCCTGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTC- CATCCG CCCCGTCTCTCCCCCTTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT
[0223] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEO ID NO:232 provided below:
TABLE-US-00065 [SEQ ID NO: 232] CCTTCTCTAGGCGCCCCCATATGGCCATATGAGATCTTATATGGGGCACCCCCGCCCCTTGTAAACTTCCCTGA- CCCTGA CATGACAAGAGTTACTAACAGCCCCTCTCTCCAAGCTCACTTACAGGCTCTCTACTTAGTCCAGCACGAAGTCT- GGAGAC CTCTGGCGGCAGCCTACCAAGAACAACTGGACCGACCGGTGCCGCCACCATGGAAACCGACACCCTGCTGCTGT- GGGTGC TGCTGCTGTGGGTGCCAGGATCCACAGGAcagtctgtgctgacgcagccgccctcagtgtctggggccccaggg- cagagg gtcaccatctcctgcactgggaggagctccaacatcggggcaggttttgatgtacactggtaccaggagcttcc- aggaac agcccccaaactcctcatctatggtaacagcaatcggccctcaggggtccctgaccgattctctggctccaagt- ctggca cctcagcctccctggccatcactgggctccaggctgaggatgaggctgattattactgccagtcctatgacagc- agcctg agtggttatgtcttcggaactgggaccaaggtcaccgtcctaggttctagaggtggtggtggtagcggcggcgg- cggctc tggtggtggtggatccctcgagatggcccaggtccagctggtacagtctggggctgaggtgaagaagcctgggg- cctcag tgaaggtctcctgcaaggcttctggatacaccttcaccgactactatatgcactgggtgcgacaggcccctgga- caacgg cttgagtggatgggatggatcaaccctaacagtggtggcacaaactatgcacagaagtttcaggacaggatcac- cgtgac cagggacacctccagcaacacaggctacatggagctgaccaggctgagatctgacgacacggccgtgtattact- gtgcgc gctctccgtactctggtgttctggataaatggggtcaaggtactctggtgaccgtctcctcagcggccgcaccc- accacg acgccagcgccgcgaccaccaaccccggcgcccacgatcgcgtcgcagcccctgtccctgcgcccagaggcgtg- ccggcc agcggcggggggcgcagtgcacacgagggggctggacttcgcctgtgatatctacatctgggcgcccctggccg- ggactt gtggggtccttctcctgtcactggttatcaccctttactgcaacaaacggggcagaaagaagctcctgtatata- ttcaaa caaccatttatgagaccagtacaaactactcaagaggaagatggctgtagctgccgatttccagaagaagaaga- aggagg atgtgaactgagagtgaagttcaggaggagcgcagagccccccgcgtaccaggagggccagaaccagctctata- acgagc tcaatctaggacgaagagaggagtacgatgttttggacaagagacgtggccgggaccctgagatggggggaaag- ccgaga aggaagaaccctcaggaaggcctgtacaatgaactgcagaaagataagatggcggaggcctacagtgagattgg- gatgaa aggcgagcgccggaggggcaaggggcacgatggcctttaccagggtctcagtacagccaccaaggacacctacg- acgccc ttcacatgcaggccctgccccctcgctaacagccactcgaggatccggattagtccaatttgttaaagacagga- tatcag tggtccaggctctagttttgactcaacaatatcaccagctgaagcctatagagtacgagccatagataaaataa- aagatt ttatttagtctccagaaaaaggggggaatgaaagaccccacctgtaggtttggcaagctagcttaagtaacgcc- attttg caaggcatggaaaaatacataactgagaatagagaagttcagatcaaggtcaggaacagatggaacagctgaat- atgggc caaacaggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggaacagctgaatatggg- ccaaac aggatatctgtggtaagcagttcctgccccggctcagggccaagaacagatggtccccagatgcggtccagccc- tcagca gtttctagagaaccatcagatgtttccagggtgccccaaggacctgaaatgaccctgtgccttatttgaactaa- ccaatc agttcgcttctcgcttctgttcgcgcgcttctgctccccgagctcaataaaagagcccacaacccctcactcgg- ggcgcc agtcctccgattgactgagtcgcccgggtacccgtgtatccaataaaccctcttgcagttgcatccgacttgtg- gtctcg ctgttccttgggagggtctcctctgagtgattgactacccgtcagcgggggtctttcacacatgcagcatgtat- caaaat taatttggttttttttcttaagtatttacattaaatggccatagtacttaaagttacattggcttccttgaaat- aaacat ggagtattcagaatgtgtcataaatatttctaattttaagatagtatctccattggctttctactttttctttt- attttt ttttgtcctctgtcttccatttgttgttgttgttgtttgtttgtttgtttgttggttggttggttaattttttt- ttaaag atcctacactatagttcaagctagactattagctactctgtaacccagggtgaccttgaagtcatgggtagcct- gctgtt ttagccttcccacatctaagattacaggtatgagctatcatttttggtatattgattgattgattgattgatgt- gtgtgt gtgtgattgtgtttgtgtgtgtgactgtgaaaatgtgtgtatgggtgtgtgtgaatgtgtgtatgtatgtgtgt- gtgtga gtgtgtgtgtgtgtgtgtgcatgtgtgtgtgtgtgactgtgtctatgtgtatgactgtgtgtgtgtgtgtgtgt- gtgtgt gtgtgtgtgtgtgtgtgtgtgttgtgaaaaaatattctatggtagtgagagccaacgctccggctcaggtgtca- ggttgg tttttgagacagagtctttcacttagcttggAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACC- CTGGCG TTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGAT- CGCCCT TCCCAACAGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTAT- TTCACA CCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACAC- CCGCTG ACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATG- TGTCAG AGGTTTTCACCGTCATCACCGAAACGCGCGATGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATG- TCATGA TAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTC- TAAATA CATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTAT- GAGTAT TCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGC- TGGTGA AAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATC- CTTGAG AGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCG- TATTGA CGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAG- AAAAGC ATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAAC- TTACTT CTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGA- TCGTTG GGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGT- TGCGCA AACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTT- GCAGGA CCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCG- CGGTAT CATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTA- TGGATG AACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCA- TATATA CTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGAC- CAAAAT CCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTT- TTTTTC TGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTA- CCAACT CTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGG- CCACCA CTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCG- ATAAGT CGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCG- TGCACA CAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCT- TCCCGA AGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGG- GAAACG CCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGG- GGGCGG AGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTT- CTTTCC TGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAA- CGACCG AGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCG- ATTCAT TAATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCT- CACTCA TTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTC- ACACAG GAAACAGCTATGACCATGATTACGCCAAGCTTTGCTCTTAGGAGTTTCCTAATACATCCCAAACTCAAATATAT- AAAGCA TTTGACTTGTTCTATGCCCTAGGGGGCGGGGGGAAGCTAAGCCAGCTTTTTTTAACATTTAAAATGTTAATTCC- ATTTTA AATGCACAGATGTTTTTATTTCATAAGGGTTTCAATGTGCATGAATGCTGCAATATTCCTGTTACCAAAGCTAG- TATAAA TAAAAATAGATAAACGTGGAAATTACTTAGAGTTTCTGTCATTAACGTTTCCTTCCTCAGTTGACAACATAAAT- GCGCTG CTGAGCAAGCCAGTTTGCATCTGTCAGGATCAATTTCCCATTATGCCAGTCATATTAATTACTAGTCAATTAGT- TGATTT TTATTTTTGACATATACATGTGAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACGCCATTT- TGCAAG GCATGGAAAAATACATAACTGAGAATAGAAAAGTTCAGATCAAGGTCAGGAACAGATGGAACAGCTGAATATGG- GCCAAA CAGGATATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATGGGCCAA- ACAGGA TATCTGTGGTAAGCAGTTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCAGCCCTCAG- CAGTTT CTAGAGAACCATCAGATGTTTCCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGAACTAACCAA- TCAGTT CGCTTCTCGCTTCTGTTCGCGCGCTTATGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGGCG- CCAGTC
CTCCGATTGACTGAGTCGCCCGGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGCATCCGACTTGTGGTCT- CGCTGT TCCTTGGGAGGGTCTCCTCTGAGTGATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGAT- CGGGAG ACCCCTGCCCAGGGACCACCGACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGTCCGATTG- TCTAGT GTCTATGACTGATTTTATGCGCCTGCGTCGGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCCGTGGTG- GAACTG ACGAGTTCGGAACACCCGGCCGCAACCCTGGGAGACGTCCCAGGGACTTCGGGGGCCGTTTTTGTGGCCCGACC- TGAGTC CTAAAATCCCGATCGTTTAGGACTCTTTGGTGCACCCCCCTTAGAGGAGGGATATGTGGTTCTGGTAGGAGACG- AGAACC TAAAACAGTTCCCGCCTCCGTCTGAATTTTTGCTTTCGGTTTGGGACCGAAGCCGCGCCGCGCGTCTTGTCTGC- TGCAGC ATCGTTCTGTGTTGTCTCTGTCTGACTGTGTTTCTGTATTTGTCTGAAAATATGGGCCCGGGCTAGACTGTTAC- CACTCC CTTAAGTTTGACCTTAGGTCACTGGAAAGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCAAGAAGA- GACGTT GGGTTACCTTCTGCTCTGCAGAATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACCTTTAACCGAGAC- CTCATC ACCCAGGTTAAGATCAAGGTCTTTTCACCTGGCCCGCATGGACACCCAGACCAGGTCCCCTACATCGTGACCTG- GGAAGC CTTGGCTTTTGACCCCCCTCCCTGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTCCATCCG- CCCCGT CTCTCCCCCTTGAACCTCCTCGTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT
[0224] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:233 provided below:
TABLE-US-00066 [SEQ ID NO: 233] CCTTCTCTAGGCGCCCCCATATGGCCATATGAGATCTTATATGGGGCACCCCCGCCCCTTGTAAACTTCCCTGA- CCCTGA CATGACAAGAGTTACTAACAGCCCCTCTCTCCAAGCTCACTTACAGGCTCTCTACTTAGTCCAGCACGAAGTCT- GGAGAC CTCTGGCGGCAGCCTACCAAGAACAACTGGACCGACCGGTGCCGCCACCATGGAAACCGACACCCTGCTGCTGT- GGGTGC TGCTGCTGTGGGTGCCAGGATCCACAGGAcaatctgccctgactcagcctgcctccgtgtctgcgtctcctgga- cagtcg atcgccatctcctgcactggaaccagcagtgacgttggttggtatcaacagcacccaggcaaagcccccaaact- catgat ttatgaggacagtaagcggccctcaggggtttctaatcgcttctctggctccaagtctggcaacacggcctccc- tgacca tctctgggctccaggctgaggacgaggctgattattactgcagctcaaatacaagaagcagcactttggtgttc- ggcgga gggaccaagctgaccgtcctaggttctagaggtggtggtggtagcggcggcggcggctctggtggtggtggatc- cctcga gatggccgaagtgcagctggtgcagtctggggctgagatgaagaagcctggggcctcactgaagctctcctgca- aggctt ctggatacaccttcatcgactactatgtatactggatgcgacaggcccctggacaagggcttgagtccatggga- tggatc aaccctaacagtggtggcacaaactatgcacagaagtttcagggcagggtcaccatgaccagggacacgtccat- cagcac agcctacatggagctgagcaggctgagatctgacgacaccgccatgtattactgtgcgcgctcccagcgtgacg- gttaca tggattactggggtcaaggtactctggtgaccgtctcctcagcggccgcacccaccacgacgccagcgccgcga- ccacca accccggcgcccacgatcgcgtcgcagcccctgtccctgcgcccagaggcgtgccggccagcggcggggggcgc- agtgca cacgagggggctggacttcgcctgtgatatctacatctgggcgcccctggccgggacttgtggggtccttctcc- tgtcac tggttatcaccctttactgcaacaaacggggcagaaagaagctcctgtatatattcaaacaaccatttatgaga- ccagta caaactactcaagaggaagatggctgtagctgccgatttccagaagaagaagaaggaggatgtgaactgagagt- gaagtt caggaggagcgcagagccccccgcgtaccaggagggccagaaccagctctataacgagctcaatctaggacgaa- gagagg agtacgatgttttggacaagagacgtggccgggaccctgagatggggggaaagccgagaaggaagaaccctcag- gaaggc ctgtacaatgaactgcagaaagataagatggcggaggcctacagtgagattgggatgaaaggcgagcgccggag- gggcaa ggggcacgatggcctttaccagggtctcagtacagccaccaaggacacctacgacgcccttcacatgcaggccc- tgcccc ctcgctaacagccactcgaggatccggattagtccaatttgttaaagacaggatatcagtggtccaggctctag- ttttga ctcaacaatatcaccagctgaagcctatagagtacgagccatagataaaataaaagattttatttagtctccag- aaaaag gggggaatgaaagaccccacctgtaggtttggcaagctagcttaagtaacgccattttgcaaggcatggaaaaa- tacata actgagaatagagaagttcagatcaaggtcaggaacagatggaacagctgaatatgggccaaacaggatatctg- tggtaa gcagttcctgccccggctcagggccaagaacagatggaacagctgaatatgggccaaacaggatatctgtggta- agcagt tcctgccccggctcagggccaagaacagatggtccccagatgcggtccagccctcagcagtttctagagaacca- tcagat gtttccagggtgccccaaggacctgaaatgaccctgtgccttatttgaactaaccaatcagttcgcttctcgct- tctgtt cgcgcgcttctgctccccgagctcaataaaagagcccacaacccctcactcggggcgccagtcctccgattgac- tgagtc gcccgggtacccgtgtatccaataaaccctcttgcagttgcatccgacttgtggtctcgctgttccttgggagg- gtctcc tctgagtgattgactacccgtcagcgggggtctttcacacatgcagcatgtatcaaaattaatttggttttttt- tcttaa gtatttacattaaatggccatagtacttaaagttacattggcttccttgaaataaacatggagtattcagaatg- tgtcat aaatatttctaattttaagatagtatctccattggctttctactttttcttttatttttttttgtcctctgtct- tccatt tgttgttgttgttgtttgtttgtttgtttgttggttggttggttaatttttttttaaagatcctacactatagt- tcaagc tagactattagctactctgtaacccagggtgaccttgaagtcatgggtagcctgctgttttagccttcccacat- ctaaga ttacaggtatgagctatcatttttggtatattgattgattgattgattgatgtgtgtgtgtgtgattgtgtttg- tgtgtg tgactgtgaaaatgtgtgtatgggtgtgtgtgaatgtgtgtatgtatgtgtgtgtgtgagtgtgtgtgtgtgtg- tgtgca tgtgtgtgtgtgtgactgtgtctatgtgtatgactgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtgtg- tgtgtg ttgtgaaaaaatattctatggtagtgagagccaacgctccggctcaggtgtcaggttggtttttgagacagagt- ctttca cttagcttggAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATC- GCCTTG CAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGC- AGCCTG AATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATATGGTGCAC- TCTCAG TACAATCTGCTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGG- CTTGTC TGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCA- TCACCG AAACGCGCGATGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTT- AGACGT CAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTAT- CCGCTC ATGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGT- CGCCCT TATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTG- AAGATC AGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAA- GAACGT TTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCA- ACTCGG TCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCA- TGACAG TAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGA- GGACCG AAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAA- TGAAGC CATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCG- AACTAC TTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCG- GCCCTT CCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGG- GCCAGA TGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGA- TCGCTG AGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTA- AAACTT CATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTT- TTCGTT CCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCT- GCTTGC AAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTA- ACTGGC TTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGT- AGCACC GCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGT- TGGACT CAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAG- CGAACG ACCTACACCGAACTGAGATACCTACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGA- CAGGTA TCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATA- GTCCTG TCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAAC- GCCAGC AACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGA- TTCTGT GGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAG- TGAGCG AGGAAGCGGAAGAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCAC- GACAGG TTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGC- TTTACA CTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACC- ATGATT ACGCCAAGCTTTGCTCTTAGGAGTTTCCTAATACATCCCAAACTCAAATATATAAAGCATTTGACTTGTTCTAT- GCCCTA GGGGGCGGGGGGAAGCTAAGCCAGCTTTTTTTAACATTTAAAATGTTAATTCCATTTTAAATGCACAGATGTTT- TTATTT CATAAGGGTTTCAATGTGCATGAATGCTGCAATATTCCTGTTACCAAAGCTAGTATAAATAAAAATAGATAAAC- GTGGAA ATTACTTAGAGTTTCTGTCATTAACGTTTCCTTCCTCAGTTGACAACATAAATGCGCTGCTGAGCAAGCCAGTT- TGCATC TGTCAGGATCAATTTCCCATTATGCCAGTCATATTAATTACTAGTCAATTAGTTGATTTTTATTTTTGACATAT- ACATGT GAATGAAAGACCCCACCTGTAGGTTTGGCAAGCTAGCTTAAGTAACGCCATTTTGCAAGGCATGGAAAAATACA- TAACTG AGAATAGAAAAGTTCAGATCAAGGTCAGGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCTGTGGT- AAGCAG TTCCTGCCCCGGCTCAGGGCCAAGAACAGATGGAACAGCTGAATATGGGCCAAACAGGATATCTGTGGTAAGCA- GTTCCT GCCCCGGCTCAGGGCCAAGAACAGATGGTCCCCAGATGCGGTCCAGCCCTCAGCAGTTTCTAGAGAACCATCAG- ATGTTT CCAGGGTGCCCCAAGGACCTGAAATGACCCTGTGCCTTATTTGAACTAACCAATCAGTTCGCTTCTCGCTTCTG- TTCGCG CGCTTATGCTCCCCGAGCTCAATAAAAGAGCCCACAACCCCTCACTCGGGGCGCCAGTCCTCCGATTGACTGAG- TCGCCC
GGGTACCCGTGTATCCAATAAACCCTCTTGCAGTTGCATCCGACTTGTGGTCTCGCTGTTCCTTGGGAGGGTCT- CCTCTG AGTGATTGACTACCCGTCAGCGGGGGTCTTTCATTTGGGGGCTCGTCCGGGATCGGGAGACCCCTGCCCAGGGA- CCACCG ACCCACCACCGGGAGGTAAGCTGGCCAGCAACTTATCTGTGTCTGTCCGATTGTCTAGTGTCTATGACTGATTT- TATGCG CCTGCGTCGGTACTAGTTAGCTAACTAGCTCTGTATCTGGCGGACCCGTGGTGGAACTGACGAGTTCGGAACAC- CCGGCC GCAACCCTGGGAGACGTCCCAGGGACTTCGGGGGCCGTTTTTGTGGCCCGACCTGAGTCCTAAAATCCCGATCG- TTTAGG ACTCTTTGGTGCACCCCCCTTAGAGGAGGGATATGTGGTTCTGGTAGGAGACGAGAACCTAAAACAGTTCCCGC- CTCCGT CTGAATTTTTGCTTTCGGTTTGGGACCGAAGCCGCGCCGCGCGTCTTGTCTGCTGCAGCATCGTTCTGTGTTGT- CTCTGT CTGACTGTGTTTCTGTATTTGTCTGAAAATATGGGCCCGGGCTAGACTGTTACCACTCCCTTAAGTTTGACCTT- AGGTCA CTGGAAAGATGTCGAGCGGATCGCTCACAACCAGTCGGTAGATGTCAAGAAGAGACGTTGGGTTACCTTCTGCT- CTGCAG AATGGCCAACCTTTAACGTCGGATGGCCGCGAGACGGCACCTTTAACCGAGACCTCATCACCCAGGTTAAGATC- AAGGTC TTTTCACCTGGCCCGCATGGACACCCAGACCAGGTCCCCTACATCGTGACCTGGGAAGCCTTGGCTTTTGACCC- CCCTCC CTGGGTCAAGCCCTTTGTACACCCTAAGCCTCCGCCTCCTCTTCCTCCATCCGCCCCGTCTCTCCCCCTTGAAC- CTCCTC GTTCGACCCCGCCTCGATCCTCCCTTTATCCAGCCCTCACT
[0225] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:234 provided below:
TABLE-US-00067 [SEQ ID NO: 234] atggaaaccgacaccctgctgctgtgggtgctgctgctgtgggtgccagga tccacaggactgcctgtgctgactcagccaccctcagcgtctgggaccccc gggcagagggtcaccatctcttgttctggacgcagttccaacatcgggagt aattctgttaactggtatcgacaactcccaggagcggcccccaaactcctc atctatagtaataatcagcggcccccaggggtccctgtgcgattctctggc tccaagtctggcacctcagcctccctggccatcagtgggctccagtctgaa gatgaggccacttattactgtgcaacatgggatgacaatctgaatgttcac tatgtcttcggaactgggaccaaggtcaccgtcctaggttctagaggtggt ggtggtagcggcggcggcggctctggtggtggtggatccctcgagatggcc caggtgcagctggtgcagtctggggctgaggtgaagaagcctgggtcctcg gtgaaggtctcctgcaaggcttctggaggcaccttcagcagctatgctatc agctgggtgcgacaggcccctggacaagggcttgagtggatgggaaggatc atccctatccttggtatagcaaactacgcacagaagttccagggcagagtc acgattaccgcggacaaatccacgagcacagcctacatggagctgagcagc ctgagatctgaggacacggccgtgtattactgtgcgcgcggtggttactac tctcatgacatgtggtctgaagattggggtcaaggtactctggtgaccgtc tcctcagcggccgcaattgaagttatgtatcctcctccttacctagacaat gagaagagcaatggaaccattatccatgtgaaagggaaacacctttgtcca agtcccctatttcccggaccttctaagcccttttgggtgctggtggtggtt ggtggagtcctggcttgctatagcttgctagtaacagtggcctttattatt ttctgggtgaggagtaagaggagcaggctcctgcacagtgactacatgaac atgactccccgccgccccgggcccacccgcaagcattaccagccctatgcc ccaccacgcgacttcgcagcctatcgctccagagtgaagttcagcaggagc gcagacgcccccgcgtaccagcagggccagaaccagctctataacgagctc aatctaggacgaagagaggagtacgatgttttggacaagagacgtggccgg gaccctgagatggggggaaagccgagaaggaagaaccctcaggaaggcctg tacaatgaactgcagaaagataagatggcggaggcctacagtgagattggg atgaaaggcgagcgccggaggggcaaggggcacgatggcctttaccagggt ctcagtacagccaccaaggacacctacgacgcccttcacatgcaggccctg ccccctcgctaa
[0226] In one specific non-limiting example, the isolated nucleic acid molecule comprises nucleic acids having the sequence set forth in SEQ ID NO:235 provided below:
TABLE-US-00068 [SEQ ID NO: 235] atggaaaccgacaccctgctgctgtgggtgctgctgctgtgggtgccagga tccacaggacaggctgtgctgactcagccaccctcagcgtctgggaccccc gggcagagggtcaccatctcttgttctggaagcagctccaacatcggaagt aattacgtattctggtaccagcagctcccaggaacggcccccaaactcctc atctatagtaataatcagcggccctcaggggtccctgaccgattctctggc tccaagtctggcacctcagcctccctggccatcagtgggctccggtccgag gatgaggctgattattactgtgcagcatgggatgacagcctgagtgcctct tatgttttcggaactgggaccaaggtcaccgtcctaggttctagaggtggt ggtggtagcggcggcggcggctctggtggtggtggatccctcgagatggcc caggtgcagctggtgcagtctggggctgaggtgaagaagcctgggtcctcg gtgaaggtctcctgcaaggcttctggaggcaccttcagcagctatgctatc agctgggtgcgacaggcccctggacaagggcttgagtggatgggaaggatc atccctatccttggtacagcaaactacgcacagaagttccagggcagagtc acgattaccgcggacgaatccacgagcacagcctacatggagctgagcagc ctgagatctgaggacacggccgtgtattactgtgcgcgctctggttacggt tcttaccgttgggaagattcttggggtcaaggtactctggtgaccgtctcc tcagcggccgcaattgaagttatgtatcctcctccttacctagacaatgag aagagcaatggaaccattatccatgtgaaagggaaacacctttgtccaagt cccctatttcccggaccttctaagcccttttgggtgctggtggtggttggt ggagtcctggcttgctatagcttgctagtaacagtggcctttattattttc tgggtgaggagtaagaggagcaggctcctgcacagtgactacatgaacatg actccccgccgccccgggcccacccgcaagcattaccagccctatgcccca ccacgcgacttcgcagcctatcgctccagagtgaagttcagcaggagcgca gacgcccccgcgtaccagcagggccagaaccagctctataacgagctcaat ctaggacgaagagaggagtacgatgttttggacaagagacgtggccgggac cctgagatggggggaaagccgagaaggaagaaccctcaggaaggcctgtac aatgaactgcagaaagataagatggcggaggcctacagtgagattgggatg aaaggcgagcgccggaggggcaaggggcacgatggcctttaccagggtctc agtacagccaccaaggacacctacgacgcccttcacatgcaggccctgccc cctcgctaa
[0227] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:207 encodes a BCMA-targeted CAR (designated as BCMA-targeted 28z CAR54) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:53, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:54, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a CD28 polypeptide, wherein the CD28 region comprising the transmembrance domain and the co-stimulatory signaling region comprises amino acids 114 to 220 of SEQ ID NO:193.
[0228] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:208 encodes a BCMA-targeted CAR (designated as BCMA-targeted 28z CAR40) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:57, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:58, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a CD28 polypeptide, wherein the CD28 region comprising the transmembrance domain and the co-stimulatory signaling region comprises amino acids 114 to 220 of SEQ ID NO:193.
[0229] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:209 encodes a BCMA-targeted CAR (designated as BCMA-targeted 28z CAR24) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:65, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:66, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a CD28 polypeptide, wherein the CD28 region comprising the transmembrance domain and the co-stimulatory signaling region comprises amino acids 114 to 220 of SEQ ID NO:193.
[0230] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:234 encodes a BCMA-targeted CAR (designated as BCMA-targeted 28z CAR3) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:21, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:22, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a CD28 polypeptide, wherein the CD28 region comprising the transmembrance domain and the co-stimulatory signaling region comprises amino acids 114 to 220 of SEQ ID NO:193.
[0231] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:235 encodes a BCMA-targeted CAR (designated as BCMA-targeted 28z CAR37) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:61, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:62, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a CD28 polypeptide, wherein the CD28 region comprising the transmembrance domain and the co-stimulatory signaling region comprises amino acids 114 to 220 of SEQ ID NO:193.
[0232] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:229 encodes a BCMA-targeted CAR (designated as BCMA-targeted BBz CAR3) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:21, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:22, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD8 polypeptide having amino acids 137 to 207 of SEQ ID NO: 226, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a 4-1BB polypeptide having the amino acids 214-255 of SEQ ID NO: 197. Nucleotide sequences 270-1031 of SEQ ID NO: 229 encodes the human scFv. Nucleotide sequences 1041-1253 of SEQ ID NO: 229 encodes the CD8 polypeptide comprised in the transmembrane domain. Nucleotide sequences 1254-1379 of SEQ ID NO: 229 encodes the 4-1BB polypeptide comprised in the intracellular domain. Nucleotide sequences 1380-1718 of SEQ ID NO: 229 encodes the CD3zeta polypeptide comprised in the intracellular domain. Other portions of SEQ ID NO: 229 are shown in Table 19.
TABLE-US-00069 TABLE 19 nucleotide Sequence positions Portions of SEQ ID NO: 229 number of nucleotides Kappa sp 210 . . . 269 60 LTR 1998 . . . 2467 470 M13 fwd 3166 . . . 3182 17 AmpR promoter 3657 . . . 3761 105 AmpR 3762 . . . 4622 861 ori 4793 . . . 5381 589 CAP binding site 5669 . . . 5690 22 lac promoter 5705 . . . 5735 31 lac operator 5743 . . . 5759 17 M13 rev 5767 . . . 5783 17 LTR 6192 . . . 6785 594 MMLV Psi 6848 . . . 7205 358 gag (truncated) 7270 . . . 15 417
[0233] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:230 encodes a BCMA-targeted CAR (designated as BCMA-targeted BBz CAR24) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:65, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:66, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD8 polypeptide having amino acids 137 to 207 of SEQ ID NO: 226, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a 4-1BB polypeptide having the amino acids 214-255 of SEQ ID NO: 197. Nucleotide sequences 270-1015 of SEQ ID NO: 230 encodes the human scFv. Nucleotide sequences 1023-1235 of SEQ ID NO: 230 encodes the CD8 polypeptide comprised in the transmembrane domain. Nucleotide sequences 1236-1361 of SEQ ID NO: 230 encodes the 4-1BB polypeptide comprised in the intracellular domain. Nucleotide sequences 1362-1700 of SEQ ID NO: 230 encodes the CD3zeta polypeptide comprised in the intracellular domain. Other portions of SEQ ID NO: 230 are shown in Table 20.
TABLE-US-00070 TABLE 20 nucleotide Sequence positions Portions of SEQ ID NO: 230 number of nucleotides Kappa sp 210 . . . 269 60 LTR 1980 . . . 2449 470 M13 fwd 3148 . . . 3164 17 AmpR promoter 3639 . . . 3743 105 AmpR 3744 . . . 4604 861 ori 4775 . . . 5363 589 CAP binding site 5651 . . . 5672 22 lac promoter 5687 . . . 5717 31 lac operator 5725 . . . 5741 17 M13 rev 5749 . . . 5765 17 LTR 6174 . . . 6767 594 MMLV Psi 6830 . . . 7187 358 gag (truncated) 7252 . . . 15 417
[0234] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:231 encodes a BCMA-targeted CAR (designated as BCMA-targeted BBz CAR37) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:61, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:62, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD8 polypeptide having amino acids 137 to 207 of SEQ ID NO: 226, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a 4-1BB polypeptide having the amino acids 214-255 of SEQ ID NO: 197. Nucleotide sequences 270-1028 of SEQ ID NO: 231 encodes the human scFv. Nucleotide sequences 1038-1250 of SEQ ID NO: 231 encodes the CD8 polypeptide comprised in the transmembrane domain. Nucleotide sequences 1251-1376 of SEQ ID NO: 231 encodes the 4-1BB polypeptide comprised in the intracellular domain. Nucleotide sequences 1377-1715 of SEQ ID NO: 231 encodes the CD3zeta polypeptide comprised in the intracellular domain. Other portions of SEQ ID NO: 231 are shown in Table 21.
TABLE-US-00071 TABLE 21 nucleotide Sequence positions Portions of SEQ ID NO: 231 number of nucleotides Kappa sp 210 . . . 269 60 LTR 1995 . . . 2464 470 M13 fwd 3163 . . . 3179 17 AmpR promoter 3654 . . . 3758 105 AmpR 3759 . . . 4619 861 ori 4790 . . . 5378 589 CAP binding site 5666 . . . 5687 22 lac promoter 5702 . . . 5732 31 lac operator 5740 . . . 5756 17 M13 rev 5764 . . . 5780 17 LTR 6189 . . . 6782 594 MMLV Psi 6845 . . . 7202 358 gag (truncated) 7267 . . . 15 417
[0235] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:232 encodes a BCMA-targeted CAR (designated as BCMA-targeted BBz CAR40) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:57, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:58, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD8 polypeptide having amino acids 137 to 207 of SEQ ID NO: 226, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a 4-1BB polypeptide having the amino acids 214-255 of SEQ ID NO: 197. Nucleotide sequences 270-1024 of SEQ ID NO: 232 encodes the human scFv. Nucleotide sequences 1032-1244 of SEQ ID NO: 232 encodes the CD8 polypeptide comprised in the transmembrane domain. Nucleotide sequences 1245-1370 of SEQ ID NO: 232 encodes the 4-1BB polypeptide comprised in the intracellular domain. Nucleotide sequences 1371-1709 of SEQ ID NO: 232 encodes the CD3zeta polypeptide comprised in the intracellular domain. Other portions of SEQ ID NO: 232 are shown in Table 22.
TABLE-US-00072 TABLE 22 nucleotide Sequence positions Portions of SEQ ID NO: 232 number of nucleotides Kappa sp 210 . . . 269 60 LTR 1989 . . . 2458 470 M13 fwd 3157 . . . 3173 17 AmpR promoter 3648 . . . 3752 105 AmpR 3753 . . . 4613 861 ori 4784 . . . 5372 589 CAP binding site 5660 . . . 5681 22 lac promoter 5696 . . . 5726 31 lac operator 5734 . . . 5750 17 M13 rev 5758 . . . 5774 17 LTR 6183 . . . 6776 594 MMLV Psi 6839 . . . 7196 358 gag (truncated) 7261 . . . 15 417
[0236] The isolated nucleic acid molecule having the nucleotide sequence of SEQ ID NO:233 encodes a BCMA-targeted CAR (designated as BCMA-targeted BBz CAR54) comprising a human scFv that comprises a heavy chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:53, a light chain variable region comprising amino acids having the sequence set forth in SEQ ID NO:54, and a linker having an amino acid sequence of SEQ ID NO:69 positioned between the heavy chain variable region and the light chain variable region, a transmembrane domain comprising a CD8 polypeptide having amino acids 137 to 207 of SEQ ID NO: 226, and an intracellular domain comprising a CD3.xi. polypeptide comprising amino acid sequence of amino acids 52 to 163 of SEQ ID NO: 195, and a co-stimulatory signaling region comprising a 4-1BB polypeptide having the amino acids 214-255 of SEQ ID NO: 197. Nucleotide sequences 270-1003 of SEQ ID NO: 233 encodes the human scFv. Nucleotide sequences 1011-1223 of SEQ ID NO: 233 encodes the CD8 polypeptide comprised in the transmembrane domain. Nucleotide sequences 1224-1349 of SEQ ID NO: 233 encodes the 4-1BB polypeptide comprised in the intracellular domain. Nucleotide sequences 1350-1688 of SEQ ID NO: 233 encodes the CD3zeta polypeptide comprised in the intracellular domain. Other portions of SEQ ID NO: 233 are shown in Table 23.
TABLE-US-00073 TABLE 23 nucleotide Sequence positions Portions of SEQ ID NO: 233 number of nucleotides Kappa sp 210 . . . 269 60 LTR 1968 . . . 2437 470 M13 fwd 3136 . . . 3152 17 AmpR promoter 3627 . . . 3731 105 AmpR 3732 . . . 4592 861 ori 4763 . . . 5351 589 CAP binding site 5639 . . . 5660 22 lac promoter 5675 . . . 5705 31 lac operator 5713 . . . 5729 17 M13 rev 5737 . . . 5753 17 LTR 6162 . . . 6755 594 MMLV Psi 6818 . . . 7175 358 gag (truncated) 7240 . . . 15 417
[0237] In certain embodiments, the isolated nucleic acid molecule encodes a functional portion of a presently disclosed BCMA-targeted CAR. As used herein, the term "functional portion" refers to any portion, part or fragment of a presently disclosed BCMA-targeted CAR, which portion, part or fragment retains the biological activity of the BCMA-targeted CAR (the parent CAR). For example, functional portions encompass the portions, parts or fragments of a presently disclosed BCMA-targeted CAR that retains the ability to recognize a target cell, to treat a disease, e.g., multiple myeloma, to a similar, same, or even a higher extent as the parent CAR. In certain embodiments, an isolated nucleic acid molecule encoding a functional portion of a presently disclosed BCMA-targeted CAR can encode a protein comprising, e.g., about 10%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, and 95%, or more of the parent CAR.
[0238] One Phase I clinical trial (NCT02215967) operated by National Cancer Institute (NCI) used anti-BCMA CAR-transduced T cells for treating multiple myeloma..sup.33,34 The anti-BCMA CAR applied in the NCI's clinical trial comprises a murine scFv binding to human BCMA. Using a mouse antibody or a mouse scFv for treating humans can lead to anti-mouse antibody (HAMA) response, which may be life-threatening. Unlike NCI clinical trial, in certain embodiments, the presently disclosed BCMA-targeted CAR comprises a human scFv, and thus, affords a much decreased risk of immunogenicity, compared with CARs comprising murine antibodies (see Maus et al., Cancer Immunol Res (2003); 1(1):26-31), which reports that the potential immunogenicity of CARs derived from murine antibodies may be a safety issue for mRNA CARs).
III. Immunoresponsive Cells
[0239] The presently disclosed subject matter provides immunoresponsive cells expressing a CAR that comprises an extracellular antigen-binding domain, a transmembrane domain and an intracellular domain, where the extracellular antigen-binding domain specifically binds to BCMA (e.g., human BCMA) as described above. The immunoresponsive cells can be transduced with a presently disclosed CAR such that the cells express the CAR. The presently disclosed subject matter also provides methods of using such cells for the treatment of a tumor, e.g., multiple myeloma (MM). The immunoresponsive cells of the presently disclosed subject matter can be cells of the lymphoid lineage. The lymphoid lineage, comprising B, T and natural killer (NK) cells, provides for the production of antibodies, regulation of the cellular immune system, detection of foreign agents in the blood, detection of cells foreign to the host, and the like. Non-limiting examples of immunoresponsive cells of the lymphoid lineage include T cells, Natural Killer (NK) cells, cytotoxic T lymphocytes (CTLs), regulatory T cells, embryonic stem cells, and pluripotent stem cells (e.g., those from which lymphoid cells may be differentiated). T cells can be lymphocytes that mature in the thymus and are chiefly responsible for cell-mediated immunity. T cells are involved in the adaptive immune system. The T cells of the presently disclosed subject matter can be any type of T cells, including, but not limited to, T helper cells, cytotoxic T cells, memory T cells (including central memory T cells, stem-cell-like memory T cells (or stem-like memory T cells), and two types of effector memory T cells: e.g., T.sub.EM cells and T.sub.EMRA cells), Regulatory T cells (also known as suppressor T cells), Natural killer T cells, Mucosal associated invariant T cells, and .gamma..delta. T cells. In certain embodiments, the CAR-expressing T cells express Foxp3 to achieve and maintain a T regulatory phenotype. Natural killer (NK) cells can be lymphocytes that are part of cell-mediated immunity and act during the innate immune response. NK cells do not require prior activation in order to perform their cytotoxic effect on target cells. Cytotoxic T cells (CTL or killer T cells) are a subset of T lymphocytes capable of inducing the death of infected somatic or tumor cells.
[0240] The immunoresponsive cells of the presently disclosed subject matter can express an extracellular antigen-binding domain (e.g., a human scFV, a Fab that is optionally crosslinked, or a F(ab).sub.2) that specifically binds to BCMA (e.g., human BCMA), for the treatment of multiple myeloma. Such immunoresponsive cells can be administered to a subject (e.g., a human subject) in need thereof for the treatment of multiple myeloma. In certain embodiments, the immunoresponsive cell is a T cell. The T cell can be a CD4.sup.+ T cell or a CD8.sup.+ T cell. In certain embodiments, the T cell is a CD4.sup.+ T cell. In another embodiment, the T cell is a CD8.sup.+ T cell.
[0241] A presently disclosed immunoresponsive cell can be further transduced with at least one co-stimulatory ligand, such that the immunoresponsive cell co-expresses or is induced to co-express the BCMA-targeted CAR and the at least one co-stimulatory ligand. The interaction between the BCMA-targeted CAR and at least one co-stimulatory ligand provides a non-antigen-specific signal important for full activation of an immunoresponsive cell (e.g., T cell). Co-stimulatory ligands include, but are not limited to, members of the tumor necrosis factor (TNF) superfamily, and immunoglobulin (Ig) superfamily ligands. TNF is a cytokine involved in systemic inflammation and stimulates the acute phase reaction. Its primary role is in the regulation of immune cells. Members of TNF superfamily share a number of common features. The majority of TNF superfamily members are synthesized as type II transmembrane proteins (extracellular C-terminus) containing a short cytoplasmic segment and a relatively long extracellular region. TNF superfamily members include, without limitation, nerve growth factor (NGF), CD40L (CD40L)/CD154, CD137L/4-1BBL, TNF-.alpha., CD134L/OX40L/CD252, CD27L/CD70, Fas ligand (FasL), CD30L/CD153, tumor necrosis factor beta (TNF.beta.)/lymphotoxin-alpha (LT.alpha.), lymphotoxin-beta (LT.beta.), CD257/B cell-activating factor (BAFF)/Blys/THANK/Tall-1, glucocorticoid-induced TNF Receptor ligand (GITRL), and TNF-related apoptosis-inducing ligand (TRAIL), LIGHT (TNFSF14). The immunoglobulin (Ig) superfamily is a large group of cell surface and soluble proteins that are involved in the recognition, binding, or adhesion processes of cells. These proteins share structural features with immunoglobulins--they possess an immunoglobulin domain (fold). Immunoglobulin superfamily ligands include, but are not limited to, CD80 and CD86, both ligands for CD28, PD-L1/(B7-H1) that ligands for PD-1. In certain embodiments, the at least one co-stimulatory ligand is selected from the group consisting of 4-1BBL, CD80, CD86, CD70, OX40L, CD48, TNFRSF14, PD-L1, and combinations thereof. In certain embodiments, the immunoresponsive cell is transduced with one co-stimulatory ligand that is 4-1BBL. In certain embodiments, the immunoresponsive cell is transduced with two co-stimulatory ligands that are 4-1BBL and CD80. CARs transduced with at least one co-stimulatory ligand are described in U.S. Pat. No. 8,389,282, which is incorporated by reference in its entirety.
[0242] Furthermore, a presently disclosed immunoresponsive cell can be further transduced with at least one cytokine, such that the immunoresponsive cell secretes the at least one cytokine as well as expresses the BCMA-targeted CAR. In certain embodiments, the at least one cytokine is selected from the group consisting of IL-2, IL-3, IL-6, IL-7, IL-11, IL-12, IL-15, IL-17, and IL-21. In certain embodiments, the cytokine is IL-12.
[0243] The BCMA-specific or BCMA-targeted human lymphocytes that can be used in peripheral donor lymphocytes, e.g., those disclosed in Sadelain, M., et al. 2003 Nat Rev Cancer 3:35-45 (disclosing peripheral donor lymphocytes genetically modified to express CARs), in Morgan, R. A., et al. 2006 Science 314:126-129 (disclosing peripheral donor lymphocytes genetically modified to express a full-length tumor antigen-recognizing T cell receptor complex comprising the .alpha. and .beta. heterodimer), in Panelli, M. C., et al. 2000 J Immunol 164:495-504; Panelli, M. C., et al. 2000 J Immunol 164:4382-4392 (disclosing lymphocyte cultures derived from tumor infiltrating lymphocytes (TILs) in tumor biopsies), and in Dupont, J., et al. 2005 Cancer Res 65:5417-5427; Papanicolaou, G. A., et al. 2003 Blood 102:2498-2505 (disclosing selectively in vitro-expanded antigen-specific peripheral blood leukocytes employing artificial antigen-presenting cells (AAPCs) or pulsed dendritic cells). The immunoresponsive cells (e.g., T cells) can be autologous, non-autologous (e.g., allogeneic), or derived in vitro from engineered progenitor or stem cells.
[0244] In certain embodiments, a presently disclosed immunoresponsive cell (e.g., T cell) expresses from about 1 to about 4, from about 2 to about 4, from about 3 to about 4, from about 1 to about 2, from about 1 to about 3, or from about 2 to about 3 vector copy numbers/cell of a presently disclosed BCMA-targeted CAR.
[0245] The unpurified source of CTLs may be any known in the art, such as the bone marrow, fetal, neonate or adult or other hematopoietic cell source, e.g., fetal liver, peripheral blood or umbilical cord blood. Various techniques can be employed to separate the cells. For instance, negative selection methods can remove non-CTLs initially. Monoclonal antibodies are particularly useful for identifying markers associated with particular cell lineages and/or stages of differentiation for both positive and negative selections.
[0246] A large proportion of terminally differentiated cells can be initially removed by a relatively crude separation. For example, magnetic bead separations can be used initially to remove large numbers of irrelevant cells. Preferably, at least about 80%, usually at least 70% of the total hematopoietic cells will be removed prior to cell isolation.
[0247] Procedures for separation include, but are not limited to, density gradient centrifugation; resetting; coupling to particles that modify cell density; magnetic separation with antibody-coated magnetic beads; affinity chromatography; cytotoxic agents joined to or used in conjunction with a mAb, including, but not limited to, complement and cytotoxins; and panning with antibody attached to a solid matrix, e.g. plate, chip, elutriation or any other convenient technique.
[0248] Techniques for separation and analysis include, but are not limited to, flow cytometry, which can have varying degrees of sophistication, e.g., a plurality of color channels, low angle and obtuse light scattering detecting channels, impedance channels.
[0249] The cells can be selected against dead cells, by employing dyes associated with dead cells such as propidium iodide (PI). Preferably, the cells are collected in a medium comprising 2% fetal calf serum (FCS) or 0.2% bovine serum albumin (BSA) or any other suitable, preferably sterile, isotonic medium.
IV. Vectors
[0250] Genetic modification of immunoresponsive cells (e.g., T cells, CTL cells, NK cells) can be accomplished by transducing a substantially homogeneous cell composition with a recombinant DNA or RNA construct. The vector can be a retroviral vector (e.g., gamma retroviral), which is employed for the introduction of the DNA or RNA construct into the host cell genome. For example, a polynucleotide encoding the BCMA-targeted CAR can be cloned into a retroviral vector and expression can be driven from its endogenous promoter, from the retroviral long terminal repeat, or from an alternative internal promoter.
[0251] Non-viral vectors or RNA may be used as well. Random chromosomal integration, or targeted integration (e.g., using a nuclease, transcription activator-like effector nucleases (TALENs), Zinc-finger nucleases (ZFNs), and/or clustered regularly interspaced short palindromic repeats (CRISPRs), or transgene expression (e.g., using a natural or chemically modified RNA) can be used.
[0252] For initial genetic modification of the cells to provide BCMA-targeted CAR expressing cells, a retroviral vector is generally employed for transduction, however any other suitable viral vector or non-viral delivery system can be used. For subsequent genetic modification of the cells to provide cells comprising an antigen presenting complex comprising at least two co-stimulatory ligands, retroviral gene transfer (transduction) likewise proves effective. Combinations of retroviral vector and an appropriate packaging line are also suitable, where the capsid proteins will be functional for infecting human cells. Various amphotropic virus-producing cell lines are known, including, but not limited to, PA12 (Miller, et al. (1985) Mol. Cell. Biol. 5:431-437); PA317 (Miller, et al. (1986) Mol. Cell. Biol. 6:2895-2902); and CRIP (Danos, et al. (1988) Proc. Natl. Acad. Sci. USA 85:6460-6464). Non-amphotropic particles are suitable too, e.g., particles pseudotyped with VSVG, RD114 or GALV envelope and any other known in the art.
[0253] Possible methods of transduction also include direct co-culture of the cells with producer cells, e.g., by the method of Bregni, et al. (1992) Blood 80:1418-1422, or culturing with viral supernatant alone or concentrated vector stocks with or without appropriate growth factors and polycations, e.g., by the method of Xu, et al. (1994) Exp. Hemat. 22:223-230; and Hughes, et al. (1992) J. Clin. Invest. 89:1817.
[0254] Transducing viral vectors can be used to express a co-stimulatory ligand (e.g., 4-1BBL and IL-12) in an immunoresponsive cell. Preferably, the chosen vector exhibits high efficiency of infection and stable integration and expression (see, e.g., Cayouette et al., Human Gene Therapy 8:423-430, 1997; Kido et al., Current Eye Research 15:833-844, 1996; Bloomer et al., Journal of Virology 71:6641-6649, 1997; Naldini et al., Science 272:263 267, 1996; and Miyoshi et al., Proc. Natl. Acad. Sci. U.S.A. 94:10319, 1997). Other viral vectors that can be used include, for example, adenoviral, lentiviral, and adeno-associated viral vectors, vaccinia virus, a bovine papilloma virus, or a herpes virus, such as Epstein-Barr Virus (also see, for example, the vectors of Miller, Human Gene Therapy 15-14, 1990; Friedman, Science 244:1275-1281, 1989; Eglitis et al., BioTechniques 6:608-614, 1988; Tolstoshev et al., Current Opinion in Biotechnology 1:55-61, 1990; Sharp, The Lancet 337:1277-1278, 1991; Cornetta et al., Nucleic Acid Research and Molecular Biology 36:311-322, 1987; Anderson, Science 226:401-409, 1984; Moen, Blood Cells 17:407-416, 1991; Miller et al., Biotechnology 7:980-990, 1989; Le Gal La Salle et al., Science 259:988-990, 1993; and Johnson, Chest 107:77S-- 83S, 1995). Retroviral vectors are particularly well developed and have been used in clinical settings (Rosenberg et al., N. Engl. J. Med 323:370, 1990; Anderson et al., U.S. Pat. No. 5,399,346).
[0255] In certain non-limiting embodiments, the vector expressing a presently disclosed BCMA-targeted CAR is a retroviral vector, e.g., a 293galv9 retroviral vector.
[0256] Non-viral approaches can also be employed for the expression of a protein in cell. For example, a nucleic acid molecule can be introduced into a cell by administering the nucleic acid in the presence of lipofection (Feigner et al., Proc. Nat'l. Acad. Sci. U.S.A. 84:7413, 1987; Ono et al., Neuroscience Letters 17:259, 1990; Brigham et al., Am. J. Med. Sci. 298:278, 1989; Staubinger et al., Methods in Enzymology 101:512, 1983), asialoorosomucoid-polylysine conjugation (Wu et al., Journal of Biological Chemistry 263:14621, 1988; Wu et al., Journal of Biological Chemistry 264:16985, 1989), or by micro-injection under surgical conditions (Wolff et al., Science 247:1465, 1990). Other non-viral means for gene transfer include transfection in vitro using calcium phosphate, DEAE dextran, electroporation, and protoplast fusion. Liposomes can also be potentially beneficial for delivery of DNA into a cell. Transplantation of normal genes into the affected tissues of a subject can also be accomplished by transferring a normal nucleic acid into a cultivatable cell type ex vivo (e.g., an autologous or heterologous primary cell or progeny thereof), after which the cell (or its descendants) are injected into a targeted tissue or are injected systemically. Recombinant receptors can also be derived or obtained using transposases or targeted nucleases (e.g. Zinc finger nucleases, meganucleases, or TALE nucleases). Transient expression may be obtained by RNA electroporation.
[0257] cDNA expression for use in polynucleotide therapy methods can be directed from any suitable promoter (e.g., the human cytomegalovirus (CMV), simian virus 40 (SV40), or metallothionein promoters), and regulated by any appropriate mammalian regulatory element or intron (e.g. the elongation factor 1.alpha. enhancer/promoter/intron structure). For example, if desired, enhancers known to preferentially direct gene expression in specific cell types can be used to direct the expression of a nucleic acid. The enhancers used can include, without limitation, those that are characterized as tissue- or cell-specific enhancers. Alternatively, if a genomic clone is used as a therapeutic construct, regulation can be mediated by the cognate regulatory sequences or, if desired, by regulatory sequences derived from a heterologous source, including any of the promoters or regulatory elements described above.
[0258] The resulting cells can be grown under conditions similar to those for unmodified cells, whereby the modified cells can be expanded and used for a variety of purposes.
V. Polypeptides and Analogs and Polynucleotides
[0259] Also included in the presently disclosed subject matter are extracellular antigen-binding domains that specifically binds to a BCMA (e.g., human BCMA) (e.g., an scFv (e.g., a human scFv), a Fab, or a (Fab).sub.2), CD3.zeta., CD8, CD28, etc. polypeptides or fragments thereof, and polynucleotides encoding thereof that are modified in ways that enhance their anti-tumor activity when expressed in an immunoresponsive cell. The presently disclosed subject matter provides methods for optimizing an amino acid sequence or a nucleic acid sequence by producing an alteration in the sequence. Such alterations may comprise certain mutations, deletions, insertions, or post-translational modifications. The presently disclosed subject matter further comprises analogs of any naturally-occurring polypeptide of the presently disclosed subject matter. Analogs can differ from a naturally-occurring polypeptide of the presently disclosed subject matter by amino acid sequence differences, by post-translational modifications, or by both. Analogs of the presently disclosed subject matter can generally exhibit at least about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99% or more identity with all or part of a naturally-occurring amino, acid sequence of the presently disclosed subject matter. The length of sequence comparison is at least 5, 10, 15, 20, 25, 50, 75, 100 or more amino acid residues. Again, in an exemplary approach to determining the degree of identity, a BLAST program may be used, with a probability score between e.sup.-3 and e.sup.-100 indicating a closely related sequence. Modifications comprise in vivo and in vitro chemical derivatization of polypeptides, e.g., acetylation, carboxylation, phosphorylation, or glycosylation; such modifications may occur during polypeptide synthesis or processing or following treatment with isolated modifying enzymes. Analogs can also differ from the naturally-occurring polypeptides of the presently disclosed subject matter by alterations in primary sequence. These include genetic variants, both natural and induced (for example, resulting from random mutagenesis by irradiation or exposure to ethanemethylsulfate or by site-specific mutagenesis as described in Sambrook, Fritsch and Maniatis, Molecular Cloning: A Laboratory Manual (2d ed.), CSH Press, 1989, or Ausubel et al., supra). Also included are cyclized peptides, molecules, and analogs which contain residues other than L-amina acids, e.g., D-amino acids or non-naturally occurring or synthetic amino acids, e.g., beta (.beta.) or gamma (.gamma.) amino acids.
[0260] In addition to full-length polypeptides, the presently disclosed subject matter also provides fragments of any one of the polypeptides or peptide domains of the presently disclosed subject matter. A fragment can be at least 5, 10, 13, or 15 amino acids. In certain embodiments, a fragment is at least 20 contiguous amino acids, at least 30 contiguous amino acids, or at least 50 contiguous amino acids. In certain embodiments, a fragment is at least 60 to 80, 100, 200, 300 or more contiguous amino acids. Fragments of the presently disclosed subject matter can be generated by methods known to those of ordinary skill in the art or may result from normal protein processing (e.g., removal of amino acids from the nascent polypeptide that are not required for biological activity or removal of amino acids by alternative mRNA splicing or alternative protein processing events).
[0261] Non-protein analogs have a chemical structure designed to mimic the functional activity of a protein of the invention. Such analogs are administered according to methods of the presently disclosed subject matter. Such analogs may exceed the physiological activity of the original polypeptide. Methods of analog design are well known in the art, and synthesis of analogs can be carried out according to such methods by modifying the chemical structures such that the resultant analogs increase the anti-neoplastic activity of the original polypeptide when expressed in an immunoresponsive cell. These chemical modifications include, but are not limited to, substituting alternative R groups and varying the degree of saturation at specific carbon atoms of a reference polypeptide. The protein analogs can be relatively resistant to in vivo degradation, resulting in a more prolonged therapeutic effect upon administration. Assays for measuring functional activity include, but are not limited to, those described in the Examples below.
[0262] In accordance with the presently disclosed subject matter, the polynucleotides encoding an extracellular antigen-binding domain that specifically binds to BCMA (e.g., human BCMA) (e.g., an scFv (e.g., a human scFv), a Fab, or a (Fab).sub.2), CD3.zeta., CD8, CD28) can be modified by codon optimization. Codon optimization can alter both naturally occurring and recombinant gene sequences to achieve the highest possible levels of productivity in any given expression system. Factors that are involved in different stages of protein expression include codon adaptability, mRNA structure, and various cis-elements in transcription and translation. Any suitable codon optimization methods or technologies that are known to ones skilled in the art can be used to modify the polynucleotids of the presently disclosed subject matter, including, but not limited to, OptimumGene.TM., Encor optimization, and Blue Heron.
VI. Administration
[0263] BCMA-targeted CARs and immunoresponsive cells expressing thereof of the presently disclosed subject matter can be provided systemically or directly to a subject for treating or preventing a neoplasia. In certain embodiments, the BCMA-targeted CARs and immunoresponsive cells expressing thereof are directly injected into an organ of interest (e.g., an organ affected by a neoplasia). Alternatively or additionally, the BCMA-targeted CARs and immunoresponsive cells expressing thereof are provided indirectly to the organ of interest, for example, by administration into the circulatory system (e.g., the tumor vasculature). Expansion and differentiation agents can be provided prior to, during or after administration of cells and compositions to increase production of T cells in vitro or in vivo.
[0264] BCMA-targeted CARs and immunoresponsive cells expressing thereof of the presently disclosed subject matter can be administered in any physiologically acceptable vehicle, normally intravascularly, although they may also be introduced into bone or other convenient site where the cells may find an appropriate site for regeneration and differentiation (e.g., thymus). Usually, at least 1.times.10.sup.5 cells can be administered, eventually reaching 1.times.10.sup.10 or more. A cell population comprising immunoresponsive cells expressing a BCMA-targeted CAR can comprise a purified population of cells. Those skilled in the art can readily determine the percentage of immunoresponsive cells in a cell population using various well-known methods, such as fluorescence activated cell sorting (FACS). The ranges of purity in cell populations comprising genetically modified immunoresponsive cells expressing a BCMA-specific CAR can be from about 50% to about 55%, from about 55% to about 60%, from about 65% to about 70%, from about 70% to about 75%, from about 75% to about 80%, from about 80% to about 85%; from about 85% to about 90%, from about 90% to about 95%, or from about 95 to about 100%. Dosages can be readily adjusted by those skilled in the art (e.g., a decrease in purity may require an increase in dosage). The immunoresponsive cells can be introduced by injection, catheter, or the like. If desired, factors can also be included, including, but not limited to, interleukins, e.g. IL-2, IL-3, IL 6, IL-11, IL-7, IL-12, IL-15, IL-21, as well as the other interleukins, the colony stimulating factors, such as G-, M- and GM-CSF, interferons, e.g., .gamma.-interferon.
[0265] Compositions of the presently disclosed subject matter comprise pharmaceutical compositions comprising immunoresponsive cells expressing a BCMA-targeted CAR and a pharmaceutically acceptable carrier. Administration can be autologous or non-autologous. For example, immunoresponsive cells expressing a BCMA-targeted CAR and compositions comprising thereof can be obtained from one subject, and administered to the same subject or a different, compatible subject. Peripheral blood derived T cells of the presently disclosed subject matter or their progeny (e.g., in vivo, ex vivo or in vitro derived) can be administered via localized injection, including catheter administration, systemic injection, localized injection, intravenous injection, or parenteral administration. When administering a pharmaceutical composition of the presently disclosed subject matter (e.g., a pharmaceutical composition comprising immunoresponsive cells expressing a BCMA-targeted CAR), it can be formulated in a unit dosage injectable form (solution, suspension, emulsion).
VII. Formulations
[0266] Immunoresponsive cells expressing a generally BCMA-targeted CAR and compositions comprising thereof of the presently disclosed subject matter can be conveniently provided as sterile liquid preparations, e.g., isotonic aqueous solutions, suspensions, emulsions, dispersions, or viscous compositions, which may be buffered to a selected pH. Liquid preparations are normally easier to prepare than gels, other viscous compositions, and solid compositions. Additionally, liquid compositions are somewhat more convenient to administer, especially by injection. Viscous compositions, on the other hand, can be formulated within the appropriate viscosity range to provide longer contact periods with specific tissues. Liquid or viscous compositions can comprise carriers, which can be a solvent or dispersing medium containing, for example, water, saline, phosphate buffered saline, polyol (for example, glycerol, propylene glycol, liquid polyethylene glycol, and the like) and suitable mixtures thereof.
[0267] Sterile injectable solutions can be prepared by incorporating the compositions comprising immunoresponsive cells expressing a generally BCMA-targeted CAR of the presently disclosed subject matter in the required amount of the appropriate solvent with various amounts of the other ingredients, as desired. Such compositions may be in admixture with a suitable carrier, diluent, or excipient such as sterile water, physiological saline, glucose, dextrose, or the like. The compositions can also be lyophilized. The compositions can contain auxiliary substances such as wetting, dispersing, or emulsifying agents (e.g., methylcellulose), pH buffering agents, gelling or viscosity enhancing additives, preservatives, flavoring agents, colors, and the like, depending upon the route of administration and the preparation desired. Standard texts, such as "REMINGTON'S PHARMACEUTICAL SCIENCE", 17th edition, 1985, incorporated herein by reference, may be consulted to prepare suitable preparations, without undue experimentation.
[0268] Various additives which enhance the stability and sterility of the compositions, including antimicrobial preservatives, antioxidants, chelating agents, and buffers, can be added. Prevention of the action of microorganisms can be ensured by various antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, sorbic acid, and the like. Prolonged absorption of the injectable pharmaceutical form can be brought about by the use of agents delaying absorption, for example, alum inurn monostearate and gelatin. According to the present invention, however, any vehicle, diluent, or additive used would have to be compatible with the immunoresponsive cells expressing a generally BCMA-targeted CAR of the presently disclosed subject matter.
[0269] The compositions can be isotonic, i.e., they can have the same osmotic pressure as blood and lacrimal fluid. The desired isotonicity of the compositions of the presently disclosed subject matter may be accomplished using sodium chloride, or other pharmaceutically acceptable agents such as dextrose, boric acid, sodium tartrate, propylene glycol or other inorganic or organic solutes. Sodium chloride is preferred particularly for buffers containing sodium ions.
[0270] Viscosity of the compositions, if desired, can be maintained at the selected level using a pharmaceutically acceptable thickening agent. Methylcellulose can be used because it is readily and economically available and is easy to work with. Other suitable thickening agents include, for example, xanthan gum, carboxymethyl cellulose, hydroxypropyl cellulose, carbomer, and the like. The concentration of the thickener can depend upon the agent selected. The important point is to use an amount that will achieve the selected viscosity. Obviously, the choice of suitable carriers and other additives will depend on the exact route of administration and the nature of the particular dosage form, e.g., liquid dosage form (e.g., whether the composition is to be formulated into a solution, a suspension, gel or another liquid form, such as a time release form or liquid-filled form).
[0271] Those skilled in the art will recognize that the components of the compositions should be selected to be chemically inert and will not affect the viability or efficacy of the immunoresponsive cells as describe in the presently disclosed subject matter. This will present no problem to those skilled in chemical and pharmaceutical principles, or problems can be readily avoided by reference to standard texts or by simple experiments (not involving undue experimentation), from this disclosure and the documents cited herein.
[0272] One consideration concerning the therapeutic use of the immunoresponsive cells of the presently disclosed subject matter is the quantity of cells necessary to achieve an optimal effect. The quantity of cells to be administered will vary for the subject being treated. In certain embodiments, from about 10.sup.4 to about 10.sup.10, from about 10.sup.5 to about 10.sup.9, or from about 10.sup.6 to about 10.sup.8 immunoresponsive cells of the presently disclosed subject matter are administered to a subject. More effective cells may be administered in even smaller numbers. In certain embodiments, at least about 1.times.10.sup.8, about 2.times.10.sup.8, about 3.times.10.sup.8, about 4.times.10.sup.8, and about 5.times.10.sup.8 immunoresponsive cells of the presently disclosed subject matter are administered to a human subject. The precise determination of what would be considered an effective dose may be based on factors individual to each subject, including their size, age, sex, weight, and condition of the particular subject. Dosages can be readily ascertained by those skilled in the art from this disclosure and the knowledge in the art.
[0273] The skilled artisan can readily determine the amount of cells and optional additives, vehicles, and/or carrier in compositions and to be administered in methods of the presently disclosed subject matter. Typically, any additives (in addition to the active cell(s) and/or agent(s)) are present in an amount of from about 0.001% to about 50% by weight) solution in phosphate buffered saline, and the active ingredient is present in the order of micrograms to milligrams, such as from about 0.0001 wt % to about 5 wt %, from about 0.0001 wt % to about 1 wt %, from about 0.0001 wt % to about 0.05 wt %, from about 0.001 wt % to about 20 wt %, from about 0.01 wt % to about 10 wt %, or from about 0.05 wt % to about 5 wt %. For any composition to be administered to an animal or human, and for any particular method of administration, toxicity should be determined, such as by determining the lethal dose (LD) and LD50 in a suitable animal model e.g., rodent such as mouse; and, the dosage of the composition(s), concentration of components therein and timing of administering the composition(s), which elicit a suitable response. Such determinations do not require undue experimentation from the knowledge of the skilled artisan, this disclosure and the documents cited herein. And, the time for sequential administrations can be ascertained without undue experimentation.
VII. Methods of Treatment
[0274] Tumor Microenvironment. Tumors have a microenvironment that is hostile to the host immune response involving a series of mechanisms by malignant cells to protect themselves from immune recognition and elimination. This "hostile tumor microenvironment" comprises a variety of immune suppressive factors including infiltrating regulatory CD4.sup.+ T cells (Tregs), myeloid derived suppressor cells (MDSCs), tumor associated macrophages (TAMs), immune suppressive cytokines including IL-10 and TGF-.beta., and expression of ligands targeted to immune suppressive receptors expressed by activated T cells (CTLA-4 and PD-1). These mechanisms of immune suppression play a role in the maintenance of tolerance and suppressing inappropriate immune responses, however within the tumor microenvironment these mechanisms prevent an effective anti-tumor immune response. Collectively these immune suppressive factors can induce either marked anergy or apoptosis of adoptively transferred CAR modified T cells upon encounter with targeted tumor cells.
[0275] Challenges in tumor immunology. Effective tumor immunity requires recognition of tumor antigens and unopposed tumor elimination by immune effector cells. Tumor antigens must contain peptide epitopes that are presented by the tumor and can be recognized by specific cytotoxic T lymphocytes (CTLs). The primed CTLs must expand to a sufficient number and migrate to tumor sites, wherein they mature into effectors to perform their functions, which are enhanced by helper T cells and dampened by Tregs and inhibitory macrophages.
[0276] Targeted T cell therapy with engineered T lymphocytes. T cell engineering is a groundbreaking strategy to potentially resolve many previously observed shortcomings of earlier immunotherapeutic approaches. Within the past year, researchers have reported dramatic complete remissions in relapsed.sup.16,17, chemorefractory leukemia and metastatic melanoma.sup.18-20, obtained with autologous peripheral blood T cells targeted to a defined antigen (CD19 and NY-ESO-1, respectively).
[0277] Rationale for a genetic approach: Cell engineering can be used to redirect T cells toward tumor antigens and to enhance T cell function. One impetus for genetic T cell modification is the potential to enhance T cell survival and expansion and to offset T cell death, anergy, and immune suppression. The genetic targeting of T cells can also be refined to prevent undesired destruction of normal tissues.
[0278] Chimeric antigen receptors (CARs): Tumor-specific T cells can be generated by the transfer of genes that encode CARs.sup.21-26. Second-generation CARs comprise a tumor antigen-binding domain fused to an intracellular signaling domain capable of activating T cells and a co-stimulatory domain designed to augment T cell potency and persistence.sup.27. CAR design can therefore reconcile antigen recognition with signal transduction, two functions that are physiologically borne by two separate complexes, the TCR heterodimer and the CD3 complex. The CAR's extracellular antigen-binding domain is usually derived from a murine monoclonal antibody (mAb) or from receptors or their ligands. Antigen recognition is therefore not MHC-restricted.sup.28,29 and is therefore applicable to any patient expressing the target antigen, using the same CAR. Antigen binding by the CARs triggers phosphorylation of immunoreceptor tyrosine-based activation motifs (ITAMs) in the intracellular domain, initiating a signaling cascade required for cytolysis induction, cytokine secretion, and proliferation. Because MEW restriction of antigen recognition is bypassed, the function of CAR-targeted T cells is not affected by HLA downregulation or defects in the antigen-processing machinery.
[0279] T cell requirements for expansion and survival: Proliferation of tumor-specific T cells is needed ex vivo and is arguably desirable in vivo. T cell proliferation must be accompanied by T cell survival to permit absolute T cell expansion and persistence. To proliferate in response to antigen, T cells must receive two signals. One is provided by TCR recognition of antigenic peptide/WIC complexes displayed on the surface of antigen-presenting cells (APCs).sup.25. The other is provided by a T cell co-stimulatory receptor, such as the CD28 or 4-1BB receptors. Whereas the cytolytic activity of T cells does not require concomitant co-stimulation, there is a critical need for the provision of co-stimulatory signals to sustain the antitumor functions of adoptively transferred T cells, as previously demonstrated.sup.23,27,30-32.
[0280] Immune monitoring: Lymphocytes are multifunctional "drugs" that exhibit dynamically evolving effects after infusion. Upon antigen encounter, tumor-specific T cells activate and/or release a variety of proteins that can trigger tumor killing, T cell proliferation, and recruitment or immunomodulation of other immune cells. Thus, measuring which proteins are secreted from which cells, in what quantity, and at what time point yields profound insights into why a particular patient is or is not responding and provides critical feedback for designing more-effective trials. These assay systems will permit direct and meaningful comparisons of clinical approaches and thus help design rational, next-generation therapeutic strategies.
[0281] For treatment, the amount administered is an amount effective in producing the desired effect. An effective amount can be provided in one or a series of administrations. An effective amount can be provided in a bolus or by continuous perfusion.
[0282] An "effective amount" (or, "therapeutically effective amount") is an amount sufficient to affect a beneficial or desired clinical result upon treatment. An effective amount can be administered to a subject in one or more doses. In terms of treatment, an effective amount is an amount that is sufficient to palliate, ameliorate, stabilize, reverse or slow the progression of the disease, or otherwise reduce the pathological consequences of the disease. The effective amount is generally determined by the physician on a case-by-case basis and is within the skill of one in the art. Several factors are typically taken into account when determining an appropriate dosage to achieve an effective amount. These factors include age, sex and weight of the subject, the condition being treated, the severity of the condition and the form and effective concentration of the immunoresponsive cells administered.
[0283] For adoptive immunotherapy using antigen-specific T cells, cell doses in the range of about 10.sup.6 to about 10.sup.10 (e.g., about 10.sup.9) are typically infused. Upon administration of the immunoresponsive cells into the subject and subsequent differentiation, the immunoresponsive cells are induced that are specifically directed against one specific antigen (e.g., BCMA). "Induction" of T cells can include inactivation of antigen-specific T cells such as by deletion or anergy. Inactivation is particularly useful to establish or reestablish tolerance such as in autoimmune disorders. The immunoresponsive cells of the presently disclosed subject matter can be administered by any methods known in the art, including, but not limited to, pleural administration, intravenous administration, subcutaneous administration, intranodal administration, intratumoral administration, intrathecal administration, intrapleural administration, intraperitoneal administration, and direct administration to the thymus. In certain embodiments, the immunoresponsive cells and the compositions comprising thereof are intravenously administered to the subject in need.
[0284] The presently disclosed subject matter provides various methods of using the immunoresponsive cells (e.g., T cells) expressing a BCMA-targeted CAR. For example, the presently disclosed subject matter provides methods of reducing tumor burden in a subject. In one non-limiting example, the method of reducing tumor burden comprises administering an effective amount of the presently disclosed immunoresponsive cell to the subject, thereby inducing tumor cell death in the subject. The presently disclosed immunoresponsive cell can reduce the number of tumor cells, reduce tumor size, and/or eradicate the tumor in the subject. Non-limiting examples of suitable tumor include multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia. In certain embodiments, the tumor is multiple myeloma.
[0285] The presently disclosed subject matter also provides methods of increasing or lengthening survival of a subject having a neoplasia. In one non-limiting example, the method of increasing or lengthening survival of a subject having neoplasia comprises administering an effective amount of the presently disclosed immunoresponsive cell to the subject, thereby increasing or lengthening survival of the subject. The method can reduce or eradicate tumor burden in the subject. The presently disclosed subject matter further provides methods for treating or preventing a neoplasia in a subject, comprising administering the presently disclosed immunoresponsive cell to the subject.
[0286] As used herein, the term "neoplasia" refers to a disease characterized by the pathological proliferation of a cell or tissue and its subsequent migration to or invasion of other tissues or organs. Neoplasia growth is typically uncontrolled and progressive, and occurs under conditions that would not elicit, or would cause cessation of, multiplication of normal cells. Neoplasias can affect a variety of cell types, tissues, or organs, including but not limited to an organ selected from the group consisting of bladder, colon, bone, brain, breast, cartilage, glia, esophagus, fallopian tube, gallbladder, heart, intestines, kidney, liver, lung, lymph node, nervous tissue, ovaries, pleura, pancreas, prostate, skeletal muscle, skin, spinal cord, spleen, stomach, testes, thymus, thyroid, trachea, urogenital tract, ureter, urethra, uterus, and vagina, or a tissue or cell type thereof. Neoplasias include cancers, such as sarcomas, carcinomas, or plasmacytomas (malignant tumor of the plasma cells).
[0287] Cancers whose growth may be inhibited using the immunoresponsive cells of the presently disclosed subject matter comprise cancers typically responsive to immunotherapy. Non-limiting examples of cancers for treatment include multiple myeloma, Non-Hodgkin Lymphoma, Hodgkin Lymphoma, Chronic Lymphocytic Leukemia (CLL), glioblastoma, and Waldenstrom's Macroglobulinemia. In certain embodiments, the cancer is multiple myeloma.
[0288] Additionally, the presently disclosed subject matter provides methods of increasing immune-activating cytokine production in response to a cancer cell in a subject. In one non-limiting example, the method comprises administering the presently disclosed immunoresponsive cell to the subject. The immune-activating cytokine can be granulocyte macrophage colony stimulating factor (GM-CSF), IFN-.alpha., IFN-.beta., IFN-.gamma., TNF-.alpha., IL-2, IL-3, IL-6, IL-11, IL-7, IL-12, IL-15, IL-21, interferon regulatory factor 7 (IRF7), and combinations thereof. In certain embodiments, the immunoresponsive cells including a BCMA-specific CAR of the presently disclosed subject matter increase the production of GM-CSF, IFN-.gamma., and/or TNF-.alpha..
[0289] Suitable human subjects for therapy typically comprise two treatment groups that can be distinguished by clinical criteria. Subjects with "advanced disease" or "high tumor burden" are those who bear a clinically measurable tumor (e.g., multiple myeloma). A clinically measurable tumor is one that can be detected on the basis of tumor mass (e.g., by palpation, CAT scan, sonogram, mammogram or X-ray; positive biochemical or histopathologic markers on their own are insufficient to identify this population). A pharmaceutical composition embodied in the presently disclosed subject matter is administered to these subjects to elicit an anti-tumor response, with the objective of palliating their condition. Ideally, reduction in tumor mass occurs as a result, but any clinical improvement constitutes a benefit. Clinical improvement comprises decreased risk or rate of progression or reduction in pathological consequences of the tumor (e.g., multiple myeloma).
[0290] A second group of suitable subjects is known in the art as the "adjuvant group." These are individuals who have had a history of neoplasia (e.g., multiple myeloma), but have been responsive to another mode of therapy. The prior therapy can have included, but is not restricted to, surgical resection, radiotherapy, and traditional chemotherapy. As a result, these individuals have no clinically measurable tumor. However, they are suspected of being at risk for progression of the disease, either near the original tumor site, or by metastases. This group can be further subdivided into high-risk and low-risk individuals. The subdivision is made on the basis of features observed before or after the initial treatment. These features are known in the clinical arts, and are suitably defined for each different neoplasia. Features typical of high-risk subgroups are those in which the tumor (e.g., multiple myeloma) has invaded neighboring tissues, or who show involvement of lymph nodes. Another group has a genetic predisposition to neoplasia (e.g., multiple myeloma) but has not yet evidenced clinical signs of neoplasia (e.g., multiple myeloma). For instance, women testing positive for a genetic mutation associated with breast cancer, but still of childbearing age, can wish to receive one or more of the antigen-binding fragments described herein in treatment prophylactically to prevent the occurrence of neoplasia until it is suitable to perform preventive surgery.
[0291] The subjects can have an advanced form of disease (e.g., multiple myeloma), in which case the treatment objective can include mitigation or reversal of disease progression, and/or amelioration of side effects. The subjects can have a history of the condition, for which they have already been treated, in which case the therapeutic objective will typically include a decrease or delay in the risk of recurrence.
[0292] Further modification can be introduced to the BCMA-targeted CAR-expressing immunoresponsive cells (e.g., T cells) to avert or minimize the risks of immunological complications (known as "malignant T-cell transformation"), e.g., graft versus-host disease (GvHD), or when healthy tissues express the same target antigens as the tumor cells, leading to outcomes similar to GvHD. A potential solution to this problem is engineering a suicide gene into the CAR-expressing T cells. Suitable suicide genes include, but are not limited to, Herpes simplex virus thymidine kinase (hsv-tk), inducible Caspase 9 Suicide gene (iCasp-9), and a truncated human epidermal growth factor receptor (EGFRt) polypeptide. In certain embodiments, the suicide gene is an EGFRt polypeptide. The EGFRt polypeptide can enable T cell elimination by administering anti-EGFR monoclonal antibody (e.g., cetuximab). EGFRt can be covalently joined to the 3' terminus of the intracellular domain of the BCMA-targeted CAR. The suicide gene can be included within the vector comprising nucleic acids encoding the presently disclosed BCMA-targeted CARs. In this way, administration of a prodrug designed to activate the suicide gene (e.g., a prodrug (e.g., AP1903 that can activates iCasp-9) during malignant T-cell transformation (e.g., GVHD) triggers apoptosis in the suicide gene-activated CAR-expressing T cells.
IX. Kits
[0293] The presently disclosed subject matter provides kits for the treatment or prevention of a neoplasia (e.g., multiple myeloma). In certain embodiments, the kit comprises a therapeutic or prophylactic composition containing an effective amount of an immunoresponsive cell comprising a BCMA-targeted CAR in unit dosage form. In particular embodiments, the cells further expresses at least one co-stimulatory ligand. In certain embodiments, the kit comprises a sterile container which contains a therapeutic or prophylactic vaccine; such containers can be boxes, ampules, bottles, vials, tubes, bags, pouches, blister-packs, or other suitable container forms known in the art. Such containers can be made of plastic, glass, laminated paper, metal foil, or other materials suitable for holding medicaments.
[0294] If desired, the immunoresponsive cell is provided together with instructions for administering the cell to a subject having or at risk of developing a neoplasia (e.g., multiple myeloma). The instructions will generally include information about the use of the composition for the treatment or prevention of a neoplasia (e.g., multiple myeloma). In other embodiments, the instructions include at least one of the following: description of the therapeutic agent; dosage schedule and administration for treatment or prevention of a neoplasia (e.g., multiple myeloma) or symptoms thereof; precautions; warnings; indications; counter-indications; overdosage information; adverse reactions; animal pharmacology; clinical studies; and/or references. The instructions may be printed directly on the container (when present), or as a label applied to the container, or as a separate sheet, pamphlet, card, or folder supplied in or with the container.
EXAMPLES
[0295] The practice of the present invention employs, unless otherwise indicated, conventional techniques of molecular biology (including recombinant techniques), microbiology, cell biology, biochemistry and immunology, which are well within the purview of the skilled artisan. Such techniques are explained fully in the literature, such as, "Molecular Cloning: A Laboratory Manual", second edition (Sambrook, 1989); "Oligonucleotide Synthesis" (Gait, 1984); "Animal Cell Culture" (Freshney, 1987); "Methods in Enzymology" "Handbook of Experimental Immunology" (Weir, 1996); "Gene Transfer Vectors for Mammalian Cells" (Miller and Calos, 1987); "Current Protocols in Molecular Biology" (Ausubel, 1987); "PCR: The Polymerase Chain Reaction", (Mullis, 1994); "Current Protocols in Immunology" (Coligan, 1991). These techniques are applicable to the production of the polynucleotides and polypeptides of the invention, and, as such, may be considered in making and practicing the invention. Particularly useful techniques for particular embodiments will be discussed in the sections that follow.
[0296] The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the assay, screening, and therapeutic methods of the invention, and are not intended to limit the scope of what the inventors regard as their invention.
Example 1--BCMA Expression in Various Tissues
[0297] The Expression of human BCMA was evaluated in various malignant and normal tissues by investigating gene expression profiles in databases such as the cancer cell line encyclopedia and BioGPS. As shown in FIGS. 2A-2D, human BCMA was highly expressed in lymphoma and multiple myeloma, but not in other malignant tissues. Normal expression appeared limited to B-cells and plasma cells. Potential BCMA targeted CAR T cell eradication of these normal cell types may not have significant adverse effects based on inventors' patient experience with CD19 targeted CAR T cells. Any lack of physiologic antibody production can be addressed with intravenous immunoglobulin treatment.
Example 2--Construct of BCMA-Specific 28z CARs
[0298] Multiple unique fully human scFv's to BCMA were generated, and CARs based on these scFv's were generated. Multiple scFv's were identified by screening a fully human scFv phage library (>6.times.10.sup.10 scFv's) with BCMA-Fc fusion protein and then 3T3 cells expressing human BCMA. After sequencing, 57 unique and BCMA-Fc positive clones were found out of 79 sequenced positive clones; the unique clone rate was 72%. FACS analysis of phage antibody clones against BCMA-3T3 and parental 3T3 cell lines resulted in confirming 25 unique positive clones.
[0299] ET140-153 scFv (or "ET140-3 scFv"), ET140-174 scFv (or "ET140-24 scFv"), ET140-187 scFv (or "ET140-37 scFv"), ET140-190 scFv (or "ET140-40 scFv"), and ET140-204 scFv (or "ET140-54 scFv") were used to generate BCMA-targeted 28z CARs 3, 24, 37, 40, and 54, respectively. These BCMA-targeted 28z CARs have similar structure, e.g., each has a transmembrane domain comprising a CD28 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide and a co-stimulatory signaling region that comprises a CD28 polypeptide, as shown in FIG. 1. Each of these BCMA-targeted CARs were cloned into a retroviral vector. These viral vectors were then transduced into HEK 293galv9 viral packaging cells in order to generate a stable packaging line for generation of CAR' T cells.
[0300] Human T cells (unselected (CD4 and CD8) human T cells from a healthy donor) were transduced with retrovirus in order to express each of these BCMA-targeted CARs such that the T cells expressed these BCMA-targeted 28z CARs. The cell surface expression of BCMA-targeted CARs on human T cells was determined via binding A647 conjugated BCMA-Fc fusion protein. The cell surface expression of BCMA-targeted 28z CAR24 was assessed, and cell surface detection was valided by flow cytometry, as shown in FIG. 3.
[0301] The cross-reacting activity of seventeen human scFv's between human BCMA and mouse BCMA was assessed. As shown in FIG. 4, certain scFv's, e.g., ET140-153 scFv (or "ET140-3 scFv") and ET140-192 scFv (or "ET140-42 scFv") cross-reacted with mouse BCMA, thus, this scFv can be used for syngeneic mouse studies.
Example 3--Activity of BCMA-Specific CARs
[0302] The anti-tumor activity of the presently disclosed BCMA-specific 28z CARs was evaluated. The in vitro data showed that the BCMA-specific CARs specifically killed BCMA presenting cells, including MM cell lines. For example, as shown in FIG. 5, the T cells expressing the BCMA-specific 28z CAR24 killed 3T3 cells overexpressing BCMA (but not control 3T3s overexpressing an irrelevant antigen). As shown in FIG. 6, the T cells expressing the BCMA-specific 28z CARs 24, 40, and 54 killed human MM cell lines.
Example 4--Screening Data for Anti-BCMA Antibodies
[0303] ELISA Screening: FIG. 21 shows the representative results of protein ELISA screening against BCMA antigen using specific scFv phage antibody clones (ET140-3, ET140-24, ET140-37, ET140-40 and ET140-54). ELISA plates were coated with human BCMA ECD-Fc fusion protein, control-Fc fusion protein, or PBS alone as blank control, respectively. Individual phage clones from enriched phage display panning pools against BCMA ECD-Fc fusion protein were incubated in the coated plates. Binding of the phage clones was detected by HRP-conjugated anti-M13 antibodies and developed using TMB substrate. The absorbance was read at 450 nm.
[0304] FACS Screening: FIGS. 22A-22D show a representative figure of a FACS analysis of the BCMA-specific phage antibody clones ET140-3, ET140-24, ET140-37, ET140-40 and ET140-54. Phage clones were incubated with 3T3-BCMA cell line, then with anti-M13 mouse antibody. Finally APC-labeled anti-mouse IgG 2nd antibody was added to the reaction after washing again. The binding was measured by FACS and expressed as mean fluorescence intensity (MFI). Cells incubated with 2nd antibody alone, M13 K07 helper phage and cells only were used as negative controls.
Example 5--Construct of BCMA-Specific BBz CARs
[0305] Multiple unique fully human scFv's to BCMA were generated as described in Example 2. ET140-153 scFv (or "ET140-3 scFv"), ET140-174 scFv (or "ET140-24 scFv"), ET140-187 scFv (or "ET140-37 scFv"), ET140-190 scFv (or "ET140-40 scFv"), and ET140-204 scFv (or "ET140-54 scFv") were used to generate BCMA-targeted BBz CARs 3, 24, 37, 40, and 54, respectively. Each of these BCMA-targeted BBz CARs has a transmembrane domain comprising a CD8 polypeptide, and an intracellular domain comprising a CD3.xi. polypeptide and a co-stimulatory signaling region that comprises a 4-1BB polypeptide, as shown in FIG. 7. Each of these BCMA-targeted CARs were cloned into an SFG retroviral vector, as an example the 4-1BB containing CAR vectors are shown in FIGS. 8-12.
Example 6--Activity of BCMA-Targeted CAR T Cells
[0306] As shown in FIG. 13, BCMA-specific 28z CAR24 lysed human MM cell lines L363, NCL-H929, and U266, compared to irrelevantly targeted 4h11-28z MUC16 targeted CAR T cells. The cytotoxicity exhibited by observed BCMA-specific 28z CAR24 was specific to BCMA, as it did not lyse BCMA negative CD19 positive Raji Burkett lymphoma cell line, as shown in FIG. 13.
Example 7--Induction of Cytokine Secretion by BCMA-Targeted CAR T Cells
[0307] Co-culture of BCMA targeted 28z CAR24 T cells specifically with MM cell line induced cytokine secretion profile consistent with T cell activation. FIG. 14 shows the IL-2 secretion after 24h co-culture of CAR T cells with human tumor cell lines (E:T ratio 1:1). The lymphoplasmacytic lymphoma (CD19.sup.+) with CD19 targeted CAR T cells (positive control) and the MM cell line with the BCMA targeted 28z CAR24 T cells displayed increased cytokine production. IFNg, IL-6, TNFa, sCD40L, GM-CSF all had similar secretion profiles (data not shown).
Example 8--Anti-Tumor Activity of BCMA-Targeted CAR T Cells
[0308] BCMA targeted 28z CAR54 T cells mediated an anti-myeloma immune response. 1.times.10.sup.7 U266 human myeloma cell line cells were injected IV into NSG mice on day 0. On day 4 1.times.10.sup.6 BCMA targeted or CD19 targeted second generation CAR T cells were injected IV. Imaging on day 11 (day 7 s/p CAR T cell injection) shows that, unlike irrelevant (CD19) targeted CAR T cells; BCMA targeted 28z CAR54 T cells can mediate an anti-tumor response. See FIG. 15.
Example 9--Activity of BCMA-Targeted CAR T Cells
[0309] The ability of BCMA targeted CAR T cells to specifically lyse human myeloma cell line (HMCL) was tested. CD19 targeted CAR T cells or BCMA targeted 28z CAR24 T cells were incubated with GFP expressing tumor cell lines SET2 (Acute myeloid leukemia (AML), CD19.sup.- BCMA.sup.-); BCWM1 (Lymphoplasmacytic Lymphoma (LPL), CD19.sup.-BCMA.sup.-); L363 (Multiple Myeloma (MM), CD19.sup.-BCMA.sup.+). At time 0, the percent of GFP.sup.+ tumor line is shown in FIG. 16A. At 36h the positive control CD19 targeted CAR T cells have specifically killed the GFP.sup.+ LPL line, and similarly the BCMA targeted 28z CAR24 T cells have specifically killed the GFP.sup.+ MM line. See FIG. 16B.
Example 10--Epitope Mapping of Anti-BCMA Antibodies
[0310] BCMA peptides were ordered based on the ECD sequence with N-terminal Biotin+SGSG linker+15 amino acids with 1 amino acid space. The peptide library is shown in Table 24.
TABLE-US-00074 TABLE 24 ET140-pl SGSGLQMAGQCSQNEYFDS [SEQ ID NO: 236] ET140-p2 SGSGQMAGQCSQNEYFDSL [SEQ ID NO: 237] ET140-p3 SGSGMAGQCSQNEYFDSLL [SEQ ID NO: 238] ET140-p4 SGSGAGQCSQNEYFDSLLH [SEQ ID NO: 239] ET140-p5 SGSGGQCSQNEYFDSLLHA [SEQ ID NO: 240] ET140-p6 SGSGQCSQNEYFDSLLHAC [SEQ ID NO: 241] ET140-p7 SGSGCSQNEYFDSLLHACI [SEQ ID NO: 242] ET140-p8 SGSGSQNEYFDSLLHACIP [SEQ ID NO: 243] ET140-p9 SGSGQNEYFDSLLHACIPC [SEQ ID NO: 244] ET140-p10 SGSGNEYFDSLLHACIPCQ [SEQ ID NO: 245] ET140-pll SGSGEYFDSLLHACIPCQL [SEQ ID NO: 246] ET140-p12 SGSGYFDSLLHACIPCQLR [SEQ ID NO: 247] ET140-p13 SGSGFDSLLHACIPCQLRC [SEQ ID NO: 248] ET140-p14 SGSGDSLLHACIPCQLRCS [SEQ ID NO: 249] ET140-p15 SGSGSLLHACIPCQLRCSS [SEQ ID NO. 250] ET140-p16 SGSGLLHACIPCQLRCSSN [SEQ ID NO: 251] ET140-p17 SGSGLHACIPCQLRCSSNT [SEQ ID NO: 252] ET140-p18 SGSGHACIPCQLRCSSNTP [SEQ ID NO: 253] ET140-p19 SGSGACIPCQLRCSSNTPP [SEQ ID NO: 254] ET140-p20 SGSGCIPCQLRCSSNTPPL [SEQ ID NO: 255] ET140-p21 SGSGIPCQLRCSSNTPPLT [SEQ ID NO: 256] ET140-p22 SGSGPCQLRCSSNTPPLTC [SEQ ID NO: 257] ET140-p23 SGSGCQLRCSSNTPPLTCQ [SEQ ID NO: 258] ET140-p24 SGSGQLRCSSNTPPLTCQR [SEQ ID NO: 259] ET140-p25 SGSGLRCSSNTPPLTCQRY [SEQ ID NO: 260] ET140-p26 SGSGRCSSNTPPLTCQRYC [SEQ ID NO: 261] ET140-p27 SGSGCSSNTPPLTCQRYCN [SEQ ID NO: 262] ET140-p28 SGSGSSNTPPLTCQRYCNA [SEQ ID NO: 263] ET140-p29 SGSGSNTPPLTCQRYCNAS [SEQ ID NO: 264] ET140-p30 SGSGNTPPLTCQRYCNASV [SEQ ID NO: 265] ET140-p31 SGSGTPPLTCQRYCNASVT [SEQ ID NO: 266] ET140-p32 SGSGPPLTCQRYCNASVTN [SEQ ID NO: 267] ET140-p33 SGSGPLTCQRYCNASVTNS [SEQ ID NO: 268] ET140-p34 SGSGLTCQRYCNASVTNSV [SEQ ID NO: 269] ET140-p35 SGSGTCQRYCNASVTNSVK [SEQ ID NO: 270] ET140-p36 SGSGCQRYCNASVTNSVKG [SEQ ID NO: 271] ET140-p37 SGSGQRYCNASVTNSVKGT [SEQ ID NO: 272] ET140-p38 SGSGRYCNASVTNSVKGTN [SEQ ID NO: 273] ET140-p39 SGSGYCNASVTNSVKGTNA [SEQ ID NO: 274]
[0311] The peptides were coated onto Streptavidin plates at 2 ug/mL in PBST (PBS+0.05% Tween-20). After washing and blocking with 3% BSA. After washing, 1 ug/mL ET140-3, ET140-24, ET140-54 or ET901 mIgG1 was added to the wells, respectively. "mIgG1" used in all Examples represents that the variable region is fully human and the Fc part is mouse IgG1. Then HRP anti-mouse IgG detection antibody was added to each well. Finally, the color was developed using TMB substrate. A450 was recorded for data analysis. The results are shown in FIGS. 17-20. As shown in FIGS. 17 and 20, ET140-3 bound to peptides 7-13 (i.e., amino acids 8-22, 9-23, 10-24, 11-25, 12-26, 13-27, and 14-28) of SEQ ID NO:71. As shown in FIGS. 18 and 19, no linear epitopes found for ET140-24 or ET140-54.
[0312] Summary: 3 ET140 antibodies (mIgG1) were tested together with isotype control ET901 mIgG1 for their binding epitope towards BCMA-ECD. A peptide library consisting of 39 peptides (N-terminal biotin+SGSG linker+15 amino acids, with 1 amino acid offset) was used for epitope mapping ELISA. This allows to search for the linear binding epitope of BCMA-ECD. ET901 mIgG1 was used as background reference for each peptide. Only ET140-3 can be identified for its epitope region: a region comprising amino acids 14-22 of SEQ ID NO:71, e.g., amino acids 8-28 of SEQ ID NO: 71.
[0313] ET140-24 and ET140-54 did not show any significant binding towards peptide library. This indicated that these two antibodies may recognize conformational epitope rather than linear epitope of BCMA.
Example 11--Anti-BCMA Antibodies Recombinant Antigen by Surface Plasmon Resonance
[0314] Kinetics of interaction between ET140-153 mIgG1 (or "ET140-3 mIgG1"), ET140-174 mIgG1 (or "ET140-24 mIgG1"), ET140-204 mIgG1 (or "ET140-54 mIgG1") and BCMA recombinant antigen was measured using a BIAcore .times.100 instrument. In brief, 50 .mu.g/mL of modified streptavidin was immobilized onto a Sensor Chip CAP by flowing the Biotin CAPture Reagent through the flow cells at 2 .mu.L/min for 5 minutes. 10 ug/mL biotinylated BCMA-Fc protein was loaded onto the flow cell at a rate of 30 .mu.L/min for 3 minutes. Following the standard protocol for kinetics, a series of injection of ESK1 was performed between 0.6 and 15 .mu.g/mL, each step consisting of a 3 minute injection at 30 .mu.L/min and 3 minute disassociation. Afterwards, the surface was regenerated for 2 minutes with a solution consisting of 75% v/v of 8M guanidine-HCl and 25% v/v 1M NaOH. Kinetic constants were derived by the global fitting (1:1 Langmuir binding model) using BIAcore .times.100 Evaluation Software (Version 2.0.1). The binding affinity data are shown in Table 25.
TABLE-US-00075 TABLE 25 Protein KD ET140-24 mIgGl KD: 4.8 nM (BiaCore) ET140-54 mIgGl KD: 8.1 nM (BiaCore) ET140-3 mIgGl KD: 1.2 nM (BiaCore)
[0315] Although the foregoing presently disclosed subject matter has been described in some detail by way of illustration and example for purposes of clarity of understanding, the descriptions and examples should not be construed as limiting the scope of the presently disclosed subject matter. The disclosures of all patent and scientific literature cited herein are expressly incorporated in their entirety by reference.
REFERENCES
[0316] 1. Frigyesi, I., et al. Robust isolation of malignant plasma cells in multiple myeloma. Blood 123, 1336-1340 (2014).
[0317] 2. Tai, Y. T., et al. Novel afucosylated anti-B cell maturation antigen-monomethyl auristatin F antibody-drug conjugate (GSK2857916) induces potent and selective anti-multiple myeloma activity. Blood (2014).
[0318] 3. Carpenter, R. O., et al. B-cell maturation antigen is a promising target for adoptive T-cell therapy of multiple myeloma. Clinical cancer research: an official journal of the American Association for Cancer Research 19, 2048-2060 (2013).
[0319] 4. Brentjens, R. J., et al. Safety and persistence of adoptively transferred autologous CD19-targeted T cells in patients with relapsed or chemotherapy refractory B-cell leukemias. Blood 118, 4817-4828 (2011).
[0320] 5. Brentjens, R. J., et al. Eradication of systemic B-cell tumors by genetically targeted human T lymphocytes co-stimulated by CD80 and interleukin-15. Nature medicine 9, 279-286 (2003).
[0321] 6. Brentjens, R. J., et al. CD19-Targeted T Cells Rapidly Induce Molecular Remissions in Adults with Chemotherapy-Refractory Acute Lymphoblastic Leukemia. Science translational medicine 5, 177ra138 (2013).
[0322] 7. Davila, M. L., et al. Efficacy and Toxicity Management of 19-28z CAR T Cell Therapy in B Cell Acute Lymphoblastic Leukemia. Science translational medicine 6, 224ra225 (2014).
[0323] 8. Siegel, R., Naishadham, D. & Jemal, A. Cancer statistics, 2013. CA: a cancer journal for clinicians 63, 11-30 (2013).
[0324] 9. Boyd, K. D., et al. The clinical impact and molecular biology of del(17p) in multiple myeloma treated with conventional or thalidomide-based therapy. Genes, chromosomes & cancer 50, 765-774 (2011).
[0325] 10. Shaughnessy, J. D., Jr., et al. A validated gene expression model of high-risk multiple myeloma is defined by deregulated expression of genes mapping to chromosome 1. Blood 109, 2276-2284 (2007).
[0326] 11. Gahrton, G., et al. Allogeneic bone marrow transplantation in multiple myeloma. European Group for Bone Marrow Transplantation. The New England journal of medicine 325, 1267-1273 (1991).
[0327] 12. Pegram, H. J., et al. Tumor-targeted T cells modified to secrete IL-12 eradicate systemic tumors without need for prior conditioning. Blood 119, 4133-4141 (2012).
[0328] 13. Sabrina Bertilaccio, M. T., et al. Low-Dose Lenalidomide Improves CAR-Based Immunotherapy In CLL By Reverting T-Cell Defects In Vivo. Blood 122, 4171 (2013).
[0329] 14. Bataille, R., et al. The phenotype of normal, reactive and malignant plasma cells. Identification of "many and multiple myelomas" and of new targets for myeloma therapy. Haematologica 91, 1234-1240 (2006).
[0330] 15. Morgan, R. A., et al. Case report of a serious adverse event following the administration of T cells transduced with a chimeric antigen receptor recognizing ERBB2. Molecular therapy: the journal of the American Society of Gene Therapy 18, 843-851 (2010).
[0331] 16. Brentjens, R. J., et al. Safety and persistence of adoptively transferred autologous CD19-targeted T cells in patients with relapsed or chemotherapy refractory B-cell leukemias. Blood 118, 4817-4828 (2011).
[0332] 17. Brentjens, R. J., et al. CD19-targeted T cells rapidly induce molecular remissions in adults with chemotherapy-refractory acute lymphoblastic leukemia. Science translational medicine 5, 177ra138 (2013).
[0333] 18. Hunder, N. N., et al. Treatment of metastatic melanoma with autologous CD4+ T cells against NY-ESO-1. N. Eng J. Med. 358, 2698-2703 (2008).
[0334] 19. Rosenberg, S. A., Restifo, N. P., Yang, J. C., Morgan, R. A. & Dudley, M. E. Adoptive cell transfer: a clinical path to effective cancer immunotherapy. Nat. Rev. Cancer 8, 299-308 (2008).
[0335] 20. Dudley, M. E., et al. Adoptive cell therapy for patients with metastatic melanoma: evaluation of intensive myeloablative chemoradiation preparative regimens. J Clin Oncol 26, 5233-5239 (2008).
[0336] 21. Brentjens, R. J., et al. Genetically targeted T cells eradicate systemic acute lymphoblastic leukemia xenografts. Clin. Cancer Res. 13, 5426-5435 (2007).
[0337] 22. Gade, T. P., et al. Targeted elimination of prostate cancer by genetically directed human T lymphocytes. Cancer Res. 65, 9080-9088 (2005).
[0338] 23. Maher, J., Brentjens, R. J., Gunset, G., Riviere, I. & Sadelain, M. Human T-lymphocyte cytotoxicity and proliferation directed by a single chimeric TCRzeta/CD28 receptor. Nat. Biotechnol. 20, 70-75 (2002).
[0339] 24. Kershaw, M. H., et al. Gene-engineered T cells as a superior adjuvant therapy for metastatic cancer. J Immunol 173, 2143-2150 (2004).
[0340] 25. Sadelain, M., Brentjens, R. & Riviere, I. The promise and potential pitfalls of chimeric antigen receptors. Curr Opin Immunol (2009).
[0341] 26. Hollyman, D., et al. Manufacturing validation of biologically functional T cells targeted to CD19 antigen for autologous adoptive cell therapy. J Immunother 32, 169-180 (2009).
[0342] 27. Sadelain, M., Brentjens, R. & Riviere, I. The basic principles of chimeric antigen receptor design. Cancer discovery 3, 388-398 (2013).
[0343] 28. Riviere, I., Sadelain, M. & Brentjens, R. J. Novel strategies for cancer therapy: the potential of genetically modified T lymphocytes. Curr Hematol Rep 3, 290-297 (2004).
[0344] 29. Stephan, M. T., et al. T cell-encoded CD80 and 4-1BBL induce auto- and transco-stimulation, resulting in potent tumor rejection. Nat. Med. 13, 1440-1449 (2007).
[0345] 30. Krause, A., et al. Antigen-dependent CD28 signaling selectively enhances survival and proliferation in genetically modified activated human primary T lymphocytes. J Exp Med 188, 619-626 (1998).
[0346] 31. Gong, M. C., et al. Cancer patient T cells genetically targeted to prostate-specific membrane antigen specifically lyse prostate cancer cells and release cytokines in response to prostate-specific membrane antigen. Neoplasia. 1, 123-127 (1999).
[0347] 32. Lyddane, C., et al. Cutting Edge: CD28 controls dominant regulatory T cell activity during active immunization. J. Immunol. 176, 3306-3310 (2006).
[0348] 33. Carpenter et al., B-cell maturation antigen is a promising target for adoptive T-cell therapy of multiple myeloma. Clin Cancer Res. 2013 Apr. 15; 19(8):2048-60.
[0349] 34. WO2013/154760
[0350] 35. Maus et al., Cancer Immunol Res (2003); 1(1):26-31
[0351] From the foregoing description, it will be apparent that variations and modifications may be made to the invention described herein to adopt it to various usages and conditions. Such embodiments are also within the scope of the following claims.
[0352] All patents and publications and sequences referred to by accession or reference number mentioned in this specification are herein incorporated by reference to the same extent as if each independent patent and publication and sequence was specifically and individually indicated to be incorporated by reference.
Sequence CWU
1
1
2801123PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 1Gln Val Gln Leu Gln Gln Ser Gly Pro Gly Leu Val Lys Pro
Ser Gln1 5 10 15Thr Leu
Ser Leu Thr Cys Ala Ile Ser Gly Asp Ser Val Ser Ser Asn 20
25 30Ser Ala Ala Trp Asn Trp Ile Arg Gln
Ser Pro Ser Arg Gly Leu Glu 35 40
45Trp Leu Gly Arg Thr Tyr Tyr Arg Ser Lys Trp Tyr Asn Asp Tyr Ala 50
55 60Val Ser Val Lys Ser Arg Ile Thr Ile
Asn Pro Asp Thr Ser Lys Asn65 70 75
80Gln Phe Ser Leu Gln Leu Asn Ser Val Thr Pro Glu Asp Thr
Ala Val 85 90 95Tyr Tyr
Cys Ala Arg Gln Gly Tyr Ser Tyr Tyr Gly Tyr Ser Asp Val 100
105 110Trp Gly Gln Gly Thr Leu Val Thr Val
Ser Ser 115 1202111PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
2Gln Ser Val Leu Thr Gln Pro Pro Ser Val Ser Val Ala Pro Arg Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Ser Ser Ser Asn Ile Gly His Asn 20 25
30Asp Val Ser Trp Tyr Gln His Leu Pro Gly Lys Ala Pro
Arg Leu Leu 35 40 45Ile Tyr Phe
Asp Asp Leu Leu Pro Ser Gly Val Ser Asp Arg Phe Ser 50
55 60Ala Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Gly Ser Leu
85 90 95Asn Ala Phe Val Phe Gly
Thr Gly Thr Lys Val Thr Val Leu Gly 100 105
1103369DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 3caggtacagc tgcagcagtc aggtccagga
ctggtgaagc cctcgcagac cctctcactc 60acctgtgcca tctccgggga cagtgtctct
agcaacagtg ctgcttggaa ctggatcagg 120cagtccccat cgagaggcct tgagtggctg
ggaaggacat actacaggtc caagtggtat 180aatgattatg cagtatctgt gaaaagtcga
ataaccatca acccagacac atccaagaac 240cagttctccc tgcagctgaa ctctgtgact
cccgaggaca cggctgtgta ttactgtgcg 300cgccagggtt actcttacta cggttactct
gatgtttggg gtcaaggtac tctggtgacc 360gtctcctca
3694333DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
4cagtctgtgc tgactcagcc accctcggtg tctgtagccc ccaggcagag ggtcaccatc
60tcgtgttctg gaagcagctc caacatcgga cataatgatg taagctggta ccagcatctc
120ccagggaagg ctcccagact cctcatctat tttgatgacc tgctgccgtc aggggtctct
180gaccgattct ctgcctccaa gtctggcacc tcagcctccc tggccatcag tgggctccag
240tctgaggatg aggctgatta ttactgtgca gcatgggatg gcagcctgaa tgcctttgtc
300ttcggaactg ggaccaaggt caccgtccta ggt
3335123PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 5Gln Val Gln Leu Gln Gln Ser Gly Pro Gly Leu Val Lys Pro
Ser Gln1 5 10 15Thr Leu
Ser Leu Thr Cys Ala Ile Ser Gly Asp Ser Val Ser Ser Asn 20
25 30Ser Ala Ala Trp Asn Trp Ile Arg Gln
Ser Pro Ser Arg Gly Leu Glu 35 40
45Trp Leu Gly Arg Thr Tyr Tyr Arg Ser Lys Trp Tyr Asn Asp Tyr Ala 50
55 60Val Ser Val Lys Ser Arg Ile Thr Ile
Asn Pro Asp Thr Ser Lys Asn65 70 75
80Gln Phe Ser Leu Gln Leu Asn Ser Val Thr Pro Glu Asp Thr
Ala Val 85 90 95Tyr Tyr
Cys Ala Arg Tyr Gly Phe Ser Gly Ser Arg Phe Tyr Asp Thr 100
105 110Trp Gly Gln Gly Thr Leu Val Thr Val
Ser Ser 115 1206111PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
6Gln Pro Val Leu Thr Gln Pro Pro Ser Val Ser Glu Ala Pro Arg Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Ser Ser Ser Asn Ile Gly Asn Asn 20 25
30Ala Val Asn Trp Tyr Gln Gln Leu Pro Gly Lys Ala Pro
Lys Leu Leu 35 40 45Ile Tyr Phe
Asp Asp Leu Leu Ser Ser Gly Val Ser Asp Arg Phe Ser 50
55 60Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu
85 90 95Asn Gly Tyr Val Phe Gly
Thr Gly Thr Lys Val Thr Val Leu Gly 100 105
1107369DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 7caggtacagc tgcagcagtc aggtccagga
ctggtgaagc cctcgcagac cctctcactc 60acctgtgcca tctccgggga cagtgtctct
agcaacagtg ctgcttggaa ctggatcagg 120cagtccccat cgagaggcct tgagtggctg
ggaaggacat actacaggtc caagtggtat 180aatgattatg cagtatctgt gaaaagtcga
ataaccatca acccagacac atccaagaac 240cagttctccc tgcagctgaa ctctgtgact
cccgaggaca cggctgtgta ttactgtgcg 300cgctacggtt tctctggttc tcgtttctac
gatacttggg gtcaaggtac tctggtgacc 360gtctcctca
3698333DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
8cagcctgtgc tgactcagcc accctcggtg tctgaagccc ccaggcagag ggtcaccatc
60tcctgttctg gaagcagctc caacatcgga aataatgctg taaactggta ccagcagctc
120ccaggaaagg ctcccaaact cctcatctat tttgatgatc tgctgtcctc aggggtctct
180gaccgattct ctggctccaa gtctggcacc tcagcctccc tggccatcag tgggctccag
240tctgaagatg aggctgatta ttactgtgca gcatgggatg acagcctgaa tggttatgtc
300ttcggaactg ggaccaaggt caccgtccta ggt
3339122PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 9Glu Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro
Gly Ser1 5 10 15Ser Val
Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Ser Ser Tyr 20
25 30Ala Ile Ser Trp Val Arg Gln Ala Pro
Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Arg Ile Ile Pro Ile Leu Gly Ile Ala Asn Tyr Ala Gln Lys Phe 50
55 60Gln Gly Arg Val Thr Met Thr Glu Asp
Thr Ser Thr Asp Thr Ala Tyr65 70 75
80Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr
Tyr Cys 85 90 95Ala Arg
Ser Gly Tyr Ser Lys Ser Ile Val Ser Tyr Met Asp Tyr Trp 100
105 110Gly Gln Gly Thr Leu Val Thr Val Ser
Ser 115 12010111PRTArtificial SequenceDescription
of Artificial Sequence Synthetic polypeptide 10Leu Pro Val Leu Thr
Gln Pro Pro Ser Thr Ser Gly Thr Pro Gly Gln1 5
10 15Arg Val Thr Val Ser Cys Ser Gly Ser Ser Ser
Asn Ile Gly Ser Asn 20 25
30Val Val Phe Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Val
35 40 45Ile Tyr Arg Asn Asn Gln Arg Pro
Ser Gly Val Pro Asp Arg Phe Ser 50 55
60Val Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Arg65
70 75 80Ser Glu Asp Glu Ala
Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu 85
90 95Ser Gly Tyr Val Phe Gly Thr Gly Thr Lys Val
Thr Val Leu Gly 100 105
11011366DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 11gaggtccagc tggtgcagtc tggagctgag
gtgaagaagc ctgggtcctc ggtgaaggtc 60tcctgcaagg cttctggagg caccttcagc
agctatgcta tcagctgggt gcgacaggcc 120cctggacaag ggcttgagtg gatgggaagg
atcatcccta tccttggtat agcaaactac 180gcacagaagt tccagggcag agtcaccatg
accgaggaca catctacaga cacagcctac 240atggagctga gcagcctgag atctgaggac
acggccgtgt attactgtgc gcgctctggt 300tactctaaat ctatcgtttc ttacatggat
tactggggtc aaggtactct ggtgaccgtc 360tcctca
36612333DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
12ctgcctgtgc tgactcagcc cccctccacg tctgggaccc ccgggcagag ggtcaccgtc
60tcttgttctg gaagcagctc caacatcgga agtaatgttg tattctggta ccagcagctc
120ccaggcacgg cccccaaact tgtcatctat aggaataatc aacggccctc aggggtccct
180gaccgattct ctgtctccaa gtctggcacc tcagcctccc tggccatcag tgggctccgg
240tccgaggacg aggctgatta ttattgtgca gcttgggatg acagcctgag tggttatgtc
300ttcggaactg ggaccaaggt caccgtccta ggt
33313118PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 13Glu Val Gln Leu Val Gln Ser Gly Ala Glu Val
Lys Lys Pro Gly Ala1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr
20 25 30Tyr Met His Trp Val Arg Gln
Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Ile Ile Asn Pro Ser Gly Gly Ser Thr Ser Tyr Ala Gln Lys
Phe 50 55 60Gln Gly Arg Val Thr Met
Thr Arg Asp Thr Ser Thr Ser Thr Val Tyr65 70
75 80Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Ser Gln Trp Gly Gly Val Leu Asp Tyr Trp Gly Gln Gly Thr
100 105 110Leu Val Thr Val Ser Ser
11514112PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 14Gln Ser Val Val Thr Gln Pro Pro Ser Val Ser
Gly Ala Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ala Arg
20 25 30Tyr Asp Val Gln Trp Tyr Gln
Gln Leu Pro Gly Thr Ala Pro Lys Leu 35 40
45Leu Ile Phe Gly Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg
Phe 50 55 60Ser Gly Ser Lys Ser Gly
Thr Ser Ala Ser Leu Ala Ile Thr Gly Leu65 70
75 80Gln Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gln
Ser Tyr Asp Ser Ser 85 90
95Leu Ser Ala Ser Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly
100 105 11015354DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
15gaggtccagc tggtacagtc tggggctgag gtgaagaagc ctggggcctc agtgaaggtt
60tcctgcaagg catctggata caccttcacc agctactata tgcactgggt gcgacaggcc
120cctggacaag ggcttgagtg gatgggaata atcaacccta gtggtggtag cacaagctac
180gcacagaagt tccagggcag agtcaccatg accagggaca cgtccacgag cacagtctac
240atggagctga gcagcctgag atctgaggac acggccgtgt attactgtgc gcgctctcag
300tggggtggtg ttctggatta ctggggtcaa ggtactctgg tgaccgtctc ctca
35416336DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 16cagtctgtcg tgacgcagcc gccctcagtg
tctggggccc cagggcagag ggtcaccatc 60tcctgcagtg ggagcagctc caacatcggg
gcacgttatg atgttcagtg gtaccagcag 120cttccaggaa cagcccccaa actcctcatc
tttggtaaca acaatcggcc ctcaggggtc 180cctgaccgat tctctggctc caagtctggc
acgtcagcct ccctggccat cactgggctc 240caggctgagg atgaggctga ttattactgc
cagtcctatg acagcagcct gagtgcttcg 300gtgttcggcg gagggaccaa gctgaccgtc
ctaggt 33617123PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
17Gln Val Gln Leu Val Glu Ser Gly Ala Glu Val Lys Lys Pro Gly Ser1
5 10 15Ser Val Lys Val Ser Cys
Lys Ala Ser Gly Gly Thr Phe Ser Ser Tyr 20 25
30Ala Ile Ser Trp Val Arg Gln Ala Pro Gly Gln Gly Leu
Glu Trp Met 35 40 45Gly Arg Ile
Ile Pro Ile Leu Gly Ile Ala Asn Tyr Ala Gln Lys Phe 50
55 60Gln Gly Arg Val Thr Ile Thr Ala Asp Glu Ser Thr
Ser Thr Ala Tyr65 70 75
80Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95Ala Arg Thr Gly Tyr Glu
Ser Trp Gly Ser Tyr Glu Val Ile Asp Arg 100
105 110Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser
115 12018111PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 18Gln Ala Val Leu Thr Gln
Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln1 5
10 15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn
Ile Gly Ser Asn 20 25 30Thr
Val Asn Trp Tyr Arg Gln Leu Pro Gly Thr Ala Pro Lys Leu Leu 35
40 45Ile Tyr Ser Asn Asn Gln Arg Pro Ser
Gly Val Pro Asp Arg Phe Ser 50 55
60Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln65
70 75 80Ser Glu Asp Glu Ala
Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu 85
90 95Asn Gly Val Val Phe Gly Gly Gly Thr Lys Leu
Thr Val Leu Gly 100 105
11019369DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 19caggtgcagc tggtggagtc tggggctgag
gtgaagaagc ctgggtcctc ggtgaaggtc 60tcctgcaagg cttctggagg caccttcagc
agctatgcta tcagctgggt gcgacaggcc 120cctggacaag ggcttgagtg gatgggaagg
atcatcccta tccttggtat agcaaactac 180gcacagaagt tccagggcag agtcacgatt
accgcggacg aatccacgag cacagcctac 240atggagctga gcagcctgag atctgaggac
acggccgtat attactgtgc gcgcactggt 300tacgaatctt ggggttctta cgaagttatc
gatcgttggg gtcaaggtac tctggtgacc 360gtctcctca
36920333DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
20caggctgtgc tgactcagcc accctcagcg tctgggaccc ccgggcagag ggtcaccatc
60tcttgttctg gaagcagctc caacatcgga agtaatactg taaactggta ccggcagctc
120ccaggaacgg cccccaaact cctcatctat agtaataatc agcggccctc aggggtccct
180gaccgattct ctggctccaa gtctggcacc tcagcctccc tggccatcag tgggctccag
240tctgaggatg aggctgatta ttactgtgca gcatgggatg acagcctgaa tggtgtggta
300ttcggcggag ggaccaagct gaccgtccta ggt
33321121PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 21Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val
Lys Lys Pro Gly Ser1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Ser Ser Tyr
20 25 30Ala Ile Ser Trp Val Arg Gln
Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Arg Ile Ile Pro Ile Leu Gly Ile Ala Asn Tyr Ala Gln Lys
Phe 50 55 60Gln Gly Arg Val Thr Ile
Thr Ala Asp Lys Ser Thr Ser Thr Ala Tyr65 70
75 80Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Gly Gly Tyr Tyr Ser His Asp Met Trp Ser Glu Asp Trp Gly
100 105 110Gln Gly Thr Leu Val Thr
Val Ser Ser 115 12022112PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
22Leu Pro Val Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Arg Ser Ser Asn Ile Gly Ser Asn 20 25
30Ser Val Asn Trp Tyr Arg Gln Leu Pro Gly Ala Ala Pro
Lys Leu Leu 35 40 45Ile Tyr Ser
Asn Asn Gln Arg Pro Pro Gly Val Pro Val Arg Phe Ser 50
55 60Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Thr Tyr Tyr Cys Ala Thr Trp Asp Asp Asn Leu
85 90 95Asn Val His Tyr Val Phe
Gly Thr Gly Thr Lys Val Thr Val Leu Gly 100
105 11023363DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 23caggtgcagc tggtgcagtc
tggggctgag gtgaagaagc ctgggtcctc ggtgaaggtc 60tcctgcaagg cttctggagg
caccttcagc agctatgcta tcagctgggt gcgacaggcc 120cctggacaag ggcttgagtg
gatgggaagg atcatcccta tccttggtat agcaaactac 180gcacagaagt tccagggcag
agtcacgatt accgcggaca aatccacgag cacagcctac 240atggagctga gcagcctgag
atctgaggac acggccgtgt attactgtgc gcgcggtggt 300tactactctc atgacatgtg
gtctgaagat tggggtcaag gtactctggt gaccgtctcc 360tca
36324336DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
24ctgcctgtgc tgactcagcc accctcagcg tctgggaccc ccgggcagag ggtcaccatc
60tcttgttctg gacgcagttc caacatcggg agtaattctg ttaactggta tcgacaactc
120ccaggagcgg cccccaaact cctcatctat agtaataatc agcggccccc aggggtccct
180gtgcgattct ctggctccaa gtctggcacc tcagcctccc tggccatcag tgggctccag
240tctgaagatg aggccactta ttactgtgca acatgggatg acaatctgaa tgttcactat
300gtcttcggaa ctgggaccaa ggtcaccgtc ctaggt
33625125PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 25Gln Val Gln Leu Gln Glu Ser Gly Pro Gly Leu
Val Lys Pro Ser Gly1 5 10
15Thr Leu Ser Leu Thr Cys Gly Val Ser Gly Gly Ser Ile Ser Asn Ser
20 25 30Asn Trp Trp Ser Trp Val Arg
Gln Pro Pro Gly Lys Gly Leu Glu Trp 35 40
45Ile Gly Glu Ile Tyr His Ser Gly Ser Thr Lys Tyr Asn Pro Ser
Leu 50 55 60Arg Ser Arg Val Thr Ile
Ser Val Asp Lys Ser Lys Asn Gln Phe Ser65 70
75 80Leu Lys Leu Ser Ser Val Thr Ala Ala Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Arg Asp Asn Trp Lys Thr Pro Thr Thr Lys Ile Asp Gly Phe
100 105 110Asp Ile Trp Gly Gln Gly
Thr Met Val Thr Val Ser Ser 115 120
12526117PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 26Gln Pro Val Leu Thr Gln Pro Pro Ser Ala Ser
Ala Ser Leu Gly Ala1 5 10
15Ser Val Thr Leu Thr Cys Thr Leu Ser Ser Gly Tyr Ser Asn Tyr Lys
20 25 30Val Asp Trp Tyr Gln Gln Arg
Pro Gly Lys Gly Pro Arg Phe Val Met 35 40
45Arg Val Gly Thr Gly Gly Ile Val Gly Ser Lys Gly Asp Gly Ile
Pro 50 55 60Asp Arg Phe Ser Val Leu
Gly Ser Gly Leu Asn Arg Tyr Leu Thr Ile65 70
75 80Lys Asn Ile Gln Glu Glu Asp Glu Gly Asp Tyr
His Cys Gly Ala Asp 85 90
95His Gly Ser Gly Ser Asn Phe Val Tyr Val Phe Gly Thr Gly Thr Lys
100 105 110Val Thr Val Leu Gly
11527375DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 27caggtgcagc tgcaggagtc gggcccagga
ctggtgaagc cttcggggac cctgtccctc 60acctgcggtg tctctggtgg ctccatcagc
aatagtaact ggtggagttg ggtccgccag 120ccccccggga aggggctgga gtggattggg
gaaatctatc atagtgggag caccaagtac 180aacccgtccc tcaggagtcg agtcaccata
tcagtagaca agtccaagaa ccagttctcc 240ctaaaattga gctctgtgac cgccgcggac
acggccgtat attactgtgc gagacgagat 300aactggaaga cccccactac caaaattgat
ggttttgata tctggggcca agggacaatg 360gtcaccgtct cttca
37528351DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
28cagcctgtgc tgactcagcc accttctgca tcagcctccc tgggagcctc ggtcacactc
60acctgcaccc tgagcagcgg ctacagtaat tataaagtgg actggtacca gcagagacca
120gggaagggcc cccggtttgt gatgcgagtg ggcactggtg ggattgtggg atccaagggg
180gatggcatcc ctgatcgctt ctcagtcttg ggctcaggcc tgaatcggta cctgaccatc
240aagaacatcc aggaagaaga tgagggtgac tatcactgtg gggcagacca tggcagtggg
300agcaacttcg tgtatgtctt cggaactggg accaaggtca ccgtcctagg t
35129118PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 29Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val
Lys Lys Pro Gly Ala1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Gly Tyr
20 25 30Tyr Met His Trp Val Arg Gln
Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Tyr Ala Gln Lys
Phe 50 55 60Gln Gly Arg Val Thr Met
Thr Arg Asp Thr Ser Ile Ser Thr Ala Tyr65 70
75 80Met Glu Leu Ser Arg Leu Arg Ser Asp Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Ser Gln Trp Gly Ser Ser Trp Asp Tyr Trp Gly Gln Gly Thr
100 105 110Leu Val Thr Val Ser Ser
11530108PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 30Asp Ile Gln Leu Thr Gln Ser Pro Ser Ser Leu
Ser Ala Ser Val Gly1 5 10
15Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Ser Ile Ser Ser Tyr
20 25 30Leu Asn Trp Tyr Gln Gln Lys
Pro Gly Lys Ala Pro Lys Leu Leu Ile 35 40
45Tyr Ala Ala Ser Ser Leu Gln Ser Gly Val Pro Ser Arg Phe Ser
Gly 50 55 60Ser Gly Ser Gly Thr Asp
Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro65 70
75 80Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Ser
Tyr Ser Thr Pro Pro 85 90
95Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg 100
10531354DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 31caggtccagc tggtacagtc tggggctgag
gtgaagaagc ctggggcctc agtgaaggtc 60tcctgcaagg cttctggata caccttcacc
ggctactata tgcactgggt gcgacaggcc 120cctggacaag ggcttgagtg gatgggatgg
atcaacccta acagtggtgg cacaaactat 180gcacagaagt ttcagggcag ggtcaccatg
accagggaca cgtccatcag cacagcctac 240atggagctga gcaggctgag atctgacgac
acggccgtgt attactgtgc gcgctctcag 300tggggttctt cttgggatta ctggggtcaa
ggtactctgg tgaccgtctc ctca 35432324DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
32gacatccagt tgacccagtc tccatcctcc ctgtctgcat ctgtaggaga cagagtcacc
60atcacttgcc gggcaagtca gagcattagc agctatttaa attggtatca gcagaaacca
120gggaaagccc ctaagctcct gatctatgct gcatccagtt tgcaaagtgg ggtcccatca
180aggttcagtg gcagtggatc tgggacagat ttcactctca ccatcagcag tctgcaacct
240gaagattttg caacttacta ctgtcaacag agttacagta cccctccgac gttcggccaa
300gggaccaagg tggagatcaa acgt
32433119PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 33Glu Val Gln Leu Val Glu Ser Gly Ala Glu Val
Lys Lys Pro Gly Ala1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Gly Tyr
20 25 30Tyr Met His Trp Val Arg Gln
Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Tyr Ala Gln Lys
Phe 50 55 60Gln Gly Arg Val Thr Met
Thr Arg Asp Thr Ser Ile Ser Thr Ala Tyr65 70
75 80Met Glu Leu Ser Arg Leu Arg Ser Asp Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Ser Ser Tyr His Leu Tyr Gly Tyr Asp Ser Trp Gly Gln Gly
100 105 110Thr Leu Val Thr Val Ser
Ser 11534117PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 34Gln Pro Val Leu Thr Gln Pro Pro Ser
Ala Ser Ala Ser Leu Gly Ala1 5 10
15Ser Val Thr Leu Thr Cys Thr Leu Ser Asn Asp Tyr Thr Asn Tyr
Lys 20 25 30Val Asp Trp Tyr
Gln Gln Arg Pro Gly Lys Gly Pro Arg Phe Val Met 35
40 45Arg Val Gly Pro Gly Gly Ile Val Gly Ser Lys Gly
Asp Gly Ile Pro 50 55 60Asp Arg Phe
Ser Val Leu Gly Ser Gly Leu Asn Arg Tyr Leu Thr Ile65 70
75 80Lys Asn Ile Gln Glu Glu Asp Glu
Ser Asp Tyr His Cys Gly Ala Asp 85 90
95His Gly Thr Gly Ser Asn Phe Val Tyr Val Phe Gly Gly Gly
Thr Lys 100 105 110Leu Thr Val
Leu Gly 11535357DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 35gaggtgcagc tggtggagtc cggggctgag
gtgaagaagc ctggggcctc agtgaaggtc 60tcctgcaagg cttctggata caccttcacc
ggctactata tgcactgggt gcgacaggcc 120cctggacaag ggcttgagtg gatgggatgg
atcaacccta acagtggtgg cacaaactat 180gcacagaagt ttcagggcag ggtcaccatg
accagggaca cgtccatcag cacagcctac 240atggagctga gcaggctgag atctgacgac
acggccgtat attactgtgc gcgctcttct 300taccatctgt acggttacga ttcttggggt
caaggtactc tggtgaccgt ctcctca 35736351DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
36cagcctgtgc tgactcagcc accttctgca tcagcctccc tgggagcctc ggtcactctc
60acctgcaccc tgagcaacga ctacactaat tataaagtgg actggtacca gcagagacca
120gggaagggcc cccggtttgt gatgcgagtg ggccctggtg ggattgtggg atccaagggg
180gatggcatcc ctgatcgctt ctcagtcttg ggctcaggcc tgaatcgata cctgaccatc
240aagaacatcc aggaggagga tgagagtgac taccactgtg gggcggacca tggcaccggg
300agcaacttcg tgtacgtgtt cggcggaggg accaagctga ccgtcctagg t
35137120PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 37Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val
Lys Lys Pro Gly Ser1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr Phe Ser Ser Tyr
20 25 30Ala Ile Ser Trp Val Arg Gln
Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Gly Ile Ile Pro Ile Phe Ser Thr Ala Asn Tyr Ala Gln Lys
Phe 50 55 60Gln Gly Arg Val Thr Met
Thr Thr Asp Thr Ser Thr Ser Thr Ala Tyr65 70
75 80Met Glu Leu Arg Ser Leu Arg Ser Asp Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Gln Pro Trp Thr Trp Tyr Ser Pro Tyr Asp Gln Trp Gly Gln
100 105 110Gly Thr Leu Val Thr Val
Ser Ser 115 12038117PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
38Gln Pro Val Leu Thr Gln Pro Pro Ser Ala Ser Ala Ser Leu Gly Ala1
5 10 15Ser Val Thr Leu Thr Cys
Thr Leu Ser Ser Gly Tyr Ser Asn Tyr Lys 20 25
30Val Asp Trp Tyr Gln Gln Arg Pro Gly Lys Gly Pro Arg
Phe Leu Met 35 40 45Arg Val Asp
Thr Gly Gly Ile Val Gly Ser Lys Gly Asp Gly Ile Pro 50
55 60Asp Arg Phe Ser Val Ser Gly Ser Gly Leu Asn Arg
Tyr Leu Thr Ile65 70 75
80Lys Asn Ile Gln Glu Glu Asp Glu Ser Asp Tyr His Cys Gly Ala Asp
85 90 95His Gly Ser Gly Ser Asn
Phe Val Trp Val Phe Gly Gly Gly Thr Lys 100
105 110Leu Thr Val Leu Gly 11539360DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
39caggtgcagc tggtgcagtc tggggctgag gtgaagaagc ctgggtcctc ggtgaaggtc
60tcctgcaagg cttctggagg caccttcagc agctatgcta tcagctgggt gcgacaggcc
120cctggacaag ggcttgagtg gatgggaggg atcatcccta tctttagtac agcaaactac
180gcacagaagt tccagggcag agtcaccatg accacagaca catccacgag cacagcctac
240atggagctga ggagcctgag atctgacgac acggccgtgt attactgtgc gcgccagccg
300tggacttggt actctccgta cgatcagtgg ggtcaaggta ctctggtgac cgtctcctca
36040351DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 40cagcctgtgc tgactcagcc accttctgca
tcagcctccc tgggagcctc ggtcacactc 60acctgcaccc tgagcagcgg ctacagtaat
tataaagtgg actggtatca acagagacca 120gggaagggcc cccggtttct gatgcgagta
gacaccggtg ggattgtggg atccaagggg 180gatggcatcc ctgatcgctt ctcagtctcg
ggctcaggtc tgaatcggta cctgaccatc 240aagaacattc aggaagagga tgagagtgac
taccactgtg gggcagacca tggcagtggg 300agcaacttcg tgtgggtgtt cggcggaggg
accaagctga ccgtcctagg t 35141117PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
41Glu Val Gln Leu Val Glu Thr Gly Gly Gly Leu Val Gln Pro Gly Gly1
5 10 15Ser Leu Arg Leu Ser Cys
Ala Ala Ser Gly Phe Thr Phe Ser Thr Tyr 20 25
30Ala Met Thr Trp Val Arg Gln Ala Pro Gly Lys Gly Leu
Glu Trp Val 35 40 45Ser Ala Ile
Thr Pro Gly Gly Asp Arg Thr Tyr Tyr Ala Asp Ser Val 50
55 60Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Arg
Asn Thr Leu Tyr65 70 75
80Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95Ala Arg Tyr Tyr Gly Tyr
Met Ile Asp Met Trp Gly Gln Gly Thr Leu 100
105 110Val Thr Val Ser Ser 11542113PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
42Asp Val Val Met Thr Gln Ser Pro Leu Ser Leu Pro Val Thr Pro Gly1
5 10 15Glu Pro Ala Ser Ile Ser
Cys Arg Ser Ser Gln Ser Leu Leu His Ser 20 25
30Asn Gly Tyr Asn Tyr Leu Asp Trp Tyr Leu Gln Lys Pro
Gly Gln Ser 35 40 45Pro Gln Leu
Leu Ile Tyr Leu Gly Ser Asn Arg Ala Ser Gly Val Pro 50
55 60Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe
Thr Leu Lys Ile65 70 75
80Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr Tyr Cys Met Gln Ala
85 90 95Leu Gln Thr Pro Leu Thr
Phe Gly Gly Gly Thr Lys Val Glu Ile Lys 100
105 110Arg43351DNAArtificial SequenceDescription of
Artificial Sequence Synthetic polynucleotide 43gaggtgcagc tggtggagac
tgggggaggc ctggtacagc ctggggggtc cctgagactc 60tcctgtgctg cctctggatt
cacctttagc acctatgcca tgacctgggt ccgccaggct 120ccagggaagg ggctggagtg
ggtctcagct attactcctg gtggtgatcg cacatactac 180gcagactccg tgaagggccg
tttcactatc tccagagaca attccaggaa cacgctgtat 240ctgcaaatga acagcctgag
agccgaggac acggccgtat attactgtgc gcgctactac 300ggttacatga tcgatatgtg
gggtcaaggt actctggtga ccgtctcctc a 35144339DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
44gatgttgtga tgactcagtc tccactctcc ctgcccgtca cccctggaga gccggcctcc
60atctcctgca ggtctagtca gagcctcctg catagtaatg gatacaacta tttggattgg
120tacctgcaga agccagggca gtctccacag ctcctgatct atttgggttc taatcgggcc
180tccggggtcc ctgacaggtt cagtggcagt ggatcaggca cagattttac actgaaaatc
240agcagagtgg aggctgagga tgttggggtt tattactgca tgcaagctct acaaactcct
300ctcactttcg gcggagggac caaggtggaa atcaaacgt
33945118PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 45Gln Met Gln Leu Val Gln Ser Gly Ala Glu Val
Lys Lys Pro Gly Ala1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Gly Tyr
20 25 30Tyr Val His Trp Leu Arg Gln
Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40
45Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Asn Ala Gln Glu
Phe 50 55 60Gln Gly Arg Ile Thr Met
Thr Arg Asp Thr Ser Ile Asn Thr Ala Tyr65 70
75 80Met Glu Leu Ser Arg Leu Arg Ser Asp Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Ser Gln Trp Gly Gly Thr Tyr Asp Tyr Trp Gly Gln Gly Thr
100 105 110Leu Val Thr Val Ser Ser
11546111PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 46Ser Tyr Val Leu Thr Gln Pro Pro Ser Ala Ser
Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn
20 25 30Thr Val Asn Trp Tyr Gln Gln
Val Pro Gly Thr Ala Pro Lys Leu Leu 35 40
45Ile Tyr Ser Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe
Ser 50 55 60Gly Ser Lys Ser Gly Ala
Ser Ala Ser Leu Ala Ile Ser Trp Leu Gln65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala
Trp Asp Asp Ser Leu 85 90
95Asn Gly Trp Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly
100 105 11047354DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
47cagatgcagc tggtgcagtc tggggctgag gtgaagaagc ctggggcctc agtgaaggtc
60tcctgcaagg cttctggata caccttcacc ggctattatg tacactggtt gcgacaggcc
120cctggacaag ggcttgagtg gatgggttgg atcaacccta acagtggcgg cacaaacaat
180gcacaggagt ttcaaggcag gatcaccatg accagggaca cgtccatcaa cacagcctac
240atggagctga gcaggctgag atctgacgac acggccgtgt attactgtgc gcgctctcag
300tggggtggta cttacgatta ctggggtcaa ggtactctgg tgaccgtctc ctca
35448333DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 48tcctatgtgc tgactcagcc accctcagcg
tctgggaccc ccgggcagag ggtcaccatc 60tcttgttctg gaagcagctc caacatcgga
agtaatactg taaactggta ccagcaggtc 120ccaggaacgg cccccaaact cctcatctat
agtaataatc agcggccctc aggggtccct 180gaccgattct ctggctccaa gtctggcgcc
tcagcctccc tggccatcag ttggctccag 240tctgaggatg aggctgatta ttactgtgca
gcatgggatg acagcctgaa tggttgggtg 300ttcggcggag ggaccaagct gaccgtccta
ggt 33349117PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
49Glu Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Glu1
5 10 15Ser Leu Lys Ile Ser Cys
Lys Gly Ser Gly Tyr Asp Phe Thr Thr Tyr 20 25
30Trp Ile Gly Trp Val Arg Gln Met Pro Gly Lys Gly Leu
Glu Trp Met 35 40 45Gly Ile Ile
Tyr Pro Gly Asp Ser Asp Thr Arg Tyr Ser Pro Ser Val 50
55 60Arg Gly Arg Val Thr Ile Ser Ala Asp Lys Ser Ile
Asn Thr Ala Tyr65 70 75
80Leu Gln Trp Ser Ser Leu Glu Ala Ser Asp Thr Ala Met Tyr Tyr Cys
85 90 95Ala Arg Met Trp Thr Phe
Ser Gln Asp Gly Trp Gly Gln Gly Thr Leu 100
105 110Val Thr Val Ser Ser 11550111PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
50Gln Ala Val Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Ser Ser Ser Asn Ile Gly Ser Tyr 20 25
30Thr Val Ser Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro
Lys Phe Leu 35 40 45Ile Tyr Ser
Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser 50
55 60Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu
85 90 95Asn Gly Tyr Val Phe Gly
Thr Gly Thr Lys Val Thr Val Leu Gly 100 105
11051351DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 51gaggtgcagc tggtgcagtc tggagcagag
gtgaaaaagc cgggggagtc tctgaagatc 60tcctgtaagg gttctggata tgactttacc
acctactgga tcgggtgggt gcgccagatg 120cccgggaagg gcctggagtg gatggggatc
atctatcctg gtgactctga taccagatac 180agcccgtccg tccgaggccg ggtcaccatc
tcagccgaca agtccatcaa caccgcctat 240ttgcagtgga gtagcctgga ggcctccgac
accgccatgt attactgtgc gcgcatgtgg 300actttctctc aggatggttg gggtcaaggt
actctggtga ccgtctcctc a 35152333DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
52caggctgtgc tgactcagcc accctcagcg tctgggaccc ccgggcagag ggtcaccatc
60tcttgttctg gaagcagctc caacatcgga agttatactg taagctggta ccagcaactc
120ccaggaacgg cccccaaatt cctcatctat tctaataatc agcggccctc aggggtccct
180gaccgattct ctggctccaa gtctggcacc tcagcctccc tggccatcag tgggctccag
240tctgaggatg aggctgatta ttactgtgct gcatgggatg acagcctgaa tggttatgtc
300ttcggaactg ggaccaaggt caccgtccta ggt
33353118PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 53Glu Val Gln Leu Val Gln Ser Gly Ala Glu Met
Lys Lys Pro Gly Ala1 5 10
15Ser Leu Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe Ile Asp Tyr
20 25 30Tyr Val Tyr Trp Met Arg Gln
Ala Pro Gly Gln Gly Leu Glu Ser Met 35 40
45Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Tyr Ala Gln Lys
Phe 50 55 60Gln Gly Arg Val Thr Met
Thr Arg Asp Thr Ser Ile Ser Thr Ala Tyr65 70
75 80Met Glu Leu Ser Arg Leu Arg Ser Asp Asp Thr
Ala Met Tyr Tyr Cys 85 90
95Ala Arg Ser Gln Arg Asp Gly Tyr Met Asp Tyr Trp Gly Gln Gly Thr
100 105 110Leu Val Thr Val Ser Ser
11554105PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 54Gln Ser Ala Leu Thr Gln Pro Ala Ser Val Ser
Ala Ser Pro Gly Gln1 5 10
15Ser Ile Ala Ile Ser Cys Thr Gly Thr Ser Ser Asp Val Gly Trp Tyr
20 25 30Gln Gln His Pro Gly Lys Ala
Pro Lys Leu Met Ile Tyr Glu Asp Ser 35 40
45Lys Arg Pro Ser Gly Val Ser Asn Arg Phe Ser Gly Ser Lys Ser
Gly 50 55 60Asn Thr Ala Ser Leu Thr
Ile Ser Gly Leu Gln Ala Glu Asp Glu Ala65 70
75 80Asp Tyr Tyr Cys Ser Ser Asn Thr Arg Ser Ser
Thr Leu Val Phe Gly 85 90
95Gly Gly Thr Lys Leu Thr Val Leu Gly 100
10555354DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 55gaagtgcagc tggtgcagtc tggggctgag
atgaagaagc ctggggcctc actgaagctc 60tcctgcaagg cttctggata caccttcatc
gactactatg tatactggat gcgacaggcc 120cctggacaag ggcttgagtc catgggatgg
atcaacccta acagtggtgg cacaaactat 180gcacagaagt ttcagggcag ggtcaccatg
accagggaca cgtccatcag cacagcctac 240atggagctga gcaggctgag atctgacgac
accgccatgt attactgtgc gcgctcccag 300cgtgacggtt acatggatta ctggggtcaa
ggtactctgg tgaccgtctc ctca 35456315DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
56caatctgccc tgactcagcc tgcctccgtg tctgcgtctc ctggacagtc gatcgccatc
60tcctgcactg gaaccagcag tgacgttggt tggtatcaac agcacccagg caaagccccc
120aaactcatga tttatgagga cagtaagcgg ccctcagggg tttctaatcg cttctctggc
180tccaagtctg gcaacacggc ctccctgacc atctctgggc tccaggctga ggacgaggct
240gattattact gcagctcaaa tacaagaagc agcactttgg tgttcggcgg agggaccaag
300ctgaccgtcc taggt
31557118PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 57Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val
Lys Lys Pro Gly Ala1 5 10
15Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Asp Tyr
20 25 30Tyr Met His Trp Val Arg Gln
Ala Pro Gly Gln Arg Leu Glu Trp Met 35 40
45Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Tyr Ala Gln Lys
Phe 50 55 60Gln Asp Arg Ile Thr Val
Thr Arg Asp Thr Ser Ser Asn Thr Gly Tyr65 70
75 80Met Glu Leu Thr Arg Leu Arg Ser Asp Asp Thr
Ala Val Tyr Tyr Cys 85 90
95Ala Arg Ser Pro Tyr Ser Gly Val Leu Asp Lys Trp Gly Gln Gly Thr
100 105 110Leu Val Thr Val Ser Ser
11558112PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 58Gln Ser Val Leu Thr Gln Pro Pro Ser Val Ser
Gly Ala Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Thr Gly Ser Ser Ser Asn Ile Gly Ala Gly
20 25 30Phe Asp Val His Trp Tyr Gln
Gln Leu Pro Gly Thr Ala Pro Lys Leu 35 40
45Leu Ile Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro Asp Arg
Phe 50 55 60Ser Gly Ser Lys Ser Gly
Thr Ser Ala Ser Leu Ala Ile Thr Gly Leu65 70
75 80Gln Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gln
Ser Tyr Asp Ser Ser 85 90
95Leu Ser Gly Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly
100 105 11059354DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
59caggtccagc tggtacagtc tggggctgag gtgaagaagc ctggggcctc agtgaaggtc
60tcctgcaagg cttctggata caccttcacc gactactata tgcactgggt gcgacaggcc
120cctggacaac ggcttgagtg gatgggatgg atcaacccta acagtggtgg cacaaactat
180gcacagaagt ttcaggacag gatcaccgtg accagggaca cctccagcaa cacaggctac
240atggagctga ccaggctgag atctgacgac acggccgtgt attactgtgc gcgctctccg
300tactctggtg ttctggataa atggggtcaa ggtactctgg tgaccgtctc ctca
35460336DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 60cagtctgtgc tgacgcagcc gccctcagtg
tctggggccc cagggcagag ggtcaccatc 60tcctgcactg ggagcagctc caacatcggg
gcaggttttg atgtacactg gtaccagcag 120cttccaggaa cagcccccaa actcctcatc
tatggtaaca gcaatcggcc ctcaggggtc 180cctgaccgat tctctggctc caagtctggc
acctcagcct ccctggccat cactgggctc 240caggctgagg atgaggctga ttattactgc
cagtcctatg acagcagcct gagtggttat 300gtcttcggaa ctgggaccaa ggtcaccgtc
ctaggt 33661120PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
61Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ser1
5 10 15Ser Val Lys Val Ser Cys
Lys Ala Ser Gly Gly Thr Phe Ser Ser Tyr 20 25
30Ala Ile Ser Trp Val Arg Gln Ala Pro Gly Gln Gly Leu
Glu Trp Met 35 40 45Gly Arg Ile
Ile Pro Ile Leu Gly Thr Ala Asn Tyr Ala Gln Lys Phe 50
55 60Gln Gly Arg Val Thr Ile Thr Ala Asp Glu Ser Thr
Ser Thr Ala Tyr65 70 75
80Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys
85 90 95Ala Arg Ser Gly Tyr Gly
Ser Tyr Arg Trp Glu Asp Ser Trp Gly Gln 100
105 110Gly Thr Leu Val Thr Val Ser Ser 115
12062112PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 62Gln Ala Val Leu Thr Gln Pro Pro Ser Ala Ser
Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn
20 25 30Tyr Val Phe Trp Tyr Gln Gln
Leu Pro Gly Thr Ala Pro Lys Leu Leu 35 40
45Ile Tyr Ser Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe
Ser 50 55 60Gly Ser Lys Ser Gly Thr
Ser Ala Ser Leu Ala Ile Ser Gly Leu Arg65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala
Trp Asp Asp Ser Leu 85 90
95Ser Ala Ser Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly
100 105 11063360DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
63caggtgcagc tggtgcagtc tggggctgag gtgaagaagc ctgggtcctc ggtgaaggtc
60tcctgcaagg cttctggagg caccttcagc agctatgcta tcagctgggt gcgacaggcc
120cctggacaag ggcttgagtg gatgggaagg atcatcccta tccttggtac agcaaactac
180gcacagaagt tccagggcag agtcacgatt accgcggacg aatccacgag cacagcctac
240atggagctga gcagcctgag atctgaggac acggccgtgt attactgtgc gcgctctggt
300tacggttctt accgttggga agattcttgg ggtcaaggta ctctggtgac cgtctcctca
36064336DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 64caggctgtgc tgactcagcc accctcagcg
tctgggaccc ccgggcagag ggtcaccatc 60tcttgttctg gaagcagctc caacatcgga
agtaattacg tattctggta ccagcagctc 120ccaggaacgg cccccaaact cctcatctat
agtaataatc agcggccctc aggggtccct 180gaccgattct ctggctccaa gtctggcacc
tcagcctccc tggccatcag tgggctccgg 240tccgaggatg aggctgatta ttactgtgca
gcatgggatg acagcctgag tgcctcttat 300gttttcggaa ctgggaccaa ggtcaccgtc
ctaggt 33665116PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
65Glu Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Glu1
5 10 15Ser Leu Lys Ile Ser Cys
Lys Gly Ser Gly Tyr Ser Phe Thr Ser Tyr 20 25
30Trp Ile Gly Trp Val Arg Gln Met Pro Gly Lys Gly Leu
Glu Trp Met 35 40 45Gly Ile Ile
Tyr Pro Gly Asp Ser Asp Thr Arg Tyr Ser Pro Ser Phe 50
55 60Gln Gly His Val Thr Ile Ser Ala Asp Lys Ser Ile
Ser Thr Ala Tyr65 70 75
80Leu Gln Trp Ser Ser Leu Lys Ala Ser Asp Thr Ala Met Tyr Tyr Cys
85 90 95Ala Arg Tyr Ser Gly Ser
Phe Asp Asn Trp Gly Gln Gly Thr Leu Val 100
105 110Thr Val Ser Ser 11566111PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
66Ser Tyr Glu Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln1
5 10 15Arg Val Thr Met Ser Cys
Ser Gly Thr Ser Ser Asn Ile Gly Ser His 20 25
30Ser Val Asn Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro
Lys Leu Leu 35 40 45Ile Tyr Thr
Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser 50
55 60Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Gly Ser Leu
85 90 95Asn Gly Leu Val Phe Gly
Gly Gly Thr Lys Leu Thr Val Leu Gly 100 105
11067348DNAArtificial SequenceDescription of Artificial
Sequence Synthetic polynucleotide 67gaggtgcagc tggtgcagtc tggagcagag
gtgaaaaagc ccggggagtc tctgaagatc 60tcctgtaagg gttctggata cagctttacc
agctactgga tcggctgggt gcgccagatg 120cccgggaaag gcctggagtg gatggggatc
atctatcctg gtgactctga taccagatac 180agcccgtcct tccaaggcca cgtcaccatc
tcagctgaca agtccatcag cactgcctac 240ctgcagtgga gcagcctgaa ggcctcggac
accgccatgt attactgtgc gcgctactct 300ggttctttcg ataactgggg tcaaggtact
ctggtgaccg tctcctca 34868333DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
68tcctatgagc tgactcagcc accctcagcg tctgggaccc ccgggcagag ggtcaccatg
60tcttgttctg gaaccagctc caacatcgga agtcactctg taaactggta ccagcagctc
120ccaggaacgg cccccaaact cctcatctat actaataatc agcggccctc aggggtccct
180gaccgattct ctggctccaa gtctggcacc tcagcctccc tggccatcag tggcctccag
240tctgaggatg aggctgatta ttactgtgca gcatgggatg gcagcctgaa tggtctggta
300ttcggcggag ggaccaagct gaccgtccta ggt
3336921PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 69Ser Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
Gly Gly1 5 10 15Ser Leu
Glu Met Ala 207063DNAArtificial SequenceDescription of
Artificial Sequence Synthetic oligonucleotide 70tctagaggtg
gtggtggtag cggcggcggc ggctctggtg gtggtggatc cctcgagatg 60gcc
6371184PRTHomo
sapiens 71Met Leu Gln Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp
Ser1 5 10 15Leu Leu His
Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr 20
25 30Pro Pro Leu Thr Cys Gln Arg Tyr Cys Asn
Ala Ser Val Thr Asn Ser 35 40
45Val Lys Gly Thr Asn Ala Ile Leu Trp Thr Cys Leu Gly Leu Ser Leu 50
55 60Ile Ile Ser Leu Ala Val Phe Val Leu
Met Phe Leu Leu Arg Lys Ile65 70 75
80Asn Ser Glu Pro Leu Lys Asp Glu Phe Lys Asn Thr Gly Ser
Gly Leu 85 90 95Leu Gly
Met Ala Asn Ile Asp Leu Glu Lys Ser Arg Thr Gly Asp Glu 100
105 110Ile Ile Leu Pro Arg Gly Leu Glu Tyr
Thr Val Glu Glu Cys Thr Cys 115 120
125Glu Asp Cys Ile Lys Ser Lys Pro Lys Val Asp Ser Asp His Cys Phe
130 135 140Pro Leu Pro Ala Met Glu Glu
Gly Ala Thr Ile Leu Val Thr Thr Lys145 150
155 160Thr Asn Asp Tyr Cys Lys Ser Leu Pro Ala Ala Leu
Ser Ala Thr Glu 165 170
175Ile Glu Lys Ser Ile Ser Ala Arg 18072255PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
72Gln Ser Val Leu Thr Gln Pro Pro Ser Val Ser Val Ala Pro Arg Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Ser Ser Ser Asn Ile Gly His Asn 20 25
30Asp Val Ser Trp Tyr Gln His Leu Pro Gly Lys Ala Pro
Arg Leu Leu 35 40 45Ile Tyr Phe
Asp Asp Leu Leu Pro Ser Gly Val Ser Asp Arg Phe Ser 50
55 60Ala Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Gly Ser Leu
85 90 95Asn Ala Phe Val Phe Gly
Thr Gly Thr Lys Val Thr Val Leu Gly Ser 100
105 110Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
Gly Gly Gly Ser 115 120 125Leu Glu
Met Ala Gln Val Gln Leu Gln Gln Ser Gly Pro Gly Leu Val 130
135 140Lys Pro Ser Gln Thr Leu Ser Leu Thr Cys Ala
Ile Ser Gly Asp Ser145 150 155
160Val Ser Ser Asn Ser Ala Ala Trp Asn Trp Ile Arg Gln Ser Pro Ser
165 170 175Arg Gly Leu Glu
Trp Leu Gly Arg Thr Tyr Tyr Arg Ser Lys Trp Tyr 180
185 190Asn Asp Tyr Ala Val Ser Val Lys Ser Arg Ile
Thr Ile Asn Pro Asp 195 200 205Thr
Ser Lys Asn Gln Phe Ser Leu Gln Leu Asn Ser Val Thr Pro Glu 210
215 220Asp Thr Ala Val Tyr Tyr Cys Ala Arg Gln
Gly Tyr Ser Tyr Tyr Gly225 230 235
240Tyr Ser Asp Val Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser
245 250
25573255PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 73Gln Pro Val Leu Thr Gln Pro Pro Ser Val Ser
Glu Ala Pro Arg Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Asn Asn
20 25 30Ala Val Asn Trp Tyr Gln Gln
Leu Pro Gly Lys Ala Pro Lys Leu Leu 35 40
45Ile Tyr Phe Asp Asp Leu Leu Ser Ser Gly Val Ser Asp Arg Phe
Ser 50 55 60Gly Ser Lys Ser Gly Thr
Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala
Trp Asp Asp Ser Leu 85 90
95Asn Gly Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly Ser
100 105 110Arg Gly Gly Gly Gly Ser
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 115 120
125Leu Glu Met Ala Gln Val Gln Leu Gln Gln Ser Gly Pro Gly
Leu Val 130 135 140Lys Pro Ser Gln Thr
Leu Ser Leu Thr Cys Ala Ile Ser Gly Asp Ser145 150
155 160Val Ser Ser Asn Ser Ala Ala Trp Asn Trp
Ile Arg Gln Ser Pro Ser 165 170
175Arg Gly Leu Glu Trp Leu Gly Arg Thr Tyr Tyr Arg Ser Lys Trp Tyr
180 185 190Asn Asp Tyr Ala Val
Ser Val Lys Ser Arg Ile Thr Ile Asn Pro Asp 195
200 205Thr Ser Lys Asn Gln Phe Ser Leu Gln Leu Asn Ser
Val Thr Pro Glu 210 215 220Asp Thr Ala
Val Tyr Tyr Cys Ala Arg Tyr Gly Phe Ser Gly Ser Arg225
230 235 240Phe Tyr Asp Thr Trp Gly Gln
Gly Thr Leu Val Thr Val Ser Ser 245 250
25574254PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 74Leu Pro Val Leu Thr Gln Pro Pro Ser
Thr Ser Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Val Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser
Asn 20 25 30Val Val Phe Trp
Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Val 35
40 45Ile Tyr Arg Asn Asn Gln Arg Pro Ser Gly Val Pro
Asp Arg Phe Ser 50 55 60Val Ser Lys
Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Arg65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr
Cys Ala Ala Trp Asp Asp Ser Leu 85 90
95Ser Gly Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu
Gly Ser 100 105 110Arg Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 115
120 125Leu Glu Met Ala Glu Val Gln Leu Val Gln Ser
Gly Ala Glu Val Lys 130 135 140Lys Pro
Gly Ser Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly Thr145
150 155 160Phe Ser Ser Tyr Ala Ile Ser
Trp Val Arg Gln Ala Pro Gly Gln Gly 165
170 175Leu Glu Trp Met Gly Arg Ile Ile Pro Ile Leu Gly
Ile Ala Asn Tyr 180 185 190Ala
Gln Lys Phe Gln Gly Arg Val Thr Met Thr Glu Asp Thr Ser Thr 195
200 205Asp Thr Ala Tyr Met Glu Leu Ser Ser
Leu Arg Ser Glu Asp Thr Ala 210 215
220Val Tyr Tyr Cys Ala Arg Ser Gly Tyr Ser Lys Ser Ile Val Ser Tyr225
230 235 240Met Asp Tyr Trp
Gly Gln Gly Thr Leu Val Thr Val Ser Ser 245
25075251PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 75Gln Ser Val Val Thr Gln Pro Pro Ser Val Ser
Gly Ala Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ala Arg
20 25 30Tyr Asp Val Gln Trp Tyr Gln
Gln Leu Pro Gly Thr Ala Pro Lys Leu 35 40
45Leu Ile Phe Gly Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg
Phe 50 55 60Ser Gly Ser Lys Ser Gly
Thr Ser Ala Ser Leu Ala Ile Thr Gly Leu65 70
75 80Gln Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gln
Ser Tyr Asp Ser Ser 85 90
95Leu Ser Ala Ser Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly
100 105 110Ser Arg Gly Gly Gly Gly
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 115 120
125Ser Leu Glu Met Ala Glu Val Gln Leu Val Gln Ser Gly Ala
Glu Val 130 135 140Lys Lys Pro Gly Ala
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr145 150
155 160Thr Phe Thr Ser Tyr Tyr Met His Trp Val
Arg Gln Ala Pro Gly Gln 165 170
175Gly Leu Glu Trp Met Gly Ile Ile Asn Pro Ser Gly Gly Ser Thr Ser
180 185 190Tyr Ala Gln Lys Phe
Gln Gly Arg Val Thr Met Thr Arg Asp Thr Ser 195
200 205Thr Ser Thr Val Tyr Met Glu Leu Ser Ser Leu Arg
Ser Glu Asp Thr 210 215 220Ala Val Tyr
Tyr Cys Ala Arg Ser Gln Trp Gly Gly Val Leu Asp Tyr225
230 235 240Trp Gly Gln Gly Thr Leu Val
Thr Val Ser Ser 245 25076255PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
76Gln Ala Val Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn 20 25
30Thr Val Asn Trp Tyr Arg Gln Leu Pro Gly Thr Ala Pro
Lys Leu Leu 35 40 45Ile Tyr Ser
Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser 50
55 60Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile
Ser Gly Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu
85 90 95Asn Gly Val Val Phe Gly
Gly Gly Thr Lys Leu Thr Val Leu Gly Ser 100
105 110Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
Gly Gly Gly Ser 115 120 125Leu Glu
Met Ala Gln Val Gln Leu Val Glu Ser Gly Ala Glu Val Lys 130
135 140Lys Pro Gly Ser Ser Val Lys Val Ser Cys Lys
Ala Ser Gly Gly Thr145 150 155
160Phe Ser Ser Tyr Ala Ile Ser Trp Val Arg Gln Ala Pro Gly Gln Gly
165 170 175Leu Glu Trp Met
Gly Arg Ile Ile Pro Ile Leu Gly Ile Ala Asn Tyr 180
185 190Ala Gln Lys Phe Gln Gly Arg Val Thr Ile Thr
Ala Asp Glu Ser Thr 195 200 205Ser
Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala 210
215 220Val Tyr Tyr Cys Ala Arg Thr Gly Tyr Glu
Ser Trp Gly Ser Tyr Glu225 230 235
240Val Ile Asp Arg Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser
245 250
25577254PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 77Leu Pro Val Leu Thr Gln Pro Pro Ser Ala Ser
Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Arg Ser Ser Asn Ile Gly Ser Asn
20 25 30Ser Val Asn Trp Tyr Arg Gln
Leu Pro Gly Ala Ala Pro Lys Leu Leu 35 40
45Ile Tyr Ser Asn Asn Gln Arg Pro Pro Gly Val Pro Val Arg Phe
Ser 50 55 60Gly Ser Lys Ser Gly Thr
Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln65 70
75 80Ser Glu Asp Glu Ala Thr Tyr Tyr Cys Ala Thr
Trp Asp Asp Asn Leu 85 90
95Asn Val His Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly
100 105 110Ser Arg Gly Gly Gly Gly
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 115 120
125Ser Leu Glu Met Ala Gln Val Gln Leu Val Gln Ser Gly Ala
Glu Val 130 135 140Lys Lys Pro Gly Ser
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly145 150
155 160Thr Phe Ser Ser Tyr Ala Ile Ser Trp Val
Arg Gln Ala Pro Gly Gln 165 170
175Gly Leu Glu Trp Met Gly Arg Ile Ile Pro Ile Leu Gly Ile Ala Asn
180 185 190Tyr Ala Gln Lys Phe
Gln Gly Arg Val Thr Ile Thr Ala Asp Lys Ser 195
200 205Thr Ser Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg
Ser Glu Asp Thr 210 215 220Ala Val Tyr
Tyr Cys Ala Arg Gly Gly Tyr Tyr Ser His Asp Met Trp225
230 235 240Ser Glu Asp Trp Gly Gln Gly
Thr Leu Val Thr Val Ser Ser 245
25078263PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 78Gln Pro Val Leu Thr Gln Pro Pro Ser Ala Ser
Ala Ser Leu Gly Ala1 5 10
15Ser Val Thr Leu Thr Cys Thr Leu Ser Ser Gly Tyr Ser Asn Tyr Lys
20 25 30Val Asp Trp Tyr Gln Gln Arg
Pro Gly Lys Gly Pro Arg Phe Val Met 35 40
45Arg Val Gly Thr Gly Gly Ile Val Gly Ser Lys Gly Asp Gly Ile
Pro 50 55 60Asp Arg Phe Ser Val Leu
Gly Ser Gly Leu Asn Arg Tyr Leu Thr Ile65 70
75 80Lys Asn Ile Gln Glu Glu Asp Glu Gly Asp Tyr
His Cys Gly Ala Asp 85 90
95His Gly Ser Gly Ser Asn Phe Val Tyr Val Phe Gly Thr Gly Thr Lys
100 105 110Val Thr Val Leu Gly Ser
Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly 115 120
125Ser Gly Gly Gly Gly Ser Leu Glu Met Ala Gln Val Gln Leu
Gln Glu 130 135 140Ser Gly Pro Gly Leu
Val Lys Pro Ser Gly Thr Leu Ser Leu Thr Cys145 150
155 160Gly Val Ser Gly Gly Ser Ile Ser Asn Ser
Asn Trp Trp Ser Trp Val 165 170
175Arg Gln Pro Pro Gly Lys Gly Leu Glu Trp Ile Gly Glu Ile Tyr His
180 185 190Ser Gly Ser Thr Lys
Tyr Asn Pro Ser Leu Arg Ser Arg Val Thr Ile 195
200 205Ser Val Asp Lys Ser Lys Asn Gln Phe Ser Leu Lys
Leu Ser Ser Val 210 215 220Thr Ala Ala
Asp Thr Ala Val Tyr Tyr Cys Ala Arg Arg Asp Asn Trp225
230 235 240Lys Thr Pro Thr Thr Lys Ile
Asp Gly Phe Asp Ile Trp Gly Gln Gly 245
250 255Thr Met Val Thr Val Ser Ser
26079247PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 79Asp Ile Gln Leu Thr Gln Ser Pro Ser Ser Leu
Ser Ala Ser Val Gly1 5 10
15Asp Arg Val Thr Ile Thr Cys Arg Ala Ser Gln Ser Ile Ser Ser Tyr
20 25 30Leu Asn Trp Tyr Gln Gln Lys
Pro Gly Lys Ala Pro Lys Leu Leu Ile 35 40
45Tyr Ala Ala Ser Ser Leu Gln Ser Gly Val Pro Ser Arg Phe Ser
Gly 50 55 60Ser Gly Ser Gly Thr Asp
Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro65 70
75 80Glu Asp Phe Ala Thr Tyr Tyr Cys Gln Gln Ser
Tyr Ser Thr Pro Pro 85 90
95Thr Phe Gly Gln Gly Thr Lys Val Glu Ile Lys Arg Ser Arg Gly Gly
100 105 110Gly Gly Ser Gly Gly Gly
Gly Ser Gly Gly Gly Gly Ser Leu Glu Met 115 120
125Ala Gln Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys
Pro Gly 130 135 140Ala Ser Val Lys Val
Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Gly145 150
155 160Tyr Tyr Met His Trp Val Arg Gln Ala Pro
Gly Gln Gly Leu Glu Trp 165 170
175Met Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Tyr Ala Gln Lys
180 185 190Phe Gln Gly Arg Val
Thr Met Thr Arg Asp Thr Ser Ile Ser Thr Ala 195
200 205Tyr Met Glu Leu Ser Arg Leu Arg Ser Asp Asp Thr
Ala Val Tyr Tyr 210 215 220Cys Ala Arg
Ser Gln Trp Gly Ser Ser Trp Asp Tyr Trp Gly Gln Gly225
230 235 240Thr Leu Val Thr Val Ser Ser
24580257PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 80Gln Pro Val Leu Thr Gln Pro Pro Ser
Ala Ser Ala Ser Leu Gly Ala1 5 10
15Ser Val Thr Leu Thr Cys Thr Leu Ser Asn Asp Tyr Thr Asn Tyr
Lys 20 25 30Val Asp Trp Tyr
Gln Gln Arg Pro Gly Lys Gly Pro Arg Phe Val Met 35
40 45Arg Val Gly Pro Gly Gly Ile Val Gly Ser Lys Gly
Asp Gly Ile Pro 50 55 60Asp Arg Phe
Ser Val Leu Gly Ser Gly Leu Asn Arg Tyr Leu Thr Ile65 70
75 80Lys Asn Ile Gln Glu Glu Asp Glu
Ser Asp Tyr His Cys Gly Ala Asp 85 90
95His Gly Thr Gly Ser Asn Phe Val Tyr Val Phe Gly Gly Gly
Thr Lys 100 105 110Leu Thr Val
Leu Gly Ser Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly 115
120 125Ser Gly Gly Gly Gly Ser Leu Glu Met Ala Glu
Val Gln Leu Val Glu 130 135 140Ser Gly
Ala Glu Val Lys Lys Pro Gly Ala Ser Val Lys Val Ser Cys145
150 155 160Lys Ala Ser Gly Tyr Thr Phe
Thr Gly Tyr Tyr Met His Trp Val Arg 165
170 175Gln Ala Pro Gly Gln Gly Leu Glu Trp Met Gly Trp
Ile Asn Pro Asn 180 185 190Ser
Gly Gly Thr Asn Tyr Ala Gln Lys Phe Gln Gly Arg Val Thr Met 195
200 205Thr Arg Asp Thr Ser Ile Ser Thr Ala
Tyr Met Glu Leu Ser Arg Leu 210 215
220Arg Ser Asp Asp Thr Ala Val Tyr Tyr Cys Ala Arg Ser Ser Tyr His225
230 235 240Leu Tyr Gly Tyr
Asp Ser Trp Gly Gln Gly Thr Leu Val Thr Val Ser 245
250 255Ser81258PRTArtificial SequenceDescription
of Artificial Sequence Synthetic polypeptide 81Gln Pro Val Leu Thr
Gln Pro Pro Ser Ala Ser Ala Ser Leu Gly Ala1 5
10 15Ser Val Thr Leu Thr Cys Thr Leu Ser Ser Gly
Tyr Ser Asn Tyr Lys 20 25
30Val Asp Trp Tyr Gln Gln Arg Pro Gly Lys Gly Pro Arg Phe Leu Met
35 40 45Arg Val Asp Thr Gly Gly Ile Val
Gly Ser Lys Gly Asp Gly Ile Pro 50 55
60Asp Arg Phe Ser Val Ser Gly Ser Gly Leu Asn Arg Tyr Leu Thr Ile65
70 75 80Lys Asn Ile Gln Glu
Glu Asp Glu Ser Asp Tyr His Cys Gly Ala Asp 85
90 95His Gly Ser Gly Ser Asn Phe Val Trp Val Phe
Gly Gly Gly Thr Lys 100 105
110Leu Thr Val Leu Gly Ser Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125Ser Gly Gly Gly Gly Ser Leu
Glu Met Ala Gln Val Gln Leu Val Gln 130 135
140Ser Gly Ala Glu Val Lys Lys Pro Gly Ser Ser Val Lys Val Ser
Cys145 150 155 160Lys Ala
Ser Gly Gly Thr Phe Ser Ser Tyr Ala Ile Ser Trp Val Arg
165 170 175Gln Ala Pro Gly Gln Gly Leu
Glu Trp Met Gly Gly Ile Ile Pro Ile 180 185
190Phe Ser Thr Ala Asn Tyr Ala Gln Lys Phe Gln Gly Arg Val
Thr Met 195 200 205Thr Thr Asp Thr
Ser Thr Ser Thr Ala Tyr Met Glu Leu Arg Ser Leu 210
215 220Arg Ser Asp Asp Thr Ala Val Tyr Tyr Cys Ala Arg
Gln Pro Trp Thr225 230 235
240Trp Tyr Ser Pro Tyr Asp Gln Trp Gly Gln Gly Thr Leu Val Thr Val
245 250 255Ser
Ser82251PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 82Asp Val Val Met Thr Gln Ser Pro Leu Ser Leu
Pro Val Thr Pro Gly1 5 10
15Glu Pro Ala Ser Ile Ser Cys Arg Ser Ser Gln Ser Leu Leu His Ser
20 25 30Asn Gly Tyr Asn Tyr Leu Asp
Trp Tyr Leu Gln Lys Pro Gly Gln Ser 35 40
45Pro Gln Leu Leu Ile Tyr Leu Gly Ser Asn Arg Ala Ser Gly Val
Pro 50 55 60Asp Arg Phe Ser Gly Ser
Gly Ser Gly Thr Asp Phe Thr Leu Lys Ile65 70
75 80Ser Arg Val Glu Ala Glu Asp Val Gly Val Tyr
Tyr Cys Met Gln Ala 85 90
95Leu Gln Thr Pro Leu Thr Phe Gly Gly Gly Thr Lys Val Glu Ile Lys
100 105 110Arg Ser Arg Gly Gly Gly
Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 115 120
125Gly Ser Leu Glu Met Ala Glu Val Gln Leu Val Glu Thr Gly
Gly Gly 130 135 140Leu Val Gln Pro Gly
Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly145 150
155 160Phe Thr Phe Ser Thr Tyr Ala Met Thr Trp
Val Arg Gln Ala Pro Gly 165 170
175Lys Gly Leu Glu Trp Val Ser Ala Ile Thr Pro Gly Gly Asp Arg Thr
180 185 190Tyr Tyr Ala Asp Ser
Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn 195
200 205Ser Arg Asn Thr Leu Tyr Leu Gln Met Asn Ser Leu
Arg Ala Glu Asp 210 215 220Thr Ala Val
Tyr Tyr Cys Ala Arg Tyr Tyr Gly Tyr Met Ile Asp Met225
230 235 240Trp Gly Gln Gly Thr Leu Val
Thr Val Ser Ser 245 25083250PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
83Ser Tyr Val Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln1
5 10 15Arg Val Thr Ile Ser Cys
Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn 20 25
30Thr Val Asn Trp Tyr Gln Gln Val Pro Gly Thr Ala Pro
Lys Leu Leu 35 40 45Ile Tyr Ser
Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser 50
55 60Gly Ser Lys Ser Gly Ala Ser Ala Ser Leu Ala Ile
Ser Trp Leu Gln65 70 75
80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu
85 90 95Asn Gly Trp Val Phe Gly
Gly Gly Thr Lys Leu Thr Val Leu Gly Ser 100
105 110Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly
Gly Gly Gly Ser 115 120 125Leu Glu
Met Ala Gln Met Gln Leu Val Gln Ser Gly Ala Glu Val Lys 130
135 140Lys Pro Gly Ala Ser Val Lys Val Ser Cys Lys
Ala Ser Gly Tyr Thr145 150 155
160Phe Thr Gly Tyr Tyr Val His Trp Leu Arg Gln Ala Pro Gly Gln Gly
165 170 175Leu Glu Trp Met
Gly Trp Ile Asn Pro Asn Ser Gly Gly Thr Asn Asn 180
185 190Ala Gln Glu Phe Gln Gly Arg Ile Thr Met Thr
Arg Asp Thr Ser Ile 195 200 205Asn
Thr Ala Tyr Met Glu Leu Ser Arg Leu Arg Ser Asp Asp Thr Ala 210
215 220Val Tyr Tyr Cys Ala Arg Ser Gln Trp Gly
Gly Thr Tyr Asp Tyr Trp225 230 235
240Gly Gln Gly Thr Leu Val Thr Val Ser Ser 245
25084249PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 84Gln Ala Val Leu Thr Gln Pro Pro Ser
Ala Ser Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser
Tyr 20 25 30Thr Val Ser Trp
Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Phe Leu 35
40 45Ile Tyr Ser Asn Asn Gln Arg Pro Ser Gly Val Pro
Asp Arg Phe Ser 50 55 60Gly Ser Lys
Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr
Cys Ala Ala Trp Asp Asp Ser Leu 85 90
95Asn Gly Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu
Gly Ser 100 105 110Arg Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 115
120 125Leu Glu Met Ala Glu Val Gln Leu Val Gln Ser
Gly Ala Glu Val Lys 130 135 140Lys Pro
Gly Glu Ser Leu Lys Ile Ser Cys Lys Gly Ser Gly Tyr Asp145
150 155 160Phe Thr Thr Tyr Trp Ile Gly
Trp Val Arg Gln Met Pro Gly Lys Gly 165
170 175Leu Glu Trp Met Gly Ile Ile Tyr Pro Gly Asp Ser
Asp Thr Arg Tyr 180 185 190Ser
Pro Ser Val Arg Gly Arg Val Thr Ile Ser Ala Asp Lys Ser Ile 195
200 205Asn Thr Ala Tyr Leu Gln Trp Ser Ser
Leu Glu Ala Ser Asp Thr Ala 210 215
220Met Tyr Tyr Cys Ala Arg Met Trp Thr Phe Ser Gln Asp Gly Trp Gly225
230 235 240Gln Gly Thr Leu
Val Thr Val Ser Ser 24585244PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
85Gln Ser Ala Leu Thr Gln Pro Ala Ser Val Ser Ala Ser Pro Gly Gln1
5 10 15Ser Ile Ala Ile Ser Cys
Thr Gly Thr Ser Ser Asp Val Gly Trp Tyr 20 25
30Gln Gln His Pro Gly Lys Ala Pro Lys Leu Met Ile Tyr
Glu Asp Ser 35 40 45Lys Arg Pro
Ser Gly Val Ser Asn Arg Phe Ser Gly Ser Lys Ser Gly 50
55 60Asn Thr Ala Ser Leu Thr Ile Ser Gly Leu Gln Ala
Glu Asp Glu Ala65 70 75
80Asp Tyr Tyr Cys Ser Ser Asn Thr Arg Ser Ser Thr Leu Val Phe Gly
85 90 95Gly Gly Thr Lys Leu Thr
Val Leu Gly Ser Arg Gly Gly Gly Gly Ser 100
105 110Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Leu Glu
Met Ala Glu Val 115 120 125Gln Leu
Val Gln Ser Gly Ala Glu Met Lys Lys Pro Gly Ala Ser Leu 130
135 140Lys Leu Ser Cys Lys Ala Ser Gly Tyr Thr Phe
Ile Asp Tyr Tyr Val145 150 155
160Tyr Trp Met Arg Gln Ala Pro Gly Gln Gly Leu Glu Ser Met Gly Trp
165 170 175Ile Asn Pro Asn
Ser Gly Gly Thr Asn Tyr Ala Gln Lys Phe Gln Gly 180
185 190Arg Val Thr Met Thr Arg Asp Thr Ser Ile Ser
Thr Ala Tyr Met Glu 195 200 205Leu
Ser Arg Leu Arg Ser Asp Asp Thr Ala Met Tyr Tyr Cys Ala Arg 210
215 220Ser Gln Arg Asp Gly Tyr Met Asp Tyr Trp
Gly Gln Gly Thr Leu Val225 230 235
240Thr Val Ser Ser86251PRTArtificial SequenceDescription of
Artificial Sequence Synthetic polypeptide 86Gln Ser Val Leu Thr Gln
Pro Pro Ser Val Ser Gly Ala Pro Gly Gln1 5
10 15Arg Val Thr Ile Ser Cys Thr Gly Ser Ser Ser Asn
Ile Gly Ala Gly 20 25 30Phe
Asp Val His Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu 35
40 45Leu Ile Tyr Gly Asn Ser Asn Arg Pro
Ser Gly Val Pro Asp Arg Phe 50 55
60Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile Thr Gly Leu65
70 75 80Gln Ala Glu Asp Glu
Ala Asp Tyr Tyr Cys Gln Ser Tyr Asp Ser Ser 85
90 95Leu Ser Gly Tyr Val Phe Gly Thr Gly Thr Lys
Val Thr Val Leu Gly 100 105
110Ser Arg Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
115 120 125Ser Leu Glu Met Ala Gln Val
Gln Leu Val Gln Ser Gly Ala Glu Val 130 135
140Lys Lys Pro Gly Ala Ser Val Lys Val Ser Cys Lys Ala Ser Gly
Tyr145 150 155 160Thr Phe
Thr Asp Tyr Tyr Met His Trp Val Arg Gln Ala Pro Gly Gln
165 170 175Arg Leu Glu Trp Met Gly Trp
Ile Asn Pro Asn Ser Gly Gly Thr Asn 180 185
190Tyr Ala Gln Lys Phe Gln Asp Arg Ile Thr Val Thr Arg Asp
Thr Ser 195 200 205Ser Asn Thr Gly
Tyr Met Glu Leu Thr Arg Leu Arg Ser Asp Asp Thr 210
215 220Ala Val Tyr Tyr Cys Ala Arg Ser Pro Tyr Ser Gly
Val Leu Asp Lys225 230 235
240Trp Gly Gln Gly Thr Leu Val Thr Val Ser Ser 245
25087253PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 87Gln Ala Val Leu Thr Gln Pro Pro Ser Ala Ser
Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn
20 25 30Tyr Val Phe Trp Tyr Gln Gln
Leu Pro Gly Thr Ala Pro Lys Leu Leu 35 40
45Ile Tyr Ser Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe
Ser 50 55 60Gly Ser Lys Ser Gly Thr
Ser Ala Ser Leu Ala Ile Ser Gly Leu Arg65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala
Trp Asp Asp Ser Leu 85 90
95Ser Ala Ser Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly
100 105 110Ser Arg Gly Gly Gly Gly
Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 115 120
125Ser Leu Glu Met Ala Gln Val Gln Leu Val Gln Ser Gly Ala
Glu Val 130 135 140Lys Lys Pro Gly Ser
Ser Val Lys Val Ser Cys Lys Ala Ser Gly Gly145 150
155 160Thr Phe Ser Ser Tyr Ala Ile Ser Trp Val
Arg Gln Ala Pro Gly Gln 165 170
175Gly Leu Glu Trp Met Gly Arg Ile Ile Pro Ile Leu Gly Thr Ala Asn
180 185 190Tyr Ala Gln Lys Phe
Gln Gly Arg Val Thr Ile Thr Ala Asp Glu Ser 195
200 205Thr Ser Thr Ala Tyr Met Glu Leu Ser Ser Leu Arg
Ser Glu Asp Thr 210 215 220Ala Val Tyr
Tyr Cys Ala Arg Ser Gly Tyr Gly Ser Tyr Arg Trp Glu225
230 235 240Asp Ser Trp Gly Gln Gly Thr
Leu Val Thr Val Ser Ser 245
25088248PRTArtificial SequenceDescription of Artificial Sequence
Synthetic polypeptide 88Ser Tyr Glu Leu Thr Gln Pro Pro Ser Ala Ser
Gly Thr Pro Gly Gln1 5 10
15Arg Val Thr Met Ser Cys Ser Gly Thr Ser Ser Asn Ile Gly Ser His
20 25 30Ser Val Asn Trp Tyr Gln Gln
Leu Pro Gly Thr Ala Pro Lys Leu Leu 35 40
45Ile Tyr Thr Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe
Ser 50 55 60Gly Ser Lys Ser Gly Thr
Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln65 70
75 80Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala
Trp Asp Gly Ser Leu 85 90
95Asn Gly Leu Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu Gly Ser
100 105 110Arg Gly Gly Gly Gly Ser
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 115 120
125Leu Glu Met Ala Glu Val Gln Leu Val Gln Ser Gly Ala Glu
Val Lys 130 135 140Lys Pro Gly Glu Ser
Leu Lys Ile Ser Cys Lys Gly Ser Gly Tyr Ser145 150
155 160Phe Thr Ser Tyr Trp Ile Gly Trp Val Arg
Gln Met Pro Gly Lys Gly 165 170
175Leu Glu Trp Met Gly Ile Ile Tyr Pro Gly Asp Ser Asp Thr Arg Tyr
180 185 190Ser Pro Ser Phe Gln
Gly His Val Thr Ile Ser Ala Asp Lys Ser Ile 195
200 205Ser Thr Ala Tyr Leu Gln Trp Ser Ser Leu Lys Ala
Ser Asp Thr Ala 210 215 220Met Tyr Tyr
Cys Ala Arg Tyr Ser Gly Ser Phe Asp Asn Trp Gly Gln225
230 235 240Gly Thr Leu Val Thr Val Ser
Ser 245899PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 89Val Ser Ser Asn Ser Ala Ala Trp Asn1
5907PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 90Tyr Arg Ser Lys Trp Tyr Asn1
59113PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 91Ala Arg Gln Gly Tyr Ser Tyr Tyr Gly Tyr Ser Asp Val1
5 10928PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 92Ser Ser Asn Ile Gly His Asn
Asp1 5933PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 93Phe Asp Asp19411PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 94Ala
Ala Trp Asp Gly Ser Leu Asn Ala Phe Val1 5
10959PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 95Val Ser Ser Asn Ser Ala Ala Trp Asn1
5967PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 96Tyr Arg Ser Lys Trp Tyr Asn1 59713PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 97Ala
Arg Tyr Gly Phe Ser Gly Ser Arg Phe Tyr Asp Thr1 5
10988PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 98Ser Ser Asn Ile Gly Asn Asn Ala1
5993PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 99Phe Asp Asp110011PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 100Ala Ala Trp Asp Asp Ser Leu
Asn Gly Tyr Val1 5 101018PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 101Gly
Gly Thr Phe Ser Ser Tyr Ala1 51028PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 102Ile
Ile Pro Ile Leu Gly Ile Ala1 510315PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 103Ala
Arg Ser Gly Tyr Ser Lys Ser Ile Val Ser Tyr Met Asp Tyr1 5
10 151048PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 104Ser
Ser Asn Ile Gly Ser Asn Val1 51053PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 105Arg
Asn Asn110611PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 106Ala Ala Trp Asp Asp Ser Leu Ser Gly Tyr Val1
5 101078PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 107Gly Tyr Thr Phe Thr Ser Tyr
Tyr1 51088PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 108Ile Asn Pro Ser Gly Gly Ser Thr1
510911PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 109Ala Arg Ser Gln Trp Gly Gly Val Leu Asp Tyr1
5 101109PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 110Ser Ser Asn Ile Gly Ala Arg
Tyr Asp1 51113PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 111Gly Asn
Asn111211PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 112Gln Ser Tyr Asp Ser Ser Leu Ser Ala Ser Val1
5 101138PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 113Gly Gly Thr Phe Ser Ser Tyr
Ala1 51148PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 114Ile Ile Pro Ile Leu Gly Ile Ala1
511516PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 115Ala Arg Thr Gly Tyr Glu Ser Trp Gly Ser Tyr Glu
Val Ile Asp Arg1 5 10
151168PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 116Ser Ser Asn Ile Gly Ser Asn Thr1
51173PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 117Ser Asn Asn111811PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 118Ala Ala Trp Asp Asp Ser Leu
Asn Gly Val Val1 5 101198PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 119Gly
Gly Thr Phe Ser Ser Tyr Ala1 51208PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 120Ile
Ile Pro Ile Leu Gly Ile Ala1 512114PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 121Ala
Arg Gly Gly Tyr Tyr Ser His Asp Met Trp Ser Glu Asp1 5
101228PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 122Ser Ser Asn Ile Gly Ser Asn Ser1
51233PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 123Ser Asn Asn112412PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 124Ala
Thr Trp Asp Asp Asn Leu Asn Val His Tyr Val1 5
101259PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 125Gly Gly Ser Ile Ser Asn Ser Asn Trp1
51267PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 126Ile Tyr His Ser Gly Ser Thr1
512718PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 127Ala Arg Arg Asp Asn Trp Lys Thr Pro Thr Thr Lys Ile Asp
Gly Phe1 5 10 15Asp
Ile1287PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 128Ser Gly Tyr Ser Asn Tyr Lys1
51298PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 129Val Gly Thr Gly Gly Ile Val Gly1
513013PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 130Gly Ala Asp His Gly Ser Gly Ser Asn Phe Val Tyr Val1
5 101318PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 131Gly Tyr Thr Phe Thr Gly Tyr
Tyr1 51328PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 132Ile Asn Pro Asn Ser Gly Gly Thr1
513311PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 133Ala Arg Ser Gln Trp Gly Ser Ser Trp Asp Tyr1
5 101346PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 134Gln Ser Ile Ser Ser Tyr1
51353PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 135Ala Ala Ser11369PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 136Gln
Gln Ser Tyr Ser Thr Pro Pro Thr1 51378PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 137Gly
Tyr Thr Phe Thr Gly Tyr Tyr1 51388PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 138Ile
Asn Pro Asn Ser Gly Gly Thr1 513912PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 139Ala
Arg Ser Ser Tyr His Leu Tyr Gly Tyr Asp Ser1 5
101407PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 140Asn Asp Tyr Thr Asn Tyr Lys1
51418PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 141Val Gly Pro Gly Gly Ile Val Gly1
514213PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 142Gly Ala Asp His Gly Thr Gly Ser Asn Phe Val Tyr Val1
5 101438PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 143Gly Gly Thr Phe Ser Ser Tyr
Ala1 51448PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 144Ile Ile Pro Ile Phe Ser Thr Ala1
514513PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 145Ala Arg Gln Pro Trp Thr Trp Tyr Ser Pro Tyr Asp
Gln1 5 101467PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 146Ser
Gly Tyr Ser Asn Tyr Lys1 51478PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 147Val
Asp Thr Gly Gly Ile Val Gly1 514813PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 148Gly
Ala Asp His Gly Ser Gly Ser Asn Phe Val Trp Val1 5
101498PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 149Gly Phe Thr Phe Ser Thr Tyr Ala1
51508PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 150Ile Thr Pro Gly Gly Asp Arg Thr1
515110PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 151Ala Arg Tyr Tyr Gly Tyr Met Ile Asp Met1 5
1015211PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 152Gln Ser Leu Leu His Ser Asn Gly Tyr
Asn Tyr1 5 101533PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 153Leu
Gly Ser11549PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 154Met Gln Ala Leu Gln Thr Pro Leu Thr1
51558PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 155Gly Tyr Thr Phe Thr Gly Tyr Tyr1
51568PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 156Ile Asn Pro Asn Ser Gly Gly Thr1
515711PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 157Ala Arg Ser Gln Trp Gly Gly Thr Tyr Asp Tyr1
5 101588PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 158Ser Ser Asn Ile Gly Ser Asn Thr1
51593PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 159Ser Asn Asn116011PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 160Ala
Ala Trp Asp Asp Ser Leu Asn Gly Trp Val1 5
101618PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 161Gly Tyr Asp Phe Thr Thr Tyr Trp1
51628PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 162Ile Tyr Pro Gly Asp Ser Asp Thr1
516310PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 163Ala Arg Met Trp Thr Phe Ser Gln Asp Gly1 5
101648PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 164Ser Ser Asn Ile Gly Ser Tyr Thr1
51653PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 165Ser Asn Asn116611PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 166Ala
Ala Trp Asp Asp Ser Leu Asn Gly Tyr Val1 5
101678PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 167Gly Tyr Thr Phe Ile Asp Tyr Tyr1
51688PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 168Ile Asn Pro Asn Ser Gly Gly Thr1
516911PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 169Ala Arg Ser Gln Arg Asp Gly Tyr Met Asp Tyr1
5 101709PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 170Ile Ser Cys Thr Gly Thr Ser Ser Asp1
51713PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 171Glu Asp Ser117210PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 172Ser
Ser Asn Thr Arg Ser Ser Thr Leu Val1 5
101738PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 173Gly Tyr Thr Phe Thr Asp Tyr Tyr1
51748PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 174Ile Asn Pro Asn Ser Gly Gly Thr1
517511PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 175Ala Arg Ser Pro Tyr Ser Gly Val Leu Asp Lys1
5 101769PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 176Ser Ser Asn Ile Gly Ala Gly Phe Asp1
51773PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 177Gly Asn Ser117811PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 178Gln
Ser Tyr Asp Ser Ser Leu Ser Gly Tyr Val1 5
101798PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 179Gly Gly Thr Phe Ser Ser Tyr Ala1
51808PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 180Ile Ile Pro Ile Leu Gly Thr Ala1
518113PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 181Ala Arg Ser Gly Tyr Gly Ser Tyr Arg Trp Glu Asp Ser1
5 101828PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 182Ser Ser Asn Ile Gly Ser Asn
Tyr1 51833PRTArtificial SequenceDescription of Artificial
Sequence Synthetic peptide 183Ser Asn Asn118412PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 184Ala
Ala Trp Asp Asp Ser Leu Ser Ala Ser Tyr Val1 5
101858PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 185Gly Tyr Ser Phe Thr Ser Tyr Trp1
51868PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 186Ile Tyr Pro Gly Asp Ser Asp Thr1
51879PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 187Ala Arg Tyr Ser Gly Ser Phe Asp Asn1
51888PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 188Ser Ser Asn Ile Gly Ser His Ser1
51893PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 189Thr Asn Asn119011PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 190Ala Ala Trp Asp Gly Ser Leu
Asn Gly Leu Val1 5 1019120PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 191Met
Ala Leu Pro Val Thr Ala Leu Leu Leu Pro Leu Ala Leu Leu Leu1
5 10 15His Ala Ala Arg
2019260DNAArtificial SequenceDescription of Artificial Sequence Synthetic
oligonucleotide 192atggctctcc cagtgactgc cctactgctt cccctagcgc
ttctcctgca tgcagctcgt 60193220PRTHomo sapiens 193Met Leu Arg Leu Leu
Leu Ala Leu Asn Leu Phe Pro Ser Ile Gln Val1 5
10 15Thr Gly Asn Lys Ile Leu Val Lys Gln Ser Pro
Met Leu Val Ala Tyr 20 25
30Asp Asn Ala Val Asn Leu Ser Cys Lys Tyr Ser Tyr Asn Leu Phe Ser
35 40 45Arg Glu Phe Arg Ala Ser Leu His
Lys Gly Leu Asp Ser Ala Val Glu 50 55
60Val Cys Val Val Tyr Gly Asn Tyr Ser Gln Gln Leu Gln Val Tyr Ser65
70 75 80Lys Thr Gly Phe Asn
Cys Asp Gly Lys Leu Gly Asn Glu Ser Val Thr 85
90 95Phe Tyr Leu Gln Asn Leu Tyr Val Asn Gln Thr
Asp Ile Tyr Phe Cys 100 105
110Lys Ile Glu Val Met Tyr Pro Pro Pro Tyr Leu Asp Asn Glu Lys Ser
115 120 125Asn Gly Thr Ile Ile His Val
Lys Gly Lys His Leu Cys Pro Ser Pro 130 135
140Leu Phe Pro Gly Pro Ser Lys Pro Phe Trp Val Leu Val Val Val
Gly145 150 155 160Gly Val
Leu Ala Cys Tyr Ser Leu Leu Val Thr Val Ala Phe Ile Ile
165 170 175Phe Trp Val Arg Ser Lys Arg
Ser Arg Leu Leu His Ser Asp Tyr Met 180 185
190Asn Met Thr Pro Arg Arg Pro Gly Pro Thr Arg Lys His Tyr
Gln Pro 195 200 205Tyr Ala Pro Pro
Arg Asp Phe Ala Ala Tyr Arg Ser 210 215
220194321DNAHomo sapiens 194attgaagtta tgtatcctcc tccttaccta gacaatgaga
agagcaatgg aaccattatc 60catgtgaaag ggaaacacct ttgtccaagt cccctatttc
ccggaccttc taagcccttt 120tgggtgctgg tggtggttgg tggagtcctg gcttgctata
gcttgctagt aacagtggcc 180tttattattt tctgggtgag gagtaagagg agcaggctcc
tgcacagtga ctacatgaac 240atgactcccc gccgccccgg gcccacccgc aagcattacc
agccctatgc cccaccacgc 300gacttcgcag cctatcgctc c
321195163PRTHomo sapiens 195Met Lys Trp Lys Ala
Leu Phe Thr Ala Ala Ile Leu Gln Ala Gln Leu1 5
10 15Pro Ile Thr Glu Ala Gln Ser Phe Gly Leu Leu
Asp Pro Lys Leu Cys 20 25
30Tyr Leu Leu Asp Gly Ile Leu Phe Ile Tyr Gly Val Ile Leu Thr Ala
35 40 45Leu Phe Leu Arg Val Lys Phe Ser
Arg Ser Ala Asp Ala Pro Ala Tyr 50 55
60Gln Gln Gly Gln Asn Gln Leu Tyr Asn Glu Leu Asn Leu Gly Arg Arg65
70 75 80Glu Glu Tyr Asp Val
Leu Asp Lys Arg Arg Gly Arg Asp Pro Glu Met 85
90 95Gly Gly Lys Pro Arg Arg Lys Asn Pro Gln Glu
Gly Leu Tyr Asn Glu 100 105
110Leu Gln Lys Asp Lys Met Ala Glu Ala Tyr Ser Glu Ile Gly Met Lys
115 120 125Gly Glu Arg Arg Arg Gly Lys
Gly His Asp Gly Leu Tyr Gln Gly Leu 130 135
140Ser Thr Ala Thr Lys Asp Thr Tyr Asp Ala Leu His Met Gln Ala
Leu145 150 155 160Pro Pro
Arg196339DNAHomo sapiens 196agagtgaagt tcagcaggag cgcagacgcc cccgcgtacc
agcagggcca gaaccagctc 60tataacgagc tcaatctagg acgaagagag gagtacgatg
ttttggacaa gagacgtggc 120cgggaccctg agatgggggg aaagccgaga aggaagaacc
ctcaggaagg cctgtacaat 180gaactgcaga aagataagat ggcggaggcc tacagtgaga
ttgggatgaa aggcgagcgc 240cggaggggca aggggcacga tggcctttac cagggtctca
gtacagccac caaggacacc 300tacgacgccc ttcacatgca ggccctgccc cctcgctaa
339197255PRTHomo sapiens 197Met Gly Asn Ser Cys
Tyr Asn Ile Val Ala Thr Leu Leu Leu Val Leu1 5
10 15Asn Phe Glu Arg Thr Arg Ser Leu Gln Asp Pro
Cys Ser Asn Cys Pro 20 25
30Ala Gly Thr Phe Cys Asp Asn Asn Arg Asn Gln Ile Cys Ser Pro Cys
35 40 45Pro Pro Asn Ser Phe Ser Ser Ala
Gly Gly Gln Arg Thr Cys Asp Ile 50 55
60Cys Arg Gln Cys Lys Gly Val Phe Arg Thr Arg Lys Glu Cys Ser Ser65
70 75 80Thr Ser Asn Ala Glu
Cys Asp Cys Thr Pro Gly Phe His Cys Leu Gly 85
90 95Ala Gly Cys Ser Met Cys Glu Gln Asp Cys Lys
Gln Gly Gln Glu Leu 100 105
110Thr Lys Lys Gly Cys Lys Asp Cys Cys Phe Gly Thr Phe Asn Asp Gln
115 120 125Lys Arg Gly Ile Cys Arg Pro
Trp Thr Asn Cys Ser Leu Asp Gly Lys 130 135
140Ser Val Leu Val Asn Gly Thr Lys Glu Arg Asp Val Val Cys Gly
Pro145 150 155 160Ser Pro
Ala Asp Leu Ser Pro Gly Ala Ser Ser Val Thr Pro Pro Ala
165 170 175Pro Ala Arg Glu Pro Gly His
Ser Pro Gln Ile Ile Ser Phe Phe Leu 180 185
190Ala Leu Thr Ser Thr Ala Leu Leu Phe Leu Leu Phe Phe Leu
Thr Leu 195 200 205Arg Phe Ser Val
Val Lys Arg Gly Arg Lys Lys Leu Leu Tyr Ile Phe 210
215 220Lys Gln Pro Phe Met Arg Pro Val Gln Thr Thr Gln
Glu Glu Asp Gly225 230 235
240Cys Ser Cys Arg Phe Pro Glu Glu Glu Glu Gly Gly Cys Glu Leu
245 250 255198277PRTHomo sapiens
198Met Cys Val Gly Ala Arg Arg Leu Gly Arg Gly Pro Cys Ala Ala Leu1
5 10 15Leu Leu Leu Gly Leu Gly
Leu Ser Thr Val Thr Gly Leu His Cys Val 20 25
30Gly Asp Thr Tyr Pro Ser Asn Asp Arg Cys Cys His Glu
Cys Arg Pro 35 40 45Gly Asn Gly
Met Val Ser Arg Cys Ser Arg Ser Gln Asn Thr Val Cys 50
55 60Arg Pro Cys Gly Pro Gly Phe Tyr Asn Asp Val Val
Ser Ser Lys Pro65 70 75
80Cys Lys Pro Cys Thr Trp Cys Asn Leu Arg Ser Gly Ser Glu Arg Lys
85 90 95Gln Leu Cys Thr Ala Thr
Gln Asp Thr Val Cys Arg Cys Arg Ala Gly 100
105 110Thr Gln Pro Leu Asp Ser Tyr Lys Pro Gly Val Asp
Cys Ala Pro Cys 115 120 125Pro Pro
Gly His Phe Ser Pro Gly Asp Asn Gln Ala Cys Lys Pro Trp 130
135 140Thr Asn Cys Thr Leu Ala Gly Lys His Thr Leu
Gln Pro Ala Ser Asn145 150 155
160Ser Ser Asp Ala Ile Cys Glu Asp Arg Asp Pro Pro Ala Thr Gln Pro
165 170 175Gln Glu Thr Gln
Gly Pro Pro Ala Arg Pro Ile Thr Val Gln Pro Thr 180
185 190Glu Ala Trp Pro Arg Thr Ser Gln Gly Pro Ser
Thr Arg Pro Val Glu 195 200 205Val
Pro Gly Gly Arg Ala Val Ala Ala Ile Leu Gly Leu Gly Leu Val 210
215 220Leu Gly Leu Leu Gly Pro Leu Ala Ile Leu
Leu Ala Leu Tyr Leu Leu225 230 235
240Arg Arg Asp Gln Arg Leu Pro Pro Asp Ala His Lys Pro Pro Gly
Gly 245 250 255Gly Ser Phe
Arg Thr Pro Ile Gln Glu Glu Gln Ala Asp Ala His Ser 260
265 270Thr Leu Ala Lys Ile
275199199PRTHomo sapiens 199Met Lys Ser Gly Leu Trp Tyr Phe Phe Leu Phe
Cys Leu Arg Ile Lys1 5 10
15Val Leu Thr Gly Glu Ile Asn Gly Ser Ala Asn Tyr Glu Met Phe Ile
20 25 30Phe His Asn Gly Gly Val Gln
Ile Leu Cys Lys Tyr Pro Asp Ile Val 35 40
45Gln Gln Phe Lys Met Gln Leu Leu Lys Gly Gly Gln Ile Leu Cys
Asp 50 55 60Leu Thr Lys Thr Lys Gly
Ser Gly Asn Thr Val Ser Ile Lys Ser Leu65 70
75 80Lys Phe Cys His Ser Gln Leu Ser Asn Asn Ser
Val Ser Phe Phe Leu 85 90
95Tyr Asn Leu Asp His Ser His Ala Asn Tyr Tyr Phe Cys Asn Leu Ser
100 105 110Ile Phe Asp Pro Pro Pro
Phe Lys Val Thr Leu Thr Gly Gly Tyr Leu 115 120
125His Ile Tyr Glu Ser Gln Leu Cys Cys Gln Leu Lys Phe Trp
Leu Pro 130 135 140Ile Gly Cys Ala Ala
Phe Val Val Val Cys Ile Leu Gly Cys Ile Leu145 150
155 160Ile Cys Trp Leu Thr Lys Lys Lys Tyr Ser
Ser Ser Val His Asp Pro 165 170
175Asn Gly Glu Tyr Met Phe Met Arg Ala Val Asn Thr Ala Lys Lys Ser
180 185 190Arg Leu Thr Asp Val
Thr Leu 195200223PRTHomo sapiens 200Met Ala Cys Leu Gly Phe Gln
Arg His Lys Ala Gln Leu Asn Leu Ala1 5 10
15Thr Arg Thr Trp Pro Cys Thr Leu Leu Phe Phe Leu Leu
Phe Ile Pro 20 25 30Val Phe
Cys Lys Ala Met His Val Ala Gln Pro Ala Val Val Leu Ala 35
40 45Ser Ser Arg Gly Ile Ala Ser Phe Val Cys
Glu Tyr Ala Ser Pro Gly 50 55 60Lys
Ala Thr Glu Val Arg Val Thr Val Leu Arg Gln Ala Asp Ser Gln65
70 75 80Val Thr Glu Val Cys Ala
Ala Thr Tyr Met Met Gly Asn Glu Leu Thr 85
90 95Phe Leu Asp Asp Ser Ile Cys Thr Gly Thr Ser Ser
Gly Asn Gln Val 100 105 110Asn
Leu Thr Ile Gln Gly Leu Arg Ala Met Asp Thr Gly Leu Tyr Ile 115
120 125Cys Lys Val Glu Leu Met Tyr Pro Pro
Pro Tyr Tyr Leu Gly Ile Gly 130 135
140Asn Gly Thr Gln Ile Tyr Val Ile Asp Pro Glu Pro Cys Pro Asp Ser145
150 155 160Asp Phe Leu Leu
Trp Ile Leu Ala Ala Val Ser Ser Gly Leu Phe Phe 165
170 175Tyr Ser Phe Leu Leu Thr Ala Val Ser Leu
Ser Lys Met Leu Lys Lys 180 185
190Arg Ser Pro Leu Thr Thr Gly Val Tyr Val Lys Met Pro Pro Thr Glu
195 200 205Pro Glu Cys Glu Lys Gln Phe
Gln Pro Tyr Phe Ile Pro Ile Asn 210 215
220201288PRTHomo sapiens 201Met Gln Ile Pro Gln Ala Pro Trp Pro Val Val
Trp Ala Val Leu Gln1 5 10
15Leu Gly Trp Arg Pro Gly Trp Phe Leu Asp Ser Pro Asp Arg Pro Trp
20 25 30Asn Pro Pro Thr Phe Ser Pro
Ala Leu Leu Val Val Thr Glu Gly Asp 35 40
45Asn Ala Thr Phe Thr Cys Ser Phe Ser Asn Thr Ser Glu Ser Phe
Val 50 55 60Leu Asn Trp Tyr Arg Met
Ser Pro Ser Asn Gln Thr Asp Lys Leu Ala65 70
75 80Ala Phe Pro Glu Asp Arg Ser Gln Pro Gly Gln
Asp Cys Arg Phe Arg 85 90
95Val Thr Gln Leu Pro Asn Gly Arg Asp Phe His Met Ser Val Val Arg
100 105 110Ala Arg Arg Asn Asp Ser
Gly Thr Tyr Leu Cys Gly Ala Ile Ser Leu 115 120
125Ala Pro Lys Ala Gln Ile Lys Glu Ser Leu Arg Ala Glu Leu
Arg Val 130 135 140Thr Glu Arg Arg Ala
Glu Val Pro Thr Ala His Pro Ser Pro Ser Pro145 150
155 160Arg Pro Ala Gly Gln Phe Gln Thr Leu Val
Val Gly Val Val Gly Gly 165 170
175Leu Leu Gly Ser Leu Val Leu Leu Val Trp Val Leu Ala Val Ile Cys
180 185 190Ser Arg Ala Ala Arg
Gly Thr Ile Gly Ala Arg Arg Thr Gly Gln Pro 195
200 205Leu Lys Glu Asp Pro Ser Ala Val Pro Val Phe Ser
Val Asp Tyr Gly 210 215 220Glu Leu Asp
Phe Gln Trp Arg Glu Lys Thr Pro Glu Pro Pro Val Pro225
230 235 240Cys Val Pro Glu Gln Thr Glu
Tyr Ala Thr Ile Val Phe Pro Ser Gly 245
250 255Met Gly Thr Ser Ser Pro Ala Arg Arg Gly Ser Ala
Asp Gly Pro Arg 260 265 270Ser
Ala Gln Pro Leu Arg Pro Glu Asp Gly His Cys Ser Trp Pro Leu 275
280 285202525PRTHomo sapiens 202Met Trp Glu
Ala Gln Phe Leu Gly Leu Leu Phe Leu Gln Pro Leu Trp1 5
10 15Val Ala Pro Val Lys Pro Leu Gln Pro
Gly Ala Glu Val Pro Val Val 20 25
30Trp Ala Gln Glu Gly Ala Pro Ala Gln Leu Pro Cys Ser Pro Thr Ile
35 40 45Pro Leu Gln Asp Leu Ser Leu
Leu Arg Arg Ala Gly Val Thr Trp Gln 50 55
60His Gln Pro Asp Ser Gly Pro Pro Ala Ala Ala Pro Gly His Pro Leu65
70 75 80Ala Pro Gly Pro
His Pro Ala Ala Pro Ser Ser Trp Gly Pro Arg Pro 85
90 95Arg Arg Tyr Thr Val Leu Ser Val Gly Pro
Gly Gly Leu Arg Ser Gly 100 105
110Arg Leu Pro Leu Gln Pro Arg Val Gln Leu Asp Glu Arg Gly Arg Gln
115 120 125Arg Gly Asp Phe Ser Leu Trp
Leu Arg Pro Ala Arg Arg Ala Asp Ala 130 135
140Gly Glu Tyr Arg Ala Ala Val His Leu Arg Asp Arg Ala Leu Ser
Cys145 150 155 160Arg Leu
Arg Leu Arg Leu Gly Gln Ala Ser Met Thr Ala Ser Pro Pro
165 170 175Gly Ser Leu Arg Ala Ser Asp
Trp Val Ile Leu Asn Cys Ser Phe Ser 180 185
190Arg Pro Asp Arg Pro Ala Ser Val His Trp Phe Arg Asn Arg
Gly Gln 195 200 205Gly Arg Val Pro
Val Arg Glu Ser Pro His His His Leu Ala Glu Ser 210
215 220Phe Leu Phe Leu Pro Gln Val Ser Pro Met Asp Ser
Gly Pro Trp Gly225 230 235
240Cys Ile Leu Thr Tyr Arg Asp Gly Phe Asn Val Ser Ile Met Tyr Asn
245 250 255Leu Thr Val Leu Gly
Leu Glu Pro Pro Thr Pro Leu Thr Val Tyr Ala 260
265 270Gly Ala Gly Ser Arg Val Gly Leu Pro Cys Arg Leu
Pro Ala Gly Val 275 280 285Gly Thr
Arg Ser Phe Leu Thr Ala Lys Trp Thr Pro Pro Gly Gly Gly 290
295 300Pro Asp Leu Leu Val Thr Gly Asp Asn Gly Asp
Phe Thr Leu Arg Leu305 310 315
320Glu Asp Val Ser Gln Ala Gln Ala Gly Thr Tyr Thr Cys His Ile His
325 330 335Leu Gln Glu Gln
Gln Leu Asn Ala Thr Val Thr Leu Ala Ile Ile Thr 340
345 350Val Thr Pro Lys Ser Phe Gly Ser Pro Gly Ser
Leu Gly Lys Leu Leu 355 360 365Cys
Glu Val Thr Pro Val Ser Gly Gln Glu Arg Phe Val Trp Ser Ser 370
375 380Leu Asp Thr Pro Ser Gln Arg Ser Phe Ser
Gly Pro Trp Leu Glu Ala385 390 395
400Gln Glu Ala Gln Leu Leu Ser Gln Pro Trp Gln Cys Gln Leu Tyr
Gln 405 410 415Gly Glu Arg
Leu Leu Gly Ala Ala Val Tyr Phe Thr Glu Leu Ser Ser 420
425 430Pro Gly Ala Gln Arg Ser Gly Arg Ala Pro
Gly Ala Leu Pro Ala Gly 435 440
445His Leu Leu Leu Phe Leu Ile Leu Gly Val Leu Ser Leu Leu Leu Leu 450
455 460Val Thr Gly Ala Phe Gly Phe His
Leu Trp Arg Arg Gln Trp Arg Pro465 470
475 480Arg Arg Phe Ser Ala Leu Glu Gln Gly Ile His Pro
Pro Gln Ala Gln 485 490
495Ser Lys Ile Glu Glu Leu Glu Gln Glu Pro Glu Pro Glu Pro Glu Pro
500 505 510Glu Pro Glu Pro Glu Pro
Glu Pro Glu Pro Glu Gln Leu 515 520
525203370PRTHomo sapiens 203Met Leu Gly Gln Val Val Thr Leu Ile Leu Leu
Leu Leu Leu Lys Val1 5 10
15Tyr Gln Gly Lys Gly Cys Gln Gly Ser Ala Asp His Val Val Ser Ile
20 25 30Ser Gly Val Pro Leu Gln Leu
Gln Pro Asn Ser Ile Gln Thr Lys Val 35 40
45Asp Ser Ile Ala Trp Lys Lys Leu Leu Pro Ser Gln Asn Gly Phe
His 50 55 60His Ile Leu Lys Trp Glu
Asn Gly Ser Leu Pro Ser Asn Thr Ser Asn65 70
75 80Asp Arg Phe Ser Phe Ile Val Lys Asn Leu Ser
Leu Leu Ile Lys Ala 85 90
95Ala Gln Gln Gln Asp Ser Gly Leu Tyr Cys Leu Glu Val Thr Ser Ile
100 105 110Ser Gly Lys Val Gln Thr
Ala Thr Phe Gln Val Phe Val Phe Glu Ser 115 120
125Leu Leu Pro Asp Lys Val Glu Lys Pro Arg Leu Gln Gly Gln
Gly Lys 130 135 140Ile Leu Asp Arg Gly
Arg Cys Gln Val Ala Leu Ser Cys Leu Val Ser145 150
155 160Arg Asp Gly Asn Val Ser Tyr Ala Trp Tyr
Arg Gly Ser Lys Leu Ile 165 170
175Gln Thr Ala Gly Asn Leu Thr Tyr Leu Asp Glu Glu Val Asp Ile Asn
180 185 190Gly Thr His Thr Tyr
Thr Cys Asn Val Ser Asn Pro Val Ser Trp Glu 195
200 205Ser His Thr Leu Asn Leu Thr Gln Asp Cys Gln Asn
Ala His Gln Glu 210 215 220Phe Arg Phe
Trp Pro Phe Leu Val Ile Ile Val Ile Leu Ser Ala Leu225
230 235 240Phe Leu Gly Thr Leu Ala Cys
Phe Cys Val Trp Arg Arg Lys Arg Lys 245
250 255Glu Lys Gln Ser Glu Thr Ser Pro Lys Glu Phe Leu
Thr Ile Tyr Glu 260 265 270Asp
Val Lys Asp Leu Lys Thr Arg Arg Asn His Glu Gln Glu Gln Thr 275
280 285Phe Pro Gly Gly Gly Ser Thr Ile Tyr
Ser Met Ile Gln Ser Gln Ser 290 295
300Ser Ala Pro Thr Ser Gln Glu Pro Ala Tyr Thr Leu Tyr Ser Leu Ile305
310 315 320Gln Pro Ser Arg
Lys Ser Gly Ser Arg Lys Arg Asn His Ser Pro Ser 325
330 335Phe Asn Ser Thr Ile Tyr Glu Val Ile Gly
Lys Ser Gln Pro Lys Ala 340 345
350Gln Asn Pro Ala Arg Leu Ser Arg Lys Glu Leu Glu Asn Phe Asp Val
355 360 365Tyr Ser 370204289PRTHomo
sapiens 204Met Lys Thr Leu Pro Ala Met Leu Gly Thr Gly Lys Leu Phe Trp
Val1 5 10 15Phe Phe Leu
Ile Pro Tyr Leu Asp Ile Trp Asn Ile His Gly Lys Glu 20
25 30Ser Cys Asp Val Gln Leu Tyr Ile Lys Arg
Gln Ser Glu His Ser Ile 35 40
45Leu Ala Gly Asp Pro Phe Glu Leu Glu Cys Pro Val Lys Tyr Cys Ala 50
55 60Asn Arg Pro His Val Thr Trp Cys Lys
Leu Asn Gly Thr Thr Cys Val65 70 75
80Lys Leu Glu Asp Arg Gln Thr Ser Trp Lys Glu Glu Lys Asn
Ile Ser 85 90 95Phe Phe
Ile Leu His Phe Glu Pro Val Leu Pro Asn Asp Asn Gly Ser 100
105 110Tyr Arg Cys Ser Ala Asn Phe Gln Ser
Asn Leu Ile Glu Ser His Ser 115 120
125Thr Thr Leu Tyr Val Thr Asp Val Lys Ser Ala Ser Glu Arg Pro Ser
130 135 140Lys Asp Glu Met Ala Ser Arg
Pro Trp Leu Leu Tyr Arg Leu Leu Pro145 150
155 160Leu Gly Gly Leu Pro Leu Leu Ile Thr Thr Cys Phe
Cys Leu Phe Cys 165 170
175Cys Leu Arg Arg His Gln Gly Lys Gln Asn Glu Leu Ser Asp Thr Ala
180 185 190Gly Arg Glu Ile Asn Leu
Val Asp Ala His Leu Lys Ser Glu Gln Thr 195 200
205Glu Ala Ser Thr Arg Gln Asn Ser Gln Val Leu Leu Ser Glu
Thr Gly 210 215 220Ile Tyr Asp Asn Asp
Pro Asp Leu Cys Phe Arg Met Gln Glu Gly Ser225 230
235 240Glu Val Tyr Ser Asn Pro Cys Leu Glu Glu
Asn Lys Pro Gly Ile Val 245 250
255Tyr Ala Ser Leu Asn His Ser Val Ile Gly Pro Asn Ser Arg Leu Ala
260 265 270Arg Asn Val Lys Glu
Ala Pro Thr Glu Tyr Ala Ser Ile Cys Val Arg 275
280 285Ser20520PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 205Met Glu Thr Asp Thr Leu Leu
Leu Trp Val Leu Leu Leu Trp Val Pro1 5 10
15Gly Ser Thr Gly 2020660DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
206atggaaaccg acaccctgct gctgtgggtg ctgctgctgt gggtgccagg atccacagga
602071398DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 207caatctgccc tgactcagcc tgcctccgtg
tctgcgtctc ctggacagtc gatcgccatc 60tcctgcactg gaaccagcag tgacgttggt
tggtatcaac agcacccagg caaagccccc 120aaactcatga tttatgagga cagtaagcgg
ccctcagggg tttctaatcg cttctctggc 180tccaagtctg gcaacacggc ctccctgacc
atctctgggc tccaggctga ggacgaggct 240gattattact gcagctcaaa tacaagaagc
agcactttgg tgttcggcgg agggaccaag 300ctgaccgtcc taggttctag aggtggtggt
ggtagcggcg gcggcggctc tggtggtggt 360ggatccctcg agatggccga agtgcagctg
gtgcagtctg gggctgagat gaagaagcct 420ggggcctcac tgaagctctc ctgcaaggct
tctggataca ccttcatcga ctactatgta 480tactggatgc gacaggcccc tggacaaggg
cttgagtcca tgggatggat caaccctaac 540agtggtggca caaactatgc acagaagttt
cagggcaggg tcaccatgac cagggacacg 600tccatcagca cagcctacat ggagctgagc
aggctgagat ctgacgacac cgccatgtat 660tactgtgcgc gctcccagcg tgacggttac
atggattact ggggtcaagg tactctggtg 720accgtctcct cagcggccgc aattgaagtt
atgtatcctc ctccttacct agacaatgag 780aagagcaatg gaaccattat ccatgtgaaa
gggaaacacc tttgtccaag tcccctattt 840cccggacctt ctaagccctt ttgggtgctg
gtggtggttg gtggagtcct ggcttgctat 900agcttgctag taacagtggc ctttattatt
ttctgggtga ggagtaagag gagcaggctc 960ctgcacagtg actacatgaa catgactccc
cgccgccccg ggcccacccg caagcattac 1020cagccctatg ccccaccacg cgacttcgca
gcctatcgct ccagagtgaa gttcagcagg 1080agcgcagacg cccccgcgta ccagcagggc
cagaaccagc tctataacga gctcaatcta 1140ggacgaagag aggagtacga tgttttggac
aagagacgtg gccgggaccc tgagatgggg 1200ggaaagccga gaaggaagaa ccctcaggaa
ggcctgtaca atgaactgca gaaagataag 1260atggcggagg cctacagtga gattgggatg
aaaggcgagc gccggagggg caaggggcac 1320gatggccttt accagggtct cagtacagcc
accaaggaca cctacgacgc ccttcacatg 1380caggccctgc cccctcgc
13982081419DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
208cagtctgtgc tgacgcagcc gccctcagtg tctggggccc cagggcagag ggtcaccatc
60tcctgcactg ggagcagctc caacatcggg gcaggttttg atgtacactg gtaccagcag
120cttccaggaa cagcccccaa actcctcatc tatggtaaca gcaatcggcc ctcaggggtc
180cctgaccgat tctctggctc caagtctggc acctcagcct ccctggccat cactgggctc
240caggctgagg atgaggctga ttattactgc cagtcctatg acagcagcct gagtggttat
300gtcttcggaa ctgggaccaa ggtcaccgtc ctaggttcta gaggtggtgg tggtagcggc
360ggcggcggct ctggtggtgg tggatccctc gagatggccc aggtccagct ggtacagtct
420ggggctgagg tgaagaagcc tggggcctca gtgaaggtct cctgcaaggc ttctggatac
480accttcaccg actactatat gcactgggtg cgacaggccc ctggacaacg gcttgagtgg
540atgggatgga tcaaccctaa cagtggtggc acaaactatg cacagaagtt tcaggacagg
600atcaccgtga ccagggacac ctccagcaac acaggctaca tggagctgac caggctgaga
660tctgacgaca cggccgtgta ttactgtgcg cgctctccgt actctggtgt tctggataaa
720tggggtcaag gtactctggt gaccgtctcc tcagcggccg caattgaagt tatgtatcct
780cctccttacc tagacaatga gaagagcaat ggaaccatta tccatgtgaa agggaaacac
840ctttgtccaa gtcccctatt tcccggacct tctaagccct tttgggtgct ggtggtggtt
900ggtggagtcc tggcttgcta tagcttgcta gtaacagtgg cctttattat tttctgggtg
960aggagtaaga ggagcaggct cctgcacagt gactacatga acatgactcc ccgccgcccc
1020gggcccaccc gcaagcatta ccagccctat gccccaccac gcgacttcgc agcctatcgc
1080tccagagtga agttcagcag gagcgcagac gcccccgcgt accagcaggg ccagaaccag
1140ctctataacg agctcaatct aggacgaaga gaggagtacg atgttttgga caagagacgt
1200ggccgggacc ctgagatggg gggaaagccg agaaggaaga accctcagga aggcctgtac
1260aatgaactgc agaaagataa gatggcggag gcctacagtg agattgggat gaaaggcgag
1320cgccggaggg gcaaggggca cgatggcctt taccagggtc tcagtacagc caccaaggac
1380acctacgacg cccttcacat gcaggccctg ccccctcgc
14192091410DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 209tcctatgagc tgactcagcc accctcagcg
tctgggaccc ccgggcagag ggtcaccatg 60tcttgttctg gaaccagctc caacatcgga
agtcactctg taaactggta ccagcagctc 120ccaggaacgg cccccaaact cctcatctat
actaataatc agcggccctc aggggtccct 180gaccgattct ctggctccaa gtctggcacc
tcagcctccc tggccatcag tggcctccag 240tctgaggatg aggctgatta ttactgtgca
gcatgggatg gcagcctgaa tggtctggta 300ttcggcggag ggaccaagct gaccgtccta
ggttctagag gtggtggtgg tagcggcggc 360ggcggctctg gtggtggtgg atccctcgag
atggccgagg tgcagctggt gcagtctgga 420gcagaggtga aaaagcccgg ggagtctctg
aagatctcct gtaagggttc tggatacagc 480tttaccagct actggatcgg ctgggtgcgc
cagatgcccg ggaaaggcct ggagtggatg 540gggatcatct atcctggtga ctctgatacc
agatacagcc cgtccttcca aggccacgtc 600accatctcag ctgacaagtc catcagcact
gcctacctgc agtggagcag cctgaaggcc 660tcggacaccg ccatgtatta ctgtgcgcgc
tactctggtt ctttcgataa ctggggtcaa 720ggtactctgg tgaccgtctc ctcagcggcc
gcaattgaag ttatgtatcc tcctccttac 780ctagacaatg agaagagcaa tggaaccatt
atccatgtga aagggaaaca cctttgtcca 840agtcccctat ttcccggacc ttctaagccc
ttttgggtgc tggtggtggt tggtggagtc 900ctggcttgct atagcttgct agtaacagtg
gcctttatta ttttctgggt gaggagtaag 960aggagcaggc tcctgcacag tgactacatg
aacatgactc cccgccgccc cgggcccacc 1020cgcaagcatt accagcccta tgccccacca
cgcgacttcg cagcctatcg ctccagagtg 1080aagttcagca ggagcgcaga cgcccccgcg
taccagcagg gccagaacca gctctataac 1140gagctcaatc taggacgaag agaggagtac
gatgttttgg acaagagacg tggccgggac 1200cctgagatgg ggggaaagcc gagaaggaag
aaccctcagg aaggcctgta caatgaactg 1260cagaaagata agatggcgga ggcctacagt
gagattggga tgaaaggcga gcgccggagg 1320ggcaaggggc acgatggcct ttaccagggt
ctcagtacag ccaccaagga cacctacgac 1380gcccttcaca tgcaggccct gccccctcgc
141021015PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 210Gly
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser1 5
10 1521145DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
211ggtggaggtg gatcaggtgg aggtggatct ggtggaggtg gatct
452125PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 212Gly Gly Gly Gly Ser1 52137PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 213Ser
Gly Gly Ser Gly Gly Ser1 52149PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 214Gly
Gly Gly Gly Ser Gly Gly Gly Ser1 521510PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 215Gly
Gly Gly Gly Ser Gly Gly Gly Gly Ser1 5
1021618PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 216Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly
Gly Gly1 5 10 15Gly
Ser21720PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 217Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly
Gly Gly Ser Gly1 5 10
15Gly Gly Gly Ser 2021825PRTArtificial SequenceDescription of
Artificial Sequence Synthetic peptide 218Gly Gly Gly Gly Ser Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Gly1 5 10
15Gly Gly Gly Ser Gly Gly Gly Gly Ser 20
2521930PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 219Gly Gly Gly Gly Ser Gly Gly Gly
Gly Ser Gly Gly Gly Gly Ser Gly1 5 10
15Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser
20 25 3022035PRTArtificial
SequenceDescription of Artificial Sequence Synthetic polypeptide
220Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly1
5 10 15Gly Gly Gly Ser Gly Gly
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 20 25
30Gly Gly Ser 3522115PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 221Glu
Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro Pro Cys Pro1 5
10 1522224PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 222Gly
Gly Gly Gly Ser Gly Gly Gly Ser Glu Pro Lys Ser Cys Asp Lys1
5 10 15Thr His Thr Cys Pro Pro Cys
Pro 2022362PRTArtificial SequenceDescription of Artificial
Sequence Synthetic polypeptide 223Glu Leu Lys Thr Pro Leu Gly Asp
Thr Thr His Thr Cys Pro Arg Cys1 5 10
15Pro Glu Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg
Cys Pro 20 25 30Glu Pro Lys
Ser Cys Asp Thr Pro Pro Pro Cys Pro Arg Cys Pro Glu 35
40 45Pro Lys Ser Cys Asp Thr Pro Pro Pro Cys Pro
Arg Cys Pro 50 55
602246PRTArtificial SequenceDescription of Artificial Sequence Synthetic
peptide 224Gly Ser Gly Ser Gly Ser1 52253PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 225Ala
Ala Ala1226235PRTHomo sapiens 226Met Ala Leu Pro Val Thr Ala Leu Leu Leu
Pro Leu Ala Leu Leu Leu1 5 10
15His Ala Ala Arg Pro Ser Gln Phe Arg Val Ser Pro Leu Asp Arg Thr
20 25 30Trp Asn Leu Gly Glu Thr
Val Glu Leu Lys Cys Gln Val Leu Leu Ser 35 40
45Asn Pro Thr Ser Gly Cys Ser Trp Leu Phe Gln Pro Arg Gly
Ala Ala 50 55 60Ala Ser Pro Thr Phe
Leu Leu Tyr Leu Ser Gln Asn Lys Pro Lys Ala65 70
75 80Ala Glu Gly Leu Asp Thr Gln Arg Phe Ser
Gly Lys Arg Leu Gly Asp 85 90
95Thr Phe Val Leu Thr Leu Ser Asp Phe Arg Arg Glu Asn Glu Gly Cys
100 105 110Tyr Phe Cys Ser Ala
Leu Ser Asn Ser Ile Met Tyr Phe Ser His Phe 115
120 125Val Pro Val Phe Leu Pro Ala Lys Pro Thr Thr Thr
Pro Ala Pro Arg 130 135 140Pro Pro Thr
Pro Ala Pro Thr Ile Ala Ser Gln Pro Leu Ser Leu Arg145
150 155 160Pro Glu Ala Cys Arg Pro Ala
Ala Gly Gly Ala Val His Thr Arg Gly 165
170 175Leu Asp Phe Ala Cys Asp Ile Tyr Ile Trp Ala Pro
Leu Ala Gly Thr 180 185 190Cys
Gly Val Leu Leu Leu Ser Leu Val Ile Thr Leu Tyr Cys Asn His 195
200 205Arg Asn Arg Arg Arg Val Cys Lys Cys
Pro Arg Pro Val Val Lys Ser 210 215
220Gly Asp Lys Pro Ser Leu Ser Ala Arg Tyr Val225 230
235227213DNAHomo sapiens 227cccaccacga cgccagcgcc gcgaccacca
accccggcgc ccacgatcgc gtcgcagccc 60ctgtccctgc gcccagaggc gtgccggcca
gcggcggggg gcgcagtgca cacgaggggg 120ctggacttcg cctgtgatat ctacatctgg
gcgcccctgg ccgggacttg tggggtcctt 180ctcctgtcac tggttatcac cctttactgc
aac 213228126DNAHomo sapiens
228aaacggggca gaaagaagct cctgtatata ttcaaacaac catttatgag accagtacaa
60actactcaag aggaagatgg ctgtagctgc cgatttccag aagaagaaga aggaggatgt
120gaactg
1262297671DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 229ccttctctag gcgcccccat atggccatat
gagatcttat atggggcacc cccgcccctt 60gtaaacttcc ctgaccctga catgacaaga
gttactaaca gcccctctct ccaagctcac 120ttacaggctc tctacttagt ccagcacgaa
gtctggagac ctctggcggc agcctaccaa 180gaacaactgg accgaccggt gccgccacca
tggaaaccga caccctgctg ctgtgggtgc 240tgctgctgtg ggtgccagga tccacaggac
tgcctgtgct gactcagcca ccctcagcgt 300ctgggacccc cgggcagagg gtcaccatct
cttgttctgg acgcagttcc aacatcggga 360gtaattctgt taactggtat cgacaactcc
caggagcggc ccccaaactc ctcatctata 420gtaataatca gcggccccca ggggtccctg
tgcgattctc tggctccaag tctggcacct 480cagcctccct ggccatcagt gggctccagt
ctgaagatga ggccacttat tactgtgcaa 540catgggatga caatctgaat gttcactatg
tcttcggaac tgggaccaag gtcaccgtcc 600taggttctag aggtggtggt ggtagcggcg
gcggcggctc tggtggtggt ggatccctcg 660agatggccca ggtgcagctg gtgcagtctg
gggctgaggt gaagaagcct gggtcctcgg 720tgaaggtctc ctgcaaggct tctggaggca
ccttcagcag ctatgctatc agctgggtgc 780gacaggcccc tggacaaggg cttgagtgga
tgggaaggat catccctatc cttggtatag 840caaactacgc acagaagttc cagggcagag
tcacgattac cgcggacaaa tccacgagca 900cagcctacat ggagctgagc agcctgagat
ctgaggacac ggccgtgtat tactgtgcgc 960gcggtggtta ctactctcat gacatgtggt
ctgaagattg gggtcaaggt actctggtga 1020ccgtctcctc agcggccgca cccaccacga
cgccagcgcc gcgaccacca accccggcgc 1080ccacgatcgc gtcgcagccc ctgtccctgc
gcccagaggc gtgccggcca gcggcggggg 1140gcgcagtgca cacgaggggg ctggacttcg
cctgtgatat ctacatctgg gcgcccctgg 1200ccgggacttg tggggtcctt ctcctgtcac
tggttatcac cctttactgc aacaaacggg 1260gcagaaagaa gctcctgtat atattcaaac
aaccatttat gagaccagta caaactactc 1320aagaggaaga tggctgtagc tgccgatttc
cagaagaaga agaaggagga tgtgaactga 1380gagtgaagtt cagcaggagc gcagagcccc
ccgcgtacca gcagggccag aaccagctct 1440ataacgagct caatctagga cgaagagagg
agtacgatgt tttggacaag agacgtggcc 1500gggaccctga gatgggggga aagccgagaa
ggaagaaccc tcaggaaggc ctgtacaatg 1560aactgcagaa agataagatg gcggaggcct
acagtgagat tgggatgaaa ggcgagcgcc 1620ggaggggcaa ggggcacgat ggcctttacc
agggtctcag tacagccacc aaggacacct 1680acgacgccct tcacatgcag gccctgcccc
ctcgctaaca gccactcgag gatccggatt 1740agtccaattt gttaaagaca ggatatcagt
ggtccaggct ctagttttga ctcaacaata 1800tcaccagctg aagcctatag agtacgagcc
atagataaaa taaaagattt tatttagtct 1860ccagaaaaag gggggaatga aagaccccac
ctgtaggttt ggcaagctag cttaagtaac 1920gccattttgc aaggcatgga aaaatacata
actgagaata gagaagttca gatcaaggtc 1980aggaacagat ggaacagctg aatatgggcc
aaacaggata tctgtggtaa gcagttcctg 2040ccccggctca gggccaagaa cagatggaac
agctgaatat gggccaaaca ggatatctgt 2100ggtaagcagt tcctgccccg gctcagggcc
aagaacagat ggtccccaga tgcggtccag 2160ccctcagcag tttctagaga accatcagat
gtttccaggg tgccccaagg acctgaaatg 2220accctgtgcc ttatttgaac taaccaatca
gttcgcttct cgcttctgtt cgcgcgcttc 2280tgctccccga gctcaataaa agagcccaca
acccctcact cggggcgcca gtcctccgat 2340tgactgagtc gcccgggtac ccgtgtatcc
aataaaccct cttgcagttg catccgactt 2400gtggtctcgc tgttccttgg gagggtctcc
tctgagtgat tgactacccg tcagcggggg 2460tctttcacac atgcagcatg tatcaaaatt
aatttggttt tttttcttaa gtatttacat 2520taaatggcca tagtacttaa agttacattg
gcttccttga aataaacatg gagtattcag 2580aatgtgtcat aaatatttct aattttaaga
tagtatctcc attggctttc tactttttct 2640tttatttttt tttgtcctct gtcttccatt
tgttgttgtt gttgtttgtt tgtttgtttg 2700ttggttggtt ggttaatttt tttttaaaga
tcctacacta tagttcaagc tagactatta 2760gctactctgt aacccagggt gaccttgaag
tcatgggtag cctgctgttt tagccttccc 2820acatctaaga ttacaggtat gagctatcat
ttttggtata ttgattgatt gattgattga 2880tgtgtgtgtg tgtgattgtg tttgtgtgtg
tgactgtgaa aatgtgtgta tgggtgtgtg 2940tgaatgtgtg tatgtatgtg tgtgtgtgag
tgtgtgtgtg tgtgtgtgca tgtgtgtgtg 3000tgtgactgtg tctatgtgta tgactgtgtg
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 3060tgtgtgtgtg ttgtgaaaaa atattctatg
gtagtgagag ccaacgctcc ggctcaggtg 3120tcaggttggt ttttgagaca gagtctttca
cttagcttgg aattcactgg ccgtcgtttt 3180acaacgtcgt gactgggaaa accctggcgt
tacccaactt aatcgccttg cagcacatcc 3240ccctttcgcc agctggcgta atagcgaaga
ggcccgcacc gatcgccctt cccaacagtt 3300gcgcagcctg aatggcgaat ggcgcctgat
gcggtatttt ctccttacgc atctgtgcgg 3360tatttcacac cgcatatggt gcactctcag
tacaatctgc tctgatgccg catagttaag 3420ccagccccga cacccgccaa cacccgctga
cgcgccctga cgggcttgtc tgctcccggc 3480atccgcttac agacaagctg tgaccgtctc
cgggagctgc atgtgtcaga ggttttcacc 3540gtcatcaccg aaacgcgcga tgacgaaagg
gcctcgtgat acgcctattt ttataggtta 3600atgtcatgat aataatggtt tcttagacgt
caggtggcac ttttcgggga aatgtgcgcg 3660gaacccctat ttgtttattt ttctaaatac
attcaaatat gtatccgctc atgagacaat 3720aaccctgata aatgcttcaa taatattgaa
aaaggaagag tatgagtatt caacatttcc 3780gtgtcgccct tattcccttt tttgcggcat
tttgccttcc tgtttttgct cacccagaaa 3840cgctggtgaa agtaaaagat gctgaagatc
agttgggtgc acgagtgggt tacatcgaac 3900tggatctcaa cagcggtaag atccttgaga
gttttcgccc cgaagaacgt tttccaatga 3960tgagcacttt taaagttctg ctatgtggcg
cggtattatc ccgtattgac gccgggcaag 4020agcaactcgg tcgccgcata cactattctc
agaatgactt ggttgagtac tcaccagtca 4080cagaaaagca tcttacggat ggcatgacag
taagagaatt atgcagtgct gccataacca 4140tgagtgataa cactgcggcc aacttacttc
tgacaacgat cggaggaccg aaggagctaa 4200ccgctttttt gcacaacatg ggggatcatg
taactcgcct tgatcgttgg gaaccggagc 4260tgaatgaagc cataccaaac gacgagcgtg
acaccacgat gcctgtagca atggcaacaa 4320cgttgcgcaa actattaact ggcgaactac
ttactctagc ttcccggcaa caattaatag 4380actggatgga ggcggataaa gttgcaggac
cacttctgcg ctcggccctt ccggctggct 4440ggtttattgc tgataaatct ggagccggtg
agcgtgggtc tcgcggtatc attgcagcac 4500tggggccaga tggtaagccc tcccgtatcg
tagttatcta cacgacgggg agtcaggcaa 4560ctatggatga acgaaataga cagatcgctg
agataggtgc ctcactgatt aagcattggt 4620aactgtcaga ccaagtttac tcatatatac
tttagattga tttaaaactt catttttaat 4680ttaaaaggat ctaggtgaag atcctttttg
ataatctcat gaccaaaatc ccttaacgtg 4740agttttcgtt ccactgagcg tcagaccccg
tagaaaagat caaaggatct tcttgagatc 4800ctttttttct gcgcgtaatc tgctgcttgc
aaacaaaaaa accaccgcta ccagcggtgg 4860tttgtttgcc ggatcaagag ctaccaactc
tttttccgaa ggtaactggc ttcagcagag 4920cgcagatacc aaatactgtc cttctagtgt
agccgtagtt aggccaccac ttcaagaact 4980ctgtagcacc gcctacatac ctcgctctgc
taatcctgtt accagtggct gctgccagtg 5040gcgataagtc gtgtcttacc gggttggact
caagacgata gttaccggat aaggcgcagc 5100ggtcgggctg aacggggggt tcgtgcacac
agcccagctt ggagcgaacg acctacaccg 5160aactgagata cctacagcgt gagcattgag
aaagcgccac gcttcccgaa gggagaaagg 5220cggacaggta tccggtaagc ggcagggtcg
gaacaggaga gcgcacgagg gagcttccag 5280ggggaaacgc ctggtatctt tatagtcctg
tcgggtttcg ccacctctga cttgagcgtc 5340gatttttgtg atgctcgtca ggggggcgga
gcctatggaa aaacgccagc aacgcggcct 5400ttttacggtt cctggccttt tgctggcctt
ttgctcacat gttctttcct gcgttatccc 5460ctgattctgt ggataaccgt attaccgcct
ttgagtgagc tgataccgct cgccgcagcc 5520gaacgaccga gcgcagcgag tcagtgagcg
aggaagcgga agagcgccca atacgcaaac 5580cgcctctccc cgcgcgttgg ccgattcatt
aatgcagctg gcacgacagg tttcccgact 5640ggaaagcggg cagtgagcgc aacgcaatta
atgtgagtta gctcactcat taggcacccc 5700aggctttaca ctttatgctt ccggctcgta
tgttgtgtgg aattgtgagc ggataacaat 5760ttcacacagg aaacagctat gaccatgatt
acgccaagct ttgctcttag gagtttccta 5820atacatccca aactcaaata tataaagcat
ttgacttgtt ctatgcccta gggggcgggg 5880ggaagctaag ccagcttttt ttaacattta
aaatgttaat tccattttaa atgcacagat 5940gtttttattt cataagggtt tcaatgtgca
tgaatgctgc aatattcctg ttaccaaagc 6000tagtataaat aaaaatagat aaacgtggaa
attacttaga gtttctgtca ttaacgtttc 6060cttcctcagt tgacaacata aatgcgctgc
tgagcaagcc agtttgcatc tgtcaggatc 6120aatttcccat tatgccagtc atattaatta
ctagtcaatt agttgatttt tatttttgac 6180atatacatgt gaatgaaaga ccccacctgt
aggtttggca agctagctta agtaacgcca 6240ttttgcaagg catggaaaaa tacataactg
agaatagaaa agttcagatc aaggtcagga 6300acagatggaa cagctgaata tgggccaaac
aggatatctg tggtaagcag ttcctgcccc 6360ggctcagggc caagaacaga tggaacagct
gaatatgggc caaacaggat atctgtggta 6420agcagttcct gccccggctc agggccaaga
acagatggtc cccagatgcg gtccagccct 6480cagcagtttc tagagaacca tcagatgttt
ccagggtgcc ccaaggacct gaaatgaccc 6540tgtgccttat ttgaactaac caatcagttc
gcttctcgct tctgttcgcg cgcttatgct 6600ccccgagctc aataaaagag cccacaaccc
ctcactcggg gcgccagtcc tccgattgac 6660tgagtcgccc gggtacccgt gtatccaata
aaccctcttg cagttgcatc cgacttgtgg 6720tctcgctgtt ccttgggagg gtctcctctg
agtgattgac tacccgtcag cgggggtctt 6780tcatttgggg gctcgtccgg gatcgggaga
cccctgccca gggaccaccg acccaccacc 6840gggaggtaag ctggccagca acttatctgt
gtctgtccga ttgtctagtg tctatgactg 6900attttatgcg cctgcgtcgg tactagttag
ctaactagct ctgtatctgg cggacccgtg 6960gtggaactga cgagttcgga acacccggcc
gcaaccctgg gagacgtccc agggacttcg 7020ggggccgttt ttgtggcccg acctgagtcc
taaaatcccg atcgtttagg actctttggt 7080gcacccccct tagaggaggg atatgtggtt
ctggtaggag acgagaacct aaaacagttc 7140ccgcctccgt ctgaattttt gctttcggtt
tgggaccgaa gccgcgccgc gcgtcttgtc 7200tgctgcagca tcgttctgtg ttgtctctgt
ctgactgtgt ttctgtattt gtctgaaaat 7260atgggcccgg gctagactgt taccactccc
ttaagtttga ccttaggtca ctggaaagat 7320gtcgagcgga tcgctcacaa ccagtcggta
gatgtcaaga agagacgttg ggttaccttc 7380tgctctgcag aatggccaac ctttaacgtc
ggatggccgc gagacggcac ctttaaccga 7440gacctcatca cccaggttaa gatcaaggtc
ttttcacctg gcccgcatgg acacccagac 7500caggtcccct acatcgtgac ctgggaagcc
ttggcttttg acccccctcc ctgggtcaag 7560ccctttgtac accctaagcc tccgcctcct
cttcctccat ccgccccgtc tctccccctt 7620gaacctcctc gttcgacccc gcctcgatcc
tccctttatc cagccctcac t 76712307653DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
230ggccctctag gcgcccccat atggccatat gagatcttat atggggcacc cccgcccctt
60gtaaacttcc ctgaccctga catgacaaga gttactaaca gcccctctct ccaagctcac
120ttacaggctc tctacttagt ccagcacgaa gtctggagac ctctggcggc agcctaccaa
180gaacaactgg accgaccggt gccgccacca tggaaaccga caccctgctg ctgtgggtgc
240tgctgctgtg ggtgccagga tccacaggat cctatgagct gactcagcca ccctcagcgt
300ctgggacccc cgggcagagg gtcaccatgt cttgttctgg aaccagctcc aacatcggaa
360gtcactctgt aaactggtac cagcagctcc caggaacggc ccccaaactc ctcatctata
420ctaataatca gcggccctca ggggtccctg accgattctc tggctccaag tctggcacct
480cagcctccct ggccatcagt ggcctccagt ctgaggatga ggctgattat tactgtgcag
540catgggatgg cagcctgaat ggtctggtat tcggcggagg gaccaagctg accgtcctag
600gttctagagg tggtggtggt agcggcggcg gcggctctgg tggtggtgga tccctcgaga
660tggccgaggt gcagctggtg cagtctggag cagaggtgaa aaagcccggg gagtctctga
720agatctcctg taagggttct ggatacagct ttaccagcta ctggatcggc tgggtgcgcc
780agatgcccgg gaaaggcctg gagtggatgg ggatcatcta tcctggtgac tctgatacca
840gatacagccc gtccttccaa ggccacgtca ccatctcagc tgacaagtcc atcagcactg
900cctacctgca gtggagcagc ctgaaggcct cggacaccgc catgtattac tgtgcgcgct
960actctggttc tttcgataac tggggtcaag gtactctggt gaccgtctcc tcagcggccg
1020cacccaccac gacgccagcg ccgcgaccac caaccccggc gcccacgatc gcgtcgcagc
1080ccctgtccct gcgcccagag gcgtgccggc cagcggcggg gggcgcagtg cacacgaggg
1140ggctggactt cgcctgtgat atctacatct gggcgcccct ggccgggact tgtggggtcc
1200ttctcctgtc actggttatc accctttact gcaacaaacg gggcagaaag aagctcctgt
1260atatattcaa acaaccattt atgagaccag tacaaactac tcaagaggaa gatggctgta
1320gctgccgatt tccagaagaa gaagaaggag gatgtgaact gagagtgaag ttcagcagga
1380gcgcagagcc ccccgcgtac cagcagggcc agaaccagct ctataacgag ctcaatctag
1440gacgaagaga ggagtacgat gttttggaca agagacgtgg ccgggaccct gagatggggg
1500gaaagccgag aaggaagaac cctcaggaag gcctgtacaa tgaactgcag aaagataaga
1560tggcggaggc ctacagtgag attgggatga aaggcgagcg ccggaggggc aaggggcacg
1620atggccttta ccagggtctc agtacagcca ccaaggacac ctacgacgcc cttcacatgc
1680aggccctgcc ccctcgctaa cagccactcg aggatccgga ttagtccaat ttgttaaaga
1740caggatatca gtggtccagg ctctagtttt gactcaacaa tatcaccagc tgaagcctat
1800agagtacgag ccatagataa aataaaagat tttatttagt ctccagaaaa aggggggaat
1860gaaagacccc acctgtaggt ttggcaagct agcttaagta acgccatttt gcaaggcatg
1920gaaaaataca taactgagaa tagagaagtt cagatcaagg tcaggaacag atggaacagc
1980tgaatatggg ccaaacagga tatctgtggt aagcagttcc tgccccggct cagggccaag
2040aacagatgga acagctgaat atgggccaaa caggatatct gtggtaagca gttcctgccc
2100cggctcaggg ccaagaacag atggtcccca gatgcggtcc agccctcagc agtttctaga
2160gaaccatcag atgtttccag ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga
2220actaaccaat cagttcgctt ctcgcttctg ttcgcgcgct tctgctcccc gagctcaata
2280aaagagccca caacccctca ctcggggcgc cagtcctccg attgactgag tcgcccgggt
2340acccgtgtat ccaataaacc ctcttgcagt tgcatccgac ttgtggtctc gctgttcctt
2400gggagggtct cctctgagtg attgactacc cgtcagcggg ggtctttcac acatgcagca
2460tgtatcaaaa ttaatttggt tttttttctt aagtatttac attaaatggc catagtactt
2520aaagttacat tggcttcctt gaaataaaca tggagtattc agaatgtgtc ataaatattt
2580ctaattttaa gatagtatct ccattggctt tctacttttt cttttatttt tttttgtcct
2640ctgtcttcca tttgttgttg ttgttgtttg tttgtttgtt tgttggttgg ttggttaatt
2700tttttttaaa gatcctacac tatagttcaa gctagactat tagctactct gtaacccagg
2760gtgaccttga agtcatgggt agcctgctgt tttagccttc ccacatctaa gattacaggt
2820atgagctatc atttttggta tattgattga ttgattgatt gatgtgtgtg tgtgtgattg
2880tgtttgtgtg tgtgactgtg aaaatgtgtg tatgggtgtg tgtgaatgtg tgtatgtatg
2940tgtgtgtgtg agtgtgtgtg tgtgtgtgtg catgtgtgtg tgtgtgactg tgtctatgtg
3000tatgactgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgttgtgaaa
3060aaatattcta tggtagtgag agccaacgct ccggctcagg tgtcaggttg gtttttgaga
3120cagagtcttt cacttagctt ggaattcact ggccgtcgtt ttacaacgtc gtgactggga
3180aaaccctggc gttacccaac ttaatcgcct tgcagcacat ccccctttcg ccagctggcg
3240taatagcgaa gaggcccgca ccgatcgccc ttcccaacag ttgcgcagcc tgaatggcga
3300atggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcatatg
3360gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc
3420aacacccgct gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc
3480tgtgaccgtc tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc
3540gatgacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg
3600tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat
3660ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc
3720aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct
3780tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag
3840atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta
3900agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc
3960tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca
4020tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg
4080atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg
4140ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca
4200tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa
4260acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa
4320ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata
4380aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat
4440ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc
4500cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata
4560gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt
4620actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga
4680agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag
4740cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa
4800tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag
4860agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg
4920tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat
4980acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta
5040ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg
5100gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc
5160gtgagcattg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa
5220gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc
5280tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt
5340caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct
5400tttgctggcc ttttgctcac atgttctttc ctgcgttatc ccctgattct gtggataacc
5460gtattaccgc ctttgagtga gctgataccg ctcgccgcag ccgaacgacc gagcgcagcg
5520agtcagtgag cgaggaagcg gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt
5580ggccgattca ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc
5640gcaacgcaat taatgtgagt tagctcactc attaggcacc ccaggcttta cactttatgc
5700ttccggctcg tatgttgtgt ggaattgtga gcggataaca atttcacaca ggaaacagct
5760atgaccatga ttacgccaag ctttgctctt aggagtttcc taatacatcc caaactcaaa
5820tatataaagc atttgacttg ttctatgccc tagggggcgg ggggaagcta agccagcttt
5880ttttaacatt taaaatgtta attccatttt aaatgcacag atgtttttat ttcataaggg
5940tttcaatgtg catgaatgct gcaatattcc tgttaccaaa gctagtataa ataaaaatag
6000ataaacgtgg aaattactta gagtttctgt cattaacgtt tccttcctca gttgacaaca
6060taaatgcgct gctgagcaag ccagtttgca tctgtcagga tcaatttccc attatgccag
6120tcatattaat tactagtcaa ttagttgatt tttatttttg acatatacat gtgaatgaaa
6180gaccccacct gtaggtttgg caagctagct taagtaacgc cattttgcaa ggcatggaaa
6240aatacataac tgagaataga aaagttcaga tcaaggtcag gaacagatgg aacagctgaa
6300tatgggccaa acaggatatc tgtggtaagc agttcctgcc ccggctcagg gccaagaaca
6360gatggaacag ctgaatatgg gccaaacagg atatctgtgg taagcagttc ctgccccggc
6420tcagggccaa gaacagatgg tccccagatg cggtccagcc ctcagcagtt tctagagaac
6480catcagatgt ttccagggtg ccccaaggac ctgaaatgac cctgtgcctt atttgaacta
6540accaatcagt tcgcttctcg cttctgttcg cgcgcttatg ctccccgagc tcaataaaag
6600agcccacaac ccctcactcg gggcgccagt cctccgattg actgagtcgc ccgggtaccc
6660gtgtatccaa taaaccctct tgcagttgca tccgacttgt ggtctcgctg ttccttggga
6720gggtctcctc tgagtgattg actacccgtc agcgggggtc tttcatttgg gggctcgtcc
6780gggatcggga gacccctgcc cagggaccac cgacccacca ccgggaggta agctggccag
6840caacttatct gtgtctgtcc gattgtctag tgtctatgac tgattttatg cgcctgcgtc
6900ggtactagtt agctaactag ctctgtatct ggcggacccg tggtggaact gacgagttcg
6960gaacacccgg ccgcaaccct gggagacgtc ccagggactt cgggggccgt ttttgtggcc
7020cgacctgagt cctaaaatcc cgatcgttta ggactctttg gtgcaccccc cttagaggag
7080ggatatgtgg ttctggtagg agacgagaac ctaaaacagt tcccgcctcc gtctgaattt
7140ttgctttcgg tttgggaccg aagccgcgcc gcgcgtcttg tctgctgcag catcgttctg
7200tgttgtctct gtctgactgt gtttctgtat ttgtctgaaa atatgggccc gggctagact
7260gttaccactc ccttaagttt gaccttaggt cactggaaag atgtcgagcg gatcgctcac
7320aaccagtcgg tagatgtcaa gaagagacgt tgggttacct tctgctctgc agaatggcca
7380acctttaacg tcggatggcc gcgagacggc acctttaacc gagacctcat cacccaggtt
7440aagatcaagg tcttttcacc tggcccgcat ggacacccag accaggtccc ctacatcgtg
7500acctgggaag ccttggcttt tgacccccct ccctgggtca agccctttgt acaccctaag
7560cctccgcctc ctcttcctcc atccgccccg tctctccccc ttgaacctcc tcgttcgacc
7620ccgcctcgat cctcccttta tccagccctc act
76532317668DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 231ccttctctag gcgcccccat atggccatat
gagatcttat atggggcacc cccgcccctt 60gtaaacttcc ctgaccctga catgacaaga
gttactaaca gcccctctct ccaagctcac 120ttacaggctc tctacttagt ccagcacgaa
gtctggagac ctctggcggc agcctaccaa 180gaacaactgg accgaccggt gccgccacca
tggaaaccga caccctgctg ctgtgggtgc 240tgctgctgtg ggtgccagga tccacaggac
aggctgtgct gactcagcca ccctcagcgt 300ctgggacccc cgggcagagg gtcaccatct
cttgttctgg aagcagctcc aacatcggaa 360gtaattacgt attctggtac cagcagctcc
caggaacggc ccccaaactc ctcatctata 420gtaataatca gcggccctca ggggtccctg
accgattctc tggctccaag tctggcacct 480cagcctccct ggccatcagt gggctccggt
ccgaggatga ggctgattat tactgtgcag 540catgggatga cagcctgagt gcctcttatg
ttttcggaac tgggaccaag gtcaccgtcc 600taggttctag aggtggtggt ggtagcggcg
gcggcggctc tggtggtggt ggatccctcg 660agatggccca ggtgcagctg gtgcagtctg
gggctgaggt gaagaagcct gggtcctcgg 720tgaaggtctc ctgcaaggct tctggaggca
ccttcagcag ctatgctatc agctgggtgc 780gacaggcccc tggacaaggg cttgagtgga
tgggaaggat catccctatc cttggtacag 840caaactacgc acagaagttc cagggcagag
tcacgattac cgcggacgaa tccacgagca 900cagcctacat ggagctgagc agcctgagat
ctgaggacac ggccgtgtat tactgtgcgc 960gctctggtta cggttcttac cgttgggaag
attcttgggg tcaaggtact ctggtgaccg 1020tctcctcagc ggccgcaccc accacgacgc
cagcgccgcg accaccaacc ccggcgccca 1080cgatcgcgtc gcagcccctg tccctgcgcc
cagaggcgtg ccggccagcg gcggggggcg 1140cagtgcacac gagggggctg gacttcgcct
gtgatatcta catctgggcg cccctggccg 1200ggacttgtgg ggtccttctc ctgtcactgg
ttatcaccct ttactgcaac aaacggggca 1260gaaagaagct cctgtatata ttcaaacaac
catttatgag accagtacaa actactcaag 1320aggaagatgg ctgtagctgc cgatttccag
aagaagaaga aggaggatgt gaactgagag 1380tgaagttcag caggagcgca gagccccccg
cgtaccagca gggccagaac cagctctata 1440acgagctcaa tctaggacga agagaggagt
acgatgtttt ggacaagaga cgtggccggg 1500accctgagat ggggggaaag ccgagaagga
agaaccctca ggaaggcctg tacaatgaac 1560tgcagaaaga taagatggcg gaggcctaca
gtgagattgg gatgaaaggc gagcgccgga 1620ggggcaaggg gcacgatggc ctttaccagg
gtctcagtac agccaccaag gacacctacg 1680acgcccttca catgcaggcc ctgccccctc
gctaacagcc actcgaggat ccggattagt 1740ccaatttgtt aaagacagga tatcagtggt
ccaggctcta gttttgactc aacaatatca 1800ccagctgaag cctatagagt acgagccata
gataaaataa aagattttat ttagtctcca 1860gaaaaagggg ggaatgaaag accccacctg
taggtttggc aagctagctt aagtaacgcc 1920attttgcaag gcatggaaaa atacataact
gagaatagag aagttcagat caaggtcagg 1980aacagatgga acagctgaat atgggccaaa
caggatatct gtggtaagca gttcctgccc 2040cggctcaggg ccaagaacag atggaacagc
tgaatatggg ccaaacagga tatctgtggt 2100aagcagttcc tgccccggct cagggccaag
aacagatggt ccccagatgc ggtccagccc 2160tcagcagttt ctagagaacc atcagatgtt
tccagggtgc cccaaggacc tgaaatgacc 2220ctgtgcctta tttgaactaa ccaatcagtt
cgcttctcgc ttctgttcgc gcgcttctgc 2280tccccgagct caataaaaga gcccacaacc
cctcactcgg ggcgccagtc ctccgattga 2340ctgagtcgcc cgggtacccg tgtatccaat
aaaccctctt gcagttgcat ccgacttgtg 2400gtctcgctgt tccttgggag ggtctcctct
gagtgattga ctacccgtca gcgggggtct 2460ttcacacatg cagcatgtat caaaattaat
ttggtttttt ttcttaagta tttacattaa 2520atggccatag tacttaaagt tacattggct
tccttgaaat aaacatggag tattcagaat 2580gtgtcataaa tatttctaat tttaagatag
tatctccatt ggctttctac tttttctttt 2640attttttttt gtcctctgtc ttccatttgt
tgttgttgtt gtttgtttgt ttgtttgttg 2700gttggttggt taattttttt ttaaagatcc
tacactatag ttcaagctag actattagct 2760actctgtaac ccagggtgac cttgaagtca
tgggtagcct gctgttttag ccttcccaca 2820tctaagatta caggtatgag ctatcatttt
tggtatattg attgattgat tgattgatgt 2880gtgtgtgtgt gattgtgttt gtgtgtgtga
ctgtgaaaat gtgtgtatgg gtgtgtgtga 2940atgtgtgtat gtatgtgtgt gtgtgagtgt
gtgtgtgtgt gtgtgcatgt gtgtgtgtgt 3000gactgtgtct atgtgtatga ctgtgtgtgt
gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt 3060gtgtgtgttg tgaaaaaata ttctatggta
gtgagagcca acgctccggc tcaggtgtca 3120ggttggtttt tgagacagag tctttcactt
agcttggaat tcactggccg tcgttttaca 3180acgtcgtgac tgggaaaacc ctggcgttac
ccaacttaat cgccttgcag cacatccccc 3240tttcgccagc tggcgtaata gcgaagaggc
ccgcaccgat cgcccttccc aacagttgcg 3300cagcctgaat ggcgaatggc gcctgatgcg
gtattttctc cttacgcatc tgtgcggtat 3360ttcacaccgc atatggtgca ctctcagtac
aatctgctct gatgccgcat agttaagcca 3420gccccgacac ccgccaacac ccgctgacgc
gccctgacgg gcttgtctgc tcccggcatc 3480cgcttacaga caagctgtga ccgtctccgg
gagctgcatg tgtcagaggt tttcaccgtc 3540atcaccgaaa cgcgcgatga cgaaagggcc
tcgtgatacg cctattttta taggttaatg 3600tcatgataat aatggtttct tagacgtcag
gtggcacttt tcggggaaat gtgcgcggaa 3660cccctatttg tttatttttc taaatacatt
caaatatgta tccgctcatg agacaataac 3720cctgataaat gcttcaataa tattgaaaaa
ggaagagtat gagtattcaa catttccgtg 3780tcgcccttat tccctttttt gcggcatttt
gccttcctgt ttttgctcac ccagaaacgc 3840tggtgaaagt aaaagatgct gaagatcagt
tgggtgcacg agtgggttac atcgaactgg 3900atctcaacag cggtaagatc cttgagagtt
ttcgccccga agaacgtttt ccaatgatga 3960gcacttttaa agttctgcta tgtggcgcgg
tattatcccg tattgacgcc gggcaagagc 4020aactcggtcg ccgcatacac tattctcaga
atgacttggt tgagtactca ccagtcacag 4080aaaagcatct tacggatggc atgacagtaa
gagaattatg cagtgctgcc ataaccatga 4140gtgataacac tgcggccaac ttacttctga
caacgatcgg aggaccgaag gagctaaccg 4200cttttttgca caacatgggg gatcatgtaa
ctcgccttga tcgttgggaa ccggagctga 4260atgaagccat accaaacgac gagcgtgaca
ccacgatgcc tgtagcaatg gcaacaacgt 4320tgcgcaaact attaactggc gaactactta
ctctagcttc ccggcaacaa ttaatagact 4380ggatggaggc ggataaagtt gcaggaccac
ttctgcgctc ggcccttccg gctggctggt 4440ttattgctga taaatctgga gccggtgagc
gtgggtctcg cggtatcatt gcagcactgg 4500ggccagatgg taagccctcc cgtatcgtag
ttatctacac gacggggagt caggcaacta 4560tggatgaacg aaatagacag atcgctgaga
taggtgcctc actgattaag cattggtaac 4620tgtcagacca agtttactca tatatacttt
agattgattt aaaacttcat ttttaattta 4680aaaggatcta ggtgaagatc ctttttgata
atctcatgac caaaatccct taacgtgagt 4740tttcgttcca ctgagcgtca gaccccgtag
aaaagatcaa aggatcttct tgagatcctt 4800tttttctgcg cgtaatctgc tgcttgcaaa
caaaaaaacc accgctacca gcggtggttt 4860gtttgccgga tcaagagcta ccaactcttt
ttccgaaggt aactggcttc agcagagcgc 4920agataccaaa tactgtcctt ctagtgtagc
cgtagttagg ccaccacttc aagaactctg 4980tagcaccgcc tacatacctc gctctgctaa
tcctgttacc agtggctgct gccagtggcg 5040ataagtcgtg tcttaccggg ttggactcaa
gacgatagtt accggataag gcgcagcggt 5100cgggctgaac ggggggttcg tgcacacagc
ccagcttgga gcgaacgacc tacaccgaac 5160tgagatacct acagcgtgag cattgagaaa
gcgccacgct tcccgaaggg agaaaggcgg 5220acaggtatcc ggtaagcggc agggtcggaa
caggagagcg cacgagggag cttccagggg 5280gaaacgcctg gtatctttat agtcctgtcg
ggtttcgcca cctctgactt gagcgtcgat 5340ttttgtgatg ctcgtcaggg gggcggagcc
tatggaaaaa cgccagcaac gcggcctttt 5400tacggttcct ggccttttgc tggccttttg
ctcacatgtt ctttcctgcg ttatcccctg 5460attctgtgga taaccgtatt accgcctttg
agtgagctga taccgctcgc cgcagccgaa 5520cgaccgagcg cagcgagtca gtgagcgagg
aagcggaaga gcgcccaata cgcaaaccgc 5580ctctccccgc gcgttggccg attcattaat
gcagctggca cgacaggttt cccgactgga 5640aagcgggcag tgagcgcaac gcaattaatg
tgagttagct cactcattag gcaccccagg 5700ctttacactt tatgcttccg gctcgtatgt
tgtgtggaat tgtgagcgga taacaatttc 5760acacaggaaa cagctatgac catgattacg
ccaagctttg ctcttaggag tttcctaata 5820catcccaaac tcaaatatat aaagcatttg
acttgttcta tgccctaggg ggcgggggga 5880agctaagcca gcttttttta acatttaaaa
tgttaattcc attttaaatg cacagatgtt 5940tttatttcat aagggtttca atgtgcatga
atgctgcaat attcctgtta ccaaagctag 6000tataaataaa aatagataaa cgtggaaatt
acttagagtt tctgtcatta acgtttcctt 6060cctcagttga caacataaat gcgctgctga
gcaagccagt ttgcatctgt caggatcaat 6120ttcccattat gccagtcata ttaattacta
gtcaattagt tgatttttat ttttgacata 6180tacatgtgaa tgaaagaccc cacctgtagg
tttggcaagc tagcttaagt aacgccattt 6240tgcaaggcat ggaaaaatac ataactgaga
atagaaaagt tcagatcaag gtcaggaaca 6300gatggaacag ctgaatatgg gccaaacagg
atatctgtgg taagcagttc ctgccccggc 6360tcagggccaa gaacagatgg aacagctgaa
tatgggccaa acaggatatc tgtggtaagc 6420agttcctgcc ccggctcagg gccaagaaca
gatggtcccc agatgcggtc cagccctcag 6480cagtttctag agaaccatca gatgtttcca
gggtgcccca aggacctgaa atgaccctgt 6540gccttatttg aactaaccaa tcagttcgct
tctcgcttct gttcgcgcgc ttatgctccc 6600cgagctcaat aaaagagccc acaacccctc
actcggggcg ccagtcctcc gattgactga 6660gtcgcccggg tacccgtgta tccaataaac
cctcttgcag ttgcatccga cttgtggtct 6720cgctgttcct tgggagggtc tcctctgagt
gattgactac ccgtcagcgg gggtctttca 6780tttgggggct cgtccgggat cgggagaccc
ctgcccaggg accaccgacc caccaccggg 6840aggtaagctg gccagcaact tatctgtgtc
tgtccgattg tctagtgtct atgactgatt 6900ttatgcgcct gcgtcggtac tagttagcta
actagctctg tatctggcgg acccgtggtg 6960gaactgacga gttcggaaca cccggccgca
accctgggag acgtcccagg gacttcgggg 7020gccgtttttg tggcccgacc tgagtcctaa
aatcccgatc gtttaggact ctttggtgca 7080ccccccttag aggagggata tgtggttctg
gtaggagacg agaacctaaa acagttcccg 7140cctccgtctg aatttttgct ttcggtttgg
gaccgaagcc gcgccgcgcg tcttgtctgc 7200tgcagcatcg ttctgtgttg tctctgtctg
actgtgtttc tgtatttgtc tgaaaatatg 7260ggcccgggct agactgttac cactccctta
agtttgacct taggtcactg gaaagatgtc 7320gagcggatcg ctcacaacca gtcggtagat
gtcaagaaga gacgttgggt taccttctgc 7380tctgcagaat ggccaacctt taacgtcgga
tggccgcgag acggcacctt taaccgagac 7440ctcatcaccc aggttaagat caaggtcttt
tcacctggcc cgcatggaca cccagaccag 7500gtcccctaca tcgtgacctg ggaagccttg
gcttttgacc cccctccctg ggtcaagccc 7560tttgtacacc ctaagcctcc gcctcctctt
cctccatccg ccccgtctct cccccttgaa 7620cctcctcgtt cgaccccgcc tcgatcctcc
ctttatccag ccctcact 76682327662DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
232ccttctctag gcgcccccat atggccatat gagatcttat atggggcacc cccgcccctt
60gtaaacttcc ctgaccctga catgacaaga gttactaaca gcccctctct ccaagctcac
120ttacaggctc tctacttagt ccagcacgaa gtctggagac ctctggcggc agcctaccaa
180gaacaactgg accgaccggt gccgccacca tggaaaccga caccctgctg ctgtgggtgc
240tgctgctgtg ggtgccagga tccacaggac agtctgtgct gacgcagccg ccctcagtgt
300ctggggcccc agggcagagg gtcaccatct cctgcactgg gagcagctcc aacatcgggg
360caggttttga tgtacactgg taccagcagc ttccaggaac agcccccaaa ctcctcatct
420atggtaacag caatcggccc tcaggggtcc ctgaccgatt ctctggctcc aagtctggca
480cctcagcctc cctggccatc actgggctcc aggctgagga tgaggctgat tattactgcc
540agtcctatga cagcagcctg agtggttatg tcttcggaac tgggaccaag gtcaccgtcc
600taggttctag aggtggtggt ggtagcggcg gcggcggctc tggtggtggt ggatccctcg
660agatggccca ggtccagctg gtacagtctg gggctgaggt gaagaagcct ggggcctcag
720tgaaggtctc ctgcaaggct tctggataca ccttcaccga ctactatatg cactgggtgc
780gacaggcccc tggacaacgg cttgagtgga tgggatggat caaccctaac agtggtggca
840caaactatgc acagaagttt caggacagga tcaccgtgac cagggacacc tccagcaaca
900caggctacat ggagctgacc aggctgagat ctgacgacac ggccgtgtat tactgtgcgc
960gctctccgta ctctggtgtt ctggataaat ggggtcaagg tactctggtg accgtctcct
1020cagcggccgc acccaccacg acgccagcgc cgcgaccacc aaccccggcg cccacgatcg
1080cgtcgcagcc cctgtccctg cgcccagagg cgtgccggcc agcggcgggg ggcgcagtgc
1140acacgagggg gctggacttc gcctgtgata tctacatctg ggcgcccctg gccgggactt
1200gtggggtcct tctcctgtca ctggttatca ccctttactg caacaaacgg ggcagaaaga
1260agctcctgta tatattcaaa caaccattta tgagaccagt acaaactact caagaggaag
1320atggctgtag ctgccgattt ccagaagaag aagaaggagg atgtgaactg agagtgaagt
1380tcagcaggag cgcagagccc cccgcgtacc agcagggcca gaaccagctc tataacgagc
1440tcaatctagg acgaagagag gagtacgatg ttttggacaa gagacgtggc cgggaccctg
1500agatgggggg aaagccgaga aggaagaacc ctcaggaagg cctgtacaat gaactgcaga
1560aagataagat ggcggaggcc tacagtgaga ttgggatgaa aggcgagcgc cggaggggca
1620aggggcacga tggcctttac cagggtctca gtacagccac caaggacacc tacgacgccc
1680ttcacatgca ggccctgccc cctcgctaac agccactcga ggatccggat tagtccaatt
1740tgttaaagac aggatatcag tggtccaggc tctagttttg actcaacaat atcaccagct
1800gaagcctata gagtacgagc catagataaa ataaaagatt ttatttagtc tccagaaaaa
1860ggggggaatg aaagacccca cctgtaggtt tggcaagcta gcttaagtaa cgccattttg
1920caaggcatgg aaaaatacat aactgagaat agagaagttc agatcaaggt caggaacaga
1980tggaacagct gaatatgggc caaacaggat atctgtggta agcagttcct gccccggctc
2040agggccaaga acagatggaa cagctgaata tgggccaaac aggatatctg tggtaagcag
2100ttcctgcccc ggctcagggc caagaacaga tggtccccag atgcggtcca gccctcagca
2160gtttctagag aaccatcaga tgtttccagg gtgccccaag gacctgaaat gaccctgtgc
2220cttatttgaa ctaaccaatc agttcgcttc tcgcttctgt tcgcgcgctt ctgctccccg
2280agctcaataa aagagcccac aacccctcac tcggggcgcc agtcctccga ttgactgagt
2340cgcccgggta cccgtgtatc caataaaccc tcttgcagtt gcatccgact tgtggtctcg
2400ctgttccttg ggagggtctc ctctgagtga ttgactaccc gtcagcgggg gtctttcaca
2460catgcagcat gtatcaaaat taatttggtt ttttttctta agtatttaca ttaaatggcc
2520atagtactta aagttacatt ggcttccttg aaataaacat ggagtattca gaatgtgtca
2580taaatatttc taattttaag atagtatctc cattggcttt ctactttttc ttttattttt
2640ttttgtcctc tgtcttccat ttgttgttgt tgttgtttgt ttgtttgttt gttggttggt
2700tggttaattt ttttttaaag atcctacact atagttcaag ctagactatt agctactctg
2760taacccaggg tgaccttgaa gtcatgggta gcctgctgtt ttagccttcc cacatctaag
2820attacaggta tgagctatca tttttggtat attgattgat tgattgattg atgtgtgtgt
2880gtgtgattgt gtttgtgtgt gtgactgtga aaatgtgtgt atgggtgtgt gtgaatgtgt
2940gtatgtatgt gtgtgtgtga gtgtgtgtgt gtgtgtgtgc atgtgtgtgt gtgtgactgt
3000gtctatgtgt atgactgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt
3060gttgtgaaaa aatattctat ggtagtgaga gccaacgctc cggctcaggt gtcaggttgg
3120tttttgagac agagtctttc acttagcttg gaattcactg gccgtcgttt tacaacgtcg
3180tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc
3240cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct
3300gaatggcgaa tggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca
3360ccgcatatgg tgcactctca gtacaatctg ctctgatgcc gcatagttaa gccagccccg
3420acacccgcca acacccgctg acgcgccctg acgggcttgt ctgctcccgg catccgctta
3480cagacaagct gtgaccgtct ccgggagctg catgtgtcag aggttttcac cgtcatcacc
3540gaaacgcgcg atgacgaaag ggcctcgtga tacgcctatt tttataggtt aatgtcatga
3600taataatggt ttcttagacg tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta
3660tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat
3720aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc
3780ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga
3840aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca
3900acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt
3960ttaaagttct gctatgtggc gcggtattat cccgtattga cgccgggcaa gagcaactcg
4020gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc
4080atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata
4140acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt
4200tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag
4260ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca acgttgcgca
4320aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactggatgg
4380aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg
4440ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag
4500atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca actatggatg
4560aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag
4620accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga
4680tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt gagttttcgt
4740tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat cctttttttc
4800tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg gtttgtttgc
4860cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga gcgcagatac
4920caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac tctgtagcac
4980cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt ggcgataagt
5040cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag cggtcgggct
5100gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc gaactgagat
5160acctacagcg tgagcattga gaaagcgcca cgcttcccga agggagaaag gcggacaggt
5220atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg
5280cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt
5340gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc tttttacggt
5400tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc cctgattctg
5460tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg
5520agcgcagcga gtcagtgagc gaggaagcgg aagagcgccc aatacgcaaa ccgcctctcc
5580ccgcgcgttg gccgattcat taatgcagct ggcacgacag gtttcccgac tggaaagcgg
5640gcagtgagcg caacgcaatt aatgtgagtt agctcactca ttaggcaccc caggctttac
5700actttatgct tccggctcgt atgttgtgtg gaattgtgag cggataacaa tttcacacag
5760gaaacagcta tgaccatgat tacgccaagc tttgctctta ggagtttcct aatacatccc
5820aaactcaaat atataaagca tttgacttgt tctatgccct agggggcggg gggaagctaa
5880gccagctttt tttaacattt aaaatgttaa ttccatttta aatgcacaga tgtttttatt
5940tcataagggt ttcaatgtgc atgaatgctg caatattcct gttaccaaag ctagtataaa
6000taaaaataga taaacgtgga aattacttag agtttctgtc attaacgttt ccttcctcag
6060ttgacaacat aaatgcgctg ctgagcaagc cagtttgcat ctgtcaggat caatttccca
6120ttatgccagt catattaatt actagtcaat tagttgattt ttatttttga catatacatg
6180tgaatgaaag accccacctg taggtttggc aagctagctt aagtaacgcc attttgcaag
6240gcatggaaaa atacataact gagaatagaa aagttcagat caaggtcagg aacagatgga
6300acagctgaat atgggccaaa caggatatct gtggtaagca gttcctgccc cggctcaggg
6360ccaagaacag atggaacagc tgaatatggg ccaaacagga tatctgtggt aagcagttcc
6420tgccccggct cagggccaag aacagatggt ccccagatgc ggtccagccc tcagcagttt
6480ctagagaacc atcagatgtt tccagggtgc cccaaggacc tgaaatgacc ctgtgcctta
6540tttgaactaa ccaatcagtt cgcttctcgc ttctgttcgc gcgcttatgc tccccgagct
6600caataaaaga gcccacaacc cctcactcgg ggcgccagtc ctccgattga ctgagtcgcc
6660cgggtacccg tgtatccaat aaaccctctt gcagttgcat ccgacttgtg gtctcgctgt
6720tccttgggag ggtctcctct gagtgattga ctacccgtca gcgggggtct ttcatttggg
6780ggctcgtccg ggatcgggag acccctgccc agggaccacc gacccaccac cgggaggtaa
6840gctggccagc aacttatctg tgtctgtccg attgtctagt gtctatgact gattttatgc
6900gcctgcgtcg gtactagtta gctaactagc tctgtatctg gcggacccgt ggtggaactg
6960acgagttcgg aacacccggc cgcaaccctg ggagacgtcc cagggacttc gggggccgtt
7020tttgtggccc gacctgagtc ctaaaatccc gatcgtttag gactctttgg tgcacccccc
7080ttagaggagg gatatgtggt tctggtagga gacgagaacc taaaacagtt cccgcctccg
7140tctgaatttt tgctttcggt ttgggaccga agccgcgccg cgcgtcttgt ctgctgcagc
7200atcgttctgt gttgtctctg tctgactgtg tttctgtatt tgtctgaaaa tatgggcccg
7260ggctagactg ttaccactcc cttaagtttg accttaggtc actggaaaga tgtcgagcgg
7320atcgctcaca accagtcggt agatgtcaag aagagacgtt gggttacctt ctgctctgca
7380gaatggccaa cctttaacgt cggatggccg cgagacggca cctttaaccg agacctcatc
7440acccaggtta agatcaaggt cttttcacct ggcccgcatg gacacccaga ccaggtcccc
7500tacatcgtga cctgggaagc cttggctttt gacccccctc cctgggtcaa gccctttgta
7560caccctaagc ctccgcctcc tcttcctcca tccgccccgt ctctccccct tgaacctcct
7620cgttcgaccc cgcctcgatc ctccctttat ccagccctca ct
76622337641DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 233ccttctctag gcgcccccat atggccatat
gagatcttat atggggcacc cccgcccctt 60gtaaacttcc ctgaccctga catgacaaga
gttactaaca gcccctctct ccaagctcac 120ttacaggctc tctacttagt ccagcacgaa
gtctggagac ctctggcggc agcctaccaa 180gaacaactgg accgaccggt gccgccacca
tggaaaccga caccctgctg ctgtgggtgc 240tgctgctgtg ggtgccagga tccacaggac
aatctgccct gactcagcct gcctccgtgt 300ctgcgtctcc tggacagtcg atcgccatct
cctgcactgg aaccagcagt gacgttggtt 360ggtatcaaca gcacccaggc aaagccccca
aactcatgat ttatgaggac agtaagcggc 420cctcaggggt ttctaatcgc ttctctggct
ccaagtctgg caacacggcc tccctgacca 480tctctgggct ccaggctgag gacgaggctg
attattactg cagctcaaat acaagaagca 540gcactttggt gttcggcgga gggaccaagc
tgaccgtcct aggttctaga ggtggtggtg 600gtagcggcgg cggcggctct ggtggtggtg
gatccctcga gatggccgaa gtgcagctgg 660tgcagtctgg ggctgagatg aagaagcctg
gggcctcact gaagctctcc tgcaaggctt 720ctggatacac cttcatcgac tactatgtat
actggatgcg acaggcccct ggacaagggc 780ttgagtccat gggatggatc aaccctaaca
gtggtggcac aaactatgca cagaagtttc 840agggcagggt caccatgacc agggacacgt
ccatcagcac agcctacatg gagctgagca 900ggctgagatc tgacgacacc gccatgtatt
actgtgcgcg ctcccagcgt gacggttaca 960tggattactg gggtcaaggt actctggtga
ccgtctcctc agcggccgca cccaccacga 1020cgccagcgcc gcgaccacca accccggcgc
ccacgatcgc gtcgcagccc ctgtccctgc 1080gcccagaggc gtgccggcca gcggcggggg
gcgcagtgca cacgaggggg ctggacttcg 1140cctgtgatat ctacatctgg gcgcccctgg
ccgggacttg tggggtcctt ctcctgtcac 1200tggttatcac cctttactgc aacaaacggg
gcagaaagaa gctcctgtat atattcaaac 1260aaccatttat gagaccagta caaactactc
aagaggaaga tggctgtagc tgccgatttc 1320cagaagaaga agaaggagga tgtgaactga
gagtgaagtt cagcaggagc gcagagcccc 1380ccgcgtacca gcagggccag aaccagctct
ataacgagct caatctagga cgaagagagg 1440agtacgatgt tttggacaag agacgtggcc
gggaccctga gatgggggga aagccgagaa 1500ggaagaaccc tcaggaaggc ctgtacaatg
aactgcagaa agataagatg gcggaggcct 1560acagtgagat tgggatgaaa ggcgagcgcc
ggaggggcaa ggggcacgat ggcctttacc 1620agggtctcag tacagccacc aaggacacct
acgacgccct tcacatgcag gccctgcccc 1680ctcgctaaca gccactcgag gatccggatt
agtccaattt gttaaagaca ggatatcagt 1740ggtccaggct ctagttttga ctcaacaata
tcaccagctg aagcctatag agtacgagcc 1800atagataaaa taaaagattt tatttagtct
ccagaaaaag gggggaatga aagaccccac 1860ctgtaggttt ggcaagctag cttaagtaac
gccattttgc aaggcatgga aaaatacata 1920actgagaata gagaagttca gatcaaggtc
aggaacagat ggaacagctg aatatgggcc 1980aaacaggata tctgtggtaa gcagttcctg
ccccggctca gggccaagaa cagatggaac 2040agctgaatat gggccaaaca ggatatctgt
ggtaagcagt tcctgccccg gctcagggcc 2100aagaacagat ggtccccaga tgcggtccag
ccctcagcag tttctagaga accatcagat 2160gtttccaggg tgccccaagg acctgaaatg
accctgtgcc ttatttgaac taaccaatca 2220gttcgcttct cgcttctgtt cgcgcgcttc
tgctccccga gctcaataaa agagcccaca 2280acccctcact cggggcgcca gtcctccgat
tgactgagtc gcccgggtac ccgtgtatcc 2340aataaaccct cttgcagttg catccgactt
gtggtctcgc tgttccttgg gagggtctcc 2400tctgagtgat tgactacccg tcagcggggg
tctttcacac atgcagcatg tatcaaaatt 2460aatttggttt tttttcttaa gtatttacat
taaatggcca tagtacttaa agttacattg 2520gcttccttga aataaacatg gagtattcag
aatgtgtcat aaatatttct aattttaaga 2580tagtatctcc attggctttc tactttttct
tttatttttt tttgtcctct gtcttccatt 2640tgttgttgtt gttgtttgtt tgtttgtttg
ttggttggtt ggttaatttt tttttaaaga 2700tcctacacta tagttcaagc tagactatta
gctactctgt aacccagggt gaccttgaag 2760tcatgggtag cctgctgttt tagccttccc
acatctaaga ttacaggtat gagctatcat 2820ttttggtata ttgattgatt gattgattga
tgtgtgtgtg tgtgattgtg tttgtgtgtg 2880tgactgtgaa aatgtgtgta tgggtgtgtg
tgaatgtgtg tatgtatgtg tgtgtgtgag 2940tgtgtgtgtg tgtgtgtgca tgtgtgtgtg
tgtgactgtg tctatgtgta tgactgtgtg 3000tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg
tgtgtgtgtg ttgtgaaaaa atattctatg 3060gtagtgagag ccaacgctcc ggctcaggtg
tcaggttggt ttttgagaca gagtctttca 3120cttagcttgg aattcactgg ccgtcgtttt
acaacgtcgt gactgggaaa accctggcgt 3180tacccaactt aatcgccttg cagcacatcc
ccctttcgcc agctggcgta atagcgaaga 3240ggcccgcacc gatcgccctt cccaacagtt
gcgcagcctg aatggcgaat ggcgcctgat 3300gcggtatttt ctccttacgc atctgtgcgg
tatttcacac cgcatatggt gcactctcag 3360tacaatctgc tctgatgccg catagttaag
ccagccccga cacccgccaa cacccgctga 3420cgcgccctga cgggcttgtc tgctcccggc
atccgcttac agacaagctg tgaccgtctc 3480cgggagctgc atgtgtcaga ggttttcacc
gtcatcaccg aaacgcgcga tgacgaaagg 3540gcctcgtgat acgcctattt ttataggtta
atgtcatgat aataatggtt tcttagacgt 3600caggtggcac ttttcgggga aatgtgcgcg
gaacccctat ttgtttattt ttctaaatac 3660attcaaatat gtatccgctc atgagacaat
aaccctgata aatgcttcaa taatattgaa 3720aaaggaagag tatgagtatt caacatttcc
gtgtcgccct tattcccttt tttgcggcat 3780tttgccttcc tgtttttgct cacccagaaa
cgctggtgaa agtaaaagat gctgaagatc 3840agttgggtgc acgagtgggt tacatcgaac
tggatctcaa cagcggtaag atccttgaga 3900gttttcgccc cgaagaacgt tttccaatga
tgagcacttt taaagttctg ctatgtggcg 3960cggtattatc ccgtattgac gccgggcaag
agcaactcgg tcgccgcata cactattctc 4020agaatgactt ggttgagtac tcaccagtca
cagaaaagca tcttacggat ggcatgacag 4080taagagaatt atgcagtgct gccataacca
tgagtgataa cactgcggcc aacttacttc 4140tgacaacgat cggaggaccg aaggagctaa
ccgctttttt gcacaacatg ggggatcatg 4200taactcgcct tgatcgttgg gaaccggagc
tgaatgaagc cataccaaac gacgagcgtg 4260acaccacgat gcctgtagca atggcaacaa
cgttgcgcaa actattaact ggcgaactac 4320ttactctagc ttcccggcaa caattaatag
actggatgga ggcggataaa gttgcaggac 4380cacttctgcg ctcggccctt ccggctggct
ggtttattgc tgataaatct ggagccggtg 4440agcgtgggtc tcgcggtatc attgcagcac
tggggccaga tggtaagccc tcccgtatcg 4500tagttatcta cacgacgggg agtcaggcaa
ctatggatga acgaaataga cagatcgctg 4560agataggtgc ctcactgatt aagcattggt
aactgtcaga ccaagtttac tcatatatac 4620tttagattga tttaaaactt catttttaat
ttaaaaggat ctaggtgaag atcctttttg 4680ataatctcat gaccaaaatc ccttaacgtg
agttttcgtt ccactgagcg tcagaccccg 4740tagaaaagat caaaggatct tcttgagatc
ctttttttct gcgcgtaatc tgctgcttgc 4800aaacaaaaaa accaccgcta ccagcggtgg
tttgtttgcc ggatcaagag ctaccaactc 4860tttttccgaa ggtaactggc ttcagcagag
cgcagatacc aaatactgtc cttctagtgt 4920agccgtagtt aggccaccac ttcaagaact
ctgtagcacc gcctacatac ctcgctctgc 4980taatcctgtt accagtggct gctgccagtg
gcgataagtc gtgtcttacc gggttggact 5040caagacgata gttaccggat aaggcgcagc
ggtcgggctg aacggggggt tcgtgcacac 5100agcccagctt ggagcgaacg acctacaccg
aactgagata cctacagcgt gagcattgag 5160aaagcgccac gcttcccgaa gggagaaagg
cggacaggta tccggtaagc ggcagggtcg 5220gaacaggaga gcgcacgagg gagcttccag
ggggaaacgc ctggtatctt tatagtcctg 5280tcgggtttcg ccacctctga cttgagcgtc
gatttttgtg atgctcgtca ggggggcgga 5340gcctatggaa aaacgccagc aacgcggcct
ttttacggtt cctggccttt tgctggcctt 5400ttgctcacat gttctttcct gcgttatccc
ctgattctgt ggataaccgt attaccgcct 5460ttgagtgagc tgataccgct cgccgcagcc
gaacgaccga gcgcagcgag tcagtgagcg 5520aggaagcgga agagcgccca atacgcaaac
cgcctctccc cgcgcgttgg ccgattcatt 5580aatgcagctg gcacgacagg tttcccgact
ggaaagcggg cagtgagcgc aacgcaatta 5640atgtgagtta gctcactcat taggcacccc
aggctttaca ctttatgctt ccggctcgta 5700tgttgtgtgg aattgtgagc ggataacaat
ttcacacagg aaacagctat gaccatgatt 5760acgccaagct ttgctcttag gagtttccta
atacatccca aactcaaata tataaagcat 5820ttgacttgtt ctatgcccta gggggcgggg
ggaagctaag ccagcttttt ttaacattta 5880aaatgttaat tccattttaa atgcacagat
gtttttattt cataagggtt tcaatgtgca 5940tgaatgctgc aatattcctg ttaccaaagc
tagtataaat aaaaatagat aaacgtggaa 6000attacttaga gtttctgtca ttaacgtttc
cttcctcagt tgacaacata aatgcgctgc 6060tgagcaagcc agtttgcatc tgtcaggatc
aatttcccat tatgccagtc atattaatta 6120ctagtcaatt agttgatttt tatttttgac
atatacatgt gaatgaaaga ccccacctgt 6180aggtttggca agctagctta agtaacgcca
ttttgcaagg catggaaaaa tacataactg 6240agaatagaaa agttcagatc aaggtcagga
acagatggaa cagctgaata tgggccaaac 6300aggatatctg tggtaagcag ttcctgcccc
ggctcagggc caagaacaga tggaacagct 6360gaatatgggc caaacaggat atctgtggta
agcagttcct gccccggctc agggccaaga 6420acagatggtc cccagatgcg gtccagccct
cagcagtttc tagagaacca tcagatgttt 6480ccagggtgcc ccaaggacct gaaatgaccc
tgtgccttat ttgaactaac caatcagttc 6540gcttctcgct tctgttcgcg cgcttatgct
ccccgagctc aataaaagag cccacaaccc 6600ctcactcggg gcgccagtcc tccgattgac
tgagtcgccc gggtacccgt gtatccaata 6660aaccctcttg cagttgcatc cgacttgtgg
tctcgctgtt ccttgggagg gtctcctctg 6720agtgattgac tacccgtcag cgggggtctt
tcatttgggg gctcgtccgg gatcgggaga 6780cccctgccca gggaccaccg acccaccacc
gggaggtaag ctggccagca acttatctgt 6840gtctgtccga ttgtctagtg tctatgactg
attttatgcg cctgcgtcgg tactagttag 6900ctaactagct ctgtatctgg cggacccgtg
gtggaactga cgagttcgga acacccggcc 6960gcaaccctgg gagacgtccc agggacttcg
ggggccgttt ttgtggcccg acctgagtcc 7020taaaatcccg atcgtttagg actctttggt
gcacccccct tagaggaggg atatgtggtt 7080ctggtaggag acgagaacct aaaacagttc
ccgcctccgt ctgaattttt gctttcggtt 7140tgggaccgaa gccgcgccgc gcgtcttgtc
tgctgcagca tcgttctgtg ttgtctctgt 7200ctgactgtgt ttctgtattt gtctgaaaat
atgggcccgg gctagactgt taccactccc 7260ttaagtttga ccttaggtca ctggaaagat
gtcgagcgga tcgctcacaa ccagtcggta 7320gatgtcaaga agagacgttg ggttaccttc
tgctctgcag aatggccaac ctttaacgtc 7380ggatggccgc gagacggcac ctttaaccga
gacctcatca cccaggttaa gatcaaggtc 7440ttttcacctg gcccgcatgg acacccagac
caggtcccct acatcgtgac ctgggaagcc 7500ttggcttttg acccccctcc ctgggtcaag
ccctttgtac accctaagcc tccgcctcct 7560cttcctccat ccgccccgtc tctccccctt
gaacctcctc gttcgacccc gcctcgatcc 7620tccctttatc cagccctcac t
76412341491DNAArtificial
SequenceDescription of Artificial Sequence Synthetic polynucleotide
234atggaaaccg acaccctgct gctgtgggtg ctgctgctgt gggtgccagg atccacagga
60ctgcctgtgc tgactcagcc accctcagcg tctgggaccc ccgggcagag ggtcaccatc
120tcttgttctg gacgcagttc caacatcggg agtaattctg ttaactggta tcgacaactc
180ccaggagcgg cccccaaact cctcatctat agtaataatc agcggccccc aggggtccct
240gtgcgattct ctggctccaa gtctggcacc tcagcctccc tggccatcag tgggctccag
300tctgaagatg aggccactta ttactgtgca acatgggatg acaatctgaa tgttcactat
360gtcttcggaa ctgggaccaa ggtcaccgtc ctaggttcta gaggtggtgg tggtagcggc
420ggcggcggct ctggtggtgg tggatccctc gagatggccc aggtgcagct ggtgcagtct
480ggggctgagg tgaagaagcc tgggtcctcg gtgaaggtct cctgcaaggc ttctggaggc
540accttcagca gctatgctat cagctgggtg cgacaggccc ctggacaagg gcttgagtgg
600atgggaagga tcatccctat ccttggtata gcaaactacg cacagaagtt ccagggcaga
660gtcacgatta ccgcggacaa atccacgagc acagcctaca tggagctgag cagcctgaga
720tctgaggaca cggccgtgta ttactgtgcg cgcggtggtt actactctca tgacatgtgg
780tctgaagatt ggggtcaagg tactctggtg accgtctcct cagcggccgc aattgaagtt
840atgtatcctc ctccttacct agacaatgag aagagcaatg gaaccattat ccatgtgaaa
900gggaaacacc tttgtccaag tcccctattt cccggacctt ctaagccctt ttgggtgctg
960gtggtggttg gtggagtcct ggcttgctat agcttgctag taacagtggc ctttattatt
1020ttctgggtga ggagtaagag gagcaggctc ctgcacagtg actacatgaa catgactccc
1080cgccgccccg ggcccacccg caagcattac cagccctatg ccccaccacg cgacttcgca
1140gcctatcgct ccagagtgaa gttcagcagg agcgcagacg cccccgcgta ccagcagggc
1200cagaaccagc tctataacga gctcaatcta ggacgaagag aggagtacga tgttttggac
1260aagagacgtg gccgggaccc tgagatgggg ggaaagccga gaaggaagaa ccctcaggaa
1320ggcctgtaca atgaactgca gaaagataag atggcggagg cctacagtga gattgggatg
1380aaaggcgagc gccggagggg caaggggcac gatggccttt accagggtct cagtacagcc
1440accaaggaca cctacgacgc ccttcacatg caggccctgc cccctcgcta a
14912351488DNAArtificial SequenceDescription of Artificial Sequence
Synthetic polynucleotide 235atggaaaccg acaccctgct gctgtgggtg
ctgctgctgt gggtgccagg atccacagga 60caggctgtgc tgactcagcc accctcagcg
tctgggaccc ccgggcagag ggtcaccatc 120tcttgttctg gaagcagctc caacatcgga
agtaattacg tattctggta ccagcagctc 180ccaggaacgg cccccaaact cctcatctat
agtaataatc agcggccctc aggggtccct 240gaccgattct ctggctccaa gtctggcacc
tcagcctccc tggccatcag tgggctccgg 300tccgaggatg aggctgatta ttactgtgca
gcatgggatg acagcctgag tgcctcttat 360gttttcggaa ctgggaccaa ggtcaccgtc
ctaggttcta gaggtggtgg tggtagcggc 420ggcggcggct ctggtggtgg tggatccctc
gagatggccc aggtgcagct ggtgcagtct 480ggggctgagg tgaagaagcc tgggtcctcg
gtgaaggtct cctgcaaggc ttctggaggc 540accttcagca gctatgctat cagctgggtg
cgacaggccc ctggacaagg gcttgagtgg 600atgggaagga tcatccctat ccttggtaca
gcaaactacg cacagaagtt ccagggcaga 660gtcacgatta ccgcggacga atccacgagc
acagcctaca tggagctgag cagcctgaga 720tctgaggaca cggccgtgta ttactgtgcg
cgctctggtt acggttctta ccgttgggaa 780gattcttggg gtcaaggtac tctggtgacc
gtctcctcag cggccgcaat tgaagttatg 840tatcctcctc cttacctaga caatgagaag
agcaatggaa ccattatcca tgtgaaaggg 900aaacaccttt gtccaagtcc cctatttccc
ggaccttcta agcccttttg ggtgctggtg 960gtggttggtg gagtcctggc ttgctatagc
ttgctagtaa cagtggcctt tattattttc 1020tgggtgagga gtaagaggag caggctcctg
cacagtgact acatgaacat gactccccgc 1080cgccccgggc ccacccgcaa gcattaccag
ccctatgccc caccacgcga cttcgcagcc 1140tatcgctcca gagtgaagtt cagcaggagc
gcagacgccc ccgcgtacca gcagggccag 1200aaccagctct ataacgagct caatctagga
cgaagagagg agtacgatgt tttggacaag 1260agacgtggcc gggaccctga gatgggggga
aagccgagaa ggaagaaccc tcaggaaggc 1320ctgtacaatg aactgcagaa agataagatg
gcggaggcct acagtgagat tgggatgaaa 1380ggcgagcgcc ggaggggcaa ggggcacgat
ggcctttacc agggtctcag tacagccacc 1440aaggacacct acgacgccct tcacatgcag
gccctgcccc ctcgctaa 148823619PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 236Ser
Gly Ser Gly Leu Gln Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr1
5 10 15Phe Asp Ser23719PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 237Ser
Gly Ser Gly Gln Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe1
5 10 15Asp Ser Leu23819PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 238Ser
Gly Ser Gly Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp1
5 10 15Ser Leu Leu23919PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 239Ser
Gly Ser Gly Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp Ser1
5 10 15Leu Leu His24019PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 240Ser
Gly Ser Gly Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp Ser Leu1
5 10 15Leu His Ala24119PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 241Ser
Gly Ser Gly Gln Cys Ser Gln Asn Glu Tyr Phe Asp Ser Leu Leu1
5 10 15His Ala Cys24219PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 242Ser
Gly Ser Gly Cys Ser Gln Asn Glu Tyr Phe Asp Ser Leu Leu His1
5 10 15Ala Cys Ile24319PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 243Ser
Gly Ser Gly Ser Gln Asn Glu Tyr Phe Asp Ser Leu Leu His Ala1
5 10 15Cys Ile Pro24419PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 244Ser
Gly Ser Gly Gln Asn Glu Tyr Phe Asp Ser Leu Leu His Ala Cys1
5 10 15Ile Pro Cys24519PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 245Ser
Gly Ser Gly Asn Glu Tyr Phe Asp Ser Leu Leu His Ala Cys Ile1
5 10 15Pro Cys Gln24619PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 246Ser
Gly Ser Gly Glu Tyr Phe Asp Ser Leu Leu His Ala Cys Ile Pro1
5 10 15Cys Gln Leu24719PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 247Ser
Gly Ser Gly Tyr Phe Asp Ser Leu Leu His Ala Cys Ile Pro Cys1
5 10 15Gln Leu Arg24819PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 248Ser
Gly Ser Gly Phe Asp Ser Leu Leu His Ala Cys Ile Pro Cys Gln1
5 10 15Leu Arg Cys24919PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 249Ser
Gly Ser Gly Asp Ser Leu Leu His Ala Cys Ile Pro Cys Gln Leu1
5 10 15Arg Cys Ser25019PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 250Ser
Gly Ser Gly Ser Leu Leu His Ala Cys Ile Pro Cys Gln Leu Arg1
5 10 15Cys Ser Ser25119PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 251Ser
Gly Ser Gly Leu Leu His Ala Cys Ile Pro Cys Gln Leu Arg Cys1
5 10 15Ser Ser Asn25219PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 252Ser
Gly Ser Gly Leu His Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser1
5 10 15Ser Asn Thr25319PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 253Ser
Gly Ser Gly His Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser1
5 10 15Asn Thr Pro25419PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 254Ser
Gly Ser Gly Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn1
5 10 15Thr Pro Pro25519PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 255Ser
Gly Ser Gly Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr1
5 10 15Pro Pro Leu25619PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 256Ser
Gly Ser Gly Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr Pro1
5 10 15Pro Leu Thr25719PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 257Ser
Gly Ser Gly Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr Pro Pro1
5 10 15Leu Thr Cys25819PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 258Ser
Gly Ser Gly Cys Gln Leu Arg Cys Ser Ser Asn Thr Pro Pro Leu1
5 10 15Thr Cys Gln25919PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 259Ser
Gly Ser Gly Gln Leu Arg Cys Ser Ser Asn Thr Pro Pro Leu Thr1
5 10 15Cys Gln Arg26019PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 260Ser
Gly Ser Gly Leu Arg Cys Ser Ser Asn Thr Pro Pro Leu Thr Cys1
5 10 15Gln Arg Tyr26119PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 261Ser
Gly Ser Gly Arg Cys Ser Ser Asn Thr Pro Pro Leu Thr Cys Gln1
5 10 15Arg Tyr Cys26219PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 262Ser
Gly Ser Gly Cys Ser Ser Asn Thr Pro Pro Leu Thr Cys Gln Arg1
5 10 15Tyr Cys Asn26319PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 263Ser
Gly Ser Gly Ser Ser Asn Thr Pro Pro Leu Thr Cys Gln Arg Tyr1
5 10 15Cys Asn Ala26419PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 264Ser
Gly Ser Gly Ser Asn Thr Pro Pro Leu Thr Cys Gln Arg Tyr Cys1
5 10 15Asn Ala Ser26519PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 265Ser
Gly Ser Gly Asn Thr Pro Pro Leu Thr Cys Gln Arg Tyr Cys Asn1
5 10 15Ala Ser Val26619PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 266Ser
Gly Ser Gly Thr Pro Pro Leu Thr Cys Gln Arg Tyr Cys Asn Ala1
5 10 15Ser Val Thr26719PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 267Ser
Gly Ser Gly Pro Pro Leu Thr Cys Gln Arg Tyr Cys Asn Ala Ser1
5 10 15Val Thr Asn26819PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 268Ser
Gly Ser Gly Pro Leu Thr Cys Gln Arg Tyr Cys Asn Ala Ser Val1
5 10 15Thr Asn Ser26919PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 269Ser
Gly Ser Gly Leu Thr Cys Gln Arg Tyr Cys Asn Ala Ser Val Thr1
5 10 15Asn Ser Val27019PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 270Ser
Gly Ser Gly Thr Cys Gln Arg Tyr Cys Asn Ala Ser Val Thr Asn1
5 10 15Ser Val Lys27119PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 271Ser
Gly Ser Gly Cys Gln Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser1
5 10 15Val Lys Gly27219PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 272Ser
Gly Ser Gly Gln Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser Val1
5 10 15Lys Gly Thr27319PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 273Ser
Gly Ser Gly Arg Tyr Cys Asn Ala Ser Val Thr Asn Ser Val Lys1
5 10 15Gly Thr Asn27419PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 274Ser
Gly Ser Gly Tyr Cys Asn Ala Ser Val Thr Asn Ser Val Lys Gly1
5 10 15Thr Asn Ala27525PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 275Thr
Ser Gly Gln Ala Gly Gln His His His His His His Gly Ala Tyr1
5 10 15Pro Tyr Asp Val Pro Asp Tyr
Ala Ser 20 2527675DNAArtificial
SequenceDescription of Artificial Sequence Synthetic oligonucleotide
276actagtggcc aggccggcca gcaccatcac catcaccatg gcgcataccc gtacgacgtt
60ccggactacg cttct
752774PRTHomo sapiens 277Tyr Val Lys Met12784PRTArtificial
SequenceDescription of Artificial Sequence Synthetic peptide 278Ser
Gly Ser Gly12795PRTArtificial SequenceDescription of Artificial Sequence
Synthetic peptide 279Gly Ser Ser Ser Ser1
528053PRTArtificial SequenceDescription of Artificial Sequence Synthetic
polypeptide 280Leu Gln Met Ala Gly Gln Cys Ser Gln Asn Glu Tyr Phe
Asp Ser Leu1 5 10 15Leu
His Ala Cys Ile Pro Cys Gln Leu Arg Cys Ser Ser Asn Thr Pro 20
25 30Pro Leu Thr Cys Gln Arg Tyr Cys
Asn Ala Ser Val Thr Asn Ser Val 35 40
45Lys Gly Thr Asn Ala 50
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20200083153 | ELECTRONIC DEVICE INCLUDING A LATERAL TRACE |
20200083152 | Package Structures and Methods for Forming the Same |
20200083151 | Packages with Si-Substrate-Free Interposer and Method Forming Same |
20200083150 | SEMICONDUCTOR DEVICE AND METHOD FOR MANUFACTURING SAME |
20200083149 | PACKAGED SEMICONDUCTOR DEVICE WITH A PARTICLE ROUGHENED SURFACE |