Patent application title: EUKARYOTIC CELL DISPLAY SYSTEMS
Inventors:
Kevin Caili Wang (Lansdale, PA, US)
Peter Peizhi Luo (Lansdale, PA, US)
Pingyu Zhong (Blue Bell, PA, US)
Pingyu Zhong (Blue Bell, PA, US)
Jian Wang (Lansdale, PA, US)
Assignees:
Merck Sharp & Dohme Corp.
IPC8 Class: AC40B5006FI
USPC Class:
506 26
Class name: Combinatorial chemistry technology: method, library, apparatus method of creating a library (e.g., combinatorial synthesis, etc.) biochemical method (e.g., using an enzyme or whole viable micro-organism, etc.)
Publication date: 2012-12-27
Patent application number: 20120329679
Abstract:
The present invention provides expression vectors and helper display
vectors which can be used in various combinations as vector sets for
display of polypeptides on the outer surface of eukaryotic host cells.
The expression vector of the invention can be used alone for soluble
expression without having to change or reengineer the display vectors.
The display systems of the invention are particularly useful for
displaying a genetically diverse repertoire or library of polypeptides on
the surface of yeast cells, and mammalian cells.Claims:
1. A method for displaying a repertoire of polypeptide sequences of
interest on eukaryotic host cells, comprising: a) introducing at least
one expression vector comprising an expression cassette encoding a
polypeptide of interest fused in frame to a first adapter sequence, into
host cells b) introducing a helper vector encoding a fusion protein
consisting of a second adapter sequence fused to an outer surface
anchoring protein expressed by the host cells of (a), and c) maintaining
the host cells under suitable conditions for expression of the proteins
encoded by the expression cassette and helper vector wherein the
polypeptides of interest are displayed on the surface of the host cell.
2. The method of claim 1 wherein the repertoire of polypeptide sequences of interest is an antibody library.
3. The method of claim 2 wherein the antibody library is scFv library.
4. The method of claim 2 wherein the antibody library is Fab library.
5. The method of claim 1 wherein the eukaryotic host cells are yeast cells.
6. The method of claim 5 wherein the yeast cells are selected from a group consisting of S. cerevisiae, P. pichia, H. polymorpha, and C. albicans.
7. The method of claim 5 wherein the yeast outer surface anchoring sequences are selected from the group consisting of Aga1 and Aga2, Cwp1, Cwp2, Gas1p, Yap3p, Flo1p, Crh2p, Pir1, Pir2, Pir4, Icwp in S.cerevisiae; HpSEDI, HpGASI, HpTIPI, HPWPI in H. polymorpha, and Hwp1p, Als3p, Rbt5p in C. albicans.
8. The method of claim 1 wherein the eukaryotic host cell is a mammalian cell selected from a group consisting of animal cells and insect cells.
9. The method of claim 9 wherein the outer surface anchoring sequences are selected from a group consisting of transmembrane domains of cell surface receptors, GPI anchor sequences, non-cleavable type II signal anchor sequences.
10. The method of claim 1 wherein the first adapter and second adapter are homodimer sequences.
11. The method of claim 1 wherein the first adapter and second adapter are heterodimer sequences.
12. The method of claim 1 wherein the first adapter and second adapter are dimeric sequences consist a pair of cysteine residues.
13. The method of claim 12 wherein the first and second adapter sequences are derived from coiled coil domains.
14. The method of claim 6 wherein the first adapter sequence consists of SEQ ID NO:1 and the second adapter sequence consists of SEQ ID NO: 2.
15. The method of claim 1 wherein the expression vector is selected from pMAT9, pMAT12, pMAT19, pMAG10 and the helper vector is selected from pMAT7, pMAT8, and pMAG2.
16. A yeast eukaryotic host cell transformed with a helper vector selected from the group pMAT7, or pMAT8.
17. A mammalian host cell transfected with a helper vector such as pMAG2.
18. A kit comprising: a) at least one expression vector comprising an expression cassette encoding a polypeptide of interest fused in frame to a first adapter sequence, and b) a helper vector encoding a fusion protein consisting of a second adapter sequence fused to an outer surface anchoring protein.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a of U.S. application under 37 CFR 1.53(b) which claims the benefit of U.S. Provisional Application No. 61/003,413 filed Nov. 16, 2007.
TECHNICAL FIELD OF THE INVENTION
[0002] This invention relates to the field of protein display and provides display systems which facilitate the display of protein libraries on the surface of eukaryotic host cells, including yeast cells and mammalian cells. The compositions and methods of the invention are particularly useful for identifying proteins with desired properties from a vast repertoire of proteins. This system also provides methods for producing soluble protein for use in functional assays and for directing expressed proteins to different cellular organelles without any molecular manipulation of the display vector.
BACKGROUND OF THE INVENTION
[0003] Phage display systems are regarded as a core technology platform for the construction and screening of polypeptide libraries, particularly antibody libraries. This is attributed to numerous practical considerations including, the availability of various genetic tools, the convenience of manipulation, and the high transformation efficiency of E coli cells. Today, naive antibody libraries displayed on phage are routinely used for antibody discovery, thereby obviating the need for animal immunizations and the use of traditional hybridoma technology. However, despite the successful use of phage display in antibody discovery and engineering protocols, there are a number of drawbacks associated with the expression and display of eukaryotic proteins in prokaryotic systems.
[0004] For example, some eukaryotic proteins cannot be functionally expressed in prokaryotic cells. In addition, prokaryotic host cells are typically not able to accomplish the full range of post-translational modifications that are characteristic of eukaryotic host cells. Some of the limitations associated with the use of a prokaryotic display system can be overcome by the use of a eukaryotic display system. For example, a unique advantage associated with the use of a yeast display system is attributed to the fact that yeast cells can be cultivated to high densities using relatively simple and inexpensive culture medium. Generally speaking, eukaryotic host cells can accommodate the display of relatively large proteins, and are capable of post translation modifications including complex glycosylation. In addition, because eukaryotic cells are larger in size than prokaryotic cells, the members of a display libraries can be efficiently screened for single cells expressing proteins with desired properties (i.e., binding specificities) by flow cytometry.
[0005] The display of heterologous protein on the cell surface of Saccharornyces cerevisiae was first described in 1993 using a cell wall protein-basd fusion protein design in which alpha-galactosidase was fused to the C-terminal half of cell wall protein alpha-agglutinin AGA 1 (Schreuder M P et al, Yeast 9:399-409). Since then, numerous yeast display systems based on fusing a library of proteins of interest to cell wall proteins have been reported (Kondo M et al). Among all of the cell wall fusion protein-based display systems, the system created by Dane Wittrup based on a-agglutinin receptor has been widely used to display various proteins libraries including various formats of antibody libraries (U.S. Pat. Nos. 6,300,065, 6,423,538, 6,696,251, and 6,699,658).
[0006] Similarly, a number of approaches have been designed to achieve the display of proteins on the surface of mammalian cells using display vectors which comprise a membrane anchor proteins fused to the members of a protein library comprising a diverse repertoire of protein sequences. Typically, the anchor protein comprises a protein derived from the membrane domain of a cell surface receptor (Chestnut et al, 1996, J Immunological Methods; Ho et al, 2006, PNAS, 103:9637-9642), such as a GPI anchor sequences (U.S. Pat. No. 6,838,446), or a non-cleavable type 11 signal anchor sequence(U.S. Pat. No. 7,125,973). For example, the pDISPLAY vector (Invitrogen Life Technolgies.), is a commercially available vector which directs the cell surface display of proteins on mammalian cell utilizes the membrane domain of cell surface platelet derived growth factor receptor PDGFR. Proteins expressed from the pDISPLAY vector are anchored to the plasma membrane of the host cell and displayed on the extracellular side of the plasma membrane.
[0007] There are a number of drawbacks associated with the use of a cell surface display system based on fusing the protein library to a cell surface anchor protein. For example, because the proteins of interest are directly fused to the outer surface anchor protein, the protein of interest can only be expressed as a part of the membrane protein. In order to obtain soluble protein for evaluation in screening and/or functional assays, additional molecular cloning steps are required in order to transfer the coding sequences of interest to an expression vector which directs the expression of soluble protein. Use of a cell wall fusion protein-based design also eliminates the possibility of evaluating the functional properties of expressed proteins inside cellular organelles such as mitochondria, Golgi apparatus, endoplasmic reticulum etc.
[0008] Therefore, there is an unmet need for an alternative protein display system which facilitates the display of protein libraries on eukaryotic cell surfaces, using a vector design which can also be used to direct expression of library proteins as either soluble proteins or as intracellular proteins without any molecular manipulations to reengineer the display vector.
SUMMARY OF THE INVENTION
[0009] This invention provides protein display systems that are capable of displaying diverse libraries of polypeptides on the surface of eukaryotic cells. The compositions and methods of the invention can be used to display the protein products encoded by a diverse repertoire of coding sequences on the surface of yeast cells or mammalian cells. The compositions and methods of the invention are particularly useful for the display of antibody libraries for antibody discovery (i.e. screening) and/or optimization (e.g., molecular evolution) protocols. Notably, the displayed library members are not anchored to the cell membrane as a consequence of being directly fused to a coding sequence that encodes a cell wall outer membrane protein.
[0010] In one embodiment, the invention provides a method for displaying a repertoire of polypeptide sequences on the surface of eukaryotic cells. As depicted in FIG. 1, the disclosed display systems generally have two components: 1) an expression (or display) vector and 2) a corresponding helper vector. As disclosed herein a display system comprises a pair of vectors (i.e., a display vector and a corresponding helper vector) that are chosen based on the identity of the host cell. Accordingly, an alternative embodiment of the invention provides kits in suitable packaging comprising at least one display vector and a helper vector designed to direct the display of a collection of protein sequences on the surface of particular eukaryotic host cells.
[0011] The display vector comprises a fusion protein in which the members of the encoded protein library are fused to a first adapter sequence (referred to herein as "adapter1"). Introduction of a display vector, in the absence of a helper vector, into a eukaryotic host cells, such as yeast cells, or mammalian cells, leads to expression and secretion of polypeptides that are fused in-frame with adapter1.
[0012] More specifically, the invention provides the yeast expression vectors embodied by the library display vectors pMAT9, and pMAT12 (see FIGS. 2A & 2B). In pMAT9 and pMAT12 vectors, the expression of the adapter1 fusion protein is under control of a yeast promoter. Suitable strains of yeast host cells for use with the vectors and display systems of the invention include but are not limited to S. cerevisiae, Pichia pastoris, H. polymorphs, and C. albicans. Generally speaking, the disclosed yeast display vector comprises at least one expression cassette for an adapter1 fusion protein, which comprises the following functional elements: (1) a yeast promoter; (2) a yeast signal sequence; (3) a gene of interest; (4) adapter1 coding sequence.
[0013] In an alternative embodiment the invention provides the mammalian expression vector pMAG10 (see FIG. 7). In the pMAG10 vector, the expression of adapter1 fusion protein is under control of a mammalian promoter. Suitable mammalian host cells for use with the vectors and display systems of the invention include but are not limited to HEK293 and CHO cells.
[0014] Generally speaking, the disclosed helper vectors encode a fusion protein comprising a cell surface anchor protein in combination with a second adapter sequence (referred to herein as adapter2) that is capable of interacting in a pair-wise manner with a corresponding adapter sequence fused to the protein product of the library display vector. In specific embodiments the invention provides the yeast helper vectors pMAT7 (FIG. 3A) and pMAT8 (FIG. 3B). Each of the yeast helper display vectors directs the expression of a fusion protein comprising adapter 2 fused to different yeast cell outer wall proteins. More specifically, pMAT3 directs the expression of adapter 2 fused to the Aga2 proteins, and pMAT8 directs the expression of adapater2 Cwp2 fusion proteins.
[0015] The invention also provides the mammalian helper vectors pMAG2 (see FIG. 8). pMAG2 directs expression of adapter2 sequences fused to the transmembrane domain of human EGF receptor. An alternative embodiment, the invention provides eukaryotic (i.e., yeast and mammalian) host cells comprising a helper vectors of the invention. For example, a suitable host cell comprising a chromosomal integrant of one of the helper display vectors of the invention.
[0016] Co-expression of an expression vector of the invention in combination with a corresponding helper vector directs the display of the polypeptide product of the display library members anchored to the cell membrane or cell wall of the host cell. Surface display results from the pairwise interaction of the adapters (i.e., adapter1 fused to the protein product of the display library, and adapter2 fused to a host cell specific anchor protein) which has the effect of directing the display of the protein library on the surface of the host cell.
[0017] For example, co-expression of a helper vector comprising an adapter sequence fused to a yeast outer cell wall protein, in combination with a yeast display vector comprising a library of fusion proteins comprising a corresponding adapter sequence present in the fusion proteins expressed by the fusion proteins encoded by the display vector provides a yeast cell surface display system. Similarly, co-expression of a mammalian helper vector comprising an adapter a corresponding adapter sequence fused to a mammalian cell surface anchor protein in combination with a mammalian display vector comprising an adapter sequence that interacts with in a pairwise manner with the adapter present in the helper vector provides a mammalian cell surface display system.
[0018] The use of an expression vector of the invention in the absence of a helper vector, results in the expression of the encoded proteins as soluble proteins by the host cells. Therefore, the disclosed display vectors also facilitate the direct expression of library proteins as either soluble protein or as intracellular proteins without any molecular manipulation (i.e., DNA digestion and/or ligation) of the vector. Accordingly, the invention also provides an efficient method to evaluate the functional characteristics of the members of the display library proteins.
BRIEF DESCRIPTION OF THE DRAWINGS
[0019] FIGS. 1A and 1B provides schematic representations of the components of the yeast display system (FIG. 1A) and mammalian display system (FIG. 1B) of the invention.
[0020] FIGS. 2A and 2B provide schematic representations of the yeast display vectors pMAT9 (FIG. 2A) and pMAT12 (FIG. 2B).
[0021] FIGS. 3A and 3B provide schematic representations of the yeast helper display vectors pMAT7 (FIG. 3A) and pMAT8 (FIG. 3B).
[0022] FIGS. 4 provides a series of micrograph images of yeast cells transfected with a yeast helper vector comprising an adapter2 sequences fused to the outer cell membrane protein Cwp2. Yeast cells comprising a chromosomal integrant of the yeast helper vector pMAT8 were stained with mouse anti-Myc antibody and Alexa-488 conjugated anti-mouse antibody. The control micrograph presented in panel (a) shows a lack of fluorescence by cells that are not induced. The fluorescence micrograph presented in panel (b) shows a fluorescent signal on the surface of induced yeast cells. Panel (c) provides a phase contrast micrograph of the same cells depicted in the fluorescence micrograph of provided in panel (d). The surface fluorescence (green signal) detected in panel (d) illustrates the surface expression of Myc-tagged adapter2 fusion proteins anchored to the outer wall of induced yeast cells.
[0023] FIGS. 5A through 5C illustrates the functional surface display of scFv antibody on yeast cells comprising the pMAT9 and pMAT12 display vectors. FIG. 5A represents the fluorescent signal detected on the surface of yeast cells that are contransfected with the pMAT9 display vector and the pMAT8 helper vector. The control micrograph presented in panel (a) shows a lack of fluorescence by cells that are not induced. The fluorescence micrograph presented in panel (b) shows a fluorescent signal on the surface of induced yeast cells. Panel (c) provides a phase contrast micrograph of the same cells depicted in the fluorescence micrograph of provided in panel (d). The surface fluorescence (green signal) detected in panel (d) illustrates the surface expression of Myc-tagged adapter2 fusion proteins anchored to the outer wall of the induced yeast cells. FIG. 5B provides the results of a flow cytometry (FACScan) analysis of yeast cells displaying HA-tagged scFv antibodies on their surface resulting from the expression of vectors pMAT9 and pMAT12. FIG. 5C shows the co-localization of adapter1 fusion proteins and adapter2 fusion proteins on the outer membrane of yeast cells after galactose induction. HA-tagged adapter 1 fusion proteins (product of the display vector) expressed on the cell surface are detected by a green fluorescent signal illustrated by the cells panel a). Myc-tagged adapter2 fusion proteins (products of the helper vector) expressed on the cell surface are detected by a red fluorescent signal illustrated by the cells in panel b). The presence of a yellow fluorescent signal on the surface of the cell in panel c) establishes the co-localization of the two fusion proteins and results from the merger of the green and red fluorescence.
[0024] FIG. 6 provides a graphic representation of the yeast expression vector pMAT19 which comprises two expression cassettes, one for adapter1 fusion to heavy chain, and a second casette for a light chain, and is suitable for the display of a library comprising a repertoire of Fab-formatted antibodies.
[0025] FIG. 7 provides a schematic representation of the mammalian expression vector pMAG10. Expression of the adapter1 (GR1) fusion proteins encoded by the pMAG10 vector is under control of a mammalian promoter.
[0026] FIG. 8 provides a schematic representation of the mammalian helper vector pMAG2, which produces a fusion protein comprising an adapter2 (GR2) sequence fused to the transmembrane domain of the human EGF receptor.
[0027] FIG. 9 provides the fluorescence microscopy images of COS 6 cells displaying HA-tagged anti-VEGF scFv antibodies on their surface. The COS 6 cells depicted in the upper row of images (panels a-d) were transfected with the display vector pMAG10 and helper vector pMAG2. The surface expression of HA-tagged scFv-adapter1 fusion proteins on the plasma membrane of the co-transfected COS 6 cells depicted in panel (a) by a (green) fluorescent signal. The co-localization of the myc-tagged adapter2 fusion proteins on the surface of the COS 6 cells is depicted in panel (b) by a red fluorescent signal. The co-localization of the display vector Adapter 1 fusion protein and the helper vector plasma membrane Adapter2 fusion protein can be detected by the presence of a third color fluorescent signal, such as the yellow to orange fluorescent signal depicted in panel d) resulting from the merger of the green and red fluorescent signals contributed by the display vector and helper vector fusion proteins. The cells depicted in panel (c) were stained with a nuclear stain.
[0028] FIG. 10 sets forth the nucleotide and amino acid sequences of the adapter sequences used in the examples disclosed herein to practice the disclosed eukaryotic surface display systems. SEQ ID NO: 1 provides the nucleotide sequence encoding the adapter element referred to as adapted. SEQ ID NO:2 provides the nucleotide sequence of the adapter element referred to as adapter2. SEQ ID NOS:10 and 11 provides the amino acid sequences of adapter1 and adapter2, respectively.
[0029] FIG. 11 sets forth the nucleotide sequence of a polynucleotide encoding an anti-VEGF scFv antibody fused with adapter1 (GR1) (SEQ ID NO: 3).
[0030] FIG. 12 sets forth the nucleotide sequence of the yeast pMAT12 display vector (SEQ ID NO: 4).
[0031] FIG. 13 sets forth the nucleotide sequence of the yeast helper vector pMAT7 (SEQ ID NO: 5).
[0032] FIG. 14 sets forth the nucleotide sequence of the yeast helper vector pMAT8 (SEQ ID NO: 6).
[0033] FIG. 15 sets forth the nucleotide sequence of the yeast expression vector pMAT19 (SEQ ID NO: 7).
[0034] FIG. 16 sets forth the nucleotide sequence of the mammalian display vector pMAG10 (SEQ ID NO: 8).
[0035] FIG. 17 sets forth the nucleotide sequence of the mammalian helper vector pMAG2 (SEQ ID NO: 9).
DETAILED DESCRIPTION OF THE INVENTION
[0036] As used in this specification and claims, the singular form "a," "an," and "the" include plural references unless the context clearly dictates otherwise. As used herein the term "species" refers to a group of organisms which are very similar in morphology, anatomy, physiology and genetics due to having relatively recent common ancestors. Different species usually demonstrate common features in performing common function of life regardless their other differences. For example, human and mouse cells share certain molecular landmarks, and are 30 considered to be members of the same species (i.e., mammalian cells) while human cells and yeast cells are different species of eukaryotic host cells.
[0037] As used herein the term "genetic packages" refers to viruses or cells, in which polynucleotide sequences encoding proteins of interest are packaged for expression and/or surface display.
[0038] The terms "prokaryotic system" and "prokaryotic genetic packages" are used interchangeably herein to refer to prokaryotic cells such as bacterial cells or prokaryotic viruses such as phages or bacterial spores.
[0039] The term "eukaryotic system" and "eukaryotic host cells" are used interchangeably herein to refer to eukaryotic cells including cells of animal, plants, fungi and protists, and eukaryotic viruses such as retrovirus, adenovirus, beculovirus. As used herein the term "gene," is used to refer to a DNA sequence which codes for a protein. The term does not include untranslated flanking regions such as RNA transcription initiation signals, polyadenylation addition sites, promoters or enhancers.
[0040] The term "expression cassette" is used here to refer to a functional unit that is built in a vector for the purpose of expressing recombinant proteins/peptides. It usually consists of a promoter or promoters, a ribosome binding site or ribosome binding sites, and the cDNA of the expression target. Other accessory components can be added to construct an expression cassette.
[0041] As used herein the term "vector" refers to a nucleic acid molecule, preferably self replicating, which transfers an inserted nucleic acid molecule into and/or between host cells. Typically vectors are circular DNA comprising a replication origin, a selection marker, and or viral package signal, and other regulatory elements. Vector, vector DNA, plasmid DNA are interchangeable terms in description of this invention. The term includes vectors that function primarily for insertion of DNA or RNA into a cell, replication of vectors that function primarily for the replication of DNA or RNA, and expression vectors that function for transcription and/or translation of the DNA or RNA. Also included are vectors that provide more than one of the above functions.
[0042] As used herein the term "expression vector" is a polynucleotide which, when introduced into an appropriate host cell, can be transcribed and translated into a polypeptide(s). The terms "expression vector," multi-species expression vector" and "cross species expression vector" refer to vectors that direct the soluble expression of proteins of interest fused in frame with an adapter sequence which is characterized by an ability to associate in a pairwise fashion with an adapter sequence produced by a helper vector of the invention.
[0043] The term "helper vector" refers to a genetic package, or host cell-specific vector designed to produce fusion proteins comprising an anchor protein fused in frame with an adapter sequence which is characterized by an ability to associate in a pairwise fashion with an adapter sequence produced by an expression vector of the invention. Helper vectors can be introduced into recipient host cells, in combination with an expression vector, transiently by cotransformation, or permanently by integration into host genome.
[0044] As used herein the term "display vector set" refers to particular combinations of expression vectors and helper vectors which are designed to comprise complementary adapter sequences which function to display polypeptides on the surface of particular species of genetic packages or host cells. For example, a set of vectors pMAG10 (FIG. 7) and pMAG2 (FIG. 8) for mammalian display, a set of vectors pMAT9 (FIG. 2A) and pMAT8 (FIG. 3B) for yeast display.
[0045] As used herein the term "expression system" usually connotes a suitable host cell comprised of an expression vector that can function to yield a desired expression product.
[0046] As used herein, the term "surface antigen" refers to the plasma membrane components of a cell. It encompasses integral and peripheral membrane proteins, glycoproteins, polysaccharides and lipids that constitute the plasma membrane. An "integral membrane protein" is a transmembrane protein that extends across the lipid bilayer of the plasma membrane of a cell. A typical integral membrane protein consists of at least one "membrane spanning segment" that generally comprises hydrophobic amino acid residues. Peripheral membrane proteins do not extend into the hydrophobic interior of the lipid bilayer and they are bound to the membrane surface by noncovalent interaction with other membrane proteins.
[0047] The term "outer surface anchor" as used herein is to refer a polypeptide, or protein, or protein domain, which will be integrated into or attached on the out surface of a genetic package. It may be from the nature, or be artificially created by any means. The term as used interchangeably with the terms "surface anchor sequence" or "signal coat protein", "outer surface sequences", "outer membrane protein", "membrane anchor protein", "anchor protein", "cell wall protein", "GPI anchor signal," "GPI attachment signal," and signal anchor sequence.
[0048] The term "signal sequence" and "leader sequence" are used interchangeably herein to refer a DNA sequence encoding a secretory peptide that is a component of a larger peptide on DNA level. It may also refer the amino acide sequence of a secretory peptide. The function of secretory peptide is to direct the larger polypeptide through a secretory pathway of a cell.
[0049] As used herein the terms "polynucleotides", "nucleic acids", "nucleotides" and "oligonucleotides" are used interchangeably. They refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or ribonucleotides, or analogs thereof. Polynucleotides may have any three dimensional structure, and may perform any function, known or unknown. The following are non limiting examples of polynucleotides: coding or non-coding regions of a gene or gene fragment, loci (locus) defined from linkage analysis, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and primers. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The sequence of nucleotides may be interrupted by non nucleotide components. A polynucleotide may be further modified after polymerization, such as by conjugation with a labeling component.
[0050] As used herein the term "amino acid" refers to either natural and/or unnatural or synthetic amino acids, including glycine and both the D or L optical isomers, and amino acid analogs and peptidomimetics.
[0051] As used herein the terms "polypeptide", "peptide, "protein," and "protein of interest" are used interchangeably herein to refer to polymers of amino acids of any length. The polymer may be linear, cyclic, or branched, it may comprise modified amino acids, and it may be interrupted by non amino acids. The terms also encompass amino acid polymers that have been modified, for example, via sulfation, glycosylation, lipidation, acetylation, phosphorylation, iodination, methylation, oxidation, proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, transfer-RNA mediated addition of amino acids to proteins such as arginylation, ubiquitination, or any other manipulation, such as conjugation with a labeling component.
[0052] As used herein the term "antibody" refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an antigen-binding site which specifically binds ("immunoreacts with") an antigen. Structurally, the simplest naturally occurring antibody (e.g., IgG) comprises four polypeptide chains, two heavy (H) chains and two light (L) chains inter-connected by disulfide bonds. The immunoglobulins represent a large family of molecules that include several types of molecules, such as IgD, IgG, IgA, IgM and IgE. The term "immunoglobulin molecule" includes, for example, hybrid antibodies, chimeric antibodies, humanized antibodies and fragments thereof. Non-limiting examples of antibody fragments include a Fab fragment consisting of the VL, VH, CL and CH1 domains; (4) an Fd fragment consisting of the VH and CHI domains; (5) an Fv fragment consisting of the VL and VH domains of a single arm of an antibody; (6) an F(ab')2 fragment, a bivalent fragment comprising two Fab fragments linked by a disulfide bridge at the hinge region; (7) a diabody consisting of two identical single chain Fv with shorter linker; (8) a ccFv antibody consisting of Fv stabilized by a pair of coiled-coil domains interaction.
[0053] As used herein the term "pair-wise interaction" means that the two adapters can interact with and bind to each other to form a stable complex. The stable complex must be sufficiently long-lasting to permit packaging the polypeptide onto the outer surface of a genetic package. In practice, the resulting complex or dimer must be able to withstand whatever conditions exist or are introduced between the moment of formation and the moment of detecting the displayed polypeptide, these conditions being a function of the assay or reaction which is being performed.
[0054] As used herein the term "host cell" includes an individual cell or cell culture which can be, or has been, a recipient for the subject vectors. Host cells include progeny of a single host cell. The progeny may not necessarily be completely identical (in morphology or in genomic of total DNA complement) to the original parent cell due to natural, accidental, or deliberate mutation. A host cell includes cells transfected in vivo with a vector of this invention.
[0055] As used herein the term "repertoire" refers to the total collection of variant members of a functional or physical origin. A library is the total collection of homologous variant members. In general, a repertoire depicts much wider and larger functional and physical landscape, therefore, it can include libraries that are functionally defined. For example, the entire genetic capacity of immunoglobulin in a species is its immunoglobulin repertoire; for the purpose of protein engineering, a library usually refers to a collection of variant molecules that derived from one or defined number of parental (or ancestor) proteins. A repertoire created for a particular purpose, such as a collection of sequences generated during the optimization of a therapeutic antibody, includes all libraries generated for such a purpose.
[0056] As used herein the term "adapters" refer to complementary elements or components that are capable of a pair-wise interaction with each other to form a physical unity based on the physical and/or functional match between the two different interacting protein sequences. Adapters can be proteins, protein domains, peptides, compounds of non-polypeptide, etc. derived from natural or artificial origins. Typical examples for adapters include two interacting polypeptides that form a coiled-coil heterodimer such as GR1 and GR2 (SEQ ID NOS:10 and 11 respectively) polypeptide seqeunces, c-fos and c-jun, natural and artificial leucine zippers, specific protein domains derived from a ligand and its cognate receptor, sequences derived from specific binding domains of heterodimeric complexes which are known to interact with each other to form a functional unit, or protein seqeuences derived from two different non-polypeptide components such as biotin and strepavidin. Generally speaking adapters suitable for use in surface display systems described in this disclousre can be endogenous or exogenous to the host species, and/or artificially derived.
[0057] As used herein, a linear sequence of peptide is "essentially identical" to another linear sequence, if both sequences exhibit substantial amino acid or nucleotide sequence homology. Generally, essentially identical sequences are at least about 60% identical with each other, after alignment of the homologous regions. Preferably, the sequences are at least about 70% identical, more preferably, they are at least about 80% identical, more preferably, they are at least about 90% identical, of more preferably, the sequences are at least about 95% identical.
[0058] The display of polypeptides on the surface of genetic packages represents a powerful methodology for screening libraries of polypeptide sequences. The ability to construct libraries of enormous molecular diversity and to select for molecules with desired properties has made this technology broadly applicable to numerous applications, including screening/discovery protocols as well as molecular evolution protocols. The origins of phage display date to the mid1980s when George Smith first expressed an exogenous segment of a protein on the surface of bacteriophage M13 virus particles by fusing the exogenous sequence to a phage coat protein 30 (Science (1985) 228: 1315 1317). Since then, a range of display systems have been developed based on George Smith's findings. These systems can be broadly classified into two categories (U.S. Pat. Nos. 5,969,108 and 5,837,500). The first generation system is a one-vector system. The vector in this system contains the entire phage genome, insert therein an exogenous sequence in-frame with a coat protein gene. Because the resulting phage particles carry the entire phage genomes, they are relatively unstable and less infectious. The second generation system, commonly referred to as the phagemid system, has two components: (1) a phagemid vector carrying the exogenous sequence fused to phage coat protein, and a phage-derived origin of replication to allow packaging of the phagemid into a phage particle; and (2) a helper phage carrying all other sequences required for phage packaging.
[0059] The helper phage is typically replication-defective such as M13K07 helper phage manufactured by Amersham Pharmacia Biotech and its derivative VC SM13 that is produced by Stratagene. Upon superinfection of a bacterial cell with the helper phages, newly packaged phages carrying the phagemid vector and displaying the exogenous sequence are produced. As such, the prior phagemid system requires fusion of the exogenous sequence to at least part of a phage outer-surface sequence (i.e. the coat sequence). The fusion or display sites most commonly used are within genes III and VIII of M13 bacteriophage, although genes VI, VII and IX fusions have been reported.
[0060] Alternative to the coat protein fusion system, various modifications to the fusion phagemid system have been described. Crameri et al. devised a system to display eDNA products, in which Fos oncogene was inserted adjacent to the exogenous sequence to be displayed on a phagemid vector, and Jun oncogene was inserted adjacent to gene III on the same vector (see Crameri et al. (1993) Gene 137:69 75). The Crameri approach exploits the preferential interaction between fos and jun proteins: as the Fos-exogenous polypeptide is expressed and secreted into the periplasmic space, it forms a complex with pIII-Jun which is then packaged into the phage particles upon super infection with M13K07 helper phage.
[0061] Another variant similar to the Crameri system is the "cysteine-coupled" display system described in WO 01/05950, U.S. Pat. No. 6,753,136. The attachment and display of the exogenous polypeptide are mediated by the formation of disulfide bond between two cysteine residues in the bacterial periplasmic space, one of which is contained in the exogenous sequence, and the other is inserted in the outer-surface sequence. Although those two systems avoid the expression of a fusion comprising the exogenous protein linked to an outer-surface protein, the systems fails to minimize the toxicity of coat proteins to the host cells because of the constitutive expression of the coat protein pill in display vectors. In addition, the formation of disulfide bond between two cysteine residues requires high level expression of both of exogenous sequence and coat protein pill. Therefore, any lower expression member will lose the chance to display.
[0062] Recently, Wang et al described an alternative phage display system based on an adapter-directed display system (U.S. Pat. No. 7,175,983), which comprise: (a) an expression vector comprising a coding sequence that encodes the exogenous polypeptide fused in-frame to a first adapter sequence; (b) a helper vector comprising outer-surface sequences encoding outer-surface proteins necessary for packaging the phage particle, and one of the outer-surface proteins is fused in- frame to a second adapter. Therefore, displays of the exogenous polypeptides are achieved by pairwise interaction between the first and second adapters.
[0063] Display of polypeptides on the surface of E. coli was developed as an alternative to phage display technology. Similar to phage display, bacterial display is an attractive method due to the availability of various genetic tools and mutant strains, and its high transformation efficiency that makes it ideal for large size library construction and screening. In gram-negative bacteria, surface display systems based on fusion of protein to be display to various anchoring proteins have been reported, in which outer membrane proteins (Chang and Lo 2000, J Biotechnol 78:115-122; Lee et al. 2004, Appl Environ Microbiol 70:5074-5080), pili and flagella (Westerlund-Wikstrom et al. 1997, Protein Eng 10:1319-1326), modified lipoproteins (Georgiou et al. 1996, Protein Eng 9: 239-247), ice nucleation proteins (Jung et al. 1998, Nat Biotechnol 16:576-580), and autotransporters (Veiga et al. 2003, J Bacteriol 185:5585-5590) were used as the anchors for display.
[0064] The display of heterologous protein on the cell wall of the eukaryotic host cell Saccharornyces cerevisiae was first described in 1993 by fusion of alpha-galactosidase to C-terminal half of cell well protein alpha-agglutinin AGA1 (Schreuder M P et al, Yeast 9:399-409). Since then, various yeast display systems base on fusion of the protein of interest to various cell well proteins were reported (Kondo M et al). Almost all of the cell-surface display systems developed for yeast are glycosyl phosphatidylinositol (GPI) anchor-dependent. More than a dozen of yeast cell well proteins with a putative GPI attachment signal at the C-termini have been proven capable of displaying peptides and proteins, which includes a-agglutinin (Aga1 and Aga2), Cwp1, Cwp2, Gas1p, Yap3p, Flo1p, Crh2p, Pir1, Pir2, Pir4, and Icwp in S.cerevisiae; HpSEDI, HpGASI, HpTIPI, HPWPI in Hansenula polymorpha, and HwpIp, Als3p, Rbt5p in Candida albicans. To date, over twenty heterologous proteins have been successfully displayed on yeast cell surface.
[0065] Among all of the cell-surface display systems described above, the system created by Dane Wittrup base on a-agglutinin receptor has been widely used for display various peptides and proteins such as scFv antibody and antibody libraries (U.S. Pat. Nos. 6,300,065, 6,423,538, 6,696,251, and 6,699,658). In S. cerevisiae, the a-agglutinin receptor acts as an adhesion molecule to stabilize cell-cell interactions and facilitate fusion between mating "a" and a haploid yeast cells. The receptor consists of a core subunit Aga1 and small subunit Aga2. Aga1 is secreted from the cell and becomes covalently attached to E1-linked glucans in the extra cellular matrix of the yeast cell wall though its GPI anchor-attachment signal. Aga2 binds to Aga1 through two disulfide bonds, presumably in the golgi, and after secretion remains attached to the cell via Aga1. This yeast display system takes advantage of the association of Aga1 and Aga2 proteins to display a recombinant protein on the yeast cell surface through fusion of protein with the Aga2 subunit.
[0066] The Wittrup system has been adapted for multi-chain polypeptides such as immunoglobulin Fab fragments (Hufton et al, U.S. patent application 2003/0186374 A1). Hufton et al mention the possible use of the Fos/Jun interaction as the basis of a display system suitable for use in eukaryotic cells. However, Hufton et at did not provide an enabling description of how the Fos/Jun interaction can be utilized to direct protein display in eukaryotic host cells. However the reference only teaches how to use Ag2 fusion developed by Dane Wittrup (U.S. Pat. Nos. 6,300,065, 6,423,538, 6,696,251, and 6,699,658) for yeast display of Fab antibody, and how to transfer gene from phage display vector to yeast vector by molecular cloning.
[0067] A number of approaches have been used to achieve display of proteins on the surface of mammalian cells based on the use of display vectors that are designed to display a repertoire of proteins of interest directly fused to various membrane anchor proteins, which includes membrane domains of cell surface receptors (Chesnut et al, 1996, J Immunological Methods; Ho et al, 2006, PNAS, 103:9637-9642), GPI anchor sequences (U.S. Pat. No. 6,838,446), non-cleavable type 11 signal anchor sequences (U.S. Pat. No. 7,125,973). A typical example is the pDISPLAY vector,that is a commercially available vector to display protein on mammalian cell surface provided by Invitrogen Corp. In this vector, the protein of interest will be fused with a membrane domain of cell surface receptor PDGFR. An alternative approach was also reported in U.S. Pat. No. 6,919,183. In this system, a cell surface capture molecule such as protein G, protein A was used to capture the antibody molecules on mammalian cell surface.
Display Systems of the Invention
[0068] The invention provides a new display system that is capable of multi-species eukaryotic display. More specifically, using the same display vector, without any molecular manipulations such as DNA digestions and cloning, a protein of interest can be displayed on the surface of multi-species such as yeast cells and mammalian cells, or expressed as a soluble protein in a eukaryotic host cells. The display systems of the invention utilize particular pairs of display vectors and helper vectors for each species. The different vector sets of the invention comprise a multi-species expression vector, encoding a library of polypeptide sequences fused to a first adapter (i.e. adapted), in combination with a helper vectors that is specific for particular genetic packages or host cell. Each of the helper vectors comprise a cell surface anchor protein fused to a second adapter (i.e., adapter2). As shown herein, the co-expression of a multi-species display vector in combination with a helper vector which comprises a corresponding adapter produces a collection of genetic packages (or host cells) which has a repertoire of polypeptide sequences displayed on its surface via the pairwise interaction of the adapters (i.e. adapter1 and adapter2).
Components of the Vectors
Adapters
[0069] Adapter sequences applicable for constructing the display and helper vectors of the subject display system can be derived from a variety of sources. Generally, any protein sequences involved in the formation of stable multimers are candidate adapter sequences. As such, these sequences may be derived from any homomultimeric or heteromultimeric protein complexes. Representative homomultimeric proteins are homodimeric receptors (e.g. plateletderived growth factor homodimer BB (PDGF), homodimeric transcription factors (e.g. Max homodimer, NF-kappaB p65 (ReIA) homodimer), and growth factors (e.g. neurotrophin homodimers). Non-limiting examples of heteromultimeric proteins are complexes of protein kinases and SH2-domain-containing proteins (Cantley et al. (1993) Cell 72: 767 778; Cantley et al. (1995) J. Biol. Chem. 270(44): 26029 26032), heterodimeric transcription factors, and heterodimeric receptors. A vast number of heterodimeric receptors are known, including but not limited to receptors that bind to growth factors (e.g. heregulin), neurotransmitters (e.g. gamma.Aminobutyric acid), and other organic or inorganic small molecules (e.g. mineralocorticoid, glucocorticoid). Preferred heterodimeric receptors are nuclear hormone receptors (Belshaw et al (1996) Proc. Natl. Acad. Sci. U.S.A 93(10):4604 4607), erbB3 and erbB2 receptor complex, and G-protein-coupled receptors including but not limited to opioid (Gomes et al. (2000) J. Neuroscience 20(22): RC110); Jordan et al. (1999) Nature 399:697 700), muscarinic, dopamine, serotonin, adenosine/dopamine, and gamma-aminobutyric acid GABA families of receptors. Generally speaking, the majority of the known heterodimeric receptors, comprise C-terminal sequences that mediate heterodimer formation.
[0070] GABAB-R1/GABAB-R2 receptors exhibit the above-mentioned physical properties. These two receptors are essentially incapable of forming homodimers under physiological conditions (e.g. in vivo) and at physiological body temperatures, Research by Kuner et al. and White et al. (Science (1999) 283: 74 77); Nature (1998) 396: 679 682)) has demonstrated the heterodimerization specificity of GABAB-R1 and GABAB -R2 C-terminus in vivo. In fact, White et al. were able to clone GABAB-R2 from yeast cells based on the exclusive specificity of this heterodimeric receptor pair. In vitro studies by Kammerer et al. (Biochemistry, 1999, 38: 13263-13269) has shown that neither GABAB-R1 nor GABAB-R2 C-terminal sequence is capable of forming homodimers in physiological buffer conditions when assayed at physiological body temperatures. Specifically, Kammerer et al. have demonstrated by sedimentation experiments that the heterodimerization sequences of GABAB receptor 1 and 2, when tested alone, sediment at the molecular mass of the monomer under physiological conditions and at physiological body temperatures. When mixed in equimolar amounts, GABAB receptor 1 and 2 heterodimerization sequences sediment at the molecular mass corresponding to the heterodimer of the two sequences (see Table I of Kammerer et al.). However, when the GABAB-R1 and GABAB R2 C-terminal sequences are linked to a cysteine residue, homodimers may occur via formation of disulfide bond.
[0071] A diverse variety of coiled coils involved in multimer formation can be employed as the adapters in the subject display system. Preferred coiled coils are derived from heterodimeric receptors. Accordingly, the present invention encompasses coiled-coil adapters derived from GABAB receptors 1 and 2. In one aspect, the subject coiled coils adapters comprises a C-terminal sequences of GABAB receptor 1, referred to herein as GR1 EEKSRLLEKENRELEKIIAEKEERVSELRHQLQSVGGC (SEQ ID NO:10) and a sequence of GABAB receptor 2, referred to herein as GR2 TSRLEGLQSENHRLRMKITELDKDLEEVTMQLQDVGGC(SEQ ID NO:11).
[0072] It is to be understood that although the examples describe the use of vector sets which comprise the same pair of adapter sequences (referred to as adapter1 in the context of expression vectors and adapter2 in the context of helper display vectors), the vectors described herein can be prepared, and the methods of the invention can be practiced, using alternative adapters.
[0073] For example, based on the disclosure provided herein suitable adapter sequence can be derived from any of a number of coiled coil domains including for example Winzip-A2B 1(Katj a M Arndt et al, Structure, 2002,10:1235-1248); Winzip-A1B1(Katja M Arndt et al, JMB, 2000, 295:627-639); FNfnl O(Sanjib Dutta et al, Protein Science, 2005, 14:2838-2848), IAAL 15 E3/K3(Jennifer R. Litowski and Robert S Hodges, JBC, 2002,277(40)37272-37279), PcrV/PcrG (Max Nanao et al, BMC Microbiology, 2003:1-9), bZip and derivatives (Jumi A. Shin, Pure Appl. Chem., 2004, 76(7-8):1579-1590),ESCRT-I'II (David J. Gill et al, The EMBO Journal, 2007, 26:600-612), EE1234/RR1234 and derivatives (Johnthan R. Moll et al, Protein Science, 2001, 10:649-655), Laminin a, b, g.(Atsushi Utani et al, JBC 1995, 270(7):3292-3298), Peptides A/B and derivatives(Ilian Jelesarov and Hans Rudolf Bosshard, JMB, 1996, 263:344358), artificially designed peptides (Derek N. Woolfson and Tom Alber, Protein Science, 1995, 4:1596-1607), DcoH-HNF-p1 (Robert. B Rose et al, Nat. Struct. Biol., 2000, 7(9):744-748), and APC peptides (Catherine L. Day and Tom Alber, JMB, 2000, 301:147-156).
[0074] Depending upon the affinity of the adapter subunit interaction associated with a particular pair of adapter subunits it may be possible to eliminate the need for using a disulfide bond to stabilize the resulting coiled coil interaction. For example, the affinities reported in the literature for some of the coiled coil domains listed above range from 0.00001 nM to 70 nM (4.5 nM for Winzip-A2B1, 24 nM for Winzip-AIB1, 3 nM for FNfn10; 70 nM for 1 AL-E3/K3, 15.6 nM for PcrV/PcrG and 0.0001 nM for EE1234/RR1234 and derivatives).
[0075] Alternative heterodimeric transcription factors that are suitable for use as adapters include alpha-Pal/Max complexes and Hox/Pbx complexes Hox represents a large family of transcription factors involved in patterning the anterior-posterior axis during embryogenesis. Hox proteins bind DNA with a conserved three alpha helix homeodomain. In order to bind to specific DNA sequences, Hox proteins require the presence of hetero-partners such as the Pbx homeodomain. Wolberger et al. solved the 2.35 ANG. crystal structure of a Hox13I-Pbx1-DNA ternary complex in order to understand how Hox-Pbx complex formation occurs and how this complex binds to DNA. The structure shows that the homeodomain of each protein binds to adjacent recognition sequences on opposite sides of the DNA. Heterodimerization occurs through contacts formed between a six amino acid hexapeptide Nterminal to the homeodomain of HoxB1 and a pocket in Pbx1 formed between helix 3 and helices I and 2. A C-terminal extension of the Pbx1 homeodomain forms an alpha helix that packs against helix 1 to form a larger four helix homeodomain (Wolberger et al. (1999) Cell 96: 587 597; Wolberger et al. J Mol Biol. 291: 521 530).
[0076] For example, sequences from novel hetermultimeric proteins can be employed as adapters. In such situation, the identification of candidate sequences involved in formation of heteromultimers can be determined by any genetic or biochemical assays without undue experimentation. Additionally, computer modeling and searching technologies further facilitates detection of heteromultimeric sequences based on sequence homologies of common domains appeared in related and unrelated genes. Non-limiting examples of programs that allow homology searches are Blast, Fasta (Genetics Computing Group package, Madison, Wis.), DNA Star, Clustlaw, TOFFEE, COBLATH, Genthreader, and MegAlign. Any sequence databases that contains DNA sequences corresponding to a target receptor or a segment thereof can be used for sequence analysis. Commonly employed databases include but are not limited to GenBank, EMBL, DDBJ, PDB, SWISS-PROT, EST, STS, GSS, and HTGS.
[0077] Suitable adapters that are derived from heterodimerization sequences can be further characterized based on their physical properties. Preferred heterodimerization sequences exhibit pairwise affinity resulting in predominant formation of heterodimers to a substantial exclusion of homodimers. Preferably, the predominant formation yields a heteromultimeric pool that contains at least 60% heterodimers, more preferably at least 80% heterodimers, more preferably between 85 to 90% heterodimers, and more preferably between 90 to 95% heterodimers, and even more preferably between 96-99% heterodimers that are allowed to form under physiological buffer conditions and/or physiological body temperatures. In certain embodiments of the present invention, at least one of the heterodimerization sequences of the adapter pair is essentially incapable of forming a homodimer in a physiological buffer and/or at physiological body temperature. By "essentially incapable" is meant that the selected heterodimerization sequences when tested alone do not yield detectable amounts of homodimers in an in vitro sedimentation experiment as detailed in Kammerer et al. (1999) Biochemistry 38: 13263 13269), or in the in vivo two-hybrid yeast analysis (see e.g. White et al. Nature (1998) 396: 679 682). In addition, individual heterodimerization sequences can be expressed in a host cell and the absence of homodimers in the host cell can be demonstrated by a variety of protein analyses including but not limited to SDS-PAGE, Western blot, and immunoprecipitation. The in vitro assays must be conducted under a physiological buffer conditions, and/or preferably at physiological body temperatures. Generally, a physiological buffer contains a physiological concentration of salt and at adjusted to a neutral pH ranging from about 6.5 to about 7.8, and preferably from about 7.0 to about 7.5. A variety of physiological buffers is listed in Sambrook et al. (1989) supra and hence is not detailed herein. Preferred physiological conditions are described in Kammerer et al., (Biochemistry, 1999, 38: 13263-13269)
[0078] Adapters can be further characterized based on their secondary structures. Preferred adapters consist of amphiphilic peptides that adopt a coiled-coil helical structure. The helical coiled coil is one of the principal subunit oligomerization sequences in proteins. Primary sequence analysis reveals that approximately 23% of all protein residues form coiled coils (Wolf et al. (1997) Protein Sci. 6:1179 1189). Well-characterized coiled-coil-containing proteins include members of the cytoskeletal family (e.g. alpha.-keratin, vimentin), cytoskeletal motor family (e.g. myosine, kinesins, and dyneins), viral membrane proteins (e.g. membrane proteins of Ebola or HIV), DNA binding proteins, and cell surface receptors (e.g., GABAB receptors 1 and 2).
[0079] Coiled-coil adapters of the present invention can be broadly classified into two groups, namely the left-handed and right-handed coiled coils. The left-handed coiled coils are characterized by a heptad repeat denoted "abcdefg" with the occurrence of polar residues preferentially located at the first (a) and fourth (d) position. The residues at these two positions typically constitute a zig-zag pattern of "knobs and holes" that interlock with those of the other stand to form a tight-fitting hydrophobic core. In contrast, the second (b), third (c) and sixth (f) positions that cover the periphery of the coiled coil are preferably charged residues. Examples of charged amino acids include basic residues such as lysine, arginine, histidine, and acidic residues such as aspartate, glutamate, asparagine, and glutamine. Uncharged or polar amino acids suitable for designing a heterodimeric coiled coil include but are not limited to glycine, alanine, valine, leucine, isoleucine, serine and threonine. While the uncharged residues typically form the hydrophobic core, inter-helical and intra-helical salt-bridge including charged residues even at core positions may be employed to stabilize the overall helical coiled-coiled structure (Burkhard et al (2000) J. Biol. Chem. 275:11672 -11677). Whereas varying lengths of coiled coil may be employed, the subject coiled coil adapters preferably contain two to ten heptad repeats. More preferably, the adapters contain three to eight heptad repeats, even more preferably contain four to five heptad repeats.
[0080] In designing optimal coiled-coil adapters, a variety of existing computer software programs that predict the secondary structure of a peptide can be used. An illustrative computer analysis uses the COILS algorithm which compares an amino acid sequence with sequences in the database of known two-stranded coiled coils, and predicts the high probability coiled-coil stretches (Kammerer et al. (1999) Biochemistry 38:13263 13269). Base on design and selection, a variety of engineered coiled coil sequences were reported, with affinity of nanomole to fentomole region (Structure, 2002, 10(9):1235-48; J Mol Biol. 2000, 21;295(3):627-39; Protein Sci. 2005, 14(11):283848; J 13iol Chem. 2002,277(40):37272-9; BMC Microbiol. 2003, 18:3:21; Protein Science, 2001, 10:649-655). For Example, engineered heterodimeric coiled coil sequences derived from human B-ZIP give a fentomole dissociate constant, which is similar to that for Biotin/Streptavidin interaction.
[0081] Another class of preferred coiled coil adapters are leucine zippers. The leucine zipper have been defined in the art as a stretch of about 35 amino acids containing 45 leucine residues separated from each other by six amino acids (Maniatis and Abel, (1989) Nature 341:24). The leucine zipper has been found to occur in a variety of eukaryotic DNA-binding proteins, such as GCN4, C/EBP, c-fos gene product (Fos), c jun gene product (Jun), and c-Myc gene product. In these proteins, the leucine zipper creates a dimerization interface wherein proteins containing leucine zippers may form stable homodimers and/or heterodimers. Molecular analysis of the protein products encoded by two proto-oncogenes, c-fos and c-jun, has revealed such a case of preferential heterodimer formation (Gentz et al., (1989) Science 243:1695; Nakabeppu et al., (1988) Cell 55:907; Cohen et al., (1989) Genes Dev. 3:173). Synthetic peptides comprising the leucine zipper regions of Fos and Jun have also been shown to mediate heterodimer formation, and, where the amino-termini of the synthetic peptides each include a cysteine residue to permit intermolecular disulfide bonding, heterodimer formation occurs to the substantial exclusion of homodimerization.
[0082] The leucine-zipper adapters of the present invention have the general structural formula known as the heptad repeat (Leucine-X1-X2-X3-X4-X5-X.sub.6)n, where X may be any of the conventional 20 amino acids, but are most likely to be amino acids with alpha-helix forming potential, for example, alanine, valine, aspartic acid, glutamic acid, and lysine, and n may be 2 or greater, although typically n is 3 to 10, preferably 4 to 8, more preferably 4 to 5. Preferred sequences are the Fos or Jun leucine zippers.
[0083] Sequence of antibody chains that are involved in dimerizing the L and H chains can also be used as adapters for constructing the subject display systems. These sequences include but are not limited to constant region sequences of an L or H chain. Additionally, adapter sequences can be derived from antigen-binding site sequences and its binding antigen. In such case, one adapter of the pair contains antigen-binding site amino acid residues that is recognized (i.e. being able to stably associate with) by the other adapter containing the corresponding antigen residues.
[0084] The pairwise interaction between the first and second adapters may be covalent or non-covalent interactions. Non-covalent interactions encompass every exiting stable linkage that do not result in the formation of a covalent bond. Non-limiting examples of noncovalent interactions include electrostatic bonds, hydrogen bonding, Van der Waal's forces, steric interdigitation of amphiphilic peptides. By contrast, covalent interactions result in the formation of covalent bonds, including but not limited to disulfide bond between two cysteine residues, C--C bond between two carbon-containing molecules, C--O or C--H between a carbon and oxygen or hydrogen-containing molecules respectively, and O--P bond between an oxygen- and phosphate-containing molecule.
[0085] Based on the wealth of genetic and biochemical data on vast families of genes, 5 one of ordinary skill will be able to select and obtain suitable adapter sequences for constructing the subject display system without undue experimentation.
Outer Surface Anchor Protein
[0086] For yeast display, suitable outer surface anchor proteins can be any of the outer wall proteins, with or without, GPI signal, which includes a-agglutinin (Aga1 and Aga2) Cwp1, Cwp2, Gas1p, Yap3p, Flo1p, Crh2p, Pir1, Pir2, Pir4, and Icwp in S.cerevisiae; HpSED1, HpGASI, HpTIP1, HPWPI in Hansenula polymorpha, and Hwp1p, Als3p, Rbt5p in Candida albicans. Alternatively, the methods of the invention can be practiced in the context of yeast using a cell surface anchor which is an artificial sequence that can be assembled into, or attached to the outer wall of yeast. As shown herein, Example 7 shows yeast display of scFv antibody by using helper vector pMAT7, or pMAT8, in which the yeast outer surface anchor protein is from Cwp2, or Aga2 depicted in FIGS. 4A and 4B.
[0087] Mammalian cell surface display can be practiced using a transmembrane domain of any known cell membrane proteins, or a polypeptides with GPI anchor sequences, or a noncleavable type 1 signal anchor sequences as a surface anchor. Alternatively, the methods of the invention can be practiced in the context of mammalian cells using a cell surface anchor which is an artificial sequence that can be assembled into, or attached to the cell membrane of mammalian cells. As shown herein, Example 11 shows the display of scFv protein on the mammalian cells by using helper vector pMAG2 (FIG. 8), in which the transmembrane domain of human EGF receptor fused to adapter2 is used as surface anchor for display.
Signal Sequences
[0088] Signal sequences from both prokaryotes and eukaryotes are built along the same general lines. They are about 15-30 amino acids in length and consist of three regions: a positively charged N-terminal region, a central hydrophobic region, and a more polar C-terminal region. There is a large amount of functional and structural homology between the signal peptides of prokaryotic and eukaryotic systems. Therefore, it is expected that some native signal peptides will function in both prokaryotes and eukaryotes.
[0089] Consistent with this expectation, some eukaryotic signal peptides have been reported to be functional in prokaryotic cells. For example, the signal peptide from human 10 growth hormone (hGH) and rat proinsulin protein function in E. coil (Gene, 1985, 39:247-254); yeast signal peptide of Endo-beta-1,3-glucanase are also functional in E coli (Protein Exp. Puri, 2000, 20.252-264). In addition, the prokaryotic signal peptides of Staphlococcal protein A, bacterial b-lactamase protein, and bacterial OmpA are functional in mammalian cells (Humphreys et al, Protein Exp. Purif. 2000, 20:252-264). Examples of signal peptides that work cross between yeast and mammalian cells are the signal peptides for human pancreatic lipase protein 1 (HPLRPI), human interferon, Human bile salt-stimulated lipase, and yeast Saccharomyces cerevisiae invertase (SUC2) (Tohoku J Exp Med, 1996, 180: 297-308; Protein Exp. Puri, 2006, 47:415-421; Protein Exp. Purif, 1998, 14:425-433).
[0090] Any of the native signal peptides including those identified above for their ability to function in a specific species may be used as signal peptides for the expression vector of this invention. In addition, an artificial signal peptide sequence characterized by the ability to function in eukaryotic host cells may also be used to practice the methods disclosed herein. The artificial signal peptides may be isolated from the design signal peptide libraries.
[0091] The vectors of the present invention generally comprise transcriptional or translational control sequences required for expressing the exogenous polypeptide. Suitable transcription or translational control sequences include but are not limited to replication origin, promoter, enhancer, repressor binding regions, transcription initiation sites, ribosome binding sites, translation initiation sites, and termination sites for transcription and translation.
[0092] The origin of replication (generally referred to as an ori sequence) permits replication of the vector in a suitable host cell. The choice of ori will depend on the type of host cells and/or genetic packages that are employed. Where the host cells are prokaryotes, the expression vector typically comprises two ori sequences, one directing autonomous replication of the vector within the prokaryotic cells, and the other ori supports packaging of the phage particles. Preferred prokaryotic ori is capable of directing vector replication in bacterial cells. Non-limiting examples of this class of ori include pMB1, pUC, as well as other E. Coli origins. Preferred ori supporting packaging of the phage particles includes but is not limited to f1 ori, Pf3 phage replication ori. For example, the pUC ori and f1 on are built in the expression vectors in this invention for yeast and mammalian display.
[0093] In the eukaryotic system, higher eukaryotes contain multiple origins of DNA replication (estimated 10e4-10e6 ori/mammalian genome), but the ori sequences are not so clearly defined. The suitable origins for mammalian vectors are normally from eukaryotic viruses. Preferred eukaryotic ori includes but is not limited to SV40 ori, EBV ori, HSV oris. The suitable ori for yeast cells includes but is not limited to 2u ori CEN61ARS4 ori.
[0094] As used herein, a "promoter" is a DNA region capable under certain conditions of binding RNA polymerase and initiating transcription of a coding region located downstream (in the 3' direction) from the promoter. It can be constitutive or inducible. In general, the promoter sequence is bounded at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to include the minimum number of bases or elements necessary to initiate transcription at levels detectable above background. Within the promoter sequence is a transcription initiation site, as well as protein binding domains responsible for the binding of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" boxes and "CAT" boxes.
[0095] The choice of promoters will largely depend on the host cells in which the vector is introduced. For prokaryotic cells, a variety of robust promoters are known in the art. Preferred promoters are lac promoter, Trc promoter, T7 promoter and pBAD promoter. Normally, to obtain expression of exogenous sequence in multiple species, the prokaryotic promoter can be placed immediately after the eukaryotic promoter, or inside an intron sequence downstream of the eukaryotic promoter.
[0096] Suitable promoter sequences for other eukaryotic cells include the promoters for 3-phosphoglycerate kinase, or other glycolytic enzymes, such as enolase, glyceraldehyde-3phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Other promoters, which have the additional advantage of transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the aforementioned glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Preferred promoters for mammalian cells are SV40 promoter, CMV promoter, β-actin promoter and their hybrids. Preferred promoter for yeast cell includes but is not limited to GAL 10, GAL I, TEFI in S. cerevisia, and GAP, AOX1 in P. pastoris.
[0097] In constructing the subject vectors, the termination sequences associated with the exogenous sequence are also inserted into the 3' end of the sequence desired to be transcribed to provide polyadenylation of the mRNA and/or transcriptional termination signal. The terminator sequence preferably contains one or more transcriptional termination sequences (such as polyadenylation sequences) and may also be lengthened by the inclusion of additional DNA sequence so as to further disrupt transcriptional read-through. Preferred terminator sequences (or termination sites) of the present invention have a gene that is followed by a transcription termination sequence, either its own termination sequence or a heterologous termination sequence. Examples of such termination sequences include stop codons coupled to various yeast transcriptional termination sequences or mammalian polyadenylation sequences that are known in the art, widely available, and exemplified below. Where the terminator comprises a gene, it can be advantageous to use a gene which encodes a detectable or selectable marker; thereby providing a means by which the presence and/or absence of the terminator sequence (and therefore the corresponding inactivation and/or activation of the transcription unit) can be detected and/or selected.
[0098] In addition to the above-described elements, the vectors may contain a selectable marker (for example, a gene encoding a protein necessary for the survival or growth of a host cell transformed with the vector), although such a marker gene can be carried on another polynucleotide sequence co-introduced into the host cell. Only those host cells into which a selectable gene has been introduced will survive and/or grow under selective conditions. Typical selection genes encode protein(s) that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, kanamycin, neomycin, zeocin, G418, methotrexate, etc.; (b) complement auxotrophic deficiencies; or (c) supply critical nutrients not available from complex media. The choice of the proper marker gene will depend on the host cell, and appropriate genes for different hosts are known in the art.
[0099] In one embodiment of the invention, the expression vector is a shuttle vector, capable of replicating in at least two unrelated host systems. In order to facilitate such replication, the vector generally contains at least two origins of replication, one effective in each host system. Typically, shuttle vectors are capable of replicating in a eukaryotic host system and a prokaryotic host system. This enables detection of protein expression in the eukaryotic host (the expression cell type) and amplification of the vector in the prokaryotic host (the amplification cell type). Preferably, one origin of replication is derived from SV40 or 2u and one is derived from pUC, although any suitable origin known in the art may be used provided it directs replication of the vector. Where the vector is a shuttle vector, the vector preferably contains at least two selectable markers, one for the expression cell type and one for the amplification cell type. Any selectable marker known in the art or those described herein may be used provided it functions in the expression system being utilized
[0100] In one embodiment of the invention, the expression vector comprises more than one expression cassettes for multi-chain protein complex. Each cassette comprises promoter, signal sequence, gene of interest, and transcription termination sequence. To display multi-chain complex on the eukaryotic cell surface, at lease one of the expression cassettes will express adapter1 fusion with one chain of the multi-chain complex. For example, to display full length antibody or antibody Fab fragment (heavy chain and light chain), at least one expression cassette will express adapter1 fusion with either heavy chain or light chain. Alternatively, yeast mating system can be used for display of multi-chain complex. The expression cassettes for multi-chains can be split into two expression vectors. The first expression vector can be introduced into one mating type MATa strain, second vector will be induced into another mating type MATa strain. The two vectors will be brought together in a single diploid by yeast mating. For display, at least one expression vector comprises at least one express cassette for adapter1 fusion with one chain of the multi-chain complex.
[0101] The vectors encompassed by the invention can be obtained using recombinant cloning methods and/or by chemical synthesis. A vast number of recombinant cloning techniques such as PCR, restriction endonuclease digestion and ligation are well known in the art, and need not be described in detail herein. One of skill in the art can also use the sequence data provided herein or that in the public or proprietary databases to obtain a desired vector by any synthetic means available in the art. Additionally, using well-known restriction and ligation techniques, appropriate sequences can be excised from various DNA sources and integrated in operative relationship with the exogenous sequences to be expressed in accordance with the present invention.
[0102] The examples and figures provided with this disclosure illustrate practice of the present invention in multi-species display of protein of interest on the eukaryotic systems. The following examples are meant to be illustrative of an embodiment of the present invention and should not limit the scope of the invention in any way. A number of modifications and variations will be apparent to the skilled artisan from reading this disclosure. Such modifications and variations constitute part of the invention.
[0103] The practice of the present invention employs, unless otherwise indicated, conventional techniques of cell biology, molecular biology, cell culture and the like which are in the skill of one in the art. All publications and patent applications cited in the specification are indicative of the level of skill of those skilled in the art to which this invention pertains and are hereby incorporated by reference in their entirety.
[0104] Although the various compositions and methods of the invention (multi-species and cross-species display strategies) of the invention are exemplified herein using a coding sequence for an anti-VEGF antibody, a skilled artisan will readily appreciate that libraries of expression cassettes encoding diverse libraries of antibody sequences can be used in the expression and display vector sets of the invention to accomplish antibody discovery and engineering protocols.
[0105] The practice of the present invention will employ, unless otherwise indicated, conventional techniques of immunology, biochemistry, chemistry, molecular biology, microbiology, cell biology, genomics and recombinant DNA, which are within the skill of the art. See, e.g., PHAGE DISPLAY OF PEPTIDES AND PROTEINS (B. K. Kay et al., 1996); PHAGE DISPLAY, A LABORATORY MANUAL (C. F. Barbas III et al., 2001) Sambrook, Fritsch and Maniatis, MOLECULAR CLONING: A LABORATORY MANUAL, 2nd edition (1989); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY (F. M. Ausubel, et ad. eds., (1987)); the series METHODS IN ENZYMOLOGY (Academic Press, Inc.): PCR 2: A PRACTICAL APPROACH (M. J. MacPherson, B. D. Hames and G. R. Taylor eds. (1995)), Harlow and Lane, eds. (1988) ANTIBODIES, A LABORATORY MANUAL, and ANIMAL CELL CULTURE (R. I. Freshney, ed. (1987)).
[0106] Further illustration of the development and use of subject vectors, display systems and host cells are provided in the EXAMPLES section below. The examples are provided as a guide to a practitioner of ordinary skill in the art, and are not meant to be limiting in any way.
EXAMPLES
Example 1
Construction Yeast Expression Vector pMAT9
[0107] pMAT9 vector (FIG. 2A) comprise a expression cassette for an anti-VEGF scFv antibody fused with adapter1 (GR1) (SEQ ID NO:1) and detectable HA and His6 tag (DH tag). It was constructed by insertion of a fully synthetic gene fragment into a commercial pESC-TRP vector (Stratagene) through cloning sites EcoRT and Pad. The scFv-GR1 fusion protein is under the control of a galactose-induced promoter GAL10 and a signal sequence of yeast endo-beta-1,3-glucanase (Bg12). The fusion gene sequence (SEQ ID NO:3) was confirmed by standard DNA sequencing method.
Example 2
Construction Yeast Expression Vector pMAT12
[0108] pMAT12 (SEQ ID NO: 4) provides an expression vectors that is suitable for expression in yeast cells. As shown in FIG. 2B, pMAT12 is created on the backbone of commercial vector pUC 19 by insertion at Aatll and Pcil sites with a fully synthetic DNA fragment (Codon Devices). This fully synthetic DNA fragment comprises (1) f1 ori; (2) a expression cassette for the adapter GR1 (SEQ ID NO:1) fusion, which is driven by a yeast pGAL1 promoter. The sequence of anti-VEGF scFv antibody is built in the downstream of a yeast signal sequence (yeast endo-B-1,3-glucanase protein Bg12p). The HA-His6 tag (DH-tag) sequences are upstream of GR1 sequence for protein detection and Ni-NTA purification. (3) yeast CEN/ARS ori for replication; and (4) a expression cassette for yeast TRP1 auxotrophic marker. The synthetic DNA sequence was confirmed by standard DNA sequencing method.
Example 3
Yeast Helper Vector pMAT7
[0109] pMAT7, which is graphically depicted in FIG. 3A, is another yeast display helper vector which expresses a fusion protein comprising the yeast outer surface protein Aga2 in frame with adapter 2 (GR2) (SEQ ID NO: 2). This vector was created from the yeast helper vector pMAT3 by replacing BamHI-HindIII fragment with a synthetic DNA fragment of 208 bp. This synthetic DNA comprises the sequence encoding yeast out well protein Aga2. Using BioFab platform technology of Codon Devices, the errors generated from oligo synthesis were corrected by oligo selection with sequences complementary to the synthetic genes, and affinity purification of Mut-S protein column. The nucleotide sequence of pMAT7 (SEQ ID NO: 5) was confirmed by standard DNA sequencing.
Example 4
Construction of Yeast Helper Vector pMAT8
[0110] The vector pMAT8 shown in FIG. 3B is yeast helper vector, which expresses a fusion protein of adapter 2 with yeast out surface GPI anchoring protein Cwp2. This vector was constructed on the backbone of commercial vector pUC19 by insertion at AatII and PciI sites with a fully synthetic DNA fragment. This fully synthetic DNA comprises sequences for three expression cassettes: (1) yeast URA3 selection marker; (2) Zeocin marker for yeast selection; (3) adapter GR2 fusion with yeast out well protein Cwp2 and Flo1 S/T rich region, under control of yeast pGAL1 promoter and the secretory signal sequence of Flo1. Using BioFab platform technology of Codon Devices at Boston, the errors in synthetic DNA was corrected by oligo selection with sequences complementary to the synthetic genes, and affinity purification of Mut-S protein column. The nucleotide sequence of pMAT8 (SEQ ID NO: 6) was confirmed by standard DNA sequencing.
Example 5
Yeast Strain With a Chromosomal Integrant of Yeast Helper Vector pMAT7
[0111] Vector DNA of pMAT7 was first linearizd with restriction enzyme ApaI, then transformed into yeast S. cerevisiae strain YPH499 (Stratagene) using Frozen-EZ Yeast Transformation II Kit according to Zymo Research's instruction. Clones with pMAT7 integration was selected and grown on the CM glucose minus URA plate (Teknova). In order to test the surface expression of adapter 2 (GR2) fusion protein in yeast cells carrying pMAT7 vector, the Galactose induction experiment was performed. Briefly, cells from single colony were grown in 50 ml of YDP medium at 30° C. overnight (OD600=15˜20), thus transferred to 50 ml SG-CAA-minus URA medium (20g/L galactose, 6.7 g/L yeast nitrogen base, 5 g/L Casamino Acids/-URA, 10.19 g/L Na2HPO4.7H2O, 8.56 g/L NaH2PO4.H2O) for 48 hrs growth at 25° C. to 10˜15 OD600. The cells were harvested, washed with PBS, and incubated with mouse monoclonal anti-myc antibody (Upstate Biotechnologies) for 1 hr at room temperature. The PBS-washed cells were then probed with Goat anti-mouse-Alexa 488 (invitrogen) for 30 min in the dark. After PBS wash, fluorescent labeled GR2 fusion protein on the yeast cell surface was visualize under a Zeiss Axiovert 135 fluorescent microscope with Plan-Neofluar x40/0.75 Ph2 and X100 oil objective lens. FIG. 4A showed clear surface localization of myc-tagged GR2-Aga2 fusion in green fluorescence, demonstrating the functional cell surface expression of adapter2 fusion by yeast helper vector pMAT7.
Example 6
Yeast Strain With a Chromosomal Integrant of Yeast Helper Vector pMAT8
[0112] The procedure of generation yeast stain with chromosomal integration of pMAT8 vector was similar as described Example 5, except only 20 hours induction for adapter 2 fusion expression. Briefly, PMAT8 vector DNA was linearizd with restriction enzyme ApaI, and transformed into yeast S. cerevisiae strain YPH499 (Stratagene). Clones with pMAT8 integration was selected and grown on the CM glucose minus URA plate (Teknova). To test the surface expression of adapter 2 (GR2)-Cwp2 fusion protein in yeast cells carrying pMAT8 vector, the Galactose induction experiment was performed. Briefly, cells from single colony were grown in 50 ml of YDP medium at 30° C. overnight. (OD600=15˜20), thus transferred to 50 ml SG-CAA-minus URA medium (20 g/L galactose, 6.7 g/L yeast nitrogen base, 5 g/L Casamino Acids/-URA, 10.19 g/L Na2HPO4.7H2O, 8.56 g/L NaH2PO4.H2O) for 20 hrs growth at 25° C. to 10-15 00600. The cells were harvested, washed with PBS, and incubated with mouse monoclonal anti-myc antibody (Upstate Biotechnologies) for 1 hr at room temperature. The PBS-washed cells were then probed with Goat anti-mouse-Alexa 488 (invitrogen) for 30 min in the dark. After PBS wash, fluorescent labeled GR2 fusion protein on the yeast cell surface was visualize under a Zeiss Axiovert 135 fluorescent microscope with Plan-Neofluar X40/0.75 Ph2 and X100 oil objective lens. FIG. 4B showed clear surface localization of myc-tagged GR2-Cwp2 fusion in green fluorescence, demonstrating the functional cell surface expression of adapter2 fusion by yeast helper vector pMAT8. In comparison with pMAT7, vector pMAT8 produced higher density of adapter 2 fusion on yeast surface after 20 hours induction.
Example 7
Yeast Surface Display of scFv Antibody Proteins by Using Expression Vector pMAT9 or pMAT12, With Helper Vector pMAT8
[0113] The yeast strain with chromosomal integration of pMAT8 vector was used for yeast surface display of antibody. The expression vector pMAT9 or pMAT12 was transformed into YPH499-pMAT8 strain created from Example 6 according to the protocol of Frozen-EZ Yeast Transformation II Kit (Zymo Research). Cells from a single colony on CM glucose minus TRP & URA plate (Teknova) were grown in the 50 ml SD-CAA-minus TRP & URA medium (20 g/L Dextrose, 6.7 g/L yeast nitrogen base, 5 g/L Casamino Acids/-URA, 10.19 g/L Na2HPO4.7H2O, 8.56 g/L NaH2PO4.H2O) overnight at 30° C. (OD600=15-20), thus transferred to 50 ml SG-CAA-minus TRP & URA medium for 20 hrs growth at 25° C., to induce the expression of scFv-DH-GR1 fusion from expression vector and expression of GR2-Myc-GR2 from helper vector. The cells were harvested, washed with PBS, and incubated with mouse monoclonal anti-HA antibody (Santa Cruz Biotechologies) for 1 hr at room temperature. The PBS-washed cells were then probed with Goat anti-mouse-Alexa 488 (invitrogen) for 30 min in the dark. After PBS wash, fluorescent labeled scFv-DH-GRI fusion protein on the yeast cell surface was visualize under a Zeiss Axiovert 135 fluorescent microscope. FIG. 5A showed clear surface localization of HA-tagged scFv in green fluorescence, demonstrating the scFv display on yeast cell surface by using pMAT9 vector. In addition, the fluorescence associated with scFv displayed on cell surface was also measured in FACS Cailibur flow cytometry. The FACScan results in FIG. 5B showed a peak of ˜70-80% positive cells with high fluorescence intensity, and there was no significant difference on the display level between vector pMAT9 (with high copy replication ori) and pMAT12 (with low copy replication ori).
[0114] Furthermore, in order to show the interaction of adapter1 and adapter2 on yeast cell surface, the post-induction cells (using pMAT9 vector) were incubated with rat anti-HA antibodies (Roche) plus mouse anti-myc antibodies (Upstate Biotechnologies) for 60 min to probe both adapters. Cells were washed three times with PBS and incubated with Alexa 488 conjugated chicken anti-rat antibody plus Alex 594 conjugated goat anti-mouse antibody (Invitrogen) in PBS for 60 min. After three times washing with PBS, cells on slides were observation under a Zeiss Axiovert 200M microscope with Plan-Neofluar x40/0.75 Ph2 and X100 oil objective lens. The results in FIG. 5C showed clear surface localization of HA-tagged adapter 1 (GR1) in green fluorescence, myc-tagged adapter 2 (GR2) on surface in red fluorescence. The yellow fluorescence merged from green and red fluorescence indicated the co-localization of both adapters on the cell surface, confirmed the surface display mechanism through adapter interaction.
Example 8
Yeast Expression Vector pMAT19 for Surface Display of Fab Antibody
[0115] Vector pMAT19 (FIG. 6, SEQ ID NO:7) was derived from pMAT12 vector. It was created on the backbone of pMAT12 by insertion after pGAL1 promoter at Aatll and SacII sites with a fully synthetic DNA fragment (Codon Devices). This fully synthetic DNA fragment comprises (1) yeast endo-B-1,3-glucanase signal sequence-heavy chain of anti-IL13R Fab antibody-HA tag-GR1 adapter-ADH terminater; (2) an expression cassette for the light chain of anti-IL13R Fab, which is driven by a yeast pGAL1 promoter, and yeast endo-B-1,3-glucanase signal. The synthetic DNA sequence was confirmed by standard DNA sequencing method.
Example 9
Construction of MammalianEexpressionVector pMAG10
[0116] The vector pMAG10 is a mammalian expression vector to produce soluble adapter1 fusion in mammalian cells. The elements of the pMAG10 vector as depicted in the schematic representation provided in FIG. 7. The vector was built on the backbone of commercial vector pUC19 by insertion at EcoRI and PciI sites with a fully synthetic DNA fragment. This fully synthetic DNA comprises the following elements: (1) f1 ori for phage package; (2) an expression cassette, in which the expression of the adapter GR1 fusion with a scFv is driven by a CMV enhancer/Chicken β-actin promoter. The HA-His6 tag (DH-tag) sequences are upstream of GR1 sequence for protein detection and Ni-NTA purification. (3) an expression cassette for mammalian selection marker neomycin. The synthetic DNA was generated by fully gene synthesis using Codon Devices BioFab platform technology at Boston. The errors generated from gene synthesis were corrected by oligo selection with sequences complementary to the synthetic genes, and affinity purification of Mut-S protein column. The nucleotide sequence of pMAG10 (SEQ ID NO: 8) was confirmed by standard DNA sequencing method.
Example 10
Construction of Mammalian Helper Vector pMAG2
[0117] The mammalian helper vector pMAG2 (FIG. 8) was created on the backbone of commercial vector pUC 19 by insertion at EcoRI and PciI sites with a fully synthetic DNA fragment of 3134 bp. This fully synthetic DNA comprises sequences for two expression cassettes: (1) Zeocin expression cassette, with SV40 ori/promoter and SV40 polyA; (2) adapter GR2 fusion with transmembrane domain of human epidermal growth factor receptor (hEGFR), driven by a CMV promoter and terminated by BGH polyA. The secretory signal sequence for adapter GR2 fusion is from hEGFR. The gene synthesis was briefly described below. The synthetic DNA was divided into 4 pieces of segments of 808, 790, 829, and 817 by for gene synthesis by using BioFab platform technology (Codon Devices). The errors generated from oligo synthesis were corrected by oligo selection with sequences complementary to the synthetic genes, and affinity purification of Mut-S protein column. These DNA with tag sequences containing type II restriction sites were digested and ligated into full DNA fragment, which was then cloned into pUC19 vector. The nucleotide sequence of pMAG2 (SEQ ID NO:9) was confirmed by standard DNA sequencing method.
Example 11
Adapter-Directed Mammalian Cell Surface Display
[0118] COS 6 cells were grown on coverslips in 6-well plates with Dulbecco's modified Eagle's medium supplemented with 10% fetal bovine serum, 100 units/ml penicillin G, 100 μg/ml streptomycin, pAMG10 expression vector and mammalian helper vector pMAG2 were co-transfected into COS 6 cells using FuGene 6 transfection reagent (Roche Applied Science) according to the manufacturer's instructions. Briefly, 800 ng of plamid DNA (400 ng of pMAG10+400 ng of pMAG2) was added to diluted FuGENE 6 reagent at 3:2 ratio of FuGene 6 reagent (ul):DNA complex (ug) in serum-free medium. The FuGENE reagent DNA complex was incubated for 15 min at room temperature and then added to the cells. After 48 hr of transfection, HA tagged scFv-GR1 fusion protein (from pMAG10 vector) and myc tagged GR2-EGFR-TM displayed on the cell surface were detected with anti-HA and anti-Myc antibody, then labeled with Alexa 488 and Alex 594.
[0119] Briefly, COS 6 cells were fixed with 4% formaldehyde for 20 min, blocked with 5% BSA in PBS for 30 min at 25° C. and then incubated with rat anti-HA antibodies (Roche) plus mouse anti-myc antibodies (Upstate Biotechnologies) for 60 min to probe both adapters. Cells were washed three times with PBS and incubated with Alexa 488 conjugated chicken anti-rat antibody plus Alex 594 conjugated goat anti-mouse antibody (Invitrogen) in PBS for 60 min. After three times washing with PBS, cells on slides were observation under a Zeiss Axiovert 200M microscope with Plan-Neofluar x 40/0.75 Ph2 and X100 oil objective lens. Panel 9 presents photomicrographs that illustrate the surface expression and co-localization of the fusion proteins on the surface of the host cells. As a negative control, wild-type COS 6 cells that were not transfected with either a display vector or a helper vector of the invention were stained with the same fluorochromes. No cell surface fluorescence was detected on any of the negative control samples.
[0120] The photomicrographs presented in panel (a) of FIG. 9 demonstrates the plasma membrane localization of HA-tagged adapter1 (GR1) fusion proteins as green fluorescence. Panel (b) of FIG. 9 demonstrates the plasma membrane localization of myc-tagged adapter2 (GR2) fusion protein which was detected as red fluorescence on the surface of the COS 6 cells. The cells in panel (c) of FIG. 9, which are devoid of fluorescence were stained with nuclear stain. Panel (d) of FIG. 9 demonstrates the co-localization of the two fusion proteins on the plasma membrane of the COS 6 cells. Detection of the co-localization of the two adapter tagged fusion proteins can be detected by the presence of cells displaying a third color fluorescent signal (using the fluorochromes described in this example the third fluorescent signal will be a yellowish-orange color). For example, the co-localization signal depicted in panel (d) results from the merger of the two other fluorescent signals used to detect the surface expression of each of the other fusion proteins.
REFERENCES
[0121] 1) George Smith, Science (1985) 228: 1315-1317 [0122] 2) U.S. Pat. No. 5,969,108 [0123] 3) U.S. Pat. No. 5,837,500 [0124] 4) Crameri, et al. (1993) Gene 137:69 75 [0125] 5) WO 01/05950, U.S. Pat. No. 6,753,136, (Cis-display) [0126] 6) U.S. Pat. No. 7,175,983, (Adapter-directed display system) [0127] 7) Chang H H, Lo S J (2000) Modification with a phosphorylation tag of PKA in the TraT-based display vector of Escherichia coli. J Biotechnol 78:115-122 [0128] 8) Lee S H, Choi J I, Park S J, Lee S Y, Park B C (2004) Display of bacterial lipase on the Escherichia coli cell surface by using FadL as an anchoring motif and use of the enzyme in enantioselective biocatalysis. Appl Environ Microbiol 70:5074-5080 [0129] 9) Westerlund-Wikstrom B et al (1997) Functional expression of adhesive peptides as fusions to Escherichia coli flagellin. Protein Eng 10:1319-1326 [0130] 10) Georgiou G, Stephens D L, Stathopoulos C, Poetschke H L, Mendenhall J, Earhart C F (1996) Display of β-lactamase on the Escherichia coli surface: outer membrane phenotypes conferred by Lpp'-OmpA'-β-lactamase fusions. Protein Eng 9: 239-247 [0131] 11) Jung H C et al (1998) Surface display of Zymomonas mobilis levansucrase by using the ice-nucleation protein of Pseudomonas syringae. Nat Biotechnol 16:576-580 [0132] 11) Veiga E et al (2003) Autotransporters as scaffolds for novel bacterial adhesins: surface properties of Escherichia coli cells displaying Jun/Fos dimerization domains. J Bacterial 185:5585-5590 [0133] 12) Schreuder M P, Brekelmans S, Van den Ende H, Klis F M (1993) Targeting of a heterologous protein to the cell wall of Saccharomyces cerevisiae. Yeast 9:399-409 [0134] 13) U.S. Pat. Nos. 6,300,065, 6,423,538, 6,696,251, and 6,699,658 [0135] 14) Georgiou G et al (1997) Display of heterologous proteins on the surface of microorganisms: from the screening of combinatorial libraries to live recombinant vaccines. Nat Biotechnol 15:29-34 [0136] 15) Ashiuchi M, Misono H (2002) Biochemistry and molecular genetics of poly-γ-glutamate synthesis. Appl Microbiol Biotechnol 59:9-14 [0137] 16) Ashiuchi M, Nawa C, Kamei T, Song J J, Hong S P, Sung M H, Soda. K, Yagi T, Misono H (2001) Physiological and biochemical characteristics of poly γ-glutamate synthetase complex of Bacillus subtilis. Eur J Biochem 268:5321-5328 [0138] 17) Benhar I (2001) Biotechnological applications of phage and cell display. Biotechnol Adv 19:1-33 [0139] 18) Dubois M, Gilles K A, Hamilton J K, Rebers P A, Smith F (1956) Colorimetric method for determination of sugars and related substances. Anal Chem 28:350-356 [0140] 19) Georgiou G, Stathopoulos C, Daugherty P S, Nayak A R, Iverson B L, Curtiss R III (1997) Display of heterologous proteins on the surface of microorganisms: from the screening of combinatorial libraries to live recombinant vaccines. Nat Biotechnol 15:29-34 [0141] 20) Jose J, von Schwichow S (2004) Autodisplay of active sorbitol dehydrogenase (SDH) yields a whole cell biocatalyst for the synthesis of rare sugars. Chembiochem 5:491-499 [0142] 21) Jose J, Bernhardt R, Hannemann F (2002) Cellular surface display of dimeric Adx and whole cell P450-mediated steroid synthesis on E. coli. J Biotechnol 95:257-268 [0143] 22) Jung H C, Lebeault J M, Pan J G (1998) Surface display of Zymomonas mobilis levansucrase by using the ice-nucleation protein of Pseudomonas syringae. Nat Biotechnol 16:576-580 [0144] 23) Kaieda M, Nagayoshi M, Hama S, Kondo A, Fukuda H (2004) Enantioselective transesterification using immobilized Aspergillus oryzae overexpressing lipase. Appl Microbiol Biotechnol 65:301-305 [0145] 24) Kano K, Negi S, Kawashima A, Nakamura K (1997) Optical resolution of 1-arylethanols using transesterification catalyzed by lipases. Enantiomer 2:261-266 [0146] 24) Lee S Y, Choi J H, Xu Z (2003) Microbial cell-surface display. Trends Biotechnol 21:45-52 [0147] 25) Lee S H, Choi J I, Park S J, Lee S Y, Park B C (2004) Display of bacterial lipase on the Escherichia coli cell surface by using FadL as an anchoring motif and use of the enzyme in enantioselective biocatalysis. Appl Environ Microbial 70:5074-5080 [0148] 26) Lee S H, Choi J I, Han M J, Choi J H, Lee S Y (2005) Display of lipase on the cell surface of Escherichia coli using OprF as an anchor and its application to enantioselective resolution in organic solvent. Biotechnol Bioeng 90:223-230 [0149] 27) Matsumoto T, Ito M, Fukuda H, Kondo A (2004) Enantioselective transesterification using lipase-displaying yeast whole-cell biocatalyst. Appl Microbiol Biotechnol 64:481-485 [0150] 28) Narita J, Nakahara S, Fukuda H, Kondo A (2004) Efficient production of L-(+)-lactic acid from raw starch by Streptococcus bovis 148. J Biosci Bioeng 97:423-425 [0151] 28) Poo H, Song J J, Hong S P, Choi Y H, Yun S W, Kim J H, Lee S C, Lee S G, Sung M H (2002) Novel high-level constitutive expression system, pHCE vector, for a convenient and cost-effective soluble production of human tumor necrosis factor-α. Biotechnol Lett 24:1185-1189 [0152] 29) Richins R D, Kaneva I, Mulchandani A, Chen W (1997) Biodegradation of organophosphorus pesticides by surface-expressed organophosphorus hydrolase. Nat Biotechnol 15:984-987 [0153] 30) Robyt J F, Whelan W J (1972) Reducing value methods for maltodextrins. I. Chain-length dependence of alkaline 3,5-dinitrosalicylate and chain-length independence of alkaline copper. Anal Biochem 45:510-516 [0154] 31) Satoh E, Niimura Y, Uchimura T, Kozaki M, Komagata K (1993) Molecular cloning and expression of two α-amylase genes from Streptococcus bovis 148 in Escherichia coli. Appl Environ Microbiol 59:3669-3673 [0155] 32) Shigechi H, Koh J, Fujita Y, Matsumoto T, Bito Y, Ueda M, Satoh E, Fukuda H, Kondo A (2004) Direct production of ethanol from raw corn starch via fermentation by use of a novel surface-engineered yeast strain codisplaying glucoamylase and α-amylase. Appl Environ Microbial 70:5037-5040 [0156] 33) Sung M H, Hong S P, Lee J S, Jung C M, Kim C J, Soda K, Ashiuchi M (2003) Surface expression vectors having pgsBCA, the gene coding poly-gamma-glutamate synthetase, and a method for expression of target protein at the surface of microorganism using the vector. International Patent WO 03/014360 [0157] 34) Uppenberg J, Hansen M T, Patkar S, Jones T A (1994) The sequence, crystal structure determination and refinement of two crystal forms of lipase B from Candida antarctica. Structure 2:293-308 [0158] 35) Veiga E, de Lorenzo V, Fernandez L A (2003) Autotransporters as scaffolds for novel bacterial adhesins: surface properties of Escherichia coli cells displaying Jun/Fos dimerization domains. J Bacteriol 185:5585-5590 [0159] 36) von Heijne G (1986) A new method for predicting signal sequence cleavage sites. Nucleic Acids Res 14:4683-4690 [0160] 37) Wan H M, Chang B Y, Lin S C (2002) Anchorage of cyclodextrin glucanotransferase on the outer membrane of Escherichia coli. Biotechnol Bioeng 79:457-464
Sequence CWU
1
111114DNAArtificial SequenceGABAB Receptor 1 1gaggagaagt cccggctgtt
ggagaaggag aaccgtgaac tggaaaagat cattgctgag 60aaagaggagc gtgtctctga
actgcgccat caactccagt ctgtaggagg ttgt 1142114DNAArtificial
SequenceGABAB 5 Receptor 2 2acatcccgct tggaaggttt gcaatctgaa aaccacagat
tgagaatgaa gattactgaa 60ttggacaagg acttggaaga agttactatg caattgcaag
acgttggtgg ttgt 11431005DNAArtificial SequenceAnti-VEGF
scFv-GABAB Receptor 1 3atgcggttta gtacgacact ggcgacagca gcaacagcac
ttttcttcac agcaagtcag 60gtaagcgcga gctccgaggt gcagctggtg cagagcggcg
gcggcgtggt gcagccgggc 120ggcagcctgc gtctgagctg cgccgcgagc ggctacacct
tcaccaacta cggcatgaac 180tggattcgtc aggcccccgg gaagggcctg gagtgggtgg
gctggatcaa cacctacacc 240ggcgagccga cctacgcagc tgacttcaag cgtcgtgtca
ccttcagcct cgacaccagc 300aagagcacgg cgtacctgca actgaacagc ctgagggccg
aggacactgc agtttactac 360tgcgcgaaat acccgtacta ctacggtcgt agccactggt
acttcgacgt ctggggccaa 420gggacccttg tcaccgtctc gagcggcggt ggcggttctg
gtggtggtgg ctctggtggc 480ggcggatccg atatcgtgat gacccagagc ccgagcaccc
tgagcgcgag tccgggtgag 540cgcgcgacca tcacctgcag tgcgagccag agcatcagca
cctacctggc gtggtatcag 600cagaaaccag gtcaagcgcc gcaagtgctg atctacgctg
cgagcaacct ggcgtccgga 660gtgccgaacc gtttcagcgg tagccgtagc gggaccgatt
tcaccctgac catcagcagc 720ttgcagccgg aagacttcgc ggtgtactac tgccagcagt
actacagcac cccgtggacc 780ttcggtggtg gtaccaaagt ggaaatcaaa gcggccgctt
atccatacga cgtaccagac 840tacgcaggag gtcatcacca tcatcaccat gtcgacggat
ctggaggagg tgaggagaag 900tcccggctgt tggagaagga gaaccgtgaa ctggaaaaga
tcattgctga gaaagaggag 960cgtgtctctg aactgcgcca tcaactccag tctgtaggag
gttgt 100545602DNAArtificial Sequenceconstruction yeast
expression vector pMAT12 4gacgtcaatt cgcgttaaat ttttgttaaa tcagctcatt
ttttaaccaa taggccgaaa 60tccccaaaat cccttataaa tcaaaagaat agaccgagat
agggttgagt gttgttccag 120tttggaacaa gagtccacta ttaaagaacg tggactccaa
cgtcaaaggg cgaaaaaccg 180tctatcaggg cgatggccca ctacgtgaac catcacccta
atcaagtttt ttggggtcga 240ggtgccgtaa agcactaaat cggaacccta aagggatgcc
ccgatttaga gcttgacggg 300gaaagccggc gaacgtggcg agaaaggaag ggaagaaagc
gaaaggagcg ggcgctaggg 360cgctggcaag tgtagcggtc acgctgcgcg taaccaccac
acccgccgcg cttaatgcgc 420cgctacaggg cgcgtttaat taaacggatt agaagccgcc
gagcgggtga cagccctccg 480aaggaagact ctcctccgtg cgtcctcgtc ttcaccggtc
gcgttcctga aacgcagatg 540tgcctcgcgc cgcactgctc cgaacaataa agattctaca
atactagctt ttatggttat 600gaagaggaaa aattggcagt aacctggccc cacaaacctt
caaatgaacg aatcaaatta 660acaaccatag gatgataatg cgattagttt tttagcctta
tttctggggt aattaatcag 720cgaagcgatg atttttgatc tattaacaga tatataaatg
caaaaactgc ataaccactt 780taactaatac tttcaacatt ttcggtttgt attacttctt
attcaaatgt aataaaagta 840tcaacaaaaa attgttaata tacctctata ctttaacgtc
aaggagaaaa aaccccggat 900cggactacta gcctaggtaa acatgcggtt tagtacgaca
ctggcgacag cagcaacagc 960acttttcttc acagcaagtc aggtaagcgc gagctccgag
gtgcagctgg tgcagagcgg 1020cggcggcgtg gtgcagccgg gcggcagcct gcgtctgagc
tgcgccgcga gcggctacac 1080cttcaccaac tacggcatga actggattcg tcaggccccc
gggaagggcc tggagtgggt 1140gggctggatc aacacctaca ccggcgagcc gacctacgca
gctgacttca agcgtcgtgt 1200caccttcagc ctcgacacca gcaagagcac ggcgtacctg
caactgaaca gcctgagggc 1260cgaggacact gcagtttact actgcgcgaa atacccgtac
tactacggtc gtagccactg 1320gtacttcgac gtctggggcc aagggaccct tgtcaccgtc
tcgagcggcg gtggcggttc 1380tggtggtggt ggctctggtg gcggcggatc cgatatcgtg
atgacccaga gcccgagcac 1440cctgagcgcg agtccgggtg agcgcgcgac catcacctgc
agtgcgagcc agagcatcag 1500cacctacctg gcgtggtatc agcagaaacc aggtcaagcg
ccgcaagtgc tgatctacgc 1560tgcgagcaac ctggcgtccg gagtgccgaa ccgtttcagc
ggtagccgta gcgggaccga 1620tttcaccctg accatcagca gcttgcagcc ggaagacttc
gcggtgtact actgccagca 1680gtactacagc accccgtgga ccttcggtgg tggtaccaaa
gtggaaatca aagcggccgc 1740ttatccatac gacgtaccag actacgcagg aggtcatcac
catcatcacc atgtcgacgg 1800atctggagga ggtgaggaga agtcccggct gttggagaag
gagaaccgtg aactggaaaa 1860gatcattgct gagaaagagg agcgtgtctc tgaactgcgc
catcaactcc agtctgtagg 1920aggttgttaa taagtcgact aatgaccgcg gatcatgtaa
ttagttatgt cacgcttaca 1980ttcacgccct ccccccacat ccgctctaac cgaaaaggaa
ggagttagac aacctgaagt 2040ctaggtccct atttattttt ttatagttat gttagtatta
agaacgttat ttatatttca 2100aatttttctt ttttttctgt acagacgcgt gtacgcatgt
aacattatac tgaaaacctt 2160gcttgagaag gttttgggac gctcgaaggc tttaatttgc
aagctgcgcg cgggtccttt 2220tcatcacgtg ctataaaaat aattataatt taaatttttt
aatataaata tataaattaa 2280aaatagaaag taaaaaaaga aattaaagaa aaaatagttt
ttgttttccg aagatgtaaa 2340agactctagg gggatcgcca acaaatacta ccttttatct
tgctcttcct gctctcaggt 2400attaatgccg aattgtttca tcttgtctgt gtagaagacc
acacacgaaa atcctgtgat 2460tttacatttt acttatcgtt aatcgaatgt atatctattt
aatctgcttt tcttgtctaa 2520taaatatata tgtaaagtac gctttttgtt gaaatttttt
aaacctttgt ttattttttt 2580ttcttcattc cgtaactctt ctaccttctt tatttacttt
ctaaaatcca aatacaaaac 2640ataaaaataa ataaacacag agtaaattcc caaattattc
catcattaaa agatacgagg 2700cgcgtgtaag ttacaggcaa gcgatccgtc ctaagaaacc
attattatca tgacattaac 2760ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa
gaaattcggt cgaaaaaaga 2820aaaggagagg gccaagaggg agggcattgg tgactattga
gcacgtgagt atacgtgatt 2880aagcacacaa aggcagcttg gagtatgtct gttattaatt
tcacaggtag ttctggtcca 2940ttggtgaaag tttgcggctt gcagagcaca gaggccgcag
aatgtgctct agattccgat 3000gctgacttgc tgggtattat atgtgtgccc aatagaaaga
gaacaattga cccggttatt 3060gcaaggaaaa tttcaagtct tgtaaaagca tataaaaata
gttcaggcac tccgaaatac 3120ttggttggcg tgtttcgtaa tcaacctaag gaggatgttt
tggctctggt caatgattac 3180ggcattgata tcgtccaact gcacggagat gagtcgtggc
aagaatacca agagttcctc 3240ggtttgccag ttattaaaag actcgtattt ccaaaagact
gcaacatact actcagtgca 3300gcttcacaga aacctcattc gtttattccc ttgtttgatt
cagaagcagg tgggacaggt 3360gaacttttgg attggaactc gatttctgac tgggttggaa
ggcaagagag ccccgagagc 3420ttacatttta tgttagctgg tggactgacg ccagaaaatg
ttggtgatgc gcttagatta 3480aatggcgtta ttggtgttga tgtaagcgga ggtgtggaga
caaatggtgt aaaagactct 3540aacaaaatag caaatttcgt caaaaatgct aagaaatagg
ttattactga gtagtattta 3600tttaagtatt gtttgtgcac ttgccccgaa tttcttatga
tttatgattt ttattattaa 3660ataagttata aaaaaaataa gtgtatacaa attttaaagt
gactcttagg ttttaaaacg 3720aaaattctta ttcttgagta actctttcct gtaggtcagg
ttgctttctc aggtatagca 3780tgaggtcgct cacatgtgag caaaaggcca gcaaaaggcc
aggaaccgta aaaaggccgc 3840gttgctggcg tttttccata ggctccgccc ccctgacgag
catcacaaaa atcgacgctc 3900aagtcagagg tggcgaaacc cgacaggact ataaagatac
caggcgtttc cccctggaag 3960ctccctcgtg cgctctcctg ttccgaccct gccgcttacc
ggatacctgt ccgcctttct 4020cccttcggga agcgtggcgc tttctcatag ctcacgctgt
aggtatctca gttcggtgta 4080ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc
gttcagcccg accgctgcgc 4140cttatccggt aactatcgtc ttgagtccaa cccggtaaga
cacgacttat cgccactggc 4200agcagccact ggtaacagga ttagcagagc gaggtatgta
ggcggtgcta cagagttctt 4260gaagtggtgg cctaactacg gctacactag aaggacagta
tttggtatct gcgctctgct 4320gaagccagtt accttcggaa aaagagttgg tagctcttga
tccggcaaac aaaccaccgc 4380tggtagcggt ggtttttttg tttgcaagca gcagattacg
cgcagaaaaa aaggatctca 4440agaagatcct ttgatctttt ctacggggtc tgacgctcag
tggaacgaaa actcacgtta 4500agggattttg gtcatgagat tatcaaaaag gatcttcacc
tagatccttt taaattaaaa 4560atgaagtttt aaatcaatct aaagtatata tgagtaaact
tggtctgaca gttaccaatg 4620cttaatcagt gaggcaccta tctcagcgat ctgtctattt
cgttcatcca tagttgcctg 4680actccccgtc gtgtagataa ctacgatacg ggagggctta
ccatctggcc ccagtgctgc 4740aatgataccg cgagacccac gctcaccggc tccagattta
tcagcaataa accagccagc 4800cggaagggcc gagcgcagaa gtggtcctgc aactttatcc
gcctccatcc agtctattaa 4860ttgttgccgg gaagctagag taagtagttc gccagttaat
agtttgcgca acgttgttgc 4920cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt
atggcttcat tcagctccgg 4980ttcccaacga tcaaggcgag ttacatgatc ccccatgttg
tgcaaaaaag cggttagctc 5040cttcggtcct ccgatcgttg tcagaagtaa gttggccgca
gtgttatcac tcatggttat 5100ggcagcactg cataattctc ttactgtcat gccatccgta
agatgctttt ctgtgactgg 5160tgagtactca accaagtcat tctgagaata gtgtatgcgg
cgaccgagtt gctcttgccc 5220ggcgtcaata cgggataata ccgcgccaca tagcagaact
ttaaaagtgc tcatcattgg 5280aaaacgttct tcggggcgaa aactctcaag gatcttaccg
ctgttgagat ccagttcgat 5340gtaacccact cgtgcaccca actgatcttc agcatctttt
actttcacca gcgtttctgg 5400gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga
ataagggcga cacggaaatg 5460ttgaatactc atactcttcc tttttcaata ttattgaagc
atttatcagg gttattgtct 5520catgagcgga tacatatttg aatgtattta gaaaaataaa
caaatagggg ttccgcgcac 5580atttccccga aaagtgccac ct
560255749DNAArtificial Sequenceyeast helper vector
pMAT7 5gacgtccact caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt
60cggcctattg gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa
120tattaacgtt tacaatttcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc
180acaccgcata gggtaataac tgatataatt aaattgaagc tctaatttgt gagtttagta
240tacatgcatt tacttataat acagtttttt agttttgctg gccgcatctt ctcaaatatg
300cttcccagcc tgcttttctg taacgttcac cctctacctt agcatccctt ccctttgcaa
360atagtcctct tccaacaata ataatgtcag atcctgtaga gaccacatca tccacggttc
420tatactgttg acccaatgcg tctcccttgt catctaaacc cacaccgggt gtcataatca
480accaatcgta accttcatct cttccaccca tgtctctttg agcaataaag ccgataacaa
540aatctttgtc gctcttcgca atgtcaacag tacccttagt atattctcca gtagataggg
600agcccttgca tgacaattct gctaacatca aaaggcctct aggttccttt gttacttctt
660ctgccgcctg cttcaaaccg ctaacaatac ctgggcccac cacaccgtgt gcattcgtaa
720tgtctgccca ttctgctatt ctgtatacac ccgcagagta ctgcaatttg actgtattac
780caatgtcagc aaattttctg tcttcgaaga gtaaaaaatt gtacttggcg gataatgcct
840ttagcggctt aactgtgccc tccatggaaa aatcagtcaa gatatccaca tgagttttta
900gtaaacaaat tttgggacct aatgcttcaa ctaactccag taattccttg gtggtacgaa
960catccaatga agcacacaag tttgtttgct tttcgtgcat gatattaaat agcttggcag
1020caacaggact aggatgagta gcagcacgtt ccttatatgt agctttcgac atgatttatc
1080ttcgtttcct gcaggttttt gttctgtgca gttgggttaa gaatactggg caatttcatg
1140tttcttcaac actacatatg cgtatatata ccaatctaag tctgtgctcc ttccttcgtt
1200cttccttctg ttcggagatt accgaatcaa aaaaatttca aagaaaccga aatcaaaaaa
1260aagaataaaa aaaaaatgat gaattgaatt gaaaagctgt ggtatggtgc actctcagta
1320caatctgctc tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg
1380cgccctgacg ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg
1440ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa acgcgcgagg ctagccccac
1500acaccatagc ttcaaaatgt ttctactcct tttttactct tccagatttt ctcggactcc
1560gcgcatcgcc gtaccacttc aaaacaccca agcacagcat actaaatttt ccctctttct
1620tcctctaggg tgtcgttaat tacccgtact aaaggtttgg aaaagaaaaa agagaccgcc
1680tcgtttcttt ttcttcgtcg aaaaaggcaa taaaaatttt tatcacgttt ctttttcttg
1740aaattttttt ttttagtttt tttctctttc agtgacctcc attgatattt aagttaataa
1800acggtcttca atttctcaag tttcagtttc atttttcttg ttctattaca acttttttta
1860cttcttgttc attagaaaga aagcatagca atctaatcta aggggcggtg ttgacaatta
1920atcatcggca tagtatatcg gcatagtata atacgacaag gtgaggaact aaaccatggc
1980caagttgacc agtgccgttc cggtgctcac cgcgcgcgat gtcgccggag cggtcgagtt
2040ctggaccgac cggctcgggt tctcccggga cttcgtggag gacgacttcg ccggtgtggt
2100ccgggacgac gtgaccctgt tcatcagcgc ggtccaggac caggtggtgc cggacaacac
2160cctggcctgg gtgtgggtgc gcggcctgga cgagctgtac gccgagtggt cggaggtcgt
2220gtccacgaac ttccgggacg cctccgggcc ggccatgacc gagatcggcg agcagccgtg
2280ggggcgggag ttcgccctgc gcgacccggc cggcaactgc gtgcacttcg tggccgagga
2340gcaggactga cacgtccgac ggcggcccac gggtcccagg cctcggagat ccgtccccct
2400tttcctttgt cgatatcatg taattagtta tgtcacgctt acattcacgc cctcccccca
2460catccgctct aaccgaaaag gaaggagtta gacaacctga agtctaggtc cctatttatt
2520tttttatagt tatgttagta ttaagaacgt tatttatatt tcaaattttt cttttttttc
2580tgtacagacg cgtgtacgca tgtaacatta tactgaaaac cttgcttgag aaggttttgg
2640gacgctcgaa ggctttaatt tgcaagctga attcacggat tagaagccgc cgagcgggtg
2700acagccctcc gaaggaagac tctcctccgt gcgtcctcgt cttcaccggt cgcgttcctg
2760aaacgcagat gtgcctcgcg ccgcactgct ccgaacaata aagattctac aatactagct
2820tttatggtta tgaagaggaa aaattggcag taacctggcc ccacaaacct tcaaatgaac
2880gaatcaaatt aacaaccata ggatgataat gcgattagtt ttttagcctt atttctgggg
2940taattaatca gcgaagcgat gatttttgat ctattaacag atatataaat gcaaaaactg
3000cataaccact ttaactaata ctttcaacat tttcggtttg tattacttct tattcaaatg
3060taataaaagt atcaacaaaa aattgttaat atacctctat actttaacgt caaggagaaa
3120aaaccccgga tcggactact agcagctgta atacgactca ctatagggaa tattaagcta
3180attctacttc atacattttc aattaagttt aaaccatgac aatgcctcat cgctatatgt
3240ttttggcagt ctttacactt ctggcactaa ctagtgtggc ctcaggagcc acttctagat
3300tggaaggttt gcaatctgaa aaccacagat tgagaatgaa gattactgaa ttggacaagg
3360acttggaaga agttactatg caattgcaag acgttggtgg ttgtgcggcc gctgaacaaa
3420agttgatttc tgaagaagac ttgagctccg gtggtggttc tggtggtggt tccggttctg
3480gtggtggtgg ttccggtggt ggttccggat cccaggaact gacaactata tgcgagcaaa
3540tcccctcacc aactttagaa tcgacgccgt actctttgtc aacgactact attttggcca
3600acgggaaggc aatgcaagga gtttttgaat attacaaatc agtaacgttt gtcagtaatt
3660gcggttctca cccctcaaca actagcaaag gcagccccat aaacacacag tatgtttttt
3720aagcttgtta ttactgagta gtatttattt aagtattgtt tgtgcacttg ccccgaattt
3780cttatgattt atgattttta ttattaaata agttataaaa aaaataagtg tatacaaatt
3840ttaaagtgac tcttaggttt taaaacgaaa attcttattc ttgagtaact ctttcctgta
3900ggtcaggttg ctttctcagg tatagcatga ggtcgctcac atgtgagcaa aaggccagca
3960aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc
4020tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata
4080aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc
4140gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc
4200acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga
4260accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc
4320ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag
4380gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag
4440gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag
4500ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca
4560gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga
4620cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat
4680cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga
4740gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg
4800tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga
4860gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc
4920agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac
4980tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc
5040agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc
5100gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc
5160catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt
5220ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc
5280atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg
5340tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag
5400cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat
5460cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc
5520atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa
5580aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta
5640ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa
5700aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccacct
574966793DNAArtificial Sequenceyeast helper vector pMAT8 6gacgtccact
caaccctatc tcggtctatt cttttgattt ataagggatt ttgccgattt 60cggcctattg
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa 120tattaacgtt
tacaatttcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc 180acaccgcata
gggtaataac tgatataatt aaattgaagc tctaatttgt gagtttagta 240tacatgcatt
tacttataat acagtttttt agttttgctg gccgcatctt ctcaaatatg 300cttcccagcc
tgcttttctg taacgttcac cctctacctt agcatccctt ccctttgcaa 360atagtcctct
tccaacaata ataatgtcag atcctgtaga gaccacatca tccacggttc 420tatactgttg
acccaatgcg tctcccttgt catctaaacc cacaccgggt gtcataatca 480accaatcgta
accttcatct cttccaccca tgtctctttg agcaataaag ccgataacaa 540aatctttgtc
gctcttcgca atgtcaacag tacccttagt atattctcca gtagataggg 600agcccttgca
tgacaattct gctaacatca aaaggcctct aggttccttt gttacttctt 660ctgccgcctg
cttcaaaccg ctaacaatac ctgggcccac cacaccgtgt gcattcgtaa 720tgtctgccca
ttctgctatt ctgtatacac ccgcagagta ctgcaatttg actgtattac 780caatgtcagc
aaattttctg tcttcgaaga gtaaaaaatt gtacttggcg gataatgcct 840ttagcggctt
aactgtgccc tccatggaaa aatcagtcaa gatatccaca tgagttttta 900gtaaacaaat
tttgggacct aatgcttcaa ctaactccag taattccttg gtggtacgaa 960catccaatga
agcacacaag tttgtttgct tttcgtgcat gatattaaat agcttggcag 1020caacaggact
aggatgagta gcagcacgtt ccttatatgt agctttcgac atgatttatc 1080ttcgtttcct
gcaggttttt gttctgtgca gttgggttaa gaatactggg caatttcatg 1140tttcttcaac
actacatatg cgtatatata ccaatctaag tctgtgctcc ttccttcgtt 1200cttccttctg
ttcggagatt accgaatcaa aaaaatttca aagaaaccga aatcaaaaaa 1260aagaataaaa
aaaaaatgat gaattgaatt gaaaagctgt ggtatggtgc actctcagta 1320caatctgctc
tgatgccgca tagttaagcc agccccgaca cccgccaaca cccgctgacg 1380cgccctgacg
ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg 1440ggagctgcat
gtgtcagagg ttttcaccgt catcaccgaa acgcgcgagg ctagccccac 1500acaccatagc
ttcaaaatgt ttctactcct tttttactct tccagatttt ctcggactcc 1560gcgcatcgcc
gtaccacttc aaaacaccca agcacagcat actaaatttt ccctctttct 1620tcctctaggg
tgtcgttaat tacccgtact aaaggtttgg aaaagaaaaa agagaccgcc 1680tcgtttcttt
ttcttcgtcg aaaaaggcaa taaaaatttt tatcacgttt ctttttcttg 1740aaattttttt
ttttagtttt tttctctttc agtgacctcc attgatattt aagttaataa 1800acggtcttca
atttctcaag tttcagtttc atttttcttg ttctattaca acttttttta 1860cttcttgttc
attagaaaga aagcatagca atctaatcta aggggcggtg ttgacaatta 1920atcatcggca
tagtatatcg gcatagtata atacgacaag gtgaggaact aaaccatggc 1980caagttgacc
agtgccgttc cggtgctcac cgcgcgcgat gtcgccggag cggtcgagtt 2040ctggaccgac
cggctcgggt tctcccggga cttcgtggag gacgacttcg ccggtgtggt 2100ccgggacgac
gtgaccctgt tcatcagcgc ggtccaggac caggtggtgc cggacaacac 2160cctggcctgg
gtgtgggtgc gcggcctgga cgagctgtac gccgagtggt cggaggtcgt 2220gtccacgaac
ttccgggacg cctccgggcc ggccatgacc gagatcggcg agcagccgtg 2280ggggcgggag
ttcgccctgc gcgacccggc cggcaactgc gtgcacttcg tggccgagga 2340gcaggactga
cacgtccgac ggcggcccac gggtcccagg cctcggagat ccgtccccct 2400tttcctttgt
cgatatcatg taattagtta tgtcacgctt acattcacgc cctcccccca 2460catccgctct
aaccgaaaag gaaggagtta gacaacctga agtctaggtc cctatttatt 2520tttttatagt
tatgttagta ttaagaacgt tatttatatt tcaaattttt cttttttttc 2580tgtacagacg
cgtgtacgca tgtaacatta tactgaaaac cttgcttgag aaggttttgg 2640gacgctcgaa
ggctttaatt tgcaagctga attcacggat tagaagccgc cgagcgggtg 2700acagccctcc
gaaggaagac tctcctccgt gcgtcctcgt cttcaccggt cgcgttcctg 2760aaacgcagat
gtgcctcgcg ccgcactgct ccgaacaata aagattctac aatactagct 2820tttatggtta
tgaagaggaa aaattggcag taacctggcc ccacaaacct tcaaatgaac 2880gaatcaaatt
aacaaccata ggatgataat gcgattagtt ttttagcctt atttctgggg 2940taattaatca
gcgaagcgat gatttttgat ctattaacag atatataaat gcaaaaactg 3000cataaccact
ttaactaata ctttcaacat tttcggtttg tattacttct tattcaaatg 3060taataaaagt
atcaacaaaa aattgttaat atacctctat actttaacgt caaggagaaa 3120aaaccccgga
tcggactact agcagctgta atacgactca ctatagggaa tattaagcta 3180attctacttc
atacattttc aattaagttt aaaccatgac aatgcctcat cgctatatgt 3240ttttggcagt
ctttacactt ctggcactaa ctagtgtggc ctcaggagcc acttctagat 3300tggaaggttt
gcaatctgaa aaccacagat tgagaatgaa gattactgaa ttggacaagg 3360acttggaaga
agttactatg caattgcaag acgttggtgg ttgtgcggcc gctgaacaaa 3420agttgatttc
tgaagaagac ttgagctccg gtggtggttc tggtggtggt tccggttctg 3480gtggtggtgg
ttccggtggt ggttccggat cctcaagttt gtcatcatca tcttcaggac 3540aaatcaccag
ctctatcacg tcttcgcgtc caattattac cccattctat cctagcaatg 3600gaacttctgt
gatttcttcc tcagtaattt cttcctcagt cacttcttct ctattcactt 3660cttctccagt
catttcttcc tcagtcattt cttcttctac aacaacctcc acttctatat 3720tttctgaatc
atctaaatca tccgtcattc caaccagtag ttccacctct ggttcttctg 3780agagcgaaac
gagttcagct ggttctgtct cttcttcctc ttttatctct tctgaatcat 3840caaaatctcc
tacatattct tcttcatcat taccacttgt taccagtgcg acaacaagcc 3900aggaaactgc
ttcttcatta ccacctgcta ccactacaaa aacgagcgaa caaaccactt 3960tggttaccgt
gacatcctgc gagtctcatg tgtgcactga atccatctcc cctgcgattg 4020tttccacagc
tactgttact gttagcggcg tcacaacaga gtataccaca tggtgcccta 4080tttctactac
agagacaaca aagcaaacca aagggacaac agagcaaacc acagaaacaa 4140caaaacaaac
cacggtagtt acaatttctt cttgtgaatc tgacgtatgc tctaagactg 4200cttctccagc
cattgtatct acaagcactg ctactattaa cggcgttact acagaataca 4260caacatggtg
tcctatttcc accacagaat cgaggcaaca aacaacgcta gttactgtta 4320cttcctgcga
atctggtgtg tgttccgaaa ctgcttcacc tgccattgtt tcgacggcca 4380cggctactgt
gaatgatgtt gttacggtct atcctacatg gaggccacag actgcgaatg 4440aagagtctgt
cagctctaaa atgaacagtg ctaccggtga gacaacaacc aatactttag 4500ctgctgaaac
gactaccaat actgtagctg ctgagacgat taccaatact ggagctgctg 4560ccatttctca
aatcactgac ggtcaaatcc aagctactac cactgctacc accgaagcta 4620ccaccactgc
tgccccatct tccaccgttg aaactgtttc tccatccagc accgaaacta 4680tctctcaaca
aactgaaaat ggtgctgcta aggccgctgt cggtatgggt gccggtgctc 4740tagctgctgc
tgctatgttg ttataagctt gttattactg agtagtattt atttaagtat 4800tgtttgtgca
cttgccccga atttcttatg atttatgatt tttattatta aataagttat 4860aaaaaaaata
agtgtataca aattttaaag tgactcttag gttttaaaac gaaaattctt 4920attcttgagt
aactctttcc tgtaggtcag gttgctttct caggtatagc atgaggtcgc 4980tcacatgtga
gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc 5040gtttttccat
aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag 5100gtggcgaaac
ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt 5160gcgctctcct
gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg 5220aagcgtggcg
ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg 5280ctccaagctg
ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg 5340taactatcgt
cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac 5400tggtaacagg
attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg 5460gcctaactac
ggctacacta gaaggacagt atttggtatc tgcgctctgc tgaagccagt 5520taccttcgga
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 5580tggttttttt
gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc 5640tttgatcttt
tctacggggt ctgacgctca gtggaacgaa aactcacgtt aagggatttt 5700ggtcatgaga
ttatcaaaaa ggatcttcac ctagatcctt ttaaattaaa aatgaagttt 5760taaatcaatc
taaagtatat atgagtaaac ttggtctgac agttaccaat gcttaatcag 5820tgaggcacct
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt 5880cgtgtagata
actacgatac gggagggctt accatctggc cccagtgctg caatgatacc 5940gcgagaccca
cgctcaccgg ctccagattt atcagcaata aaccagccag ccggaagggc 6000cgagcgcaga
agtggtcctg caactttatc cgcctccatc cagtctatta attgttgccg 6060ggaagctaga
gtaagtagtt cgccagttaa tagtttgcgc aacgttgttg ccattgctac 6120aggcatcgtg
gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg 6180atcaaggcga
gttacatgat cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc 6240tccgatcgtt
gtcagaagta agttggccgc agtgttatca ctcatggtta tggcagcact 6300gcataattct
cttactgtca tgccatccgt aagatgcttt tctgtgactg gtgagtactc 6360aaccaagtca
ttctgagaat agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat 6420acgggataat
accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc 6480ttcggggcga
aaactctcaa ggatcttacc gctgttgaga tccagttcga tgtaacccac 6540tcgtgcaccc
aactgatctt cagcatcttt tactttcacc agcgtttctg ggtgagcaaa 6600aacaggaagg
caaaatgccg caaaaaaggg aataagggcg acacggaaat gttgaatact 6660catactcttc
ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg 6720atacatattt
gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg 6780aaaagtgcca
cct
679377146DNAArtificial Sequenceyeast expression vector pMAT19 7gacgtcaatt
cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa taggccgaaa 60tccccaaaat
cccttataaa tcaaaagaat agaccgagat agggttgagt gttgttccag 120tttggaacaa
gagtccacta ttaaagaacg tggactccaa cgtcaaaggg cgaaaaaccg 180tctatcaggg
cgatggccca ctacgtgaac catcacccta atcaagtttt ttggggtcga 240ggtgccgtaa
agcactaaat cggaacccta aagggatgcc ccgatttaga gcttgacggg 300gaaagccggc
gaacgtggcg agaaaggaag ggaagaaagc gaaaggagcg ggcgctaggg 360cgctggcaag
tgtagcggtc acgctgcgcg taaccaccac acccgccgcg cttaatgcgc 420cgctacaggg
cgcgtttaat taaacggatt agaagccgcc gagcgggtga cagccctccg 480aaggaagact
ctcctccgtg cgtcctcgtc ttcaccggtc gcgttcctga aacgcagatg 540tgcctcgcgc
cgcactgctc cgaacaataa agattctaca atactagctt ttatggttat 600gaagaggaaa
aattggcagt aacctggccc cacaaacctt caaatgaacg aatcaaatta 660acaaccatag
gatgataatg cgattagttt tttagcctta tttctggggt aattaatcag 720cgaagcgatg
atttttgatc tattaacaga tatataaatg caaaaactgc ataaccactt 780taactaatac
tttcaacatt ttcggtttgt attacttctt attcaaatgt aataaaagta 840tcaacaaaaa
attgttaata tacctctata ctttaacgtc aaggagaaaa aaccccggat 900cggactacta
gcctaggtat gtagcgcaac gcaattaatg tgagttagct cactcattac 960taaccccagg
ctttacactt tatgcttcca gctcgtatgt tgtgtggaat tgtgagcgga 1020taacaattta
gtaaggagat ctaaaaaatg cggtttagta cgacactggc gacagcagca 1080acagcacttt
tcttcacagc aagtcaggta agcgcgagct ccgaagtgca gctggtgcag 1140agcggtgcgg
aagtgaaaaa accgggtgaa agcctgaaaa tcagctgcaa aggttccgga 1200tacaccttca
gccgctactg ggttggctgg gtgcgtcaga tgcccgggaa aggtctggaa 1260tggatgggtg
ggatctatcc gggtgacggt tatacccact acaacccgaa attccagggt 1320caggtgacca
tctctgcaga taaaagcatc agcaccgcgt acttgcagtg gagcagcctg 1380aaagctagcg
ataccgcgat gtactactgt gcgcgcttcc cgaactgggg tagcttcgat 1440tactggggcc
aaggcaccct ggtgaccgtc tcgagcgcaa gcaccaaagg cccatcggta 1500ttccccctgg
caccctcctc caagagcacc tctgggggca cagcggccct gggctgcctg 1560gtcaaggact
acttccccga gccggtgacg gtgtcgtgga actcaggcgc tctgaccagc 1620ggcgtgcaca
ccttcccggc tgtcctacag tcctcaggac tctactccct cagcagcgtg 1680gtgactgtgc
cctccagcag cttgggcacc cagacctaca tctgcaacgt gaatcacaag 1740cccagcaaca
ctaaggtgga caagaaagtt gagcccaaat cttgtgacaa aactcacaca 1800gcggccgctt
atccatacga cgtaccagac tacgcaggag gtcatcacca tcatcaccat 1860gtcgacggat
ctggaggagg tgaggagaag tcccggctgt tggagaagga gaaccgtgaa 1920ctggaaaaga
tcattgctga gaaagaggag cgtgtctctg aactgcgcca tcaactccag 1980tctgtaggag
gttgttgagt cgactaatag gcctcgaatt tcttatgatt tatgattttt 2040attattaaat
aagttataaa aaaaataagt gtatacaaat tttaaagtga ctcttaggtt 2100ttaaaacgaa
aattcttatt cttgagtaac tctttcctgt aggtcaggtt gctttctcag 2160gtatagcatg
aggtcgctcg gcgcgccacg gattagaagc cgccgagcgg gtgacagccc 2220tccgaaggaa
gactctcctc cgtgcgtcct cgtcttcacc ggtcgcgttc ctgaaacgca 2280gatgtgcctc
gcgccgcact gctccgaaca ataaagattc tacaatacta gcttttatgg 2340ttatgaagag
gaaaaattgg cagtaacctg gccccacaaa ccttcaaatg aacgaatcaa 2400attaacaacc
ataggatgat aatgcgatta gttttttagc cttatttctg gggtaattaa 2460tcagcgaagc
gatgattttt gatctattaa cagatatata aatgcaaaaa ctgcataacc 2520actttaacta
atactttcaa cattttcggt ttgtattact tcttattcaa atgtaataaa 2580agtatcaaca
aaaaattgtt aatatacctc tatactttaa cgtcaaggag aaaaaaccct 2640cagcgtatgt
agcgcaacgc aattaatgtg agttagctca ctcattacta accccaggct 2700ttacacttta
tgcttccagc tcgtatgttg tgtggaattg tgagcggata acaatttagt 2760aaggagatcg
ataaaatgcg gtttagtacg acactggcga cagcagcaac agcacttttc 2820ttcacagcaa
gtcaggtaag cgctggatcc gaaatcgtgc tgacccagtc tccgggcacc 2880ctgagcctgt
caccaggtga acgtgcgacc ctgtcttgca aagcctctca gtctctttct 2940cctacttacc
tgcactggta tcagcagaaa ccgggtcagg cgccgcgtct gctgatctac 3000ggtgcgagca
gccgtgcgac cggtatcccg gaccgtttca gcggtagcgg tagcggcacc 3060gatttcaccc
tgaccatcag ccgtctggaa ccggaagact tcgcggtgta ctactgccag 3120cactacgaga
ccttcggtca gggtaccaaa gtggagatca aacgtacggt ggctgcacca 3180tctgtcttca
tcttcccgcc atctgatgag cagttgaaat ctggaactgc ctctgttgtg 3240tgcctgctga
ataacttcta tcccagagag gccaaagtac agtggaaggt ggataacgcc 3300ctccaatcgg
gtaactccca ggagagtgtc acagagcagg acagcaagga cagcacctac 3360agcctcagca
gcaccctgac gctgagcaaa gcagactacg agaaacacaa agtctacgcc 3420tgcgaagtca
cccatcaggg cctgagttcg cccgtcacaa agagcttcaa caggggagag 3480tgttaatgac
cgcggatcat gtaattagtt atgtcacgct tacattcacg ccctcccccc 3540acatccgctc
taaccgaaaa ggaaggagtt agacaacctg aagtctaggt ccctatttat 3600ttttttatag
ttatgttagt attaagaacg ttatttatat ttcaaatttt tctttttttt 3660ctgtacagac
gcgtgtacgc atgtaacatt atactgaaaa ccttgcttga gaaggttttg 3720ggacgctcga
aggctttaat ttgcaagctg cgcgcgggtc cttttcatca cgtgctataa 3780aaataattat
aatttaaatt ttttaatata aatatataaa ttaaaaatag aaagtaaaaa 3840aagaaattaa
agaaaaaata gtttttgttt tccgaagatg taaaagactc tagggggatc 3900gccaacaaat
actacctttt atcttgctct tcctgctctc aggtattaat gccgaattgt 3960ttcatcttgt
ctgtgtagaa gaccacacac gaaaatcctg tgattttaca ttttacttat 4020cgttaatcga
atgtatatct atttaatctg cttttcttgt ctaataaata tatatgtaaa 4080gtacgctttt
tgttgaaatt ttttaaacct ttgtttattt ttttttcttc attccgtaac 4140tcttctacct
tctttattta ctttctaaaa tccaaataca aaacataaaa ataaataaac 4200acagagtaaa
ttcccaaatt attccatcat taaaagatac gaggcgcgtg taagttacag 4260gcaagcgatc
cgtcctaaga aaccattatt atcatgacat taacctataa aaataggcgt 4320atcacgaggc
cctttcgtct tcaagaaatt cggtcgaaaa aagaaaagga gagggccaag 4380agggagggca
ttggtgacta ttgagcacgt gagtatacgt gattaagcac acaaaggcag 4440cttggagtat
gtctgttatt aatttcacag gtagttctgg tccattggtg aaagtttgcg 4500gcttgcagag
cacagaggcc gcagaatgtg ctctagattc cgatgctgac ttgctgggta 4560ttatatgtgt
gcccaataga aagagaacaa ttgacccggt tattgcaagg aaaatttcaa 4620gtcttgtaaa
agcatataaa aatagttcag gcactccgaa atacttggtt ggcgtgtttc 4680gtaatcaacc
taaggaggat gttttggctc tggtcaatga ttacggcatt gatatcgtcc 4740aactgcacgg
agatgagtcg tggcaagaat accaagagtt cctcggtttg ccagttatta 4800aaagactcgt
atttccaaaa gactgcaaca tactactcag tgcagcttca cagaaacctc 4860attcgtttat
tcccttgttt gattcagaag caggtgggac aggtgaactt ttggattgga 4920actcgatttc
tgactgggtt ggaaggcaag agagccccga gagcttacat tttatgttag 4980ctggtggact
gacgccagaa aatgttggtg atgcgcttag attaaatggc gttattggtg 5040ttgatgtaag
cggaggtgtg gagacaaatg gtgtaaaaga ctctaacaaa atagcaaatt 5100tcgtcaaaaa
tgctaagaaa taggttatta ctgagtagta tttatttaag tattgtttgt 5160gcacttgccc
cgaatttctt atgatttatg atttttatta ttaaataagt tataaaaaaa 5220ataagtgtat
acaaatttta aagtgactct taggttttaa aacgaaaatt cttattcttg 5280agtaactctt
tcctgtaggt caggttgctt tctcaggtat agcatgaggt cgctcacatg 5340tgagcaaaag
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 5400cataggctcc
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 5460aacccgacag
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 5520cctgttccga
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 5580gcgctttctc
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 5640ctgggctgtg
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 5700cgtcttgagt
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 5760aggattagca
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 5820tacggctaca
ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 5880ggaaaaagag
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 5940tttgtttgca
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 6000ttttctacgg
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 6060agattatcaa
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 6120atctaaagta
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 6180cctatctcag
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 6240ataactacga
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 6300ccacgctcac
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 6360agaagtggtc
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 6420agagtaagta
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc 6480gtggtgtcac
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 6540cgagttacat
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 6600gttgtcagaa
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat 6660tctcttactg
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag 6720tcattctgag
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat 6780aataccgcgc
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 6840cgaaaactct
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 6900cccaactgat
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga 6960aggcaaaatg
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc 7020ttcctttttc
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata 7080tttgaatgta
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 7140ccacct
714686804DNAArtificial Sequencemammalian expression vector pMAG10
8tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca
60cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg
120ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc
180accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc
240attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat
300tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt
360tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgcgttaaat ttttgttaaa
420tcagctcatt ttttaaccaa taggccgaaa tccccaaaat cccttataaa tcaaaagaat
480agaccgagat agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg
540tggactccaa cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctacgtgaac
600catcacccta atcaagtttt ttggggtcga ggtgccgtaa agcactaaat cggaacccta
660aagggatgcc ccgatttaga gcttgacggg gaaagccggc gaacgtggcg agaaaggaag
720ggaagaaagc gaaaggagcg ggcgctaggg cgctggcaag tgtagcggtc acgctgcgcg
780taaccaccac acccgccgcg cttaatgcgc cgctacaggg cgcgtttaat taactctagt
840tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga gttccgcgtt
900acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg cccattgacg
960tcaataatga cgtatgttcc catagtaacg ccaataggga ctttccattg acgtcaatgg
1020gtggagtatt tacggtaaac tgcccacttg gcagtacatc aagtgtatca tatgccaagt
1080acgcccccta ttgacgtcaa tgacggtaaa tggcccgcct ggcattatgc ccagtacatg
1140accttatggg actttcctac ttggcagtac atctacgtat tagtcatcgc tattaccatg
1200catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc
1260cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg
1320gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag
1380aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg
1440gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcgggagtcg tgcgcgctgc
1500cttcgccccg tgccccgctc cgccgccgcc tcgcgccgcc cgccccggct ctgactgacc
1560gcgttactcc cacaggtgag cgggcgggac ggcccttctc cttcgggctg taattagcgc
1620ttggtttaat gacggcttgt ttcttttctg tggctgcgtg aaagccttga ggggctccgg
1680gagggccctt tgtgcggggg gagcggctcg gggctgtccg cggggggacg gctgccttcg
1740ggggggacgg ggcagggcgg ggttcggctt ctggcgtgtg accggcggct ctagagcctc
1800tgctaaccat gttcatgcct tcttcttttt cctacagctc ctgggcaacg tgctggttat
1860tgtgctgtct catcattttg gcaaagaatt ggatcggacc gaagcttgcg caacgcaatt
1920aatgtgagtt agctcactca ttaggcaccc caggctttac actttatgct tccggctcgt
1980atgttgtgtg gaattgtgag cggataacaa tttcacacta aggaggttta aaccatggct
2040acaggctccc ggacgagtct gctcctggct tttggcctgc tctgcctgcc ctggcttcaa
2100gagggatccg cgagctccga ggtgcagctg gtgcagagcg gcggcggcgt ggtgcagccg
2160ggcggcagcc tgcgtctgag ctgcgccgcg agcggctaca ccttcaccaa ctacggcatg
2220aactggattc gtcaggcccc cgggaagggc ctggagtggg tgggctggat caacacctac
2280accggcgagc cgacctacgc agctgacttc aagcgtcgtg tcaccttcag cctcgacacc
2340agcaagagca cggcgtacct gcaactgaac agcctgaggg ccgaggacac tgcagtttac
2400tactgcgcga aatacccgta ctactacggt cgtagccact ggtacttcga cgtctggggc
2460caagggaccc ttgtcaccgt ctcgagcggc ggtggcggtt ctggtggtgg tggctctggt
2520ggcggcggat ccgatatcgt gatgacccag agcccgagca ccctgagcgc gagtccgggt
2580gagcgcgcga ccatcacctg cagtgcgagc cagagcatca gcacctacct ggcgtggtat
2640cagcagaaac caggtcaagc gccgcaagtg ctgatctacg ctgcgagcaa cctggcgtcc
2700ggagtgccga accgtttcag cggtagccgt agcgggaccg atttcaccct gaccatcagc
2760agcttgcagc cggaagactt cgcggtgtac tactgccagc agtactacag caccccgtgg
2820accttcggtg gtggtaccaa agtggaaatc aaagcggccg cttatccata cgacgtacca
2880gactacgcag gaggtcatca ccatcatcac catgtcgacg gatctggagg aggtgaggag
2940aagtcccggc tgttggagaa ggagaaccgt gaactggaaa agatcattgc tgagaaagag
3000gagcgtgtct ctgaactgcg ccatcaactc cagtctgtag gaggttgtta ataagtcgac
3060taatgaagat ctattaacct caggtgcagg ctgcctatca gaaggtggtg gctggtgtgg
3120ccaatgccct ggctcacaaa taccactgag atcgatcttt ttccctctgc caaaaattat
3180ggggacatca tgaagcccct tgagcatctg acttctggct aataaaggaa atttattttc
3240attgcaatag tgtgttggaa ttttttgtgt ctctcactcg gaaggacata tgggagggca
3300aatcatttaa aacatcagaa tgagtttttg gtttagagtt tggcaacata tgcccatatg
3360taactagcat aaccccttgg ggcctctaaa cgggtcttga ggggtttttt gatatccaga
3420catgataaga tacattgatg agtttggaca aaccacaact agaatgcagt gaaaaaaatg
3480ctttatttgt gaaatttgtg atgctattgc tttatttgta accattataa gctgcaataa
3540acaagttggg gtgggcgaag aactccagca tgagatcccc gcgctggagg atcatccagc
3600cggcgtcccg gaaaacgatt ccgaagccca acctttcata gaaggcggcg gtggaatcga
3660aatctcgtga tggcaggttg ggcgtcgctt ggtcggtcat ttcgcgaacc ccagagtccc
3720gctcagaaga actcgtcaag aaggcgatag aaggcgatgc gctgcgaatc gggagcggcg
3780ataccgtaaa gcacgaggaa gcggtcagcc cattcgccgc caagctcttc agcaatatca
3840cgggtagcca acgctatgtc ctgatagcgg tccgccacac ccagccggcc acagtcgatg
3900aatccagaaa agcggccatt ttccaccatg atattcggca agcaggcatc gccatgggtc
3960acgacgagat cctcgccgtc gggcatgcgc gccttgagcc tggcgaacag ttcggctggc
4020gcgagcccct gatgctcttc gtccagatca tcctgatcga caagaccggc ttccatccga
4080gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatca
4140agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg
4200tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct
4260tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc
4320cgcgctgcct cgtcctgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga
4380accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt
4440tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat
4500ccatcttgtt caatcatgcg aaacgatcct catcctgtct cttgatcaga tccgaaaatg
4560gatatacaag ctcccgggag ctttttgcaa aagcctaggc ctccaaaaaa gcctcctcac
4620tacttctgga atagctcaga ggcagaggcg gcctcggcct ctgcataaat aaaaaaaatt
4680agtcagccat ggggcggaga atgggcggaa ctgggcggag ttaggggcgg gatgggcgga
4740gttaggggcg ggactatggt tgctgactaa ttgagatgca tgctttgcat acttctgcct
4800gctggggagc ctggggactt tccacacctg gttgctgact aattgagatg catgctttgc
4860atacttctgc ctgcctgggg agcctgggga ctttccacac cctaactgac acacattcca
4920cagacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg
4980cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga
5040ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg
5100tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg
5160gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc
5220gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg
5280gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca
5340ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt
5400ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag
5460ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg
5520gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc
5580ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt
5640tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt
5700ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca
5760gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg
5820tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac
5880cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg
5940ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc
6000gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta
6060caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac
6120gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc
6180ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac
6240tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact
6300caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa
6360tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt
6420cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca
6480ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa
6540aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac
6600tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg
6660gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc
6720gaaaagtgcc acctgacgtc taagaaacca ttattatcat gacattaacc tataaaaata
6780ggcgtatcac gaggcccttt cgtc
680495416DNAArtificial Sequencemammalian helper vector pMAG2 9tcgcgcgttt
cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60cagcttgtct
gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120ttggcgggtg
tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180accatatgcg
gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240attcgccatt
caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300tacgccagct
ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360tttcccagtc
acgacgttgt aaaacgacgg ccagtgaatt cggatcggga gatctcccga 420tcccctatgg
tcgactctca gtacaatctg ctctgatgcc gcatagttaa gccagtatct 480gctccctgct
tgtgtgttgg aggtcgctga gtagtgcgcg agcaaaattt aagctacaac 540aaggcaaggc
ttgaccgaca attgcatgaa gaatctgctt agggttaggc gttttgcgct 600gcttcgcgat
gtacgggcca gatatacgcg ttgacattga ttattgacta gttattaata 660gtaatcaatt
acggggtcat tagttcatag cccatatatg gagttccgcg ttacataact 720tacggtaaat
ggcccgcctg gctgaccgcc caacgacccc cgcccattga cgtcaataat 780gacgtatgtt
cccatagtaa cgccaatagg gactttccat tgacgtcaat gggtggacta 840tttacggtaa
actgcccact tggcagtaca tcaagtgtat catatgccaa gtacgccccc 900tattgacgtc
aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 960ggactttcct
acttggcagt acatctacgt attagtcatc gctattacca tggtgatgcg 1020gttttggcag
tacatcaatg ggcgtggata gcggtttgac tcacggggat ttccaagtct 1080ccaccccatt
gacgtcaatg ggagtttgtt ttggcaccaa aatcaacggg actttccaaa 1140atgtcgtaac
aactccgccc cattgacgca aatgggcggt aggcgtgtac ggtgggaggt 1200ctatataagc
agagctctct ggctaactag agaacccact gcttactggc ttatcgaaat 1260taatacgact
cactataggg agacccaagc tggctagcac catgcgaccc tccgggacgg 1320ccggggcagc
gctcctggcg ctgctggctg cgctctgccc ggcgtctaga gctaccagcc 1380gcctggaggg
cctgcagagc gagaaccacc gcctgcgcat gaagatcacc gagctggaca 1440aggacctgga
ggaggtgacc atgcagctgc aggacgtggg cggctgcgcg gccgccgagc 1500agaagctgat
cagcgaggag gacctgaccg gtggaggctc cggaggaggt agcggatccg 1560gtacgaatgg
gcctaagatc ccgtccatcg ccactgggat ggtgggggcc ctcctcttgc 1620tgctggtggt
ggccctgggg atcggcctct tcatgcgaag gcgccacatc gttcggaagc 1680gcacgctgcg
gaggctgctg caggagaggg agcttgtgga gcctcttaca cccagttgat 1740aagcttgttt
aaacccgctg atcagcctcg actgtgcctt ctagttgcca gccatctgtt 1800gtttgcccct
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc 1860taataaaatg
aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt 1920ggggtggggc
aggacagcaa gggggaggat tgggaagaca atagcaggca tgctggggat 1980gcggtgggct
ctatggcttc tgaggcggaa agaaccagct ggggctctag ggggtatccc 2040cggcgcgcca
atttaacgcg aattaattct gtggaatgtg tgtcagttag ggtgtggaaa 2100gtccccaggc
tccccagcag gcagaagtat gcaaagcatg catctcaatt agtcagcaac 2160caggtgtgga
aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa 2220ttagtcagca
accatagtcc cgcccctaac tccgcccatc ccgcccctaa ctccgcccag 2280ttccgcccat
tctccgcccc atggctgact aatttttttt atttatgcag aggccgaggc 2340cgcctctgcc
tctgagctat tccagaagta gtgaggaggc ttttttggag gcctaggctt 2400ttgcaaaaag
ctcccgggag cttgtatatc cattttcgga tctgatcagc acgtgttgac 2460aattaatcat
cggcatagta tatcggcata gtataatacg acaaggtgag gaactaaacc 2520atggccaagt
tgaccagtgc cgttccggtg ctcaccgcgc gcgacgtcgc cggagcggtc 2580gagttctgga
ccgaccggct cgggttctcc cgggacttcg tggaggacga cttcgccggt 2640gtggtccggg
acgacgtgac cctgttcatc agcgcggtcc aggaccaggt ggtgccggac 2700aacaccctgg
cctgggtgtg ggtgcgcggc ctggacgagc tgtacgccga gtggtcggag 2760gtcgtgtcca
cgaacttccg ggacgcctcc gggccggcca tgaccgagat cggcgagcag 2820ccgtgggggc
gggagttcgc cctgcgcgac ccggccggca actgcgtgca cttcgtggcc 2880gaggagcagg
actgacacgt gctacgagat ttcgattcca ccgccgcctt ctatgaaagg 2940ttgggcttcg
gaatcgtttt ccgggacgcc ggctggatga tcctccagcg cggggatctc 3000atgctggagt
tcttcgccca ccccaacttg tttattgcag cttataatgg ttacaaataa 3060agcaatagca
tcacaaattt cacaaataaa gcattttttt cactgcattc tagttgtggt 3120ttgtccaaac
tcatcaatgt atcttatcat gtctgtatac cgtcgacctc tagctagagc 3180ttggcgtaat
catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca 3240cacaacatac
gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa 3300ctcacattaa
ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 3360ctgcattaat
gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc 3420gcttcctcgc
tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 3480cactcaaagg
cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 3540tgagcaaaag
gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 3600cataggctcc
gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 3660aacccgacag
gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 3720cctgttccga
ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 3780gcgctttctc
atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 3840ctgggctgtg
tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 3900cgtcttgagt
ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 3960aggattagca
gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 4020tacggctaca
ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 4080ggaaaaagag
ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 4140tttgtttgca
agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 4200ttttctacgg
ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 4260agattatcaa
aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 4320atctaaagta
tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 4380cctatctcag
cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 4440ataactacga
tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 4500ccacgctcac
cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 4560agaagtggtc
ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 4620agagtaagta
gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc 4680gtggtgtcac
gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 4740cgagttacat
gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 4800gttgtcagaa
gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat 4860tctcttactg
tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag 4920tcattctgag
aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aatacgggat 4980aataccgcgc
cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 5040cgaaaactct
caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 5100cccaactgat
cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga 5160aggcaaaatg
ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc 5220ttcctttttc
aatattattg aagcatttat cagggttatt gtctcatgag cggatacata 5280tttgaatgta
tttagaaaaa taaacaaata ggggttccgc gcacatttcc ccgaaaagtg 5340ccacctgacg
tctaagaaac cattattatc atgacattaa cctataaaaa taggcgtatc 5400acgaggccct
ttcgtc
54161038PRTArtificial SequenceC-terminal sequence of GABAB receptor 1
10Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys1
5 10 15 Ile Ile Ala Glu
Lys Glu Glu Arg Val Ser Glu Leu Arg His Gln Leu 20
25 30 Gln Ser Val Gly Gly Cys 35
1138PRTArtificial SequenceC-terminal sequence of GABAB 5
receptor 2 11Thr Ser Arg Leu Glu Gly Leu Gln Ser Glu Asn His Arg Leu Arg
Met1 5 10 15 Lys
Ile Thr Glu Leu Asp Lys Asp Leu Glu Glu Val Thr Met Gln Leu 20
25 30 Gln Asp Val Gly Gly Cys
35
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20220266974 | SYSTEM FOR AND METHOD OF CONTROLLING WATERCRAFT |
20220266973 | SYSTEM FOR AND METHOD OF CONTROLLING WATERCRAFT |
20220266972 | STOWABLE PROPULSION DEVICES FOR MARINE VESSELS AND METHODS FOR MAKING STOWABLE PROPULSION DEVICES FOR MARINE VESSELS |
20220266971 | STOWABLE MARINE PROPULSION SYSTEMS |
20220266970 | PROPULSION DEVICES WITH LOCK DEVICES AND METHODS OF MAKING PROPULSION DEVICES WITH LOCK DEVICES FOR MARINE VESSELS |