Patent application title: GENETICALLY-MODIFIED POULTRY EGG
Inventors:
IPC8 Class: AC12N1585FI
USPC Class:
1 1
Class name:
Publication date: 2021-09-02
Patent application number: 20210269823
Abstract:
Provided are a poultry knock-in egg and knock-out egg. The present
invention pertains to a knock-out poultry egg in which at least one
oviduct-specific gene has been knocked out, said gene being selected from
the group consisting of ovalbumin, ovomucoid, ovomucin, ovotransferrin,
ovoinhibitor, and lysozyme, and at least one egg allergen protein has
been reduced or eliminated, said protein being selected from the group
consisting of ovalbumin, ovomucoid, ovomucin, ovotransferrin,
ovoinhibitor, and lysozyme.Claims:
1-22. (canceled)
23. A genetically modified chicken egg whose genome comprises a deletion, a substitution, or an insertion of a base or bases in a coding region of an endogenous ovomucoid gene, wherein the chicken egg that does not functionally express endogenous ovomucoid.
24. The chicken egg of claim 23, wherein the coding region is exon 3 of ovomucoid gene.
25. The chicken egg of claim 24, wherein the coding region is OVMtg2 represented by the nucleotide sequence of SEQ ID NO: 6.
26. A genetically modified chicken whose genome comprises a deletion, a substitution, or an insertion of a base or bases in a coding region of an endogenous ovomucoid gene, wherein the chicken is capable of laying an egg that does not functionally express endogenous ovomucoid.
27. The chicken of claim 26, wherein the coding region is exon 3 of ovomucoid gene.
28. The chicken egg of claim 27, wherein the coding region is OVMtg2 represented by the nucleotide sequence of SEQ ID NO: 6.
29. A method of producing a genetically modified knock-out chicken egg, the method comprising: a) deleting, substituting, or inserting a base or bases in a coding region of an endogenous ovomucoid gene of an isolated chicken primordial germ cell (PGC), wherein: the endogenous ovomucoid gene is inactivated; b) transplanting the PGC obtained in step a) into a recipient chicken embryo; c) producing a genetically modified knock-out male chicken and a genetically modified knock-out female chicken from the chicken embryo in step b); d) mating the genetically modified knock-out male chicken and the genetically modified knock-out female chicken; e) obtaining an offspring knock-out chicken, wherein the offspring knock-out chicken is capable of laying an egg; and f) obtaining a genetically modified knock-out chicken egg from the knock-out offspring chicken of step e), wherein the chicken egg does not functionally express endogenous ovomucoid.
30. The method of claim 29, wherein the coding region is exon 3 of ovomucoid gene.
31. The method of claim 30, wherein the coding region is OVMtg2 represented by the nucleotide sequence of SEQ ID NO: 6.
32. The method of claim 29, wherein the base or bases in the coding region of the endogenous ovomucoid gene in step a) is deleted, substituted, or inserted using CRISPR and guide RNA.
Description:
TECHNICAL FIELD
[0001] The present invention relates to an egg of knock-out poultry and an individual derived therefrom, and an egg and a thick albumen of knock-in poultry.
[0002] Further, the present invention relates to a method of preparing an expression product of an exogenous gene.
BACKGROUND
[0003] It has been attempted to induce the expression of human interferon .beta. in an egg using a promoter containing a 2.8 kbp upstream region of a translation starting site of ovalbumin, or a promoter to which a further upstream estrogen-responsive enhancer element is attached to the 2.8 kbp region (Non-Patent Literature 1). In this example, a gene transfection is carried out using a lentiviral vector instead of using a knock-in method, so that a vector gene is inserted in a various location in the genome, and several vector genes are inserted. Consistent with such a gene transfection form, a concentration of human interferon .beta. secreted in the egg significantly varies, and an average concentration in 6 chickens is 3.5 to 426 .mu.g/ml. Further, data show that the concentration significantly varies among eggs derived from the same individual, indicating that the expression of interferon .beta. is very unstable. Further, because the genes inserted in various locations of chromosomes are subjected to a gene silencing effect or the like, in general offspring (G2) of G1 chickens expressing interferon .beta. at a relatively high level tend to reduce the expression level of interferon .beta..
[0004] In Non-Patent Literature 2, transgenic chimeric chickens (G0) expressing a Fv-Fc protein in the whole body are created using an actin promoter causing an expression in the whole body and a retroviral vector. It was confirmed that some G0 chickens express the Fv-Fc protein at a high concentration of 5 mg/ml. However, because the gene transfection is performed by viral infection in a chicken embryo, the gene transfection is achieved in a mosaic manner in which the presence/absence of gene insertion, the copy number of insertion, and a location of insertion differ between cells in the same individual. As a result, the G0 chimeric individual expressing the protein at a high concentration produces offsprings of transgenic individuals having the inserted genes varied in numbers and positions, making it difficult to completely transmit the character of the G0 chimeric individual to the offsprings. In fact, the expression level is reduced to 2 mg/ml or less in the G1 generation and 0.8 mg/ml or less in the G1 generation. Further, in many cases, an exogenous gene is not introduced in germline cells of the chimeric chicken infected by viruses. Thus, although one or several chickens that express the protein at a high level can be occasionally obtained in the G0 generation, it is difficult to propagate individuals having the same character and genetic information from such chickens. Such non-uniformity among G0 individuals or between G0 and G1 generations causes a fatal disadvantage for building so-called an "animal factory" in which a large number of chickens expressing an exogenous protein are grown to obtain a large number of eggs for mass production of the proteins.
[0005] Non-Patent Literature 4 shows an example in which an oviduct-specific gene, ovalbumin, is disrupted using a TALEN method. However, this literature only shows that a chick having a heterozygous deletion (+/-) in ovalbumin is obtained, and it is difficult to predict whether such poultry can produce an egg in the future, whether an egg having a null (-/-) genotype or an individual derived therefrom can be obtained, whether a homozygous knockout female (-/-) produces an egg, or whether an individual can hatch from an egg lacking the ovalbumin protein.
CITATION LIST
Non-Patent Literature
[0006] Non-Patent Literature 1: Lillico, S. G. et al. Oviduct-specific expression of two therapeutic proteins in transgenic hens. Proc Natl Acad Sci USA 104, 1771-1776 (2007)
[0007] Non-Patent Literature 2: Kamihira et al. High-Level Expression of Single-Chain Fv-Fc Fusion Protein in Serum and Egg White of Genetically Manipulated Chickens by Using a Retroviral Vector. Journal of Virology, p. 10864-10874 (2005)
[0008] Non-Patent Literature 3: van de Lavoir M C, Diamond J H, Leighton P A, Mather-Love C, Heyer B S, Bradshaw R, Kerchner A, Hooi L T, Gessaro T M, Swanberg S E et al: Germline transmission of genetically modified primordial germ cells. Nature 2006, 441(7094): 766-769
[0009] Non-Patent Literature 4: Park, T. S., Lee, H. J., Kim, K. H., Kim, J. S. & Han, J. Y. Targeted gene knockout in chickens mediated by TALENs. Proc Natl Acad Sci USA. 111, 12716-12721 (2014)
SUMMARY
Technical Problem
[0010] An object of the present invention is to provide a poultry egg in which an expression level of a protein encoded by an oviduct-specific gene is reduced or eliminated.
[0011] Further, another object of the present invention is to provide a poultry egg in which an exogenous gene is stably expressed, and a gene product thereof is highly expressed.
[0012] Further, another object of the present invention is to provide a technique for efficiently recovering an exogenous gene product, which is expressed in a chicken egg using a knock-in technique.
Solution to Problem
[0013] The present invention provides the following knock-out poultry egg and knock-in poultry egg. Further, the present invention provides a method of efficiently preparing an exogenous gene product from the knock-in poultry egg.
[0014] The present invention, in one aspect, relates to: [1] a knock-out poultry egg in which at least one oviduct-specific gene is knocked out, the gene being selected from the group consisting of ovalbumin, ovomucoid, ovomucin, ovotransferrin, ovoinhibitor, and lysozyme, and at least one egg allergen protein is reduced or eliminated, the protein being selected from the group consisting of ovalbumin, ovomucoid, ovomucin, ovotransferrin, ovoinhibitor, and lysozyme.
[0015] Further, in one embodiment of the present invention, [2] the knock-out poultry egg according to the item [1] above is characterized in that a base sequence encoding the knocked-out oviduct-specific gene includes deletion, substitution, or insertion of a base or bases in a region near a 5' side or 3' side of a PAM sequence.
[0016] Further, in one embodiment of the present invention, [3] the knock-out poultry egg according to the item [1] or [2] above is characterized in that the oviduct-specific gene is homozygously knocked-out and a genotype of the oviduct-specific gene is null (-/-).
[0017] Further, in one embodiment of the present invention, [4] the knock-out poultry egg according to any of the items [1] to [3] above is characterized in that the oviduct-specific gene is ovalbumin.
[0018] Further, in one embodiment of the present invention, [5] the knock-out poultry egg according to the item [4] above is characterized in that a base sequence encoding ovalbumin includes deletion, substitution, or insertion of a base or bases in a region corresponding to a base sequence represented by SEQ ID NO: 1 (OVATg1) and a vicinity thereof.
[0019] Further, in one embodiment of the present invention, [6] the knock-out poultry egg according to any of the items [1] to [3] above is characterized in that the oviduct-specific gene is an ovomucoid gene.
[0020] Further, in one embodiment of the present invention, [7] the knock-out poultry egg according to the item [6] above is characterized in that a base sequence encoding ovomucoid includes deletion, substitution, or insertion of a base or bases in a region corresponding to a base sequence represented by SEQ ID NO: 6 (OVMTg2) and a vicinity thereof.
[0021] Further, in one embodiment of the present invention, [8] the knock-out poultry egg according to the item [6] or [7] above is characterized by being substantially free of endogenous ovomucoid.
[0022] Further, the present invention, in another aspect, relates to:
[9] a knock-out poultry derived from the knock-out poultry egg according to any of the items [1] to [8] above.
[0023] Further, the present invention, in another aspect, relates to:
[10] a knock-in poultry egg, in which:
[0024] an exogenous gene under control of an oviduct-specific gene promoter is knocked-in as homozygous or heterozygous and an expression product of the exogenous gene is stably and highly expressed in an egg; and
[0025] the oviduct-specific gene promoter is at least one of promoters of oviduct-specific genes selected from the group consisting of ovalbumin, ovomucoid, ovomucin, ovotransferrin, ovoinhibitor, and lysozyme.
[0026] Further, in one embodiment of the present invention, [11] the knock-in poultry egg according to the item [10] above is characterized in that the oviduct-specific gene promoter is an ovalbumin gene promoter, and the exogenous gene and a drug-resistant gene are both inserted in an exon 2 of an ovalbumin gene.
[0027] Further, in one embodiment of the present invention, [12] the knock-in poultry egg according to the item [10] or [11] above is characterized in that the exogenous gene is inserted in a region corresponding to a base sequence represented by SEQ ID NO: 1 (OVATg1) or a vicinity thereof or a region corresponding to a base sequence represented by SEQ ID NO: 24 (OVATg2) or a vicinity thereof in a base sequence encoding ovalbumin.
[0028] Further, in one embodiment of the present invention, [13] the knock-in poultry egg according to any of the items [10] to [12] above is characterized in that a protein encoded by the exogenous gene is contained in an amount of 1 mg or more per egg.
[0029] Further, in one embodiment of the present invention, [14] the knock-in poultry egg according to any of the items [10] to [13] above is characterized in that an expression product of the exogenous gene is dominantly expressed in a thick albumen.
[0030] Further, in one embodiment of the present invention, [15] the knock-in poultry egg according to any of the items [10] to [14] above is characterized in that the exogenous gene is a gene encoding interferon .beta., immunoglobulin, or collagen.
[0031] Further, in one embodiment of the present invention, [16] the knock-in poultry egg according to any of the items [10] to [15] above is characterized in that the exogenous gene is a gene derived from a human being.
[0032] Further, the present invention, in another aspect, relates to:
[17] a thick albumen derived from a knock-in poultry egg in which an exogenous gene under control of an ovalbumin gene promoter is knocked-in homozygously or heterozygously, the thick albumen dominantly containing an expression product of the exogenous gene that is stably and highly expressed.
[0033] Further, the present invention, in another aspect, relates to:
[18] a method of producing a knock-in poultry egg containing an expression product of an exogenous gene that is stably and highly expressed, the method comprising:
[0034] a step (a) of knocking-in the exogenous gene under control of an oviduct-specific gene promoter in a poultry primordial germ cell;
[0035] a step (b) of producing female poultry in which the exogenous gene under control of the oviduct-specific gene promoter is knocked-in as homozygous or heterozygous using the poultry germ cell; and
[0036] a step (c) obtaining a poultry egg expressing the exogenous gene from the female poultry.
[0037] Further, in one embodiment of the present invention, [19] the method of producing the knock-in poultry egg containing the expression product of the exogenous gene that is stably and highly expressed according to the item [18] above is characterized in that the step (a) is a step of introducing the exogenous gene by genome editing using (i) a donor construct that includes a 5' side region of a translation starting site under control of the oviduct-specific gene promoter, the exogenous gene, a drug-resistant gene unit, and a 3' side region of the translation starting site, and (ii) a vector that includes a target sequence and a different drug-resistant gene unit.
[0038] Further, in one embodiment of the present invention, [20] the method of producing the knock-in poultry egg containing the expression product of the exogenous gene that is stably and highly expressed according to the item [19] above is characterized in that:
[0039] the oviduct-specific gene promoter in the step (a) is an ovalbumin gene promoter; and
[0040] the step (d) is a step of recovering the expression product of the exogenous gene from a thick albumen of the poultry egg.
[0041] Further, in one embodiment of the present invention, [21] the method of producing the knock-in poultry egg containing the expression product of the exogenous gene that is stably and highly expressed according to the item [19] or [20] above is characterized in that the step (a) is a step of introducing the exogenous gene by CRISPR using (i) a donor construct that includes a 2.8 kb 5' side region of a translation starting site of ovalbumin, the exogenous gene, a neomycin-resistant gene unit, and a 3.0 kb 3' side region of the translation starting site of ovalbumin, and (ii) a vector that includes a base sequence represented by SEQ ID NO: 24 as the target sequence and a neomycin-resistant gene unit.
[0042] Further, the present invention, in another aspect, relates to:
[22] a method of preparing an expression product of an exogenous gene from a knock-in poultry egg, further comprising a step (d) of recovering the expression product of the exogenous gene from the poultry egg after the step (c) in the method according to any of items [18] to [21] above.
Effects of Invention
[0043] The knock-in poultry egg of the present invention is produced from a knock-in poultry individual in which an identical exogenous gene (a gene not derived from the poultry) is inserted in an identical location in the whole body, thus the difference in a protein expression level between individuals is small and genetic information and character of the exogenous gene can be properly transmitted over generations. Further, the expression of the exogenous gene can be restricted to an oviduct by performing gene knock-in at a location of the oviduct-specific gene. In this manner, a possibility of affecting a development process and the health of chicken is clearly lower as compared to a case where the exogenous gene is expressed in the whole body, which produces excellent effects. In addition, it is further preferable that the exogenous gene is expressed under control of a gene that is highly expressing in the albumen, such as ovomucoid, to increase an expression efficiency of the exogenous gene. Moreover, the knock-in chicken can be efficiently established by the gene knock-in using genome editing. It is confirmed that using such a new technique also allows the expression of the exogenous gene in the oviduct and the accumulation of the exogenous gene expression product in the albumen. Further, it is found, for the first time, that the exogenous gene-derived product mainly localizes to the thick albumen in the albumen. Based on this observation, the exogenous gene-derived product can be efficiently recovered by recovering a portion including the thick albumen from the egg containing the exogenous gene product, in a preferred embodiment, from the egg in which the exogenous gene is knocked-in at an albumen gene locus.
[0044] Because the oviduct-specific gene is knocked-out in the knock-out poultry egg of the present invention, the impact on the development is a matter of concern. However, it is confirmed by the inventor that such knock-out poultry can produce an egg.
BRIEF DESCRIPTION OF DRAWINGS
[0045] FIG. 1 shows a target sequence of a chicken ovalbumin gene (a target sequence of OVATg1). An sgRNA recognition site is indicated in capitals and a PAM sequence is indicated by an adjacent underline.
[0046] FIG. 2 shows a target sequence of a chicken ovomucoid gene (a target sequence of OVMTg2). An sgRNA recognition site is indicated in capitals and a PAM sequence is indicated by an adjacent underline.
[0047] FIG. 3 shows an example of disruption of the ovalbumin gene by CRISPR. An sgRNA recognition site is indicated in capitals (underlined) and a PAM sequence is indicated by an adjacent box. A deletion site of mutated sequence is indicated by a hyphen (-) and mutated sites are indicated in capitals. A translation starting site of OVATg1 is indicated by Met.
[0048] FIG. 4A shows an example of disruption of the ovomucoid gene by CRISPR. An sgRNA recognition site is indicated in capitals (underlined) and a PAM sequence is indicated by an adjacent box.
[0049] FIG. 4B (upper panel): An example of a chicken in which the ovomucoid gene is disrupted. A chicken (black) in a photograph is derived from a transplanted primordial germ cell and one allele of the ovomucoid gene has a 5-base deletion in a Tg2 region shown in FIG. 2. A base sequence of the Tg2 region in the chicken genome is analyzed both on a sense side and an antisense side. (lower panel): An example of ovomucoid gene mutations found in chicken individuals (F1 chickens). Deletions of 1 to 31 bases are found.
[0050] FIG. 5A shows knock-in of a human interferon .beta. gene at an ovalbumin gene locus and demonstration of the knock-in by genome PCR. Primer 1 (P1) to primer 8 (P8) correspond to the following sequences. P1: SEQ ID NO: 15, P2: SEQ ID NO: 17, P3: SEQ ID NO: 16, P4: SEQ ID NO: 14, P5: SEQ ID NO: 18, P6: SEQ ID NO: 20, P7: SEQ ID NO: 21, P8: SEQ ID NO: 19. As a result of nested PCR, amplification products in predicted sizes are detected only in genome derived from the knock-in primordial germ cells (PGCs) (indicated by arrows in an image).
[0051] FIG. 5B shows knock-in of the human interferon .beta. gene at the ovalbumin gene locus in a chimeric chicken sperm. Semen genomes from 4 chimeric chickens (411 to 414) and 1 negative control chicken (416, NC), and genome of knock-in cells (PCIFNKI #4) are amplified using primer sets of SEQ ID NO: 18 and 19 (3' UTR), SEQ ID NO: 15 and 14 (5' OVAp_out-IFN), and SEQ ID NO: 15 and 22 (5' OVAp_out-OVA(ATG)). Detected bands in predicted sizes are indicated by "*". In 411 and 412, knock-in signals having almost the same relative intensities as that of the positive control are detected.
[0052] FIG. 5C shows chickens in which the human interferon .beta. gene is knocked-in at the ovalbumin gene locus. Photographs show chickens (female) which are offsprings of 411 and 412 in FIG. 5B. A PCR analysis of genomes derived from blood of the offsprings demonstrates that an IFN donor construct is knocked-in at the ovalbumin gene locus. A genome derived from blood of a wild-type (WT: a negative control) chicken, the genomes derived from blood of the knock-in chicken offsprings (KI), and a genome derived from a knock-in cell (KI PGC: a positive control) are amplified using primer sets of SEQ ID NO: 18 and 19 (a knock-in 3' region), SEQ ID NO: 15 and 14 (a knock-in 5' region), and SEQ ID NO: 15 and 22 (endogenous ovalbumin). Detected bands in predicted sizes are indicated by "*". In the offsprings of 411 and 412, signals having the same patterns as that of the positive control are detected, indicating that the IFN donor construct is knocked-in at the ovalbumin gene locus in the chicken offsprings.
[0053] FIG. 6A shows target sequences (2 locations, the target sequences of OVATg1 and OVATg2) of the chicken ovalbumin gene. sgRNA recognition sites are indicated in capitals and PAM sequences are indicated by adjacent underlines.
[0054] FIG. 6B shows an efficiency of gene knock-in at the ovalbumin gene locus of the chicken primordial germ cell. The knock-in efficiency is the same between a transfection group 1 and a transfection group 2. Since the transfection group 2 has more cells, a transfection method of the transfection group 2 is more preferable. The knock-in efficiency seems to be higher in a transfection group 3 than that in the transfection group 2, thus a transfection method of the transfection group 3 is more preferable.
[0055] FIG. 7 shows knock-in of a human immunoglobulin gene at the ovalbumin gene locus and demonstration of the knock-in by genome PCR. Primer 1 (P1) to primer 8 (P8) correspond to the following sequences. P1: SEQ ID NO: 15, P2: SEQ ID NO: 17, P3: SEQ ID NO: 29, P4: SEQ ID NO: 28, P5: SEQ ID NO: 18, P6: SEQ ID NO: 20, P7: SEQ ID NO: 21, P8: SEQ ID NO: 19. As a result of nested PCR, amplification products in predicted sizes are detected only in the genome derived from the knock-in primordial germ cells (PGCs) (indicated by arrows in an image).
[0056] FIG. 8 shows knock-in of a human collagen gene at the ovalbumin gene locus and demonstration of the knock-in by genome PCR. Primer 1 (P1) to primer 8 (P8) correspond to the following sequences. P1: SEQ ID NO: 15, P2: SEQ ID NO: 17, P3: SEQ ID NO: 29, P4: SEQ ID NO: 28, P5: SEQ ID NO: 18, P6: SEQ ID NO: 20, P7: SEQ ID NO: 21, P8: SEQ ID NO: 31. As a result of nested PCR, amplification products in predicted sizes are detected only in the genome derived from the knock-in primordial germ cells (PGCs) (indicated by arrows in an image).
[0057] FIG. 9 shows an image of an egg produced from an interferon .beta. knock-in female chicken. It is found that a thick albumen surrounding an egg yolk is cloudy.
[0058] FIG. 10 shows an image of western blotting of albumen components using an anti-human interferon .beta. antibody. Recombinant human interferon .beta. (indicated by an arrow) is expressed in an egg derived from a knock-in chicken. A thick albumen contains more human interferon .beta. proteins than a thin albumen per unit volume. It is found that the thick albumen contains 100 times more human interferon .beta. proteins than the thin albumen in terms of a relative concentration.
[0059] FIG. 11 shows a distribution of human interferon .beta. in the albumen produced by the human interferon .beta. knock-in chicken. In eggs derived from multiple chickens (KI egg 1 and 2), the thick albumen contains more interferon .beta. proteins than the thin albumen (indicated by boxes). A concentration of recombinant human interferon .beta. in the thick albumen is one tenth of the ovalbumin proteins present in a concentration of about 50 mg/ml (indicated by black arrows), thus the concentration of recombinant human interferon .beta. is estimated to be about 5 mg/ml.
[0060] FIG. 12 shows how stably the interferon .beta. protein is expressed in an egg produced from the interferon .beta. knock-in chicken. Eggs were collected for a week (d1 to d7) and human interferon .beta. contained in the thick albumen was identified by CBB staining. It is found that interferon .beta. is stably expressed during the test period.
[0061] FIG. 13 shows an image of white precipitates obtained after centrifugation of the thick albumen. The thick albumen of the egg produced from the interferon .beta. knock-in chicken is centrifuged without any treatment (1) or after various treatments (2 to 10) to compare amounts of white precipitate. The amounts of the white precipitates are reduced in 2 to 10 as compared to 1. In FIG. 13, a thick albumen liquid in an amount of 200 .mu.l is added in each tube and subjected to the following treatments. The thick albumen liquid is added and mixed by inversion with 4 times volume (800 .mu.l) of a 3M saturated arginine solution (tube 2), added with 4 times volume (800 .mu.l) of the 3M saturated arginine solution and subjected to ultrasonic crushing (tube 3), added and mixed by inversion with a small amount of arginine (20 mg) and filled up with PBS to 1 ml (tube 4), added and mixed by inversion with a small amount of arginine hydrochloride (20 mg) and filled up with PBS to 1 ml (tube 5), filled up with PBS to 1 ml and subjected to the ultrasonic crushing (tube 6), added and mixed by inversion with arginine hydrochloride in a saturating amount or more (200 mg) (tube 7), added with twice volume (400 .mu.l) of the 3M saturated arginine solution and subjected to the ultrasonic crushing (tube 8), added with a small amount of arginine hydrochloride (20 mg) and subjected to the ultrasonic crushing (tube 9), or added with a small amount of sodium chloride (40 mg) and subjected to the ultrasonic crushing (tube 10). Although the white precipitates were still observed after centrifugation at 20 k.times.g for 15 minutes, the amounts of the white precipitates were reduced by all of these treatments as compared to the case where no treatment was performed (tube 1). In particular, the amounts of the white precipitates were markedly reduced in the tubes 3, 6, 8, and 9, which were subjected to the ultrasonic crushing.
[0062] FIG. 14 shows an image of electrophoresis of supernatants of the thick albumen after various treatments. The thick albumen without the centrifugation treatment (lane 0) and the supernatants of the tubes 1 to 10 in FIG. 13 (lanes 1 to 10) were subjected to electrophoresis after adjusting their loading amounts to be equal on the basis of the original amounts of the thick albumen. Bands of human interferon .beta. are indicated by a black arrow.
[0063] FIG. 15 shows activity of human interferon .beta. contained in an egg derived from the human interferon .beta. knock-in chicken. The activity of human interferon .beta. is detected in all of a thick albumen rough purification product, a centrifugation supernatant of the thick albumen, and the thin albumen.
[0064] FIG. 16 shows genomes of ovomucoid heterozygous knock-out (in a G1 generation: 5-base deletion in ORF) male and female, and genomes of ovomucoid heterozygous and homozygous knock-out and wild-type chickens obtained from the heterozygous knock-out chickens in the next generation.
[0065] FIG. 17 shows an image of electrophoresis of the thick albumen obtained from different G1 individuals. The thick albumen derived from a wild-type chicken (ctrl) and the thick albumen derived from 5 human interferon .beta. knock-in chickens (#584, #766, #714, #645, and #640) are subjected to electrophoresis. Bands of human interferon .beta. are indicated by an arrow.
[0066] FIG. 18 shows a comparison of activity between human interferon .beta. contained in the egg derived from the human interferon .beta. knock-in chicken (lower stage) and commercially available recombinant human interferon .beta. (upper stage). In the image, culture supernatants of bioassay cells are added to a QUANTI-Blue substrate solution. The albumen supernatant and commercially available interferon .beta. are serially diluted by 5-fold and added to the bioassay cells. Judging from a color change of the substrate solution, the albumen supernatant contains interferon .beta. at a concentration of 625 or more times greater than a 0.01 .mu.g/.mu.l concentration of commercially available interferon .beta..
[0067] FIG. 19 shows the interferon .beta. proteins (left panel) in the eggs derived from the human interferon .beta. knock-in chickens in the G1 generation and G2 generation. Concentrations of human interferon .beta. in the thick albumen of the eggs derived from G1 and G2 (3 female chickens) are approximately equal to each other. Further, the egg derived from G2 is cloudy as is the case for the egg derived from G1.
[0068] FIG. 20 shows an image of western blotting of the albumen derived from a chicken in which a human antibody gene is knocked-in at the ovalbumin gene locus using an anti-human immunoglobulin antibody. A recombinant human antibody (indicated by an arrow with a sign of hIgG) is expressed in an egg derived from the knock-in chicken (hIgG KI) but not in the albumen derived from a wild type (ctrl) (left panel). Further, the recombinant human antibody has the same electrophoretic mobility as a commercially available human antibody (Herceptin) under a non-reduced condition, indicating that the recombinant human antibody can form an antibody complex (right panel). A dilution ratio of the albumen is indicated by 1/2 k (1/2000), 1/200, and 1/20. Judging from band, intensities, a concentration of the antibody complex is 1 mg/ml or more.
[0069] FIG. 21 shows an egg derived from an ovomucoid homozygous knock-out (OVM-/- in a G2 generation). The egg is not markedly different from a wild-type egg in appearance (left panel) and the albumen and egg yolk of the egg are coagulated by heating (right panel), thus the egg can be processed similarly to the wild-type egg for cooking or other purposes.
[0070] FIG. 22 shows an ovomucoid homozygous knock-out individual in a G3 generation obtained by incubating an egg derived from the ovomucoid homozygous knock-out (OVM-/- in a G2 generation) (upper panel). The G3 generation was obtained by mating an OVM-/- female and OVM-/- male. A chicken can be developed without an endogenous ovomucoid gene or an ovomucoid gene product in the egg. It is found that the ovomucoid gene has a homozygous 5 bp deletion by a fragment analysis (lower panel).
[0071] FIG. 23 shows an image of eggs derived from 4 lines of interferon .beta. knock-in female chickens. The thick albumen is cloudy in all 4 eggs as seen in FIG. 9.
DESCRIPTION OF EMBODIMENTS
[0072] In the present invention, a gene in a poultry primordial germ cell is modified by genome editing to obtain knock-in or knock-out female poultry that is derived from the genetically modified primordial germ cell, thereby obtaining a knock-in or knock-out poultry egg of the present invention from the knock-in or knock-out female.
[0073] In the present specification, the "knock-out poultry egg" includes eggs produced from female poultry having both heterozygous (+/-) and homozygous (-/-; null) genotypes of a knock-out gene. In a case where a knock-out gene is expressed in an oviduct and encodes an egg allergen accumulated in the albumen, an egg of the heterozygous knock-out poultry has a reduced amount of the egg allergen protein. On the other hand, an egg produced from the homozygous knock-out female poultry lacks the egg allergen protein.
[0074] In the present specification, the "knock-in poultry egg" includes eggs produced from female poultry having both heterozygous (+/-) genotype of, and fertilized eggs produced from poultry having homozygous (+/+) genotype of a knock-in gene. An egg produced from the female poultry having the homozygous (+/+) genotype of the knock-in gene contains more expression products of an exogenous gene than an egg produced from the female poultry having the heterozygous (+/-) genotype of the knock-in gene.
[0075] The genome editing is a technique for gene modification using a cleavage of double-stranded DNA and an error in repairing the cleavage and includes a nuclease capable of cleaving the target double-stranded DNA and a DNA recognition component that binds to or forms a complex with the nuclease. Examples of the genome editing technique include ZFN (zinc finger nuclease), TALEN, and CRISPR. For example, ZFN uses FokI (a nuclease) and a zinc finger motif (a DNA recognition component), TALEN uses FokI (a nuclease) and a TAL effector (a DNA recognition component), and CRISPR uses Cas9 (a nuclease) and a guide RNA (gRNA, a DNA recognition component). The nuclease used in the genome editing is only required to have nuclease activity, and a DNA polymerase, a recombinase, and the like may be used other than the nuclease.
[0076] Examples of the poultry include a chicken, a quail, a turkey, a duck, a goose, a long-tailed cock, a Japanese bantam, a pigeon, an ostrich, a green pheasant, a helmeted guineafowl, and the like. Preferable examples of the poultry include the chicken, the quail, and the like.
[0077] The primordial germ cell may be from male or female. The primordial germ cell of the poultry such as a chicken is a floating cell and cultured in the presence of a feeder cell, such as a BRL cell and STO cell. Alternatively, the primordial germ cell may be cultured in the absence of the feeder cell by adding an appropriate cytokine in a medium.
[0078] A gene modified by the genome editing is an oviduct-specific gene, and specific examples thereof include ovalbumin, ovomucoid, ovomucin, ovotransferrin, ovoinhibitor, lysozyme, and the like.
[0079] A gene function is eliminated by knock-out performed by the gene editing. In a case where at least one base is deleted or inserted in a gene by the gene editing, the gene function may be eliminated by a frame shift. The gene function may be eliminated without a frame shift by missing a part of amino acids. Further, the gene function may be eliminated by generating a stop codon by deletion or substitution.
[0080] In a case where an exogenous gene is knocked-in by the genome editing, the exogenous gene is preferably knocked-in at an oviduct-specific gene locus to obtain an egg containing an expression product of the exogenous gene instead of an expression product of the oviduct-specific gene. Examples of a protein as the expression product of the exogenous gene include various secreted proteins and peptides, and specific examples thereof include a functional peptide, such as an antibody (a monoclonal antibody) or a fragment thereof (e.g., scFv, Fab, Fab', F(ab')2, Fv, a single-chain antibody, scFv, dsFv, etc.), an enzyme, a hormone, a growth factor, a cytokine, an interferon, a collagen, an extracellular matrix molecule, and a vaccine, an agonistic protein, an antagonistic protein, and the like. In a case where the protein encoded by the exogenous gene is a biologically active protein that can be administered to human as a medicine, such a protein is derived from a mammal, preferably from human. Further, in a case where the protein encoded by the exogenous gene is an industrially applicable protein, such as a protein A and a protein constituting a spider thread, examples of the exogenous gene include a gene that encodes a protein derived from any organisms including a microorganism (bacteria, yeast, etc.), a plant, and an animal, or an artificial protein.
[0081] As the exogenous gene, a single gene or a plurality of genes may be used. In a case where a plurality of genes are used, the plurality of genes are expressed under control of the oviduct-specific gene. For example, the plurality of genes may be expressed by interposing a sequence such as IRES between the plurality of genes. Alternatively, the plurality of genes may be expressed by interposing a sequence encoding a 2A peptide or the like between the plurality of genes. In such a case, the plurality of genes are simultaneously expressed as a single peptide under control of an ovalbumin promoter and the peptide is cleaved to produce a plurality of proteins.
[0082] The exogenous protein may include an appropriate signal peptide. Codon usage of the exogenous protein may be changed to facilitate its expression in the poultry.
[0083] In a preferred embodiment of the knock-in poultry egg of the present invention, the expression product of the exogenous gene is dominantly expressed in the thick albumen. The term "dominant" herein refers to (a) a state in which an expression amount of the exogenous gene in the thick albumen is 50% or more, 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, or 98% or more by mass with respect to an expression amount of the exogenous gene in a whole knock-in egg or (b) a state in which an expression amount of the exogenous gene in the thick albumen is 1.1 times or more, preferably 2 times or more, more preferably 10 times or more of an expression amount of the exogenous gene in an egg other than the thick albumen in a relative concentration. The expression product of the knock-in exogenous gene is concentrated in the thick albumen and can thus be easily purified. Further, the expression product of the exogenous gene can be expressed in an active form. The thick albumen may become cloudy due to the expression product of the exogenous gene. However, a cloudy protein can be easily solubilized by an ultrasonic treatment, adding a solubilizing agent such as arginine hydrochloride, or the like.
[0084] In a preferred embodiment of the present invention, the expression product of the knock-in gene expressed in the thick albumen may be in a soluble form or in an insoluble form. The expression product of the knock-in gene in the insoluble form can be purified as an active protein. The expression product of the knock-in gene is preferably purified after solubilized. For purification, a conventional purification method, such as a column and dialysis, may be used. As the genome editing, a zinc finger, TALEN, and CRISPR can be mentioned. Of these, TALEN and CRISPR are preferable and CRISPR is more preferable. As a new genome editing method has been continuously developed, the present invention is not limited to the existing methods and any genome editing methods to be developed in the future may be used in the present invention.
[0085] When performing knock-in by the genome editing, it is preferable that a drug-resistant gene is stably integrated in the genome together with the useful exogenous gene to select a knock-in primordial germ cell by the drug-resistant gene. Examples of the drug-resistant gene include a neomycin-resistant gene (Neor), a hydromycin-resistant gene (Hygr), a puromycin-resistant gene (Puror), a blasticidin-resistant gene (blastr), a zeocin-resistant gene (Zeor), and the like. Of these, the neomycin-resistant gene (Neor) or the puromycin-resistant gene (Puror) is preferable.
[0086] When performing knock-in of the exogenous gene at the oviduct-specific gene locus, a translation starting site of the exogenous gene is preferably coincided with a translation starting site of the oviduct-specific gene. The exogenous gene may be introduced into the primordial germ cell as single- or double-stranded nucleic acids. The double-stranded nucleic acids may be introduced in a form of a plasmid vector, a BAC (bacterial artificial chromosome) vector, or the like. A gene sequence around the translation starting site of the oviduct-specific gene may be inserted immediately before the translation starting site of the exogenous gene to match the translation starting site of the exogenous gene with the translation starting site of the oviduct-specific gene by the knock-in.
[0087] In a case where the ovalbumin gene is selected as the oviduct-specific gene and the exogenous gene is introduced under control of the ovalbumin gene promoter, a 5' end of the exogenous gene is preferably inserted in a base sequence encoding ovalbumin in a region corresponding to a base sequence represented by SEQ ID NO: 1 (OVATg1) or in a region corresponding to a base sequence represented by SEQ ID NO: 24 (OVATg2). More preferably, the translation starting site of the exogenous gene is inserted in the translation starting site of the ovalbumin gene.
[0088] In a case where the exogenous gene is knocked-in at the oviduct-specific gene locus to obtain a knock-in chicken individual and its egg, it is desirable that the albumen of the egg is recovered to recover the exogenous gene product. More desirably, a portion including the thick albumen surrounding the egg yolk is recovered to efficiently recover the exogenous gene product.
[0089] In a case where a gene function is eliminated by knock-out using the genome editing, the above-mentioned drug-resistant gene is preferably introduced into a primordial germ cell during the gene transfection by the gene editing to perform a selection on the basis of the drug-resistant gene. For the introduction of the drug-resistant gene and drug-based selection, the drug-resistant gene may be stably or transiently introduced. The drug-resistant gene is preferably transiently introduced in the case of the knock-out. Examples of the drug-resistant gene include the ones described above. Of these, the puromycin-resistant gene (Puror) or the zeocin-resistant gene (Zeor) is preferable. The drug-resistant gene may be introduced independently from a zinc finger, TALEN, or CRISPR plasmid or integrated into these plasmids. The drug-resistant gene is preferably integrated into the plasmid used for the genome editing.
[0090] In the present invention, performing knock-out of the oviduct-specific gene can induce deletion, substitution, or insertion of a base or bases in a base sequence encoding the oviduct-specific gene to be knocked-out and thus causes a frame shift or a nonsense mutation in the oviduct-specific gene, whereby a protein expression can be eliminated. For example, performing the knock-out by using CRISPR can induce deletion, substitution, or insertion of a base or bases in a region near a 5' side or 3' side of a PAM sequence. The region near the 5' side or 3' side of the PAM sequence is within, for example, about 1 to 50 bases, preferably about 1 to 15 bases, from the PAM sequence.
[0091] One embodiment of the present invention can include the knock-out poultry egg in which ovalbumin or ovomucoid is knocked-out as the oviduct-specific gene, although the present invention is not limited thereto. In a preferable embodiment of the knock-out of ovalbumin as the oviduct-specific gene, deletion, substitution, or insertion of a base or bases can be induced in a region corresponding to a base sequence represented by SEQ ID NO: 1 (OVATg1) and its vicinity. Further, in a preferable embodiment of the knock-out of ovomucoid as the oviduct-specific gene, deletion, substitution, or insertion of a base or bases can be induced in a region corresponding to a base sequence represented by SEQ ID NO: 6 (OVMTg2) and its vicinity.
[0092] In this description, for example, the "region corresponding to a base sequence represented by SEQ ID NO: 1 (OVATg1)" includes a corresponding region in a homolog of the ovomucoid gene, and a person skilled in the art can recognize the region corresponding to the base sequence in the poultry of interest.
[0093] The ovomucoid protein before secretion contains 210 amino acids (210aa) (an initiation methionine is counted as the first amino acid) and has a signal peptide from 1 to 24th aa. Further, the PAM sequence of SEQ ID NO: 1 (OVATg1) corresponds to 38 and 39th aa. Thus, in one embodiment of the present invention, the ovomucoid gene knock-out poultry egg expresses an ovomucoid mutant protein that lacks at least 160th aa and aa thereafter, preferably 100th aa and aa thereafter, more preferably 38th aa and aa thereafter.
[0094] Further, in a preferable embodiment of the present invention, the ovomucoid gene knock-out poultry egg is substantially free from endogenous ovomucoid. Being substantially free from endogenous ovomucoid means that endogenous ovomucoid is eliminated in an egg produced from an ovomucoid knock-out female poultry in which the ovomucoid gene is homozygously knocked-out.
[0095] Genetically modified poultry can be produced by a conventional method from the genetically modified poultry primordial germ cell that is obtained by the gene modification method in one embodiment of the present invention. Further, a (knock-in and knock-out) egg can be obtained from the genetically modified poultry. Specific procedures will be described below.
[0096] The genetically modified primordial germ cell is transplanted in a blastoderm, blood stream, or gonadal region of a recipient early embryo. Several hundreds to several thousands of the cells are transplanted by microinjection into the blood stream around the time of starting a blood circulation, preferably on the second or third day after the start of egg incubation. Further, the endogenous primordial germ cells of the recipient may be inactivated or reduced in number in advance by a drug or ionizing radiation before performing transplantation. The egg incubation is continued for the transplanted embryo according to a conventional method to obtain a transplanted individual. The transplantation and egg incubation may be performed in an ex-ovo culture system in which an eggshell is changed or a windowing method in which an eggshell is not changed. The hatched individual can be raised under a normal condition to sexual maturity to obtain a living individual (a chimeric individual). The chimeric individual is mated with a wild-type or genetically modified individual, or the genetically modified chimeric individual to produce a poultry offspring having the genetic modification derived from the transplanted cell. The primordial germ cell of the present invention obtained by the genome editing has high proliferation ability and differentiates into a large number of sperms or eggs having high fertilizability in the chimeric individual. In order to increase an efficiency of this process, a mating test may be performed after examining a frequency of gene modification in genomes of gametes or evaluating a contribution ratio of the transferred cells, or the offspring may be selected by a feather color. The genetically modified homozygous poultry can be obtained by mating the female chimeric poultry in which the genetically modified female primordial germ cells are transplanted with the male chimeric poultry in which the genetically modified male primordial germ cells are transplanted. Further, the present invention is not limited to internal fertilization of the poultry. In a case where a technique such as differentiating a primordial germ cell into a germ cell in vitro is developed in the future, the genetically modified poultry can be produced by artificial insemination or intracytoplasmic sperm injection using such a technique.
[0097] In FIG. 2, FIG. 4A, FIG. 4B, and FIG. 16, the PAM sequence of OVMTg2 is "agg", however, NCBI databases include two kinds of sequences corresponding to chicken ovomucoid OVMTg2. Thus, the OVMTg2 sequence can be represented by TTTCCCAACGCTACAGACA(t or a)gg. The present invention includes all kinds of polymorphisms such as above.
[0098] In another embodiment of the present invention, the genome editing may be performed without culturing the primordial germ cell. In such a case, the endogenous primordial germ cell is genetically manipulated by infecting the early embryo with various viral vectors or injecting a plasmid vector as a liposome complex into a blood stream of the early embryo to establish a chimeric individual and a recombinant offspring. The primordial germ cell obtained by the genome editing includes the gene modification at a high frequency and has sufficiently high fertility to produce a recombinant poultry offspring or a genetically modified poultry offspring, and is thus useful in the present embodiment. In the present embodiment, the (endogenous) primordial germ cell can be genetically modified without culturing the primordial germ cell.
[0099] Examples of the viral vector used for gene manipulation by the genome editing include a retroviral vector, an adenoviral vector, an adeno-associated viral vector, a lentiviral vector, and the like. These viral vectors may be used for the genome editing both in the cultured primordial germ cell and the endogenous primordial germ cell.
[0100] For example, in a case where the endogenous primordial germ cell is modified by the genome editing, the genome editing in the primordial germ cell may be performed by constructing a viral vector expressing a nuclease that recognizes and cleaves any target sequences and an sgRNA using a genome editing viral vector commercially available from various companies, processing the viral vector into an infectious form by packaging, and administering a resulting material into a place where the primordial germ cell exits, such as a blastoderm, blood stream, or gonadal region of the poultry early embryo. In this manner, a genetically modified individual and a genetically modified product can be obtained in the following generation. The commercially available genome editing viral vectors are offered by a number of companies worldwide, and examples thereof include an "AAVpro (registered trademark) CRISPR/Cas9 Helper Free System (AAV2)" available from Takara Bio Inc., which uses the adeno-associated viral vector, a "Lentiviral CRISPR/Cas9 System" available from System Biosciences, LLC, which uses the lentiviral vector, and the like. In a case where the gene modification involves knock-in, the viral vector required for the gene editing may be used with, for example, a viral vector, plasmid, Bac vector, or single- or double-stranded DNA including a gene to be knocked-in.
[0101] Further, a genome editing plasmid and a donor construct, either without or in combination with the viral vector, may be prepared in a cell membrane permeable form, such as a liposome complex, and administered into a place where the primordial germ cell exits, such as a blastoderm, blood stream, or gonadal region of the poultry early embryo, to perform the genome editing in the primordial germ cell and obtain a genetically modified individual and a genetically modified product in the following generation.
[0102] In the knock-in poultry egg of the present invention obtained by the above method, an expression product of the exogenous gene is stably and highly expressed in the egg. Herein, "the expression product of the exogenous gene being stably and highly expressed in the egg" means that a protein encoded by the exogenous gene is expressed in an amount of about 1 mg or more per egg in each egg derived from different individuals. The expression amount of the exogenous protein is preferably about 10 mg or more, more preferably 100 mg or more, per egg. Further, for example, in a case where the exogenous gene is knocked-in at the chicken oviduct-specific gene locus, the exogenous gene product (protein) is expressed in the thick albumen of the egg produced from the knock-in female chicken in a concentration of 5 mg/ml. This concentration is much higher than that obtained by a conventional gene transfection method that does not rely on knock-in and thus causes random gene insertions. Since the exogenous gene is inserted in an identical location, variation in expression level is small between individuals and in the same individual. Further, because the present invention uses a technique to perform knock-in at a translation starting site of a gene that is actually expressing in a chicken individual, the gene expression is not reduced by an effect of gene silencing or the like in a G2 generation or later.
[0103] In the case where the exogenous gene is knocked-in at the oviduct-specific gene locus by the present method, the exogenous gene product (protein) in the egg produced from the knock-in female chicken is distributed to the thick albumen at a higher concentration than that distributed to the thin albumen. Thus, the exogenous gene product can be efficiently recovered by recovering a portion including the thick albumen.
[0104] An egg deprived of an albumen allergen protein can be obtained by raising an albumen allergen gene homozygous knock-out chicken generated by the present method and obtaining an egg thereof. Such an egg is expected to have low allergenicity. Non-Patent Literature 4 discloses an example of a heterozygous ovalbumin knock-out chicken. However, it is impossible to predict whether a homozygous knock-out chicken can be obtained or whether such a homozygous knock-out chicken can produce an egg in light of the overall common technical knowledge at that time or from the literature.
EXAMPLES
[0105] Hereinafter, the present invention will be described in detail by way of examples.
Production Example 1
Genome Editing Using Chicken Male Primordial Germ Cell
Production Example 1-1
Gene Constructions for Knock-In at Ovalbumin (OVA) Gene Locus and Knock-Out of Ovomucoid (OVM)
[0106] A CRISPR method was applied to a chicken male primordial germ cell line for targeting an ovalbumin and ovomucoid genes. As shown in FIG. 1 (ovalbumin) and FIG. 2 (ovomucoid), OVATg1 and OVMTg2 were tested respectively if they were suitable as targets.
[0107] A CRISPR plasmid for targeting the target sequence of the ovalbumin gene shown in FIG. 1 was constructed.
[0108] First, for targeting SEQ ID NO: 1 (OVATg1), oligo DNAs represented by SEQ ID NO: 2 and SEQ ID NO: 3 were synthesized, subjected to 5' phosphorylation by T4 polynucleotide kinase, and then annealed by heating a mixture of both oligo DNAs to 98.degree. C. and slowly cooling the mixture to the room temperature. This DNA fragment was inserted into a BbsI cleavage site of a plasmid px330-Puro.sup.r in which a puromycin-resistant gene unit represented by SEQ ID NO: 4 was inserted into a NotI site of a plasmid px330 (AddGENE, USA) to obtain px330-Puro.sup.r-OVATg1. Further, the puromycin-resistant gene unit in px330-Puro.sup.r-OVATg1 was replaced with a neomycin-resistant gene unit represented by SEQ ID NO: 5 to construct px330-Neo.sup.r-OVATg1.
[0109] A CRISPR plasmid for targeting the target sequence of the ovomucoid gene shown in FIG. 2 was constructed. For targeting SEQ ID NO: 6 (OVMTg2), oligo DNAs represented by SEQ ID NO: 7 and SEQ ID NO: 8 were synthesized, phosphorylated, annealed, and inserted into a BbsI cleavage site of the plasmid px330-Puro.sup.r to obtain px330-Puro.sup.r-OVMTg2.
Production Example 1-2
Knock-Out of Ovalbumin and Ovomucoid Genes
[0110] A chicken male primordial germ cell was collected from the blood stream of a male embryo of Barred Plymouth Rock to establish a cell line (the cell line was prepared according to Non-Patent Literature 3). The cell line was transiently transfected with the above genes (plasmids). After 1.times.10.sup.5 to 5.times.10.sup.5 male primordial germ line cells were rinsed with PBS and suspended into OPTI-MEM, 1.6 .mu.g of px330-Neo.sup.r-OVATg1 was transfected into the cells using 3 .mu.l of Lipofectamine 2000 (Life Technologies, USA). Specifically, Lipofectamine 2000 and the plasmid were mixed in the 80 .mu.l of OPTI-MEM, and the resulting mixture was added to the male primordial germ line cells. The primordial germ line cells mixed with the mixture were left still for about 5 minutes at the room temperature and then added with 500 .mu.l of medium containing no antibiotics. After being left still at 37.degree. C. for about 1 to 4 hours, the primordial germ line cells were seeded onto feeder cells. The cell culture was added with neomycin (G418 disulfate salt, Nacalai, Japan) at a final concentration of 0.5 mg/ml from day 2 to day 4 after gene transfection. Then, the primordial germ line cells were rinsed to remove neomycin and cultured for another 1 to 2 weeks. After the cultured cells were collected and their genomic DNA was extracted, a PCR method was performed to amplify a part of the ovalbumin gene with oligo DNA primers represented by SEQ ID NO: 9 and SEQ ID NO: 10. The amplified DNA was sub-cloned into a TA vector (pGEM-T Easy, Promega, USA) to analyze a genome base sequence of a region including SEQ ID NO: 1 (OVATg1). As shown in FIG. 3, mutations including a deletion or substitution of the start codon were confirmed.
[0111] Next, a similar analysis was performed by targeting the ovomucoid gene. As described above, 1.times.10.sup.5 to 5.times.10.sup.5 male primordial germ line cells were transfected with 1.6 .mu.g of px330-Puro.sup.r-OVMTg2 using Lipofectamine 2000. The cell culture was added with puromycin at a final concentration of 1 .mu.g/ml from day 2 to day 4 after gene transfection. The cells were rinsed to remove puromycin and cultured for another 1 to 2 weeks. After the cells were collected and their genomic DNA was extracted, a PCR method was performed to amplify a part of the ovomucoid gene with oligo DNA primers represented by SEQ ID NO: 11 and SEQ ID NO: 12. The amplified DNA was sub-cloned into a TA vector to analyze a genome base sequence of a region including SEQ ID NO: 6 (OVMTg2). Gene deletion was observed in 21 out of 23 clones analyzed (91%) in the region including SEQ ID NO: 6 (OVMTg2) of the ovomucoid gene. On the other hand, gene deletion was observed in 0 out of 24 clones (0%) in a control group not selected by a drug. An example of gene mutations found in the region including OVMTg2 are shown in FIG. 4A. These results show that a mutation efficiency can be markedly increased in the genome editing of a gene of the poultry primordial germ cell by introducing a drug-resistant gene and transiently performing a drug selection, in particular, a high mutation efficiency can be obtained by the puromycin-resistant gene and the drug selection using puromycin.
Production Example 1-3
Establishing Genome Edited Chicken
[0112] The px330-Puro.sup.r-OVMTg2 plasmid was transfected into Barred Plymouth Rock primordial germ cells in a manner described in Production example 1-2 and the drug selected cells were cultured. The cultured cells were transplanted into a blood stream of a 2.5-day-old White Leghorn embryo (a recipient embryo) by microinjection. Prior to transplantation, a fertilized egg was irradiated with ionizing radiation at 5 Gy or 6 Gy before incubation to reduce the number of endogenous primordial germ cells in the recipient embryo. The ionizing radiation was performed by gamma irradiation using a Gammacell 40 irradiator (Atomic Energy of Canada Ltd.).
[0113] After 2.5 days of incubation, a window having a diameter of about 2 cm was cut in the egg shell on a protruding side to expose an embryo. About 1,000 to 5,000 drug selected cells (suspended in 1 to 2 .mu.l PBS) were transplanted in the blood stream of the recipient embryo at a hamburger-hamilton stage of 13 to 15 using a glass micropipette. After the window was sealed by a cellophane tape, the egg was incubated and hatched at a temperature of 38.5.degree. C. and a humidity of 60 to 80% (a chimeric chick (G0)). Eight male chimeric chicks were raised to sexual maturity and their sperms were collected. After the genomic DNA was extracted from the sperms, a PCR method was performed to amplify a part of the ovomucoid gene with oligo DNA primers represented by SEQ ID NO: 11 and SEQ ID NO: 12. The amplified DNA was sub-cloned into a TA vector to analyze a genome base sequence of the region including SEQ ID NO: 6 (OVMTg2). Chimeric chickens #372 and #376 having a high mutation frequency (in both cases, the ovomucoid gene was mutated in 10 out of 11 clones after sub-cloning) were mated with wild-type Barred Plymouth Rock females to find 11 and 6 ovomucoid mutated chickens (chicks) out of 19 and 14 offsprings, respectively. One example of mutation in the ovomucoid gene is shown in an upper panel of FIG. 4B. This individual has a 5-base deletion immediately after a signal peptide of the ovomucoid protein, which causes a frame shift mutation in one allele of the ovomucoid gene. Further, representative examples of mutations (gene deletions) found around the target region of ovomucoid genome are shown in a lower panel of FIG. 4B. Several female and male individuals of ovomucoid heterozygous knock-out having frame shift mutations immediately after the signal peptide of the ovomucoid protein as represented above were obtained, allowing the production of a homozygous ovomucoid knock-out chicken by mating these individuals after sexual maturity.
Production Example 2
Human Interferon Gene Knock-In at Ovalbumin Gene Locus
Production Example 2-1
Knock-In for Establishing Primordial Germ Cell and Establishment of Knock-In Chimeric Chicken
[0114] In order to insert an exogenous gene (human interferon .beta.; IFN.beta.) at a translation starting site of the ovalbumin gene, a donor construct (an IFN.beta. donor construct) including a human interferon .beta. gene represented by SEQ ID NO: 13 was created. This donor construct contains an about 2.8 kb 5' side of a translation starting site of ovalbumin, the human interferon .beta. gene, a drug-resistant gene unit (PGK-puro.sup.r), and an about 3.0 kb 3' side of the translation starting site of ovalbumin. This donor construct was inserted in a plasmid pBlue ScriptII (SK+) (Stratagene, USA, current Agilent Technologies) to create a pBS-IFN.beta. donor. As with production example 1, 1.times.10.sup.5 to 5.times.10.sup.5 primordial germ line cells were simultaneously transfected with 0.8 .mu.g of px330-Neo.sup.r-OVATg1 and 0.8 .mu.g of the pBS-IFN.beta. donor using Lipofectamine 2000. The cell culture was added with puromycin at a final concentration of 1 .mu.g/ml on the third day after gene transfection. After the medium was replaced as needed, cells capable of growing in the presence of puromycin at a final concentration of 1 .mu.g/ml were recovered to prepare their genomic DNA. The genome PCR was performed to confirm that the donor construct was knocked-in at the ovalbumin gene locus. PCR in the 5' region was conducted as described below using a primer recognizing the exogenous gene on the donor construct and a primer recognizing a 5' region of ovalbumin not included in the donor construct. The PCR was conducted using an antisense primer recognizing interferon .beta., represented by SEQ ID NO: 14, and a sense primer recognizing a region of an about 3.0 kb 5' side of the translation starting site of ovalbumin, represented by SEQ ID NO: 15. Another round of PCR (nested PCR) was conducted with an amplification product using an antisense primer recognizing interferon .beta., represented by SEQ ID NO: 16, and a sense primer recognizing an about 2.85 kb 5' side of the translation starting site of ovalbumin not included in the donor construct, represented by SEQ ID NO: 17. As shown in FIG. 5A, when px330-Neo.sup.r-OVATg1 and the donor construct were transfected and genome derived from the drug selected primordial germ cells (knock-in PGCs) was used as a template, an amplification product was detected at a position of about 2.9 k that was expected when the donor construct was inserted. In contrast, when genome derived from the control primordial germ cells not subjected to gene transfection (control PGCs) was used as a template, such an amplification product was not detected.
[0115] Similarly, a 3' region was examined by the genome PCR using a primer recognizing the exogenous gene on the donor construct and a primer recognizing a 3' region of ovalbumin not included in the donor construct. The PCR was conducted using a sense primer recognizing the drug-resistant gene unit, represented by SEQ ID NO: 18, and an antisense primer recognizing a region of an about 3.4 kb 3' side of the translation starting site of ovalbumin, represented by SEQ ID NO: 19. Another round of PCR (nested PCR) was conducted with an amplification product using a sense primer recognizing the drug-resistant gene unit, represented by SEQ ID NO: 20, and a sense primer recognizing an about 3.2 kb 3' side of the translation starting site of ovalbumin not included in the donor construct, represented by SEQ ID NO: 21. As shown in FIG. 5A, when px330-Neo.sup.r-OVATg1 and the donor construct were transfected and genome derived from the drug selected primordial germ cells (knock-in PGCs) was used as a template, an amplification product was detected at a position of about 3.4 k that was expected when the donor construct was inserted. In contrast, when genome derived from the control primordial germ cells not subjected to gene transfection (control PGCs) was used as a template, such an amplification product was not detected. These results suggest that the drug selected cell group includes a cell in which the donor construct including the exogenous gene portion is knocked-in at the ovalbumin gene locus.
[0116] The primordial germ cells containing the cell in which the IFN.beta. donor construct was knocked-in were transplanted into recipient embryos by the same method as described in Production example 1-3 and the embryos were incubated to obtain 4 chimeric male chickens (#411 to #414). After semen was collected from each chicken to isolate genomic DNA, PCR was conducted using primers represented by SEQ ID NO: 18 and SEQ ID NO: 19 (for amplifying a 3' side of interferon knocked-in at the ovalbumin gene), primers represented by SEQ ID NO: 15 and SEQ ID NO: 14 (for amplifying a 5' side of interferon knocked-in at the ovalbumin gene), and primers represented by SEQ ID NO: 15 and SEQ ID NO: 22 (for amplifying ovalbumin without a knock-in event) (FIG. 5B). The chimeric chickens #411 and #412 show signals that clearly prove the interferon knock-in at the ovalbumin gene locus at both the 3' side and 5' side. In particular, signal intensities of #411 are comparable to that of the transplanted parental cell line, suggesting that the sperms contain the interferon knock-in cells to the same extent as the parental cell line.
[0117] The chimeric chickens #411 and #412 were mated with wild-type female chickens (Barred Plymouth Rock) to obtain 28 and 19 offsprings, respectively. After wing shafts were collected from the offsprings to isolate their genomic DNA, PCR was conducted as described above using primers represented by SEQ ID NO: 18 and SEQ ID NO: 19 (for amplifying the 3' side of interferon knocked-in at the ovalbumin gene), primers represented by SEQ ID NO: 15 and SEQ ID NO: 14 (for amplifying the 5' side of interferon knocked-in at the ovalbumin gene), and primers represented by SEQ ID NO: 15 and SEQ ID NO: 22 (for amplifying ovalbumin without a knock-in event). Further, the same PCR was conducted with genome derived from a wild-type wing shaft (a negative control (NC)) and genome derived from the transplanted interferon donor vector knock-in primordial germ cells (a positive control (PC)). Eight out of 28 offsprings derived from #411 and 5 out of 19 offsprings derived from #412 showed signals that clearly proved the interferon knock-in at the ovalbumin gene locus at both the 3' side and 5' side, as were seen in the positive control. FIG. 5C shows an image of electrophoresis of PCR products from the offspring (female) derived from #411 and the offspring (female) derived from #412. These results indicate that the interferon donor vector is knocked-in at the ovalbumin gene locus in these female chicken offsprings.
Production Example 2-2
Improvement of Knock-In Efficiency
[0118] Studies were conducted to improve a gene knock-in efficiency. First, the drug resistance unit of the interferon .beta. donor construct in the above production example 2-1 was changed from PGK-Puro.sup.r to SV40Pe-Neo.sup.r (SEQ ID NO: 23) to create an IFN.beta.-Neo donor construct <SEQ ID NO: 33>. This donor construct contains an about 2.8 kb 5' side of the translation starting site of ovalbumin, the human interferon .beta. gene, the drug resistant gene unit (SV40Pe-Neo.sup.r), and an about 3.0 kb 3' side of the translation starting site of ovalbumin. This donor construct was inserted in the plasmid pBlue ScriptII (SK+) to create a pBS-IFN.beta.-Neo donor. Further, as shown in FIG. 6A, a CRISPR plasmid for targeting the target sequence OVATg2 (SEQ ID NO: 24) of ovalbumin partially overlapping with OVATg1 was constructed. Oligo DNAs represented by SEQ ID NO: 25 and SEQ ID NO: 26 were synthesized and, as described in production example 1-1, they were phosphorylated, annealed, and inserted, as a DNA fragment, into the BbsI cleavage site of px330 to construct a plasmid, px330-Puro.sup.r-OVATg2, which also has the puromycin-resistant unit represented by SEQ ID NO: 4 inserted into a NotI site. After about 5.times.10.sup.5 primordial germ line cells were prepared and divided into 3 groups, as described in production example 2-1, the cells were simultaneously transfected with 0.8 .mu.g of px330-Neo.sup.r-OVATg1 and 0.8 .mu.g of the pBS-IFN donor (including the puromycin-resistant gene unit) (transfection group 1), or 0.8 .mu.g of px330-Puro.sup.r-OVATg1 and 0.8 .mu.g of the pBS-IFN.beta.-Neo donor (transfection group 2), or 0.8 .mu.g of px330-Puro.sup.r-OVATg2 and 0.8 .mu.g of the pBS-IFN.beta.-Neo donor (transfection group 3) using Lipofectamine 2000. The transfected group 1 was added with puromycin at a final concentration of 1 .mu.g/ml on the third day after gene transfection as was the case in production example 2-1. On the other hand, the transfection group 2 and transfection group 3 were cultured in the presence of puromycin at a final concentration of 1 .mu.g/ml from day 2 to day 4 after gene transfection as was the case in production example 1-2. After rinsed, the cells were added with neomycin at a final concentration of 0.5 mg/ml and cultured. The number of cells was counted in each transfection group on day 24 after gene transfection. As a result, the transfection group 1 had 2.times.10.sup.4 drug-resistant cells while the transfection groups 2 and 3 had 1.times.10.sup.5 drug-resistant cells. Further, the cells in each transfection group were recovered to prepare their genomic DNA. Then, as described in production example 2-1, PCR was conducted using primers represented by SEQ ID NO: 18 and SEQ ID NO: 19 (for amplifying a 3' side of interferon knocked-in at the ovalbumin gene), primers represented by SEQ ID NO: 15 and SEQ ID NO: 14 (for amplifying a 3' side of interferon knocked-in at the ovalbumin gene), and primers represented by SEQ ID NO: 15 and SEQ ID NO: 22 (for amplifying ovalbumin without a knock-in event). Note that a primer represented by SEQ ID NO: 71 was used instead of the primer represented by SEQ ID NO: 18 in the transfection groups 2 and 3 (FIG. 6B). There is no considerable difference between the transfection groups 1 and 2 in terms of a PCR signal intensity ratio, suggesting that preparation of desired cells can be quicker in the method of the transfection group 2, in which the cells are briefly selected by puromycin and then selected by neomycin. Further, as compared to the transfection group 2, the transfection group 3 rarely contains ovalbumin not having a knock-in event, suggesting a better knock-in efficiency in the transfection group 3. These results suggest that the exogenous gene can be quickly and highly efficiently knocked-in at the ovalbumin gene locus by transfecting the primordial germ cells with the CRISPR construct targeting the OVATg2 sequence, inserting the puromycin-resistant gene and the neomycin-resistant gene into the CRISPR construct and the donor construct, respectively, and temporarily selecting with puromycin, and then selecting with neomycin.
Production Example 3
Human Antibody Gene Knock-In at Ovalbumin Gene Locus
[0119] Human interferon .beta. in the interferon .beta. donor construct in the above production example 2-1 was replaced with a human immunoglobulin gene represented by SEQ ID NO: 27 to create a donor construct (an immunoglobulin donor construct). In this donor construct, genes encoding an albumen lysozyme signal peptide, a human immunoglobulin heavy chain, a cleavage target sequence of furin protein, a 2A self-processing peptide, an albumen lysozyme signal peptide, and a human immunoglobulin light chain gene are arranged in tandem at a downstream of an about 2.8 kb 5' side of the translation starting site of ovalbumin, which are followed by the drug-resistant gene unit (PGK-Puro.sup.r) and an about 3.0 kb 3' side of the translation starting site of ovalbumin. This donor construct is transcribed and translated to express an antibody protein composed of immunoglobulin heavy chains and light chains.
[0120] The immunoglobulin donor construct was inserted into the plasmid pBlue ScriptII (SK+) to create a pBS-immunoglobulin donor (a pBS-IgG(Hc+Lc) donor). After the donor construct was knocked-in into the male chicken primordial germ cells by the same method used for the pBS-IFN.beta. donor described above, the primordial germ cells were selected by puromycin and PCR was performed using genome of the selected cells as a template. The PCR in a 5' side was conducted using a primer represented by SEQ ID NO: 15 and an antisense primer recognizing the albumen lysozyme signal peptide, represented by SEQ ID NO: 28. Another round of PCR (nested PCR) was conducted with an amplification product using a primer represented by SEQ ID NO: 17 and an antisense primer recognizing the albumen lysozyme signal peptide, represented by SEQ ID NO: 29. The PCR in a 3' side was conducted in the same manner as for the knock-in of the EGFP donor and pBS-IFN.beta. donor described above. That is, the PCR was conducted using the primers represented by SEQ ID NO: 18 and SEQ ID NO: 19 and nested PCR was conducted with an amplification product using the primers represented by SEQ ID NO: 20 and SEQ ID NO: 21. As shown in FIG. 7, knock-in at the ovalbumin gene in the primordial germ cells was also observed using the immunoglobulin donor.
[0121] Further, similarly to production example 2-2, the drug resistance unit of the immunoglobulin donor construct was changed from PGK-Puro.sup.r to SV40Pe-Neo.sup.r (SEQ ID NO: 23) to create an immunoglobulin-Neo donor construct (SEQ ID NO: 30). In this donor construct, genes encoding an albumen lysozyme signal peptide, a human immunoglobulin heavy chain, a cleavage target sequence of furin protein, a 2A self-processing peptide, an albumen lysozyme signal peptide, and a human immunoglobulin light chain gene are arranged in tandem at a downstream of an about 2.8 kb 5' side of the translation starting site of ovalbumin, which are followed by the drug-resistant gene unit (SV40Pe-Neo.sup.r) and an about 3.0 kb 3' side of the translation starting site of ovalbumin. This donor construct was inserted into the plasmid pBlue ScriptII (SK+) to create a pBS-immunoglobulin-Neo donor. With the same method used in production example 1-2, about 2.times.10.sup.5 primordial germ line cells were transfected with 0.8 .mu.g of px330-Puro.sup.r-OVATg2 and 0.8 .mu.g of the pBS-immunoglobulin-Neo donor using 3 .mu.l of Lipofectamine 2000. The cells were cultured in the presence of puromycin at a final concentration of 1 .mu.g/ml from day 2 to day 4 after gene transfection. After rinsed, the cells were added with neomycin at a final concentration of 0.5 mg/ml and cultured. A cell group containing immunoglobulin knock-in cells was obtained after about 3 weeks of culturing. Using the same method as described in production example 1-3, the cell group was transplanted into a recipient embryo and the embryo was incubated to establish an immunoglobulin knock-in germline chimeric chicken. A chicken in which the human immunoglobulin gene is knocked-in at the ovalbumin gene locus is obtained in the following generation and such a chicken expresses an antibody protein composed of the human immunoglobulin heavy chains and light chains in the albumen.
Production Example 4
[0122] Human Collagen Gene Knock-In at Ovalbumin Gene Locus
[0123] Human interferon .beta. in the interferon .beta. donor construct in the above production example 2-1 was replaced with a human type I collagen gene represented by SEQ ID NO: 31 to create a donor construct (a collagen donor construct). In this donor construct, genes encoding an albumen lysozyme signal peptide, a human type I collagen .alpha.1 chain (COLLAGEN1A1), a cleavage target sequence of a furin protein, a 2A self-processing peptide, an albumen lysozyme signal peptide, and a human type I collagen .alpha.2 chain (COLLAGEN1A2) gene are arranged in tandem at a downstream of an about 2.8 kb 5' side of the translation starting site of ovalbumin, which are followed by the drug-resistant gene unit (PGK-Puro.sup.r) and an about 3.0 kb 3' side of the translation starting site of ovalbumin. This donor construct is transcribed and translated to express a type I collagen protein composed of the human type I collagen .alpha.1 chains and .alpha.2 chain.
[0124] The collagen donor construct was inserted into the plasmid pBlue ScriptII (SK+) to create a pBS-COL1(A1+A2) donor. The pBS-COL1(A1+A2) donor was knocked-in into the male chicken primordial germ cells by the same method used for the ppBS-IFN.beta. donor and pBS-IgG(Hc+Lc) donor described above. After knock-in, the primordial germ cells were selected by puromycin and PCR was conducted using genome of the selected cells as a template. The PCR in a 5' side was conducted in the same manner as for the pBS-IgG(Hc+Lc) donor. That is, the PCR was conducted using the primer represented by SEQ ID NO: 15 and the antisense primer recognizing the albumen lysozyme signal peptide, represented by SEQ ID NO: 28. Then, another round of PCR (nested PCR) was conducted with an amplification product using the primer represented by SEQ ID NO: 17 and the antisense primer recognizing the albumen lysozyme signal peptide, represented by SEQ ID NO: 29. The PCR in a 3' side was conducted in the same manner as for the knock-in of the pBS-IFN.beta. donor and pBS-IgG(Hc+Lc) donor described above. That is, the PCR was conducted using the primers represented by SEQ ID NO: 18 and SEQ ID NO: 19 and then nested PCR was conducted with an amplification product using the primers represented by SEQ ID NO: 20 and SEQ ID NO: 21. As shown in FIG. 8, knock-in at the ovalbumin gene in the primordial germ cells was also observed using the collagen donor.
[0125] Further, similarly to production example 2-2, the drug resistance unit of the collagen donor construct was changed from PGK-Puro.sup.r to SV40Pe-Neo.sup.r (SEQ ID NO: 23) to create a collagen-Neo donor construct (SEQ ID NO: 32). In this donor construct, genes encoding an albumen lysozyme signal peptide, the human type I collagen .alpha.1 chain (COLLAGEN1A1), the cleavage target sequence of furin protein, the 2A self-processing peptide, the albumen lysozyme signal peptide, and the human type I collagen .alpha.2 chain (COLLAGEN1A2) gene are arranged in tandem at a downstream of an about 2.8 kb 5' side of the translation starting site of ovalbumin, which are followed by the drug-resistant gene unit (SV40Pe-Neo.sup.r) and an about 3.0 kb 3' side of the translation starting site of ovalbumin. This donor construct was inserted into the plasmid pBlue ScriptII (SK+) to create a pBS-collagen-Neo donor. Using the same method as described in production example 1-2, about 2.times.10.sup.5 primordial germ line cells were transfected with 0.8 .mu.g of px330-Puro.sup.r-OVATg2 and 0.8 .mu.g of the pBS-collagen-Neo donor using 3 .mu.l of Lipofectamine 2000. The cells were cultured in the presence of puromycin at a final concentration of 1 .mu.g/ml from day 2 to day 4 after gene transfection. After rinsed, the cells were added with neomycin at a final concentration of 0.5 mg/ml and cultured. A cell group containing collagen knock-in cells was obtained after about 3 weeks of culturing. Using the same method as described in production example 1-3, the cell group was transplanted into a recipient embryo and the embryo was incubated to establish an immunoglobulin knock-in germline chimeric chicken. A chicken in which the human immunoglobulin gene is knocked-in at the ovalbumin gene locus is obtained in the following generation and such a chicken expresses a protein complex composed of the human type I collagen .alpha.1 and .alpha.2 in the albumen.
Example 1
[0126] As described in above production example 2-1, the female and male chickens in which the human interferon .beta. donor vector was knocked-in at the translation starting site of the ovalbumin gene locus were established.
(1) Characteristics of Knock-In Chicken and Knock-In Egg
[0127] The established knock-in chickens reached sexual maturity without showing a developmental abnormality or significant disease condition. The female knock-in chicken laid eggs. A content of the egg was examined by opening an eggshell. As a result, a cloudy thick albumen was found around an egg yolk. On the other hand, similar to a wild-type egg, a thin albumen having a low viscosity was observed. A typical image of opened egg is shown in FIG. 9.
(2) Identification of Interferon and Possible Enrichment in Thick Albumen
[0128] Next, the presence of human interferon .beta. in the albumen was examined. The thick albumen and the thin albumen were recovered by a dropper and added with an equal volume of a sample buffer (0.125M Tris pH6.8, 10% 2-ME, 4% SDS, 10% glycerol, 0.1% BPB). These samples were serially diluted 10 folds 3 times, separated by electrophoresis using a 5-20% acrylamide gel, and transferred to a PVDF membrane. After the membrane was blocked from a non-specific binding of an antibody molecule by skim milk, the membrane was subjected to western blotting using an anti-human interferon .beta. antibody (abcam ab85803, a rabbit polyclonal antibody) diluted 1,000 times as a primary antibody and an anti-rabbit HRP conjugated antibody (GE Healthcare NA934V) diluted 1,000 times as a secondary antibody. A result is shown in FIG. 10.
[0129] The antibodies detected bands of about 30 kDa at the same position as that of purified recombinant human interferon .beta. (WAKO rhIFN-.beta.). Further, this band was not detected in a wild-type egg at all. In FIG. 10, numbers of 1, 1/10, and 1/100 in each lane indicate relative amounts of samples subjected to electrophoresis and the lanes having the same number include the same amount of albumen liquid. Interestingly, only a small amount of interferon was identified in the thin albumen.
[0130] These results suggest the possibility that a relatively large amount of recombinant proteins expressed by gene knock-in are accumulated in the thick albumen.
(3) Large Amount of Interferon Proteins Detected in Thick Albumen (Estimated to be about 5 mg/ml)
[0131] Next, the thin albumen and thick albumen were collected from a wild-type egg (NC: negative control) and eggs (KI egg 1 and 2) derived from 2 human interferon .beta. knock-in chickens. After each sample was diluted twice, an equal amount of sample was subjected to electrophoresis using a 5-20% acrylamide gel to visualize proteins contained in the albumen by Coomassie Brilliant Blue staining (CBB Stain One, Nacalai). A result is shown in FIG. 11. Similar to the previous western blotting result, clear bands are detected at a position of about 30 kDa in 2 knock-in eggs but not in the wild-type egg, indicating that these bands are human interferon .beta.. Further, similar to the western blotting result, these human interferon .beta. bands are hardly detected in the thin albumen subjected to electrophoresis. A comparison of CBB stained band signals revealed that amounts of human interferon .beta. were significantly different between the thin albumen and the thick albumen although amounts of other albumen components such as ovotransferrin and ovalbumin were almost the same, demonstrating that human interferon .beta. expressed by knock-in at the ovalbumin gene locus was dominantly accumulated in the thick albumen.
[0132] Further, a concentration of human interferon in the albumen can be estimated by analyzing the CBB staining image. An intensity of blue color caused by CBB staining is substantially proportional to an amount of proteins. A signal concentration of human interferon .beta. having a relative amount of 1 is compared to that of ovalbumin having a relative amount of 1/10 through a quantification analysis of NIH image to obtain a ratio between them of 1.01:1. A concentration of ovalbumin that accounts for nearly half of albumen proteins is about 50 mg/ml, thus a concentration of human interferon .beta. is estimated to be about 5 mg/ml.
[0133] The present method achieves the expression of the exogenous gene at a very high concentration of 5 mg/ml. Further, the exogenous gene is inserted in an identical location, thus variation in an expression level is small between individuals and in the same individual. Further, the present method uses a technique to perform knock-in at a translation starting site of a gene that is actually expressing in a chicken individual, thus gene expression is not reduced by an effect of gene silencing or the like in a G2 generation or later. FIG. 12 compares amounts of interferon in the albumen of eggs that have been collected 3 times (day1, day4 and day7) over a week. The amounts of interferon in 3 eggs are compared through a quantification analysis of NIH image to obtain a ratio between them of 1:0.92:0.96, thus there is almost no variation in the concentration of interferon in the thick albumen. Further, these knock-in eggs were kept at 18.degree. C. after collection and cracked open at the same time. This shows that the exogenous protein interferon can stably exist in the albumen over a week without undergoing significant degradation or the like.
(4) Expression of Interferon in Eggs Derived from Different Individuals (Consistency of Interferon Expression)
[0134] Eggs were obtained from 4 interferon knock-in females 3 months after they laid the first eggs and then cracked open (FIG. 23). Every egg had cloudy thick albumen. Further, eggs were obtained from 5 chickens (#584, #766, #714, #645, and #640) and the thick albumen of each egg was subjected to electrophoresis and CBB staining in the same manner as described in above (3). Interferon bands were detected in all eggs (FIG. 17). The concentrations of interferon of #584, #766, #714, #645, and #640 are compared through a quantification analysis of NIH image to obtain a ratio between them of 1.0:1.0:0.94:0.91:0.89. In this ratio, a difference between the maximum concentration and minimum concentration is within 11%, demonstrating a very stable expression as compared to a variation in secretion concentrations observed between individuals (5 .mu.g/ml to 100 .mu.g/ml) in Non-Patent Literature 1. Having little individual difference is advantageous for obtaining a large amount of recombinant proteins using a plurality of recombinant chickens.
Example 2
[0135] (1) Attempt of Efficiently Extracting Interferon from Thick Albumen (Applicable Solubilization Treatment)
[0136] It is found that interferon .beta. is in the thick albumen at a high concentration. This interferon .beta. is preferably extracted and purified from the albumen for a general use. A purification technique includes various column treatments on the basis of molecular weight and chemical properties, however proteins are preferably solubilized in an aqueous solution before being subjected to the column treatment. An aqueous solution and an insoluble matter can be separated by a centrifugal operation. Thus, studies were conducted to examine whether interferon .beta. in the thick albumen was collected in an aqueous solution or included in an insoluble matter.
[0137] The thick albumen in an amount of 200 .mu.l was collected and subjected to centrifugation at 20,000.times.g for 15 minutes to separate a white precipitate fraction from a liquid fraction (a tube 1 in FIG. 13). Electrophoresis using an acrylamide gel shows that the liquid fraction contains human interferon .beta. (lane 1 in FIG. 14), however an amount of human interferon .beta. is clearly less than that included in an equivalent amount of the thick albumen before separation (lane 0 in FIG. 14). Thus, it was speculated that a majority of interferon .beta. was included in the white precipitate caused by centrifugation. In order to solve this, several attempts have been made to reduce the white precipitate and increase a yield of interferon. In FIG. 13, a thick albumen liquid in an amount of 200 .mu.l was added in each tube and subjected to the following treatments. The thick albumen liquid is added and mixed by inversion with a 4 times volume (800 .mu.l) of a 3M saturated arginine solution (tube 2), added with a 4 times volume (800 .mu.l) of the 3M saturated arginine solution and subjected to ultrasonic crushing (tube 3), added and mixed by inversion with a small amount of arginine (20 mg) and filled up with PBS to 1 ml (tube 4), added and mixed by inversion with a small amount of arginine hydrochloride (20 mg) and filled up with PBS to 1 ml (tube 5), filled up with PBS to 1 ml and subjected to the ultrasonic crushing (tube 6), added and mixed by inversion with arginine hydrochloride in a saturating amount or more (200 mg) (tube 7), added with a twice volume (400 .mu.l) of the 3M saturated arginine solution and subjected to the ultrasonic crushing (tube 8), added with a small amount of arginine hydrochloride (20 mg) and subjected to the ultrasonic crushing (tube 9), or added with a small amount of sodium chloride (40 mg) and subjected to the ultrasonic crushing (tube 10). Although the white precipitates were still observed after centrifugation at 20,000.times.g for 15 minutes, the amounts of the white precipitates were reduced by all of these treatments as compared to that without a treatment (tube 1). In particular, the amounts of the white precipitates were markedly reduced in the tubes 3, 6, 8, and 9, which were subjected to the ultrasonic crushing.
[0138] After supernatants were recovered, samples were prepared by adjusting their loading amounts to be equal on the basis of the original amounts of the thick albumen and subjected to electrophoresis using an acrylamide gel (FIG. 14). The lane number in FIG. 14 corresponds to the tube number in FIG. 13 except lane 0, in which the thick albumen in an equivalent amount was applied to electrophoresis without separation. Although there were some differences between lanes, lane 2 to lane 10 contained more interferon .beta. than lane 1 of the non-treatment sample. In particular, the sample prepared by adding a 4 times volume of the 3M saturated arginine solution and performing the ultrasonic crushing (tube 3) contained a significant amount of interferon .beta.. These results show that an amount of interferon .beta. extracted in an aqueous solution from the thick albumen increases by a physical treatment such as the ultrasonic crushing and a chemical treatment such as adding arginine or an arginine buffer (more restrictively, a solubilization treatment of insoluble protein). The sample in tube 3 prepared by adding a 4 times volume of the 3M saturated arginine solution and performing the ultrasonic crushing was subjected to centrifugation at 20 k.times.g for 15 minutes to recover a supernatant. The supernatant was subjected to dialysis in PBS for 24 hours (hereinafter referred to as a thick albumen rough purification product). The thick albumen rough purification product is transparent without any precipitates, suggesting that some of the white precipitates were solubilized and transferred to the supernatant by a series of treatments.
(2) Activity of Interferon Produced in Chicken Egg.
[0139] Activity of interferon in the thin albumen, thick albumen, and thick albumen rough purification product was examined by a bioassay. HEK-blue IFN-.alpha./.beta. (Invivogen) is a cultured cell that secretes alkaline phosphatase by human interferon .beta. added in a medium. Activity of human interferon .beta. can be detected by adding a medium after reaction to an alkaline phosphatase substrate solution (Quanti-Blue; Invivogen) and examining a change in the substrate solution (a color change from red to blue for Quanti-Blue). The thin albumen, the thick albumen (the supernatant after centrifugation at 20 k.times.g for 15 minutes), and the thick albumen rough purification product (derived from tube 3) derived from the human interferon knock-in eggs were added to the culture media of HEK-blue IFN-.alpha./.beta.. Further, the thin albumen derived form a wild-type chicken egg and PBS were added to the media as a negative control and recombinant human interferon was added to the media as a positive control. The cells were cultured for 20 hours and supernatants of the culture media were added to the Quanti-Blue substrate solution to perform reactions at 37.degree. C. for 1 hour. A result is shown in FIG. 15.
[0140] The human interferon .beta. activity is detected in any of the thin albumen, the thick albumen centrifugation supernatant, and the thick albumen rough purification product derived from the interferon knock-in (IFN-KI) eggs, demonstrating that an unpurified knock-in egg product exhibits the interferon activity and such an activity remains after the solubilization treatment by the ultrasonic crushing or the arginine buffer. Thus, interferon derived from the chicken egg can be used as it is in the egg without a processing, after a simple processing such as a centrifugation fractionation, or after a processing such as solubilization and purification.
(3) Activity Quantification of Interferon Produced in Chicken Egg
[0141] Activity of interferon in the thick albumen was measured by a bioassay. As described in (2), the thick albumen (a supernatant obtained by the ultrasonic crushing followed by centrifugation at 20,000.times.g for 15 minutes) derived from a human interferon knock-in egg (collected 3 months after the first egg) was serially diluted 5 folds and 10 .mu.l of each dilution was added to the HEK-blue IFN-.alpha./.beta. culture medium. As a comparison, commercially available recombinant human interferon .beta. (Wako Pure Chemical Industries, Ltd.) in a concentration of 10 .mu./ml was serially diluted 5 folds in the same manner and 10 .mu.l of each dilution was added to the culture medium. The cells were cultured for 20 hours and supernatants of the culture media were added to the Quanti-Blue substrate solution to perform reactions at 37.degree. C. for 1 hour. A result is shown in FIG. 18. In an upper series of reactions using supernatants of the culture media to which commercially available human interferon was added, a fourth well from the left has a color in which red and blue is mixed, whereas, in a lower series of reactions using supernatants of the media to which the thick albumen was added, an eighth well from the left has a color in which red and blue is mixed (in both dilution series, concentration decreases from left to right). Thus, the interferon activity in the thick albumen is 625 or more times higher than that of the 10 .mu./ml commercially available interferon and a concentration of interferon in the thick albumen is estimated to be 6.25 mg/ml or more. About 16 ml of the thick albumen was recovered at this point, meaning that human interferon having activity equivalent to about 100 mg of the commercially available interferon could be obtained from one egg. As for price, 20 .mu.g of the recombinant human interferon .beta. available from Wako Pure Chemical Industries, Ltd. costs 39,000 Japanese Yen, thus 100 mg of interferon is worthy of 195,000,000 Japanese Yen (about 200 million Japanese Yen). Using the present method allows the production of a very large amount of human recombinant proteins in terms of activity.
(4) Analysis of Egg of Knock-In Chicken in G2 Generation
[0142] A G1 knock-in chicken (male) was mated with a wild-type female to establish G2 knock-in chickens (male and female). The G2 knock-in chickens were raised to sexual maturity to obtain eggs and the eggs were cracked open (right in FIG. 19). All obtained eggs had cloudy thick albumen as was the case for the egg derived from G1. Further, eggs were obtained from 3 G2 knock-in chickens and their thick albumen was subjected to electrophoresis. The thick albumen of the G1 derived egg was also subjected to electrophoresis for comparison. In a CBB staining image, interferon signals were detected in the G2 derived eggs as in the G1 derived egg. These results showed that the exogenous gene knocked-in at the oviduct gene could be stably expressed in the chicken egg over generations. This observation can guarantee a large-scale and long-term stable operation of the production of recombinant proteins using a knock-in chicken.
Example 3
[0143] An egg was obtained from the chicken in which the human antibody gene was knocked-in at the human ovalbumin gene locus in a manner as described in production example 3. The presence of a human antibody protein in the albumen was examined. The albumen was added with an equal volume of a sample buffer (0.125M Tris pH6.8, 4% SDS, 10% glycerol, 0.1% BPB, note that 2-ME is not included). These samples were serially diluted 10 folds 3 times, separated by electrophoresis using a 5-20% acrylamide gel, and transferred to a PVDF membrane. After the membrane was blocked from a non-specific binding of an antibody molecule by skim milk, the membrane was subjected to western blotting using an anti-human immunoglobulin antibody (Jackson Immuno Research, Anti-Human IgG F(ab)) diluted 1,000 times as a primary antibody and an anti-rabbit HRP conjugated antibody (Jackson Immuno Research, Peroxidase-conjugated AffiniPure Goat Anti-Rabbit IgG (H+L)) diluted 1,000 times as a secondary antibody. A result is shown in FIG. 20.
[0144] The antibodies detected bands of about 200 kDa at the same position as that of a purified recombinant human antibody (trade name: Herceptin, Roche Ltd.) (right in FIG. 20). Further, this band was not detected in a wild-type egg at all (left in FIG. 20). These results suggest that a human antibody complex maintains a normal subunit structure in the chicken egg. In FIG. 20, numbers of 1/20, 1/200, and 1/2 k (=1/2000) in each lane indicate relative amounts of the samples subjected to electrophoresis when the undiluted albumen is taken as 1. A concentration of the antibody complex is estimated to be 1 mg/ml or more by comparison with a loaded amount of Herceptin of a known concentration (right in FIG. 20).
Example 4
(1) Ovomucoid Homozygous Gene Knock-Out Chicken
[0145] Ovomucoid is an albumen protein, however an expression dynamic of ovomucoid in an early developmental process or a function of ovomucoid in development has not been studied. An effect of loss of function of ovomucoid is completely unknown. As shown in production example 1-3, the inventors created the heterozygous ovomucoid knock-out chickens (male and female) using the genome editing technique. Further, the inventors established the chicken having a complete deletion of ovomucoid (the homozygous ovomucoid knock-out chicken) by mating the heterozygous chickens. As shown in FIG. 16, the male and female heterozygous knock-out chickens having the same 5-base deletion in an exon 3 encoding the ovomucoid protein were mated with each other after sexual maturity to obtain offsprings. As shown in FIG. 16, the offsprings included a homozygous knock-out chicken in addition to a wild-type and heterozygous knock-out chickens. Further, both male and female ovomucoid homozygous knock-out chickens were obtained and they have been growing healthily like a wild-type chicken without any morphological abnormalities. These results showed, for the first time, that loss of function of ovomucoid did not cause any effect on an initial development such as lethality or morphological abnormalities.
(2) Usefulness of Ovomucoid Knock-Out Chicken Egg
[0146] The homozygous ovomucoid knock-out chicken does not secrete ovomucoid and thus produces an egg having no ovomucoid. Ovomucoid is a very strong allergen substance and it is known that allergenicity of ovomucoid is not lost by heating or enzymatic degradation. Needless to say, an ovomucoid deficient egg does not exhibit strong allergenicity caused by ovomucoid, thus it is clearly understood that the ovomucoid deficient egg is useful, as a low allergenic egg, for significantly reducing allergenicity in all products that use an egg, such as a raw food, a processed food, a vaccine produced in an egg, and a cosmetic raw material.
(3) Characteristics of Ovomucoid Knock-Out Chicken Egg
[0147] The homozygous knock-out female can produce an egg as it produced an egg almost every day at least for 6 months in a manner similar to that of a wild type. An image of a cracked open egg is shown in a left panel of FIG. 21. The knock-out egg is not visually different from a wild-type egg. Further, the knock-out egg is not markedly different from a normal chicken egg in processability as, for example, it is coagulated by heating (right in FIG. 21).
[0148] Further, a chicken individual can be produced by mating the homozygous knock-out chickens with each other. An egg, which was obtained by mating 5 bp-deletion homozygous male and female individuals with each other, was incubated to obtain an ovomucoid 5 bp-deletion homozygous individual in the G3 generation (FIG. 23). This result shows that a chicken can develop without the ovomucoid gene or the ovomucoid protein, that is, ovomucoid is not essential for the development of chicken.
INDUSTRIAL APPLICABILITY
[0149] Further Improvement in Expression Level of Knock-In Gene
[0150] Interferon .beta. obtained in the present example had a very high concentration of 5 mg/ml, however a concentration of a recombinant protein in an egg can be further increased.
1. Homozygousing Knock-In Gene
[0151] The analyzed chicken egg was produced from the parental chickens having a genotype in which human interferon .beta. was inserted into one allele of the ovalbumin gene locus (heterozygous gene knock-in). It is possible to obtain an individual that expresses a recombinant protein at a higher concentration by mating heterozygous gene knock-in parents with each other or mating germ line chimeric individuals having knock-in primordial germ cells with each other to create an individual in which human interferon .beta. is inserted into both alleles (homozygous gene knock-in).
2. Improving Signal Peptide and Codon Usage
[0152] In the present example, human interferon .beta. cDNA was knocked-in at the translation starting site of the ovalbumin gene. In this case, a signal peptide in use was from human interferon .beta., and it was not optimized for a chicken oviduct cell. Examples of the signal peptide that allows secretion in the chicken oviduct at a high efficiency include a signal peptide of a protein that is actually secreted from the oviduct. Specific examples of such a signal peptide include "MRSLLILVLCFLPLAALG" of albumen lysozyme and "MKLILCTVLSLGIAAVCFA" of ovotransferrin. Further, the present invention is not limited thereto and an artificial or natural signal peptide that allows protein secretion in a chicken cell at a high efficiency may be used. Knock-in of a desired protein having these signal peptides at its N terminus at the ovalbumin gene locus allows the production of the desired protein at a higher concentration. Further, protein production can be improved by optimizing a base sequence of cDNA to be knocked-in in accordance with a codon usage used in a chicken.
3. Increasing Copy Number of Inserted Gene
[0153] In the present example, only one gene was knocked-in, however a plurality of genes can be knocked-in in a tandem form that allows transcription and translation to simultaneously express the plurality of genes, thereby increasing their expression levels. The plurality of genes may be composed of a single gene or a plurality of kinds of genes and the number of genes may be any number. Specifically, when performing gene knock-in, a plurality of genes may be inserted by interposing a sequence such as IRES between them to facilitate transcription and translation and increase gene products. Further, a plurality of proteins may be arranged by interposing a 2A peptide or the like between them and simultaneously expressed under control of the ovalbumin promoter. In this manner, a larger amount of proteins can be produced by cleaving the peptides.
[0154] As a preferable embodiment, the light chain gene and heavy chain gene of the human antibody gene and .alpha.1 and .alpha.2 of the human type I collagen gene are each tandemly connected via the 2A peptide gene and knocked-in into a chicken.
4. Usefulness of Knock-Out Chicken Egg
[0155] The homozygous ovomucoid knock-out chicken does not secrete ovomucoid and thus produce an egg having no ovomucoid. Ovomucoid is a very strong allergen substance and it is known that allergenicity of ovomucoid is not lost by heating or enzymatic degradation. Needless to say, an ovomucoid deficient egg does not have strong allergenicity caused by ovomucoid, thus it is obvious that the ovomucoid deficient egg is useful, as a low allergenic egg, for significantly reducing allergenicity in all products that use an egg, such as a raw food, a processed food, a vaccine produced in an egg, and a cosmetic raw material. Further, an exogenous gene can be expressed in an albumin allergen knock-out chicken egg by breeding or using a genome edited primordial germ cell, and it is clearly understood that purification of the exogenous gene product produced in this manner can simplify an allergen removing process.
[0156] A poultry egg in which an oviduct-specific gene other than ovomucoid is knocked-out is useful for significantly reducing allergenicity as is the case for ovomucoid.
Sequence CWU
1
1
71122DNAArtificial Sequenceprimer 1acaactcaga gttcaccatg gg
22224DNAArtificial Sequenceprimer
2caccgacaac tcagagttca ccat
24324DNAArtificial Sequenceprimer 3aaacatggtg aactctgagt tgtc
2441383DNAArtificial Sequencepuromycin
resistant gene unit 4gcggccgcgg gaattcgatt gtcgaccctc gacctcgaaa
ttctaccggg taggggaggc 60gcttttccca aggcagtctg gagcatgcgc tttagcagcc
ccgctgggca cttggcgcta 120cacaagtggc ctctggcctc gcacacattc cacatccacc
ggtaggcgcc aaccggctcc 180gttctttggt ggccccttcg cgccaccttc tactcctccc
ctagtcagga agttcccccc 240cgccccgcag ctcgcgtcgt gcaggacgtg acaaatggaa
gtagcacgtc tcactagtct 300cgtgcagatg gacagcaccg ctgagcaatg gaagcgggta
ggcctttggg gcagcggcca 360atagcagctt tgctccttcg ctttctgggc tcagaggctg
ggaaggggtg ggtccggggg 420cgggctcagg ggcgggctca ggggcggggc gggcgcccga
aggtcctccg gaggcccggc 480attctgcacg cttcaaaagc gcacgtctgc cgcgctgttc
tcctcttcct catctccggg 540cctttcgacc tgcatccatc tagatctaga tcagcttacc
atgaccgagt acaagcccac 600ggtgcgcctc gccacccgcg acgacgtccc cagggccgta
cgcaccctcg ccgccgcgtt 660cgccgactac cccgccacgc gccacaccgt cgatccggac
cgccacatcg agcgggtcac 720cgagctgcaa gaactcttcc tcacgcgcgt cgggctcgac
atcggcaagg tgtgggtcgc 780ggacgacggc gccgcggtgg cggtctggac cacgccggag
agcgtcgaag cgggggcggt 840gttcgccgag atcggcccgc gcatggccga gttgagcggt
tcccggctgg ccgcgcagca 900acagatggaa ggcctcctgg cgccgcaccg gcccaaggag
cccgcgtggt tcctggccac 960cgtcggcgtc tcgcccgacc accagggcaa gggtctgggc
agcgccgtcg tgctccccgg 1020agtggaggcg gccgagcgcg ccggggtgcc cgccttcctg
gagacctccg cgccccgcaa 1080cctccccttc tacgagcggc tcggcttcac cgtcaccgcc
gacgtcgagg tgcccgaagg 1140accgcgcacc tggtgcatga cccgcagccc ggtgcctgac
gcccgcccca cgacccgcag 1200cgcccgaccg aaaggagcgc acgaccccat gcatcgatga
tgccaataaa gatatcattg 1260atgagtttgg acaaaccaca actagaatgc agtgaaaaaa
atgctttatt tgtgaaattt 1320gtgatgctat tgctttattt gtaaccatta taagctgcaa
atcactagtg aattcgcggc 1380cgc
138351834DNAArtificial Sequenceneomycin resistant
gene unit 5gcggccgcgg gaattcgatt aagtcgactc aggtggcact tttcggggaa
atgtgcgcgg 60aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca
tgagacaata 120accctgataa atgcttcaat aatattgaaa aaggaagagt cctgaggcgg
aaagaaccag 180ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag gctccccagc
aggcagaagt 240atgcaaagca tgcatctcaa ttagtcagca accaggtgtg gaaagtcccc
aggctcccca 300gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccatagt
cccgccccta 360actccgccca tcccgcccct aactccgccc agttccgccc attctccgcc
ccatggctga 420ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct
attccagaag 480tagtgaggag gcttttttgg aggcctaggc ttttgcaaag atcgatcaag
agacaggatg 540aggatcgttt cgcatgattg aacaagatgg attgcacgca ggttctccgg
ccgcttgggt 600ggagaggcta ttcggctatg actgggcaca acagacaatc ggctgctctg
atgccgccgt 660gttccggctg tcagcgcagg ggcgcccggt tctttttgtc aagaccgacc
tgtccggtgc 720cctgaatgaa ctgcaagacg aggcagcgcg gctatcgtgg ctggccacga
cgggcgttcc 780ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg gactggctgc
tattgggcga 840agtgccgggg caggatctcc tgtcatctca ccttgctcct gccgagaaag
tatccatcat 900ggctgatgca atgcggcggc tgcatacgct tgatccggct acctgcccat
tcgaccacca 960agcgaaacat cgcatcgagc gagcacgtac tcggatggaa gccggtcttg
tcgatcagga 1020tgatctggac gaagagcatc aggggctcgc gccagccgaa ctgttcgcca
ggctcaaggc 1080gagcatgccc gacggcgagg atctcgtcgt gacccatggc gatgcctgct
tgccgaatat 1140catggtggaa aatggccgct tttctggatt catcgactgt ggccggctgg
gtgtggcgga 1200ccgctatcag gacatagcgt tggctacccg tgatattgct gaagagcttg
gcggcgaatg 1260ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc gattcgcagc
gcatcgcctt 1320ctatcgcctt cttgacgagt tcttctgagc gggactctgg ggttcgaaat
gaccgaccaa 1380gcgacgccca acctgccatc acgagatttc gattccaccg ccgccttcta
tgaaaggttg 1440ggcttcggaa tcgttttccg ggacgccggc tggatgatcc tccagcgcgg
ggatctcatg 1500ctggagttct tcgcccaccc tagggggagg ctaactgaaa cacggaagga
gacaataccg 1560gaaggaaccc gcgctatgac ggcaataaaa agacagaata aaacgcacgg
tgttgggtcg 1620tttgttcata aacgcggggt tcggtcccag ggctggcact ctgtcgatac
cccaccgaga 1680ccccattggg gccaatacgc ccgcgtttct tccttttccc caccccaccc
cccaagttcg 1740ggtgaaggcc cagggctcgc agccaacgtc ggggcggcag gccctgccat
agcctcaggt 1800tactcgagtt aatcactagt gaattcgcgg ccgc
1834622DNAchiken 6tttcccaacg ctacagacat gg
22724DNAArtificial Sequenceprimer 7caccgtttcc
caacgctaca gaca
24824DNAArtificial Sequenceprimer 8aaactgtctg tagcgttggg aaac
24924DNAArtificial Sequenceprimer
9ctatgtacag cattccatcc ttac
241023DNAArtificial Sequenceprimer 10ttcagcctct gagctatgca gtt
231121DNAArtificial Sequenceprimer
11ctacaaaatg tcactttgtc c
211222DNAArtificial Sequenceprimer 12tgatgtctag gcaaccgagt gt
22138136DNAhuman 13gtcgacctta agtcctcaga
cttggcaagg agaatgtaga tttctacagt atatatgttt 60tcacaaaagg aaggagagaa
acaaaagaaa atggcactga ctaaacttca gctagtggta 120taggaaagta attctgctta
acagagattg cagtgatctc tatgtatgtc ctgaagaatt 180atgttgtact tttttccctc
atttttaaat caaacagtgc tttacagagg tcagaatggt 240ttctttactg tttgtcaatt
ctattatttc aatacagaac aatagcttct ataactgaaa 300tatatttgct attgtatatt
atgattgtcc ctcgaaccat gaacactcct ccagctgaat 360ttcacaattc ctctgtcatc
tgccaggcca ttaagttatt catggaagat ctttgaggaa 420cactgcaagt tcatatcata
aacacatttg aaattgagta ttgttttgca ttgtatggag 480ctatgttttg ctgtatcctc
agaaaaaaag tttgttataa agcattcaca cccataaaaa 540gatagattta aatattccag
ctataggaaa gaaagtgcgt ctgctcttca ctctagtctc 600agttggctcc ttcacatgca
cgcttcttta tttctcctat tttgtcaaga aaataatagg 660tcacgtcttg ttctcactta
tgtcctgcct agcatggctc agatgcacgt tgtacataca 720agaaggatca aatgaaacag
acttctggtc tgttactaca accatagtaa taagcacact 780aactaataat tgctaattat
gttttccatc tccaaggttc ccacattttt ctgttttctt 840aaagatccca ttatctggtt
gtaactgaag ctcaatggaa catgagcaat atttcccagt 900cttctctccc atccaacagt
cctgatggat tagcagaaca ggcagaaaac acattgttac 960ccagaattaa aaactaatat
ttgctctcca ttcaatccaa aatggaccta ttgaaactaa 1020aatctaaccc aatcccatta
aatgatttct atggcgtcaa aggtcaaact tctgaaggga 1080acctgtgggt gggtcacaat
tcaggctata tattccccag ggctcagcca gtgtctgtac 1140atacagctag aaagctgtat
tgcctttagc actcaagctc aaaaggtaag caactctctg 1200gaattacctt ctctctatat
tagctcttac ttgcacctaa actttaaaaa attaacaatt 1260attgtgttat gtgttgtatc
tttaagggtg aagtacctgc gtgatacccc ctataaaaac 1320ttctcacctg tgtatgcatt
ctgcactatt ttattatgtg taaaagcttt gtgtttgttt 1380tcaggaggct tattctttgt
gcttaaaata tgtttttaat ttcagaacat cttatcctgt 1440cgttcactat ctgatatgct
ttgcagtttg cctgattaac ttctagccct acagagtgca 1500cagagagcaa aatcatggtg
ttcagtgaat tctggggagt tattttaatg tgaaaattct 1560ctagaagttt aattcctgca
aagtgcagct gctgatcact acacaagata aaaatgtggg 1620gggtgcataa acgtatattc
ttacaataat agatacatgt gaacttgtat acagaaaaga 1680aaatgagaaa aatgtgtgtg
cgtatactca cacacgtggt cagtaaaaac ttttgagggg 1740tttaatacag aaaatccaat
cctgaggccc cagcactcag tacgcatata aagggctggg 1800ctctgaagga cttctgactt
tcacagatta tataaatctc aggaaagcaa ctagattcat 1860gctggctcca aaagctgtgc
tttatataag cacactggct atacaatagt tgtacagttc 1920agctctttat aatagaaaca
gacagaacaa gtataaatct tctattggtc tatgtcatga 1980acaagaattc attcagtggc
tctgttttat agtaaacatt gctattttat catgtctgca 2040tttctcttct gtctgaatgt
caccactaaa atttaactcc acagaaagtt tatactacag 2100tacacatgca tatctttgag
caaagcaaac catacctgaa agtgcaatag agcagaatat 2160gaattacatg cgtgtctttc
tcctagacta catgacccca tataaattac attccttatc 2220tattctgcca tcaccaaaac
aaaggtaaaa atacttttga agatctactc atagcaagta 2280gtgtgcaaca aacagatatt
tctctacatt tatttttagg gaataaaaat aagaaataaa 2340atagtcagca agcctctgct
ttctcatata tctgtccaaa cctaaagttt actgaaattt 2400gctctttgaa tttccagttt
tgcaagccta tcagattgtg ttttaatcag aggtactgaa 2460aagtatcaat gaattctagc
tttcactgaa caaaaatatg tagaggcaac tggcttctgg 2520gacagtttgc tacccaaaag
acaactgaat gcaaatacat aaatagattt atgaatatgg 2580ttttgaacat gcacatgaga
ggtggatata gcaacagaca cattaccaca gaattacttt 2640aaaactactt gttaacattt
aattgcctaa aaactgctcg taatttactg ttgtagccta 2700ccatagagta ccctgcatgg
tactatgtac agcattccat ccttacattt tcactgttct 2760gctgtttgct ctagacaact
cagagttcac catgaccaac aagtgtctcc tccaaattgc 2820tctcctgttg tgcttctcca
ctacagctct ttccatgagc tacaacttgc ttggattcct 2880acaaagaagc agcaattttc
agtgtcagaa gctcctgtgg caattgaatg ggaggcttga 2940atattgcctc aaggacagga
tgaactttga catccctgag gagattaagc agctgcagca 3000gttccagaag gaggacgccg
cattgaccat ctatgagatg ctccagaaca tctttgctat 3060tttcagacaa gattcatcta
gcactggctg gaatgagact attgttgaga acctcctggc 3120taatgtctat catcagataa
accatctgaa gacagtcctg gaagaaaaac tggagaaaga 3180agatttcacc aggggaaaac
tcatgagcag tctgcacctg aaaagatatt atgggaggat 3240tctgcattac ctgaaggcca
aggagtacag tcactgtgcc tggaccatag tcagagtgga 3300aatcctaagg aacttttact
tcattaacag acttacaggt tacctccgaa actgaagatc 3360tcctagcctg tgcctctggt
cgactcgctg atcagcctcg actttgcctt ctagttgcca 3420gccatctgtt gtttgcccct
cccccgtgcc ttccttgacc ctggaaggtg ccactcccac 3480tgtcctttcc taataaaatg
aggaaattgc atcgcattgt ctgagtaggt gtcattctat 3540tctggggggt ggggtggggc
aggacagcaa gggggaggat tgggaagaca atagcaggca 3600ctcgaccctc gacctcgaaa
ttctaccggg taggggaggc gcttttccca aggcagtctg 3660gagcatgcgc tttagcagcc
ccgctgggca cttggcgcta cacaagtggc ctctggcctc 3720gcacacattc cacatccacc
ggtaggcgcc aaccggctcc gttctttggt ggccccttcg 3780cgccaccttc tactcctccc
ctagtcagga agttcccccc cgccccgcag ctcgcgtcgt 3840gcaggacgtg acaaatggaa
gtagcacgtc tcactagtct cgtgcagatg gacagcaccg 3900ctgagcaatg gaagcgggta
ggcctttggg gcagcggcca atagcagctt tgctccttcg 3960ctttctgggc tcagaggctg
ggaaggggtg ggtccggggg cgggctcagg ggcgggctca 4020ggggcggggc gggcgcccga
aggtcctccg gaggcccggc attctgcacg cttcaaaagc 4080gcacgtctgc cgcgctgttc
tcctcttcct catctccggg cctttcgacc tgcatccatc 4140tagatctaga tcagcttacc
atgaccgagt acaagcccac ggtgcgcctc gccacccgcg 4200acgacgtccc cagggccgta
cgcaccctcg ccgccgcgtt cgccgactac cccgccacgc 4260gccacaccgt cgatccggac
cgccacatcg agcgggtcac cgagctgcaa gaactcttcc 4320tcacgcgcgt cgggctcgac
atcggcaagg tgtgggtcgc ggacgacggc gccgcggtgg 4380cggtctggac cacgccggag
agcgtcgaag cgggggcggt gttcgccgag atcggcccgc 4440gcatggccga gttgagcggt
tcccggctgg ccgcgcagca acagatggaa ggcctcctgg 4500cgccgcaccg gcccaaggag
cccgcgtggt tcctggccac cgtcggcgtc tcgcccgacc 4560accagggcaa gggtctgggc
agcgccgtcg tgctccccgg agtggaggcg gccgagcgcg 4620ccggggtgcc cgccttcctg
gagacctccg cgccccgcaa cctccccttc tacgagcggc 4680tcggcttcac cgtcaccgcc
gacgtcgagg tgcccgaagg accgcgcacc tggtgcatga 4740cccgcaagcc cggtgcctga
cgcccgcccc acgacccgca gcgcccgacc gaaaggagcg 4800cacgacccca tgcatcgatg
atgccaataa agatatcatt gatgagtttg gacaaaccac 4860aactagaatg cagtgaaaaa
aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt 4920tgtaaccatt ataagctgca
tatcgaattc ccgcggccgc gggaattcga ttccatcggt 4980gcagcaagca tggaattttg
ttttgatgta ttcaaggagc tcaaagtcca ccatgccaat 5040gagaacatct tctactgccc
cattgccatc atgtcagctc tagccatggt atacctgggt 5100gcaaaagaca gcaccaggac
acagataaat aaggtgagcc tacagttaaa gattaaaacc 5160tttgccctgc tcaatggagc
cacagcactt aattgtatga taatgtccct tggaaactgc 5220atagctcaga ggctgaaaat
ctgaaaccag agttatctaa aagtgtggcc acctccaact 5280cccagagtgt tacccaaatg
cactagctag aaatcttgaa actggattgc ataacttctt 5340tttgtcataa ccattatttc
agctactatt attttcaatt acaggttgtt cgctttgata 5400aacttccagg attcggagac
agtattgaag ctcaggtaca gaaataattt cacctccttc 5460tctatgtccc tttcctctgg
aagcaaaata cagcagatga agcaatctct tagctgttcc 5520aagccctctc tgatgagcag
ctagtgctct gcatccagca gttgggagaa cactgttcat 5580aagaacagag aaaaagaagg
aagtaacagg ggattcagaa caaacagaag ataaaactca 5640ggacaaaaat accgtgtgaa
tgaggaaact tgtggatatt tgtacgctta agcaagacag 5700ctagatgatt ctggataaat
gggtctggtt ggaaaagaag gaaagcctgg ctgatctgct 5760ggagctagat tattgcagca
ggtaggcagg agttccctag agaaaagtat gagggaatta 5820cagaagaaaa acagcacaaa
attgtaaata ttggaaaagg accacatcag tgtagttact 5880agcagtaaga cagacaggat
gaaaaatagt tttgtaaaca gaagtatcta actactttac 5940tctgttcata cactatgtaa
aacctactaa gtaataaaac tagaataaca acatctttct 6000ttctctttgt attcagtgtg
gcacatctgt aaacgttcac tcttcactta gagacatcct 6060caaccaaatc accaaaccaa
atgatgttta ttcgttcagc cttgccagta gactttatgc 6120tgaagagaga tacccaatcc
tgccagtaag ttgctctaaa atctgatctg agtgtatttc 6180catgccaaag ctctaccatt
ctgtaatgca aaaacagtca gagttccaca tgtttcacta 6240agaaaatttc tttttctctt
gtttttacaa atgaaagaga ggacaaataa catttctcta 6300tcaccgacct gaaactctac
agtcttcaga gaatgaatgg cttgctaaaa gaatgtcaaa 6360tcttaccata cagctatttc
atattacact actaaataca ctataaggca tagcatgtag 6420taatacactg taaaatagct
ttttacacta ctatattatt aatatctgtt aattccagtc 6480ttgcatttca catttgcaaa
acgttttgaa attcgtatct gaaagctgaa tactcttgct 6540ttacaggaat acttgcagtg
tgtgaaggaa ctgtatagag gaggcttgga acctatcaac 6600tttcaaacag ctgcagatca
agccagagag ctcatcaatt cctgggtaga aagtcagaca 6660aatggtaagg tagaacatgc
tttgtacata gtgagagttg gttcacccta atactgagaa 6720cctggatata gctcagccag
cgtgctttgc gttcaagctt accagagctg ttgtatgcct 6780gttaagcagg gcatacagtc
atgaggctct tgaaaaatct taacagacaa agggcaatgg 6840aaaatcggag ttaagggatg
gtagggataa aatgcataga aagaggtacc acgattttga 6900tttttgccct aatgcctctc
tgcgtggttc ctcaattttt ctacttcatt cctcatctcc 6960tcagagcatt cctttccctc
atgcttgaaa cacagatgaa agactgtgaa ttctaactga 7020gatgaaaaca tccacaacca
cacaacctct ggtgtggagt cacattctgt gaaggcaaaa 7080actaggccac gtaatctatg
tgtgcaagct acgtgtaagc tatgtgtgtg acaggacaat 7140gtgaggaaca tactatgtgc
acaaggactg cagaataaac aggagcaaag tttttgaaga 7200aaacagagta aaatcccgtt
ttcctctttt gttacattct ttacatatat ctcaaatttc 7260ctctttggtt agaagcaagt
aatatttatg tttcttggta ctgtttgggt tgaagaccat 7320tctgggataa gagaaattcc
agtggttctt cccctaatca taaaatgtac aggtttagtt 7380tttttgtaac acagaaatct
cttcatcttt tatcttttgt tgtgattctt tatagagaga 7440gaaacaagac ttactgacaa
tagcagcaag aaaatcaatc ttggaagaac aagattgcag 7500ttgcaaaaac aaaccaatgt
ccttgcccct acatcctctt ccccataaat tctacattct 7560ctatctacct tgtgcttgcc
aacatgatat acgtaaactc tcttttccta ttcattctta 7620aaggaattat cagaaatgtc
cttcagccaa gctccgtgga ttctcaaact gcaatggttc 7680tggttaatgc cattgtcttc
aaaggactgt gggagaaagc atttaaggat gaagacacac 7740aagcaatgcc tttcagagtg
actgaggtat atgggcatac cttagagatg taatctagaa 7800tttatgaaga gagtagacat
gttgttatat gaacactgca ttagcgtatc tgctcatttg 7860tctgcatctc tttcagacac
tgtgttaaaa gcagggaatt ttccttatgt ctctttcatc 7920acaatattcc tgacattgca
aagctcctga gaaataactt cagattccca cttttcctag 7980ggaggtcttc ctggatgaga
acaatcaatc atcttaactg taactagata tttctgcatc 8040taagaataat ctttgttaaa
actatattct ctctctcttt tttttttttt ttggttctcc 8100agcaagaaag caaacctgtg
cagatgatgt ggatcc 81361427DNAArtificial
Sequenceprimer 14gctgtagtgg agaagcacaa caggaga
271525DNAArtificial Sequenceprimer 15ctctttctgc agactgacat
gcatt 251626DNAArtificial
Sequenceprimer 16atttggagga gacacttgtt ggtcat
261723DNAArtificial Sequenceprimer 17acctgtggtg tagacatcca
gca 231824DNAArtificial
Sequenceprimer 18caacctcccc ttctacgagc ggct
241922DNAArtificial Sequenceprimer 19aggacccagt gggacaaatc
ta 222024DNAArtificial
Sequenceprimer 20accgaaagga gcgcacgacc ccat
242123DNAArtificial Sequenceprimer 21caacttctag ggccatacct
gct 232225DNAArtificial
Sequenceprimer 22aaaattccat gcttgctgca ccgat
25231693DNAArtificial SequenceSV40Pe-Neor 23ttcaaatatg
tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 60aaggaagagt
cctgaggcgg aaagaaccag ctgtggaatg tgtgtcagtt agggtgtgga 120aagtccccag
gctccccagc aggcagaagt atgcaaagca tgcatctcaa ttagtcagca 180accaggtgtg
gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag catgcatctc 240aattagtcag
caaccatagt cccgccccta actccgccca tcccgcccct aactccgccc 300agttccgccc
attctccgcc ccatggctga ctaatttttt ttatttatgc agaggccgag 360gccgcctcgg
cctctgagct attccagaag tagtgaggag gcttttttgg aggcctaggc 420ttttgcaaag
atcgatcaag agacaggatg aggatcgttt cgcatgattg aacaagatgg 480attgcacgca
ggttctccgg ccgcttgggt ggagaggcta ttcggctatg actgggcaca 540acagacaatc
ggctgctctg atgccgccgt gttccggctg tcagcgcagg ggcgcccggt 600tctttttgtc
aagaccgacc tgtccggtgc cctgaatgaa ctgcaagacg aggcagcgcg 660gctatcgtgg
ctggccacga cgggcgttcc ttgcgcagct gtgctcgacg ttgtcactga 720agcgggaagg
gactggctgc tattgggcga agtgccgggg caggatctcc tgtcatctca 780ccttgctcct
gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct 840tgatccggct
acctgcccat tcgaccacca agcgaaacat cgcatcgagc gagcacgtac 900tcggatggaa
gccggtcttg tcgatcagga tgatctggac gaagagcatc aggggctcgc 960gccagccgaa
ctgttcgcca ggctcaaggc gagcatgccc gacggcgagg atctcgtcgt 1020gacccatggc
gatgcctgct tgccgaatat catggtggaa aatggccgct tttctggatt 1080catcgactgt
ggccggctgg gtgtggcgga ccgctatcag gacatagcgt tggctacccg 1140tgatattgct
gaagagcttg gcggcgaatg ggctgaccgc ttcctcgtgc tttacggtat 1200cgccgctccc
gattcgcagc gcatcgcctt ctatcgcctt cttgacgagt tcttctgagc 1260gggactctgg
ggttcgaaat gaccgaccaa gcgacgccca acctgccatc acgagatttc 1320gattccaccg
ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc 1380tggatgatcc
tccagcgcgg ggatctcatg ctggagttct tcgcccaccc tagggggagg 1440ctaactgaaa
cacggaagga gacaataccg gaaggaaccc gcgctatgac ggcaataaaa 1500agacagaata
aaacgcacgg tgttgggtcg tttgttcata aacgcggggt tcggtcccag 1560ggctggcact
ctgtcgatac cccaccgaga ccccattggg gccaatacgc ccgcgtttct 1620tccttttccc
caccccaccc cccaagttcg ggtgaaggcc cagggctcgc agccaacgtc 1680ggggcggcag
gcc
16932422DNAArtificial Sequenceprimer 24agttcaccat gggctccatc gg
222524DNAArtificial Sequenceprimer
25caccgagttc accatgggct ccat
242624DNAArtificial Sequenceprimer 26aaacatggag cccatggtga actc
24279735DNAhuman 27gtcgacctta agtcctcaga
cttggcaagg agaatgtaga tttctacagt atatatgttt 60tcacaaaagg aaggagagaa
acaaaagaaa atggcactga ctaaacttca gctagtggta 120taggaaagta attctgctta
acagagattg cagtgatctc tatgtatgtc ctgaagaatt 180atgttgtact tttttccctc
atttttaaat caaacagtgc tttacagagg tcagaatggt 240ttctttactg tttgtcaatt
ctattatttc aatacagaac aatagcttct ataactgaaa 300tatatttgct attgtatatt
atgattgtcc ctcgaaccat gaacactcct ccagctgaat 360ttcacaattc ctctgtcatc
tgccaggcca ttaagttatt catggaagat ctttgaggaa 420cactgcaagt tcatatcata
aacacatttg aaattgagta ttgttttgca ttgtatggag 480ctatgttttg ctgtatcctc
agaaaaaaag tttgttataa agcattcaca cccataaaaa 540gatagattta aatattccag
ctataggaaa gaaagtgcgt ctgctcttca ctctagtctc 600agttggctcc ttcacatgca
cgcttcttta tttctcctat tttgtcaaga aaataatagg 660tcacgtcttg ttctcactta
tgtcctgcct agcatggctc agatgcacgt tgtacataca 720agaaggatca aatgaaacag
acttctggtc tgttactaca accatagtaa taagcacact 780aactaataat tgctaattat
gttttccatc tccaaggttc ccacattttt ctgttttctt 840aaagatccca ttatctggtt
gtaactgaag ctcaatggaa catgagcaat atttcccagt 900cttctctccc atccaacagt
cctgatggat tagcagaaca ggcagaaaac acattgttac 960ccagaattaa aaactaatat
ttgctctcca ttcaatccaa aatggaccta ttgaaactaa 1020aatctaaccc aatcccatta
aatgatttct atggcgtcaa aggtcaaact tctgaaggga 1080acctgtgggt gggtcacaat
tcaggctata tattccccag ggctcagcca gtgtctgtac 1140atacagctag aaagctgtat
tgcctttagc actcaagctc aaaaggtaag caactctctg 1200gaattacctt ctctctatat
tagctcttac ttgcacctaa actttaaaaa attaacaatt 1260attgtgttat gtgttgtatc
tttaagggtg aagtacctgc gtgatacccc ctataaaaac 1320ttctcacctg tgtatgcatt
ctgcactatt ttattatgtg taaaagcttt gtgtttgttt 1380tcaggaggct tattctttgt
gcttaaaata tgtttttaat ttcagaacat cttatcctgt 1440cgttcactat ctgatatgct
ttgcagtttg cctgattaac ttctagccct acagagtgca 1500cagagagcaa aatcatggtg
ttcagtgaat tctggggagt tattttaatg tgaaaattct 1560ctagaagttt aattcctgca
aagtgcagct gctgatcact acacaagata aaaatgtggg 1620gggtgcataa acgtatattc
ttacaataat agatacatgt gaacttgtat acagaaaaga 1680aaatgagaaa aatgtgtgtg
cgtatactca cacacgtggt cagtaaaaac ttttgagggg 1740tttaatacag aaaatccaat
cctgaggccc cagcactcag tacgcatata aagggctggg 1800ctctgaagga cttctgactt
tcacagatta tataaatctc aggaaagcaa ctagattcat 1860gctggctcca aaagctgtgc
tttatataag cacactggct atacaatagt tgtacagttc 1920agctctttat aatagaaaca
gacagaacaa gtataaatct tctattggtc tatgtcatga 1980acaagaattc attcagtggc
tctgttttat agtaaacatt gctattttat catgtctgca 2040tttctcttct gtctgaatgt
caccactaaa atttaactcc acagaaagtt tatactacag 2100tacacatgca tatctttgag
caaagcaaac catacctgaa agtgcaatag agcagaatat 2160gaattacatg cgtgtctttc
tcctagacta catgacccca tataaattac attccttatc 2220tattctgcca tcaccaaaac
aaaggtaaaa atacttttga agatctactc atagcaagta 2280gtgtgcaaca aacagatatt
tctctacatt tatttttagg gaataaaaat aagaaataaa 2340atagtcagca agcctctgct
ttctcatata tctgtccaaa cctaaagttt actgaaattt 2400gctctttgaa tttccagttt
tgcaagccta tcagattgtg ttttaatcag aggtactgaa 2460aagtatcaat gaattctagc
tttcactgaa caaaaatatg tagaggcaac tggcttctgg 2520gacagtttgc tacccaaaag
acaactgaat gcaaatacat aaatagattt atgaatatgg 2580ttttgaacat gcacatgaga
ggtggatata gcaacagaca cattaccaca gaattacttt 2640aaaactactt gttaacattt
aattgcctaa aaactgctcg taatttactg ttgtagccta 2700ccatagagta ccctgcatgg
tactatgtac agcattccat ccttacattt tcactgttct 2760gctgtttgct ctagacaact
cagagttcac catgaggtct ttgctaatct tggtgctttg 2820cttcctgccc ctggctgctc
tgggggaggt ccagctggtg gagtccggcg gagggctggt 2880ccagcctgga ggcagcctga
gactgagctg tgctgccagc gggttcaata tcaaggatac 2940ctacatccac tgggtgaggc
aggcccccgg aaagggcctg gaatgggtgg ccaggattta 3000cccaactaat gggtatactc
ggtacgccga ttccgtcaaa ggcagattta ccattagcgc 3060agacaccagc aaaaacacag
catacctgca gatgaactcc ctgagagctg aagacacagc 3120tgtgtattat tgctcccggt
gggggggcga cggcttttat gccatggact actggggcca 3180gggaaccctg gtcaccgtct
cctcagcctc caccaagggc ccatcggtct tccccctggc 3240accctcctcc aagagcacct
ctgggggcac agcagccctg ggctgcctgg tcaaggacta 3300cttccccgaa ccggtgacgg
tgtcgtggaa ctcaggcgcc ctgaccagcg gcgtgcacac 3360cttcccggct gtcctacagt
cctcaggact ctactccctc agcagcgtgg tgaccgtgcc 3420ctccagcagc ttgggcaccc
agacctacat ctgcaacgtg aatcacaagc ccagcaacac 3480caaggtggac aagaaagttg
agcccaaatc ttgtgacaaa actcacacat gcccaccgtg 3540cccagcacct gaactcctgg
ggggaccgtc agtcttcctc ttccccccaa aacccaagga 3600caccctcatg atctcccgga
cccctgaggt cacatgcgtg gtggtggacg tgagccacga 3660agaccctgag gtcaagttca
actggtacgt ggacggcgtg gaggtgcata atgccaagac 3720aaagccgcgg gaggagcagt
acaacagcac gtaccgtgtg gtcagcgtcc tcaccgtcct 3780gcaccaggac tggctgaatg
gcaaggagta caagtgcaag gtctccaaca aagccctccc 3840agcccccatc gagaaaacca
tctccaaagc caaagggcag ccccgagaac cacaggtgta 3900caccctgccc ccatcccggg
atgagctgac caagaaccag gtcagcctga cctgcctggt 3960caaaggcttc tatcccagcg
acatcgccgt ggagtgggag agcaatgggc agccggagaa 4020caactacaag accacgcctc
ccgtgctgga ctccgacggc tccttcttcc tctacagcaa 4080gctcaccgtg gacaagagca
ggtggcagca ggggaacgtc ttctcatgct ccgtgatgca 4140tgaggctctg cacaaccact
acacgcagaa gagcctctcc ctgtctccgg gtaaacgggc 4200caagagagcc cccgtgaagc
agaccctgaa cttcgacctg ctgaagctgg ccggcgacgt 4260ggagtccaac cccggcccca
tgcgctccct cttgattctc gtcttgtgtt ttttgccact 4320ggccgctctc ggcgacatac
agatgaccca gagcccctct tctctctcag catcagtcgg 4380cgacagggtc acaattacct
gccgggctag ccaagatgtg aacacagccg tggcttggta 4440tcagcagaaa cctgggaagg
ccccaaaact gctgatttat tctgctagct tcctgtattc 4500tggggtgcct tccagattct
ccggatccag atccggcact gatttcacac tgaccatcag 4560cagcctccag cccgaggatt
ttgcaacata ctactgtcag caacactaca ctactcctcc 4620aacctttggc caaggcacca
aggttgaaat caagcgtacg gtggctgcac catctgtctt 4680catcttcccg ccatctgatg
agcagttgaa atctggaact gcctctgttg tgtgcctgct 4740gaataacttc tatcccagag
aggccaaagt acagtggaag gtggataacg ccctccaatc 4800gggtaactcc caggagagtg
tcacagagca ggacagcaag gacagcacct acagcctcag 4860cagcaccctg acgctgagca
aagcagacta cgagaaacac aaagtctacg cctgcgaagt 4920cacccatcag ggcctgagct
cgcccgtcac aaagagcttc aacaggggag agtgttagtc 4980gactcgctga tcagcctcga
ctttgccttc tagttgccag ccatctgttg tttgcccctc 5040ccccgtgcct tccttgaccc
tggaaggtgc cactcccact gtcctttcct aataaaatga 5100ggaaattgca tcgcattgtc
tgagtaggtg tcattctatt ctggggggtg gggtggggca 5160ggacagcaag ggggaggatt
gggaagacaa tagcaggcac tcgaccctcg acctcgaaat 5220tctaccgggt aggggaggcg
cttttcccaa ggcagtctgg agcatgcgct ttagcagccc 5280cgctgggcac ttggcgctac
acaagtggcc tctggcctcg cacacattcc acatccaccg 5340gtaggcgcca accggctccg
ttctttggtg gccccttcgc gccaccttct actcctcccc 5400tagtcaggaa gttccccccc
gccccgcagc tcgcgtcgtg caggacgtga caaatggaag 5460tagcacgtct cactagtctc
gtgcagatgg acagcaccgc tgagcaatgg aagcgggtag 5520gcctttgggg cagcggccaa
tagcagcttt gctccttcgc tttctgggct cagaggctgg 5580gaaggggtgg gtccgggggc
gggctcaggg gcgggctcag gggcggggcg ggcgcccgaa 5640ggtcctccgg aggcccggca
ttctgcacgc ttcaaaagcg cacgtctgcc gcgctgttct 5700cctcttcctc atctccgggc
ctttcgacct gcatccatct agatctagat cagcttacca 5760tgaccgagta caagcccacg
gtgcgcctcg ccacccgcga cgacgtcccc agggccgtac 5820gcaccctcgc cgccgcgttc
gccgactacc ccgccacgcg ccacaccgtc gatccggacc 5880gccacatcga gcgggtcacc
gagctgcaag aactcttcct cacgcgcgtc gggctcgaca 5940tcggcaaggt gtgggtcgcg
gacgacggcg ccgcggtggc ggtctggacc acgccggaga 6000gcgtcgaagc gggggcggtg
ttcgccgaga tcggcccgcg catggccgag ttgagcggtt 6060cccggctggc cgcgcagcaa
cagatggaag gcctcctggc gccgcaccgg cccaaggagc 6120ccgcgtggtt cctggccacc
gtcggcgtct cgcccgacca ccagggcaag ggtctgggca 6180gcgccgtcgt gctccccgga
gtggaggcgg ccgagcgcgc cggggtgccc gccttcctgg 6240agacctccgc gccccgcaac
ctccccttct acgagcggct cggcttcacc gtcaccgccg 6300acgtcgaggt gcccgaagga
ccgcgcacct ggtgcatgac ccgcaagccc ggtgcctgac 6360gcccgcccca cgacccgcag
cgcccgaccg aaaggagcgc acgaccccat gcatcgatga 6420tgccaataaa gatatcattg
atgagtttgg acaaaccaca actagaatgc agtgaaaaaa 6480atgctttatt tgtgaaattt
gtgatgctat tgctttattt gtaaccatta taagctgcat 6540atcgaattcc cgcggccgcg
ggaattcgat tccatcggtg cagcaagcat ggaattttgt 6600tttgatgtat tcaaggagct
caaagtccac catgccaatg agaacatctt ctactgcccc 6660attgccatca tgtcagctct
agccatggta tacctgggtg caaaagacag caccaggaca 6720cagataaata aggtgagcct
acagttaaag attaaaacct ttgccctgct caatggagcc 6780acagcactta attgtatgat
aatgtccctt ggaaactgca tagctcagag gctgaaaatc 6840tgaaaccaga gttatctaaa
agtgtggcca cctccaactc ccagagtgtt acccaaatgc 6900actagctaga aatcttgaaa
ctggattgca taacttcttt ttgtcataac cattatttca 6960gctactatta ttttcaatta
caggttgttc gctttgataa acttccagga ttcggagaca 7020gtattgaagc tcaggtacag
aaataatttc acctccttct ctatgtccct ttcctctgga 7080agcaaaatac agcagatgaa
gcaatctctt agctgttcca agccctctct gatgagcagc 7140tagtgctctg catccagcag
ttgggagaac actgttcata agaacagaga aaaagaagga 7200agtaacaggg gattcagaac
aaacagaaga taaaactcag gacaaaaata ccgtgtgaat 7260gaggaaactt gtggatattt
gtacgcttaa gcaagacagc tagatgattc tggataaatg 7320ggtctggttg gaaaagaagg
aaagcctggc tgatctgctg gagctagatt attgcagcag 7380gtaggcagga gttccctaga
gaaaagtatg agggaattac agaagaaaaa cagcacaaaa 7440ttgtaaatat tggaaaagga
ccacatcagt gtagttacta gcagtaagac agacaggatg 7500aaaaatagtt ttgtaaacag
aagtatctaa ctactttact ctgttcatac actatgtaaa 7560acctactaag taataaaact
agaataacaa catctttctt tctctttgta ttcagtgtgg 7620cacatctgta aacgttcact
cttcacttag agacatcctc aaccaaatca ccaaaccaaa 7680tgatgtttat tcgttcagcc
ttgccagtag actttatgct gaagagagat acccaatcct 7740gccagtaagt tgctctaaaa
tctgatctga gtgtatttcc atgccaaagc tctaccattc 7800tgtaatgcaa aaacagtcag
agttccacat gtttcactaa gaaaatttct ttttctcttg 7860tttttacaaa tgaaagagag
gacaaataac atttctctat caccgacctg aaactctaca 7920gtcttcagag aatgaatggc
ttgctaaaag aatgtcaaat cttaccatac agctatttca 7980tattacacta ctaaatacac
tataaggcat agcatgtagt aatacactgt aaaatagctt 8040tttacactac tatattatta
atatctgtta attccagtct tgcatttcac atttgcaaaa 8100cgttttgaaa ttcgtatctg
aaagctgaat actcttgctt tacaggaata cttgcagtgt 8160gtgaaggaac tgtatagagg
aggcttggaa cctatcaact ttcaaacagc tgcagatcaa 8220gccagagagc tcatcaattc
ctgggtagaa agtcagacaa atggtaaggt agaacatgct 8280ttgtacatag tgagagttgg
ttcaccctaa tactgagaac ctggatatag ctcagccagc 8340gtgctttgcg ttcaagctta
ccagagctgt tgtatgcctg ttaagcaggg catacagtca 8400tgaggctctt gaaaaatctt
aacagacaaa gggcaatgga aaatcggagt taagggatgg 8460tagggataaa atgcatagaa
agaggtacca cgattttgat ttttgcccta atgcctctct 8520gcgtggttcc tcaatttttc
tacttcattc ctcatctcct cagagcattc ctttccctca 8580tgcttgaaac acagatgaaa
gactgtgaat tctaactgag atgaaaacat ccacaaccac 8640acaacctctg gtgtggagtc
acattctgtg aaggcaaaaa ctaggccacg taatctatgt 8700gtgcaagcta cgtgtaagct
atgtgtgtga caggacaatg tgaggaacat actatgtgca 8760caaggactgc agaataaaca
ggagcaaagt ttttgaagaa aacagagtaa aatcccgttt 8820tcctcttttg ttacattctt
tacatatatc tcaaatttcc tctttggtta gaagcaagta 8880atatttatgt ttcttggtac
tgtttgggtt gaagaccatt ctgggataag agaaattcca 8940gtggttcttc ccctaatcat
aaaatgtaca ggtttagttt ttttgtaaca cagaaatctc 9000ttcatctttt atcttttgtt
gtgattcttt atagagagag aaacaagact tactgacaat 9060agcagcaaga aaatcaatct
tggaagaaca agattgcagt tgcaaaaaca aaccaatgtc 9120cttgccccta catcctcttc
cccataaatt ctacattctc tatctacctt gtgcttgcca 9180acatgatata cgtaaactct
cttttcctat tcattcttaa aggaattatc agaaatgtcc 9240ttcagccaag ctccgtggat
tctcaaactg caatggttct ggttaatgcc attgtcttca 9300aaggactgtg ggagaaagca
tttaaggatg aagacacaca agcaatgcct ttcagagtga 9360ctgaggtata tgggcatacc
ttagagatgt aatctagaat ttatgaagag agtagacatg 9420ttgttatatg aacactgcat
tagcgtatct gctcatttgt ctgcatctct ttcagacact 9480gtgttaaaag cagggaattt
tccttatgtc tctttcatca caatattcct gacattgcaa 9540agctcctgag aaataacttc
agattcccac ttttcctagg gaggtcttcc tggatgagaa 9600caatcaatca tcttaactgt
aactagatat ttctgcatct aagaataatc tttgttaaaa 9660ctatattctc tctctctttt
tttttttttt tggttctcca gcaagaaagc aaacctgtgc 9720agatgatgtg gatcc
97352830DNAArtificial
Sequenceprimer 28ccccagagca gccaggggca ggaagcaaag
302928DNAArtificial Sequenceprimer 29aaagcaccaa gattagcaaa
gacctcat 283010104DNAhuman
30gtcgacctta agtcctcaga cttggcaagg agaatgtaga tttctacagt atatatgttt
60tcacaaaagg aaggagagaa acaaaagaaa atggcactga ctaaacttca gctagtggta
120taggaaagta attctgctta acagagattg cagtgatctc tatgtatgtc ctgaagaatt
180atgttgtact tttttccctc atttttaaat caaacagtgc tttacagagg tcagaatggt
240ttctttactg tttgtcaatt ctattatttc aatacagaac aatagcttct ataactgaaa
300tatatttgct attgtatatt atgattgtcc ctcgaaccat gaacactcct ccagctgaat
360ttcacaattc ctctgtcatc tgccaggcca ttaagttatt catggaagat ctttgaggaa
420cactgcaagt tcatatcata aacacatttg aaattgagta ttgttttgca ttgtatggag
480ctatgttttg ctgtatcctc agaaaaaaag tttgttataa agcattcaca cccataaaaa
540gatagattta aatattccag ctataggaaa gaaagtgcgt ctgctcttca ctctagtctc
600agttggctcc ttcacatgca cgcttcttta tttctcctat tttgtcaaga aaataatagg
660tcacgtcttg ttctcactta tgtcctgcct agcatggctc agatgcacgt tgtacataca
720agaaggatca aatgaaacag acttctggtc tgttactaca accatagtaa taagcacact
780aactaataat tgctaattat gttttccatc tccaaggttc ccacattttt ctgttttctt
840aaagatccca ttatctggtt gtaactgaag ctcaatggaa catgagcaat atttcccagt
900cttctctccc atccaacagt cctgatggat tagcagaaca ggcagaaaac acattgttac
960ccagaattaa aaactaatat ttgctctcca ttcaatccaa aatggaccta ttgaaactaa
1020aatctaaccc aatcccatta aatgatttct atggcgtcaa aggtcaaact tctgaaggga
1080acctgtgggt gggtcacaat tcaggctata tattccccag ggctcagcca gtgtctgtac
1140atacagctag aaagctgtat tgcctttagc actcaagctc aaaaggtaag caactctctg
1200gaattacctt ctctctatat tagctcttac ttgcacctaa actttaaaaa attaacaatt
1260attgtgttat gtgttgtatc tttaagggtg aagtacctgc gtgatacccc ctataaaaac
1320ttctcacctg tgtatgcatt ctgcactatt ttattatgtg taaaagcttt gtgtttgttt
1380tcaggaggct tattctttgt gcttaaaata tgtttttaat ttcagaacat cttatcctgt
1440cgttcactat ctgatatgct ttgcagtttg cctgattaac ttctagccct acagagtgca
1500cagagagcaa aatcatggtg ttcagtgaat tctggggagt tattttaatg tgaaaattct
1560ctagaagttt aattcctgca aagtgcagct gctgatcact acacaagata aaaatgtggg
1620gggtgcataa acgtatattc ttacaataat agatacatgt gaacttgtat acagaaaaga
1680aaatgagaaa aatgtgtgtg cgtatactca cacacgtggt cagtaaaaac ttttgagggg
1740tttaatacag aaaatccaat cctgaggccc cagcactcag tacgcatata aagggctggg
1800ctctgaagga cttctgactt tcacagatta tataaatctc aggaaagcaa ctagattcat
1860gctggctcca aaagctgtgc tttatataag cacactggct atacaatagt tgtacagttc
1920agctctttat aatagaaaca gacagaacaa gtataaatct tctattggtc tatgtcatga
1980acaagaattc attcagtggc tctgttttat agtaaacatt gctattttat catgtctgca
2040tttctcttct gtctgaatgt caccactaaa atttaactcc acagaaagtt tatactacag
2100tacacatgca tatctttgag caaagcaaac catacctgaa agtgcaatag agcagaatat
2160gaattacatg cgtgtctttc tcctagacta catgacccca tataaattac attccttatc
2220tattctgcca tcaccaaaac aaaggtaaaa atacttttga agatctactc atagcaagta
2280gtgtgcaaca aacagatatt tctctacatt tatttttagg gaataaaaat aagaaataaa
2340atagtcagca agcctctgct ttctcatata tctgtccaaa cctaaagttt actgaaattt
2400gctctttgaa tttccagttt tgcaagccta tcagattgtg ttttaatcag aggtactgaa
2460aagtatcaat gaattctagc tttcactgaa caaaaatatg tagaggcaac tggcttctgg
2520gacagtttgc tacccaaaag acaactgaat gcaaatacat aaatagattt atgaatatgg
2580ttttgaacat gcacatgaga ggtggatata gcaacagaca cattaccaca gaattacttt
2640aaaactactt gttaacattt aattgcctaa aaactgctcg taatttactg ttgtagccta
2700ccatagagta ccctgcatgg tactatgtac agcattccat ccttacattt tcactgttct
2760gctgtttgct ctagacaact cagagttcac catgaggtct ttgctaatct tggtgctttg
2820cttcctgccc ctggctgctc tgggggaggt ccagctggtg gagtccggcg gagggctggt
2880ccagcctgga ggcagcctga gactgagctg tgctgccagc gggttcaata tcaaggatac
2940ctacatccac tgggtgaggc aggcccccgg aaagggcctg gaatgggtgg ccaggattta
3000cccaactaat gggtatactc ggtacgccga ttccgtcaaa ggcagattta ccattagcgc
3060agacaccagc aaaaacacag catacctgca gatgaactcc ctgagagctg aagacacagc
3120tgtgtattat tgctcccggt gggggggcga cggcttttat gccatggact actggggcca
3180gggaaccctg gtcaccgtct cctcagcctc caccaagggc ccatcggtct tccccctggc
3240accctcctcc aagagcacct ctgggggcac agcagccctg ggctgcctgg tcaaggacta
3300cttccccgaa ccggtgacgg tgtcgtggaa ctcaggcgcc ctgaccagcg gcgtgcacac
3360cttcccggct gtcctacagt cctcaggact ctactccctc agcagcgtgg tgaccgtgcc
3420ctccagcagc ttgggcaccc agacctacat ctgcaacgtg aatcacaagc ccagcaacac
3480caaggtggac aagaaagttg agcccaaatc ttgtgacaaa actcacacat gcccaccgtg
3540cccagcacct gaactcctgg ggggaccgtc agtcttcctc ttccccccaa aacccaagga
3600caccctcatg atctcccgga cccctgaggt cacatgcgtg gtggtggacg tgagccacga
3660agaccctgag gtcaagttca actggtacgt ggacggcgtg gaggtgcata atgccaagac
3720aaagccgcgg gaggagcagt acaacagcac gtaccgtgtg gtcagcgtcc tcaccgtcct
3780gcaccaggac tggctgaatg gcaaggagta caagtgcaag gtctccaaca aagccctccc
3840agcccccatc gagaaaacca tctccaaagc caaagggcag ccccgagaac cacaggtgta
3900caccctgccc ccatcccggg atgagctgac caagaaccag gtcagcctga cctgcctggt
3960caaaggcttc tatcccagcg acatcgccgt ggagtgggag agcaatgggc agccggagaa
4020caactacaag accacgcctc ccgtgctgga ctccgacggc tccttcttcc tctacagcaa
4080gctcaccgtg gacaagagca ggtggcagca ggggaacgtc ttctcatgct ccgtgatgca
4140tgaggctctg cacaaccact acacgcagaa gagcctctcc ctgtctccgg gtaaacgggc
4200caagagagcc cccgtgaagc agaccctgaa cttcgacctg ctgaagctgg ccggcgacgt
4260ggagtccaac cccggcccca tgcgctccct cttgattctc gtcttgtgtt ttttgccact
4320ggccgctctc ggcgacatac agatgaccca gagcccctct tctctctcag catcagtcgg
4380cgacagggtc acaattacct gccgggctag ccaagatgtg aacacagccg tggcttggta
4440tcagcagaaa cctgggaagg ccccaaaact gctgatttat tctgctagct tcctgtattc
4500tggggtgcct tccagattct ccggatccag atccggcact gatttcacac tgaccatcag
4560cagcctccag cccgaggatt ttgcaacata ctactgtcag caacactaca ctactcctcc
4620aacctttggc caaggcacca aggttgaaat caagcgtacg gtggctgcac catctgtctt
4680catcttcccg ccatctgatg agcagttgaa atctggaact gcctctgttg tgtgcctgct
4740gaataacttc tatcccagag aggccaaagt acagtggaag gtggataacg ccctccaatc
4800gggtaactcc caggagagtg tcacagagca ggacagcaag gacagcacct acagcctcag
4860cagcaccctg acgctgagca aagcagacta cgagaaacac aaagtctacg cctgcgaagt
4920cacccatcag ggcctgagct cgcccgtcac aaagagcttc aacaggggag agtgttagtc
4980gactcgctga tcagcctcga ctttgccttc tagttgccag ccatctgttg tttgcccctc
5040ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga
5100ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca
5160ggacagcaag ggggaggatt gggaagacaa tagcaggcac tcgaccctcg acttcaaata
5220tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga
5280gtcctgaggc ggaaagaacc agctgtggaa tgtgtgtcag ttagggtgtg gaaagtcccc
5340aggctcccca gcaggcagaa gtatgcaaag catgcatctc aattagtcag caaccaggtg
5400tggaaagtcc ccaggctccc cagcaggcag aagtatgcaa agcatgcatc tcaattagtc
5460agcaaccata gtcccgcccc taactccgcc catcccgccc ctaactccgc ccagttccgc
5520ccattctccg ccccatggct gactaatttt ttttatttat gcagaggccg aggccgcctc
5580ggcctctgag ctattccaga agtagtgagg aggctttttt ggaggcctag gcttttgcaa
5640agatcgatca agagacagga tgaggatcgt ttcgcatgat tgaacaagat ggattgcacg
5700caggttctcc ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa
5760tcggctgctc tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg
5820tcaagaccga cctgtccggt gccctgaatg aactgcaaga cgaggcagcg cggctatcgt
5880ggctggccac gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa
5940gggactggct gctattgggc gaagtgccgg ggcaggatct cctgtcatct caccttgctc
6000ctgccgagaa agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg
6060ctacctgccc attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg
6120aagccggtct tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg
6180aactgttcgc caggctcaag gcgagcatgc ccgacggcga ggatctcgtc gtgacccatg
6240gcgatgcctg cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact
6300gtggccggct gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg
6360ctgaagagct tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc
6420ccgattcgca gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga gcgggactct
6480ggggttcgaa atgaccgacc aagcgacgcc caacctgcca tcacgagatt tcgattccac
6540cgccgccttc tatgaaaggt tgggcttcgg aatcgttttc cgggacgccg gctggatgat
6600cctccagcgc ggggatctca tgctggagtt cttcgcccac cctaggggga ggctaactga
6660aacacggaag gagacaatac cggaaggaac ccgcgctatg acggcaataa aaagacagaa
6720taaaacgcac ggtgttgggt cgtttgttca taaacgcggg gttcggtccc agggctggca
6780ctctgtcgat accccaccga gaccccattg gggccaatac gcccgcgttt cttccttttc
6840cccaccccac cccccaagtt cgggtgaagg cccagggctc gcagccaacg tcggggcggc
6900aggctgcata tcgaattccc gcggccgcgg gaattcgatt ccatcggtgc agcaagcatg
6960gaattttgtt ttgatgtatt caaggagctc aaagtccacc atgccaatga gaacatcttc
7020tactgcccca ttgccatcat gtcagctcta gccatggtat acctgggtgc aaaagacagc
7080accaggacac agataaataa ggtgagccta cagttaaaga ttaaaacctt tgccctgctc
7140aatggagcca cagcacttaa ttgtatgata atgtcccttg gaaactgcat agctcagagg
7200ctgaaaatct gaaaccagag ttatctaaaa gtgtggccac ctccaactcc cagagtgtta
7260cccaaatgca ctagctagaa atcttgaaac tggattgcat aacttctttt tgtcataacc
7320attatttcag ctactattat tttcaattac aggttgttcg ctttgataaa cttccaggat
7380tcggagacag tattgaagct caggtacaga aataatttca cctccttctc tatgtccctt
7440tcctctggaa gcaaaataca gcagatgaag caatctctta gctgttccaa gccctctctg
7500atgagcagct agtgctctgc atccagcagt tgggagaaca ctgttcataa gaacagagaa
7560aaagaaggaa gtaacagggg attcagaaca aacagaagat aaaactcagg acaaaaatac
7620cgtgtgaatg aggaaacttg tggatatttg tacgcttaag caagacagct agatgattct
7680ggataaatgg gtctggttgg aaaagaagga aagcctggct gatctgctgg agctagatta
7740ttgcagcagg taggcaggag ttccctagag aaaagtatga gggaattaca gaagaaaaac
7800agcacaaaat tgtaaatatt ggaaaaggac cacatcagtg tagttactag cagtaagaca
7860gacaggatga aaaatagttt tgtaaacaga agtatctaac tactttactc tgttcataca
7920ctatgtaaaa cctactaagt aataaaacta gaataacaac atctttcttt ctctttgtat
7980tcagtgtggc acatctgtaa acgttcactc ttcacttaga gacatcctca accaaatcac
8040caaaccaaat gatgtttatt cgttcagcct tgccagtaga ctttatgctg aagagagata
8100cccaatcctg ccagtaagtt gctctaaaat ctgatctgag tgtatttcca tgccaaagct
8160ctaccattct gtaatgcaaa aacagtcaga gttccacatg tttcactaag aaaatttctt
8220tttctcttgt ttttacaaat gaaagagagg acaaataaca tttctctatc accgacctga
8280aactctacag tcttcagaga atgaatggct tgctaaaaga atgtcaaatc ttaccataca
8340gctatttcat attacactac taaatacact ataaggcata gcatgtagta atacactgta
8400aaatagcttt ttacactact atattattaa tatctgttaa ttccagtctt gcatttcaca
8460tttgcaaaac gttttgaaat tcgtatctga aagctgaata ctcttgcttt acaggaatac
8520ttgcagtgtg tgaaggaact gtatagagga ggcttggaac ctatcaactt tcaaacagct
8580gcagatcaag ccagagagct catcaattcc tgggtagaaa gtcagacaaa tggtaaggta
8640gaacatgctt tgtacatagt gagagttggt tcaccctaat actgagaacc tggatatagc
8700tcagccagcg tgctttgcgt tcaagcttac cagagctgtt gtatgcctgt taagcagggc
8760atacagtcat gaggctcttg aaaaatctta acagacaaag ggcaatggaa aatcggagtt
8820aagggatggt agggataaaa tgcatagaaa gaggtaccac gattttgatt tttgccctaa
8880tgcctctctg cgtggttcct caatttttct acttcattcc tcatctcctc agagcattcc
8940tttccctcat gcttgaaaca cagatgaaag actgtgaatt ctaactgaga tgaaaacatc
9000cacaaccaca caacctctgg tgtggagtca cattctgtga aggcaaaaac taggccacgt
9060aatctatgtg tgcaagctac gtgtaagcta tgtgtgtgac aggacaatgt gaggaacata
9120ctatgtgcac aaggactgca gaataaacag gagcaaagtt tttgaagaaa acagagtaaa
9180atcccgtttt cctcttttgt tacattcttt acatatatct caaatttcct ctttggttag
9240aagcaagtaa tatttatgtt tcttggtact gtttgggttg aagaccattc tgggataaga
9300gaaattccag tggttcttcc cctaatcata aaatgtacag gtttagtttt tttgtaacac
9360agaaatctct tcatctttta tcttttgttg tgattcttta tagagagaga aacaagactt
9420actgacaata gcagcaagaa aatcaatctt ggaagaacaa gattgcagtt gcaaaaacaa
9480accaatgtcc ttgcccctac atcctcttcc ccataaattc tacattctct atctaccttg
9540tgcttgccaa catgatatac gtaaactctc ttttcctatt cattcttaaa ggaattatca
9600gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc aatggttctg gttaatgcca
9660ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga agacacacaa gcaatgcctt
9720tcagagtgac tgaggtatat gggcatacct tagagatgta atctagaatt tatgaagaga
9780gtagacatgt tgttatatga acactgcatt agcgtatctg ctcatttgtc tgcatctctt
9840tcagacactg tgttaaaagc agggaatttt ccttatgtct ctttcatcac aatattcctg
9900acattgcaaa gctcctgaga aataacttca gattcccact tttcctaggg aggtcttcct
9960ggatgagaac aatcaatcat cttaactgta actagatatt tctgcatcta agaataatct
10020ttgttaaaac tatattctct ctctcttttt tttttttttt ggttctccag caagaaagca
10080aacctgtgca gatgatgtgg atcc
101043116102DNAhuman 31gtcgacctta agtcctcaga cttggcaagg agaatgtaga
tttctacagt atatatgttt 60tcacaaaagg aaggagagaa acaaaagaaa atggcactga
ctaaacttca gctagtggta 120taggaaagta attctgctta acagagattg cagtgatctc
tatgtatgtc ctgaagaatt 180atgttgtact tttttccctc atttttaaat caaacagtgc
tttacagagg tcagaatggt 240ttctttactg tttgtcaatt ctattatttc aatacagaac
aatagcttct ataactgaaa 300tatatttgct attgtatatt atgattgtcc ctcgaaccat
gaacactcct ccagctgaat 360ttcacaattc ctctgtcatc tgccaggcca ttaagttatt
catggaagat ctttgaggaa 420cactgcaagt tcatatcata aacacatttg aaattgagta
ttgttttgca ttgtatggag 480ctatgttttg ctgtatcctc agaaaaaaag tttgttataa
agcattcaca cccataaaaa 540gatagattta aatattccag ctataggaaa gaaagtgcgt
ctgctcttca ctctagtctc 600agttggctcc ttcacatgca cgcttcttta tttctcctat
tttgtcaaga aaataatagg 660tcacgtcttg ttctcactta tgtcctgcct agcatggctc
agatgcacgt tgtacataca 720agaaggatca aatgaaacag acttctggtc tgttactaca
accatagtaa taagcacact 780aactaataat tgctaattat gttttccatc tccaaggttc
ccacattttt ctgttttctt 840aaagatccca ttatctggtt gtaactgaag ctcaatggaa
catgagcaat atttcccagt 900cttctctccc atccaacagt cctgatggat tagcagaaca
ggcagaaaac acattgttac 960ccagaattaa aaactaatat ttgctctcca ttcaatccaa
aatggaccta ttgaaactaa 1020aatctaaccc aatcccatta aatgatttct atggcgtcaa
aggtcaaact tctgaaggga 1080acctgtgggt gggtcacaat tcaggctata tattccccag
ggctcagcca gtgtctgtac 1140atacagctag aaagctgtat tgcctttagc actcaagctc
aaaaggtaag caactctctg 1200gaattacctt ctctctatat tagctcttac ttgcacctaa
actttaaaaa attaacaatt 1260attgtgttat gtgttgtatc tttaagggtg aagtacctgc
gtgatacccc ctataaaaac 1320ttctcacctg tgtatgcatt ctgcactatt ttattatgtg
taaaagcttt gtgtttgttt 1380tcaggaggct tattctttgt gcttaaaata tgtttttaat
ttcagaacat cttatcctgt 1440cgttcactat ctgatatgct ttgcagtttg cctgattaac
ttctagccct acagagtgca 1500cagagagcaa aatcatggtg ttcagtgaat tctggggagt
tattttaatg tgaaaattct 1560ctagaagttt aattcctgca aagtgcagct gctgatcact
acacaagata aaaatgtggg 1620gggtgcataa acgtatattc ttacaataat agatacatgt
gaacttgtat acagaaaaga 1680aaatgagaaa aatgtgtgtg cgtatactca cacacgtggt
cagtaaaaac ttttgagggg 1740tttaatacag aaaatccaat cctgaggccc cagcactcag
tacgcatata aagggctggg 1800ctctgaagga cttctgactt tcacagatta tataaatctc
aggaaagcaa ctagattcat 1860gctggctcca aaagctgtgc tttatataag cacactggct
atacaatagt tgtacagttc 1920agctctttat aatagaaaca gacagaacaa gtataaatct
tctattggtc tatgtcatga 1980acaagaattc attcagtggc tctgttttat agtaaacatt
gctattttat catgtctgca 2040tttctcttct gtctgaatgt caccactaaa atttaactcc
acagaaagtt tatactacag 2100tacacatgca tatctttgag caaagcaaac catacctgaa
agtgcaatag agcagaatat 2160gaattacatg cgtgtctttc tcctagacta catgacccca
tataaattac attccttatc 2220tattctgcca tcaccaaaac aaaggtaaaa atacttttga
agatctactc atagcaagta 2280gtgtgcaaca aacagatatt tctctacatt tatttttagg
gaataaaaat aagaaataaa 2340atagtcagca agcctctgct ttctcatata tctgtccaaa
cctaaagttt actgaaattt 2400gctctttgaa tttccagttt tgcaagccta tcagattgtg
ttttaatcag aggtactgaa 2460aagtatcaat gaattctagc tttcactgaa caaaaatatg
tagaggcaac tggcttctgg 2520gacagtttgc tacccaaaag acaactgaat gcaaatacat
aaatagattt atgaatatgg 2580ttttgaacat gcacatgaga ggtggatata gcaacagaca
cattaccaca gaattacttt 2640aaaactactt gttaacattt aattgcctaa aaactgctcg
taatttactg ttgtagccta 2700ccatagagta ccctgcatgg tactatgtac agcattccat
ccttacattt tcactgttct 2760gctgtttgct ctagacaact cagagttcac catgaggtct
ttgctaatct tggtgctttg 2820cttcctgccc ctggctgctc tggggcaaga ggaaggccaa
gtcgagggcc aagacgaaga 2880catcccacca atcacctgcg tacagaacgg cctcaggtac
catgaccgag acgtgtggaa 2940acccgagccc tgccggatct gcgtctgcga caacggcaag
gtgttgtgcg atgacgtgat 3000ctgtgacgag accaagaact gccccggcgc cgaagtcccc
gagggcgagt gctgtcccgt 3060ctgccccgac ggctcagagt cacccaccga ccaagaaacc
accggcgtcg agggacccaa 3120gggagacact ggcccccgag gcccaagggg acccgcaggc
ccccctggcc gagatggcat 3180ccctggacag cctggacttc ccggaccccc cggacccccc
ggacctcccg gaccccctgg 3240cctcggagga aactttgctc cccagctgtc ttatggctat
gatgagaaat caaccggagg 3300aatttccgtg cctggcccca tgggtccctc tggtcctcgt
ggtctccctg gcccccctgg 3360tgcacctggt ccccaaggct tccaaggtcc ccctggtgag
cctggcgagc ctggagcttc 3420aggtcccatg ggtccccgag gtcccccagg tccccctgga
aagaatggag atgatgggga 3480agctggaaaa cctggtcgtc ctggtgagcg tgggcctcct
gggcctcagg gtgctcgagg 3540attgcccgga acagctggcc tccctggaat gaagggacac
agaggtttca gtggtttgga 3600tggtgccaag ggagatgctg gtcctgctgg tcctaagggt
gagcctggca gccctggtga 3660aaatggagct cctggtcaga tgggcccccg tggcctgcct
ggtgagagag gtcgccctgg 3720agcccctggc cctgctggtg ctcgtggaaa tgatggtgct
actggtgctg ccgggccccc 3780tggtcccacc ggccccgctg gtcctcctgg cttccctggt
gctgttggtg ctaagggtga 3840agctggtccc caagggcccc gaggctctga aggtccccag
ggtgtgcgtg gtgagcctgg 3900cccccctggc cctgctggtg ctgctggccc tgctggaaac
cctggtgctg atggacagcc 3960tggtgctaaa ggtgccaatg gtgctcctgg tattgctggt
gctcctggct tccctggtgc 4020ccgaggcccc tctggacccc agggccccgg cggccctcct
ggtcccaagg gtaacagcgg 4080tgaacctggt gctcctggca gcaaaggaga cactggtgct
aagggagagc ctggccctgt 4140tggtgttcaa ggaccccctg gccctgctgg agaggaagga
aagcgaggag ctcgaggtga 4200acccggaccc actggcctgc ccggaccccc tggcgagcgt
ggtggacctg gtagccgtgg 4260tttccctggc gcagatggtg ttgctggtcc caagggtccc
gctggtgaac gtggttctcc 4320tggccctgct ggccccaaag gatctcctgg tgaagctggt
cgtcccggtg aagctggtct 4380gcctggtgcc aagggtctga ctggaagccc tggcagccct
ggtcctgatg gcaaaactgg 4440cccccctggt cccgccggtc aagatggtcg ccccggaccc
ccaggcccac ctggtgcccg 4500tggtcaggct ggtgtgatgg gattccctgg acctaaaggt
gctgctggag agcccggcaa 4560ggctggagag cgaggtgttc ccggaccccc tggcgctgtc
ggtcctgctg gcaaagatgg 4620agaggctgga gctcagggac cccctggccc tgctggtccc
gctggcgaga gaggtgaaca 4680aggccctgct ggctcccccg gattccaggg tctccctggt
cctgctggtc ctccaggtga 4740agcaggcaaa cctggtgaac agggtgttcc tggagacctt
ggcgcccctg gcccctctgg 4800agcaagaggc gagagaggtt tccctggcga gcgtggtgtg
caaggtcccc ctggtcctgc 4860tggtccccga ggggccaacg gtgctcccgg caacgatggt
gctaagggtg atgctggtgc 4920ccctggagct cccggtagcc agggcgcccc tggccttcag
ggaatgcctg gtgaacgtgg 4980tgcagctggt cttccagggc ctaagggtga cagaggtgat
gctggtccca aaggtgctga 5040tggctctcct ggcaaagatg gcgtccgtgg tctgactggc
cccattggtc ctcctggccc 5100tgctggtgcc cctggtgaca agggtgaaag tggtcccagc
ggccctgctg gtcccactgg 5160agctcgtggt gcccccggag accgtggtga gcctggtccc
cccggccctg ctggctttgc 5220tggcccccct ggtgctgacg gccaacctgg tgctaaaggc
gaacctggtg atgctggtgc 5280taaaggcgat gctggtcccc ctggccctgc cggacccgct
ggaccccctg gccccattgg 5340taatgttggt gctcctggag ccaaaggtgc tcgcggcagc
gctggtcccc ctggtgctac 5400tggtttccct ggtgctgctg gccgagtcgg tcctcctggc
ccctctggaa atgctggacc 5460ccctggccct cctggtcctg ctggcaaaga aggcggcaaa
ggtccccgtg gtgagactgg 5520ccctgctgga cgtcctggtg aagttggtcc ccctggtccc
cctggccctg ctggcgagaa 5580aggatcccct ggtgctgatg gtcctgctgg tgctcctggt
actcccgggc ctcaaggtat 5640tgctggacag cgtggtgtgg tcggcctgcc tggtcagaga
ggagagagag gcttccctgg 5700tcttcctggc ccctctggtg aacctggcaa acaaggtccc
tctggagcaa gtggtgaacg 5760tggtccccct ggtcccatgg gcccccctgg attggctgga
ccccctggtg aatctggacg 5820tgagggggct cctggtgccg aaggttcccc tggacgagac
ggttctcctg gcgccaaggg 5880tgaccgtggt gagaccggcc ccgctggacc ccctggtgct
cctggtgctc ctggtgcccc 5940tggccccgtt ggccctgctg gcaagagtgg tgatcgtggt
gagactggtc ctgctggtcc 6000cgccggtcct gtcggccctg ttggcgcccg tggccccgcc
ggaccccaag gcccccgtgg 6060tgacaagggt gagacaggcg aacagggcga cagaggcata
aagggtcacc gtggcttctc 6120tggcctccag ggtccccctg gccctcctgg ctctcctggt
gaacaaggtc cctctggagc 6180ctctggtcct gctggtcccc gaggtccccc tggctctgct
ggtgctcctg gcaaagatgg 6240actcaacggt ctccctggcc ccattgggcc ccctggtcct
cgcggtcgca ctggtgatgc 6300tggtcctgtt ggtccccccg gccctcctgg acctcctggt
ccccctggtc ctcccagcgc 6360tggtttcgac ttcagcttcc tgccccagcc acctcaagag
aaggctcacg atggtggccg 6420ctactaccgg gctgatgatg ccaatgtggt tcgtgaccgt
gacctcgagg tggacaccac 6480cctcaagagc ctgagccagc agatcgagaa catccggagc
ccagagggca gccgcaagaa 6540ccccgcccgc acctgccgtg acctcaagat gtgccactct
gactggaaga gtggagagta 6600ctggattgac cccaaccaag gctgcaacct ggatgccatc
aaagtcttct gcaacatgga 6660gactggtgag acctgcgtgt accccactca gcccagtgtg
gcccagaaga actggtacat 6720cagcaagaac cccaaggaca agaggcatgt ctggttcggc
gagagcatga ccgatggatt 6780ccagttcgag tatggcggcc agggctccga ccctgccgat
gtggccatcc agctgacctt 6840cctgcgcctg atgtccaccg aggcctccca gaacatcacc
taccactgca agaacagcgt 6900ggcctacatg gaccagcaga ctggcaacct caagaaggcc
ctgctcctcc agggctccaa 6960cgagatcgag atccgcgccg agggcaacag ccgcttcacc
tacagcgtca ctgtcgatgg 7020ctgcacgagt cacaccggag cctggggcaa gacagtgatt
gaatacaaaa ccaccaagac 7080ctcccgcctg cccatcatcg atgtggcccc cttggacgtt
ggtgccccag accaggaatt 7140cggcttcgac gttggccctg tctgcttcct gcgggccaag
agagcccccg tgaagcagac 7200cctgaacttc gacctgctga agctggccgg cgacgtggag
tccaaccccg gccccatgcg 7260ctccctcttg attctcgtct tgtgtttttt gccactggcc
gctctcggcc aatctttaca 7320agaggaaact gtaagaaagg gcccagccgg agatagagga
ccacgtggag aaaggggtcc 7380accaggcccc ccaggcagag atggtgaaga tggtcccaca
ggccctcctg gtccacctgg 7440tcctcctggc ccccctggtc tcggtgggaa ctttgctgct
cagtatgatg gaaaaggagt 7500tggacttggc cctggaccaa tgggcttaat gggacctaga
ggcccacctg gtgcagctgg 7560agccccaggc cctcaaggtt tccaaggacc tgctggtgag
cctggtgaac ctggtcaaac 7620tggtcctgca ggtgctcgtg gtccagctgg ccctcctggc
aaggctggtg aagatggtca 7680ccctggaaaa cccggacgac ctggtgagag aggagttgtt
ggaccacagg gtgctcgtgg 7740tttccctgga actcctggac ttcctggctt caaaggcatt
aggggacaca atggtctgga 7800tggattgaag ggacagcccg gtgctcctgg tgtgaagggt
gaacctggtg cccctggtga 7860aaatggaact ccaggtcaaa caggagcccg tgggcttcct
ggtgagagag gacgtgttgg 7920tgcccctggc ccagctggtg cccgtggcag tgatggaagt
gtgggtcccg tgggtcctgc 7980tggtcccatt gggtctgctg gccctccagg cttcccaggt
gcccctggcc ccaagggtga 8040aattggagct gttggtaacg ctggtcctgc tggtcccgcc
ggtccccgtg gtgaagtggg 8100tcttccaggc ctctccggcc ccgttggacc tcctggtaat
cctggagcaa acggccttac 8160tggtgccaag ggtgctgctg gccttcccgg cgttgctggg
gctcccggcc tccctggacc 8220ccgcggtatt cctggccctg ttggtgctgc cggtgctact
ggtgccagag gacttgttgg 8280tgagcctggt ccagctggct ccaaaggaga gagcggtaac
aagggtgagc ccggctctgc 8340tgggccccaa ggtcctcctg gtcccagtgg tgaagaagga
aagagaggcc ctaatgggga 8400agctggatct gccggccctc caggacctcc tgggctgaga
ggtagtcctg gttctcgtgg 8460tcttcctgga gctgatggca gagctggcgt catgggccct
cctggtagtc gtggtgcaag 8520tggccctgct ggagtccgag gacctaatgg agatgctggt
cgccctgggg agcctggtct 8580catgggaccc agaggtcttc ctggttcccc tggaaatatc
ggccccgctg gaaaagaagg 8640tcctgtcggc ctccctggca tcgacggcag gcctggccca
attggcccag ctggagcaag 8700aggagagcct ggcaacattg gattccctgg acccaaaggc
cccactggtg atcctggcaa 8760aaacggtgat aaaggtcatg ctggtcttgc tggtgctcgg
ggtgctccag gtcctgatgg 8820aaacaatggt gctcagggac ctcctggacc acagggtgtt
caaggtggaa aaggtgaaca 8880gggtcccgct ggtcctccag gcttccaggg tctgcctggc
ccctcaggtc ccgctggtga 8940agttggcaaa ccaggagaaa ggggtctcca tggtgagttt
ggtctccctg gtcctgctgg 9000tccaagaggg gaacgcggtc ccccaggtga gagtggtgct
gccggtccta ctggtcctat 9060tggaagccga ggtccttctg gacccccagg gcctgatgga
aacaagggtg aacctggtgt 9120ggttggtgct gtgggcactg ctggtccatc tggtcctagt
ggactcccag gagagagggg 9180tgctgctggc atacctggag gcaagggaga aaagggtgaa
cctggtctca gaggtgaaat 9240tggtaaccct ggcagagatg gtgctcgtgg tgctcctggt
gctgtaggtg cccctggtcc 9300tgctggagcc acaggtgacc ggggcgaagc tggggctgct
ggtcctgctg gtcctgctgg 9360tcctcgggga agccctggtg aacgtggtga ggtcggtcct
gctggcccca atggatttgc 9420tggtcctgct ggtgctgctg gtcaacctgg tgctaaagga
gaaagaggag ccaaagggcc 9480taagggtgaa aacggtgttg ttggtcccac aggccccgtt
ggagctgctg gcccagctgg 9540tccaaatggt ccccccggtc ctgctggaag tcgtggtgat
ggaggccccc ctggtatgac 9600tggtttccct ggtgctgctg gacggactgg tcccccagga
ccctctggta tttctggccc 9660tcctggtccc cctggtcctg ctgggaaaga agggcttcgt
ggtcctcgtg gtgaccaagg 9720tccagttggc cgaactggag aagtaggtgc agttggtccc
cctggcttcg ctggtgagaa 9780gggtccctct ggagaggctg gtactgctgg acctcctggc
actccaggtc ctcagggtct 9840tcttggtgct cctggtattc tgggtctccc tggctcgaga
ggtgaacgtg gtctaccagg 9900tgttgctggt gctgtgggtg aacctggtcc tcttggcatt
gccggccctc ctggggcccg 9960tggtcctcct ggtgctgtgg gtagtcctgg agtcaacggt
gctcctggtg aagctggtcg 10020tgatggcaac cctgggaacg atggtccccc aggtcgcgat
ggtcaacccg gacacaaggg 10080agagcgcggt taccctggca atattggtcc cgttggtgct
gcaggtgcac ctggtcctca 10140tggccccgtg ggtcctgctg gcaaacatgg aaaccgtggt
gaaactggtc cttctggtcc 10200tgttggtcct gctggtgctg ttggcccaag aggtcctagt
ggcccacaag gcattcgtgg 10260cgataaggga gagcccggtg aaaaggggcc cagaggtctt
cctggcttaa agggacacaa 10320tggattgcaa ggtctgcctg gtatcgctgg tcaccatggt
gatcaaggtg ctcctggctc 10380cgtgggtcct gctggtccta ggggccctgc tggtccttct
ggccctgctg gaaaagatgg 10440tcgcactgga catcctggta cagttggacc tgctggcatt
cgaggccctc agggtcacca 10500aggccctgct ggcccccctg gtccccctgg ccctcctgga
cctccaggtg taagcggtgg 10560tggttatgac tttggttacg atggagactt ctacagggct
gaccagcctc gctcagcacc 10620ttctctcaga cccaaggact atgaagttga tgctactctg
aagtctctca acaaccagat 10680tgagaccctt cttactcctg aaggctctag aaagaaccca
gctcgcacat gccgtgactt 10740gagactcagc cacccagagt ggagcagtgg ttactactgg
attgacccta accaaggatg 10800cactatggat gctatcaaag tatactgtga tttctctact
ggcgaaacct gtatccgggc 10860ccaacctgaa aacatcccag ccaagaactg gtataggagc
tccaaggaca agaaacacgt 10920ctggctagga gaaactatca atgctggcag ccagtttgaa
tataatgtag aaggagtgac 10980ttccaaggaa atggctaccc aacttgcctt catgcgcctg
ctggccaact atgcctctca 11040gaacatcacc taccactgca agaacagcat tgcatacatg
gatgaggaga ctggcaacct 11100gaaaaaggct gtcattctac agggctctaa tgatgttgaa
cttgttgctg agggcaacag 11160caggttcact tacactgttc ttgtagatgg ctgctctaaa
aagacaaatg aatggggaaa 11220gacaatcatt gaatacaaaa caaataagcc atcacgcctg
cccttccttg atattgcacc 11280tttggacatc ggtggtgctg accaggaatt ctttgtggac
attggcccag tctgtttcaa 11340ataagtcgac tcgctgatca gcctcgactt tgccttctag
ttgccagcca tctgttgttt 11400gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac
tcccactgtc ctttcctaat 11460aaaatgagga aattgcatcg cattgtctga gtaggtgtca
ttctattctg gggggtgggg 11520tggggcagga cagcaagggg gaggattggg aagacaatag
caggcactcg accctcgacc 11580tcgaaattct accgggtagg ggaggcgctt ttcccaaggc
agtctggagc atgcgcttta 11640gcagccccgc tgggcacttg gcgctacaca agtggcctct
ggcctcgcac acattccaca 11700tccaccggta ggcgccaacc ggctccgttc tttggtggcc
ccttcgcgcc accttctact 11760cctcccctag tcaggaagtt cccccccgcc ccgcagctcg
cgtcgtgcag gacgtgacaa 11820atggaagtag cacgtctcac tagtctcgtg cagatggaca
gcaccgctga gcaatggaag 11880cgggtaggcc tttggggcag cggccaatag cagctttgct
ccttcgcttt ctgggctcag 11940aggctgggaa ggggtgggtc cgggggcggg ctcaggggcg
ggctcagggg cggggcgggc 12000gcccgaaggt cctccggagg cccggcattc tgcacgcttc
aaaagcgcac gtctgccgcg 12060ctgttctcct cttcctcatc tccgggcctt tcgacctgca
tccatctaga tctagatcag 12120cttaccatga ccgagtacaa gcccacggtg cgcctcgcca
cccgcgacga cgtccccagg 12180gccgtacgca ccctcgccgc cgcgttcgcc gactaccccg
ccacgcgcca caccgtcgat 12240ccggaccgcc acatcgagcg ggtcaccgag ctgcaagaac
tcttcctcac gcgcgtcggg 12300ctcgacatcg gcaaggtgtg ggtcgcggac gacggcgccg
cggtggcggt ctggaccacg 12360ccggagagcg tcgaagcggg ggcggtgttc gccgagatcg
gcccgcgcat ggccgagttg 12420agcggttccc ggctggccgc gcagcaacag atggaaggcc
tcctggcgcc gcaccggccc 12480aaggagcccg cgtggttcct ggccaccgtc ggcgtctcgc
ccgaccacca gggcaagggt 12540ctgggcagcg ccgtcgtgct ccccggagtg gaggcggccg
agcgcgccgg ggtgcccgcc 12600ttcctggaga cctccgcgcc ccgcaacctc cccttctacg
agcggctcgg cttcaccgtc 12660accgccgacg tcgaggtgcc cgaaggaccg cgcacctggt
gcatgacccg caagcccggt 12720gcctgacgcc cgccccacga cccgcagcgc ccgaccgaaa
ggagcgcacg accccatgca 12780tcgatgatgc caataaagat atcattgatg agtttggaca
aaccacaact agaatgcagt 12840gaaaaaaatg ctttatttgt gaaatttgtg atgctattgc
tttatttgta accattataa 12900gctgcatatc gaattcccgc ggccgcggga attcgattcc
atcggtgcag caagcatgga 12960attttgtttt gatgtattca aggagctcaa agtccaccat
gccaatgaga acatcttcta 13020ctgccccatt gccatcatgt cagctctagc catggtatac
ctgggtgcaa aagacagcac 13080caggacacag ataaataagg tgagcctaca gttaaagatt
aaaacctttg ccctgctcaa 13140tggagccaca gcacttaatt gtatgataat gtcccttgga
aactgcatag ctcagaggct 13200gaaaatctga aaccagagtt atctaaaagt gtggccacct
ccaactccca gagtgttacc 13260caaatgcact agctagaaat cttgaaactg gattgcataa
cttctttttg tcataaccat 13320tatttcagct actattattt tcaattacag gttgttcgct
ttgataaact tccaggattc 13380ggagacagta ttgaagctca ggtacagaaa taatttcacc
tccttctcta tgtccctttc 13440ctctggaagc aaaatacagc agatgaagca atctcttagc
tgttccaagc cctctctgat 13500gagcagctag tgctctgcat ccagcagttg ggagaacact
gttcataaga acagagaaaa 13560agaaggaagt aacaggggat tcagaacaaa cagaagataa
aactcaggac aaaaataccg 13620tgtgaatgag gaaacttgtg gatatttgta cgcttaagca
agacagctag atgattctgg 13680ataaatgggt ctggttggaa aagaaggaaa gcctggctga
tctgctggag ctagattatt 13740gcagcaggta ggcaggagtt ccctagagaa aagtatgagg
gaattacaga agaaaaacag 13800cacaaaattg taaatattgg aaaaggacca catcagtgta
gttactagca gtaagacaga 13860caggatgaaa aatagttttg taaacagaag tatctaacta
ctttactctg ttcatacact 13920atgtaaaacc tactaagtaa taaaactaga ataacaacat
ctttctttct ctttgtattc 13980agtgtggcac atctgtaaac gttcactctt cacttagaga
catcctcaac caaatcacca 14040aaccaaatga tgtttattcg ttcagccttg ccagtagact
ttatgctgaa gagagatacc 14100caatcctgcc agtaagttgc tctaaaatct gatctgagtg
tatttccatg ccaaagctct 14160accattctgt aatgcaaaaa cagtcagagt tccacatgtt
tcactaagaa aatttctttt 14220tctcttgttt ttacaaatga aagagaggac aaataacatt
tctctatcac cgacctgaaa 14280ctctacagtc ttcagagaat gaatggcttg ctaaaagaat
gtcaaatctt accatacagc 14340tatttcatat tacactacta aatacactat aaggcatagc
atgtagtaat acactgtaaa 14400atagcttttt acactactat attattaata tctgttaatt
ccagtcttgc atttcacatt 14460tgcaaaacgt tttgaaattc gtatctgaaa gctgaatact
cttgctttac aggaatactt 14520gcagtgtgtg aaggaactgt atagaggagg cttggaacct
atcaactttc aaacagctgc 14580agatcaagcc agagagctca tcaattcctg ggtagaaagt
cagacaaatg gtaaggtaga 14640acatgctttg tacatagtga gagttggttc accctaatac
tgagaacctg gatatagctc 14700agccagcgtg ctttgcgttc aagcttacca gagctgttgt
atgcctgtta agcagggcat 14760acagtcatga ggctcttgaa aaatcttaac agacaaaggg
caatggaaaa tcggagttaa 14820gggatggtag ggataaaatg catagaaaga ggtaccacga
ttttgatttt tgccctaatg 14880cctctctgcg tggttcctca atttttctac ttcattcctc
atctcctcag agcattcctt 14940tccctcatgc ttgaaacaca gatgaaagac tgtgaattct
aactgagatg aaaacatcca 15000caaccacaca acctctggtg tggagtcaca ttctgtgaag
gcaaaaacta ggccacgtaa 15060tctatgtgtg caagctacgt gtaagctatg tgtgtgacag
gacaatgtga ggaacatact 15120atgtgcacaa ggactgcaga ataaacagga gcaaagtttt
tgaagaaaac agagtaaaat 15180cccgttttcc tcttttgtta cattctttac atatatctca
aatttcctct ttggttagaa 15240gcaagtaata tttatgtttc ttggtactgt ttgggttgaa
gaccattctg ggataagaga 15300aattccagtg gttcttcccc taatcataaa atgtacaggt
ttagtttttt tgtaacacag 15360aaatctcttc atcttttatc ttttgttgtg attctttata
gagagagaaa caagacttac 15420tgacaatagc agcaagaaaa tcaatcttgg aagaacaaga
ttgcagttgc aaaaacaaac 15480caatgtcctt gcccctacat cctcttcccc ataaattcta
cattctctat ctaccttgtg 15540cttgccaaca tgatatacgt aaactctctt ttcctattca
ttcttaaagg aattatcaga 15600aatgtccttc agccaagctc cgtggattct caaactgcaa
tggttctggt taatgccatt 15660gtcttcaaag gactgtggga gaaagcattt aaggatgaag
acacacaagc aatgcctttc 15720agagtgactg aggtatatgg gcatacctta gagatgtaat
ctagaattta tgaagagagt 15780agacatgttg ttatatgaac actgcattag cgtatctgct
catttgtctg catctctttc 15840agacactgtg ttaaaagcag ggaattttcc ttatgtctct
ttcatcacaa tattcctgac 15900attgcaaagc tcctgagaaa taacttcaga ttcccacttt
tcctagggag gtcttcctgg 15960atgagaacaa tcaatcatct taactgtaac tagatatttc
tgcatctaag aataatcttt 16020gttaaaacta tattctctct ctcttttttt ttttttttgg
ttctccagca agaaagcaaa 16080cctgtgcaga tgatgtggat cc
161023216474DNAhumanmisc_feature(13276)..(13278)n is
a, c, g, or t 32gtcgacctta agtcctcaga cttggcaagg agaatgtaga tttctacagt
atatatgttt 60tcacaaaagg aaggagagaa acaaaagaaa atggcactga ctaaacttca
gctagtggta 120taggaaagta attctgctta acagagattg cagtgatctc tatgtatgtc
ctgaagaatt 180atgttgtact tttttccctc atttttaaat caaacagtgc tttacagagg
tcagaatggt 240ttctttactg tttgtcaatt ctattatttc aatacagaac aatagcttct
ataactgaaa 300tatatttgct attgtatatt atgattgtcc ctcgaaccat gaacactcct
ccagctgaat 360ttcacaattc ctctgtcatc tgccaggcca ttaagttatt catggaagat
ctttgaggaa 420cactgcaagt tcatatcata aacacatttg aaattgagta ttgttttgca
ttgtatggag 480ctatgttttg ctgtatcctc agaaaaaaag tttgttataa agcattcaca
cccataaaaa 540gatagattta aatattccag ctataggaaa gaaagtgcgt ctgctcttca
ctctagtctc 600agttggctcc ttcacatgca cgcttcttta tttctcctat tttgtcaaga
aaataatagg 660tcacgtcttg ttctcactta tgtcctgcct agcatggctc agatgcacgt
tgtacataca 720agaaggatca aatgaaacag acttctggtc tgttactaca accatagtaa
taagcacact 780aactaataat tgctaattat gttttccatc tccaaggttc ccacattttt
ctgttttctt 840aaagatccca ttatctggtt gtaactgaag ctcaatggaa catgagcaat
atttcccagt 900cttctctccc atccaacagt cctgatggat tagcagaaca ggcagaaaac
acattgttac 960ccagaattaa aaactaatat ttgctctcca ttcaatccaa aatggaccta
ttgaaactaa 1020aatctaaccc aatcccatta aatgatttct atggcgtcaa aggtcaaact
tctgaaggga 1080acctgtgggt gggtcacaat tcaggctata tattccccag ggctcagcca
gtgtctgtac 1140atacagctag aaagctgtat tgcctttagc actcaagctc aaaaggtaag
caactctctg 1200gaattacctt ctctctatat tagctcttac ttgcacctaa actttaaaaa
attaacaatt 1260attgtgttat gtgttgtatc tttaagggtg aagtacctgc gtgatacccc
ctataaaaac 1320ttctcacctg tgtatgcatt ctgcactatt ttattatgtg taaaagcttt
gtgtttgttt 1380tcaggaggct tattctttgt gcttaaaata tgtttttaat ttcagaacat
cttatcctgt 1440cgttcactat ctgatatgct ttgcagtttg cctgattaac ttctagccct
acagagtgca 1500cagagagcaa aatcatggtg ttcagtgaat tctggggagt tattttaatg
tgaaaattct 1560ctagaagttt aattcctgca aagtgcagct gctgatcact acacaagata
aaaatgtggg 1620gggtgcataa acgtatattc ttacaataat agatacatgt gaacttgtat
acagaaaaga 1680aaatgagaaa aatgtgtgtg cgtatactca cacacgtggt cagtaaaaac
ttttgagggg 1740tttaatacag aaaatccaat cctgaggccc cagcactcag tacgcatata
aagggctggg 1800ctctgaagga cttctgactt tcacagatta tataaatctc aggaaagcaa
ctagattcat 1860gctggctcca aaagctgtgc tttatataag cacactggct atacaatagt
tgtacagttc 1920agctctttat aatagaaaca gacagaacaa gtataaatct tctattggtc
tatgtcatga 1980acaagaattc attcagtggc tctgttttat agtaaacatt gctattttat
catgtctgca 2040tttctcttct gtctgaatgt caccactaaa atttaactcc acagaaagtt
tatactacag 2100tacacatgca tatctttgag caaagcaaac catacctgaa agtgcaatag
agcagaatat 2160gaattacatg cgtgtctttc tcctagacta catgacccca tataaattac
attccttatc 2220tattctgcca tcaccaaaac aaaggtaaaa atacttttga agatctactc
atagcaagta 2280gtgtgcaaca aacagatatt tctctacatt tatttttagg gaataaaaat
aagaaataaa 2340atagtcagca agcctctgct ttctcatata tctgtccaaa cctaaagttt
actgaaattt 2400gctctttgaa tttccagttt tgcaagccta tcagattgtg ttttaatcag
aggtactgaa 2460aagtatcaat gaattctagc tttcactgaa caaaaatatg tagaggcaac
tggcttctgg 2520gacagtttgc tacccaaaag acaactgaat gcaaatacat aaatagattt
atgaatatgg 2580ttttgaacat gcacatgaga ggtggatata gcaacagaca cattaccaca
gaattacttt 2640aaaactactt gttaacattt aattgcctaa aaactgctcg taatttactg
ttgtagccta 2700ccatagagta ccctgcatgg tactatgtac agcattccat ccttacattt
tcactgttct 2760gctgtttgct ctagacaact cagagttcac catgaggtct ttgctaatct
tggtgctttg 2820cttcctgccc ctggctgctc tggggcaaga ggaaggccaa gtcgagggcc
aagacgaaga 2880catcccacca atcacctgcg tacagaacgg cctcaggtac catgaccgag
acgtgtggaa 2940acccgagccc tgccggatct gcgtctgcga caacggcaag gtgttgtgcg
atgacgtgat 3000ctgtgacgag accaagaact gccccggcgc cgaagtcccc gagggcgagt
gctgtcccgt 3060ctgccccgac ggctcagagt cacccaccga ccaagaaacc accggcgtcg
agggacccaa 3120gggagacact ggcccccgag gcccaagggg acccgcaggc ccccctggcc
gagatggcat 3180ccctggacag cctggacttc ccggaccccc cggacccccc ggacctcccg
gaccccctgg 3240cctcggagga aactttgctc cccagctgtc ttatggctat gatgagaaat
caaccggagg 3300aatttccgtg cctggcccca tgggtccctc tggtcctcgt ggtctccctg
gcccccctgg 3360tgcacctggt ccccaaggct tccaaggtcc ccctggtgag cctggcgagc
ctggagcttc 3420aggtcccatg ggtccccgag gtcccccagg tccccctgga aagaatggag
atgatgggga 3480agctggaaaa cctggtcgtc ctggtgagcg tgggcctcct gggcctcagg
gtgctcgagg 3540attgcccgga acagctggcc tccctggaat gaagggacac agaggtttca
gtggtttgga 3600tggtgccaag ggagatgctg gtcctgctgg tcctaagggt gagcctggca
gccctggtga 3660aaatggagct cctggtcaga tgggcccccg tggcctgcct ggtgagagag
gtcgccctgg 3720agcccctggc cctgctggtg ctcgtggaaa tgatggtgct actggtgctg
ccgggccccc 3780tggtcccacc ggccccgctg gtcctcctgg cttccctggt gctgttggtg
ctaagggtga 3840agctggtccc caagggcccc gaggctctga aggtccccag ggtgtgcgtg
gtgagcctgg 3900cccccctggc cctgctggtg ctgctggccc tgctggaaac cctggtgctg
atggacagcc 3960tggtgctaaa ggtgccaatg gtgctcctgg tattgctggt gctcctggct
tccctggtgc 4020ccgaggcccc tctggacccc agggccccgg cggccctcct ggtcccaagg
gtaacagcgg 4080tgaacctggt gctcctggca gcaaaggaga cactggtgct aagggagagc
ctggccctgt 4140tggtgttcaa ggaccccctg gccctgctgg agaggaagga aagcgaggag
ctcgaggtga 4200acccggaccc actggcctgc ccggaccccc tggcgagcgt ggtggacctg
gtagccgtgg 4260tttccctggc gcagatggtg ttgctggtcc caagggtccc gctggtgaac
gtggttctcc 4320tggccctgct ggccccaaag gatctcctgg tgaagctggt cgtcccggtg
aagctggtct 4380gcctggtgcc aagggtctga ctggaagccc tggcagccct ggtcctgatg
gcaaaactgg 4440cccccctggt cccgccggtc aagatggtcg ccccggaccc ccaggcccac
ctggtgcccg 4500tggtcaggct ggtgtgatgg gattccctgg acctaaaggt gctgctggag
agcccggcaa 4560ggctggagag cgaggtgttc ccggaccccc tggcgctgtc ggtcctgctg
gcaaagatgg 4620agaggctgga gctcagggac cccctggccc tgctggtccc gctggcgaga
gaggtgaaca 4680aggccctgct ggctcccccg gattccaggg tctccctggt cctgctggtc
ctccaggtga 4740agcaggcaaa cctggtgaac agggtgttcc tggagacctt ggcgcccctg
gcccctctgg 4800agcaagaggc gagagaggtt tccctggcga gcgtggtgtg caaggtcccc
ctggtcctgc 4860tggtccccga ggggccaacg gtgctcccgg caacgatggt gctaagggtg
atgctggtgc 4920ccctggagct cccggtagcc agggcgcccc tggccttcag ggaatgcctg
gtgaacgtgg 4980tgcagctggt cttccagggc ctaagggtga cagaggtgat gctggtccca
aaggtgctga 5040tggctctcct ggcaaagatg gcgtccgtgg tctgactggc cccattggtc
ctcctggccc 5100tgctggtgcc cctggtgaca agggtgaaag tggtcccagc ggccctgctg
gtcccactgg 5160agctcgtggt gcccccggag accgtggtga gcctggtccc cccggccctg
ctggctttgc 5220tggcccccct ggtgctgacg gccaacctgg tgctaaaggc gaacctggtg
atgctggtgc 5280taaaggcgat gctggtcccc ctggccctgc cggacccgct ggaccccctg
gccccattgg 5340taatgttggt gctcctggag ccaaaggtgc tcgcggcagc gctggtcccc
ctggtgctac 5400tggtttccct ggtgctgctg gccgagtcgg tcctcctggc ccctctggaa
atgctggacc 5460ccctggccct cctggtcctg ctggcaaaga aggcggcaaa ggtccccgtg
gtgagactgg 5520ccctgctgga cgtcctggtg aagttggtcc ccctggtccc cctggccctg
ctggcgagaa 5580aggatcccct ggtgctgatg gtcctgctgg tgctcctggt actcccgggc
ctcaaggtat 5640tgctggacag cgtggtgtgg tcggcctgcc tggtcagaga ggagagagag
gcttccctgg 5700tcttcctggc ccctctggtg aacctggcaa acaaggtccc tctggagcaa
gtggtgaacg 5760tggtccccct ggtcccatgg gcccccctgg attggctgga ccccctggtg
aatctggacg 5820tgagggggct cctggtgccg aaggttcccc tggacgagac ggttctcctg
gcgccaaggg 5880tgaccgtggt gagaccggcc ccgctggacc ccctggtgct cctggtgctc
ctggtgcccc 5940tggccccgtt ggccctgctg gcaagagtgg tgatcgtggt gagactggtc
ctgctggtcc 6000cgccggtcct gtcggccctg ttggcgcccg tggccccgcc ggaccccaag
gcccccgtgg 6060tgacaagggt gagacaggcg aacagggcga cagaggcata aagggtcacc
gtggcttctc 6120tggcctccag ggtccccctg gccctcctgg ctctcctggt gaacaaggtc
cctctggagc 6180ctctggtcct gctggtcccc gaggtccccc tggctctgct ggtgctcctg
gcaaagatgg 6240actcaacggt ctccctggcc ccattgggcc ccctggtcct cgcggtcgca
ctggtgatgc 6300tggtcctgtt ggtccccccg gccctcctgg acctcctggt ccccctggtc
ctcccagcgc 6360tggtttcgac ttcagcttcc tgccccagcc acctcaagag aaggctcacg
atggtggccg 6420ctactaccgg gctgatgatg ccaatgtggt tcgtgaccgt gacctcgagg
tggacaccac 6480cctcaagagc ctgagccagc agatcgagaa catccggagc ccagagggca
gccgcaagaa 6540ccccgcccgc acctgccgtg acctcaagat gtgccactct gactggaaga
gtggagagta 6600ctggattgac cccaaccaag gctgcaacct ggatgccatc aaagtcttct
gcaacatgga 6660gactggtgag acctgcgtgt accccactca gcccagtgtg gcccagaaga
actggtacat 6720cagcaagaac cccaaggaca agaggcatgt ctggttcggc gagagcatga
ccgatggatt 6780ccagttcgag tatggcggcc agggctccga ccctgccgat gtggccatcc
agctgacctt 6840cctgcgcctg atgtccaccg aggcctccca gaacatcacc taccactgca
agaacagcgt 6900ggcctacatg gaccagcaga ctggcaacct caagaaggcc ctgctcctcc
agggctccaa 6960cgagatcgag atccgcgccg agggcaacag ccgcttcacc tacagcgtca
ctgtcgatgg 7020ctgcacgagt cacaccggag cctggggcaa gacagtgatt gaatacaaaa
ccaccaagac 7080ctcccgcctg cccatcatcg atgtggcccc cttggacgtt ggtgccccag
accaggaatt 7140cggcttcgac gttggccctg tctgcttcct gcgggccaag agagcccccg
tgaagcagac 7200cctgaacttc gacctgctga agctggccgg cgacgtggag tccaaccccg
gccccatgcg 7260ctccctcttg attctcgtct tgtgtttttt gccactggcc gctctcggcc
aatctttaca 7320agaggaaact gtaagaaagg gcccagccgg agatagagga ccacgtggag
aaaggggtcc 7380accaggcccc ccaggcagag atggtgaaga tggtcccaca ggccctcctg
gtccacctgg 7440tcctcctggc ccccctggtc tcggtgggaa ctttgctgct cagtatgatg
gaaaaggagt 7500tggacttggc cctggaccaa tgggcttaat gggacctaga ggcccacctg
gtgcagctgg 7560agccccaggc cctcaaggtt tccaaggacc tgctggtgag cctggtgaac
ctggtcaaac 7620tggtcctgca ggtgctcgtg gtccagctgg ccctcctggc aaggctggtg
aagatggtca 7680ccctggaaaa cccggacgac ctggtgagag aggagttgtt ggaccacagg
gtgctcgtgg 7740tttccctgga actcctggac ttcctggctt caaaggcatt aggggacaca
atggtctgga 7800tggattgaag ggacagcccg gtgctcctgg tgtgaagggt gaacctggtg
cccctggtga 7860aaatggaact ccaggtcaaa caggagcccg tgggcttcct ggtgagagag
gacgtgttgg 7920tgcccctggc ccagctggtg cccgtggcag tgatggaagt gtgggtcccg
tgggtcctgc 7980tggtcccatt gggtctgctg gccctccagg cttcccaggt gcccctggcc
ccaagggtga 8040aattggagct gttggtaacg ctggtcctgc tggtcccgcc ggtccccgtg
gtgaagtggg 8100tcttccaggc ctctccggcc ccgttggacc tcctggtaat cctggagcaa
acggccttac 8160tggtgccaag ggtgctgctg gccttcccgg cgttgctggg gctcccggcc
tccctggacc 8220ccgcggtatt cctggccctg ttggtgctgc cggtgctact ggtgccagag
gacttgttgg 8280tgagcctggt ccagctggct ccaaaggaga gagcggtaac aagggtgagc
ccggctctgc 8340tgggccccaa ggtcctcctg gtcccagtgg tgaagaagga aagagaggcc
ctaatgggga 8400agctggatct gccggccctc caggacctcc tgggctgaga ggtagtcctg
gttctcgtgg 8460tcttcctgga gctgatggca gagctggcgt catgggccct cctggtagtc
gtggtgcaag 8520tggccctgct ggagtccgag gacctaatgg agatgctggt cgccctgggg
agcctggtct 8580catgggaccc agaggtcttc ctggttcccc tggaaatatc ggccccgctg
gaaaagaagg 8640tcctgtcggc ctccctggca tcgacggcag gcctggccca attggcccag
ctggagcaag 8700aggagagcct ggcaacattg gattccctgg acccaaaggc cccactggtg
atcctggcaa 8760aaacggtgat aaaggtcatg ctggtcttgc tggtgctcgg ggtgctccag
gtcctgatgg 8820aaacaatggt gctcagggac ctcctggacc acagggtgtt caaggtggaa
aaggtgaaca 8880gggtcccgct ggtcctccag gcttccaggg tctgcctggc ccctcaggtc
ccgctggtga 8940agttggcaaa ccaggagaaa ggggtctcca tggtgagttt ggtctccctg
gtcctgctgg 9000tccaagaggg gaacgcggtc ccccaggtga gagtggtgct gccggtccta
ctggtcctat 9060tggaagccga ggtccttctg gacccccagg gcctgatgga aacaagggtg
aacctggtgt 9120ggttggtgct gtgggcactg ctggtccatc tggtcctagt ggactcccag
gagagagggg 9180tgctgctggc atacctggag gcaagggaga aaagggtgaa cctggtctca
gaggtgaaat 9240tggtaaccct ggcagagatg gtgctcgtgg tgctcctggt gctgtaggtg
cccctggtcc 9300tgctggagcc acaggtgacc ggggcgaagc tggggctgct ggtcctgctg
gtcctgctgg 9360tcctcgggga agccctggtg aacgtggtga ggtcggtcct gctggcccca
atggatttgc 9420tggtcctgct ggtgctgctg gtcaacctgg tgctaaagga gaaagaggag
ccaaagggcc 9480taagggtgaa aacggtgttg ttggtcccac aggccccgtt ggagctgctg
gcccagctgg 9540tccaaatggt ccccccggtc ctgctggaag tcgtggtgat ggaggccccc
ctggtatgac 9600tggtttccct ggtgctgctg gacggactgg tcccccagga ccctctggta
tttctggccc 9660tcctggtccc cctggtcctg ctgggaaaga agggcttcgt ggtcctcgtg
gtgaccaagg 9720tccagttggc cgaactggag aagtaggtgc agttggtccc cctggcttcg
ctggtgagaa 9780gggtccctct ggagaggctg gtactgctgg acctcctggc actccaggtc
ctcagggtct 9840tcttggtgct cctggtattc tgggtctccc tggctcgaga ggtgaacgtg
gtctaccagg 9900tgttgctggt gctgtgggtg aacctggtcc tcttggcatt gccggccctc
ctggggcccg 9960tggtcctcct ggtgctgtgg gtagtcctgg agtcaacggt gctcctggtg
aagctggtcg 10020tgatggcaac cctgggaacg atggtccccc aggtcgcgat ggtcaacccg
gacacaaggg 10080agagcgcggt taccctggca atattggtcc cgttggtgct gcaggtgcac
ctggtcctca 10140tggccccgtg ggtcctgctg gcaaacatgg aaaccgtggt gaaactggtc
cttctggtcc 10200tgttggtcct gctggtgctg ttggcccaag aggtcctagt ggcccacaag
gcattcgtgg 10260cgataaggga gagcccggtg aaaaggggcc cagaggtctt cctggcttaa
agggacacaa 10320tggattgcaa ggtctgcctg gtatcgctgg tcaccatggt gatcaaggtg
ctcctggctc 10380cgtgggtcct gctggtccta ggggccctgc tggtccttct ggccctgctg
gaaaagatgg 10440tcgcactgga catcctggta cagttggacc tgctggcatt cgaggccctc
agggtcacca 10500aggccctgct ggcccccctg gtccccctgg ccctcctgga cctccaggtg
taagcggtgg 10560tggttatgac tttggttacg atggagactt ctacagggct gaccagcctc
gctcagcacc 10620ttctctcaga cccaaggact atgaagttga tgctactctg aagtctctca
acaaccagat 10680tgagaccctt cttactcctg aaggctctag aaagaaccca gctcgcacat
gccgtgactt 10740gagactcagc cacccagagt ggagcagtgg ttactactgg attgacccta
accaaggatg 10800cactatggat gctatcaaag tatactgtga tttctctact ggcgaaacct
gtatccgggc 10860ccaacctgaa aacatcccag ccaagaactg gtataggagc tccaaggaca
agaaacacgt 10920ctggctagga gaaactatca atgctggcag ccagtttgaa tataatgtag
aaggagtgac 10980ttccaaggaa atggctaccc aacttgcctt catgcgcctg ctggccaact
atgcctctca 11040gaacatcacc taccactgca agaacagcat tgcatacatg gatgaggaga
ctggcaacct 11100gaaaaaggct gtcattctac agggctctaa tgatgttgaa cttgttgctg
agggcaacag 11160caggttcact tacactgttc ttgtagatgg ctgctctaaa aagacaaatg
aatggggaaa 11220gacaatcatt gaatacaaaa caaataagcc atcacgcctg cccttccttg
atattgcacc 11280tttggacatc ggtggtgctg accaggaatt ctttgtggac attggcccag
tctgtttcaa 11340ataagtcgac tcgctgatca gcctcgactt tgccttctag ttgccagcca
tctgttgttt 11400gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc
ctttcctaat 11460aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg
gggggtgggg 11520tggggcagga cagcaagggg gaggattggg aagacaatag caggcactcg
accctcgact 11580tcaaatatgt atccgctcat gagacaataa ccctgataaa tgcttcaata
atattgaaaa 11640aggaagagtc ctgaggcgga aagaaccagc tgtggaatgt gtgtcagtta
gggtgtggaa 11700agtccccagg ctccccagca ggcagaagta tgcaaagcat gcatctcaat
tagtcagcaa 11760ccaggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc
atgcatctca 11820attagtcagc aaccatagtc ccgcccctaa ctccgcccat cccgccccta
actccgccca 11880gttccgccca ttctccgccc catggctgac taattttttt tatttatgca
gaggccgagg 11940ccgcctcggc ctctgagcta ttccagaagt agtgaggagg cttttttgga
ggcctaggct 12000tttgcaaaga tcgatcaaga gacaggatga ggatcgtttc gcatgattga
acaagatgga 12060ttgcacgcag gttctccggc cgcttgggtg gagaggctat tcggctatga
ctgggcacaa 12120cagacaatcg gctgctctga tgccgccgtg ttccggctgt cagcgcaggg
gcgcccggtt 12180ctttttgtca agaccgacct gtccggtgcc ctgaatgaac tgcaagacga
ggcagcgcgg 12240ctatcgtggc tggccacgac gggcgttcct tgcgcagctg tgctcgacgt
tgtcactgaa 12300gcgggaaggg actggctgct attgggcgaa gtgccggggc aggatctcct
gtcatctcac 12360cttgctcctg ccgagaaagt atccatcatg gctgatgcaa tgcggcggct
gcatacgctt 12420gatccggcta cctgcccatt cgaccaccaa gcgaaacatc gcatcgagcg
agcacgtact 12480cggatggaag ccggtcttgt cgatcaggat gatctggacg aagagcatca
ggggctcgcg 12540ccagccgaac tgttcgccag gctcaaggcg agcatgcccg acggcgagga
tctcgtcgtg 12600acccatggcg atgcctgctt gccgaatatc atggtggaaa atggccgctt
ttctggattc 12660atcgactgtg gccggctggg tgtggcggac cgctatcagg acatagcgtt
ggctacccgt 12720gatattgctg aagagcttgg cggcgaatgg gctgaccgct tcctcgtgct
ttacggtatc 12780gccgctcccg attcgcagcg catcgccttc tatcgccttc ttgacgagtt
cttctgagcg 12840ggactctggg gttcgaaatg accgaccaag cgacgcccaa cctgccatca
cgagatttcg 12900attccaccgc cgccttctat gaaaggttgg gcttcggaat cgttttccgg
gacgccggct 12960ggatgatcct ccagcgcggg gatctcatgc tggagttctt cgcccaccct
agggggaggc 13020taactgaaac acggaaggag acaataccgg aaggaacccg cgctatgacg
gcaataaaaa 13080gacagaataa aacgcacggt gttgggtcgt ttgttcataa acgcggggtt
cggtcccagg 13140gctggcactc tgtcgatacc ccaccgagac cccattgggg ccaatacgcc
cgcgtttctt 13200ccttttcccc accccacccc ccaagttcgg gtgaaggccc agggctcgca
gccaacgtcg 13260gggcggcagg ctgcannnta tcgaattccc gcggccgcgg gaattcgatt
ccatcggtgc 13320agcaagcatg gaattttgtt ttgatgtatt caaggagctc aaagtccacc
atgccaatga 13380gaacatcttc tactgcccca ttgccatcat gtcagctcta gccatggtat
acctgggtgc 13440aaaagacagc accaggacac agataaataa ggtgagccta cagttaaaga
ttaaaacctt 13500tgccctgctc aatggagcca cagcacttaa ttgtatgata atgtcccttg
gaaactgcat 13560agctcagagg ctgaaaatct gaaaccagag ttatctaaaa gtgtggccac
ctccaactcc 13620cagagtgtta cccaaatgca ctagctagaa atcttgaaac tggattgcat
aacttctttt 13680tgtcataacc attatttcag ctactattat tttcaattac aggttgttcg
ctttgataaa 13740cttccaggat tcggagacag tattgaagct caggtacaga aataatttca
cctccttctc 13800tatgtccctt tcctctggaa gcaaaataca gcagatgaag caatctctta
gctgttccaa 13860gccctctctg atgagcagct agtgctctgc atccagcagt tgggagaaca
ctgttcataa 13920gaacagagaa aaagaaggaa gtaacagggg attcagaaca aacagaagat
aaaactcagg 13980acaaaaatac cgtgtgaatg aggaaacttg tggatatttg tacgcttaag
caagacagct 14040agatgattct ggataaatgg gtctggttgg aaaagaagga aagcctggct
gatctgctgg 14100agctagatta ttgcagcagg taggcaggag ttccctagag aaaagtatga
gggaattaca 14160gaagaaaaac agcacaaaat tgtaaatatt ggaaaaggac cacatcagtg
tagttactag 14220cagtaagaca gacaggatga aaaatagttt tgtaaacaga agtatctaac
tactttactc 14280tgttcataca ctatgtaaaa cctactaagt aataaaacta gaataacaac
atctttcttt 14340ctctttgtat tcagtgtggc acatctgtaa acgttcactc ttcacttaga
gacatcctca 14400accaaatcac caaaccaaat gatgtttatt cgttcagcct tgccagtaga
ctttatgctg 14460aagagagata cccaatcctg ccagtaagtt gctctaaaat ctgatctgag
tgtatttcca 14520tgccaaagct ctaccattct gtaatgcaaa aacagtcaga gttccacatg
tttcactaag 14580aaaatttctt tttctcttgt ttttacaaat gaaagagagg acaaataaca
tttctctatc 14640accgacctga aactctacag tcttcagaga atgaatggct tgctaaaaga
atgtcaaatc 14700ttaccataca gctatttcat attacactac taaatacact ataaggcata
gcatgtagta 14760atacactgta aaatagcttt ttacactact atattattaa tatctgttaa
ttccagtctt 14820gcatttcaca tttgcaaaac gttttgaaat tcgtatctga aagctgaata
ctcttgcttt 14880acaggaatac ttgcagtgtg tgaaggaact gtatagagga ggcttggaac
ctatcaactt 14940tcaaacagct gcagatcaag ccagagagct catcaattcc tgggtagaaa
gtcagacaaa 15000tggtaaggta gaacatgctt tgtacatagt gagagttggt tcaccctaat
actgagaacc 15060tggatatagc tcagccagcg tgctttgcgt tcaagcttac cagagctgtt
gtatgcctgt 15120taagcagggc atacagtcat gaggctcttg aaaaatctta acagacaaag
ggcaatggaa 15180aatcggagtt aagggatggt agggataaaa tgcatagaaa gaggtaccac
gattttgatt 15240tttgccctaa tgcctctctg cgtggttcct caatttttct acttcattcc
tcatctcctc 15300agagcattcc tttccctcat gcttgaaaca cagatgaaag actgtgaatt
ctaactgaga 15360tgaaaacatc cacaaccaca caacctctgg tgtggagtca cattctgtga
aggcaaaaac 15420taggccacgt aatctatgtg tgcaagctac gtgtaagcta tgtgtgtgac
aggacaatgt 15480gaggaacata ctatgtgcac aaggactgca gaataaacag gagcaaagtt
tttgaagaaa 15540acagagtaaa atcccgtttt cctcttttgt tacattcttt acatatatct
caaatttcct 15600ctttggttag aagcaagtaa tatttatgtt tcttggtact gtttgggttg
aagaccattc 15660tgggataaga gaaattccag tggttcttcc cctaatcata aaatgtacag
gtttagtttt 15720tttgtaacac agaaatctct tcatctttta tcttttgttg tgattcttta
tagagagaga 15780aacaagactt actgacaata gcagcaagaa aatcaatctt ggaagaacaa
gattgcagtt 15840gcaaaaacaa accaatgtcc ttgcccctac atcctcttcc ccataaattc
tacattctct 15900atctaccttg tgcttgccaa catgatatac gtaaactctc ttttcctatt
cattcttaaa 15960ggaattatca gaaatgtcct tcagccaagc tccgtggatt ctcaaactgc
aatggttctg 16020gttaatgcca ttgtcttcaa aggactgtgg gagaaagcat ttaaggatga
agacacacaa 16080gcaatgcctt tcagagtgac tgaggtatat gggcatacct tagagatgta
atctagaatt 16140tatgaagaga gtagacatgt tgttatatga acactgcatt agcgtatctg
ctcatttgtc 16200tgcatctctt tcagacactg tgttaaaagc agggaatttt ccttatgtct
ctttcatcac 16260aatattcctg acattgcaaa gctcctgaga aataacttca gattcccact
tttcctaggg 16320aggtcttcct ggatgagaac aatcaatcat cttaactgta actagatatt
tctgcatcta 16380agaataatct ttgttaaaac tatattctct ctctcttttt tttttttttt
ggttctccag 16440caagaaagca aacctgtgca gatgatgtgg atcc
16474338505DNAhuman 33gtcgacctta agtcctcaga cttggcaagg
agaatgtaga tttctacagt atatatgttt 60tcacaaaagg aaggagagaa acaaaagaaa
atggcactga ctaaacttca gctagtggta 120taggaaagta attctgctta acagagattg
cagtgatctc tatgtatgtc ctgaagaatt 180atgttgtact tttttccctc atttttaaat
caaacagtgc tttacagagg tcagaatggt 240ttctttactg tttgtcaatt ctattatttc
aatacagaac aatagcttct ataactgaaa 300tatatttgct attgtatatt atgattgtcc
ctcgaaccat gaacactcct ccagctgaat 360ttcacaattc ctctgtcatc tgccaggcca
ttaagttatt catggaagat ctttgaggaa 420cactgcaagt tcatatcata aacacatttg
aaattgagta ttgttttgca ttgtatggag 480ctatgttttg ctgtatcctc agaaaaaaag
tttgttataa agcattcaca cccataaaaa 540gatagattta aatattccag ctataggaaa
gaaagtgcgt ctgctcttca ctctagtctc 600agttggctcc ttcacatgca cgcttcttta
tttctcctat tttgtcaaga aaataatagg 660tcacgtcttg ttctcactta tgtcctgcct
agcatggctc agatgcacgt tgtacataca 720agaaggatca aatgaaacag acttctggtc
tgttactaca accatagtaa taagcacact 780aactaataat tgctaattat gttttccatc
tccaaggttc ccacattttt ctgttttctt 840aaagatccca ttatctggtt gtaactgaag
ctcaatggaa catgagcaat atttcccagt 900cttctctccc atccaacagt cctgatggat
tagcagaaca ggcagaaaac acattgttac 960ccagaattaa aaactaatat ttgctctcca
ttcaatccaa aatggaccta ttgaaactaa 1020aatctaaccc aatcccatta aatgatttct
atggcgtcaa aggtcaaact tctgaaggga 1080acctgtgggt gggtcacaat tcaggctata
tattccccag ggctcagcca gtgtctgtac 1140atacagctag aaagctgtat tgcctttagc
actcaagctc aaaaggtaag caactctctg 1200gaattacctt ctctctatat tagctcttac
ttgcacctaa actttaaaaa attaacaatt 1260attgtgttat gtgttgtatc tttaagggtg
aagtacctgc gtgatacccc ctataaaaac 1320ttctcacctg tgtatgcatt ctgcactatt
ttattatgtg taaaagcttt gtgtttgttt 1380tcaggaggct tattctttgt gcttaaaata
tgtttttaat ttcagaacat cttatcctgt 1440cgttcactat ctgatatgct ttgcagtttg
cctgattaac ttctagccct acagagtgca 1500cagagagcaa aatcatggtg ttcagtgaat
tctggggagt tattttaatg tgaaaattct 1560ctagaagttt aattcctgca aagtgcagct
gctgatcact acacaagata aaaatgtggg 1620gggtgcataa acgtatattc ttacaataat
agatacatgt gaacttgtat acagaaaaga 1680aaatgagaaa aatgtgtgtg cgtatactca
cacacgtggt cagtaaaaac ttttgagggg 1740tttaatacag aaaatccaat cctgaggccc
cagcactcag tacgcatata aagggctggg 1800ctctgaagga cttctgactt tcacagatta
tataaatctc aggaaagcaa ctagattcat 1860gctggctcca aaagctgtgc tttatataag
cacactggct atacaatagt tgtacagttc 1920agctctttat aatagaaaca gacagaacaa
gtataaatct tctattggtc tatgtcatga 1980acaagaattc attcagtggc tctgttttat
agtaaacatt gctattttat catgtctgca 2040tttctcttct gtctgaatgt caccactaaa
atttaactcc acagaaagtt tatactacag 2100tacacatgca tatctttgag caaagcaaac
catacctgaa agtgcaatag agcagaatat 2160gaattacatg cgtgtctttc tcctagacta
catgacccca tataaattac attccttatc 2220tattctgcca tcaccaaaac aaaggtaaaa
atacttttga agatctactc atagcaagta 2280gtgtgcaaca aacagatatt tctctacatt
tatttttagg gaataaaaat aagaaataaa 2340atagtcagca agcctctgct ttctcatata
tctgtccaaa cctaaagttt actgaaattt 2400gctctttgaa tttccagttt tgcaagccta
tcagattgtg ttttaatcag aggtactgaa 2460aagtatcaat gaattctagc tttcactgaa
caaaaatatg tagaggcaac tggcttctgg 2520gacagtttgc tacccaaaag acaactgaat
gcaaatacat aaatagattt atgaatatgg 2580ttttgaacat gcacatgaga ggtggatata
gcaacagaca cattaccaca gaattacttt 2640aaaactactt gttaacattt aattgcctaa
aaactgctcg taatttactg ttgtagccta 2700ccatagagta ccctgcatgg tactatgtac
agcattccat ccttacattt tcactgttct 2760gctgtttgct ctagacaact cagagttcac
catgaccaac aagtgtctcc tccaaattgc 2820tctcctgttg tgcttctcca ctacagctct
ttccatgagc tacaacttgc ttggattcct 2880acaaagaagc agcaattttc agtgtcagaa
gctcctgtgg caattgaatg ggaggcttga 2940atattgcctc aaggacagga tgaactttga
catccctgag gagattaagc agctgcagca 3000gttccagaag gaggacgccg cattgaccat
ctatgagatg ctccagaaca tctttgctat 3060tttcagacaa gattcatcta gcactggctg
gaatgagact attgttgaga acctcctggc 3120taatgtctat catcagataa accatctgaa
gacagtcctg gaagaaaaac tggagaaaga 3180agatttcacc aggggaaaac tcatgagcag
tctgcacctg aaaagatatt atgggaggat 3240tctgcattac ctgaaggcca aggagtacag
tcactgtgcc tggaccatag tcagagtgga 3300aatcctaagg aacttttact tcattaacag
acttacaggt tacctccgaa actgaagatc 3360tcctagcctg tgcctctggt cgactcgctg
atcagcctcg actttgcctt ctagttgcca 3420gccatctgtt gtttgcccct cccccgtgcc
ttccttgacc ctggaaggtg ccactcccac 3480tgtcctttcc taataaaatg aggaaattgc
atcgcattgt ctgagtaggt gtcattctat 3540tctggggggt ggggtggggc aggacagcaa
gggggaggat tgggaagaca atagcaggca 3600ctcgaccctc gacttcaaat atgtatccgc
tcatgagaca ataaccctga taaatgcttc 3660aataatattg aaaaaggaag agtcctgagg
cggaaagaac cagctgtgga atgtgtgtca 3720gttagggtgt ggaaagtccc caggctcccc
agcaggcaga agtatgcaaa gcatgcatct 3780caattagtca gcaaccaggt gtggaaagtc
cccaggctcc ccagcaggca gaagtatgca 3840aagcatgcat ctcaattagt cagcaaccat
agtcccgccc ctaactccgc ccatcccgcc 3900cctaactccg cccagttccg cccattctcc
gccccatggc tgactaattt tttttattta 3960tgcagaggcc gaggccgcct cggcctctga
gctattccag aagtagtgag gaggcttttt 4020tggaggccta ggcttttgca aagatcgatc
aagagacagg atgaggatcg tttcgcatga 4080ttgaacaaga tggattgcac gcaggttctc
cggccgcttg ggtggagagg ctattcggct 4140atgactgggc acaacagaca atcggctgct
ctgatgccgc cgtgttccgg ctgtcagcgc 4200aggggcgccc ggttcttttt gtcaagaccg
acctgtccgg tgccctgaat gaactgcaag 4260acgaggcagc gcggctatcg tggctggcca
cgacgggcgt tccttgcgca gctgtgctcg 4320acgttgtcac tgaagcggga agggactggc
tgctattggg cgaagtgccg gggcaggatc 4380tcctgtcatc tcaccttgct cctgccgaga
aagtatccat catggctgat gcaatgcggc 4440ggctgcatac gcttgatccg gctacctgcc
cattcgacca ccaagcgaaa catcgcatcg 4500agcgagcacg tactcggatg gaagccggtc
ttgtcgatca ggatgatctg gacgaagagc 4560atcaggggct cgcgccagcc gaactgttcg
ccaggctcaa ggcgagcatg cccgacggcg 4620aggatctcgt cgtgacccat ggcgatgcct
gcttgccgaa tatcatggtg gaaaatggcc 4680gcttttctgg attcatcgac tgtggccggc
tgggtgtggc ggaccgctat caggacatag 4740cgttggctac ccgtgatatt gctgaagagc
ttggcggcga atgggctgac cgcttcctcg 4800tgctttacgg tatcgccgct cccgattcgc
agcgcatcgc cttctatcgc cttcttgacg 4860agttcttctg agcgggactc tggggttcga
aatgaccgac caagcgacgc ccaacctgcc 4920atcacgagat ttcgattcca ccgccgcctt
ctatgaaagg ttgggcttcg gaatcgtttt 4980ccgggacgcc ggctggatga tcctccagcg
cggggatctc atgctggagt tcttcgccca 5040ccctaggggg aggctaactg aaacacggaa
ggagacaata ccggaaggaa cccgcgctat 5100gacggcaata aaaagacaga ataaaacgca
cggtgttggg tcgtttgttc ataaacgcgg 5160ggttcggtcc cagggctggc actctgtcga
taccccaccg agaccccatt ggggccaata 5220cgcccgcgtt tcttcctttt ccccacccca
ccccccaagt tcgggtgaag gcccagggct 5280cgcagccaac gtcggggcgg caggctgcat
atcgaattcc cgcggccgcg ggaattcgat 5340tccatcggtg cagcaagcat ggaattttgt
tttgatgtat tcaaggagct caaagtccac 5400catgccaatg agaacatctt ctactgcccc
attgccatca tgtcagctct agccatggta 5460tacctgggtg caaaagacag caccaggaca
cagataaata aggtgagcct acagttaaag 5520attaaaacct ttgccctgct caatggagcc
acagcactta attgtatgat aatgtccctt 5580ggaaactgca tagctcagag gctgaaaatc
tgaaaccaga gttatctaaa agtgtggcca 5640cctccaactc ccagagtgtt acccaaatgc
actagctaga aatcttgaaa ctggattgca 5700taacttcttt ttgtcataac cattatttca
gctactatta ttttcaatta caggttgttc 5760gctttgataa acttccagga ttcggagaca
gtattgaagc tcaggtacag aaataatttc 5820acctccttct ctatgtccct ttcctctgga
agcaaaatac agcagatgaa gcaatctctt 5880agctgttcca agccctctct gatgagcagc
tagtgctctg catccagcag ttgggagaac 5940actgttcata agaacagaga aaaagaagga
agtaacaggg gattcagaac aaacagaaga 6000taaaactcag gacaaaaata ccgtgtgaat
gaggaaactt gtggatattt gtacgcttaa 6060gcaagacagc tagatgattc tggataaatg
ggtctggttg gaaaagaagg aaagcctggc 6120tgatctgctg gagctagatt attgcagcag
gtaggcagga gttccctaga gaaaagtatg 6180agggaattac agaagaaaaa cagcacaaaa
ttgtaaatat tggaaaagga ccacatcagt 6240gtagttacta gcagtaagac agacaggatg
aaaaatagtt ttgtaaacag aagtatctaa 6300ctactttact ctgttcatac actatgtaaa
acctactaag taataaaact agaataacaa 6360catctttctt tctctttgta ttcagtgtgg
cacatctgta aacgttcact cttcacttag 6420agacatcctc aaccaaatca ccaaaccaaa
tgatgtttat tcgttcagcc ttgccagtag 6480actttatgct gaagagagat acccaatcct
gccagtaagt tgctctaaaa tctgatctga 6540gtgtatttcc atgccaaagc tctaccattc
tgtaatgcaa aaacagtcag agttccacat 6600gtttcactaa gaaaatttct ttttctcttg
tttttacaaa tgaaagagag gacaaataac 6660atttctctat caccgacctg aaactctaca
gtcttcagag aatgaatggc ttgctaaaag 6720aatgtcaaat cttaccatac agctatttca
tattacacta ctaaatacac tataaggcat 6780agcatgtagt aatacactgt aaaatagctt
tttacactac tatattatta atatctgtta 6840attccagtct tgcatttcac atttgcaaaa
cgttttgaaa ttcgtatctg aaagctgaat 6900actcttgctt tacaggaata cttgcagtgt
gtgaaggaac tgtatagagg aggcttggaa 6960cctatcaact ttcaaacagc tgcagatcaa
gccagagagc tcatcaattc ctgggtagaa 7020agtcagacaa atggtaaggt agaacatgct
ttgtacatag tgagagttgg ttcaccctaa 7080tactgagaac ctggatatag ctcagccagc
gtgctttgcg ttcaagctta ccagagctgt 7140tgtatgcctg ttaagcaggg catacagtca
tgaggctctt gaaaaatctt aacagacaaa 7200gggcaatgga aaatcggagt taagggatgg
tagggataaa atgcatagaa agaggtacca 7260cgattttgat ttttgcccta atgcctctct
gcgtggttcc tcaatttttc tacttcattc 7320ctcatctcct cagagcattc ctttccctca
tgcttgaaac acagatgaaa gactgtgaat 7380tctaactgag atgaaaacat ccacaaccac
acaacctctg gtgtggagtc acattctgtg 7440aaggcaaaaa ctaggccacg taatctatgt
gtgcaagcta cgtgtaagct atgtgtgtga 7500caggacaatg tgaggaacat actatgtgca
caaggactgc agaataaaca ggagcaaagt 7560ttttgaagaa aacagagtaa aatcccgttt
tcctcttttg ttacattctt tacatatatc 7620tcaaatttcc tctttggtta gaagcaagta
atatttatgt ttcttggtac tgtttgggtt 7680gaagaccatt ctgggataag agaaattcca
gtggttcttc ccctaatcat aaaatgtaca 7740ggtttagttt ttttgtaaca cagaaatctc
ttcatctttt atcttttgtt gtgattcttt 7800atagagagag aaacaagact tactgacaat
agcagcaaga aaatcaatct tggaagaaca 7860agattgcagt tgcaaaaaca aaccaatgtc
cttgccccta catcctcttc cccataaatt 7920ctacattctc tatctacctt gtgcttgcca
acatgatata cgtaaactct cttttcctat 7980tcattcttaa aggaattatc agaaatgtcc
ttcagccaag ctccgtggat tctcaaactg 8040caatggttct ggttaatgcc attgtcttca
aaggactgtg ggagaaagca tttaaggatg 8100aagacacaca agcaatgcct ttcagagtga
ctgaggtata tgggcatacc ttagagatgt 8160aatctagaat ttatgaagag agtagacatg
ttgttatatg aacactgcat tagcgtatct 8220gctcatttgt ctgcatctct ttcagacact
gtgttaaaag cagggaattt tccttatgtc 8280tctttcatca caatattcct gacattgcaa
agctcctgag aaataacttc agattcccac 8340ttttcctagg gaggtcttcc tggatgagaa
caatcaatca tcttaactgt aactagatat 8400ttctgcatct aagaataatc tttgttaaaa
ctatattctc tctctctttt tttttttttt 8460tggttctcca gcaagaaagc aaacctgtgc
agatgatgtg gatcc 85053418PRTchicken 34Met Arg Ser Leu
Leu Ile Leu Val Leu Cys Phe Leu Pro Leu Ala Ala1 5
10 15Leu Gly3519PRTchicken 35Met Lys Leu Ile
Leu Cys Thr Val Leu Ser Leu Gly Ile Ala Ala Val1 5
10 15Cys Phe Ala36240DNAchicken 36gacatacagc
tagaaagctg tattgccttt agcactcaag ctcaaaagac aactcagagt 60tcaccatggg
ctccatcggc gcagcaagca tggaattttg ttttgatgta ttcaaggagc 120tcaaagtcca
ccatgccaat gagaacatct tctactgccc cattgccatc atgtcagctc 180tagccatggt
atacctgggt gcaaaagaca gcaccaggac acagataaat aaggttgttc
24037180DNAchicken 37atggccatgg caggcgtctt cgtgctgttc tctttcgtgc
tttgtggctt cctcccagat 60gctgtctttg gggctgaggt ggactgcagt aggtttccca
acgctacaga caaggaaggc 120aaagatgtat tggtttgcaa caaggacctc cgccccatct
gtggtaccga tggagtcact 1803846DNAchicken 38gctgtttgct ctagacaact
cagagttcac catgggctcc atcggt 463942DNAchicken
39gctgtttgct ctagacaact cagagtcatg ggctccatcg gt
424045DNAchicken 40gctgtttgct ctagacaact cagagttcac atgggctcca tcggt
454146DNAchicken 41gctgtttgct ctagacaact cagagttcac
attgggctcc atcggt 464244DNAchicken 42gctgtttgct
ctagacaact cagagttcac tgggctccat cggt
444337DNAchicken 43gctgtttgct ctagacaact caatgggctc catcggt
374420DNAchicken 44gctgcatggg ctccatcggt
204556DNAchicken 45tgcagtaggt ttcccaacgc
tacagacaag gaaggcaaag atgtattggt ttgcaa 564654DNAchicken
46tgcagtaggt ttcccaacgc tacagaaagg aggcaaagat gtattggttt gcaa
544754DNAchicken 47tgcagtaggt ttcccaacgc tacaacaagg aggcaaagat gtattggttt
gcaa 544853DNAchicken 48tgcagtaggt ttcccaacgc tacacaagga
ggcaaagatg tattggtttg caa 534952DNAchicken 49tgcagtaggt
ttcccaacgc taacaaggag gcaaagatgt attggtttgc aa
525052DNAchicken 50tgcagtaggt ttcccaacgc tacaaggaag gcaaagatgt attggtttgc
aa 525151DNAchicken 51tgcagtaggt ttcccaacgc taaaggaagg
caaagatgta ttggtttgca a 515251DNAchicken 52tgcagtaggt
ttcccaacgg acaaggaagg caaagatgta ttggtttgca a
515349DNAchicken 53tgcagtaggt ttcccaacgc aaggaaggca aagatgtatt ggtttgcaa
495449DNAchicken 54tgcagtaggt ttcccaacgc aaggaaggca
aagatgtatt ggtttgcaa 495534DNAchicken 55tgcagtaggt
ttcccaacgc tgtattggtt gcaa
345660DNAchicken 56agtaggtttc ccaacgctac agacaaggaa ggcaaagatg tattggtttg
caacaaggac 605755DNAchicken 57agtaggtttc ccaacgcaca aggaaggcaa
agatgtattg gtttgcaaca aggac 555856DNAchicken 58tgcagtaggt
ttcccaacgc tacagacaag gaaggcaaag atgtattggt ttgcaa
565955DNAchicken 59tgcagtaggt ttcccaacgc tacaacaagg aaggcaaaga tgtattggtt
tgcaa 556054DNAchicken 60tgcagtaggt ttcccaacgc tacacaagga
aggcaaagat gtattggttt gcaa 546152DNAchicken 61tgcagtaggt
ttcccaacgc tacaaggaag gcaaagatgt attggtttgc aa
526251DNAchicken 62tgcagtaggt ttcccaacgc acaaggaagg caaagatgta ttggtttgca
a 516347DNAchicken 63tgcagtaggt ttcccaacgc ggaaggcaaa
gatgtattgg tttgcaa 476444DNAchicken 64tgcagtaggt
ttcccaacga aggcaaagat gtattggttt gcaa
446535DNAchicken 65tgcagtaggt ttcccaacgc tgtattggtt tgcaa
356625DNAchicken 66tgcagtagga tgtattggtt tgcaa
2567120DNAchicken 67gacatacagc tagaaagctg
tattgccttt agcactcaag ctcaaaagac aactcagagt 60tcaccatggg ctccatcggc
gcagcaagca tggaattttg ttttgatgta ttcaaggagc 12068120DNAchicken
68gacatacagc tagaaagctg tattgccttt agcactcaag ctcaaaagac aactcagagt
60tcaccatggg ctccatcggc gcagcaagca tggaattttg ttttgatgta ttcaaggagc
1206960DNAchicken 69agtaggtttc ccaacgctac agacaaggaa ggcaaagatg
tattggtttg caacaaggac 607055DNAchicken 70agtaggtttc ccaacgcaca
aggaaggcaa agatgtattg gtttgcaaca aggac 557119DNAArtificial
Sequenceprimer 71tctggggttc gaaatgacc
19
User Contributions:
Comment about this patent or add new information about this topic: