Drug delivery compositions

文档序号:310047 发布日期:2021-11-26 浏览:13次 中文

阅读说明:本技术 药物递送组合物 (Drug delivery compositions ) 是由 藤原崇之 广冈俊亮 宫城岛进也 大松勉 于 2020-03-27 设计创作,主要内容包括:一种药物递送组合物,其包含在细胞中内包药物的耐酸性细胞。另外,一种耐酸性细胞,其是在细胞中内包药物的耐酸性细胞,所述药物局部存在于所述耐酸性细胞具有的袋状膜结构中。(A drug delivery composition comprising an acid-resistant cell encapsulating a drug within the cell. Also disclosed is an acid-resistant cell wherein a drug is encapsulated in the cell, wherein the drug is locally present in a pouch-like membrane structure of the acid-resistant cell.)

1. A drug delivery composition comprising an acid-resistant cell encapsulating a drug.

2. The drug delivery composition of claim 1, wherein the drug is locally present in a pouch-like membrane structure possessed by the acid-resistant cells.

3. The drug delivery composition of claim 2, wherein the pouch-like membrane structure is at least one selected from the group consisting of an exogenous liposome, a cell membrane, and an organelle.

4. The drug delivery composition of claim 3, wherein the organelle is at least one selected from the group consisting of mitochondria, chloroplasts, endoplasmic reticulum, vacuole, nucleus, peroxisome, and Golgi body.

5. The drug delivery composition according to any one of claims 1 to 4, wherein the drug is at least one selected from the group consisting of a low-molecular compound, a peptide, a protein, and a nucleic acid.

6. The drug delivery composition of any one of claims 1 to 5, wherein the drug is a drug that acts in the intestine.

7. The drug delivery composition of any one of claims 1 to 6, wherein the drug is an immunogenic drug.

8. The drug delivery composition of any one of claims 1 to 7, wherein the acid-resistant cell is a cell that undergoes cell disruption at a pH of 7 or above.

9. The drug delivery composition of any one of claims 1 to 8, wherein the acid-resistant cell is a cell that is resistant to acidic conditions of pH 1-3.

10. The drug delivery composition of any one of claims 1 to 9, wherein the acid-resistant cells are cells of an alga belonging to the class Rhodophyceae, spa (Cyanidiophyceae).

11. A feed containing the drug delivery composition of any one of claims 1 to 10.

12. A medicament containing a drug delivery composition according to any one of claims 1 to 10.

13. A food product comprising the drug delivery composition of any one of claims 1 to 10.

14. An acid-resistant cell that encapsulates a drug within the cell.

15. An acid-resistant cell in which a drug is encapsulated in the cell, wherein the drug is locally present in a pouch membrane structure of the acid-resistant cell.

16. A method for producing an acid-resistant cell according to claim 15, which comprises a step of introducing a gene encoding a fusion protein into the acid-resistant cell, wherein the fusion protein contains a peptide or protein as a drug and a peptide or protein that is localized to a cell membrane or an organelle.

17. A drug carrier comprising acid-resistant cells.

18. The drug carrier of claim 17, wherein the acid-resistant cell is a cell that undergoes cell rupture at a pH of 7 or above.

19. The drug carrier according to claim 17 or 18, wherein the acid-resistant cell is a cell resistant to acidic conditions of pH 1-3.

20. The pharmaceutical carrier of any one of claims 17 to 19, wherein the acid-resistant cells are cells of an alga belonging to the class rhodophyceae spa (Cyanidiophyceae).

21. An acid-resistant cell comprising an exogenous substance.

22. The acid-resistant cell of claim 21, wherein the exogenous substance is locally present in a pouch-like membrane structure that the acid-resistant cell has.

Technical Field

The present invention relates to a drug delivery composition; in addition, it relates to an acid-resistant cell, a drug carrier, and a method for producing the acid-resistant cell, which can be used for the drug delivery composition.

The present application claims priority based on the application No. 2019-069029, filed on 29.3.2019, the content of which is hereby incorporated by reference.

Background

In animals with an alimentary canal, first of all, food ingested from the mouth is fed into the stomach through the esophagus. For example, in the case of oral administration of a drug, particularly in the case where the drug contains peptides or proteins as main components, the possibility that the drug is enzymatically decomposed in the stomach is high. Further, since the interior of the stomach is strongly acidic, even a drug of a low molecular compound may be decomposed in the stomach non-enzymatically. In addition, even when an acidic compound is absorbed from the intestine, it may be absorbed from the stomach. Thus, oral administration using capsules that do not dissolve in the stomach but dissolve in the intestine is useful.

As a method for achieving drug delivery to the intestine, there are known a technique called bilosome (non-patent document 1) utilizing the property that a substance having protein B introduced into a lipid is stable in the stomach, a rice vaccine (non-patent document 2) utilizing a proteosome which is an organelle of rice to exhibit resistance to digestive enzymes, a spore vaccine (non-patent document 3) utilizing spores resistant to digestive enzymes, temperature changes, and pH changes, and the like.

As a vaccine targeting an industrial oral vaccine, a vaccine using yeast is known. For example, patent document 1 describes an oral vaccine for expressing an antigenic protein in yeast cells. Patent document 1 discloses that yeast cells are not digested in the stomach and jejunum but digested and decomposed in the ileum by freeze-drying, but release of antigenic proteins from yeast depends on the function of digestive enzymes in the small intestine. Patent document 2 suggests that a yeast strain containing a foreign gene incorporated therein is administered mucosally or orally to induce immunity, but also describes that proteins derived from the yeast used are antigenic.

Documents of the prior art

Patent document

Patent document 1: international publication No. 2006/028214

Patent document 2: japanese Kohyo publication No. 2012-508697

Non-patent document

Non-patent document 1: man JF et al, Lipid derived volumetric size of an inorganic influenza vacuum delivery methods the Th1/Th2 bias in the immune response and protection against infection. vaccine.2009Jun 2; 27(27): 3643-9.

Non-patent document 2: nochi T et al, Rice-based microbial vaccine as aglobal strain for cold-chain-and needle-free vaccine, Proc Natl Acad Sci U S A.2007Jun 26; 104(26): 10986-91.

Non-patent document 3: huang JM et al, Mucosal delivery of anti-using adaptation to bacterial spheres, vaccine.2010Jan 22; 28(4): 1021-30.

Disclosure of Invention

Problems to be solved by the invention

In animal husbandry, it is difficult to suppress the spread of infectious diseases when infectious diseases occur, and a large amount of livestock may be killed. Among infectious diseases, there are infectious diseases that are thought to be preventable by intestinal immunity, and it is an urgent subject to develop a technique for establishing immunity against pathogenic bacteria in the intestinal tract of livestock animals. Further, by imparting intestinal immunity, there is a possibility that other mucosal immunity and systemic immunity can be imparted. Therefore, development of an enteric composition that is orally administered with a vaccine and can be directly delivered into the intestine is required. However, the techniques described in non-patent documents 1 to 3 have a problem in terms of cost when used in animal husbandry.

Accordingly, an object of the present invention is to provide a novel drug delivery composition capable of delivering a drug to the intestine, an acid-resistant cell and a drug carrier that can be used in the drug delivery composition, and a method for producing the acid-resistant cell.

Means for solving the problems

The present invention includes the following aspects.

(1) A drug delivery composition comprising an acid-resistant cell encapsulating a drug.

(2) The drug delivery composition according to (1), wherein the drug is locally present in a pouch-like membrane structure possessed by the acid-resistant cells.

(3) The drug delivery composition according to (2), wherein the pouch-like membrane structure is at least one selected from the group consisting of an exogenous liposome and an organelle.

(4) The drug delivery composition according to (3), wherein the organelle is at least one selected from the group consisting of mitochondria, chloroplasts, endoplasmic reticulum, vacuole, nucleus, peroxisome and Golgi body.

(5) The drug delivery composition according to any one of (1) to (4), wherein the drug is at least one selected from the group consisting of a low-molecular compound, a peptide, a protein, and a nucleic acid.

(6) The drug delivery composition according to any one of (1) to (5), wherein the drug is a drug that acts in the intestine.

(7) The drug delivery composition according to any one of (1) to (6), which is an immunogenic drug.

(8) The drug delivery composition according to any one of (1) to (7), wherein the acid-resistant cell is a cell in which cell rupture occurs under a condition of pH7 or more.

(9) The drug delivery composition according to any one of (1) to (8), wherein the acid-resistant cell is a cell resistant to acidic conditions at pH 1-3.

(10) The drug delivery composition according to any one of (1) to (9), wherein the acid-resistant cells are cells of an alga belonging to the class Rhodophyceae in the spa (Cyanidiophyceae).

(11) A feed containing the drug delivery composition of any one of (1) to (10).

(12) A pharmaceutical product containing the drug delivery composition according to any one of (1) to (10).

(13) A food product containing the drug delivery composition of any one of (1) to (10).

(14) An acid-resistant cell comprises a drug.

(15) The acid-resistant cell according to (14), wherein the drug is locally present in a sac-like membrane structure of the acid-resistant cell.

(16) The acid-resistant cell according to (14), wherein the drug is locally present in a sac-like membrane structure of the acid-resistant cell.

(17) The acid-resistant cell according to any one of (14) to (16), wherein the drug is at least one selected from the group consisting of a low-molecular compound, a peptide, a protein, and a nucleic acid.

(18) The method for producing an acid-resistant cell according to (15), which comprises a step of introducing a gene encoding a fusion protein into the acid-resistant cell, wherein the fusion protein comprises a peptide or protein as a drug and a peptide or protein localized to a cell membrane or an organelle.

The present invention also includes the following aspects.

(19) A drug carrier comprising acid-resistant cells.

(20) The drug carrier according to (19), wherein the acid-resistant cells are cells which undergo cell disruption at a pH of 7 or higher.

(21) The drug carrier according to (19) or (20), wherein the acid-resistant cells are cells resistant to acidic conditions at pH 1-3.

(22) The pharmaceutical carrier according to any one of (19) to (21), wherein the acid-resistant cells are cells of an alga belonging to the class Rhodophyceae in the spa (Cyanidiophyceae).

(23) A pharmaceutical capsule containing a drug in the pharmaceutical carrier of any one of (19) to (22).

(24) The drug carrier according to (23), wherein the drug is locally present in a pouch-like membrane structure of the acid-resistant cells.

The present invention also includes the following aspects.

(25) An acid-resistant cell comprising an exogenous substance.

(26) The acid-resistant cell of (25), wherein the exogenous substance is locally present in a sac-like membrane structure of the acid-resistant cell.

(27) The acid-resistant cell according to (25) or (26), wherein the exogenous substance is at least one selected from the group consisting of a low-molecular compound, a peptide, a protein, a nucleic acid, and a synthetic high-molecular compound.

(28) The acid-resistant cell of any one of (25) to (27), wherein the exogenous substance is a substance that acts in the intestine.

(29) The acid-resistant cell of any one of (25) to (28), wherein the exogenous substance is an immunogenic substance.

(30) The acid-resistant cell of any one of (26) to (29), wherein the pouch-like membrane structure is at least one selected from the group consisting of an exogenous liposome, a cell membrane, and an organelle.

(31) The acid-resistant cell of (30), wherein the organelle is at least one selected from the group consisting of mitochondria, chloroplasts, endoplasmic reticulum, vacuole, nucleus, peroxisomes and golgi apparatus.

(32) The acid-resistant cell according to any one of (25) to (31), wherein the acid-resistant cell is a cell in which cell rupture occurs under a condition of pH7 or more.

(33) The acid-resistant cell according to any one of (25) to (32), which is resistant to acidic conditions at pH 1-3.

(34) The acid-resistant cell of any one of (25) to (33), which is a cell of an alga belonging to the class Rhodophyceae in spa (Cyanidiophyceae).

(35) A feed comprising the acid-resistant cell of any one of (25) to (34).

(36) A pharmaceutical agent comprising the acid-resistant cell according to any one of (25) to (34).

(37) A food product comprising the acid-resistant cell of any one of (25) to (34).

(38) A method for administering the exogenous substance, comprising orally administering (25) the acid-resistant cell of any one of (34) to a subject.

(39) A method for feeding an animal, comprising feeding the animal with the acid-resistant cell of any one of (25) to (34).

(40) A method for imparting intestinal immunity, which comprises orally administering the acid-resistant cell described in any one of (25) to (34).

Effects of the invention

The present invention can provide a novel drug delivery composition capable of delivering a drug to the intestine, an acid-resistant cell and a drug carrier that can be used in the drug delivery composition, and a method for producing the acid-resistant cell.

Drawings

FIG. 1 is a graph showing the results of immunoblotting using an anti-GFP antibody against a GAPDH-GP-sfGFP-expressing strain cultured in the presence and absence of MG-132. In the figure, the arrow shows the band of the GAPDH-GP-sfGFP protein.

FIG. 2 is a fluorescence microscope image of the GAPDH-GP-sfGFP expressing strain. (A) PC: a phase difference microscope image representing the outline of the cell; (B) and (5) Chl: autofluorescence images of chloroplasts; (C) sfGFP: fluorescence image of sfGFP.

FIG. 3 is a diagram showing the structure of a DNA fragment used for preparing the Chl-TP-3HA-GP-Co 1-expressing strain in example 2.

FIG. 4 is a graph showing the results of immunoblotting using an anti-HA antibody against a Chl-TP-3HA-GP-Co 1-expressing strain cultured in the presence and absence of MG-132. In the figure, the arrow shows the band of Chl-TP-3HA-GP-Co1 protein.

FIG. 5 is a fluorescent microscope image of the Chl-TP-3HA-GP-Co 1-expressing strain. (A) PC: a phase difference microscope image representing the outline of the cell; (B) and (5) Chl: autofluorescence images of chloroplasts; (C) sfGFP: immunofluorescence staining images with anti-HA antibodies.

FIG. 6 is a graph showing the results of evaluating the production of anti-GP protein antibodies by immunoblotting on mice administered with a suspension of an sfGFP-expressing strain (control suspension administration group), a suspension of a Chl-TP-3 HA-GP-Col-expressing strain (suspension administration group), or an alginic acid-immobilized feed of a Chl-TP-3 HA-GP-Col-expressing strain (alginic acid-immobilized feed administration group). (A) The method comprises the following steps Alginic acid solidified feed administration group; (B) the suspension was administered to the group; (C) control suspensions were administered to groups. The numbers 1 to 4 represent the individual numbers of the mice.

FIG. 7 shows a molecular phylogenetic tree of algae belonging to the class Rhodophyceae in the hot spring based on the chloroplast ribulose 1, 5-bisphosphate carboxylase/oxygenase large subunit gene. In the vicinity of each branch, a local bootstrap value by the maximum likelihood method (only 50 or more is described, left side) and a posterior probability by the bayesian method (only 0.95 or more is described, right side) are shown. Known Cyanidioschyzon merole (a type of red spa alga) is enclosed by a dotted line, and YFU3 strains and HKN1 strains are enclosed by a solid line.

Detailed Description

[ definitions ]

In the present specification, the terms "peptide" and "protein" are used interchangeably to refer to a polymer of amino acids bound by amide bonds. The "peptide" or "protein" may be a polymer of natural amino acids, a polymer of natural amino acids and unnatural amino acids (chemical analogs, modified derivatives, etc. of natural amino acids), or a polymer of unnatural amino acids. Unless otherwise specified, the amino acid sequence is described from the N-terminal side to the C-terminal side.

The number of amino acid residues constituting the "peptide" or the "protein" is not particularly limited, and an amino acid polymer having two or more amino acid residues is also included in the "peptide" or the "protein". In the present specification, unless otherwise specified, a polymer having a large number of amino acid residues (for example, 100 amino acid residues or more) is referred to as "protein", and a polymer having a small number of amino acid residues (for example, less than 100 amino acids) is referred to as "peptide".

In the present specification, the terms "polynucleotide" and "nucleic acid" are used interchangeably to refer to a polymer of nucleotides to which the nucleotides are bound by phosphodiester bonds. "Polynucleotide" and "nucleic acid" may be DNA, RNA, or a combination of DNA and RNA. The "polynucleotide" and the "nucleic acid" may be a polymer of natural nucleotides, a polymer of natural nucleotides and non-natural nucleotides (e.g., a nucleotide in which at least one of the analog, base portion, sugar portion, and phosphate portion of natural nucleotides is modified (e.g., phosphorothioate skeleton)), or a polymer of non-natural nucleotides. Unless otherwise specified, the nucleotide sequence is described from the 5 'side to the 3' side.

In the present specification, the term "gene" refers to a polynucleotide comprising at least one Open Reading Frame (ORF) encoding a specific protein. The gene may include both exons and introns.

In the present specification, the term "operably linked" as used in connection with a polynucleotide means that a first base sequence is disposed sufficiently close to a second base sequence so that the first base sequence can affect the second base sequence or a region under the control of the second base sequence. For example, a polynucleotide "operably linked to" a promoter means that the polynucleotide is linked in such a way that it is expressed under the control of the promoter.

In the present specification, "a promoter is capable of functioning" means that the promoter is capable of expressing a polynucleotide operably linked to the promoter in a cell of a subject.

In the present specification, the expression "expressible state" means that the polynucleotide or gene is in a state in which it can be transcribed in a cell into which the polynucleotide has been introduced.

In the present specification, the term "expression vector" refers to a vector containing a target polypeptide and having a system for allowing expression of a target polynucleotide in a cell into which the vector has been introduced.

In the present specification, the term "drug delivery composition" means a composition for delivering a drug to an arbitrary site (organ, tissue, disease site, etc.) in a living body.

In the present specification, "drug carrier" means a carrier for delivery of a drug. The drug carrier may be any of organic and inorganic. In the case where the drug carrier is composed of an organic substance, the drug carrier may be a cell.

In the present specification, the term "drug-encapsulated" in a cell means that the drug is present in the cell and/or the drug is present in the cell membrane. In the case where the drug is present inside the cell, the drug may be present inside the organelle.

In the present specification, the phrase "the drug is locally present in the pouch-shaped film structure" means that most of the drug is present in the inside of the pouch-shaped film structure (inside of the pouch) of the subject or a film forming the pouch-shaped film structure (hereinafter referred to as "pouch film"). In the case where the drug is locally present in the pouch-like membrane structure of the cell, the entire drug contained in the cell does not need to be present inside the pouch-like membrane structure or in the pouch-like membrane, and a part of the drug may be present outside the pouch-like membrane structure. In the case where the drug is "locally present in the pouch-like membrane structure", the proportion of the drug present in the pouch-like structure may be, for example, 50% or more, preferably 60% or more, more preferably 70% or more, and still more preferably 80% or more of the total amount of the drug encapsulated in the cells.

In the present specification, "low molecular compound" means a compound having a molecular weight of about 2000 or less. However, peptides and nucleic acids having a molecular weight of 2000 or less are not included in the "low molecular compound".

In the present specification, "synthetic polymer compound" means a non-natural compound having a molecular weight of 2000 or more. By "non-natural compound" is meant a compound that does not occur in nature. Examples of the synthetic polymer compound include various synthetic polymers (e.g., polyolefin, polyester, polyamide, polyethylene glycol, poly (2-oxazoline)). Artificially chemically synthesized peptides, proteins and nucleic acids are not included in the "synthetic high molecular compound".

In the present specification, the term "exogenous substance" means a substance introduced from outside of a cell or a substance produced in a cell by a substance introduced from outside of a cell. Specific examples of the substance produced in the cell based on the substance introduced from the outside of the cell include: transcription products (mRNA) and translation products (protein) of the foreign gene in cells into which the foreign gene has been introduced, active metabolites of the prodrug (agents exhibiting the desired pharmacological effect) in cells into which the prodrug has been introduced, and the like. The exogenous substance is a substance different from a substance originally possessed by the cell (endogenous substance).

In the present specification, "drug" means a substance that shows a beneficial activity in an organism. The beneficial activity exhibited by the drug is not particularly limited, and includes physiological activity, pharmacological activity, biological activity, chemical activity useful for diagnosis and the like, and the like. For example, the activity may include: the pharmacological activity of a compound known as an active ingredient of a pharmaceutical product, and the chemical activity or physiological activity of a diagnostic drug administered into the body. Examples of the activity include, but are not limited to, an immune-inducing activity, an immune-enhancing activity, an anticancer activity, a signal transduction-inhibiting activity, a signal transduction-promoting activity, a metabolic antagonistic activity, an analgesic activity, an anti-inflammatory activity, a bactericidal activity, an antiviral activity, an antiallergic activity, an enzyme-inhibiting activity, a contrast action, a fluorescent activity, and the like. Drugs may also be those compounds which release a compound exhibiting a beneficial activity in the organism (so-called prodrugs).

In the present specification, the term "pharmaceutical" includes medical drugs and drugs that are administered for the treatment, prevention, or health promotion of diseases in a broad sense. The "pharmaceutical product" may be a publicly registered drug or a drug for medical use or a drug other than medical use.

In the present specification, "food" is used in a concept including general foods, health foods, nutritional supplements, health supplements, functional foods, beauty supplements, health products, and the like.

In the present specification, the term "variant" means a cell line in which a mutation has been naturally or artificially produced in the genome (including nuclear genome, chloroplast genome, and mitochondrial genome, and the same applies hereinafter) of the original cell line. The method for artificially mutating the genome is not particularly limited. Examples of the artificial method include: ultraviolet irradiation, radiation irradiation, chemical treatment with nitrous acid, etc., gene transfer, genome editing, etc.

In the present specification, the "YFU 3 mutant strain" refers to an algal strain in which a mutation has been generated in the genome of YFU3 strain and which has a diploid cell morphology and a haploid cell morphology. The "HKN 1 mutant strain" refers to an algal strain in which a mutation has been generated in the genome of HKN1 strain and which has a diploid cell morphology and a haploid cell morphology.

In the present specification, the "closely related species" is, for example, a cell line in which the nucleotide sequence of the rbcL gene, 18SrRNA gene or 16sRNA gene has 90% or more identity to the nucleotide sequence of the gene of the original biological species. In the case where the biological species is algae, the gene to be compared is an rbcL gene or an 18SrRNA gene, and preferably an rbcL gene. The identity between the nucleotide sequence of the rbcL gene of the original alga and the nucleotide sequence of the rbcL gene of the alga of the related species is preferably 95% or more, more preferably 97% or more, still more preferably 98% or more, and particularly preferably 99% or more. The nucleotide sequence of the rbcL gene possessed by the algae can be obtained by a known method. For example, the nucleotide sequence of the rbcL gene of the target alga can be obtained by extracting DNA from cells of the target alga by a known method, amplifying DNA fragments of the rbcL gene by a PCR method or the like, and analyzing the nucleotide sequence of the amplified DNA fragments by a DNA sequencer.

[ drug delivery composition ]

In one embodiment, the present invention provides a drug delivery composition comprising acid-resistant cells encapsulating a drug. In a preferred embodiment, the drug is locally present in a pouch-like membrane structure that the acid-resistant cells have.

< acid-resistant cell >

In the present specification, "acid-resistant cell" means a cell having resistance to acidic conditions. Specific examples of the acidic conditions include pH conditions of 1 to 3. The acid-resistant cell is preferably resistant to a pH condition of 1-4, and more preferably resistant to a pH condition of 1-5.

By "resistant" to acidic conditions is meant that no cell disruption or dissolution of the cell contents occurs under acidic conditions.

The acid-resistant cells may be living cells or dead cells, but are preferably cells maintaining the morphology of the cells. The acid-resistant cell is preferably a cell whose cell membrane and/or its outer membrane is not damaged and whose cell content is not eluted. In the case where the acid-resistant cell is a living cell, the cell can be grown under acidic conditions.

The cell type of the acid-resistant cell is not particularly limited. Examples of the acid-resistant cells include acid-resistant algal cells. Examples of such algal cells include cells of microalgae isolated from an acidic environment such as an acidic hot spring. Specific examples of such microalgae include algae belonging to the class Rhodophyceae (Cyaniidophyceae) in the hot spring.

The Rhodophyceae is classified by Rhodophyta (Rhodophyta) and Rhodophyceae (Cyaniidiophyceae). Within the class of the spa class are currently classified three genera, cyanidiischizon, Cyanidium and galdiia. The acid-fast cell may belong to any of these genera. For example, it can be judged whether or not an alga belongs to the class Rhodophyceae in the Hot spring by systematic analysis using the base sequence of the 18S rRNA gene or the ribulose-1, 5-bisphosphate carboxylase/oxygenase large subunit (rbcL) gene of chloroplast. The system analysis may be performed by a known method. A molecular phylogenetic tree based on the base sequence of the rbcL gene of algae belonging to the class rhodophyceae is shown in fig. 7.

Among algae belonging to the class rhodophyta, there are algae having both diploid and haploid cell morphologies. Diploid cell morphology can produce haploid cell morphology through meiosis. Further, it is considered that a haploid cell generates a diploid cell by the joining of two cells.

Haploid cells are easier to produce than diploid cells using genetic recombination techniques. Therefore, as described later, when a gene encoding a peptide as a drug is introduced into acid-resistant cells, haploid cells can be preferably used. In addition, a plurality of transformants into which an arbitrary drug-encoding gene has been introduced are prepared using haploid cells, and by crossing these transformants, a diploid containing a plurality of drug-encoding genes and a plurality of drugs therein can be prepared.

The judgment of whether an alga is diploid or haploid can be made by confirming the copy number of the same locus. That is, when the copy number of the same locus is 1, it is judged to be haploid. Further, the algae may be judged to be haploid by a next-generation sequencer or the like. For example, sequence reads of all genomes are obtained by a next generation sequencer or the like, these sequence reads are assembled, and mapping (mapping) is performed on the assembled sequences. In diploids, differences in the bases of the respective alleles can be found in the respective regions on the genome, but in haploids, such regions cannot be found because only one allele is present.

Alternatively, cells may be stained with a nuclear staining reagent such as DAPI, and cells showing the same fluorescence intensity may be judged to be haploid, and cells showing about 2-fold fluorescence intensity may be judged to be diploid, by comparing the stained cells with known haploid cells. Alternatively, cells may be stained with a nuclear staining reagent such as DAPI, and cells showing equivalent fluorescence intensity may be judged to be diploid and cells showing fluorescence intensity of about 1/2 times may be judged to be haploid, as compared with known diploid cells.

In order to release the drug rapidly in the intestine, the acid-resistant cells preferably do not have a strong cell wall. In the present specification, "does not have a strong cell wall" means that cell rupture occurs in any of the cell rupture treatments (a) to (C) described below.

(A) Cells were suspended in an isotonic solution above pH7 for more than 1 week.

(B) The cells were suspended in distilled water for more than 1 minute.

(C) The cells were dried and suspended in an isotonic solution above pH7.

In the above (a) to (C), when the cells are cultured cells, the culture medium may be removed by centrifugation or the like and the algal cells may be washed with an isotonic solution or the like before the respective treatments.

In the above (A) and (C), the isotonic solution may be a buffer solution of pH7 containing 10% sucrose and 20mM HEPES.

In the above (C), examples of the drying treatment include drying in a refrigerator (at 4 ℃ C.), freeze drying and the like. The precipitate of the algal cells recovered by the centrifugal separation is used in the drying treatment. In the case of drying in a refrigerator, the drying treatment time varies depending on the amount of cells, and may be exemplified by 3 days or more.

Further, the cell suspensions after the cell disruption treatment of the above (a) to (C) were centrifuged (1500 × g, 3 minutes), and the ratio of the mass of protein in the centrifugal supernatant to the mass of total protein in the cell suspension was determined, whereby it was possible to determine whether or not cell disruption occurred. Specifically, when the cell rupture rate obtained by the following equation is 20% or more, it can be determined that cell rupture has occurred.

Alternatively, the cells in the cell suspension may be observed with an optical microscope (for example, 600 times magnification), and when the ratio of the cells in which cell disruption has occurred is about 10% or more, and preferably about 20% or more of the total cells, it may be determined that cell disruption has occurred.

In the case where the cell does not have a strong cell wall, the cell wall is not observed in observation (for example, 600 times magnification) with an optical microscope. Further, it was judged whether or not the mild hypotonic treatment at a pH of 6 or less causes cell rupture without affecting whether or not the algae do not have a strong cell wall.

In the cell disruption treatment of the above (A) and (C), since an isotonic solution having a pH of 7 or more can be used, the cell disrupted in the cell disruption treatment of any one of the above (A) and (C) can be said to be a cell disrupted under the condition of a pH of 7 or more. In order to rapidly release the drug in the intestine, the acid-resistant cells are preferably cells in which cell rupture occurs under conditions of pH7 or more.

Further, by immersing the cells in a buffer solution of pH7 or more and observing for about 10 to 30 minutes, it is possible to determine whether or not the algal cells are disrupted under the condition of pH7 or more by confirming whether or not the algal cells are disrupted.

As the acid-resistant cells having such characteristics, among algae belonging to the class rhodophyta, for example, haploid of algae belonging to the genus Cyanidioschyzon merole or galdiiria, haploid of algae belonging to the genus Cyanidium, and the like can be given. These algae can be isolated from acidic environments such as acidic hot springs, and can also be obtained from strain preservation institutions. Examples of such a strain Collection include national institute of research and development (national institute of environmental sciences) facility for preservation of microbial systems (Xiaoyachu, Togawa, Ichwa, Japan 16-2), American Type Culture Collection (ATCC: American Type Culture Collection; 10801University Boulevard Manassas, VA 20110, USA), and the like.

Haploid of algae belonging to the genus Galdieria includes haploid of Galdieria subpluraria and Galdieria partita, and haploid of sibling species, variant strains, and progeny thereof. For example, diploid algae belonging to the genus Galdieria obtained from culture collection or the like are cultured until the culture reaches a stationary phase, and thereafter, culture is continued for an arbitrary period of time, whereby haploid cells appear in the culture solution. The haploid cells can be recovered and used as acid-resistant cells.

Examples of the haploid of algae belonging to the genus Cyanidium include a haploid of a Cyanidium sp.yfu3 strain (FERM BP-22334) (hereinafter, referred to as "YFU 3 strain"), a haploid of a Cyanidium sp.hkn1 strain (FERM BP-22333) (hereinafter, referred to as "HKN 1 strain"), and related species, mutants, and progeny thereof.

YFU3 strain (haploid) is a unicellular red alga separated from the high-temperature acidic water of a hot spring in Buzhou prefecture, Japan. YFU3 strain was deposited at 30/5.2017 at the agency for patent deposit of independent administrative statutes of human evaluation technology (Collection of microorganisms under authorization) (2-5-8 of Sickle foot on Toujin city, Qianye county, Japan) as accession number FERM P-22334, and was handed over as international deposit at 20/4.2018 as accession number FERM BP-22334.

HKN1 is a unicellular red alga separated from the high-temperature acidic water of a hot spring of kakomachi under Zhangzu, Kanagawa, Japan. HKN1 strain (haploid) was deposited at 30.5.2017 as accession number FERM P-22333 at the patent organism depositary center of independent administrative statutes human article evaluation technology base disk organization (Collection of licensed microorganisms), and was handed over as international deposit at 20.4.2018 as accession number FERM BP-22333.

Algae belonging to the class Rhodophyceae in the hot spring can be cultured using a culture medium for microalgae culture. The medium is not particularly limited, and an inorganic salt medium containing a nitrogen source, a phosphorus source, trace elements (zinc, boron, cobalt, copper, manganese, molybdenum, iron, and the like) and the like can be exemplified. For example, the nitrogen source includes ammonium salts, nitrates, nitrites, urea, and ammonia, and the phosphorus source includes phosphates. Examples of such a medium include a2 × Allen medium (Allen MB. Arch. Microbiol.195932: 270- & 277.), an M-Allen medium (Minoda A et al. plant Cell physiology.200445: 667-71.), an MA2 medium (Ohnuma M et al. plant Cell physiology.2008Jan; 49(1):117-20.), and the like.

Algae belonging to the class Rhodophyceae can also be cultured using a medium containing acidic hot spring drainage water. By "acidic spa drainage" is meant acidic drainage from a spa installation. The acidic hot spring drainage water is not particularly limited, but preferably has a pH of 1.0 to 4.0, and more preferably a pH of 1.0 to 3.0. The term "medium using acidic hot spring drainage" means a medium prepared by adding a nitrogen source, a phosphorus source, trace elements, and the like to acidic hot spring drainage. The medium using the Acidic Hot spring drainage water is preferably a medium obtained by adding a nitrogen source to the Acidic Hot spring drainage water, and more preferably a medium obtained by adding a nitrogen source and a phosphorus source (for example, refer to Hirooka S and Miyagishima S.Y (2016) digestion of Acidophilic Algae Galdieria subpluria and Pseudomonas sp. YKT1 in Media Derived from Acidic Hot springs. front microorganism. Dec 20; 7: 2022.). The nitrogen source includes ammonium salts (e.g., ammonium sulfate), urea, nitrates (e.g., sodium nitrate), and the like, and preferably ammonium salts and urea, and more preferably ammonium salts. The amount of the nitrogen source added is, for example, 1 to 50 mM. The amount of the nitrogen source added is preferably 5 to 40mM, more preferably 10 to 30mM, based on the amount of nitrogen added. Examples of the phosphorus source include phosphate (e.g., potassium dihydrogen phosphate). The amount of the phosphorus source added is preferably 0.1 to 10mM in terms of phosphorus addition amount, and the amount of the phosphorus source added is preferably 0.5 to 5mM, more preferably 1 to 3mM in terms of phosphorus addition amount. Algae belonging to the class Rhodophyceae in the hot spring can be cultured in a medium using acidic hot spring drainage water, so that the acidic hot spring drainage water can be effectively utilized and the culture can be performed at low cost.

When the algae belonging to the class rhodophyceae is an algae belonging to the genus Galdieria, the nitrogen source is preferably an ammonium salt or urea, and more preferably an ammonium salt. When the algae belonging to the class rhodophyceae are algae belonging to the genus Cyanidium, the nitrogen source is preferably an ammonium salt or a nitrate salt, and more preferably an ammonium salt.

As described above, algae belonging to the class Rhodophyceae in the hot spring can be proliferated at a high density under relatively mild culture conditions. The pH condition is exemplified by pH1.0 to 6.0, preferably pH1.0 to 5.0. In the case of outdoor culture, it is preferable to culture under conditions of high acidity in order to prevent the proliferation of other organisms, and such conditions include pH1.0 to 3.0.

The temperature condition may be, for example, 15 to 50 ℃ and preferably 30 to 50 ℃. When the culture is carried out outdoors, the culture is preferably carried out at a high temperature in order to prevent the proliferation of other organisms, and such conditions include 35 to 50 ℃.

The light intensity may be 5 to 2000. mu. mol/m2s is preferably 5 to 1500. mu. mol/m2And s. When the culture is carried out outdoors, the culture may be carried out in sunlight. In the case of indoor cultivation, the cultivation may be carried out under continuous light, or a light-dark period (10L:14D, etc.) may be set.

< medicine >

The drug encapsulated in the acid-resistant cell is not particularly limited, and may be any drug. Examples of the drug include, but are not limited to, low molecular weight compounds, peptides, proteins, nucleic acids, lipids, saccharides, vitamins, hormones, and synthetic high molecular weight compounds. Among them, the drug is preferably at least one drug selected from the group consisting of low molecular compounds, peptides, proteins, and nucleic acids.

As the low-molecular compound, a low-molecular compound known as an active ingredient of a pharmaceutical can be used without particular limitation. The low molecular compound may be a contrast agent, a fluorescent dye, or the like for a diagnostic drug. Examples of the low-molecular weight compound include, but are not limited to, an immunopotentiator, an anticancer agent, a signal transduction inhibitor, a metabolic antagonist, an analgesic, an anti-inflammatory agent, an antibiotic, an antiallergic agent, a therapeutic agent for a central nervous system disease, a therapeutic agent for a circulatory organ disease, a therapeutic agent for a respiratory organ disease, a therapeutic agent for a digestive organ disease, a therapeutic agent for a urogenital organ disease, a contrast medium, and a fluorescent dye. The low-molecular compound is not limited to an active ingredient of a pharmaceutical product, and may be a component in a food (for example, a nutritional component such as an amino acid or a vitamin) or a food additive (for example, a flavor).

Examples of the nucleic acid include nucleic acid molecules used as nucleic acid medicines (e.g., siRNA, miRNA, antisense RNA, aptamer, decoy (decoy), CpG oligonucleotide, etc.).

Examples of the synthetic polymer compound include industrially produced polymer compounds such as polyolefin, polyester, and polyamide, and granular or spherical polymer compounds. Among them, there are also polymer compounds which can be expected to have an immunopotentiating effect.

The drug may be a microcapsule containing a low molecular compound, a sustained-release microcapsule, or a microcapsule releasing the drug depending on the environment such as temperature, pH, pressure, or the like.

As the peptide or protein (hereinafter, also collectively referred to as "pharmaceutical peptide"), a peptide or protein known as an active ingredient of a pharmaceutical can be used without particular limitation. Examples of the drug peptide include, but are not limited to, antigens, cytokines, growth factors, hormones, enzymes, antibodies, antibody fragments, ligands, blood component proteins, and the like.

Among them, the drug peptide is preferably an immunogenic drug peptide. The term "immunogenic" as used herein means that the pharmaceutical peptide is capable of inducing immunity to the pharmaceutical peptide in an organism to which the pharmaceutical peptide is administered. The immunity induced by the drug peptide may be cellular immunity, humoral immunity, or both.

The drug peptide is more preferably a drug peptide that contributes to intestinal immunity. "intestinal immunity" means a defense system of a living body for preventing the invasion of foreign substances from the intestinal tract into the body. The intestinal immune system is composed of lymphoid tissues such as Peyer's patches (Peyer's patches), immunocompetent cells of the mucosal lamina propria, intestinal epithelial cells, lymphocytes present therebetween, and the like. The pharmaceutical peptide that contributes to gut immunity may be one that acts on any one or more of these gut immune systems and strengthens the gut immune system.

Examples of the pharmaceutical peptide contributing to intestinal immunity include immunogenic peptides or immunogenic proteins of pathogenic microorganisms or pathogenic viruses (hereinafter, collectively referred to as "pathogenic bacteria"). The immunogenic pharmaceutical peptide can be appropriately selected according to the infectious disease infected by the subject to which the drug delivery composition of the present embodiment is applied. Immunogenic peptides or immunogenic proteins are also referred to as antigenic peptides or antigenic proteins.

For example, in the case of applying the drug delivery composition of the present embodiment to a human, an immunogenic peptide or an immunogenic protein of a human pathogenic bacterium may be used as the drug peptide. Examples of human pathogenic bacteria include, but are not limited to, rabies virus, rotavirus, influenza virus, aids virus, poliovirus, hepatitis a virus, hepatitis B virus, human papilloma virus, vibrio cholerae, salmonella, tubercle bacillus, streptococcus pneumoniae, anthrax, and typhoid bacillus.

For example, in the case of applying the drug delivery composition of the present embodiment to livestock, an immunogenic peptide or an immunogenic protein of a pathogen of livestock can be used as the pharmaceutical peptide. Examples of the livestock pathogenic bacteria include, but are not limited to, rabies virus, bovine rotavirus, bovine coronavirus, akabane virus, bovine adenovirus, bovine parainfluenza virus, bovine salmonella, tubercle bacillus, porcine circovirus, swine influenza virus, swine parvovirus, swine cholera virus, and streptococcus suis.

For example, an immunogenic peptide or an immunogenic protein can be designed using a full-length protein of a protein constituting the outer membrane or capsid of a pathogenic virus or a full-length protein of a cell membrane protein of a pathogenic bacterium or a partial peptide thereof. For example, when the pathogenic bacterium is rabies virus, the immunogenic protein may be the full length or a partial peptide of glycoprotein (the base sequence is seq id No. 1, and the amino acid sequence is seq id No. 2).

(localization of drug to bag-like film Structure)

In the acid-resistant cell, the drug is preferably localized in the pouch membrane structure of the acid-resistant cell. In the present specification, the term "bag-like membrane structure" means a structure divided into bags by a biological membrane or a biological membrane-mimicking structure, and specific examples thereof include cell membranes, organelles, and exogenous liposomes. Examples of the organelles include mitochondria, chloroplasts, endoplasmic reticulum, vacuoles, nuclei, peroxisomes, and golgi apparatus, but are not limited thereto. The term "exogenous liposome" means a liposome introduced into a cell from the outside.

The drug is locally present in the pouch membrane structure of acid-resistant cells, and thus degradation of the drug by a degrading enzyme in the cytoplasm can be suppressed. Therefore, the drug can be protected from decomposition by intracellular enzymes until the acid-resistant cells are delivered to a predetermined site (for example, intestine) in the living body and the acid-resistant cells are broken.

The method for locally allowing a drug to exist in a sac-like membrane structure is not particularly limited, and examples thereof include a method using a signal peptide (hereinafter referred to as "migration signal") that instructs migration to an arbitrary sac-like structure or a protein (hereinafter referred to as "migration protein") that migrates to the sac-like structure. For example, a migration signal or a migration protein targeting an arbitrary sac-like structure is combined with a drug and introduced into acid-resistant cells, whereby the drug can be localized in the sac-like structure. For example, in the case where a drug is localized at any site in mitochondria, vacuole, peroxisome, endoplasmic reticulum, cell membrane, golgi apparatus, and cell nucleus, the drug may be bound to a movement signal (signal peptide) or a movement protein for mitochondria, vacuole, peroxisome, endoplasmic reticulum, cell membrane, golgi apparatus, or cell nucleus. These movement signals and movement proteins can be selected from various well-known movement signals and movement proteins according to the type of acid-resistant cells. Alternatively, a moving signal or a moving protein can be obtained with respect to a sac-like membrane structure in which a drug is desired to be locally present by separating the sac-like membrane structure from acid-resistant cells by a cell fractionation method such as density gradient centrifugation, and analyzing the protein in the sac-like membrane structure.

For example, in the case of using cyanidiiscyzon merole as the acid-resistant cell, as the movement signal or movement protein, for example, the following movement signal or movement protein can be used.

As a mobile protein against chloroplasts, a protein or the like composed of 130 residues (SEQ ID NO: 5, SEQ ID NO: 6) on the N-terminal side of chloroplast proprotein translocase SecA subunit (CMQ 393C; SEQ ID NO: 3, SEQ ID NO: 4 in base sequence) of the chloroplast is used (Sumiya et al2016, Proc Natl Acad Sci U S A.113(47): E7629-E7638; PMID: 27837024).

As the signal for moving to the mitochondrial matrix, a peptide or the like composed of 78 residues (SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 300(6735): 1316-8; PMID:2369666) on the N-terminal side of EF-TU (CMS502C) (SEQ ID NO: 7, SEQ ID NO: 8 in the base sequence) can be used.

As a mobile protein against vacuoles, a prenylated Rab receptor PRA1 (methylated Rab aceptorpA 1) (CMJ260C) (SEQ ID NO: 7 in base sequence and SEQ ID NO: 8 in amino acid sequence), ABC transporter (ABC transporter) (CMS401C) (SEQ ID NO: 13 in base sequence and SEQ ID NO: 14 in amino acid sequence) or O-methyltransferase (O-methyltransferase) (CMT369C) (SEQ ID NO: 15 in base sequence and SEQ ID NO: 16) and the like (Yagisawa et al 2009, Plant J.60(5): 882-93; PMID:19709388) can be used.

As a mobile protein against peroxisome, Catalase (Catalase) (CMI050C) (SEQ ID NO: 17 in base sequence and SEQ ID NO: 18 in amino acid sequence) was used (Moriyama et al2014, plant No. 240(3): 585-98; PMID: 25009310).

As the mobile protein to the endoplasmic reticulum, ACC1(CMM188C) (SEQ ID NO: 19 in base sequence and SEQ ID NO: 20 in amino acid sequence), PAP (CMT239C) (SEQ ID NO: 21 in base sequence and SEQ ID NO: 22 in amino acid sequence) or ALA1(CMR396C) (SEQ ID NO: 23 in base sequence and SEQ ID NO: 24 in amino acid sequence) and the like (Mori et al2016, Front Plant Sci.7: 958; PMID:27446184) can be used.

As a mobile protein against a cell membrane, ALA1(CMR396C) and the like (Mori et al2016, Front Plant Sci.7: 958; PMID:27446184) can be used. Since ALA1(CMR396C) is also a mobilization protein for the endoplasmic reticulum, the use of ALA1(CMR396C) enables the drug to be localized in both the cell membrane and the endoplasmic reticulum.

As a mobile protein to Golgi apparatus, Got1(CMI302C) (SEQ ID NO: 25 in base sequence, amino acid sequence: 286) and the like can be used (Yagisawa et al 2013, Protoplasma.250 (4): 943-8; PMID: 23197134).

As the movement protein for cell nucleus, topoisomerase I type IB (Toposisomerase I type IB) (CMM263C) (base sequence: SEQ ID NO. 27, amino acid sequence: SEQ ID NO. 28) and the like can be used (Moriyama et al2014, Genome Biol Evol.6 (1): 228-37; PMID: 24407855).

In the case where the drug is a drug peptide, the drug peptide may be included in the acid-resistant cell as a fusion protein with a movement signal or a movement protein. By forming the drug peptide as a fusion protein with a migration signal or a migration protein, the drug can be localized in a pocket membrane structure targeted by the migration signal or the migration protein.

For example, by introducing a gene encoding a fusion protein of a drug peptide and a movement signal or a movement protein (hereinafter also referred to as "fusion protein gene") into an acid-resistant cell and expressing the fusion protein in the acid-resistant cell, the fusion protein moves to a pouch-like membrane structure to which the movement signal or the movement protein is targeted. As a result thereof, the drug peptide contained in the fusion protein is locally present in the pouch membrane structure. Therefore, in a preferred embodiment, the acid-resistant cell is a cell into which a fusion protein gene including a movement signal or a fusion protein of a movement protein and a drug peptide is introduced in an expressible state, and has the fusion protein gene. In a preferred embodiment, the acid-resistant cell is a cell expressing the fusion protein gene.

The fusion protein gene may include a sequence encoding a peptide that enhances recognition by intestinal cells, in addition to a sequence encoding a drug peptide and a sequence encoding a movement signal or a movement protein. Examples of the peptide that enhances recognition by intestinal cells include Co1 peptide (seq id No. 43).

The fusion protein gene of the drug peptide and the movement signal or the movement protein is preferably operably linked to a promoter capable of functioning in acid-resistant cells. The promoter is not particularly limited as long as it is a promoter capable of functioning in acid-resistant cells, and a promoter of a housekeeping gene having a large expression amount is preferable from the viewpoint of maintaining the amount of a drug in cells. For example, when the acid-resistant cell is cyanidiiscyzon merole, a promoter such as a promoter of APCC (CMO250C) (e.g., -600 to-1; -1 "represents the nucleotide immediately preceding the initiation codon), a promoter of CPCC (CMP166C), or a promoter of catalase (CMI050C) can be suitably used. The promoter sequence of the APCC of CyanidioSchyzon merole is shown in SEQ ID NO. 29, the promoter sequence of the CPCC (CMP166C) of CyanidioSchyzon merole is shown in SEQ ID NO. 30, and the promoter sequence of the catalase (CMI050C) of CyanidioSchyzon merole is shown in SEQ ID NO. 31. These promoters of Cyanidioschyzon merole can also be used in other algae belonging to the class Rhodophyceae.

The gene encoding the fusion protein is introduced into the acid-resistant cell in an expressible state, for example, in the form of an expression vector. The expression vector may contain, in addition to the fusion protein and the promoter, a marker gene such as an enhancer, a poly-A addition signal, a terminator, a control sequence such as 3' UTR, and a drug resistance gene. Examples of the terminator and 3 'UTR include 3' UTR of β -tubulin.

The type of vector is not particularly limited, and a commonly used expression vector can be appropriately selected and used according to the type of acid-resistant cell. The vector may be linear or circular, and may be a non-viral vector such as a plasmid, a viral vector (e.g., a retroviral vector such as a lentiviral vector), or a transposon-based vector.

In the case where the acid-resistant cell is cyanidiiscyzon merole, the URA5.3 gene (CMK046C) can be used as a selection marker. Among the Cyanidioschyzon merones, there is a Cyanidioschyzon merone strain M4(Minoda et al, Plant Cell Physiol.2004 Jun; 45 (6): 667-71.) as a variant of uracil auxotrophy. Cyanidioschyzon merone M4 strain had a mutation in the URA5.3 gene, and uracil could not be synthesized. Therefore, the strain Cyanidioschyzon merone M4 could not grow in a medium containing no uracil. Therefore, a transformant into which a fusion gene has been introduced can be selected by using the cyanidiuschyzon merole M4 strain as a parent strain and the URA5.3 gene of the wild strain as a selection marker. More specifically, the fusion protein gene operably linked to a promoter is ligated to the URA5.3 gene set of a wild strain (for example, 10D strain) of Cyanidioschyzon merole, and introduced into Cyanidioschyzon merole M4 strain. Thereafter, the cells into which the fusion protein gene has been introduced can be obtained by culturing the cells in a medium containing no uracil.

The method for introducing an arbitrary fusion protein gene into acid-resistant cells is not particularly limited, and a known method can be used. Examples of the gene transfer method include a polyethylene glycol method, a lipofection method, a microinjection method, a DEAE dextran method, a particle gun method, an electroporation method, and a calcium phosphate method.

The fusion protein gene may be present as a plasmid or the like in acid-resistant cells, or may be inserted into any of the nuclear genome, chloroplast genome, and mitochondrial genome. When the fusion protein gene is inserted into the genome, it may be inserted into a specific position of the genome, or may be randomly inserted into the genome.

As a method for inserting a fusion protein gene into a specific position of a genome, homologous recombination can be used. For example, in the case of Cyanidioscheyzon merone, since the whole genome sequence interpretation has been completed (Matsuzaki M et al, Nature.2004Apr 8; 428 (6983): 653-7.), it is possible to insert the fusion protein gene into a desired position on the genome. The insertion position of the fusion protein gene of Cyanidioscheyzon merole is not particularly limited, and for example, a region between CMD184C and CMD185C can be exemplified.

In the fusion protein gene, the order of disposing the drug peptide and the movement signal or movement protein may be appropriately selected depending on the kind of the movement signal or movement protein. Typically, the movement signal or the coding sequence for the movement protein is disposed more 5' to the coding sequence for the drug peptide.

In the case where a gene encoding a drug peptide (hereinafter referred to as "drug peptide gene") is inserted into the chloroplast genome or the mitochondrial genome, the drug peptide need not necessarily be a fusion protein with a movement signal or a movement protein. For example, a drug peptide gene is operably linked to a promoter capable of functioning in chloroplasts, inserted into the chloroplast genome in an expressible state, and expressed in chloroplasts, thereby allowing the drug peptide to locally exist in the chloroplasts. Similarly, the drug peptide gene is inserted into the mitochondrial genome in an expressible state by operably linking the drug peptide gene to a promoter capable of functioning in mitochondria, so that the drug peptide gene is expressed in the mitochondria, thereby enabling the drug peptide to be locally present in the mitochondria.

In the drug delivery composition of the present embodiment, the drug is preferably present locally in an organelle, more preferably a chloroplast. In addition, the drug is preferably a drug peptide, and is preferably present locally in an organelle targeted by a movement signal or movement protein in the form of a fusion protein with the movement signal or movement protein. The movement signal or movement protein is more preferably a chloroplast movement signal or chloroplast movement protein.

< optional component >

The drug delivery composition of the present embodiment may further include other components in addition to the acid-resistant cells. The other component is not particularly limited, and examples thereof include pharmaceutically acceptable carriers. The term "pharmaceutically acceptable carrier" means a carrier that does not interfere with the function of acid-resistant intracellularly encapsulating a drug and does not substantially exhibit toxicity to the subject to which the drug is administered. The phrase "does not exhibit substantial toxicity" means that the component does not exhibit toxicity to the subject at the dose usually used. The pharmaceutically acceptable carrier is not particularly limited, and examples thereof include excipients, binders, disintegrants, lubricants, emulsifiers, stabilizers, diluents, oily bases, thickeners, antioxidants, reducing agents, oxidizing agents, chelating agents, solvents, and the like. One kind of the pharmaceutically acceptable carrier may be used alone, or two or more kinds may be used in combination. The pharmaceutically acceptable carrier is preferably a carrier that does not damage acid-resistant cells.

The drug delivery composition of the present embodiment can be appropriately mixed with other ingredients and formed into a form of granules, tablets, gels, liquid preparations, capsules, and the like according to a general method. Among these pharmaceutical forms, the pharmaceutical form that does not damage acid-resistant cells is preferable, and for example, a gel-like agent, a liquid preparation, a capsule, and the like are preferable. For example, as shown in examples described later, the acid-resistant cells can be included in the form of a solidified product of alginic acid. In addition, instead of alginic acid, a suspension containing acid-resistant cells may be solidified using a thickener or gelling agent such as gelatin, agar, carrageenan, locust bean gum, guar gum, xanthan gum, pectin, gellan gum, tamarind gum, or gum arabic, and used as the drug delivery composition of the present embodiment. The medium used for suspending the acid-resistant cells is not particularly limited, but is preferably a medium that does not rupture the acid-resistant cells, and is preferably an isotonic solution having a pH of about 1 to 6. Examples of the isotonic solution include a culture medium for culturing acid-resistant cells, a glucose isotonic solution adjusted to a pH of about 1 to 6, a sucrose isotonic solution, and various buffers (phosphate buffered saline, HEPES buffer, citric acid buffer, Tris buffer (Tris buffer), and the like). In one embodiment, the drug delivery composition is a solidified body of acid-resistant cells utilizing a viscosity increasing agent or a gelling agent. By making the drug delivery composition a solidified body using a thickening agent and/or a gelling agent, drying of acid-resistant cells can be prevented. The "solidified body of acid-resistant cells using a thickening agent or a gelling agent" refers to a solidified body obtained by gelling and solidifying a suspension of acid-resistant cells using a thickening agent or a gelling agent. In other words, the "solidified body of acid-resistant cells using a thickening agent or a gelling agent" is a gel composition containing acid-resistant cells and at least one selected from the group consisting of thickening agents and gelling agents.

The route of administration of the drug delivery composition of the present embodiment is not particularly limited, and oral administration or non-oral administration may be used, and oral administration is preferred. In the drug delivery composition of the present embodiment, since the drug is contained in the acid-resistant cell, the drug decomposition by gastric acid can be suppressed. Thus, the drug delivery composition of this embodiment is suitable for oral administration.

The drug delivery target of the drug delivery composition of the present embodiment is preferably the intestine (intestinal tract), more preferably the small intestine. If the drug delivery composition of the present embodiment is administered orally, the drug is protected inside the cells of the acid-resistant cells and passes through the stomach. When the drug reaches the intestine, the acid-resistant cells are broken due to the neutral to weakly alkaline pH condition (pH7 or more) in the intestinal tract, and the drug is released into the intestinal tract. The drug released into the intestinal tract acts in the intestinal tract, and contributes to enhancement of intestinal immunity and the like. Further, it is expected that other mucosal immunity and systemic immunity are activated by the enhancement of intestinal immunity.

As described above, according to the drug delivery composition of the present embodiment, since the drug is contained in the acid-resistant cell, the drug can be inhibited from being decomposed in the stomach, and the drug can be delivered to the intestine. In addition, in acid-resistant cells, since the drug is locally present in a bag-like membrane structure, the drug is protected from decomposition by a lytic enzyme in the cytoplasm.

Furthermore, by using acid-resistant cells into which a drug peptide gene or a fusion protein gene containing a coding sequence of a drug peptide is introduced, acid-resistant cells containing a drug can be easily proliferated. In particular, algae belonging to the class Rhodophyceae in the hot spring can proliferate under conditions where acidity is high and other organisms cannot survive, and therefore, can be cultured outdoors in large quantities. Therefore, reduction in manufacturing cost can be expected.

[ feed ]

In one embodiment, the present invention provides a feed comprising the drug delivery composition of the above embodiments.

The kind of animal to which the feed of the present embodiment is administered is not particularly limited. Examples thereof include, but are not limited to, domestic animals (e.g., cattle, pigs, chickens, horses, sheep, and goats), pets (e.g., dogs, cats, hamsters, rabbits, parrots, tropical fishes, reptiles, amphibians, and insects), aquatic animals (e.g., fishes and shellfishes), and test animals (e.g., mice, rats, and guinea pigs).

The feed of this embodiment may contain other ingredients in addition to the drug delivery composition of the above embodiment. Examples of the other ingredients include commonly used feeds (including livestock feeds, aquatic feeds, and pet foods). For example, the drug delivery composition of the above embodiments may be added to existing feed as a feed additive. The feed to which the drug delivery composition of the above embodiment is added is not particularly limited, and may be appropriately selected depending on the subject animal. The drug delivery composition of the embodiment can be administered to an animal by adding it to a usual feed, and the animal can be caused to take in the drug by a usual feeding action.

The drug delivery composition used in the feed of the present embodiment may be in any form, but in order to prevent the drug from leaking out of the acid-resistant cells, it is preferably in a form that does not damage the cells of the acid-resistant cells. Examples thereof include the gel-like agents and capsules described above, and the forms cured by a gelling agent and/or a thickener. When the drug delivery composition is added to a feed as a feed additive, for example, a solidified body of the drug delivery composition using a thickener and/or a gelling agent may be adjusted to an appropriate size, added to the feed, and mixed. Alternatively, the mixture may be cured using a gelling agent and/or a viscosity increasing agent after the drug delivery composition is added to the feed and mixed. The solidified body can be prepared to an appropriate size as appropriate according to the size of an animal. By forming the cured product using a thickening agent and/or a gelling agent, drying of acid-resistant cells can be prevented.

The content of the drug delivery composition of the above embodiment in the feed of the present embodiment is not particularly limited, and may be appropriately set according to the kind of feed. For example, the content of the drug delivery composition in the feed may be, for example, 0.01 to 80% by mass, preferably 0.1 to 70% by mass, more preferably 0.1 to 60% by mass, and particularly preferably 0.1 to 50% by mass. Examples of the content of acid-resistant cells in the feed include 0.1 to 100mg (wet weight)/g, 0.5 to 80mg (wet weight)/g, and 1 to 60mg (wet weight)/g.

According to the feed of the present embodiment, since the drug delivery composition of the above embodiment is contained, any drug can be taken into an animal as a feed. As described above, the drug delivery composition is capable of protecting any drug from decomposition in the stomach and delivery to the intestine. Therefore, by using a drug acting in the intestine for the drug delivery composition, the drug can be effectively made to act on the intestine of an animal. In addition, in the case where the drug is a drug peptide having immunogenicity, intestinal immunity can be effectively activated for animals taking the drug delivery composition. Furthermore, by activating intestinal immunity, it is also expected to activate other mucosal immunity and systemic immunity.

In another aspect, the invention provides a method of feeding an animal comprising feeding the animal a feed comprising the drug delivery composition of the above embodiments.

Furthermore, in another aspect, the invention provides a method of conferring intestinal immunity to an animal comprising feeding the animal a feed comprising the drug delivery composition of the above embodiments.

[ medicinal products ]

In one embodiment, the present invention provides a pharmaceutical comprising the drug delivery composition of the above embodiment.

The pharmaceutical product according to the present embodiment may be a human pharmaceutical product or an animal pharmaceutical product. In the case of animal drugs, the type of animal to which the drugs are applied is not particularly limited. Examples thereof include, but are not limited to, domestic animals (e.g., cattle, pigs, chickens, horses, sheep, and goats), pets (e.g., dogs, cats, hamsters, rabbits, parrots, tropical fishes, reptiles, amphibians, and insects), aquatic animals (e.g., fishes and shellfishes), and test animals (e.g., mice, rats, and guinea pigs).

The pharmaceutical product of the present embodiment may contain other components in addition to the drug delivery composition of the above embodiment. The other component is not particularly limited, and may be a pharmaceutically acceptable carrier. By "pharmaceutically acceptable carrier" is meant a carrier that does not interfere with the function of the drug and does not exhibit substantial toxicity to the subject to which it is administered. The phrase "does not exhibit substantial toxicity" means that the component does not exhibit toxicity to the subject within the dose usually used. The pharmaceutically acceptable carrier is not particularly limited, and examples thereof include excipients, binders, disintegrants, lubricants, emulsifiers, stabilizers, diluents, oily bases, thickeners, antioxidants, reducing agents, oxidizing agents, chelating agents, solvents, and the like. One kind of the pharmaceutically acceptable carrier may be used alone, or two or more kinds may be used in combination. The other components may be other than those described above, and for example, pharmaceutical additives generally used in pharmaceuticals may be used without particular limitation. In addition, the other ingredient may also be an active ingredient other than the drug contained in the above-mentioned drug delivery composition. The active substance is not particularly limited, and examples thereof include an intestinal tract regulating agent, an anti-inflammatory agent, an antibiotic substance, an antibacterial substance, a crude drug, a structure promoter, an antipyretic agent, and an analgesic agent.

The dosage form of the pharmaceutical product of the present embodiment is not particularly limited, but in order to prevent leakage of the drug from the acid-resistant cells, it is preferable that the acid-resistant cells are in a form that does not cause cell damage. Examples thereof include tablets, granules, gels, capsules, liquid preparations, and syrups. For example, the drug of the present embodiment may contain a solidified product of acid-resistant cells using a thickener and/or a gelling agent.

The content of the drug delivery composition of the above embodiment in the drug of the present embodiment is not particularly limited, and can be appropriately set according to the kind of the drug contained in the drug delivery composition. For example, the content of the drug delivery composition in the pharmaceutical product may be, for example, 0.01 to 80% by mass, preferably 0.1 to 70% by mass, more preferably 0.1 to 60% by mass, and particularly preferably 0.1 to 50% by mass. Examples of the content of acid-resistant cells in the pharmaceutical product include 0.1 to 100mg (wet weight)/g, 0.5 to 80mg (wet weight)/g, and 1 to 60mg (wet weight)/g.

The route of administration of the pharmaceutical product of the present embodiment is not particularly limited, and may be oral administration or non-oral administration, and oral administration is preferred. The drug of the present embodiment is encapsulated in acid-resistant cells, and therefore, decomposition of the drug by gastric acid can be suppressed.

The drug delivery target of the drug of the present embodiment is preferably the intestine (intestinal tract), and more preferably the small intestine.

Since the drug according to the present embodiment includes the drug delivery composition according to the above-described embodiment, any drug can be protected from decomposition in the stomach, and the drug can be delivered to the intestine. Therefore, by using a drug acting in the intestine for the drug delivery composition, the drug can be effectively made to act on the intestine. In addition, in the case where the drug is a drug peptide having immunogenicity, intestinal immunity can be effectively activated for animals taking the drug delivery composition. Furthermore, by activating intestinal immunity, it is also expected to activate other mucosal immunity and systemic immunity.

Therefore, the pharmaceutical product of the present embodiment can be used for preventing and treating human diseases and improving health. It is particularly suitable for use in the following medicaments: drugs that are not absorbed in the stomach but absorbed in the intestine are desired to be drugs that are decomposed or insolubilized by gastric acid and that prevent intestinal absorption, and drugs that absorb a variety of drugs once in the intestine are used.

In another aspect, the present invention provides a method for administering a drug, comprising orally administering to a subject a drug comprising the drug delivery composition of the above embodiment.

In another aspect, the present invention provides a method of administering intestinal immunity to a subject, comprising orally administering to the subject a pharmaceutical comprising the drug delivery composition of the above embodiment.

[ food ]

In one embodiment, the present invention provides a food product comprising the drug delivery composition of the above embodiments.

The food of the present embodiment may be a normal food, a nutritional supplement food, a functional food, a health product, or the like. The drug delivery composition may be added to food as a food additive.

In the food of the present embodiment, the type of the food is not particularly limited, but in order to prevent the drug from leaking out of the acid-resistant cells, the acid-resistant cells are preferably in a form that does not cause cell damage, and preferably are not dried foods. Examples of the food include: green juice (green juice), refreshing beverage, carbonated beverage, nutritious beverage, fruit beverage, vegetable beverage, lactic acid beverage, milk beverage, sports beverage, tea, coffee, etc.; curry sauce, stew soup, instant soup, etc.; ice food such as ice cream, sherbet, water ice, etc.; cake products such as maltose, jelly, jam, and butter; processed food of aquatic products and livestock products such as fish cake, fish meat and sweet potato cake, ham, sausage, etc.; processed milk, fermented milk, butter, cheese, yogurt, etc.; sauces, salad dressings, miso, soy sauce, sauces, and the like; other processed foods such as various soft canned foods; but are not limited to these.

In the food of the present embodiment, the content of the drug delivery composition is not particularly limited, and may be appropriately set according to the type of food. For example, the content of the drug delivery composition in the food may be, for example, 0.01 to 80% by mass, preferably 0.1 to 70% by mass, more preferably 0.1 to 60% by mass, and particularly preferably 0.1 to 50% by mass, in consideration of the taste of the food. Examples of the content of acid-resistant cells in the food include 0.1 to 100mg (wet weight)/g, 0.5 to 80mg (wet weight)/g, and 1 to 60mg (wet weight)/g.

In the case where the food is a functional food, a nutritional supplement, a health product or the like, it may be in the form of a usual food as described above, or may be in the form of granules, tablets, jellies, drinks or the like. For example, the food product of the present embodiment may contain a solidified product of acid-resistant cells using a thickener and/or a gelling agent.

According to the food of the present embodiment, since the drug delivery composition of the above embodiment is included, any drug can be ingested as a food. As described above, the drug delivery composition can protect any drug from decomposition in the stomach so that it can be delivered to the intestine. The food of the present embodiment is useful when one or more specific nutrient components are to be absorbed in the intestine without being affected by gastric acid.

[ drug Carrier ]

In one embodiment, the present invention provides a drug carrier comprising acid-resistant cells.

The acid-resistant cells contained in the drug carrier of the present embodiment are the same as those described in "< acid-resistant cells >" of the above "[ drug delivery composition ]", and the same examples can be given as preferable examples. The acid-resistant cells are resistant to acids and the cells are not broken even in the acidic environment of the stomach. Therefore, by encapsulating a drug in a cell, it can be used as a carrier for an acid-resistant drug. Examples of a method for entrapping a drug in a cell include the same methods as those described in the above-mentioned "[ drug delivery composition ]". The drug carrier of the present embodiment is preferably composed of acid-resistant cells.

The drug carrier of the present embodiment can be suitably used for delivering a drug into the intestine, and can be suitably used for a pharmaceutical product to be orally administered or a feed or food to be orally ingested.

[ medicinal capsules ]

In one embodiment, the present invention provides a drug capsule having a drug encapsulated within a drug carrier of the embodiment.

The acid-resistant cells contain a drug in the cells, and as shown in examples described later, the drug is hardly released in an acidic environment such as the stomach. Therefore, the drug carrier containing the acid-resistant cells can be used as an acid-resistant drug capsule by encapsulating a drug in the acid-resistant cells. The drug capsule of the present embodiment can be used as an oral drug capsule for the purpose of delivering a drug into the intestine.

[ acid-resistant cells ]

In one embodiment, the present invention provides an acid-resistant cell encapsulating a drug in the cell. In a preferred embodiment, the drug is locally present in a pouch membrane structure possessed by acid-resistant cells.

The acid-resistant cell of the present embodiment is the same as the acid-resistant cell contained in the drug delivery composition of the above embodiment, and the same examples can be given as a preferable example. Alternatively, the drug is locally present outside the bag-like membrane structure possessed by the acid-resistant cells. In the case where the drug is locally present outside the sac-like membrane structure, the drug is present in the cytoplasm of acid-resistant cells.

The drug is not particularly limited, and for example, at least one drug selected from the group consisting of low molecular compounds, peptides, proteins, and nucleic acids is preferable. For example, in the case of a drug which is affected by a degradation enzyme derived from the cytoplasm, etc., it is preferable that the drug is locally present in the pouch membrane structure. By locally existing in the bag-like membrane structure, the drug can be protected from the influence of a decomposing enzyme or the like in the cytoplasm. Therefore, the drug can be efficiently delivered to a prescribed site in the living body. For example, in the case where the drug is a peptide, protein or nucleic acid, it is susceptible to proteases or nucleases in the cytoplasm; therefore, it is preferable that the film is locally present in the pouch film structure. On the other hand, in the case of a drug (e.g., a low-molecular compound) which is hardly affected by a lytic enzyme or the like in the cytoplasm, the drug may be present locally outside the sac-like membrane structure.

In addition, in one embodiment, the invention provides an acid-resistant cell comprising an exogenous material.

The acid-resistant cell of the present embodiment is the same as that described in "< acid-resistant cell >" of the above "[ drug delivery composition ]", and preferred examples thereof are also the same.

The exogenous substance is not particularly limited, and examples thereof include drugs, poisons, dyes, perfumes, compounds having no known action on living bodies, and the like, but are not limited thereto. The method for introducing the exogenous substance into the acid-resistant cell is not particularly limited, and examples thereof include: a method of binding to a cell-permeable substance (e.g., a cell-permeable peptide), and a method of encapsulating the cell-permeable micelle. In addition, in the case where the exogenous substance is a drug, the same method as that described in the above "[ drug delivery composition ]" can be cited.

The acid-resistant cell of this embodiment can be used, for example, to deliver exogenous substances. More specifically, the acid-resistant cells of the present embodiment may be applied to an oral composition for delivering exogenous substances into the intestine.

In another embodiment, the present invention provides a feed containing the acid-resistant cell.

In another aspect, the present invention provides a pharmaceutical containing the acid-resistant cell.

In another aspect, the present invention provides a food product containing the acid-resistant cell.

Further, in another aspect, the present invention provides a method for administering the exogenous substance, which comprises orally administering the acid-resistant cell to a subject.

In another aspect, the present invention provides a method for feeding an animal, comprising feeding the animal with the acid-resistant cells.

In addition, in another aspect, the present invention provides a method for conferring intestinal immunity, comprising orally administering the acid-resistant cell.

[ method for producing acid-resistant cells ]

In one embodiment, the present invention provides a method for producing acid-resistant cells including a drug, the method including the step of introducing a gene encoding a fusion protein into the acid-resistant cells, the fusion protein including: peptides or proteins as drugs, as well as peptides or proteins that are locally present with respect to the cell membrane or organelle.

The production method of the present embodiment can be carried out in the manner described in "[ drug delivery composition ] < acid-resistant cell > (localization of drug to pouch-like membrane structure)".

Examples

The present invention will be described below with reference to examples, but the present invention is not limited to the following examples.

[ example 1]

(preparation of GAPDH-GP-sfGFP-expressing Strain)

In order to insert a DNA fragment of GAPDH-GP-sfGFP into CMD184C (gene number) of chromosome of Cyanidioscheyzon merole 10D, first, plasmid pD184-HSp-GAPDH-GP-sfGFP was prepared as shown below.

The following sequences were designed to be arranged in order at the multiple cloning site of pQE80 plasmid (for maintenance and replication in E.coli; QIAGEN). The sequence was arranged in order from the 5' side into the latter half of CMD184C gene (773 bp-2773bp of the Open Reading Frame (ORF) of the gene and 25bp downstream including the stop codon), Heat Shock (HS) promoter (200 bp of the sequence near the upstream of the start codon of HSP20/CMJ101C gene; Sumiya et al2014, Plos one.22; 9 (10): e 111261; PMID: 25337786), GAPDH (1 bp-1209bp of the open reading frame of CMJ042C gene; GAPDH described in Moriyama et al2014, planta.240(3): 585-98; PMID:25009310), rabies glycoprotein gene GP (full length ORF 1-1572bp, ProtKB accession No. P19462), beta tubulin terminator (200 bp downstream of the CMD 263C gene including the stop codon), downstream 200bp of URA screening marker and downstream nucleotide sequence from the downstream of the URA 185 bp-185 (bp) of the downstream of the gene). In order to heat the medium to induce expression of GAPDH-GP-sfGFP, the HS promoter is necessary. URA selection marker is necessary for the selection of GAPDH-GP-sfGFP strain. In order to insert a DNA fragment downstream of CMD184C by homologous recombination, the sequence of the latter half and downstream of CMD184C and downstream of the CMD185C gene were necessary.

First, in order to prepare plasmid pD184-HSp-GAPDH-GP-sfGFP, each of the DNA fragments of the following (1), (2), (3), (4) and (5) was prepared.

(1) Plasmid pD184-APCCp-EGFP-URACm-Cm(comprising pQE80 (SEQ ID NO: 32), the second half of CMD184C (SEQ ID NO: 33), the APCC promoter (SEQ ID NO: 34), EGFP (SEQ ID NO: 35), beta tubulin terminator (SEQ ID NO: 36), URA selection marker (SEQ ID NO: 37) and the DNA sequence downstream of CMD185C gene (SEQ ID NO: 38); Fujiwara et al 2013, PLoS one.8 (9): e 73608; PMID: 24039997) as templates, using a primer set [ #1d184(+25) R/#2bT 3' (+1) F]The DNA sequence of the portion other than the APCC promoter and EGFP was amplified by PCR. (1) The nucleotide sequence of the DNA fragment of (1) is shown in SEQ ID NO. 31.

(2) A DNA sequence of the HS promoter (SEQ ID NO: 39) was amplified by PCR using C.merole 10D genomic DNA as a template and a primer set [ #3HS (-200) Fd184/#4HS (-1) R ].

(3) The GAPDH gene open reading frame (SEQ ID NO: 40) was amplified by PCR using C.merole 10D genomic DNA as a template and a primer set [ #5J042(1) Fhs/#6J042(1209) R-link3 ].

(4) A DNA sequence of GP was chemically synthesized based on the codon usage frequency of C.merolae (SEQ ID NO: 41), and amplified by PCR using a primer set [ #7GP (1) F-linker3/#8GP (1572) R-linker2] as a template.

(5) pAPCC-promoter-sfGFP-pmE2F-URA (Miyagishima et al2014, Nat Commun.5: 3807; PMID: 24806410) was used as a template, and sfGFP (SEQ ID NO: 42) was amplified by PCR using a primer set [ #9sfGFP (1) F-linker2/#10sfGFP (714) Rbt ].

The DNA fragments of (1), (2), (3), (4) and (5) above were mixed and fused using In-Fusion (registered trademark) HD Cloning Kit (product No. 639648, TAKARA) to replace pD184-APCCp-EGFP-URACm-GsThe HS promoter, GAPDH, GP and sfGFP were inserted as part of the APCC promoter and EGFP. After the Infusion reaction, E.coli competent cells were introduced, and the plasmid was amplified to obtain pD 184-HSp-GAPDH-GP-sfGFP. Next, this was used as a template with the primer set [ #11D184(1200) F/#12D184(+1400) R]A DNA fragment to which the latter half of CMD184 gene (1200 bp-2773bp of ORF of the gene and 25bp downstream including the stop codon), HS promoter, GAPDH, GP, sfGFP, beta tubulin terminator, URA selection marker and the downstream of CMD184C gene (base sequence from 28bp to 1440bp downstream of the stop codon) were ligated was amplified by PCR.

This DNA fragment was introduced into C.merole uracil auxotrophic strain M4(Minoda et al 2004, Plant Cell physiol.45 (6): 667-71.; PMID: 15215501) by the PEG method (Ohnuma et al2008, Plant Cell physiol.49 (1): 117-20; PMID: 18003671), and screened using MA2 solid medium containing no uracil to obtain GAPDH-GP-sfGFP expressing strain.

(evaluation of proteasome-induced degradation of GAPDH-GP-sfGFP protein)

The GAPDH-GP-sfGFP-expressing strain of C.merolae (hereinafter referred to as "GAPDH-GP-sfGFP-expressing strain") prepared as described above was subcultured in 60mL of MA2 medium in a conical flask at a cell concentration of OD750 ═ 0.2, and irradiated with light (50. mu. mol)-2s-1) The cells were cultured at 40 ℃ for 2 days by rotation (before expression). Then, 20mL of the culture solution was transferred to two Erlenmeyer flasks at a time. To induce GAPDH-GP-sfGFP gene expression by thermal stimulation, the two flasks were transferred to a 50 ℃ incubator and subjected to rotary culture under light irradiation for 1 hour. Just before the transfer to 50 ℃, in order to prevent the decomposition of the protein by the proteasome, the final concentration of the protein in one of the two Erlenmeyer flasks was 100. mu.MThe proteasome inhibitor MG-132(MG-132(+)) (Nishida et al 2005; Mol Biol cell.16 (5): 2493-502; PMID 15772156) was added. To the other Erlenmeyer flask, as a control, only 40. mu.L of DMSO (MG-132(-)) as a solvent for MG-132 was added. The expression of the GAPDH-GP-sfGFP protein was confirmed by immunoblotting, and the effect of proteasome inhibition was confirmed by comparing the band patterns. An anti-GFP antibody (clone JL-8, product No. 632381, Takara) was used for the detection of the GAPDH-GP-sfGFP protein.

The results of the immunoblotting are shown in figure 1. In MG-132(-), the GAPDH-GP-sfGFP protein band was thinner compared to MG-132 (+). From the results, it was revealed that a part of the GAPDH-GP-sfGFP protein was decomposed by proteasome after expression.

(analysis of intracellular localization of GAPDH-GP-sfGFP protein)

In order to analyze the intracellular localization of GAPDH-GP-sfGFP protein, GAPDH-GP-sfGFP expressing strain was cultured under light irradiation at 50 ℃ in the presence of MG-132 for 1 hour, and then the fluorescence of GAPDH-GP-sfGFP protein was observed by a fluorescence microscope.

A fluorescence microscope image of the GAPDH-GP-sfGFP expressing strain is shown in FIG. 2. Based on the fluorescence signal of sfGFP, it was shown that the GAPDH-GP-sfGFP protein is localized in the cytoplasm. The image (PC) in fig. 2(a) is a phase-difference microscope image showing the contour of the cell, the image (Chl) in fig. 2(B) is an autofluorescence image of chloroplast, and the image (sfGFP) in fig. 2(C) is a fluorescence image of sfGFP.

[ example 2]

(preparation of Chl-TP-3HA-GP-Col expression Strain)

In order to insert the DNA fragment for expressing Chl-TP-3HA-GP-Col (see FIG. 3) downstream of CMD184C (gene number) of chromosome C.merolae 10D, plasmid pD184-APCCp-Chl-TP-3HA-GP-Col was first prepared as shown below.

This plasmid was designed to arrange the following sequences in the order from the 5' side in the multiple cloning site of the pQE80 plasmid. The sequence is arranged into the rear half part (773 bp-2773bp of gene ORF and downstream 25bp including stop codon) of CMD184C gene and APCC starting sequence from the 5' sideMover (sequence 600bp upstream of start codon of gene APCC/CMO 250C), chloroplast transit signal Chl-TP (1 bp-390bp of ORF of SECA/CMQ393C gene; Sumiya et al2016, Proc Natl Acad Sci U S A.113(47): E7629-E7638; PMID:27837024), sequence encoding 3xHA tag (for confirming expression using HA antibody), rabies virus glycoprotein gene GP (1572bp, UniProtKB access No. P19462), sequence encoding Col peptide (Col peptide: SFHQLPARSPLP (SEQ ID NO: 43), peptide enhancing antigen recognition of M cells involved in intestinal immunity; Kim 2010, J munol.185 (10): 5787-95; PMID: 20952686), beta tubulin gene terminator (downstream 200bp of gene β -tubulin/N263C including stop codon), URACm-GsA selection marker and the downstream of the CMD185 gene (base sequence from 28bp to 880bp downstream of the stop codon). In order to insert a DNA fragment downstream of CMD184C by homologous recombination, the sequence of the latter half and downstream of CMD184C and downstream of the CMD185 gene were necessary. In order to express Chl-TP-HA-GP-Col constantly, the APCC promoter is necessary (Watanabe et al 2011, J Gen Appl Microbiol.57 (1): 69-72; PMID: 21478650). To select transformants having Chl-TP-3HA-GP-Col inserted, URACm-GsA selection marker is required (Imamura et al 2010, Plant Cell Physiol.51 (5): 707-17; PMID: 20375110) and, in addition, enables an increase in the expression of the protein which makes the gene multicopy (Fujiwara et al 2013, ploS One 8 (9): e 73608; PMID: 24039997).

To prepare the plasmid pD184-APCCp-Chl-TP-HA-GP-Col, the DNA fragments of the following (1), (2), (3) and (4) were prepared.

(1) Plasmid pD184-APCCp-EGFP-URACm-Gs(including pQE80 (SEQ ID NO: 32), the posterior half of CMD184C (SEQ ID NO: 33), APCC promoter (SEQ ID NO: 34), EGFP (SEQ ID NO: 35), beta-tubulin terminator (SEQ ID NO: 36), URACm-GsA selection marker (SEQ ID NO: 44) and a DNA sequence (SEQ ID NO: 38) downstream of the CMD185 gene; fujiwara et al 2013, PLoS one.8 (9): e 73608; PMID: 24039997) as a template, primers [ #13APCC (-1) R/#14bT 3' (+1)]The DNA sequence of the portion other than EGFP was amplified by PCR.

(2) Melole 10D genomic DNA was used as a template, and the DNA sequence of Chl-TP (SEQ ID NO: 45) was amplified by PCR using a primer set [ #15SecA (1) Facpc/# 16SecA (390) R-linker-ha ].

(3) Plasmid DNA comprising 3xHA (SEQ ID NO: 46): 3xHA (SEQ ID NO: 46) was amplified by PCR using pBSb-THA (Ohnuma et al2008, Plant Cell physiol.49 (1): 117-20; PMID: 18003671) as a template and a primer set [ #17HA (1) F/#18HA (90) R ].

(4) The ORF of GP was chemically synthesized based on the codon usage frequency of C.merolae (SEQ ID NO: 40), and amplified by PCR using a primer set of #19GP (1) Fha/#20Col-GP (1680) Rbt as a template.

The DNA fragments of (1), (2), (3) and (4) above were mixed and fused together using In-Fusion (registered trademark) HD Cloning Kit (product No. 639648, TAKARA) In place of pD184-APCCp-EGFP-URACm-GsThe EGFP part of (1) is inserted with Chl-TP, 3x HA and rabies virus glycoprotein ORF. After the Infusion reaction, the plasmid was introduced into E.coli competent cells and amplified to obtain pD184-APCCp-Chl-TP-3HA-GP-bt-URACm-Gs. Next, primers [ #11D184(1200) F/#12D184(+1400) R ] were used as templates]A DNA fragment to which the second half of CMD184C gene (1200 bp-2773bp of ORF of the gene and 25bp downstream including a stop codon), APCC promoter, Chl-TP, 3xHA, GP, Col peptide, beta tubulin terminator, URACm-Gs selection marker and the downstream of CMD185C gene (base sequence from No. 28 to No. 1440 bp) were ligated was amplified by PCR.

This DNA fragment was introduced into C.merole uracil auxotrophic strain M4(Minoda et al 2004, Plant Cell physiol.45 (6): 667-71; PMID: 15215501) by the PEG method (Ohnuma et al2008, Plant Cell physiol.49 (1): 117-20; PMID: 18003671), and screened using MA2 solid medium containing no uracil to obtain Chl-TP-3HA-GP-Col expression strain.

(evaluation of proteasome-induced degradation of Chl-TP-3HA-GP-Col protein)

The cell concentration of the medium 60mL in MA2 medium in a conical flask was 0.2 OD750C.merolae ChlTP-sfGFP-HA-GP-Col expression strain (hereinafter, referred to as "Chl-TP-3 HA-GP-Col expression strain") prepared as described above and a wild strain (WT) as a negative control were subcultured under light irradiation (50. mu. mol m)-2s-1) The culture was performed at 40 ℃ for 2 days by rotation. Then, 20mL of each culture solution was transferred to two Erlenmeyer flasks. In order to inhibit proteasome-induced proteolysis, proteasome inhibitor MG-132(MG-132(+) (Nishida et al 2005; Mol Biol cell.16 (5): 2493-502; PMID: 15772156) was added to one of the two Erlenmeyer flasks at a final concentration of 100. mu.M. As a control, only 40. mu.L of DMSO (MG-132(-)) as a solvent for MG-132 was added to the other Erlenmeyer flask. The expression of the ChlTP-sfGFP-HA-GP-Col protein was confirmed by immunoblotting, and the effect of proteasome inhibition was confirmed by comparing the band patterns. An anti-HA antibody (clone 16B12, product No. 901503, Biolegend) was used in the detection of the ChlTP-sfGFP-HA-GP-Col protein.

The results of the immunoblotting are shown in fig. 4. No difference in band pattern of the ChlTP-sfGFP-HA-GP-Col protein was observed between MG-132(-) and MG-132 (+). According to the results, it was revealed that the ChlTP-sfGFP-HA-GP-Col protein was not decomposed by proteasome.

(analysis of intracellular localization of ChlTP-sfGFP-HA-GP-Co1 protein)

To analyze the intracellular localization of the ChlTP-sfGFP-HA-GP-Col protein, the expressed Chl-TP-3HA-GP-Col strain was fixed after 2 days of culture at 40 ℃ in the absence of MG-132 under light irradiation, and immunofluorescent staining was performed using an anti-HA antibody.

The results of immunofluorescent staining are shown in figure 5. According to the signal of anti-HA antibody, it was shown that the ChlTP-sfGFP-HA-GP-Co1 protein is localized in chloroplasts (between the thylakoid and the envelope in the central part). The image (PC) of fig. 5(a) is a phase-difference microscope image showing the contour of the cell, the image (Chl) of fig. 5(B) is an autofluorescence image of chloroplast, and the image (anti-HA) of fig. 5(C) is an immunofluorescence staining image using an anti-HA antibody. It could be confirmed that the ChlTP-sfGFP-HA-GP-Col protein detected by the anti-HA antibody is locally present in chloroplasts.

Table 1 shows the sequences of the primers used in example 1 and example 2.

[ Table 1]

[ example 3]

(mice were given GAPDH-GP-sfGFP-expressing strain)

To be 1.3 × 108GAPDH-GP-sfGFP expressing strains were suspended in 300mM glucose solution (isotonic solution) per mL (OD750 ═ 4) and 250 μ L of the suspension was delivered directly into the stomach of mice (ICR line) using a probe. Thereafter, after 0, 0.5, and 1.0 hour, the stomach, the upper small intestine, and the lower small intestine were removed, and the removed organs were suspended in 1mL of 300mM glucose solution. After the suspension was centrifuged, the supernatant was separated and subjected to an enzyme-linked immunosorbent Assay (ELISA Assay) for sfGFP to measure absorbance at 450 nm.

Table 2 shows the results of measurement of the relative concentration of sfGFP in each organ (absorbance at 450nm in an enzyme-linked immunosorbent assay). sfGFP was barely detected in the stomach and in the small intestine immediately after administration. The results show that the cells of the algal bodies immediately moved from the stomach to the intestine after administration, without rupture in the stomach, and with rupture in the intestine.

[ Table 2]

[ example 4]

(mice were fed alginic acid-containing immobilized feed containing sfGFP-expressing strain)

Cells (sfGFP-expressing strain) of c.merolae 10D (Sumiya et al2014, PLoS one.9 (10): e 111261; PMID: 25337786) after expressing and labeling sfGFP in cytoplasm were mixed with commercial feed (CLEA Rodent Diet CE-2, CLEA Japan, Inc.) and the mixture was solidified with alginic acid as shown below to prepare feed samples.

27mL of 300mM glucose solution (isotonic solution) (OD750 ═ 4) in which the sfGFP-expressing strain was suspended was centrifuged at 3000g for 10 minutes, and the precipitated cells were collected. Cells of the sfGFP-expressing strain and 1.12g of a commercial feed (CE-2) were suspended in 10mL of a 2.5% sucrose solution containing 1% sodium alginate. Subsequently, the suspension was dropped into a 10% calcium chloride solution to obtain a feed sample containing an alginate solidified body of sfGFP-expressing strain and a commercially available feed. The content of the sfGFP-expressing strain in the feed sample was 4.6mg wet weight/g (80-110 mg per pellet).

Mice (ICR line) were allowed to freely take the feed samples for 4 hours before being normally bred. The gastrointestinal tract, upper small intestine and lower small intestine of the mice were extirpated 4, 8, 24 and 48 hours after the start of ingestion. Each picked organ was suspended in a 300mM glucose solution, centrifuged at 1000g, and the supernatant was collected as a sample for measuring the extracellular concentration of sfGFP. After the supernatant was collected, the pellet was resuspended by adding Distilled Water (DW) in an amount equivalent to that of the collected supernatant, and after centrifugation at 1000g, the supernatant was collected as a sample for measuring the intracellular concentration of sfGFP. The amount of sfGFP in the extracellular concentration-measuring sample and the intracellular concentration-measuring sample was quantified using a commercially available ELISA kit (GFP ELISA kit; cat No. abe171581, abcam), and the amounts were determined as the extracellular concentration and the intracellular concentration, respectively.

Table 3 shows the results of measurement of the relative concentration of sfGFP in each organ (absorbance at 450nm in an enzyme-linked immunosorbent assay). For sfGFP, both extracellular as well as intracellular concentrations were detected at higher concentrations in the small intestine than in the stomach. In addition, in the lower small intestine, sfGFP concentration was higher than in the upper small intestine, and the ratio of intracellular concentration to extracellular concentration also increased. This result indicates that the cells of the algal cells were disrupted in the small intestine and that sfGFP was taken in by the small intestine cells.

[ Table 3]

[ example 5]

(administration test and serum Collection)

As a "control suspension administration group", it was 1.3X 108sfGFP-expressing strains were suspended in 300mM glucose solution (isotonic solution) per mL (OD750 ═ 4) and 300 μ L of the suspension was delivered directly into the stomach of mice (three ICR individuals) using a probe. The same amount of oral administration was performed every other week for 6 times, and serum was collected 2 weeks after the final administration.

As a "suspension administration group" to be 1.3X 108Chl-TP-3HA-GP-Col expression strain (ChlTP-sfGFP-HA-GP-Col protein expression strain of c. merole) was suspended in 300mM glucose solution per mL (OD750 ═ 4) and 300 μ L of the suspension was delivered directly into the stomach of mice (four individuals of the ICR line) using a probe. The same amount of oral administration was performed every other week for 6 times, and serum was collected 2 weeks after the final administration.

In the "alginic acid-immobilized feed administration group", 27mL of 300mM glucose solution (OD750 ═ 4) in which Chl-TP-3HA-GP-Col expression strain (ChlTP-sfGFP-HA-GP-Col protein expression strain of c. merole) was suspended was centrifuged at 3000g for 10 minutes to collect precipitated cells. Cells of the Chl-TP-3HA-GP-Col expressing strain and 1.12g of commercial feed (CE-2) were suspended in 10mL of a 2.5% sucrose solution containing 1% sodium alginate. Next, this suspension was dropped into a 10% calcium chloride solution to obtain a feed sample containing an alginic acid cured product of the expressed Chl-TP-3HA-GP-Col strain and a commercially available feed. The content of the Chl-TP-3HA-GP-Col strain is 4.6mg wet weight/g (80-110 mg per grain). Mice (four individuals of the ICR line) were allowed to freely take feed samples and then normally bred. Feeding of feed samples was performed 6 times every other week, and serum was collected 2 weeks after the final feeding.

(evaluation of anti-GP protein antibody production)

The production of anti-GP protein antibodies was confirmed by immunoblotting. First, in order to fuse a 6 × histidine tag sequence to the amino terminus of the GP protein of rabies virus, the ORF of the GP gene was cloned into a pQE80 vector (containing the 6 × histidine tag sequence, product No. 32923, QIAGEN) to prepare a plasmid. This plasmid was introduced into E.coli, and 6 × histidine tag fusion GP protein (protein size about 50kDa) was expressed. The protein was concentrated using a nickel column (product No. 17531901, GE healthcare). Subsequently, the 6 × histidine-tag-fused GP protein concentrate was separated by electrophoresis using SDS-PAGE. The proteins were transferred from the gel after electrophoresis to a polyvinylidene fluoride (PVDF) film (product number IPVH00010, Merck). The transferred membrane was immersed in a dilution of serum collected from each mouse, and incubated at room temperature for 1 hour. Serum dilutions were prepared by diluting the serum to one-five percent in Tris buffer (ph7.5, containing 0.1% Tween 20). The presence or absence of an anti-GP protein antibody contained in serum was determined as the presence or absence of an antibody reaction with a GP protein located at around 50 kDa.

(results)

The results of the immunoblotting are shown in fig. 6. In the serum dilutions of individual mice 1, 2 and 3 of the "control administration group (solution)" as the negative control, no band was detected at the position of about 50kDa, which is the molecular weight of rabies GP protein (fig. 6 (C)). In contrast, in individual mice 2, 3, and 4 of the "alginic acid solidified feed administration group" (fig. 6(a)) and individual mice 3 and 4 of the "suspension administration group" (fig. 6(B)), bands were detected at a position of about 50 kDa. From the results, it was shown that anti-GP protein antibodies were produced in mice given a suspension of c.merolae expressing rabies GP protein or alginic acid solidified feed.

(examination)

In mouse individual 2 in the "control suspension administration group", a band having a size smaller than the predicted size of the 6 × histidine-tag-fused GP protein was detected, and it is considered that the antibody possessed by this mouse individual causes a nonspecific reaction with the administration of the GP protein regardless of the e.coli-derived protein contained in the 6 × histidine-tag-fused GP protein concentrate.

According to a series of examples, it was confirmed that an antigenic protein appropriately introduced by the acid-resistant cells used in the present invention can be delivered to a site subsequent to the upper small intestine. It was also confirmed that even when the acid-resistant cells into which the antigenic protein of the present invention has been introduced are incorporated into feed in a form that can be used in general livestock industry and breeding industry, the antigenic protein can be delivered to a target site in the same manner. In addition, it was confirmed that antibodies were produced in the blood by driving the intestinal immune system by the antigenic protein thus delivered.

Sequence listing

<110> Japan scientific and technological development institution (Japan Science and Technology Agency)

<120> DRUG DELIVERY COMPOSITION (DRUG DELIVERY COMPOSITION)

<130> PC-29322

<150> JP2019-069029

<151> 2019-03-29

<160> 66

<170> PatentIn version 3.5

<210> 1

<211> 2140

<212> DNA

<213> Rabies virus (Rabies virus)

<400> 1

acattttgag cctcttggat gtgaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 60

aaaaaaaaaa aaaaaactat taacatccct caaaagactt aaggaaagat ggttcctcag 120

gttcttttgt ttgcacccct cctggttttt ccattgtgtt tcgggaagtt ccccatttac 180

acgataccag acaaacttgg tccctggagc cctattgact tacaccatct cagctgtcca 240

aataacctgg ttgtggagga cgaaggatgt accaacctgt ccgggttctc ttacatggaa 300

cttaaagtgg gatacatctc agccataaaa gtgaacgggt tcacttgcac aggtgttgtg 360

acagaggcag aaacctacac caactttgtt ggttatgtca caaccacatt caagagaaag 420

catttccgcc ccaccccaga cgcatgtaga gccgcgtata actggaagat ggccggtgac 480

cccagatatg aagagtctct acacaatccg taccccgact accattggct tcgaactgta 540

aaaaccacca aagagtctct cgttatcata tccccaagtg tgacagattt ggacccatat 600

gacaaatccc ttcactcaag ggtcttccct ggcggaaatt gctcaggaat aacggtgtcc 660

tcgacctact gctcaactaa tcatgattac accatctgga tgcctgagaa tctgagacta 720

gggacatctt gtgacatttt tacccatagc agagggaaga gagcatccaa aggagacaag 780

acttgcggct ttgtggatga aagaggcctg tataagtctt taaagggagc atgcaaactc 840

aagttatgtg gagttctcgg acttagactt atggatggaa catgggtcgc gatgcaaaca 900

tcagatgaga ccaaatggtg ccctccaggt cagttggtga atttgcacga ctttcgctca 960

gacgagattg agcatctcgt tgaggaagag ttagtcaaga aaagagagga gtgtctggat 1020

gcactagagt ccatcatgac caccaagtca gtgagtttca gacgtctcag tcacctgaga 1080

aaacttgtcc ctgggtttgg aaaagcatat accatattca acaaaacctt gatggaggct 1140

gatgctcact acaagtctgt ccagacctgg aatgagatca tcccctcaaa agggtgtttg 1200

agagttgggg agaggtgtca tccccatgtg aacggggtgt ttttcaatgg tataatatta 1260

gggtctgacg gccatgttct aatcccagag atgcagtcat ccctcctcca gcaacatatg 1320

gagttgttgg aatcttcagt tatccccctg atgcacccct tggcagaccc ttctacagtt 1380

ttcaaagacg gtgatgaggt tgaggatttt gttgaagttc acctccccga tgtgcataaa 1440

caggtctcag gagttgacct gggtctcccg aaatggggga agtatgtatt gatgattgca 1500

ggggccttga ttgccctgat gttgataatt ttcctgatga catgttgcag aagagtcaat 1560

cgaccagaat ctacacaaag caatcttgga gggacaggga gaaatgtgtc agtcccttcc 1620

caaagcggaa aagtcatatc ttcatgggag tcatataaga gtggaggcga gaccagactg 1680

tgaaggccgg tcatcctttt gacacctcaa gtccagagga taacctcctc tcggggttgg 1740

ggggaatctt gggatccagt agtcctcctt gaactccatc caacagggta gatttaagag 1800

tcatgagact ttcattaatc atatcagttg atcagacatg gtcgtgtaga ttctcataac 1860

acgggagatc ttctagcagt ttcagtgacc aacggtgctt tcattctcca ggaactgata 1920

ccaaaggttg tggacaagcc aaggggtgct tcggattact ctgtgcttgg gcacagaaag 1980

aggtcatagt ttgccccttg atagcggatt caacatgaat taactaagaa aggcgatctg 2040

cctcccatga aggacataag caatagttca caatcatctt gcatctcagt gaagtgtaca 2100

taactataaa gggctgggtc atctaagcat ttcagtcgag 2140

<210> 2

<211> 524

<212> PRT

<213> Rabies virus (Rabies virus)

<400> 2

Met Val Pro Gln Val Leu Leu Phe Ala Pro Leu Leu Val Phe Pro Leu

1 5 10 15

Cys Phe Gly Lys Phe Pro Ile Tyr Thr Ile Pro Asp Lys Leu Gly Pro

20 25 30

Trp Ser Pro Ile Asp Leu His His Leu Ser Cys Pro Asn Asn Leu Val

35 40 45

Val Glu Asp Glu Gly Cys Thr Asn Leu Ser Gly Phe Ser Tyr Met Glu

50 55 60

Leu Lys Val Gly Tyr Ile Ser Ala Ile Lys Val Asn Gly Phe Thr Cys

65 70 75 80

Thr Gly Val Val Thr Glu Ala Glu Thr Tyr Thr Asn Phe Val Gly Tyr

85 90 95

Val Thr Thr Thr Phe Lys Arg Lys His Phe Arg Pro Thr Pro Asp Ala

100 105 110

Cys Arg Ala Ala Tyr Asn Trp Lys Met Ala Gly Asp Pro Arg Tyr Glu

115 120 125

Glu Ser Leu His Asn Pro Tyr Pro Asp Tyr His Trp Leu Arg Thr Val

130 135 140

Lys Thr Thr Lys Glu Ser Leu Val Ile Ile Ser Pro Ser Val Thr Asp

145 150 155 160

Leu Asp Pro Tyr Asp Lys Ser Leu His Ser Arg Val Phe Pro Gly Gly

165 170 175

Asn Cys Ser Gly Ile Thr Val Ser Ser Thr Tyr Cys Ser Thr Asn His

180 185 190

Asp Tyr Thr Ile Trp Met Pro Glu Asn Leu Arg Leu Gly Thr Ser Cys

195 200 205

Asp Ile Phe Thr His Ser Arg Gly Lys Arg Ala Ser Lys Gly Asp Lys

210 215 220

Thr Cys Gly Phe Val Asp Glu Arg Gly Leu Tyr Lys Ser Leu Lys Gly

225 230 235 240

Ala Cys Lys Leu Lys Leu Cys Gly Val Leu Gly Leu Arg Leu Met Asp

245 250 255

Gly Thr Trp Val Ala Met Gln Thr Ser Asp Glu Thr Lys Trp Cys Pro

260 265 270

Pro Gly Gln Leu Val Asn Leu His Asp Phe Arg Ser Asp Glu Ile Glu

275 280 285

His Leu Val Glu Glu Glu Leu Val Lys Lys Arg Glu Glu Cys Leu Asp

290 295 300

Ala Leu Glu Ser Ile Met Thr Thr Lys Ser Val Ser Phe Arg Arg Leu

305 310 315 320

Ser His Leu Arg Lys Leu Val Pro Gly Phe Gly Lys Ala Tyr Thr Ile

325 330 335

Phe Asn Lys Thr Leu Met Glu Ala Asp Ala His Tyr Lys Ser Val Gln

340 345 350

Thr Trp Asn Glu Ile Ile Pro Ser Lys Gly Cys Leu Arg Val Gly Glu

355 360 365

Arg Cys His Pro His Val Asn Gly Val Phe Phe Asn Gly Ile Ile Leu

370 375 380

Gly Ser Asp Gly His Val Leu Ile Pro Glu Met Gln Ser Ser Leu Leu

385 390 395 400

Gln Gln His Met Glu Leu Leu Glu Ser Ser Val Ile Pro Leu Met His

405 410 415

Pro Leu Ala Asp Pro Ser Thr Val Phe Lys Asp Gly Asp Glu Val Glu

420 425 430

Asp Phe Val Glu Val His Leu Pro Asp Val His Lys Gln Val Ser Gly

435 440 445

Val Asp Leu Gly Leu Pro Lys Trp Gly Lys Tyr Val Leu Met Ile Ala

450 455 460

Gly Ala Leu Ile Ala Leu Met Leu Ile Ile Phe Leu Met Thr Cys Cys

465 470 475 480

Arg Arg Val Asn Arg Pro Glu Ser Thr Gln Ser Asn Leu Gly Gly Thr

485 490 495

Gly Arg Asn Val Ser Val Pro Ser Gln Ser Gly Lys Val Ile Ser Ser

500 505 510

Trp Glu Ser Tyr Lys Ser Gly Gly Glu Thr Arg Leu

515 520

<210> 3

<211> 3033

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 3

atgttccatg tgacgtaccc gttcacgcag agacaatgct ttctccgttc acgagaagcg 60

tgccttgcaa cgttgccagc tggtgctttt cgaaagcacc tgtggcgccc ttcgtgctgg 120

tcgttccgca cacgtcttcg taaagaggcg tcgctacgga aatccacagt tctcgctccg 180

cttactcgcc gtctgcagct gagtctcttc ggcctcccag agcggttcgt tcgcaagtcc 240

aagtcgccgg tctcggcaga gtccagtgtc gccactgagc tcacacgtga tcgggtcaaa 300

gatccgacgc tcgcgaagta ctgggataca cttctggaaa tcaatgcact ggaggcggaa 360

ctggaacaac tcaaaagcga tgaactcaga gctcgcttgg atgccctgcg gagaaacgac 420

tcggtgcgga gcggggaccc gccactggcc gaggtattcg ccatcgttcg agaggccgca 480

cgtcggacgc tcagcatgcg acccttcgat gtgcaggttc ttggtggcct tgcactcttt 540

cacggttgcg tagcggagat cgccaccggt gaggggaaaa cgctcatcgc aacgatgccg 600

gcatgtgcca gcgcgctagc ggctcgcggt accgtcctgg tcgtgacggt gaacgattac 660

ctctgccgtc gcgactttga aaacatgggt ccactgtatc gctccctggg tttctctgtc 720

gggtgtgtga ccagcgccac agagcgggcg gcacgtcaac gagcatacgc ttgcgatatc 780

acctatgtga cgaatgcgga gcttggattc gactatctac gcgaccatct ggtgctgagc 840

gctgctgatc aagtgcttgt gaggcccaag cccttctact tttgtctact ggatgaggcc 900

gactcaatca tgatagatga agcgcgtaca ccgctgatca tttcccaggc tgcagaggcg 960

cccacagaga aatacgctac tgccgctaaa ctggctgcaa acctgcagcg ggatcggcac 1020

tacacggtct atgaaaagga gcgcaatgtc actttgacag gcgccggtta cgaagcatgt 1080

gaggaggcac tgcaagtgcc aacgctcttc gccgcagcgg atccgtgggc gccctttgtg 1140

ctgaatgcac tcaaggcgaa ggagctctat caacgtgata tagattatgt cgttcggggt 1200

gatcaagtgc taatcgtgga tgagtttacc ggtcgagtac tgcaaggtag gcgctggtca 1260

gagggtctgc accaggccat cgaggccaag gaggggctcg ctgtccgcac tgaaccgcgg 1320

actgtagctt ccatctcgta tcagtccttc tttcgcctgt ttcctcgtct ggcaggcatg 1380

acgggcaccg ctgctaccga tgcagcagag atacgcgaaa cgtacggact cgaggtggtc 1440

gttgtgccca ccgcgctacc tgtcgttcgc cgagactacc ccgatgtggt gtttcgaacg 1500

agtcgcggca aacttcttgc tgtggtcgca gaaattcgac gcctgcacct tcgaaaagtg 1560

cccgtcctgg ttggaaccac cagtattgaa gctagtgagc gaatcagcgc ccttttgagc 1620

gaaggcgaac gcgttccgca cgaggttctg aatgcacgtc cggagaacgc tgaacgtgag 1680

agcgaaatca tcgcccaagc aggtcgtcta ggagcggtta cgatcgcaac aaacatggct 1740

ggacgaggaa ccgatatcgt gctgggtgga aacgtgtcca gtctagcacg cgctcttctc 1800

cagagggagc tgcttgccac gttcgctctg gggcctgaat gctcctctgg ggccggcgac 1860

cgcgcatcca ccgagcattt cctctcgtgt ctcgaagagg cggagcatca ccagctgcac 1920

tgctttggag aaaaaatcgc tgaggcactg cgttcccagc gttcgacgga tcccggtccg 1980

cggcttatca tcgaatccat ggagcagctg catcagctga tgctgcaagc tgccgagttt 2040

caagagccaa cgttgccctt ctctgccgag gcccaggagc tggtaagaga ggcacttcag 2100

tggctggaaa agcagctgcg tccgcggctc gatgcagagc gcgcagctgt gctggacctc 2160

ggtggcctgc acatcctcgg aactgaacga cacgagtcac gccgcatcga taaccagttg 2220

cggggtcgtg caggtcgcca aggtgacccc ggatgttcgc gtttcttcct gtcgctggag 2280

gaccccatct tcagggtgtt tgggggtgat cggatggctc gcctcgccga agcgtttcgt 2340

ctggacgaaa caacccccat cgagagcgtc caggttgcgc gtacgctgga caatgtccag 2400

cgcagcatcg agcaatatta tgcggggatt cggaagcagt tattcgctta tgatgaggtg 2460

ctttcccaac aacggaaggt actgtatcga cagcgaaacc gctttttgga agccgatgaa 2520

gcgctcctat tcggttcgga tgcggcagca gcccgggcct ctggggtgct tgctggcctc 2580

gcgggcgact ggatacgaac gacgattcag gatattctcc aagcaaatcg tcgagatccg 2640

gcaaaatgcg ttcagaaatt gaaagcgttc ttcccagggg cgcttctgaa cgagagcatg 2700

tgccagtccg cagcagctat cgaccaagtt gccgccgctg ttggcgtacg gctctcgcaa 2760

caccggcgga tgttgcaaca gagcgcaccg cagcaggatg ttgctgtctt tcgctatttg 2820

gcactggttc agcatgatca gctctggagt gaacatttac gaaaattagc gcttttgcgg 2880

gacatgtcgt ctttgcagac gttgcggcag gtagatccgt tgcaacagta tcagcaagat 2940

agttttcagc tctttgaaca gatgatggca caaataaggc gcaacactgt atactcgttt 3000

ttcaagtatt cgccggggcc aacggtatcc gcg 3033

<210> 4

<211> 1011

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 4

Met Phe His Val Thr Tyr Pro Phe Thr Gln Arg Gln Cys Phe Leu Arg

1 5 10 15

Ser Arg Glu Ala Cys Leu Ala Thr Leu Pro Ala Gly Ala Phe Arg Lys

20 25 30

His Leu Trp Arg Pro Ser Cys Trp Ser Phe Arg Thr Arg Leu Arg Lys

35 40 45

Glu Ala Ser Leu Arg Lys Ser Thr Val Leu Ala Pro Leu Thr Arg Arg

50 55 60

Leu Gln Leu Ser Leu Phe Gly Leu Pro Glu Arg Phe Val Arg Lys Ser

65 70 75 80

Lys Ser Pro Val Ser Ala Glu Ser Ser Val Ala Thr Glu Leu Thr Arg

85 90 95

Asp Arg Val Lys Asp Pro Thr Leu Ala Lys Tyr Trp Asp Thr Leu Leu

100 105 110

Glu Ile Asn Ala Leu Glu Ala Glu Leu Glu Gln Leu Lys Ser Asp Glu

115 120 125

Leu Arg Ala Arg Leu Asp Ala Leu Arg Arg Asn Asp Ser Val Arg Ser

130 135 140

Gly Asp Pro Pro Leu Ala Glu Val Phe Ala Ile Val Arg Glu Ala Ala

145 150 155 160

Arg Arg Thr Leu Ser Met Arg Pro Phe Asp Val Gln Val Leu Gly Gly

165 170 175

Leu Ala Leu Phe His Gly Cys Val Ala Glu Ile Ala Thr Gly Glu Gly

180 185 190

Lys Thr Leu Ile Ala Thr Met Pro Ala Cys Ala Ser Ala Leu Ala Ala

195 200 205

Arg Gly Thr Val Leu Val Val Thr Val Asn Asp Tyr Leu Cys Arg Arg

210 215 220

Asp Phe Glu Asn Met Gly Pro Leu Tyr Arg Ser Leu Gly Phe Ser Val

225 230 235 240

Gly Cys Val Thr Ser Ala Thr Glu Arg Ala Ala Arg Gln Arg Ala Tyr

245 250 255

Ala Cys Asp Ile Thr Tyr Val Thr Asn Ala Glu Leu Gly Phe Asp Tyr

260 265 270

Leu Arg Asp His Leu Val Leu Ser Ala Ala Asp Gln Val Leu Val Arg

275 280 285

Pro Lys Pro Phe Tyr Phe Cys Leu Leu Asp Glu Ala Asp Ser Ile Met

290 295 300

Ile Asp Glu Ala Arg Thr Pro Leu Ile Ile Ser Gln Ala Ala Glu Ala

305 310 315 320

Pro Thr Glu Lys Tyr Ala Thr Ala Ala Lys Leu Ala Ala Asn Leu Gln

325 330 335

Arg Asp Arg His Tyr Thr Val Tyr Glu Lys Glu Arg Asn Val Thr Leu

340 345 350

Thr Gly Ala Gly Tyr Glu Ala Cys Glu Glu Ala Leu Gln Val Pro Thr

355 360 365

Leu Phe Ala Ala Ala Asp Pro Trp Ala Pro Phe Val Leu Asn Ala Leu

370 375 380

Lys Ala Lys Glu Leu Tyr Gln Arg Asp Ile Asp Tyr Val Val Arg Gly

385 390 395 400

Asp Gln Val Leu Ile Val Asp Glu Phe Thr Gly Arg Val Leu Gln Gly

405 410 415

Arg Arg Trp Ser Glu Gly Leu His Gln Ala Ile Glu Ala Lys Glu Gly

420 425 430

Leu Ala Val Arg Thr Glu Pro Arg Thr Val Ala Ser Ile Ser Tyr Gln

435 440 445

Ser Phe Phe Arg Leu Phe Pro Arg Leu Ala Gly Met Thr Gly Thr Ala

450 455 460

Ala Thr Asp Ala Ala Glu Ile Arg Glu Thr Tyr Gly Leu Glu Val Val

465 470 475 480

Val Val Pro Thr Ala Leu Pro Val Val Arg Arg Asp Tyr Pro Asp Val

485 490 495

Val Phe Arg Thr Ser Arg Gly Lys Leu Leu Ala Val Val Ala Glu Ile

500 505 510

Arg Arg Leu His Leu Arg Lys Val Pro Val Leu Val Gly Thr Thr Ser

515 520 525

Ile Glu Ala Ser Glu Arg Ile Ser Ala Leu Leu Ser Glu Gly Glu Arg

530 535 540

Val Pro His Glu Val Leu Asn Ala Arg Pro Glu Asn Ala Glu Arg Glu

545 550 555 560

Ser Glu Ile Ile Ala Gln Ala Gly Arg Leu Gly Ala Val Thr Ile Ala

565 570 575

Thr Asn Met Ala Gly Arg Gly Thr Asp Ile Val Leu Gly Gly Asn Val

580 585 590

Ser Ser Leu Ala Arg Ala Leu Leu Gln Arg Glu Leu Leu Ala Thr Phe

595 600 605

Ala Leu Gly Pro Glu Cys Ser Ser Gly Ala Gly Asp Arg Ala Ser Thr

610 615 620

Glu His Phe Leu Ser Cys Leu Glu Glu Ala Glu His His Gln Leu His

625 630 635 640

Cys Phe Gly Glu Lys Ile Ala Glu Ala Leu Arg Ser Gln Arg Ser Thr

645 650 655

Asp Pro Gly Pro Arg Leu Ile Ile Glu Ser Met Glu Gln Leu His Gln

660 665 670

Leu Met Leu Gln Ala Ala Glu Phe Gln Glu Pro Thr Leu Pro Phe Ser

675 680 685

Ala Glu Ala Gln Glu Leu Val Arg Glu Ala Leu Gln Trp Leu Glu Lys

690 695 700

Gln Leu Arg Pro Arg Leu Asp Ala Glu Arg Ala Ala Val Leu Asp Leu

705 710 715 720

Gly Gly Leu His Ile Leu Gly Thr Glu Arg His Glu Ser Arg Arg Ile

725 730 735

Asp Asn Gln Leu Arg Gly Arg Ala Gly Arg Gln Gly Asp Pro Gly Cys

740 745 750

Ser Arg Phe Phe Leu Ser Leu Glu Asp Pro Ile Phe Arg Val Phe Gly

755 760 765

Gly Asp Arg Met Ala Arg Leu Ala Glu Ala Phe Arg Leu Asp Glu Thr

770 775 780

Thr Pro Ile Glu Ser Val Gln Val Ala Arg Thr Leu Asp Asn Val Gln

785 790 795 800

Arg Ser Ile Glu Gln Tyr Tyr Ala Gly Ile Arg Lys Gln Leu Phe Ala

805 810 815

Tyr Asp Glu Val Leu Ser Gln Gln Arg Lys Val Leu Tyr Arg Gln Arg

820 825 830

Asn Arg Phe Leu Glu Ala Asp Glu Ala Leu Leu Phe Gly Ser Asp Ala

835 840 845

Ala Ala Ala Arg Ala Ser Gly Val Leu Ala Gly Leu Ala Gly Asp Trp

850 855 860

Ile Arg Thr Thr Ile Gln Asp Ile Leu Gln Ala Asn Arg Arg Asp Pro

865 870 875 880

Ala Lys Cys Val Gln Lys Leu Lys Ala Phe Phe Pro Gly Ala Leu Leu

885 890 895

Asn Glu Ser Met Cys Gln Ser Ala Ala Ala Ile Asp Gln Val Ala Ala

900 905 910

Ala Val Gly Val Arg Leu Ser Gln His Arg Arg Met Leu Gln Gln Ser

915 920 925

Ala Pro Gln Gln Asp Val Ala Val Phe Arg Tyr Leu Ala Leu Val Gln

930 935 940

His Asp Gln Leu Trp Ser Glu His Leu Arg Lys Leu Ala Leu Leu Arg

945 950 955 960

Asp Met Ser Ser Leu Gln Thr Leu Arg Gln Val Asp Pro Leu Gln Gln

965 970 975

Tyr Gln Gln Asp Ser Phe Gln Leu Phe Glu Gln Met Met Ala Gln Ile

980 985 990

Arg Arg Asn Thr Val Tyr Ser Phe Phe Lys Tyr Ser Pro Gly Pro Thr

995 1000 1005

Val Ser Ala

1010

<210> 5

<211> 390

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 5

atgttccatg tgacgtaccc gttcacgcag agacaatgct ttctccgttc acgagaagcg 60

tgccttgcaa cgttgccagc tggtgctttt cgaaagcacc tgtggcgccc ttcgtgctgg 120

tcgttccgca cacgtcttcg taaagaggcg tcgctacgga aatccacagt tctcgctccg 180

cttactcgcc gtctgcagct gagtctcttc ggcctcccag agcggttcgt tcgcaagtcc 240

aagtcgccgg tctcggcaga gtccagtgtc gccactgagc tcacacgtga tcgggtcaaa 300

gatccgacgc tcgcgaagta ctgggataca cttctggaaa tcaatgcact ggaggcggaa 360

ctggaacaac tcaaaagcga tgaactcaga 390

<210> 6

<211> 130

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 6

Met Phe His Val Thr Tyr Pro Phe Thr Gln Arg Gln Cys Phe Leu Arg

1 5 10 15

Ser Arg Glu Ala Cys Leu Ala Thr Leu Pro Ala Gly Ala Phe Arg Lys

20 25 30

His Leu Trp Arg Pro Ser Cys Trp Ser Phe Arg Thr Arg Leu Arg Lys

35 40 45

Glu Ala Ser Leu Arg Lys Ser Thr Val Leu Ala Pro Leu Thr Arg Arg

50 55 60

Leu Gln Leu Ser Leu Phe Gly Leu Pro Glu Arg Phe Val Arg Lys Ser

65 70 75 80

Lys Ser Pro Val Ser Ala Glu Ser Ser Val Ala Thr Glu Leu Thr Arg

85 90 95

Asp Arg Val Lys Asp Pro Thr Leu Ala Lys Tyr Trp Asp Thr Leu Leu

100 105 110

Glu Ile Asn Ala Leu Glu Ala Glu Leu Glu Gln Leu Lys Ser Asp Glu

115 120 125

Leu Arg

130

<210> 7

<211> 1389

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 7

atgagcgtat cttgtggacg gcggatcttt tcagacattg tgcgtcaagt acgcacgttt 60

gctaccgtgg gcgtagacgc tcgctcgctt gcaggtacac gcaacgccac ggtctggcga 120

tcgtaccgca cgacgacgtt gcagtacccg cggctttggg aactgcgcgc ttcgcgcttc 180

ctctcaggtg aggcggtgga ggcgagtgca agcaaacgca agccccatct gaatgttggc 240

ggtatgggtc acgttgacca tgggaaaact acgctcgcgg cagcgattac gaaagtgctc 300

gccgagactg gtggagcccg gtacactgct tacgaagaga ttgacaaggc accggaggag 360

cgcgcccgcg ggatcacgat caacgcttcg catctcaaat acgagactcc atcacgttcg 420

tatgcacatg tcgattgccc tgggcatcga gactttgtga agaactttat cacgggtgcg 480

gcacaggtcg acaccgcgat tctcgtggtc agcggcccgg acggcccgca gccgcagact 540

caagagcacg tactgctatc gaagcaggta ggtgttccga actttgttgt atacctgaac 600

aaatgtgata tggtcgacga tccggaactt ttggacttgg tcgaattaga ggtgcgtgaa 660

ctactcagca agtacgaata cgatggcgac aacgtgccga tcgtgcgggg ctctgccctg 720

aaggcattgc agggcgatca gagcgagctt ggctgtgggt ccatccacaa gctcctggag 780

attctcgaca aggtcccaat acccaaaaga gaccttgaaa aaccgttcct gatgcccatt 840

gaagatagct ttagcattac tggccgagga acggtggtta cgggacgcgt ggagaccggt 900

atcctacgcc ctggtgatga aatcgaaatc gtcggacttc gtcctcccga ggtagcacca 960

atgcgcacga tcgtcaccgg tatcgagact ttcaagcagt cgcttcctta cgcagaggct 1020

ggcgagaatg ttggctgtct gcttcgcggg gtcaagcgcg aagacgtgtt gcgcggtcag 1080

gttctagcga aaccgggaac ctccagagcg caccgcaagt tcgaggctga cgtttacatt 1140

cttactcagg aggaaggcgg ccgccataca ccattcttca gcaactatcg tcctcaattc 1200

ttcgtgcgga ctgcagatgt cacaggacgc ttccttctgc cgccagaagt ggagatgtgc 1260

atgccaggag atcgcgtacg atgcgctgtt gagctcatct atccggtcgc gctgcaggaa 1320

ggcttacgtt tcgctgttcg tgagggcggc aggaccgtcg gtgctgggtt ggttacgaaa 1380

gtcatcgag 1389

<210> 8

<211> 463

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 8

Met Ser Val Ser Cys Gly Arg Arg Ile Phe Ser Asp Ile Val Arg Gln

1 5 10 15

Val Arg Thr Phe Ala Thr Val Gly Val Asp Ala Arg Ser Leu Ala Gly

20 25 30

Thr Arg Asn Ala Thr Val Trp Arg Ser Tyr Arg Thr Thr Thr Leu Gln

35 40 45

Tyr Pro Arg Leu Trp Glu Leu Arg Ala Ser Arg Phe Leu Ser Gly Glu

50 55 60

Ala Val Glu Ala Ser Ala Ser Lys Arg Lys Pro His Leu Asn Val Gly

65 70 75 80

Gly Met Gly His Val Asp His Gly Lys Thr Thr Leu Ala Ala Ala Ile

85 90 95

Thr Lys Val Leu Ala Glu Thr Gly Gly Ala Arg Tyr Thr Ala Tyr Glu

100 105 110

Glu Ile Asp Lys Ala Pro Glu Glu Arg Ala Arg Gly Ile Thr Ile Asn

115 120 125

Ala Ser His Leu Lys Tyr Glu Thr Pro Ser Arg Ser Tyr Ala His Val

130 135 140

Asp Cys Pro Gly His Arg Asp Phe Val Lys Asn Phe Ile Thr Gly Ala

145 150 155 160

Ala Gln Val Asp Thr Ala Ile Leu Val Val Ser Gly Pro Asp Gly Pro

165 170 175

Gln Pro Gln Thr Gln Glu His Val Leu Leu Ser Lys Gln Val Gly Val

180 185 190

Pro Asn Phe Val Val Tyr Leu Asn Lys Cys Asp Met Val Asp Asp Pro

195 200 205

Glu Leu Leu Asp Leu Val Glu Leu Glu Val Arg Glu Leu Leu Ser Lys

210 215 220

Tyr Glu Tyr Asp Gly Asp Asn Val Pro Ile Val Arg Gly Ser Ala Leu

225 230 235 240

Lys Ala Leu Gln Gly Asp Gln Ser Glu Leu Gly Cys Gly Ser Ile His

245 250 255

Lys Leu Leu Glu Ile Leu Asp Lys Val Pro Ile Pro Lys Arg Asp Leu

260 265 270

Glu Lys Pro Phe Leu Met Pro Ile Glu Asp Ser Phe Ser Ile Thr Gly

275 280 285

Arg Gly Thr Val Val Thr Gly Arg Val Glu Thr Gly Ile Leu Arg Pro

290 295 300

Gly Asp Glu Ile Glu Ile Val Gly Leu Arg Pro Pro Glu Val Ala Pro

305 310 315 320

Met Arg Thr Ile Val Thr Gly Ile Glu Thr Phe Lys Gln Ser Leu Pro

325 330 335

Tyr Ala Glu Ala Gly Glu Asn Val Gly Cys Leu Leu Arg Gly Val Lys

340 345 350

Arg Glu Asp Val Leu Arg Gly Gln Val Leu Ala Lys Pro Gly Thr Ser

355 360 365

Arg Ala His Arg Lys Phe Glu Ala Asp Val Tyr Ile Leu Thr Gln Glu

370 375 380

Glu Gly Gly Arg His Thr Pro Phe Phe Ser Asn Tyr Arg Pro Gln Phe

385 390 395 400

Phe Val Arg Thr Ala Asp Val Thr Gly Arg Phe Leu Leu Pro Pro Glu

405 410 415

Val Glu Met Cys Met Pro Gly Asp Arg Val Arg Cys Ala Val Glu Leu

420 425 430

Ile Tyr Pro Val Ala Leu Gln Glu Gly Leu Arg Phe Ala Val Arg Glu

435 440 445

Gly Gly Arg Thr Val Gly Ala Gly Leu Val Thr Lys Val Ile Glu

450 455 460

<210> 9

<211> 234

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 9

atgagcgtat cttgtggacg gcggatcttt tcagacattg tgcgtcaagt acgcacgttt 60

gctaccgtgg gcgtagacgc tcgctcgctt gcaggtacac gcaacgccac ggtctggcga 120

tcgtaccgca cgacgacgtt gcagtacccg cggctttggg aactgcgcgc ttcgcgcttc 180

ctctcaggtg aggcggtgga ggcgagtgca agcaaacgca agccccatct gaat 234

<210> 10

<211> 78

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 10

Met Ser Val Ser Cys Gly Arg Arg Ile Phe Ser Asp Ile Val Arg Gln

1 5 10 15

Val Arg Thr Phe Ala Thr Val Gly Val Asp Ala Arg Ser Leu Ala Gly

20 25 30

Thr Arg Asn Ala Thr Val Trp Arg Ser Tyr Arg Thr Thr Thr Leu Gln

35 40 45

Tyr Pro Arg Leu Trp Glu Leu Arg Ala Ser Arg Phe Leu Ser Gly Glu

50 55 60

Ala Val Glu Ala Ser Ala Ser Lys Arg Lys Pro His Leu Asn

65 70 75

<210> 11

<211> 690

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 11

atgggcgttc cgcaaatcac aagcaacacc gtggcacccg tcgaacgcaa ggtggagcgc 60

ggaagctacg gtgcaactgc gccagcagcg ccggcagctg cgttcgcctc caccgagtcc 120

gagctgatca gggatagtga cccttcgctc tcaacgatgt ggttgccggt ggatctgagt 180

gtccaactca ctccgaagga gcgtctgcag ctggcgtggt cggcggcacg gccgtggcga 240

gagtgggctg cactgcacgc actagcgccg ccgccaccct cgtcgtggct ggactggagc 300

gctcgtgtgc gcacgaattt ggagctctac gcatggaact acctctttgt ggcccttgtg 360

atgttcattg tgacgggttt gttctatccg tggagtgcgc tgctcctaat atcctggctc 420

ctgctcgcgc tgtacatggg tgtacgcacg gctgatgccg tcgggggcga ggacgcttcg 480

ctgggcgcgc gcatgttcca gaactggcct ggctatatcc gatacatttt gctgggtggc 540

ttgctcacgc tcatgttgtt tctgacagac gttgtcgcct tggcgctcac cagcgcatcg 600

ttggccgcag ctgtgacgct cgcgcacgca gcgctccacg atcccgttgc cgtgctctcc 660

gcgcggaatg ccgaatatac cgtggcaccg 690

<210> 12

<211> 230

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 12

Met Gly Val Pro Gln Ile Thr Ser Asn Thr Val Ala Pro Val Glu Arg

1 5 10 15

Lys Val Glu Arg Gly Ser Tyr Gly Ala Thr Ala Pro Ala Ala Pro Ala

20 25 30

Ala Ala Phe Ala Ser Thr Glu Ser Glu Leu Ile Arg Asp Ser Asp Pro

35 40 45

Ser Leu Ser Thr Met Trp Leu Pro Val Asp Leu Ser Val Gln Leu Thr

50 55 60

Pro Lys Glu Arg Leu Gln Leu Ala Trp Ser Ala Ala Arg Pro Trp Arg

65 70 75 80

Glu Trp Ala Ala Leu His Ala Leu Ala Pro Pro Pro Pro Ser Ser Trp

85 90 95

Leu Asp Trp Ser Ala Arg Val Arg Thr Asn Leu Glu Leu Tyr Ala Trp

100 105 110

Asn Tyr Leu Phe Val Ala Leu Val Met Phe Ile Val Thr Gly Leu Phe

115 120 125

Tyr Pro Trp Ser Ala Leu Leu Leu Ile Ser Trp Leu Leu Leu Ala Leu

130 135 140

Tyr Met Gly Val Arg Thr Ala Asp Ala Val Gly Gly Glu Asp Ala Ser

145 150 155 160

Leu Gly Ala Arg Met Phe Gln Asn Trp Pro Gly Tyr Ile Arg Tyr Ile

165 170 175

Leu Leu Gly Gly Leu Leu Thr Leu Met Leu Phe Leu Thr Asp Val Val

180 185 190

Ala Leu Ala Leu Thr Ser Ala Ser Leu Ala Ala Ala Val Thr Leu Ala

195 200 205

His Ala Ala Leu His Asp Pro Val Ala Val Leu Ser Ala Arg Asn Ala

210 215 220

Glu Tyr Thr Val Ala Pro

225 230

<210> 13

<211> 1287

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 13

atggtgatca gcacccaggg aaacgtctca ggttccgttc cgaatacgct cagagcgaaa 60

cctgagcaaa cgcttcaggc gtcgaggcct gccaccaggg ccgtctctgc agcgcttgac 120

ttctgcgctg gagttgcagc agccgtttca ggtatatctg ttcccggggg atattttccc 180

cctttcatgt cattcagcaa ccgcttaagt ccaaaagctg gcacagatac tctttcggtg 240

agcaatcagg agtcaaccag ggaagacgcg gacgctgtgc tttcgacggc agccgcttcg 300

ctggcaacgg acctcgagta cggctccgcg acagagaggc accacgagaa catcgatcct 360

attcgcccag tgacgcgaga ctctggcgca ttcgtcgaga actatccaag tggaacacgg 420

gaggtggcaa tcgctttcga aggagtgacg ttgtctgcga gcacccaggc ggcggggacg 480

actcgaccca tacttcggaa tatctgtttc gaagtacgcg acggagagac ggtgtttatc 540

atcggcccgt caggagctgg caagtctcga cttcttcgac tggtgaaccg cttagaggag 600

ccttcaggtg gtcaagtccg gctgtggggc acaccagtgc cagcgtaccc tccaggcgag 660

ttacggggca aactagtagg ttttctgagc cagcaacctg cattgccgtg tttggcacat 720

aggaggcctg gtttgctgga ggcactccgc cgacttgtaa cgcgagaaac gctttgggag 780

ctggtcctcg gcaaacgccg acaacgctcg agcgcgaaca cggaggaacc agcaaagtcg 840

gcgctcgaga cgctcgttcg tttgggagtt gtatcgcgga cagagcttga acagcgacta 900

ccggaagcac tgcatatagc tggcttatcg cgaacgattc tggatcgacc gctagccgcc 960

ttgtcagggg gtgagcgagc tcgcctcggc ctcacgcgag tgcttttgca gcaaccacga 1020

atcctactgc tggacgaggt cacttcaagt ctggacgcag caacagcggc tcaagtattg 1080

caacgtttat cggactggaa agaacgcatc cgggccacaa tactcatagt gacgcacaga 1140

ctctcggagg ttaccgacgg ccagctcatt cttgtgcggg acggggagct gttcttacgc 1200

ggtgacgcgc gtgcgctgct cggtaccgcc gatacagcgg ccagcattca tcggcttttg 1260

actggggagt cattgggagc aacaaaa 1287

<210> 14

<211> 429

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 14

Met Val Ile Ser Thr Gln Gly Asn Val Ser Gly Ser Val Pro Asn Thr

1 5 10 15

Leu Arg Ala Lys Pro Glu Gln Thr Leu Gln Ala Ser Arg Pro Ala Thr

20 25 30

Arg Ala Val Ser Ala Ala Leu Asp Phe Cys Ala Gly Val Ala Ala Ala

35 40 45

Val Ser Gly Ile Ser Val Pro Gly Gly Tyr Phe Pro Pro Phe Met Ser

50 55 60

Phe Ser Asn Arg Leu Ser Pro Lys Ala Gly Thr Asp Thr Leu Ser Val

65 70 75 80

Ser Asn Gln Glu Ser Thr Arg Glu Asp Ala Asp Ala Val Leu Ser Thr

85 90 95

Ala Ala Ala Ser Leu Ala Thr Asp Leu Glu Tyr Gly Ser Ala Thr Glu

100 105 110

Arg His His Glu Asn Ile Asp Pro Ile Arg Pro Val Thr Arg Asp Ser

115 120 125

Gly Ala Phe Val Glu Asn Tyr Pro Ser Gly Thr Arg Glu Val Ala Ile

130 135 140

Ala Phe Glu Gly Val Thr Leu Ser Ala Ser Thr Gln Ala Ala Gly Thr

145 150 155 160

Thr Arg Pro Ile Leu Arg Asn Ile Cys Phe Glu Val Arg Asp Gly Glu

165 170 175

Thr Val Phe Ile Ile Gly Pro Ser Gly Ala Gly Lys Ser Arg Leu Leu

180 185 190

Arg Leu Val Asn Arg Leu Glu Glu Pro Ser Gly Gly Gln Val Arg Leu

195 200 205

Trp Gly Thr Pro Val Pro Ala Tyr Pro Pro Gly Glu Leu Arg Gly Lys

210 215 220

Leu Val Gly Phe Leu Ser Gln Gln Pro Ala Leu Pro Cys Leu Ala His

225 230 235 240

Arg Arg Pro Gly Leu Leu Glu Ala Leu Arg Arg Leu Val Thr Arg Glu

245 250 255

Thr Leu Trp Glu Leu Val Leu Gly Lys Arg Arg Gln Arg Ser Ser Ala

260 265 270

Asn Thr Glu Glu Pro Ala Lys Ser Ala Leu Glu Thr Leu Val Arg Leu

275 280 285

Gly Val Val Ser Arg Thr Glu Leu Glu Gln Arg Leu Pro Glu Ala Leu

290 295 300

His Ile Ala Gly Leu Ser Arg Thr Ile Leu Asp Arg Pro Leu Ala Ala

305 310 315 320

Leu Ser Gly Gly Glu Arg Ala Arg Leu Gly Leu Thr Arg Val Leu Leu

325 330 335

Gln Gln Pro Arg Ile Leu Leu Leu Asp Glu Val Thr Ser Ser Leu Asp

340 345 350

Ala Ala Thr Ala Ala Gln Val Leu Gln Arg Leu Ser Asp Trp Lys Glu

355 360 365

Arg Ile Arg Ala Thr Ile Leu Ile Val Thr His Arg Leu Ser Glu Val

370 375 380

Thr Asp Gly Gln Leu Ile Leu Val Arg Asp Gly Glu Leu Phe Leu Arg

385 390 395 400

Gly Asp Ala Arg Ala Leu Leu Gly Thr Ala Asp Thr Ala Ala Ser Ile

405 410 415

His Arg Leu Leu Thr Gly Glu Ser Leu Gly Ala Thr Lys

420 425

<210> 15

<211> 1362

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 15

atgtcgtcca tctgggagca aatgagatct ctgctgttcg gagcgagtgg atcgtatgct 60

gatgcggccg acacacccca agggcttgct ttcgaacagc gacttgttgc gaggcacgga 120

aacacgctgc cgccttggcc gctcttcaag ctgggtgctg tatacggcgt tggaccagtg 180

cctcccaacc tgcgggccct tctagacggt tctttctttc tgataggcgc gtgtttgggg 240

actgcggcgc tgtttgactt tgccggtatc atcggtagac gcggcggcgg cgttactgca 300

gcggaagttg cgaaagagtg caactgtgcc gatgtcaatg cggtcggtcg cgttctacgg 360

gcttgcgaaa actggggcta ctttgagtcc tacgtcccgc gggactgtga ctcaaagagc 420

gtatcgaacg agcagcgctt atggcggaac acactattat cggctttatt gcgcgaggat 480

catccacaca gcgtgcgcgc tcaaataatg cacttgtacg tcgatatttt tcccgcgagt 540

gcgcttctct ttgagacgat acgtgatacg ccgtcaaacg acgtagaaac ggacggtttg 600

cgagcgcgtg ccgatgcgtc agcgacaaaa caaccgagcg cgtttgaacg cgtccatcag 660

tgtacgtttt gggaatacct gtcgagccat ccggatcgat gggatgtctt caacagggcc 720

atgcgtagct cagatgcgct cctgggatcg acgatcatga aagatatcga ctggggacgc 780

tacctccggg ttatagacct cggcgcagcg gatggttcac tcgtttatca tctgctgagc 840

gctttcgcgc tgaaggcagt gattttcgat ctgccgccgg ttatcgagca tgccaaggcg 900

tactggaaca cgagtccgga gcgctcggca atggtcgcaa gtggccgcgt ccagttcgcg 960

gccggcgacc ttttcgacgc caccaccgtg ccagcagcgg aggaaggcga catctatgtg 1020

atgcgcaata tctggcacga ctggcgcgat cccgactgta tccgcattgg ccggagtgtt 1080

cgagcagcaa tcggcgacgt tcgcaatgtg aagctcgtca tcctggaggc aagcattgac 1140

cagtatccgc gcggctcact ccttgagcgc tttcgcgttg cactcgatca aatcatgttt 1200

accgcatttc gatccaagga acgcgacagg atagagtttg atgcgctctt acgtcgatgt 1260

ggttttcagc tgaccgaagt gaggcattta cgcgcatcac tcgtcgcagt gatcgccgag 1320

ccgctttcgc attgggtcca gtctcctgaa ccagcggacc aa 1362

<210> 16

<211> 454

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 16

Met Ser Ser Ile Trp Glu Gln Met Arg Ser Leu Leu Phe Gly Ala Ser

1 5 10 15

Gly Ser Tyr Ala Asp Ala Ala Asp Thr Pro Gln Gly Leu Ala Phe Glu

20 25 30

Gln Arg Leu Val Ala Arg His Gly Asn Thr Leu Pro Pro Trp Pro Leu

35 40 45

Phe Lys Leu Gly Ala Val Tyr Gly Val Gly Pro Val Pro Pro Asn Leu

50 55 60

Arg Ala Leu Leu Asp Gly Ser Phe Phe Leu Ile Gly Ala Cys Leu Gly

65 70 75 80

Thr Ala Ala Leu Phe Asp Phe Ala Gly Ile Ile Gly Arg Arg Gly Gly

85 90 95

Gly Val Thr Ala Ala Glu Val Ala Lys Glu Cys Asn Cys Ala Asp Val

100 105 110

Asn Ala Val Gly Arg Val Leu Arg Ala Cys Glu Asn Trp Gly Tyr Phe

115 120 125

Glu Ser Tyr Val Pro Arg Asp Cys Asp Ser Lys Ser Val Ser Asn Glu

130 135 140

Gln Arg Leu Trp Arg Asn Thr Leu Leu Ser Ala Leu Leu Arg Glu Asp

145 150 155 160

His Pro His Ser Val Arg Ala Gln Ile Met His Leu Tyr Val Asp Ile

165 170 175

Phe Pro Ala Ser Ala Leu Leu Phe Glu Thr Ile Arg Asp Thr Pro Ser

180 185 190

Asn Asp Val Glu Thr Asp Gly Leu Arg Ala Arg Ala Asp Ala Ser Ala

195 200 205

Thr Lys Gln Pro Ser Ala Phe Glu Arg Val His Gln Cys Thr Phe Trp

210 215 220

Glu Tyr Leu Ser Ser His Pro Asp Arg Trp Asp Val Phe Asn Arg Ala

225 230 235 240

Met Arg Ser Ser Asp Ala Leu Leu Gly Ser Thr Ile Met Lys Asp Ile

245 250 255

Asp Trp Gly Arg Tyr Leu Arg Val Ile Asp Leu Gly Ala Ala Asp Gly

260 265 270

Ser Leu Val Tyr His Leu Leu Ser Ala Phe Ala Leu Lys Ala Val Ile

275 280 285

Phe Asp Leu Pro Pro Val Ile Glu His Ala Lys Ala Tyr Trp Asn Thr

290 295 300

Ser Pro Glu Arg Ser Ala Met Val Ala Ser Gly Arg Val Gln Phe Ala

305 310 315 320

Ala Gly Asp Leu Phe Asp Ala Thr Thr Val Pro Ala Ala Glu Glu Gly

325 330 335

Asp Ile Tyr Val Met Arg Asn Ile Trp His Asp Trp Arg Asp Pro Asp

340 345 350

Cys Ile Arg Ile Gly Arg Ser Val Arg Ala Ala Ile Gly Asp Val Arg

355 360 365

Asn Val Lys Leu Val Ile Leu Glu Ala Ser Ile Asp Gln Tyr Pro Arg

370 375 380

Gly Ser Leu Leu Glu Arg Phe Arg Val Ala Leu Asp Gln Ile Met Phe

385 390 395 400

Thr Ala Phe Arg Ser Lys Glu Arg Asp Arg Ile Glu Phe Asp Ala Leu

405 410 415

Leu Arg Arg Cys Gly Phe Gln Leu Thr Glu Val Arg His Leu Arg Ala

420 425 430

Ser Leu Val Ala Val Ile Ala Glu Pro Leu Ser His Trp Val Gln Ser

435 440 445

Pro Glu Pro Ala Asp Gln

450

<210> 17

<211> 1488

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 17

atggacccaa ccaagtatac accacgcgat acttttgacg tgaacgtgct taccacgaac 60

gctgggcagc cggtgacgaa caaccagtct tcacgcactg tggggccgcg tggaccggtg 120

ctgcttgagg actaccacct cttggagaag ctggctaact ttgaccgtga acggcaaccg 180

gagcgtgttg tacacgcgcg cggtgtgact gccaagggtt actttgaggt gacacacgat 240

atcacagact taacctgcgc ggacatgttc gcggaggttg gtcgccgtac gccggtggct 300

gtccggtttt cgacggtcat tcactcgcgt cattcaccgg aaaccttgcg tgatccgcgc 360

ggttttgctg tcaagttcta cactcgtgaa ggaatctggg acctcgtcgg aaataatctg 420

ccggtcttct ttatcaggga tgcgatcaag ttcccggact taattcacgc gttcaaacca 480

aacccgcgga cagaggcgca ggaatcctgg aggattctcg actttttaag caaccaacac 540

gaaagcctga atatgctcac gttccttttc gacgacgagg gtatcccgaa ggactatcgg 600

catatgcgcg gcagcggagt gcactcgttc cgccttgtaa ccagggacgg acgctcgacg 660

tacgtgcgct tccactggcg ccccaagtgc ggtatggaaa acttgttgga cgaagaggct 720

gccgttgtgt gtggtcagga tttttctcac gcaacgcatg acttgatccg ggcaattgac 780

cgaggcgact atcccgaatg ggcgctctat atccaaacca tggacccggc aatggttgag 840

aaccacgtgt tcccgtgggg cgatccactc gacgcgacga gagagtggcc tgagaaggac 900

tttccgcttc gccctgtggg tcgcatggtg ctgaatcaga actgcgataa ccagttcttg 960

gagaatgagc aaattgcctt ctcgccggca ctcgttgtcc caggtatcta ctattcagac 1020

gataagctgt tgcagggtcg ccttttcagt tacgccgaca cgcagcgata ccgtattggt 1080

gccaattatc tgcagctgcc gatcaacgca ccaaagaatc cgttccataa taaccattat 1140

gatggccagc agaactggat gctgcgtcag ggcgaagtga actactaccc cagccgcgta 1200

gacccggtgc cggaggcgcc cccggccgca ttcccaacgc ctcgcgatga actacgtggt 1260

cagcgggtga agcagctggt tccaaatcag tgcgactttg tgcagcctgg agaacgttac 1320

cgctctttcg atccggcgcg caaggagcgg ttcgtgaacc ggatcgcgaa actgctaaca 1380

gatgagcgcg tgaccccaga gctgcgagcc atctggctgg agctctggag caaatgcgat 1440

gctgaactgg gcgcagcgct ggcgaccaga gtgaagcagt gtacaatg 1488

<210> 18

<211> 496

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 18

Met Asp Pro Thr Lys Tyr Thr Pro Arg Asp Thr Phe Asp Val Asn Val

1 5 10 15

Leu Thr Thr Asn Ala Gly Gln Pro Val Thr Asn Asn Gln Ser Ser Arg

20 25 30

Thr Val Gly Pro Arg Gly Pro Val Leu Leu Glu Asp Tyr His Leu Leu

35 40 45

Glu Lys Leu Ala Asn Phe Asp Arg Glu Arg Gln Pro Glu Arg Val Val

50 55 60

His Ala Arg Gly Val Thr Ala Lys Gly Tyr Phe Glu Val Thr His Asp

65 70 75 80

Ile Thr Asp Leu Thr Cys Ala Asp Met Phe Ala Glu Val Gly Arg Arg

85 90 95

Thr Pro Val Ala Val Arg Phe Ser Thr Val Ile His Ser Arg His Ser

100 105 110

Pro Glu Thr Leu Arg Asp Pro Arg Gly Phe Ala Val Lys Phe Tyr Thr

115 120 125

Arg Glu Gly Ile Trp Asp Leu Val Gly Asn Asn Leu Pro Val Phe Phe

130 135 140

Ile Arg Asp Ala Ile Lys Phe Pro Asp Leu Ile His Ala Phe Lys Pro

145 150 155 160

Asn Pro Arg Thr Glu Ala Gln Glu Ser Trp Arg Ile Leu Asp Phe Leu

165 170 175

Ser Asn Gln His Glu Ser Leu Asn Met Leu Thr Phe Leu Phe Asp Asp

180 185 190

Glu Gly Ile Pro Lys Asp Tyr Arg His Met Arg Gly Ser Gly Val His

195 200 205

Ser Phe Arg Leu Val Thr Arg Asp Gly Arg Ser Thr Tyr Val Arg Phe

210 215 220

His Trp Arg Pro Lys Cys Gly Met Glu Asn Leu Leu Asp Glu Glu Ala

225 230 235 240

Ala Val Val Cys Gly Gln Asp Phe Ser His Ala Thr His Asp Leu Ile

245 250 255

Arg Ala Ile Asp Arg Gly Asp Tyr Pro Glu Trp Ala Leu Tyr Ile Gln

260 265 270

Thr Met Asp Pro Ala Met Val Glu Asn His Val Phe Pro Trp Gly Asp

275 280 285

Pro Leu Asp Ala Thr Arg Glu Trp Pro Glu Lys Asp Phe Pro Leu Arg

290 295 300

Pro Val Gly Arg Met Val Leu Asn Gln Asn Cys Asp Asn Gln Phe Leu

305 310 315 320

Glu Asn Glu Gln Ile Ala Phe Ser Pro Ala Leu Val Val Pro Gly Ile

325 330 335

Tyr Tyr Ser Asp Asp Lys Leu Leu Gln Gly Arg Leu Phe Ser Tyr Ala

340 345 350

Asp Thr Gln Arg Tyr Arg Ile Gly Ala Asn Tyr Leu Gln Leu Pro Ile

355 360 365

Asn Ala Pro Lys Asn Pro Phe His Asn Asn His Tyr Asp Gly Gln Gln

370 375 380

Asn Trp Met Leu Arg Gln Gly Glu Val Asn Tyr Tyr Pro Ser Arg Val

385 390 395 400

Asp Pro Val Pro Glu Ala Pro Pro Ala Ala Phe Pro Thr Pro Arg Asp

405 410 415

Glu Leu Arg Gly Gln Arg Val Lys Gln Leu Val Pro Asn Gln Cys Asp

420 425 430

Phe Val Gln Pro Gly Glu Arg Tyr Arg Ser Phe Asp Pro Ala Arg Lys

435 440 445

Glu Arg Phe Val Asn Arg Ile Ala Lys Leu Leu Thr Asp Glu Arg Val

450 455 460

Thr Pro Glu Leu Arg Ala Ile Trp Leu Glu Leu Trp Ser Lys Cys Asp

465 470 475 480

Ala Glu Leu Gly Ala Ala Leu Ala Thr Arg Val Lys Gln Cys Thr Met

485 490 495

<210> 19

<211> 8157

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 19

atgggggatt tgggtgaaca tgcaccgtcg aatggctccg caaaggatag cgttcggcgc 60

tatgttgagg aacacggcgg tagtagacca attcagcgct tattggtcgc aaacaacgga 120

atcgccgccg taaagtgcat ccgatcaatg cgaagatggg catacgaagc gtttggtagt 180

gaacgagcgc tggagtttgt agcaatggcc actcccgagg atgttcaagc caacgccgac 240

tacattcgcc tcgctgacct cttcgttccc gtacctgggg gaagcaacaa ccacaactac 300

gccaacgtgg agctcatcgt cgacatcgcg gaaaggaacg agtgcgatgc agtatgggcg 360

ggttggggtc acgccagcga gaacccgctg ctgcccgctc gccttgcgga gactggtatc 420

gcattcctgg gaccggatgc cattgccatg agggcgctcg gcgacaagat ttcgtcaacg 480

ttgttagctc agtctgcgga tgttccagtc gtcagctgga acggtgatga cttgaaggta 540

acttttcacc gcgagcgcgg cggtattgat gaagaacgct atcgccgtgc ctgcgttgcg 600

aatgtcgacg aggctcgcgc tgctgccgat cgtattggct acccagtgat gattaaagcc 660

agcgaaggag gcggtggcaa aggcatccgt ctttgccgtc gaccagagga tgttcgagat 720

gcctttcgtc aggtcgccgg agagattcct ggctctccga tcttcatcat gaaaatggtg 780

gatcaagcgc gacacctgga agtgcagatt gttgccgatg aatacgggca cgcgatagcg 840

ctatatgggc gtgactgctc cgtgcagaga cggcatcaaa agatcatcga ggaagggcca 900

gttacggcaa cgccaccgca ggtttggaaa gagctcgaac atggagcggt gcgtcttgcc 960

aaaatggtcg gttacgtagg tgccggcact gttgagtatc tctacgacgg tcgacgtttc 1020

tatttcctgg aactgaatcc gcgcttgcag gtggagcacc cagtaaccga gtggatcact 1080

ggcgtgaacc tgcctgcagt ccagctgcag attgccatgg ggattccgct ccagcgtatt 1140

gcagctatcc aagcacttta cggcaataga ccggacctgg atctcgagcg acaccagccg 1200

aacccgccgc atggccacgt aatcgcctgc cgcgttaccg cagaaaatcc cgaggaagga 1260

ttccagccga caagcggctc cattcaggag ctttcgtttc gaaatacgcc caatgtgtgg 1320

ggctatttca gcgtcggcgc ctggggtggt gtccacgagt acgctgactc gcaattcggg 1380

cacctttttg cctggggtgc ggatcgcgag atggcccgga ggaatatggt gctggcactg 1440

aaagagctca gcatccgcgg agatatccgc acaacggtgg agtatcttat tacgttgctg 1500

gagctcgaat cgtatcgcga aaatcgcatc catacccgtt ggctggacaa cctcattgcc 1560

tccaaagtga aaccagagcg accgccgttc catatcgctg ttgtgcttgg tgcagtccat 1620

caggcgtaca gagcctggtc tgagcgccgg cgatcttttg ttgagtcgct gagtcgcggt 1680

caagtgccgc agcgtattga cgctaccttc atagaatttc agttcgagct gatctacgat 1740

gagctcaagt acgcgctcat cgtcaggcaa gcgggtccga atgccttcca cgtctcccta 1800

gcacagcatc ccgaggaacg ggtgcgtgtt gatttccgac ctctgctgga cggtggtctc 1860

ttgatcatgt gcgatggtcg ctccttcaat actcactcca aggaggattc cactggcatg 1920

cgtctgagca tcgacgggcg aacctgtgtg ttcccaggcg aaatggaccc tacgcgcatt 1980

gcagcacagg cctcggggcg tctcgtccga tacctggttg ccaacgagga acacgtcgat 2040

cagggacaag tcattgcgga gattgaagtc atgaaaatgt atctgagcgt gcaggctccc 2100

gaaccgggca ccattcactt gctgaaacca gcgggagcgg ccctggagcc aggcgaagtt 2160

tttgcgcagc tcgatctcga cgatccgagc aaagttcgac gggtgacgcc atttactggg 2220

cgctttccga caatgctgcc tccacagcga ctgggaaaga aaccgcacca gcgctttgag 2280

agcgcgaaac agcagatcga agcgctcctg gacggctacg atgttgatat ggagccgctc 2340

ggcatgcttc tgcagctggc gagcaccgaa ccggcagtac cggccgggaa actgcaggag 2400

gcgctctctt tcctagccgg tcgtattccg tcgacgttgc atgccgccct tctaggacaa 2460

gttttcgaac tgcgttccgt tgctcgtgat gatcacgagc gcctccaaca gcatttccgc 2520

gaaatgaagc gcatgatcag cgatttctgc gagcaacttg ccatcgcagc ggaccgcgag 2580

gcgctggagg ctaccctgcg ccccctgacc agtatcctgg aggcgtatgt tcgcgggggc 2640

cttcgcggct accacgagca tctcgttgta tgcctcatgc aacgctatgt gaatacggag 2700

aagtattttg cgcaaggtcg gcggctcgat gaggttctat tcgatctgcg cgaacaacac 2760

cgcgagcact tggaagttgt tgcggatatc atgctcgcac atgcccagtt agcgcgaaag 2820

aaccaactga tgctcgcttt gctcgatcac atagccgcgg atgcatcgct tctgcgcagt 2880

gtcattgtcc ggcagtgcct tcacgaggtg gctgcgttca tccaccccga ctatagccaa 2940

ctggcgctgc gagcaagact cttgctggcg agttctcgtc gccgagcgct ccccgaacgt 3000

cgagagcgcc tctgcaagca gatcgaacag gctgtacatg cgcccactga agaacaatgc 3060

tatcgtctgt tgcacgcggt ggtcgagtcc caggaaagca tcctggatgc gctggtgagc 3120

ctaaccatga acccgggggt gccggtggag attcgcaaag ccgctgtcca gtgccagatc 3180

ctgcgggcct acaaagcgta ccacgtgtac gatctggttg tcgaacttga cgaggaactg 3240

ggtttcctgc gggcgttgtg gcgtttccag tatcgttcaa acctgaacgc ttttggtagt 3300

cattccagta tgcggtcgtt tccggcggca ttggtggcgg cgtcggctcc gctagcccgc 3360

cggcagctgc gtagctacga ctctgcggac aatttgcaga actggggctt tcgagtcggt 3420

atgttggtcg tctttgagac gctgcgggcg atggtgcagg gtttcgatcg tgtgctccag 3480

gtttttcgag cggaaaccgg cggcgaaacg ctggcttcgc ctcgtaccaa gtcgagcgcc 3540

gcagtcgatt ccagtgcaga ctccagtgct gaaggtattg gtgtcaatgt actgaccatc 3600

gccattccct ggacggccgt ggaggtcgct gattttgcga aatacctggg cgttgcagac 3660

gcggacgacc gtctgcaggc aagtaatgcg aacgactcgg agcaggttgt cgccctgctc 3720

acgcgtttct gtcgctcttc cgcggagcgc aaagcctcga tgcgcgctgc cggtatcaaa 3780

cttgtgacct ttttggtggc acccggtgaa ctgtgccagc gcacgctggg ctccagtctg 3840

cctcgtgaaa gcacttatcc gggtttctat acgttgcggg catcgctcga gtacgccgag 3900

gatcccattt atcgccacat tgatccacca gcggctttcc agctggagct gaatcgattg 3960

gcgaactttc gtatcacccg ctttgaacat ccgaaccgct ccattcacgt cttctacgcg 4020

gaagatcgaa ccgacaaggg cgacgctcgc ttcttcgtgc gcgcttttgt acggcaagcg 4080

gaagtctacg ccagtcccag cgatacagca gcggtttcga tcccggaagc ggagaggact 4140

tttgtggcct gtctggatgc cttggaaacc gcccgttgtg ataggcgctt ccgacgaacg 4200

gactttaatc atctcttctt gcacctgatt ccccgggtct ccatcgatgt ggatgacgtc 4260

gaggcgatct gtacgcgcct cttttatcgg ttctcctcac gttgctggcg tttgcgcgtg 4320

ttcatggtag agatcaaagt ccatgtggaa aaaatgggcc cgaaaccgct tcgatttatt 4380

ctctataatc caaccggaca ctcgttacgc gttgaaggct acgtggaaca gggagatcgt 4440

ctcctgtcgt tggattccag cgatccgggt catttacatg gtacacccgt cgatgagccg 4500

taccaggtac tgaaccgtat ccagcgccgg cgagttgttg cccagacgct cgaaacaact 4560

tacgtctacg attaccagga gcttttcacc aaagcactcc atgagcagtg gcggcgttat 4620

tcgcaggaac gacttttggg aggatttcgc cgccacaaga taccgctgaa actgctgacc 4680

tgcacggagc tcgttctcca ggatgaagat catgacgaga gcgagctgat agagaccaac 4740

cgccccgccg gggagaatga aatcggtatg gtcgcctggc gctacacctt cttcacgccc 4800

gagtatcccc agggtcgcga tgcgatcgtc attgccaacg atatcacgta cctgagtgga 4860

tcgtttggac ctcgagagga tcgcctcttt gccaaggcgt ctcagcaggc gagaaagctc 4920

ggtataccgc gcatctacat cgctgcgaat agcggagccc gaatcggcct cgcggaagaa 4980

ctgcgcaacg tcttccaagt tcagtggaag gatgcacatg acccgagccg aggcttcgac 5040

tacctctacc tgggactggc ggataagctc aagtacgagg atgaactcgg catcgtgcga 5100

acgcgagacg tcggcggcgg caagtacgct ctagttgata tctatggcgc tgaggatggc 5160

attggtgtcg agaacctcat gggttcagcg tgcatcgctg ccgagacctc agccgcctac 5220

aatgactgct tcaccatcac gtacgtcgca gcgcgctgtg tggggatcgg tgcgtacctc 5280

gtgcgcctgg ggcagcgcgt tatccagcgc gagcacaatg caccgatcat cctcactggc 5340

tactcagcgc tgaataagct tctgggtcgc gaagtctaca ccagccacga acagctgggc 5400

ggtacgaaga tcatgtatcc gaacggggtt acccatctgc gcgtatccaa tgactacgag 5460

ggtgttcggg caattctgga ctggctcgct tttgtgcctc gtatgcaagg ggaacgacct 5520

ccgatgatcg actccatcga tgctgtggta cgcgaagtcg actacgatcc acgagtggct 5580

aacgaggata ttcgatacgc cattgaaggc aagtgggtcg gacccttcgc tcccgacggg 5640

gcaagtgtgg tctcgtcgtc gggctcgtcg gtgtcgtcca atggtccggt tgcgacaggt 5700

gcgagcgctg gactgccgcc cgtcaacatg gagagtgtcg gcggtgaagt tcggtatcgt 5760

tcgcggtcgc ggacggcttc gtatgcgggt gcgaatgcag caacagcagc agcaccagga 5820

gcaagtagct tggcgacctc tggagcggca cttgcgaatg ctacgggacg agcagcggct 5880

cgggtgagcc tcgtatcgac caagcctgca accggtgtcg aggaggttcc cgcggcgttg 5940

ccttcgggtt cgagttcact ggcgctccgc tcgaccgctg tcgctgcgcc tgtacctcag 6000

gagagagcaa agacggatgc ggatgcggat gcggatgcgg atgcggatgc cttggaggcc 6060

actggaactg cccgaggcgc agataccagc ggtgctgggg cgcacgcaag ccctgacacc 6120

cggcagccgc ccacactgga tgccctgaag actgcgaggt ttgcaccagc gcccgctgca 6180

ttgccgacga caggtttacg ctcgccacca cagtccccgg gttctgtttc tgcgtatagc 6240

ccgttggaga gtggctctcc attggagccg tcccttgacg aggatcacat cgctttcacg 6300

gacaccgatc ggatgcggat cgggtcgcct ttggtacacg cgaccgacgc ctgggatacg 6360

agcaacagcg ttcactcgag tagcgcgacg acgatggctg caggtgccgt ccggggctcg 6420

cgactgttca ctgcgtcatc gagtacgtcg gcggctgcgg gagcaagcgc cagcaatagc 6480

agcaacaacg gtagcagcaa tgcgaatacg aataccatga acaatgctgg gagtagcgtg 6540

gcatcggcag gcgcaccgct caacaccacc ggtagcatca ataccaatac caatgccaat 6600

gccaatacca acaatatctt gggtagcggt ggtgttggtg cgacagcttc gctgaccgct 6660

tggagcggta accttccgcc gctaccgtac gttccaaata cgctcatggt ggtggacaac 6720

caagggtacg tgttcctggc aggcctcttc gaccgtcact cgtttcgaga gacgctcgct 6780

ggctgggcga agtccgttgt ggttggtaga gcacgccttg gtggtatccc cgtgggcgtt 6840

attgcgacac agacggtgac cacagagaag gttattcccc cggatccagc ggcgccagac 6900

tcgcgtgagc ttcgtgaaca gcaggccggt caagtctggt atccagattc atccttcaag 6960

actgcccaaa gtattcgtga ttttgatcgc gagggtttgc ctctgtttgt tctggccagc 7020

tggcgcggtt tctctggcgg agcacacgac atgttccagg aaatactcaa gtttggttca 7080

gagatcgttg acgcgcttcg cgagtactcg aaacccgtct ttgtgtacat accaccggga 7140

ggcgagcttc gaggcggtgc ctgggtcgtc ctcgataccg ccatcaatcc ccgctttatc 7200

gaaatgtacg ccgatgaaag tgcacgtggt ggtgtgctcg aacctgccgg taccgtggat 7260

atcaagttcc gcaccagaga tttgctgaag accatgcagc gcctgctacc atcgtctaga 7320

cgtgatgaga gcgattcgag tgcgtccgac gttgtcggca tgctgtcttt ggacgcatca 7380

tcatcatcag cagcagcaac agcagcaaca acaacaacaa caacaacaac aacaacaaca 7440

gcaggtaccg ctgggacgtc ctcttcgact gctgggcgta gcgagtcgcg ggtcacaaag 7500

tcaacggcgg acggcttgcc agccttcgag gcagacggtc ggagttcgct cgagcaaaca 7560

ctcttgccca tctttcagca gatagccatg acgtttgcgg atctgcacga tacagcgggt 7620

cggatgcaac ataagcgcgc gattcgccga gtggtaccct ggcgaacaag tcgcacgttt 7680

ttctactggc gccttcgacg tcggctcgcc gaggaggaac tgcggagtcg cgtttcacaa 7740

gcggatccgc aactcagcga cgcggagatc gatgcgctgc tgcggaaatg ggcaagagca 7800

catgaccccg ctctggacgg tgatatttac gaacttgatg atcctcgtgt cgtgcaatgg 7860

ctcgaggacg aattggatag ccagttggag cggcgtttgc acaaactccg cgaagcacgc 7920

gccacctggc aagctgtgga actggggaat acagcacccg aggcgttgct tgcggcaatt 7980

gagcgtattc tcgtacaaat ggatgaagtt ggtcgcaacg aatggttacg aacgcttagt 8040

gcgcgactcg agaacacctc atccggtttg ggtgatctcg aggcgcccag ctcgggtgag 8100

caggccgcac cacgacgtgt cttgtcgggt cgtggtctgc tgcgacgctt cctcggg 8157

<210> 20

<211> 2719

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 20

Met Gly Asp Leu Gly Glu His Ala Pro Ser Asn Gly Ser Ala Lys Asp

1 5 10 15

Ser Val Arg Arg Tyr Val Glu Glu His Gly Gly Ser Arg Pro Ile Gln

20 25 30

Arg Leu Leu Val Ala Asn Asn Gly Ile Ala Ala Val Lys Cys Ile Arg

35 40 45

Ser Met Arg Arg Trp Ala Tyr Glu Ala Phe Gly Ser Glu Arg Ala Leu

50 55 60

Glu Phe Val Ala Met Ala Thr Pro Glu Asp Val Gln Ala Asn Ala Asp

65 70 75 80

Tyr Ile Arg Leu Ala Asp Leu Phe Val Pro Val Pro Gly Gly Ser Asn

85 90 95

Asn His Asn Tyr Ala Asn Val Glu Leu Ile Val Asp Ile Ala Glu Arg

100 105 110

Asn Glu Cys Asp Ala Val Trp Ala Gly Trp Gly His Ala Ser Glu Asn

115 120 125

Pro Leu Leu Pro Ala Arg Leu Ala Glu Thr Gly Ile Ala Phe Leu Gly

130 135 140

Pro Asp Ala Ile Ala Met Arg Ala Leu Gly Asp Lys Ile Ser Ser Thr

145 150 155 160

Leu Leu Ala Gln Ser Ala Asp Val Pro Val Val Ser Trp Asn Gly Asp

165 170 175

Asp Leu Lys Val Thr Phe His Arg Glu Arg Gly Gly Ile Asp Glu Glu

180 185 190

Arg Tyr Arg Arg Ala Cys Val Ala Asn Val Asp Glu Ala Arg Ala Ala

195 200 205

Ala Asp Arg Ile Gly Tyr Pro Val Met Ile Lys Ala Ser Glu Gly Gly

210 215 220

Gly Gly Lys Gly Ile Arg Leu Cys Arg Arg Pro Glu Asp Val Arg Asp

225 230 235 240

Ala Phe Arg Gln Val Ala Gly Glu Ile Pro Gly Ser Pro Ile Phe Ile

245 250 255

Met Lys Met Val Asp Gln Ala Arg His Leu Glu Val Gln Ile Val Ala

260 265 270

Asp Glu Tyr Gly His Ala Ile Ala Leu Tyr Gly Arg Asp Cys Ser Val

275 280 285

Gln Arg Arg His Gln Lys Ile Ile Glu Glu Gly Pro Val Thr Ala Thr

290 295 300

Pro Pro Gln Val Trp Lys Glu Leu Glu His Gly Ala Val Arg Leu Ala

305 310 315 320

Lys Met Val Gly Tyr Val Gly Ala Gly Thr Val Glu Tyr Leu Tyr Asp

325 330 335

Gly Arg Arg Phe Tyr Phe Leu Glu Leu Asn Pro Arg Leu Gln Val Glu

340 345 350

His Pro Val Thr Glu Trp Ile Thr Gly Val Asn Leu Pro Ala Val Gln

355 360 365

Leu Gln Ile Ala Met Gly Ile Pro Leu Gln Arg Ile Ala Ala Ile Gln

370 375 380

Ala Leu Tyr Gly Asn Arg Pro Asp Leu Asp Leu Glu Arg His Gln Pro

385 390 395 400

Asn Pro Pro His Gly His Val Ile Ala Cys Arg Val Thr Ala Glu Asn

405 410 415

Pro Glu Glu Gly Phe Gln Pro Thr Ser Gly Ser Ile Gln Glu Leu Ser

420 425 430

Phe Arg Asn Thr Pro Asn Val Trp Gly Tyr Phe Ser Val Gly Ala Trp

435 440 445

Gly Gly Val His Glu Tyr Ala Asp Ser Gln Phe Gly His Leu Phe Ala

450 455 460

Trp Gly Ala Asp Arg Glu Met Ala Arg Arg Asn Met Val Leu Ala Leu

465 470 475 480

Lys Glu Leu Ser Ile Arg Gly Asp Ile Arg Thr Thr Val Glu Tyr Leu

485 490 495

Ile Thr Leu Leu Glu Leu Glu Ser Tyr Arg Glu Asn Arg Ile His Thr

500 505 510

Arg Trp Leu Asp Asn Leu Ile Ala Ser Lys Val Lys Pro Glu Arg Pro

515 520 525

Pro Phe His Ile Ala Val Val Leu Gly Ala Val His Gln Ala Tyr Arg

530 535 540

Ala Trp Ser Glu Arg Arg Arg Ser Phe Val Glu Ser Leu Ser Arg Gly

545 550 555 560

Gln Val Pro Gln Arg Ile Asp Ala Thr Phe Ile Glu Phe Gln Phe Glu

565 570 575

Leu Ile Tyr Asp Glu Leu Lys Tyr Ala Leu Ile Val Arg Gln Ala Gly

580 585 590

Pro Asn Ala Phe His Val Ser Leu Ala Gln His Pro Glu Glu Arg Val

595 600 605

Arg Val Asp Phe Arg Pro Leu Leu Asp Gly Gly Leu Leu Ile Met Cys

610 615 620

Asp Gly Arg Ser Phe Asn Thr His Ser Lys Glu Asp Ser Thr Gly Met

625 630 635 640

Arg Leu Ser Ile Asp Gly Arg Thr Cys Val Phe Pro Gly Glu Met Asp

645 650 655

Pro Thr Arg Ile Ala Ala Gln Ala Ser Gly Arg Leu Val Arg Tyr Leu

660 665 670

Val Ala Asn Glu Glu His Val Asp Gln Gly Gln Val Ile Ala Glu Ile

675 680 685

Glu Val Met Lys Met Tyr Leu Ser Val Gln Ala Pro Glu Pro Gly Thr

690 695 700

Ile His Leu Leu Lys Pro Ala Gly Ala Ala Leu Glu Pro Gly Glu Val

705 710 715 720

Phe Ala Gln Leu Asp Leu Asp Asp Pro Ser Lys Val Arg Arg Val Thr

725 730 735

Pro Phe Thr Gly Arg Phe Pro Thr Met Leu Pro Pro Gln Arg Leu Gly

740 745 750

Lys Lys Pro His Gln Arg Phe Glu Ser Ala Lys Gln Gln Ile Glu Ala

755 760 765

Leu Leu Asp Gly Tyr Asp Val Asp Met Glu Pro Leu Gly Met Leu Leu

770 775 780

Gln Leu Ala Ser Thr Glu Pro Ala Val Pro Ala Gly Lys Leu Gln Glu

785 790 795 800

Ala Leu Ser Phe Leu Ala Gly Arg Ile Pro Ser Thr Leu His Ala Ala

805 810 815

Leu Leu Gly Gln Val Phe Glu Leu Arg Ser Val Ala Arg Asp Asp His

820 825 830

Glu Arg Leu Gln Gln His Phe Arg Glu Met Lys Arg Met Ile Ser Asp

835 840 845

Phe Cys Glu Gln Leu Ala Ile Ala Ala Asp Arg Glu Ala Leu Glu Ala

850 855 860

Thr Leu Arg Pro Leu Thr Ser Ile Leu Glu Ala Tyr Val Arg Gly Gly

865 870 875 880

Leu Arg Gly Tyr His Glu His Leu Val Val Cys Leu Met Gln Arg Tyr

885 890 895

Val Asn Thr Glu Lys Tyr Phe Ala Gln Gly Arg Arg Leu Asp Glu Val

900 905 910

Leu Phe Asp Leu Arg Glu Gln His Arg Glu His Leu Glu Val Val Ala

915 920 925

Asp Ile Met Leu Ala His Ala Gln Leu Ala Arg Lys Asn Gln Leu Met

930 935 940

Leu Ala Leu Leu Asp His Ile Ala Ala Asp Ala Ser Leu Leu Arg Ser

945 950 955 960

Val Ile Val Arg Gln Cys Leu His Glu Val Ala Ala Phe Ile His Pro

965 970 975

Asp Tyr Ser Gln Leu Ala Leu Arg Ala Arg Leu Leu Leu Ala Ser Ser

980 985 990

Arg Arg Arg Ala Leu Pro Glu Arg Arg Glu Arg Leu Cys Lys Gln Ile

995 1000 1005

Glu Gln Ala Val His Ala Pro Thr Glu Glu Gln Cys Tyr Arg Leu

1010 1015 1020

Leu His Ala Val Val Glu Ser Gln Glu Ser Ile Leu Asp Ala Leu

1025 1030 1035

Val Ser Leu Thr Met Asn Pro Gly Val Pro Val Glu Ile Arg Lys

1040 1045 1050

Ala Ala Val Gln Cys Gln Ile Leu Arg Ala Tyr Lys Ala Tyr His

1055 1060 1065

Val Tyr Asp Leu Val Val Glu Leu Asp Glu Glu Leu Gly Phe Leu

1070 1075 1080

Arg Ala Leu Trp Arg Phe Gln Tyr Arg Ser Asn Leu Asn Ala Phe

1085 1090 1095

Gly Ser His Ser Ser Met Arg Ser Phe Pro Ala Ala Leu Val Ala

1100 1105 1110

Ala Ser Ala Pro Leu Ala Arg Arg Gln Leu Arg Ser Tyr Asp Ser

1115 1120 1125

Ala Asp Asn Leu Gln Asn Trp Gly Phe Arg Val Gly Met Leu Val

1130 1135 1140

Val Phe Glu Thr Leu Arg Ala Met Val Gln Gly Phe Asp Arg Val

1145 1150 1155

Leu Gln Val Phe Arg Ala Glu Thr Gly Gly Glu Thr Leu Ala Ser

1160 1165 1170

Pro Arg Thr Lys Ser Ser Ala Ala Val Asp Ser Ser Ala Asp Ser

1175 1180 1185

Ser Ala Glu Gly Ile Gly Val Asn Val Leu Thr Ile Ala Ile Pro

1190 1195 1200

Trp Thr Ala Val Glu Val Ala Asp Phe Ala Lys Tyr Leu Gly Val

1205 1210 1215

Ala Asp Ala Asp Asp Arg Leu Gln Ala Ser Asn Ala Asn Asp Ser

1220 1225 1230

Glu Gln Val Val Ala Leu Leu Thr Arg Phe Cys Arg Ser Ser Ala

1235 1240 1245

Glu Arg Lys Ala Ser Met Arg Ala Ala Gly Ile Lys Leu Val Thr

1250 1255 1260

Phe Leu Val Ala Pro Gly Glu Leu Cys Gln Arg Thr Leu Gly Ser

1265 1270 1275

Ser Leu Pro Arg Glu Ser Thr Tyr Pro Gly Phe Tyr Thr Leu Arg

1280 1285 1290

Ala Ser Leu Glu Tyr Ala Glu Asp Pro Ile Tyr Arg His Ile Asp

1295 1300 1305

Pro Pro Ala Ala Phe Gln Leu Glu Leu Asn Arg Leu Ala Asn Phe

1310 1315 1320

Arg Ile Thr Arg Phe Glu His Pro Asn Arg Ser Ile His Val Phe

1325 1330 1335

Tyr Ala Glu Asp Arg Thr Asp Lys Gly Asp Ala Arg Phe Phe Val

1340 1345 1350

Arg Ala Phe Val Arg Gln Ala Glu Val Tyr Ala Ser Pro Ser Asp

1355 1360 1365

Thr Ala Ala Val Ser Ile Pro Glu Ala Glu Arg Thr Phe Val Ala

1370 1375 1380

Cys Leu Asp Ala Leu Glu Thr Ala Arg Cys Asp Arg Arg Phe Arg

1385 1390 1395

Arg Thr Asp Phe Asn His Leu Phe Leu His Leu Ile Pro Arg Val

1400 1405 1410

Ser Ile Asp Val Asp Asp Val Glu Ala Ile Cys Thr Arg Leu Phe

1415 1420 1425

Tyr Arg Phe Ser Ser Arg Cys Trp Arg Leu Arg Val Phe Met Val

1430 1435 1440

Glu Ile Lys Val His Val Glu Lys Met Gly Pro Lys Pro Leu Arg

1445 1450 1455

Phe Ile Leu Tyr Asn Pro Thr Gly His Ser Leu Arg Val Glu Gly

1460 1465 1470

Tyr Val Glu Gln Gly Asp Arg Leu Leu Ser Leu Asp Ser Ser Asp

1475 1480 1485

Pro Gly His Leu His Gly Thr Pro Val Asp Glu Pro Tyr Gln Val

1490 1495 1500

Leu Asn Arg Ile Gln Arg Arg Arg Val Val Ala Gln Thr Leu Glu

1505 1510 1515

Thr Thr Tyr Val Tyr Asp Tyr Gln Glu Leu Phe Thr Lys Ala Leu

1520 1525 1530

His Glu Gln Trp Arg Arg Tyr Ser Gln Glu Arg Leu Leu Gly Gly

1535 1540 1545

Phe Arg Arg His Lys Ile Pro Leu Lys Leu Leu Thr Cys Thr Glu

1550 1555 1560

Leu Val Leu Gln Asp Glu Asp His Asp Glu Ser Glu Leu Ile Glu

1565 1570 1575

Thr Asn Arg Pro Ala Gly Glu Asn Glu Ile Gly Met Val Ala Trp

1580 1585 1590

Arg Tyr Thr Phe Phe Thr Pro Glu Tyr Pro Gln Gly Arg Asp Ala

1595 1600 1605

Ile Val Ile Ala Asn Asp Ile Thr Tyr Leu Ser Gly Ser Phe Gly

1610 1615 1620

Pro Arg Glu Asp Arg Leu Phe Ala Lys Ala Ser Gln Gln Ala Arg

1625 1630 1635

Lys Leu Gly Ile Pro Arg Ile Tyr Ile Ala Ala Asn Ser Gly Ala

1640 1645 1650

Arg Ile Gly Leu Ala Glu Glu Leu Arg Asn Val Phe Gln Val Gln

1655 1660 1665

Trp Lys Asp Ala His Asp Pro Ser Arg Gly Phe Asp Tyr Leu Tyr

1670 1675 1680

Leu Gly Leu Ala Asp Lys Leu Lys Tyr Glu Asp Glu Leu Gly Ile

1685 1690 1695

Val Arg Thr Arg Asp Val Gly Gly Gly Lys Tyr Ala Leu Val Asp

1700 1705 1710

Ile Tyr Gly Ala Glu Asp Gly Ile Gly Val Glu Asn Leu Met Gly

1715 1720 1725

Ser Ala Cys Ile Ala Ala Glu Thr Ser Ala Ala Tyr Asn Asp Cys

1730 1735 1740

Phe Thr Ile Thr Tyr Val Ala Ala Arg Cys Val Gly Ile Gly Ala

1745 1750 1755

Tyr Leu Val Arg Leu Gly Gln Arg Val Ile Gln Arg Glu His Asn

1760 1765 1770

Ala Pro Ile Ile Leu Thr Gly Tyr Ser Ala Leu Asn Lys Leu Leu

1775 1780 1785

Gly Arg Glu Val Tyr Thr Ser His Glu Gln Leu Gly Gly Thr Lys

1790 1795 1800

Ile Met Tyr Pro Asn Gly Val Thr His Leu Arg Val Ser Asn Asp

1805 1810 1815

Tyr Glu Gly Val Arg Ala Ile Leu Asp Trp Leu Ala Phe Val Pro

1820 1825 1830

Arg Met Gln Gly Glu Arg Pro Pro Met Ile Asp Ser Ile Asp Ala

1835 1840 1845

Val Val Arg Glu Val Asp Tyr Asp Pro Arg Val Ala Asn Glu Asp

1850 1855 1860

Ile Arg Tyr Ala Ile Glu Gly Lys Trp Val Gly Pro Phe Ala Pro

1865 1870 1875

Asp Gly Ala Ser Val Val Ser Ser Ser Gly Ser Ser Val Ser Ser

1880 1885 1890

Asn Gly Pro Val Ala Thr Gly Ala Ser Ala Gly Leu Pro Pro Val

1895 1900 1905

Asn Met Glu Ser Val Gly Gly Glu Val Arg Tyr Arg Ser Arg Ser

1910 1915 1920

Arg Thr Ala Ser Tyr Ala Gly Ala Asn Ala Ala Thr Ala Ala Ala

1925 1930 1935

Pro Gly Ala Ser Ser Leu Ala Thr Ser Gly Ala Ala Leu Ala Asn

1940 1945 1950

Ala Thr Gly Arg Ala Ala Ala Arg Val Ser Leu Val Ser Thr Lys

1955 1960 1965

Pro Ala Thr Gly Val Glu Glu Val Pro Ala Ala Leu Pro Ser Gly

1970 1975 1980

Ser Ser Ser Leu Ala Leu Arg Ser Thr Ala Val Ala Ala Pro Val

1985 1990 1995

Pro Gln Glu Arg Ala Lys Thr Asp Ala Asp Ala Asp Ala Asp Ala

2000 2005 2010

Asp Ala Asp Ala Leu Glu Ala Thr Gly Thr Ala Arg Gly Ala Asp

2015 2020 2025

Thr Ser Gly Ala Gly Ala His Ala Ser Pro Asp Thr Arg Gln Pro

2030 2035 2040

Pro Thr Leu Asp Ala Leu Lys Thr Ala Arg Phe Ala Pro Ala Pro

2045 2050 2055

Ala Ala Leu Pro Thr Thr Gly Leu Arg Ser Pro Pro Gln Ser Pro

2060 2065 2070

Gly Ser Val Ser Ala Tyr Ser Pro Leu Glu Ser Gly Ser Pro Leu

2075 2080 2085

Glu Pro Ser Leu Asp Glu Asp His Ile Ala Phe Thr Asp Thr Asp

2090 2095 2100

Arg Met Arg Ile Gly Ser Pro Leu Val His Ala Thr Asp Ala Trp

2105 2110 2115

Asp Thr Ser Asn Ser Val His Ser Ser Ser Ala Thr Thr Met Ala

2120 2125 2130

Ala Gly Ala Val Arg Gly Ser Arg Leu Phe Thr Ala Ser Ser Ser

2135 2140 2145

Thr Ser Ala Ala Ala Gly Ala Ser Ala Ser Asn Ser Ser Asn Asn

2150 2155 2160

Gly Ser Ser Asn Ala Asn Thr Asn Thr Met Asn Asn Ala Gly Ser

2165 2170 2175

Ser Val Ala Ser Ala Gly Ala Pro Leu Asn Thr Thr Gly Ser Ile

2180 2185 2190

Asn Thr Asn Thr Asn Ala Asn Ala Asn Thr Asn Asn Ile Leu Gly

2195 2200 2205

Ser Gly Gly Val Gly Ala Thr Ala Ser Leu Thr Ala Trp Ser Gly

2210 2215 2220

Asn Leu Pro Pro Leu Pro Tyr Val Pro Asn Thr Leu Met Val Val

2225 2230 2235

Asp Asn Gln Gly Tyr Val Phe Leu Ala Gly Leu Phe Asp Arg His

2240 2245 2250

Ser Phe Arg Glu Thr Leu Ala Gly Trp Ala Lys Ser Val Val Val

2255 2260 2265

Gly Arg Ala Arg Leu Gly Gly Ile Pro Val Gly Val Ile Ala Thr

2270 2275 2280

Gln Thr Val Thr Thr Glu Lys Val Ile Pro Pro Asp Pro Ala Ala

2285 2290 2295

Pro Asp Ser Arg Glu Leu Arg Glu Gln Gln Ala Gly Gln Val Trp

2300 2305 2310

Tyr Pro Asp Ser Ser Phe Lys Thr Ala Gln Ser Ile Arg Asp Phe

2315 2320 2325

Asp Arg Glu Gly Leu Pro Leu Phe Val Leu Ala Ser Trp Arg Gly

2330 2335 2340

Phe Ser Gly Gly Ala His Asp Met Phe Gln Glu Ile Leu Lys Phe

2345 2350 2355

Gly Ser Glu Ile Val Asp Ala Leu Arg Glu Tyr Ser Lys Pro Val

2360 2365 2370

Phe Val Tyr Ile Pro Pro Gly Gly Glu Leu Arg Gly Gly Ala Trp

2375 2380 2385

Val Val Leu Asp Thr Ala Ile Asn Pro Arg Phe Ile Glu Met Tyr

2390 2395 2400

Ala Asp Glu Ser Ala Arg Gly Gly Val Leu Glu Pro Ala Gly Thr

2405 2410 2415

Val Asp Ile Lys Phe Arg Thr Arg Asp Leu Leu Lys Thr Met Gln

2420 2425 2430

Arg Leu Leu Pro Ser Ser Arg Arg Asp Glu Ser Asp Ser Ser Ala

2435 2440 2445

Ser Asp Val Val Gly Met Leu Ser Leu Asp Ala Ser Ser Ser Ser

2450 2455 2460

Ala Ala Ala Thr Ala Ala Thr Thr Thr Thr Thr Thr Thr Thr Thr

2465 2470 2475

Thr Thr Ala Gly Thr Ala Gly Thr Ser Ser Ser Thr Ala Gly Arg

2480 2485 2490

Ser Glu Ser Arg Val Thr Lys Ser Thr Ala Asp Gly Leu Pro Ala

2495 2500 2505

Phe Glu Ala Asp Gly Arg Ser Ser Leu Glu Gln Thr Leu Leu Pro

2510 2515 2520

Ile Phe Gln Gln Ile Ala Met Thr Phe Ala Asp Leu His Asp Thr

2525 2530 2535

Ala Gly Arg Met Gln His Lys Arg Ala Ile Arg Arg Val Val Pro

2540 2545 2550

Trp Arg Thr Ser Arg Thr Phe Phe Tyr Trp Arg Leu Arg Arg Arg

2555 2560 2565

Leu Ala Glu Glu Glu Leu Arg Ser Arg Val Ser Gln Ala Asp Pro

2570 2575 2580

Gln Leu Ser Asp Ala Glu Ile Asp Ala Leu Leu Arg Lys Trp Ala

2585 2590 2595

Arg Ala His Asp Pro Ala Leu Asp Gly Asp Ile Tyr Glu Leu Asp

2600 2605 2610

Asp Pro Arg Val Val Gln Trp Leu Glu Asp Glu Leu Asp Ser Gln

2615 2620 2625

Leu Glu Arg Arg Leu His Lys Leu Arg Glu Ala Arg Ala Thr Trp

2630 2635 2640

Gln Ala Val Glu Leu Gly Asn Thr Ala Pro Glu Ala Leu Leu Ala

2645 2650 2655

Ala Ile Glu Arg Ile Leu Val Gln Met Asp Glu Val Gly Arg Asn

2660 2665 2670

Glu Trp Leu Arg Thr Leu Ser Ala Arg Leu Glu Asn Thr Ser Ser

2675 2680 2685

Gly Leu Gly Asp Leu Glu Ala Pro Ser Ser Gly Glu Gln Ala Ala

2690 2695 2700

Pro Arg Arg Val Leu Ser Gly Arg Gly Leu Leu Arg Arg Phe Leu

2705 2710 2715

Gly

<210> 21

<211> 1596

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 21

atgcttctgg cggaaagcct cggcggcgtc gcttttgtca ccgttcacac ggtgctgtgc 60

gtcgctgtac tgtacttacc gtggaaccgc ggcgaccgag aaagcttccg tgtcacttgt 120

ctaggggctt tgagcacttg cgctgttatc gctctgagcc tggtgaatct tcagttgttc 180

ttcactggct tgacctggcg ggagctgtcg cggacgtatc actggctgag gccgtggcac 240

agctggcaag tctatctact ggactctggc ggttttttgg tgctcctttt tctgcacgag 300

tacagaaaaa gacagcagac tccagttaag agttcgtcat cgagcacgtg tgcatcgacg 360

tcgcagtccg cttgcccgaa tgcggagaac ggcatctatc gaaacgagag gcgaaatacc 420

tggtgcatgt tccgaagcct gctgatcgca ccgataacag aggagctcct cttccgttgt 480

gtattcgatg cagcaatgag atcagctcaa gtccctgaac ttgccagcat gattttcaac 540

ggagtcatgt ttgccgtagc gcacgcgcat cactatttcc ggcaccagag caggtcactt 600

ctcggaaagc agcttcttgt aacgttttgc tttggttgcg tccaagtcgt ttgcctgaga 660

cggacggatt actccctgtg ggcctgcatc gcaacgcacg ctttggcgaa cgcgctcgat 720

ctgcagaaag ttttcagtga taggggagcg tcacattttc tctacggcga cgtaggccat 780

cagttaggag ccctgctgca agctgcagca ccgctcgtat tcatcttgcg ctatgcgctt 840

gacatggcca tgcaagcgag tccgttccgg tcctcgatcc tccaggtgca tgcggacccg 900

gggctgctcg aggaacggcg tgaaatcatg cgtccaacga cgccgttttc gctcttcttg 960

atagaatatc atcagcgtga tccgatcttc gggcgcttca tggctctctg ctccatgctg 1020

cctcaacttt tattcgccgc agaggtcaca gctgtctact gttggcgaag tccgcgcgca 1080

ttgctcctag ctgttggcca gattgtgaac gagacgctga gctacgcact caagcgatcg 1140

tgtcggatac ctcgcccgcc aatcgctcga ctggaagcag atgcgttcgg ttggccctct 1200

tcacacgctc agttcatgtc ctacttgtac gttttctacg tactctatgt gtctaggcca 1260

cagcgcaagg catcatcgag ccacaacggt gaaatgatgc accagacgct tccgaagcgg 1320

aggaagacga ccgattcaat ttcggaaacc gtggcagtga tgtttttgct aggtctgtca 1380

tccgttctgg tggctgcctc aagagtatac ctggcatatc attatcccag tcaggtttgg 1440

tatggaatca tcatgggcac tattttcgcg ataacctggt tccttgcgag tgaaaacgtt 1500

ttcttatcct acgttcgatc attgcggatt ctcgtatggc tcgggttcgg cgaagatgac 1560

tctgacctgt ttcgcagtat actcagagat gtgcat 1596

<210> 22

<211> 532

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 22

Met Leu Leu Ala Glu Ser Leu Gly Gly Val Ala Phe Val Thr Val His

1 5 10 15

Thr Val Leu Cys Val Ala Val Leu Tyr Leu Pro Trp Asn Arg Gly Asp

20 25 30

Arg Glu Ser Phe Arg Val Thr Cys Leu Gly Ala Leu Ser Thr Cys Ala

35 40 45

Val Ile Ala Leu Ser Leu Val Asn Leu Gln Leu Phe Phe Thr Gly Leu

50 55 60

Thr Trp Arg Glu Leu Ser Arg Thr Tyr His Trp Leu Arg Pro Trp His

65 70 75 80

Ser Trp Gln Val Tyr Leu Leu Asp Ser Gly Gly Phe Leu Val Leu Leu

85 90 95

Phe Leu His Glu Tyr Arg Lys Arg Gln Gln Thr Pro Val Lys Ser Ser

100 105 110

Ser Ser Ser Thr Cys Ala Ser Thr Ser Gln Ser Ala Cys Pro Asn Ala

115 120 125

Glu Asn Gly Ile Tyr Arg Asn Glu Arg Arg Asn Thr Trp Cys Met Phe

130 135 140

Arg Ser Leu Leu Ile Ala Pro Ile Thr Glu Glu Leu Leu Phe Arg Cys

145 150 155 160

Val Phe Asp Ala Ala Met Arg Ser Ala Gln Val Pro Glu Leu Ala Ser

165 170 175

Met Ile Phe Asn Gly Val Met Phe Ala Val Ala His Ala His His Tyr

180 185 190

Phe Arg His Gln Ser Arg Ser Leu Leu Gly Lys Gln Leu Leu Val Thr

195 200 205

Phe Cys Phe Gly Cys Val Gln Val Val Cys Leu Arg Arg Thr Asp Tyr

210 215 220

Ser Leu Trp Ala Cys Ile Ala Thr His Ala Leu Ala Asn Ala Leu Asp

225 230 235 240

Leu Gln Lys Val Phe Ser Asp Arg Gly Ala Ser His Phe Leu Tyr Gly

245 250 255

Asp Val Gly His Gln Leu Gly Ala Leu Leu Gln Ala Ala Ala Pro Leu

260 265 270

Val Phe Ile Leu Arg Tyr Ala Leu Asp Met Ala Met Gln Ala Ser Pro

275 280 285

Phe Arg Ser Ser Ile Leu Gln Val His Ala Asp Pro Gly Leu Leu Glu

290 295 300

Glu Arg Arg Glu Ile Met Arg Pro Thr Thr Pro Phe Ser Leu Phe Leu

305 310 315 320

Ile Glu Tyr His Gln Arg Asp Pro Ile Phe Gly Arg Phe Met Ala Leu

325 330 335

Cys Ser Met Leu Pro Gln Leu Leu Phe Ala Ala Glu Val Thr Ala Val

340 345 350

Tyr Cys Trp Arg Ser Pro Arg Ala Leu Leu Leu Ala Val Gly Gln Ile

355 360 365

Val Asn Glu Thr Leu Ser Tyr Ala Leu Lys Arg Ser Cys Arg Ile Pro

370 375 380

Arg Pro Pro Ile Ala Arg Leu Glu Ala Asp Ala Phe Gly Trp Pro Ser

385 390 395 400

Ser His Ala Gln Phe Met Ser Tyr Leu Tyr Val Phe Tyr Val Leu Tyr

405 410 415

Val Ser Arg Pro Gln Arg Lys Ala Ser Ser Ser His Asn Gly Glu Met

420 425 430

Met His Gln Thr Leu Pro Lys Arg Arg Lys Thr Thr Asp Ser Ile Ser

435 440 445

Glu Thr Val Ala Val Met Phe Leu Leu Gly Leu Ser Ser Val Leu Val

450 455 460

Ala Ala Ser Arg Val Tyr Leu Ala Tyr His Tyr Pro Ser Gln Val Trp

465 470 475 480

Tyr Gly Ile Ile Met Gly Thr Ile Phe Ala Ile Thr Trp Phe Leu Ala

485 490 495

Ser Glu Asn Val Phe Leu Ser Tyr Val Arg Ser Leu Arg Ile Leu Val

500 505 510

Trp Leu Gly Phe Gly Glu Asp Asp Ser Asp Leu Phe Arg Ser Ile Leu

515 520 525

Arg Asp Val His

530

<210> 23

<211> 1251

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 23

atgagctgca aaggtgcagc ttttgggttc gccactctaa atggcttgtc agttcggagg 60

catttgtttt gtaggagttc ggcaaatcgt agagtctgcg ggaagctttt tactggttca 120

caatgtccac gtgctccaag gtattttttg gtgcacactg acaagagtgc cggcaacact 180

ccaagtacac gaggttggga ggaatatgcg ggaaacttta ttttcaggcc tccgcacgtt 240

atcgtaaggc cctcacagcg cacagcctac gggactggaa acggtacacc gctgccacca 300

aaagctctcg ttcactttct cgggggtgcc ttcgttggag ccgcgccgca ggttgcgtat 360

cgatggtttc tcgaacagtt ggctttagag ggcttcgtag tagtcgcgac gccgtaccga 420

ctttctttcg atcacctttg gacgatggat gacgtgctca ccaggttttc agcagcagcg 480

ggaatgctcg ctctcgatta cggtcccatc cctgttgttg gaatcggtca ctccctgggc 540

gctctcctgc acacccttgg aggcagcttg ttttgcaacg cggatgggta caaggctgcc 600

aacgtactga ttgcattcaa taatcgccgc gcggaagatg cgataccttt attcagagag 660

tttatcgcgc ccgtggttaa gacggttgcc caaactggtg aactctctga tgtgatcgag 720

cggctcgtga tcgatgggcc tgcgacgttt gacacgctgt tcgacactgt tacagatgtt 780

gtgtttcctg gttctcggga tagtgagctt ttcgccttgg tgcgacagtc aagggcgctt 840

gcgcagcaga ttccaccttt atttgctgag gtcgctgatg gagcttttgc attcaaccct 900

gatcctatcg aggtcatgga ggcaattcgc acactataca aagttcgtca aacgcttatt 960

gtgagcttca aaaacgacat cctcgatgac tcgcgttcgc ttcagaaagc actagaccct 1020

gagcgaggcg ctacagtaat tcgtctcggc ggaactcact taaccccctg cgcacaggat 1080

ttcctcgacc cacggctccg gcaaagcagc tttttctctt catttacgtg cgaaaaggat 1140

aacgacacat cctacatact ccgcgaagct atgcgcagaa ttgtgcttcg cgaagcaatg 1200

ctcatgaaaa acagtgttgt ggcgtattta gatcgcgctc tcggaatgga a 1251

<210> 24

<211> 417

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 24

Met Ser Cys Lys Gly Ala Ala Phe Gly Phe Ala Thr Leu Asn Gly Leu

1 5 10 15

Ser Val Arg Arg His Leu Phe Cys Arg Ser Ser Ala Asn Arg Arg Val

20 25 30

Cys Gly Lys Leu Phe Thr Gly Ser Gln Cys Pro Arg Ala Pro Arg Tyr

35 40 45

Phe Leu Val His Thr Asp Lys Ser Ala Gly Asn Thr Pro Ser Thr Arg

50 55 60

Gly Trp Glu Glu Tyr Ala Gly Asn Phe Ile Phe Arg Pro Pro His Val

65 70 75 80

Ile Val Arg Pro Ser Gln Arg Thr Ala Tyr Gly Thr Gly Asn Gly Thr

85 90 95

Pro Leu Pro Pro Lys Ala Leu Val His Phe Leu Gly Gly Ala Phe Val

100 105 110

Gly Ala Ala Pro Gln Val Ala Tyr Arg Trp Phe Leu Glu Gln Leu Ala

115 120 125

Leu Glu Gly Phe Val Val Val Ala Thr Pro Tyr Arg Leu Ser Phe Asp

130 135 140

His Leu Trp Thr Met Asp Asp Val Leu Thr Arg Phe Ser Ala Ala Ala

145 150 155 160

Gly Met Leu Ala Leu Asp Tyr Gly Pro Ile Pro Val Val Gly Ile Gly

165 170 175

His Ser Leu Gly Ala Leu Leu His Thr Leu Gly Gly Ser Leu Phe Cys

180 185 190

Asn Ala Asp Gly Tyr Lys Ala Ala Asn Val Leu Ile Ala Phe Asn Asn

195 200 205

Arg Arg Ala Glu Asp Ala Ile Pro Leu Phe Arg Glu Phe Ile Ala Pro

210 215 220

Val Val Lys Thr Val Ala Gln Thr Gly Glu Leu Ser Asp Val Ile Glu

225 230 235 240

Arg Leu Val Ile Asp Gly Pro Ala Thr Phe Asp Thr Leu Phe Asp Thr

245 250 255

Val Thr Asp Val Val Phe Pro Gly Ser Arg Asp Ser Glu Leu Phe Ala

260 265 270

Leu Val Arg Gln Ser Arg Ala Leu Ala Gln Gln Ile Pro Pro Leu Phe

275 280 285

Ala Glu Val Ala Asp Gly Ala Phe Ala Phe Asn Pro Asp Pro Ile Glu

290 295 300

Val Met Glu Ala Ile Arg Thr Leu Tyr Lys Val Arg Gln Thr Leu Ile

305 310 315 320

Val Ser Phe Lys Asn Asp Ile Leu Asp Asp Ser Arg Ser Leu Gln Lys

325 330 335

Ala Leu Asp Pro Glu Arg Gly Ala Thr Val Ile Arg Leu Gly Gly Thr

340 345 350

His Leu Thr Pro Cys Ala Gln Asp Phe Leu Asp Pro Arg Leu Arg Gln

355 360 365

Ser Ser Phe Phe Ser Ser Phe Thr Cys Glu Lys Asp Asn Asp Thr Ser

370 375 380

Tyr Ile Leu Arg Glu Ala Met Arg Arg Ile Val Leu Arg Glu Ala Met

385 390 395 400

Leu Met Lys Asn Ser Val Val Ala Tyr Leu Asp Arg Ala Leu Gly Met

405 410 415

Glu

<210> 25

<211> 447

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 25

atgaacgact accaaaagat tggggttact ttgagcgcaa tttccgtttt attttatggg 60

ctcggtgttg tactcttttt tgatacgggg ctgatctcca tagcaagtgt tcttttcacc 120

tcgtcgctct ttttcatcct gggcttcaag agagccgccc gtttcttctt ctcaagaaga 180

aaacttcgcg cgagtgcgct cttcttcggc ggctttggcc tggtgctgtt gcgctggcca 240

gtgctgggga ctctggtcca agcggtcggt gcgctctggc tattcctgag cttcattccg 300

atcgcgatga cgtttcttcg acaggttccg gttctgggtc agcttgtcga caccagcatt 360

gttcggcgga tactgcgccg cctatcggca gcctctggat acctggaaac gaacggtggc 420

atcggtttcg aaccgaaact cccggta 447

<210> 26

<211> 149

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 26

Met Asn Asp Tyr Gln Lys Ile Gly Val Thr Leu Ser Ala Ile Ser Val

1 5 10 15

Leu Phe Tyr Gly Leu Gly Val Val Leu Phe Phe Asp Thr Gly Leu Ile

20 25 30

Ser Ile Ala Ser Val Leu Phe Thr Ser Ser Leu Phe Phe Ile Leu Gly

35 40 45

Phe Lys Arg Ala Ala Arg Phe Phe Phe Ser Arg Arg Lys Leu Arg Ala

50 55 60

Ser Ala Leu Phe Phe Gly Gly Phe Gly Leu Val Leu Leu Arg Trp Pro

65 70 75 80

Val Leu Gly Thr Leu Val Gln Ala Val Gly Ala Leu Trp Leu Phe Leu

85 90 95

Ser Phe Ile Pro Ile Ala Met Thr Phe Leu Arg Gln Val Pro Val Leu

100 105 110

Gly Gln Leu Val Asp Thr Ser Ile Val Arg Arg Ile Leu Arg Arg Leu

115 120 125

Ser Ala Ala Ser Gly Tyr Leu Glu Thr Asn Gly Gly Ile Gly Phe Glu

130 135 140

Pro Lys Leu Pro Val

145

<210> 27

<211> 2118

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 27

atgcatcagc aggaccctga cgagataccg ctttctgaac tccttgggaa ctctggagaa 60

tcggtgccga gagttgtttc agttgccaga caaggtggct tcggcgcacc gtcgatacag 120

gttgccaaga agtcttcctc agctgcactt tcggaagatg aactgcctct agctctactt 180

attggtacgg aaagaaagaa ggagtgcagt agaaggaagg gcaaagtcga accatgtcga 240

aaagcgccga aaaggaaatg cgaacgaatg aaagccgaaa agcaccgcaa ggcgaaaaag 300

ttgcaggcaa caagtactaa acatagagcg gaagatgaat cgatcaagtg gtgcagcctc 360

gaacataacg gagtcttatt ccctccggaa tatgaaccgc acggacggcc tctactctac 420

gatggctgcg aaatagcgct cccgcccgag gccgaagaag tggccacctt ttatgcgtcg 480

aagttgggaa ctgtttattt agaaaaggaa acatttcgga aaaacttttt tgacgacttt 540

cggaagacct tgcccgttga tttacggaag cgcatcgtga aactcgaaca ctgtgacttc 600

agccgtatcc gggagtatct cgacgagctg aaagaacgaa aacaaagcat gcctccaagc 660

aaacggaaag agttacgaga agcggaagcg caaagagttg cgcattacac ggttgcaatc 720

gttgatggac gcaaggagaa agtggccaat tatcgtgtcg agcctccggg tctcttcctg 780

gggcgcggtg accatccgct aatgggccga gtcaaacgtc gcatattccc agaggatgtc 840

acactgaatc ttggtcgtga cgcacctatt ccgccgtgtc catttaaagg ccatgactgg 900

ggatccatcg tgcacaacca gagggtcacg tggctggctt gctggcgaga tccaatctcc 960

gatgagtaca agtacgtctg gctgagtgcg tcttcgcact tcaaggcgat gagcgatcaa 1020

gaaaaatttg agaaggcgca gcagctgagc aagcatatca cgaagatacg caatgagtac 1080

acgaaaggat tagaatctgc tgataggcac acgcagcaga gatccgtcgc actgtatctt 1140

attgataagc ttgcactccg agtcgggaac gagaaaggag aagacgaagc ggacactgtt 1200

ggttgctgct ccctgcgggt agaacacata acccttcaag agcctaatat agtgcagctc 1260

gactttctcg gcaaggattc gatgcgatac tttaacagag tgcgagtcga agcgctagtt 1320

tttcatcgcc tgcgagagtt cttgaaagga aaacgagtat cagataatgt attcgacgaa 1380

ctgaaagtcg aggaacttaa tgactatttg aaaagcctaa tgccgggcct atctgcgaaa 1440

gtttttcgca cctacaatgc atcttacacc cttgataagc ttttgcacag cgtgaaaacc 1500

cctggaccgg atattcattc gcgccttctg ttctacaatg aggcgaacaa agatgtagct 1560

gtattatgta accatcagcg atcccttcca aagactcatt cgttaatgct tgaaagactg 1620

cgcgataagc tagaggagag cagagcgtat cttgaggcgc tcaaaatggc tcgaggaaag 1680

gctcaggcat ctccagacag ccgcgcacaa gtaacacgct ggcgacggcc agtagttgag 1740

attcccgagg actgctccct cgctgagcga aagcgcattc gcgaggaggc cgaaaagcga 1800

ccgaaagaaa aacaggtagt gaacatgggg ctcgcgtcta tcacacgagg aatagcgcaa 1860

acacaggaaa aaatcaggcg cctagaggct gacctgaaaa cgagggattc gctcgctacg 1920

gtgtcgttga gcacctctaa gatcaactat ttagatccgc gaataacagt tgcttggtgc 1980

aaaagacatg aggttcctat tgagcgaata ttccctcgcg cgctacagga aaagtttatg 2040

tggagtatgg gggtaagcga ggactttcgt ttcccaatca gcaacgtcgt ttcgaacgat 2100

ccctcagggg cttccaca 2118

<210> 28

<211> 706

<212> PRT

<213> Cyanidioschyzon merole (a red alga)

<400> 28

Met His Gln Gln Asp Pro Asp Glu Ile Pro Leu Ser Glu Leu Leu Gly

1 5 10 15

Asn Ser Gly Glu Ser Val Pro Arg Val Val Ser Val Ala Arg Gln Gly

20 25 30

Gly Phe Gly Ala Pro Ser Ile Gln Val Ala Lys Lys Ser Ser Ser Ala

35 40 45

Ala Leu Ser Glu Asp Glu Leu Pro Leu Ala Leu Leu Ile Gly Thr Glu

50 55 60

Arg Lys Lys Glu Cys Ser Arg Arg Lys Gly Lys Val Glu Pro Cys Arg

65 70 75 80

Lys Ala Pro Lys Arg Lys Cys Glu Arg Met Lys Ala Glu Lys His Arg

85 90 95

Lys Ala Lys Lys Leu Gln Ala Thr Ser Thr Lys His Arg Ala Glu Asp

100 105 110

Glu Ser Ile Lys Trp Cys Ser Leu Glu His Asn Gly Val Leu Phe Pro

115 120 125

Pro Glu Tyr Glu Pro His Gly Arg Pro Leu Leu Tyr Asp Gly Cys Glu

130 135 140

Ile Ala Leu Pro Pro Glu Ala Glu Glu Val Ala Thr Phe Tyr Ala Ser

145 150 155 160

Lys Leu Gly Thr Val Tyr Leu Glu Lys Glu Thr Phe Arg Lys Asn Phe

165 170 175

Phe Asp Asp Phe Arg Lys Thr Leu Pro Val Asp Leu Arg Lys Arg Ile

180 185 190

Val Lys Leu Glu His Cys Asp Phe Ser Arg Ile Arg Glu Tyr Leu Asp

195 200 205

Glu Leu Lys Glu Arg Lys Gln Ser Met Pro Pro Ser Lys Arg Lys Glu

210 215 220

Leu Arg Glu Ala Glu Ala Gln Arg Val Ala His Tyr Thr Val Ala Ile

225 230 235 240

Val Asp Gly Arg Lys Glu Lys Val Ala Asn Tyr Arg Val Glu Pro Pro

245 250 255

Gly Leu Phe Leu Gly Arg Gly Asp His Pro Leu Met Gly Arg Val Lys

260 265 270

Arg Arg Ile Phe Pro Glu Asp Val Thr Leu Asn Leu Gly Arg Asp Ala

275 280 285

Pro Ile Pro Pro Cys Pro Phe Lys Gly His Asp Trp Gly Ser Ile Val

290 295 300

His Asn Gln Arg Val Thr Trp Leu Ala Cys Trp Arg Asp Pro Ile Ser

305 310 315 320

Asp Glu Tyr Lys Tyr Val Trp Leu Ser Ala Ser Ser His Phe Lys Ala

325 330 335

Met Ser Asp Gln Glu Lys Phe Glu Lys Ala Gln Gln Leu Ser Lys His

340 345 350

Ile Thr Lys Ile Arg Asn Glu Tyr Thr Lys Gly Leu Glu Ser Ala Asp

355 360 365

Arg His Thr Gln Gln Arg Ser Val Ala Leu Tyr Leu Ile Asp Lys Leu

370 375 380

Ala Leu Arg Val Gly Asn Glu Lys Gly Glu Asp Glu Ala Asp Thr Val

385 390 395 400

Gly Cys Cys Ser Leu Arg Val Glu His Ile Thr Leu Gln Glu Pro Asn

405 410 415

Ile Val Gln Leu Asp Phe Leu Gly Lys Asp Ser Met Arg Tyr Phe Asn

420 425 430

Arg Val Arg Val Glu Ala Leu Val Phe His Arg Leu Arg Glu Phe Leu

435 440 445

Lys Gly Lys Arg Val Ser Asp Asn Val Phe Asp Glu Leu Lys Val Glu

450 455 460

Glu Leu Asn Asp Tyr Leu Lys Ser Leu Met Pro Gly Leu Ser Ala Lys

465 470 475 480

Val Phe Arg Thr Tyr Asn Ala Ser Tyr Thr Leu Asp Lys Leu Leu His

485 490 495

Ser Val Lys Thr Pro Gly Pro Asp Ile His Ser Arg Leu Leu Phe Tyr

500 505 510

Asn Glu Ala Asn Lys Asp Val Ala Val Leu Cys Asn His Gln Arg Ser

515 520 525

Leu Pro Lys Thr His Ser Leu Met Leu Glu Arg Leu Arg Asp Lys Leu

530 535 540

Glu Glu Ser Arg Ala Tyr Leu Glu Ala Leu Lys Met Ala Arg Gly Lys

545 550 555 560

Ala Gln Ala Ser Pro Asp Ser Arg Ala Gln Val Thr Arg Trp Arg Arg

565 570 575

Pro Val Val Glu Ile Pro Glu Asp Cys Ser Leu Ala Glu Arg Lys Arg

580 585 590

Ile Arg Glu Glu Ala Glu Lys Arg Pro Lys Glu Lys Gln Val Val Asn

595 600 605

Met Gly Leu Ala Ser Ile Thr Arg Gly Ile Ala Gln Thr Gln Glu Lys

610 615 620

Ile Arg Arg Leu Glu Ala Asp Leu Lys Thr Arg Asp Ser Leu Ala Thr

625 630 635 640

Val Ser Leu Ser Thr Ser Lys Ile Asn Tyr Leu Asp Pro Arg Ile Thr

645 650 655

Val Ala Trp Cys Lys Arg His Glu Val Pro Ile Glu Arg Ile Phe Pro

660 665 670

Arg Ala Leu Gln Glu Lys Phe Met Trp Ser Met Gly Val Ser Glu Asp

675 680 685

Phe Arg Phe Pro Ile Ser Asn Val Val Ser Asn Asp Pro Ser Gly Ala

690 695 700

Ser Thr

705

<210> 29

<211> 600

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 29

cgacgagaac gtataaggag tgcgcacggc gttttgttac aataccgata gatgagtttc 60

gaacatcgca ttcacaccat gagcgggggc gcacgctcca gagagtggag atggaaaagt 120

gccagcggag ccctgaggat gcaaaaaagt acggaccgct gacggaagag caaatggaaa 180

ggagggcgaa acttcgaggg ctacttgcat tagtaagtac aaccaacgat ggtaagaaac 240

gtatggagtt tgcgaaccga gactttaacg ctgccatcaa tatcaggaga tgtgcggtgc 300

tggagacgag acctccagag tgaacaagaa ggtacttttt tggacaacct tctaaggtcg 360

aactatatga gataaaattg gaagaagtag ttggtggccg gtccaaaaag acggggaggc 420

gtctgcacat cagttggaga cgttttgtcc aaggcgcgcc gatcgccact actgtacacg 480

gccggcgaga acgtggcgag aatacgcaag cgagctcgcc ggctgcgctc gctgcaccac 540

cgtcttttga ctcacaactt cgcgatatct ttgttctctg tgtttcttcg ttcgttgacc 600

<210> 30

<211> 599

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 30

gcctcgaggg cctctctgcg tcgtcgccga agagctgaca gaaagcgttc caggtgcgcg 60

agcctcactt gggcatatac cagtgagtgg gtgcgggcac cactagtccg gtgaactcgg 120

ccggtcagcc aaatcctcca cccggaaggt tcgcaccgtc actgcagcga gcgcagtcca 180

agcttctagg cgcggtaggc gtgcaggatg cgcgtccaat tcggcaagat gcgctgccgg 240

catgcccgcc catgtgtcgg acccattccg tgcaagcgag gcacgtcgag cgatttgggg 300

cgcgtgcata ggcccgcgca cgatgaatga tctggttgca tgcgtacaca aacaggactt 360

ttctgaccat gatatcccta ttcggacatg acgcatcgcg cacttccgca acacctcgtt 420

gctgggcgca gccggccgcg gcacctcgcg agcacgccgg cctcggcagc gcgagcgcat 480

tagcgagatc tccgacgacg gacgtgggtc gagattcgat ttcggaggcc gcacgacgca 540

aaaaggtcat tcgagtttgt gtctgcggac tctgaacgtt cctcgtaaag cacttctga 599

<210> 31

<211> 500

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 31

ccctcactgg catggcaaag ccgttctgtc tccgtgccgc atctgctcca ggggtgtaag 60

cgcgattgcg aatgctcaag gaaggttacg tgcacagtgg aatgcacgaa ataaccagta 120

catccgaaag gaagtacaag taatggaacc tgaaggtagg gtccagcagc atgatgggcg 180

ctttcccgaa tgtcaatacc gatctatcgc gcaatctggc ctccatggtc catagaagcg 240

ctttggcatc ggcgggagaa ccgggcgtcg ccccgcgctg cgctccatgg aacaatgctc 300

aaatcacgaa taaattgtac tttattaaat ctgtatgtac tatgatgtac aaaatagcat 360

tccaggaatc ccgagatcac acacgcgctc gagacgtgaa ctgtctcgtc actctcggct 420

acggcttcat cttcctcgta tttcttcgcc attagatacg agtgcggtaa gattctgggc 480

tgaagttttc aatattagtg 500

<210> 32

<211> 4700

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> pQE80

<400> 32

tagctgagct tggactcctg ttgatagatc cagtaatgac ctcagaactc catctggatt 60

tgttcagaac gctcggttgc cgccgggcgt tttttattgg tgagaatcca agctagcttg 120

gcgagatttt caggagctaa ggaagctaaa atggagaaaa aaatcactgg atataccacc 180

gttgatatat cccaatggca tcgtaaagaa cattttgagg catttcagtc agttgctcaa 240

tgtacctata accagaccgt tcagctggat attacggcct ttttaaagac cgtaaagaaa 300

aataagcaca agttttatcc ggcctttatt cacattcttg cccgcctgat gaatgctcat 360

ccggaatttc gtatggcaat gaaagacggt gagctggtga tatgggatag tgttcaccct 420

tgttacaccg ttttccatga gcaaactgaa acgttttcat cgctctggag tgaataccac 480

gacgatttcc ggcagtttct acacatatat tcgcaagatg tggcgtgtta cggtgaaaac 540

ctggcctatt tccctaaagg gtttattgag aatatgtttt tcgtctcagc caatccctgg 600

gtgagtttca ccagttttga tttaaacgtg gccaatatgg acaacttctt cgcccccgtt 660

ttcaccatgg gcaaatatta tacgcaaggc gacaaggtgc tgatgccgct ggcgattcag 720

gttcatcatg ccgtttgtga tggcttccat gtcggcagaa tgcttaatga attacaacag 780

tactgcgatg agtggcaggg cggggcgtaa tttttttaag gcagttattg gtgcccttaa 840

acgcctgggg taatgactct ctagcttgag gcatcaaata aaacgaaagg ctcagtcgaa 900

agactgggcc tttcgtttta tctgttgttt gtcggtgaac gctctcctga gtaggacaaa 960

tccgccctct agattacgtg cagtcgatga taagctgtca aacatgagaa ttgtgcctaa 1020

tgagtgagct aacttacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac 1080

ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt 1140

gggcgccagg gtggtttttc ttttcaccag tgagacgggc aacagctgat tgcccttcac 1200

cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca gcaggcgaaa 1260

atcctgtttg atggtggtta acggcgggat ataacatgag ctgtcttcgg tatcgtcgta 1320

tcccactacc gagatatccg caccaacgcg cagcccggac tcggtaatgg cgcgcattgc 1380

gcccagcgcc atctgatcgt tggcaaccag catcgcagtg ggaacgatgc cctcattcag 1440

catttgcatg gtttgttgaa aaccggacat ggcactccag tcgccttccc gttccgctat 1500

cggctgaatt tgattgcgag tgagatattt atgccagcca gccagacgca gacgcgccga 1560

gacagaactt aatgggcccg ctaacagcgc gatttgctgg tgacccaatg cgaccagatg 1620

ctccacgccc agtcgcgtac cgtcttcatg ggagaaaata atactgttga tgggtgtctg 1680

gtcagagaca tcaagaaata acgccggaac attagtgcag gcagcttcca cagcaatggc 1740

atcctggtca tccagcggat agttaatgat cagcccactg acgcgttgcg cgagaagatt 1800

gtgcaccgcc gctttacagg cttcgacgcc gcttcgttct accatcgaca ccaccacgct 1860

ggcacccagt tgatcggcgc gagatttaat cgccgcgaca atttgcgacg gcgcgtgcag 1920

ggccagactg gaggtggcaa cgccaatcag caacgactgt ttgcccgcca gttgttgtgc 1980

cacgcggttg ggaatgtaat tcagctccgc catcgccgct tccacttttt cccgcgtttt 2040

cgcagaaacg tggctggcct ggttcaccac gcgggaaacg gtctgataag agacaccggc 2100

atactctgcg acatcgtata acgttactgg tttcacattc accaccctga attgactctc 2160

ttccgggcgc tatcatgcca taccgcgaaa ggttttgcac cattcgatgg tgtcggaatt 2220

tcgggcagcg ttgggtcctg gccacgggtg cgcatgatct agagctgcct cgcgcgtttc 2280

ggtgatgacg gtgaaaacct ctgacacatg cagctcccgg agacggtcac agcttgtctg 2340

taagcggatg ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt 2400

cggggcgcag ccatgaccca gtcacgtagc gatagcggag tgtatactgg cttaactatg 2460

cggcatcaga gcagattgta ctgagagtgc accatatgcg gtgtgaaata ccgcacagat 2520

gcgtaaggag aaaataccgc atcaggcgct cttccgcttc ctcgctcact gactcgctgc 2580

gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat 2640

ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca 2700

ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc 2760

atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc 2820

aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg 2880

gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta 2940

ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg 3000

ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac 3060

acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag 3120

gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat 3180

ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat 3240

ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc 3300

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt 3360

ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct 3420

agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt 3480

ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc 3540

gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac 3600

catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat 3660

cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg 3720

cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata 3780

gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg tcgtttggta 3840

tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt 3900

gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag 3960

tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa 4020

gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag tgtatgcggc 4080

gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat agcagaactt 4140

taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc 4200

tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta 4260

ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa 4320

taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca 4380

tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac 4440

aaataggggt tccgcgcaca tttccccgaa aagtgccacc tgacgtctaa gaaaccatta 4500

ttatcatgac attaacctat aaaaataggc gtatcacgag gccctttcgt cttcacctcg 4560

agaaatcata aaaaatttat ttgctttgtg agcggataac aattataata gattcaattg 4620

tgagcggata acaatttcac acagaattca ttaaagagga gaaattaact atgagaggat 4680

cgcatcacca tcaccatcac 4700

<210> 33

<211> 1986

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 33

gcgtgagtca gttcactgac aataagggaa acttgcgctt caaatgggag ccagccgagg 60

ccgtgttggc ggtggcattc gatgttcctg tgcccgggta tgatacatac aattgcatca 120

atctgcgctt gtgggacagt aagcctgcgc gtgagttcga tcttagctct ttcaacgttg 180

gcgactatta taagattctt gaaatgcggc agacgagtga gacgctctcc gccgtcctgt 240

acccgaacga tagcactgaa gcaggcaagg agctgcgcct caagcaacag tatttcttcg 300

tttcggcgac gctgcaggac atcattcgga ggttcctgaa gaaggaccga ccactcacgc 360

aacttgccga aaaagtgtgc attcagctga acgacacgca tccgacgatc gggattgttg 420

aaatgatgcg ccttctcctg gacgagtacg cattgggctg gacggatgcg tggaaaaccg 480

tcaaagcggt gttctcgtac acgaatcaca cggtgctgcc ggaggccttg gaaaagtggc 540

ctgtgccact catggaacgg ctcttgcctc gccacatgca gctcatcttc gaaatcaact 600

ttcgtcatct ccaggagtat gctcgtctaa gcaacaacga tggccatctg ttggagcgag 660

tgagcatcat cgaggagggt tttccaaaaa tggtgcgcat ggcccagctg gccgtcgttg 720

gctcacatac tgtcaatggt gtcgcggaaa tacactcgga actcgtgcga acccgtctct 780

tccctgattt taaccgcttc gagccgaaaa agtttgtgaa catcacaaac ggtgtgaccc 840

ctcgacgctg gatactggaa gcaaatcccg ccttgagtgc ggtgttttcc cgctggacgg 900

agagcgatga atggattttg gatttgaacc agatccgcca gttggaacag tacgccgaga 960

accctgacct gcaacgggaa tttttcgaag ccaaaaagga aaataagcgg cgtctcgctg 1020

aatacattcg agaaaagaat ggcgtccacg tggatgtgaa tgccctcttc gacatccagg 1080

tcaagcgaat tcacgagtac aagcgccagt tgttgaatat tctcggtgtg attgcgcgct 1140

acaatttgat caagtcgggc aagcgtgatc tcgtgccgcg ggtcttcatc ttcggaggca 1200

aagcagcggc tggttacgca caagccaagc gcatcattcg tctcattaat ggtgtagcag 1260

acgtggtcaa caatgatcca gatgttggcg acctcctgaa agttgtattt ctcgaaaact 1320

acagcgtgtc tcttgccgag atcatcattc cggcgagcga cattagcgag catatatcga 1380

ctgccggcat ggaggcgtca ggaacgagca acatgaagtt tgtgatgaac ggcggtctca 1440

tcatcggcac tatggacgga gcgaacattg aaatccgaga agaaatcggg ccggagaaca 1500

tctttatttt cggtctgttg gctcaagaag ttgaccaggc gcgcaatgaa ctcaagtacc 1560

atggctggaa atgtaccgat gggcgcttcc agaacgcact gggtcagctc agtcgcggta 1620

tgtactgcgg tcaagacact tttcaggaaa ttgtacgagc cctcgatcca gccaacgact 1680

actacctgat cagtcgcgac tttacctcgt atatggaagc ccaggatcgc gtcgacgctg 1740

cctataggga ccagcgctcg tggctggcta agtgtattgt gagcacggct cgcatgggta 1800

agttcagttc tgatcgcagt atccatgagt acgcggagcg catttggcgc attgagccat 1860

gtgcctatac accgtccagc atcacataca aggaaccagt tgagggtgtc tcagagccca 1920

ccgcagacgg tacagcgccg aagacgacgc tccgaggaac gtaaacatca agtcccgagg 1980

gtgacg 1986

<210> 34

<211> 600

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 34

cgacgagaac gtataaggag tgcgcacggc gttttgttac aataccgata gatgagtttc 60

gaacatcgca ttcacaccat gagcgggggc gcacgctcca gagagtggag atggaaaagt 120

gccagcggag ccctgaggat gcaaaaaagt acggaccgct gacggaagag caaatggaaa 180

ggagggcgaa acttcgaggg ctacttgcat tagtaagtac aaccaacgat ggtaagaaac 240

gtatggagtt tgcgaaccga gactttaacg ctgccatcaa tatcaggaga tgtgcggtgc 300

tggagacgag acctccagag tgaacaagaa ggtacttttt tggacaacct tctaaggtcg 360

aactatatga gataaaattg gaagaagtag ttggtggccg gtccaaaaag acggggaggc 420

gtctgcacat cagttggaga cgttttgtcc aaggcgcgcc gatcgccact actgtacacg 480

gccggcgaga acgtggcgag aatacgcaag cgagctcgcc ggctgcgctc gctgcaccac 540

cgtcttttga ctcacaactt cgcgatatct ttgttctctg tgtttcttcg ttcgttgacc 600

<210> 35

<211> 717

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 35

atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac 60

ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac 120

ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc 180

ctcgtgacca ccctgaccta cggcgtgcag tgcttcagcc gctaccccga ccacatgaag 240

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc 300

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg 360

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac 420

aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac 480

ggcatcaagg tgaacttcaa gatccgccac aacatcgagg acggcagcgt gcagctcgcc 540

gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac 600

tacctgagca cccagtccgc cctgagcaaa gaccccaacg agaagcgcga tcacatggtc 660

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaag 717

<210> 36

<211> 200

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 36

taaactagct atttatctgg tacatatcat tcataagcac atgtttttgc gttgaaaccc 60

ggtaaaacac tcatgacgtt ttgtgttgaa ttgcaaccag cacgttatcg accagctctc 120

gaacaaaaca tcgctttata cgggagaagt tgctgcggta ttgaccagag cgtacgaaaa 180

cttgtgcagg caaaaagtgt 200

<210> 37

<211> 2757

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 37

gaactgaggg gcgaacgcag tcctccttgc attgttaagc aaaaaatatt ctctacaagc 60

aatttgtgta caatctatat ggtacgctgc gtagagattt gcatgtcctg ctctacagtg 120

cgaagaagca ttctatttta cgcagcggga gtaaagcaaa atcgagttga acgactatct 180

gacgcctacg caaagcgatt ccgcgtaccg tgtatctgca aagtagcact tcttaatggt 240

agagccgcaa ttgagaacgt tcggagtgtt ttcctggttc tgctggatgg atttttggta 300

aactatattt aatcttctag tgggaggatg attgcgcggg caaaaccata ttttccagga 360

tgagtcgtac tgaaagaagc acagtattca ctatgccggg cgaggtaggt gctagtttgg 420

gtttgacctc acgcgatgcg cacaacggct gcggaatacg cacaactcga tactgatcgt 480

cgagtcggca agactgcaac atgctcatgg ctcaggtcat ccgaagagaa acctcaccat 540

aaataccgtg ttcatacttc acatcgtaca taagaaaaat tacttttcag gccctctaca 600

ataaataaaa ttggattcca cttatcgcgt ttattgagag atttaccgga cgcttccgtc 660

gcaaaaaaaa gaagagacat aagaaacttg acccactgtg tagaaggaga gcaaacaccg 720

caaagccctg cgaaaacctc atgcggtgat gtcgcgccct gctgcgatcg taacgaaggc 780

cattcgtcgc cacgcgcaac gcagggcaag agacggcttg cgtacggtgc agaacaccga 840

actggctgtg aagaggatct cttctttgtc agtactaaaa ggcgtcgctt tctgggaatg 900

gagcagcttg ccaaggatct ttttgagata ggtgcggtca agtttggcac gttcaaactc 960

aagtctggga tagcgtctcc attctatgtc gatctgcgcg ttgctgtctc gtatcctcga 1020

gttctgagaa gcattgcggc tctgtaccta gagtgcttgc aggacttttc tggcgctttc 1080

gacgttattt gcggggttcc atacactgcg ctcccttttg cgacagcaat ggctgtgcag 1140

ggagacttac ctatggttat gtgtcgaaag gaggtcaaag accatggaac acggcgagtt 1200

gttgaaggtg ctttcacaca aggctctcgt tgcttaataa tcgaggacgt tgttacaagt 1260

ggttcaagta ttctggaagt tgtgggcgca ttgcgcgcag agggtctcaa agtggatcac 1320

gcgatcgtct tgcttgatag ggagcaaggc gggtgtgagg cgcttctcgg ccagggaata 1380

gggctccgat ctgttttccg catcagcgac ctcgttggga cgctgcggaa tttgaacttg 1440

cttgagccgg ggaaggtgga tatgattcta gattttattc gcacggcccg cgcagcatca 1500

gatgttaagc gccccgaaac aaacgttcca tccgcattgc cagtcgccgc atcaagttct 1560

tccccggtgc gagatcaact ttactcaatc gcagaaaaaa agagaagcat attatgcgtt 1620

gcggcagacg tacaatcgac taaggagctt cttggaattg cagatgcagt aggcccccac 1680

atctgtgttc tgaagttgca cgccgatatt atccaggact ggacgacgga cacatcaaga 1740

cgccttcgag agctcgcgga caaacacagt ttcatgcttt ttgaggatcg aaaattcgct 1800

gacataggaa acacggttgt tgcacagttc agcgcaggcg tacatcggat ctcttcctgg 1860

gcggacattg tcaatgcaca cgccatcccg ggccctggac tgattgaggg cctgcgccac 1920

gcttgcgcaa tttccggcag aatgattggc ctcctgttag tcgcacaaat gtccagcaaa 1980

gggaacttga ttgacgagag atacactgct acatgcctgc ggatggcccg cgaagcgttt 2040

ccgttctgta tcggcttcat tgcccaggag cgtctggacc cgacagggaa gctctttgtg 2100

atggcgcctg gtgtgcaact ggattcaaag ggcgatcaat taggccaaca gtacaattca 2160

ccacagtatc tgctgaagcg caagggtgtc gacttcctta tcgttggccg cggtatctac 2220

gggagtgaag agccagcaaa ggcggcccag atgtacaaag agttgtcgtg gagcgtcctt 2280

cacggctaat tcctaatggg cagaagcaag cgtcggctcc cacaaagttc agtagaacgg 2340

cgggggtcag ttgccacgaa cgctttacgt gcggcatcaa gatgatattt cattgcgaat 2400

ggagttcgtc gacgtcagtt gcatccgcag aatgcttctt tgtttcggat atgtccgcgg 2460

cctgtgcccg caacgaatgt tcctagaagt acatttcacg gcgccttgaa catgtagtta 2520

agataagcgc tcgaaatcgt ccctgattcg gcagtgactg tccctagagc tggatcctat 2580

ggtttcttga ccctgtggct acaatatgcg ctcgacaccg tcgaagatca gggaatcgct 2640

ctagtgtcgg cctgatttcc accaagtgca ctctgcactt tttgaaattt ttggaggacc 2700

cattaaaaaa ataaatgcaa ggattcctaa gaatagagat acagtcagct gctaggg 2757

<210> 38

<211> 1853

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 38

gaaaccgctc agcgaccaag cgactcgagc tcgccacctc gtcgcgtggt ctcgtcgatg 60

acctccttca tcgctaccaa gtagttcacg cgacgcaact tgcgcagatc ttttctcaga 120

agaaacctca ggtcactcac atcggggtac tttcggccac gcgtgttcgc aacttcaaca 180

cagcgctgca cgacagcggt gaggtacgtc cgcaacaagt cctcgacaag ctcagccgta 240

tcacgccgcg gctgccgcgc atccccgaaa ccgtacagca tctggcgaat ttcattctgg 300

aataagcgct tcgaagcgtc ttcttcgggg tcgttcgcga aaaaagcatc ccagtcgttt 360

ccatcgctgg actgcatcgc actcgccgac tcggttccta gcacgaggcc gctgcctaga 420

caacgcgacg tacgcagata ctgtaagcaa taatagtgct gatgctgcca ctaacgaacc 480

tcgagcgcgt gtcagggcgc atacacgtcg ttaacgtacc gcggtggcag gcacggcgag 540

cttggcgtat gcgcgatacg taaatacatg ctgcgttgga tagcgactct gctgccgcat 600

ggcgacgctt tcatcagatg cggcgctgct gggaagaagg ccgtatccgg tgcatctgga 660

tggtagtttt gtataataat aggcttaaat gtcgttgcgt tcaaaggatg tgcaatcttt 720

ctggaagaaa tggaatatat gggacaaaaa agtacggaag gaaaagcatc tttaggtcca 780

tatatgaata gattggacac tgcatttcct gggtccaata acgccatcga acaaaatgga 840

tatatcggac aaaacatgta cggaacgtga agcaccgttc cctacgtcat caacactaga 900

cctcgtgttc tctcgccgat gctccggagc tatatgtacg ggatcgggcg tgcgcctgcg 960

cgttcctctt ctggacaggg cagcatctcg gtagcaacta ctgcagtgac tcacgcgtga 1020

tggtgtttgt tgcacctgtt ctacgacacg tgagcaagca ccgggcaagg agcaagcgac 1080

tggtgcgcgc aagcgtcagc acgacggaat ggtacgtgcc ggcgacactg gcccccaggg 1140

tgtcggtgaa taacacccag aatcttcgca actggagcgc gccagtgcga accgttcgcg 1200

tccctgtggc tacggtaggc gttggttccg ttgaacagca ccaaaagcac ctcggacacc 1260

gcatagcgtt ttggcttcgg cgcgagaccc gtctgccgga ctgggctacc ttgatgttgg 1320

tgtcagcgct tcctgttgtt gagctacgcg gtggcgtgcc ggtagggctc tggctcggtt 1380

tgtcgccggc cgagactttt ctcttctgcg ttatcggcaa catgcttcca ataccgtttc 1440

tagtttttgg tcttcgacac gagcggatgc gtcgcctcgc gcgccccctg ttggactctg 1500

tggcgcggcg cctgccgagt catgcgaata ccccgtcgag tcaggcgctg gccctggcct 1560

tgtttgttgg cgtaccattg ccgggcactg gagcgtggtc tggcgctatt gccgcgttcc 1620

ttctgcaaat ggacatctgg cttgcgctgg tctccatcgc ggcgggcgtc gcgatcgctg 1680

gctgcatcat gattgcactc gttctaatgg gacgcatcgg tggactcatt gtcgcgtttg 1740

cgcttttagg agttggtgct agtgcgctgt ggcgcatgct gaaaccaccg tcgtcggcgg 1800

aaaactcgtg acatcaacgc cgccgcatct gagtgggtaa ggtcagcaag ctg 1853

<210> 39

<211> 200

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 39

cttatagctt acgtggcgga ttcgcagcga ttcgcgagat ttcccgaatc gccgatctcg 60

cgcgagatcc tcggcgaaga ttcgcggcgc atcatcgaac ggatcgcaga tgctgaaata 120

ccgcgcgcag cgaaattttt ccaacactag atttccattt gtgttctagc cgtggaaacc 180

tgtgagagaa ccagggattc 200

<210> 40

<211> 1209

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 40

atggtgttta cgtgtgctgc tttcgtagct ccagttggtg gttttcgcgg cacggcggtg 60

cgcgctacga gccgcgaggc cgtcggaccc cggctgcagg ctggtgacca gcctggtgcc 120

ttcactgcac gtacgcgaag cttaggtgtg ccgctgacgc ggagccggca acgtgctgcg 180

cactcgaatc ttgtgatgaa ggtgcgggta gcggtttccg ggttcggtcg catcggacgc 240

aactttgtgc gctgcctgca agcgaccccg aatgcgaacc tggagctggt ggggatcaac 300

gatacggctg gcatcaaaac tgccgctcat ctgctgaagt acgactctat tctgggaatt 360

gcaccgtttg acgtgaaggt tagcggggag agtaccatgc tcattgacgg gaagccggtt 420

accgttgtca gcaaccgcga tccgacccag ctgccttgga gggacctcaa cgtcgacatc 480

gtcatcgagg caacgggtgt ctttatctca agggacgggg caggcaagca catcgaagcg 540

ggtgccaaaa aggtggtcat cacagcgcca gcgaaaggcg aaggcgtccc gacctttgtc 600

atgggtctga acaacaccca gtataaccac gccaccgacc atgtggtgag caacgcgtcg 660

tgcaccacca acggcatggc accgttcgtg aaggtgctcg acgaggaatt cggcatcgtt 720

tccggcatga tgaccacgac gcattcgtat accggggacc agcgtctgct ggatgcatcg 780

catcgcgacc tgcgtcgcgc ccgtgcggcc gcgttgaaca tcgtgccgac ctcgacgggt 840

gccgcgcagg cggttgcgct ggtgtatcca ccagtgaagg gcaagctcac cggcattgcc 900

ctgcgagtgc cgaccccgaa cgtttctatc gtcgactttg tctgcacggt gaagaaacca 960

acgttcaagg aggaagtgaa tgcagctttc gtacgagcag cggaagggcc aatgaagggt 1020

atcctggcgg tgagcgatga gccgctcgtc tcatcggatt atcggatgaa cgtcaactca 1080

agcattgtgg atgcagcgct gacgacggtg atgggtgata cgctcgtcaa agtggtagcc 1140

tggtacgaca acgagtatgg ctacagtcaa cgggttgtgg acttggccaa ctacatagcg 1200

cagcatttc 1209

<210> 41

<211> 1572

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> codon optimized GP (codon optimized GP)

<400> 41

atggttccac aagcactgtt gctcgtgcct attctaggct tttccctgtg tttcgggaag 60

tttccgattt acaccattcc tgacacgttg ggcccatgga gtccgatcga catccatcac 120

ctgagctgcc caaacaactt ggtggttgag gatgaaggtt gcacgaatct ttccggtttc 180

tcgtatatgg aactgaaggt tggttacact agtgcgatca aggtgaacgg ttttacatgc 240

acaggagtcg ttaccgaagc ggagacatat accaactttg taggctacgt gaccactacg 300

ttcaaacgga agcactttcg cccgactcct gatgcctgca gagctgctta caactggaag 360

atggcaggag atccgcgtta tgaggagtcg ctccactccc cgtatccaga ttaccattgg 420

ctgcgaacag tgaagacgac taaggagagc ctggtcatca tatcgccctc agttgctgac 480

ctcgacccct atgacaatag cctgcattcg cgggtctttc ctagtggcaa atgcagcggg 540

ataacccgct cttcggtgta ctgctccact aaccacgact acacggtgtg gatgcctgag 600

atcttgcgac tcggcacgtc atgcgacatt ttcaccaaca gtcgaggcaa acgcgtgagc 660

aaagggagta cgacgtgtgg cttcatcgac gagaggggcc tctacaagtc actcaaaggg 720

gcctgcaagc tcaagctgtg tggcgttctt ggcctacgct tgatggatgg aacctgggta 780

tcaatgcaga catccaatga gacgaagtgg tgtccaccaa atcagctcgt aaatctccac 840

gacttacgct ccgatgaact ggaacatctg gtgattgaag agctcgtcaa gaaacgcgaa 900

gagtgcttag atgcgctcga gtccatcatt acgacgaaaa gcgtcagctt ccgccggctt 960

tcacacctgc gaaagctggt tcccggcttt ggtaaggcct acacgatctt caacaagacc 1020

ttgatggagg cagaagccca ctacaaatcg gtgcgtacct ggaacgagat cataccctct 1080

aaagggtgtc ttcgggtcgg aggtcgttgt catccgcatg tcaatggggt gtttttcaac 1140

ggcattatcc ttggtcccga tggtcacgtc ctgattccgg agatgcagtc gagcctcttg 1200

cagcagcata tcgagctcct ggagtcgtct gtgattccgc ttatgcatcc actcgcggat 1260

ccctttaccg tgttcaagga cggtgacgaa actgaggact tcatcgaggt ccacttaccg 1320

gacgtccacg aacaggtatc gggagtggat ctggggctgc ctaactgggg aaagtatgtg 1380

ctgcttagcg caggaacgct aatcgcgctc gttttgatca ttttcctaat gacctgttgc 1440

cgcaaggttg accggccgga aagtacgcaa cgctctctgc gtggtacagg ccgtaacgtc 1500

tcggttacgt cacaatcggg caaattcatc aacagctggg agtcgtacaa atccggtggc 1560

gaaacaggcc tt 1572

<210> 42

<211> 717

<212> DNA

<213> Victoria jellyfish (Aequorea victoria)

<400> 42

atgagcaagg gcgaggagct gttcaccggg gtggtgccca tcctggtcga gctggacggc 60

gacgtaaacg gccacaagtt cagcgtgcgt ggcgagggcg agggcgatgc caccaacggc 120

aagctgaccc tgaagttcat ctgcaccacc ggcaagctgc ccgtgccctg gcccaccctc 180

gtgaccaccc tgacctacgg cgtgcagtgc ttcagccgct accccgacca catgaagcgt 240

cacgacttct tcaagtccgc catgcccgaa ggctacgtcc aggagcgcac catctcgttc 300

aaggacgacg gcacatacaa gacccgcgcc gaggtgaagt tcgagggcga caccctggtg 360

aaccgcatcg agctgaaggg catcgacttc aaggaggacg gcaacatcct ggggcacaag 420

ctggagtaca actttaacag ccacaacgtc tatatcacag ccgacaagca gaagaacggc 480

atcaaggcaa acttcaagat ccgccacaac gttgaggacg gcagcgtgca gctcgccgac 540

cactaccagc agaacacccc catcggcgac ggccccgtgc tgctgcccga caaccactac 600

ctgagcaccc agtccgttct gagcaaagac cccaacgaga agcgcgatca catggtcctg 660

ctggagttcg tgaccgccgc cgggatcact cacggcatgg acgagctgta caagtaa 717

<210> 43

<211> 12

<212> PRT

<213> Artificial sequence (Artificial sequence)

<220>

<223> Co1 peptide (Co1 peptide)

<400> 43

Ser Phe His Gln Leu Pro Ala Arg Ser Pro Leu Pro

1 5 10

<210> 44

<211> 2781

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 44

gaactgaggg gcgaacgcag tcctccttgc attgttaagc aaaaaatatt ctctacaagc 60

aatttgtgta caatctatat ggtacgctgc gtagagattt gcatgtcctg ctctacagtg 120

cgaagaagca ttctatttta cgcagcggga gtaaagcaaa atcgagttga acgactatct 180

gacgcctacg caaagcgatt ccgcgtaccg tgtatctgca aagtagcact tcttaatggt 240

agagccgcaa ttgagaacgt tcggagtgtt ttcctggttc tgctggatgg atttttggta 300

aactatattt aatcttctag tgggaggatg attgcgcggg caaaaccata ttttccagga 360

tgagtcgtac tgaaagaagc acagtattca ctatgccggg cgaggtaggt gctagtttgg 420

gtttgacctc acgcgatgcg cacaacggct gcggaatacg cacaactcga tactgatcgt 480

cgagtcggca agactgcaac atgctcatgg ctcaggtcat ccgaagagaa acctcaccat 540

aaataccgtg ttcatacttc acatcgtaca taagaaaaat tacttttcag gccctctaca 600

ataaataaaa ttggattcca cttatcgcgt ttattgagag atttaccgga cgcttccgtc 660

gcaaaaaaaa gaagagacat aagaaacttg acccactgtg tagaaggaga gcaaacaccg 720

caaagccctg cgaaaacctc atgcggtgat gtcgcgccct gctgcgatcg taacgaaggc 780

cattcgtcgc cacgcgcaac gcagggcaag agacggcttg cgtacggtgc agaacaccga 840

actggctgtg aagaggatct cttctttgtc agtactaaaa ggcgtcgctt tctgggaatg 900

gagcagcttg ccaaggatct ttttgagata ggtgcggtca agtttggcac gttcaaactc 960

aagtctggga tagcgtctcc attctatgtc gatctgcgcg ttgctgtctc gtatcctcga 1020

gttctgagaa gcattgcggc tctgtaccta gagtgcttgc aggacttttc tggcgctttc 1080

gacgttattt gcggggttcc atacactgcg ctcccttttg cgacagcaat ggctgtgcag 1140

ggagacttac ctatggttat gtgtcgaaag gaggtcaaag accatggaac acggcgagtt 1200

gttgaaggtg ctttcacaca aggctctcgt tgcttaataa tcgaggacgt tgttacaagt 1260

ggttcaagta ttctggaagt tgtgggcgca ttgcgcgcag agggtctcaa agtggatcac 1320

gcgatcgtct tgcttgatag ggagcaaggc gggtgtgagg cgcttctcgg ccagggaata 1380

gggctccgat ctgttttccg catcagcgac cttgtcggaa cactccgcca ctcaggtcgg 1440

ttatcagttg aacaagtaac cgagttgttt gattatttcc actcaaccaa agtatcatcg 1500

agtcctctga gtaactctat tcaaagtgga tataaagtgc atcctctttc ctttgaacaa 1560

cgtctttcat tgattcgaaa caaagttggc cgtcggctat tggaaattat gttaaagaag 1620

cagtcgaacc ttgcagtggc agcggatgta acaaccagtg aagaattatt atccattgcc 1680

aatgaagtgg gtccacaaat atgcatttta aagacacata tggatattat tcaagattgg 1740

acggaaagcg tatctgaaaa actcgttcat ttagctaagt tgcatcactt tttgatattt 1800

gaagacagaa agtttgcaga tattggcaat actgtggaat tacagttgac tggaggtata 1860

tttcatattg cacagtgggc agacatagtg aatgctcata tcattgctgg tcctggaact 1920

attcaagcgc taagtcgatc tgctgaacat tgcggtattt tgttattagc acaaatgagt 1980

agcaaaggga acttggcagt acaagaatat acgcaaaagg cattagaatt tgctcaacaa 2040

tatgaagatg ctgtttttgg ttttatatca ttgggttgta ttggagatcc aaactttctt 2100

tattttactc ctggagtgaa gttagaagga ggaggtgact ctttaggtca acaatatacg 2160

gatcctaaga cagttattgc tattcaagga agcgatgtag ctattgtagg gaggggaatt 2220

attcagtctt ctaatcgacg tgaagcagca tcaacttatc gaaaagcctg ttgggatgct 2280

tatttacaac gacttgaaga atacgttgaa taattcctaa tgggcagaag caagcgtcgg 2340

ctcccacaaa gttcagtaga acggcggggg tcagttgcca cgaacgcttt acgtgcggca 2400

tcaagatgat atttcattgc gaatggagtt cgtcgacgtc agttgcatcc gcagaatgct 2460

tctttgtttc ggatatgtcc gcggcctgtg cccgcaacga atgttcctag aagtacattt 2520

cacggcgcct tgaacatgta gttaagataa gcgctcgaaa tcgtccctga ttcggcagtg 2580

actgtcccta gagctggatc ctatggtttc ttgaccctgt ggctacaata tgcgctcgac 2640

accgtcgaag atcagggaat cgctctagtg tcggcctgat ttccaccaag tgcactctgc 2700

actttttgaa atttttggag gacccattaa aaaaataaat gcaaggattc ctaagaatag 2760

agatacagtc agctgctagg g 2781

<210> 45

<211> 390

<212> DNA

<213> Cyanidioschyzon merole (a red alga)

<400> 45

atgttccatg tgacgtaccc gttcacgcag agacaatgct ttctccgttc acgagaagcg 60

tgccttgcaa cgttgccagc tggtgctttt cgaaagcacc tgtggcgccc ttcgtgctgg 120

tcgttccgca cacgtcttcg taaagaggcg tcgctacgga aatccacagt tctcgctccg 180

cttactcgcc gtctgcagct gagtctcttc ggcctcccag agcggttcgt tcgcaagtcc 240

aagtcgccgg tctcggcaga gtccagtgtc gccactgagc tcacacgtga tcgggtcaaa 300

gatccgacgc tcgcgaagta ctgggataca cttctggaaa tcaatgcact ggaggcggaa 360

ctggaacaac tcaaaagcga tgaactcaga 390

<210> 46

<211> 90

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> HA tag

<400> 46

atgtacccat acgatgttcc tgactatgcg ggctatccct atgacgtccc ggactatgca 60

ggataccctt atgacgttcc agattacgct 90

<210> 47

<211> 30

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> d184(+25)R

<400> 47

cgtcaccctc gggacttgat gtttacgttc 30

<210> 48

<211> 43

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> bT3'(+1)F

<400> 48

taaactagct atttatctgg tacatatcat tcataagcac atg 43

<210> 49

<211> 39

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> HS(-200)Fd184

<400> 49

gtcccgaggg tgacgcttat agcttacgtg gcggattcg 39

<210> 50

<211> 22

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> HS(-1)R

<400> 50

gaatccctgg ttctctcaca gg 22

<210> 51

<211> 35

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> J042(1)Fhs

<400> 51

gagaaccagg gattcatggt gtttacgtgt gctgc 35

<210> 52

<211> 41

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> J042(1209)R-link3

<400> 52

ggcgcctgca ccggatccga aatgctgcgc tatgtagttg g 41

<210> 53

<211> 35

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> GP(1)F-linker3

<400> 53

tccggtgcag gcgccatggt tccacaagca ctgtt 35

<210> 54

<211> 32

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> GP(1572)R-linker2

<400> 54

tccaccgcct ccaccaaggc ctgtttcgcc ac 32

<210> 55

<211> 35

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> sfGFP(1)F-linker2

<400> 55

ggtggaggcg gtggaggcat gagcaagggc gagga 35

<210> 56

<211> 35

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> sfGFP(714)Rbt

<400> 56

taaatagcta gtttacttgt acagctcgtc catgc 35

<210> 57

<211> 28

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> D184(1200)F

<400> 57

cgccttctcc tggacgagta cgcattgg 28

<210> 58

<211> 21

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> D184(+1400)R

<400> 58

ccagagccct accggcacgc c 21

<210> 59

<211> 23

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> APCC(-1)R

<400> 59

ggtcaacgaa cgaagaaaca cag 23

<210> 60

<211> 43

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> bT3'(+1)

<400> 60

taaactagct atttatctgg tacatatcat tcataagcac atg 43

<210> 61

<211> 35

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> SecA(1)Fapcc

<400> 61

cttcgttcgt tgaccatgtt ccatgtgacg taccc 35

<210> 62

<211> 90

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> SecA(390)R-linker-ha

<400> 62

atcgtatggg tacatcccgg tgaacagctc ctcgcccttg ctcataccac cacctccgcc 60

acctctgagt tcatcgcttt tgagttgttc 90

<210> 63

<211> 32

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> HA(1)F

<400> 63

atgtacccat acgatgttcc tgactatgcg gg 32

<210> 64

<211> 32

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> HA(90)R

<400> 64

agcgtaatct ggaacgtcat aagggtatcc tg 32

<210> 65

<211> 37

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> GP(1)Fha

<400> 65

gttccagatt acgctatggt tccacaagca ctgttgc 37

<210> 66

<211> 69

<212> DNA

<213> Artificial sequence (Artificial sequence)

<220>

<223> Co1-GP(1680)Rbt

<400> 66

taaatagcta gtttatggga gcggcgagcg cgccggcagc tggtggaagc taaggcctgt 60

ttcgccacc 69

PCT/RO/134 Table

93页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:磷酸化Tau肽疫苗的安全施用方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!