Application of branched-chain alpha-ketoacid dehydrogenase complex in preparation of malonyl-CoA

文档序号:900168 发布日期:2021-02-26 浏览:24次 中文

阅读说明:本技术 支链α-酮酸脱氢酶复合体在制备丙二酸单酰辅酶A中的应用 (Application of branched-chain alpha-ketoacid dehydrogenase complex in preparation of malonyl-CoA ) 是由 刘伟丰 刘波 崔倩倩 陶勇 于 2019-08-23 设计创作,主要内容包括:本发明公开了支链α-酮酸脱氢酶复合体在制备丙二酸单酰辅酶A中的应用。本发明公开了利用支链α-酮酸脱氢酶复合体制备丙二酸单酰辅酶A的方法,该方法包括向生物细胞中导入支链α-酮酸脱氢酶复合体的编码基因,并使支链α-酮酸脱氢酶复合体的编码基因得到表达,得到重组细胞;该培养重组细胞,得到丙二酸单酰辅酶A;支链α-酮酸脱氢酶复合体为由M1)或M2)组成的成套蛋白质:M1)bkdF、bkdG、bkdH和lpdA1;M2)bkdA、bkdB、bkdC和lpdA1。实验证明,利用支链α-酮酸脱氢酶复合体不仅可以制备丙二酸单酰辅酶A,还可以制备以丙二酸单酰辅酶A为中间产物的目的产物。(The invention discloses an application of a branched-chain alpha-ketoacid dehydrogenase complex in preparation of malonyl-CoA. The present invention discloses a method for producing malonyl-CoA using a branched-chain alpha-ketoacid dehydrogenase complex, which comprises introducing a gene encoding a branched-chain alpha-ketoacid dehydrogenase complex into a biological cell, and expressing the gene encoding the branched-chain alpha-ketoacid dehydrogenase complex to obtain a recombinant cell; culturing the recombinant cell to obtain malonyl coenzyme A; the branched-chain alpha-ketoacid dehydrogenase complex is a protein set consisting of M1) or M2): m1) bkdF, bkdG, bkdH and lpdA 1; m2) bkdA, bkdB, bkdC and lpdA 1. Experiments prove that the branched-chain alpha-ketoacid dehydrogenase complex can be used for preparing the malonyl coenzyme A and preparing a target product taking the malonyl coenzyme A as an intermediate product.)

1. A method of making malonyl-coa comprising 11) and 12):

11) introducing a gene encoding a branched-chain α -keto acid dehydrogenase complex into a biological cell, and expressing the gene encoding the branched-chain α -keto acid dehydrogenase complex to obtain a recombinant cell, and designating the recombinant cell as recombinant cell a;

12) culturing the recombinant cell A to prepare malonyl coenzyme A.

2. The method of claim 1, wherein: the branched-chain alpha-ketoacid dehydrogenase complex is M1) or M2) as follows:

m1) a set of proteins consisting of a bkdF protein, a bkdG protein, a bkdH protein and an lpdA1 protein;

m2) a set of proteins consisting of bkdA protein, bkdB protein, bkdC protein and the lpdA1 protein;

the encoding gene of the branched-chain alpha-ketoacid dehydrogenase complex is L1) or L2):

l1) a set of genes consisting of the gene encoding the bkdF protein, the gene encoding the bkdG protein, the gene encoding the bkdH protein, and the gene encoding the lpdA1 protein;

l2) a set of genes consisting of the gene encoding the bkdA protein, the gene encoding the bkdB protein, the gene encoding the bkdC protein, and the gene encoding the lpdA1 protein.

3. The method of claim 2, wherein: the bkdF protein, the bkdG protein, the bkdH protein, the lpdA1 protein, the bkdA protein, the bkdB protein, and the bkdC protein, and genes encoding them, are derived from Streptomyces avermitilis (Streptomyces avermitilis).

4. A method according to claim 2 or 3, characterized in that: the bkdF protein is a protein of a1) or a2) below:

a1) protein shown as a sequence 10 in a sequence table;

a2) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 10 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 10 and has the same function;

the bkdG protein is a protein of a3) or a4) below:

a3) protein shown as a sequence 11 in a sequence table;

a4) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 11 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 11 and has the same function;

the bkdH protein is a protein of a5) or a6) below:

a5) protein shown as a sequence 12in a sequence table;

a6) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 12in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 12 and has the same function;

the lpdA1 protein is a protein of a7) or a8) as follows:

a7) protein shown as a sequence 13 in a sequence table;

a8) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues of the amino acid sequence of the sequence 13 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 13 and has the same function;

the bkdA protein is a protein of a9) or a10) as follows:

a9) protein shown as a sequence 7 in a sequence table;

a10) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 7 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 7 and has the same function;

the bkdB protein is a protein of a11) or a12) as follows:

a11) protein shown as a sequence 8 in a sequence table;

a12) the protein which has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 8 and has the same function is obtained by substituting and/or deleting and/or adding one or more amino acid residues of the amino acid sequence of the sequence 8 in the sequence table;

the bkdC protein is a protein of a13) or a14) as follows:

a13) protein shown as a sequence 9 in a sequence table;

a14) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 9 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 9, and has the same function.

5. The method according to any one of claims 2-4, wherein:

the coding gene of the bkdF protein is the following b1) or b 2):

b1) DNA molecules shown in 1 st to 1221 th sites of a sequence 2in a sequence table;

b2) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b1) and has the same function;

the coding gene of the bkdG protein is the following b3) or b 4):

b3) a DNA molecule shown as 1223-2200 site of the sequence 2in the sequence table;

b4) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b3) and has the same function;

the coding gene of the bkdH protein is the following b5) or b6) or b 7):

b5) DNA molecule shown in sequence 3 in the sequence table;

b6) a DNA molecule shown in the 2220-3608 site of the sequence 2in the sequence table;

b7) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b5) or b6) and has the same function;

the coding gene of the lpdA1 protein is the following b8) or b9) or b 10):

b8) DNA molecule shown in sequence 5 in the sequence table;

b9) DNA molecule shown in sequence 4 in the sequence table;

b10) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b8) or b9) and has the same function;

the coding gene of the bkdA protein is the following b11) or b 12):

b11) a DNA molecule shown in 1 st-1146 th site of a sequence 1 in a sequence table;

b12) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b11) and has the same function;

the coding gene of the bkdB protein is the following b13) or b 14):

b13) a DNA molecule shown as 1220-2224 bit of the sequence 1 in the sequence table;

b14) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b13) and has the same function;

the coding gene of the bkdC protein is the following b15) or b 16):

b15) a DNA molecule shown in position 2224-3591 of a sequence 1 in a sequence table;

b16) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b15) and has the same function.

6. The method according to any one of claims 1-5, wherein: the biological cell contains a branched-chain alpha-keto acid synthesis pathway, and step 11) further comprises inhibiting the synthesis of a branched-chain alpha-keto acid in the biological cell.

7. The method of claim 6, wherein: the inhibition of branched-chain alpha-keto acid synthesis is achieved by knocking out at least one gene in the branched-chain alpha-keto acid synthesis pathway in the biological cell, or by reducing the content or activity of a protein encoded by at least one gene in the branched-chain alpha-keto acid synthesis pathway.

8. The method according to claim 6 or 7, characterized in that: the inhibition of branched-chain alpha-keto acid synthesis is achieved by knocking out ilvA gene or/and ilvE gene in the biological cell, or by reducing the content or activity of a protein encoded by the ilvA gene or/and the ilvE gene in the biological cell.

9. The method of claim 8, wherein: the ilvA gene encodes the following proteins a15) or a 16):

a15) protein shown as a sequence 15 in a sequence table;

a16) the protein which has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 15 and has the same function is obtained by substituting and/or deleting and/or adding one or more amino acid residues on the amino acid sequence of the sequence 15 in the sequence table;

the ilvE gene encodes the following proteins a17) or a 18):

a17) protein shown as a sequence 17 in a sequence table;

a18) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues of the amino acid sequence of the sequence 17 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 17 and has the same function;

further, in the present invention,

the ilvA gene is b17) or b18) below:

b17) DNA molecule shown in sequence 14 in the sequence table;

b18) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b17) and has the same function;

the ilvE gene is b19) or b20) described below:

b19) DNA molecule shown in sequence 16 in the sequence table;

b20) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b19) and has the same function.

10. The method according to any one of claims 1-9, wherein: step 11) further comprises introducing a gene encoding the ppc protein into the biological cell and allowing the gene to be expressed, or increasing the content of the ppc protein or enhancing the activity of the ppc protein in the biological cell;

further, the ppc protein and the gene encoding the same are derived from Corynebacterium glutamicum;

further, the ppc protein is a19) or a20) below:

a19) protein shown as a sequence 19 in a sequence table;

a20) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 19 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 19 and has the same function;

the coding gene of the ppc protein is the following b21) or b 22):

b21) DNA molecule shown in sequence 18 in the sequence table;

b22) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b21) and has the same function.

11. The method according to any one of claims 1-10, wherein: the biological cell can express an outer membrane protease VII, and the step 11) further comprises knocking out a gene encoding the outer membrane protease VII in the biological cell, or reducing the content or activity of the outer membrane protease VII in the biological cell;

further, the outer membrane protease VII is an ompT protein;

still further, the ompT protein is a21) or a22) below:

a21) protein shown as a sequence 28 in a sequence table;

a22) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 28 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 28 and has the same function;

the encoding gene of the ompT protein is the following b23) or b 24):

b23) DNA molecule shown in sequence 27 in the sequence table;

b24) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b23) and has the same function.

12. The method according to any one of claims 1-11, wherein: the biological cell comprises an oxaloacetate synthesis pathway capable of synthesizing oxaloacetate;

further, the biological cell is a microbial cell, an animal cell or a plant cell;

still further, the microbial cell is N1) or N2) or N3):

n1) bacteria or fungi;

n2) E.coli;

n3) escherichia coli BW 25113.

13. A method of making malonyl-coa, comprising: using oxaloacetate as a substrate, and carrying out catalytic reaction by using the branched-chain alpha-ketoacid dehydrogenase complex as defined in any one of claims 1 to 5 to obtain malonyl-CoA.

14. The method of claim 13, wherein: the catalytic reaction is carried out in an F buffer solution; the F buffer solution consists of a solvent and a solute, wherein the solvent is 50mM Tris-HCl buffer solution (pH 7.0), and the solute and the concentration of the solute in the F buffer solution are respectively 0.1mM coenzyme A, 0.2mM dithiothreitol, 0.2mM triphenyl phosphate and 1mM MgSO (MgSO)4And 2mM NAD+

And/or the catalytic reaction is carried out at 30-37 ℃,

further, the catalytic reaction is carried out at 30 ℃.

15. A method for producing a desired product with malonyl-coa as an intermediate, comprising: culturing the recombinant cell A according to any one of claims 1 to 12 to produce the desired product.

16. The method of claim 15, wherein: the target product is 3-hydroxypropionic acid, and the method comprises the following steps: introducing a coding gene of mcr protein into the recombinant cell A and expressing the coding gene, or increasing the content of the mcr protein in the recombinant cell A or enhancing the activity of the mcr protein to obtain a recombinant cell, and marking the recombinant cell as recombinant cell-mcr; culturing the recombinant cell-mcr to prepare the target product;

further, the mcr protein and the coding gene thereof are derived from phomophilus thermophilus (Chloroflexus aurantiacaus);

further, the mcr protein consists of an mcr N-terminal domain and an mcr C-terminal domain, the mcr N-terminal domain being either a23) or a24) below:

a23) protein shown as a sequence 22 in a sequence table;

a24) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues of the amino acid sequence of the sequence 22 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 22 and has the same function;

the mcr C-terminal domain is a25) or a26) below:

a25) protein shown as a sequence 23 in a sequence table;

a26) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 23 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 23 and has the same function;

the encoding gene of the mcr protein consists of the encoding gene of the mcr N-terminal structural domain and the encoding gene of the mcr C-terminal structural domain, and the encoding gene of the mcr N-terminal structural domain is the following b25) or b 26):

b25) DNA molecule shown in 1-1689 bit of sequence 21 in the sequence table;

b26) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b25) and has the same function;

the encoding gene of the mcr C-terminal domain is b27) or b28) as follows:

b27) a DNA molecule shown in the 1704-3749 site of the sequence 21 in the sequence table;

b28) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b27) and has the same function;

still further, the encoding gene of the mcr protein is b29) or b30) as follows:

b29) DNA molecule shown in sequence 21 in the sequence table;

b30) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b29) and has the same function.

17. The method of claim 15, wherein: the target product is picric acid or an intermediate product between malonyl-coa to picric acid in the picric acid synthesis pathway, the method comprising: introducing a coding gene of a vps protein into the recombinant cell A and expressing the coding gene, or increasing the content of the vps protein in the recombinant cell A or enhancing the activity of the vps protein to obtain a recombinant cell, and marking the recombinant cell as recombinant cell-vps; culturing the recombinant cell-vps to prepare the target product;

further, the vps protein and the gene encoding the same are derived from hop (Humulus lupulus);

still further, the vps protein is a27) or a28) below:

a27) protein shown as a sequence 26 in a sequence table;

a28) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues to the amino acid sequence of the sequence 26 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 26 and has the same function;

the coding gene of the vps protein is b31) or b 32):

b31) DNA molecule shown in sequence 25 in the sequence table;

b32) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b31) and has the same function.

18. The kit is a kit A or a kit B or a kit C;

the kit A comprises a gene encoding the branched-chain alpha-ketoacid dehydrogenase complex or the branched-chain alpha-ketoacid dehydrogenase complex according to any one of claims 1 to 5;

the kit B consists of the kit A and the mcr protein or the gene encoding the mcr protein in claim 16;

the kit comprising the kit A and the vps protein of claim 17 or a gene encoding the vps protein.

19. The kit of claim 18, wherein: the kit A further comprising the ppc protein or a gene encoding the ppc protein of claim 10.

20. The kit of claim 18 or 19, wherein: the kit A also comprises substances for inhibiting the synthesis of the branched-chain alpha-keto acid.

21. A recombinant cell which is the recombinant cell a according to any one of claims 1 to 12, the recombinant cell-mcr according to claim 16 or the recombinant cell-vps according to claim 17.

22. The use of I, II or III:

I. the branched-chain alpha-ketoacid dehydrogenase complex according to any one of claims 1 to 5 or a gene encoding the branched-chain alpha-ketoacid dehydrogenase complex, the recombinant cell A according to any one of claims 1 to 12, or any one of the following uses of the kit A according to any one of claims 18 to 20:

x1) synthesizing malonyl-coenzyme A;

x2) preparing a synthetic malonyl-coenzyme A product;

x3) producing a target product with malonyl-coenzyme A as an intermediate product;

x4) preparing a product for producing a target product with malonyl-coenzyme A as an intermediate product;

x5) to synthesize 3-hydroxypropionic acid;

x6) preparing a synthetic 3-hydroxypropionic acid product;

x7) synthesis of picric acid or intermediates between malonyl-coenzyme A and picric acid in the synthesis pathway of picric acid;

x8) preparing synthetic picric acid or intermediate product between malonyl coenzyme A and picric acid in the synthetic route of picric acid;

x9) synthetic fatty acids;

x10) preparing a synthetic fatty acid product;

x11) synthesis of polyketides;

x12) preparing synthetic polyketide products;

x13) synthesis of flavone compounds;

x14) preparing a synthetic flavone compound product;

II. Use of a recombinant cell-mcr as claimed in claim 16 or a kit of parts as claimed in any one of claims 18 to 20 for any one of the following applications:

y1) to synthesize 3-hydroxypropionic acid;

y2) preparing and synthesizing a 3-hydroxypropionic acid product;

III, the recombinant cell-vps as claimed in claim 17 or any of the following uses of the kit of parts as claimed in any of claims 18 to 20:

z1) synthesis of picric acid or intermediates between malonyl-CoA to picric acid in the synthesis pathway of picric acid;

z2) to prepare the synthetic picric acid or an intermediate product between malonyl-CoA and picric acid in the synthetic route of picric acid.

Technical Field

The invention relates to an application of a branched-chain alpha-ketoacid dehydrogenase complex in preparation of malonyl coenzyme A in the field of biotechnology.

Background

The flavone compounds are ubiquitous in natural plants, are compounds containing 2-phenylchromone structures, and belong to plant secondary metabolites. The flavone compounds have effects of scavenging free radicals and resisting oxidation, and also have antibacterial, antitumor and immunity enhancing effects. Polyketides are also important secondary metabolites formed by bacteria, fungi, actinomycetes or plants through continuous decarboxylation condensation of lower carboxylic acids such as acetic acid, malonic acid, butyric acid and the like, the synthetic route of the polyketides is similar to that of long-chain fatty acids, and the polyketides are widely applied as antibiotics clinically due to important biological activities of the polyketides, such as erythromycin, an anticancer drug doxorubicin, an antifungal agent amphotericin, an antiparasitic agent abamectin, an insecticide spinosad, an immunosuppressant rapamycin and the like.

Polyketides are mainly produced by streptomyces which is a natural producer in industry, but streptomyces production is adopted to solve the problems that the regulation of production strains is complex, the yield is not easy to improve and the like. The method tries to synthesize the polyketide by taking escherichia coli with clear genetic background as the underpan cells, which is not only beneficial to realizing the high-level synthesis of the target compound, but also beneficial to explaining the synthesis regulation mechanism of the polyketide. At present, polyketides such as erythromycin and the like are successfully synthesized in escherichia coli, but the yield is still low mainly because the synthesis of the polyketides is limited by the content of intracellular malonyl coenzyme A, the malonyl coenzyme A is also an important precursor for the synthesis of flavonoids, and the biosynthesis of the flavonoids is limited by the content of the malonyl coenzyme A.

Malonyl-coenzyme A is used as an important precursor substance for synthesis of polyketide, flavonoid compound and fatty acid, and the improvement of the intracellular content of malonyl-coenzyme A is the key of high-level synthesis of the compound. In the central metabolic pathway, glucose is used as a carbon source, pyruvate is obtained through a series of enzyme reactions, and the pyruvate generates CO under the catalysis of pyruvate dehydrogenase2And acetyl-CoA. Most of acetyl-CoA enters into tricarboxylic acid cycle, a small amount of acetyl-CoA participates in fatty acid synthesis, malonyl-CoA is a direct precursor of fatty acid synthesis and is obtained by catalyzing acetyl-CoA by acetyl-CoA carboxylase, the reaction is energy-consuming and also involves CO2And (4) fixing. Meanwhile, in Escherichia coli, the intracellular concentration of malonyl-CoA is generally controlled to be low in order to coordinate the relationship between synthesis of phospholipids and macromolecular substances and growth of cells, and to control the fatty acid synthesis ratio. In order to improve the synthesis level of precursor substances, many studies are currently focused on the field of metabolic flux regulation, and metabolic engineering techniques of multi-target genetic manipulation are used to improve the production of malonyl-coa in cells, such as strategies of improving the expression level of key enzymes, namely acetyl-coa carboxylase, knocking out competitive branches of acetyl-coa and malonyl-coa.

Disclosure of Invention

The present invention has an object to provide a novel function of a branched-chain alpha-ketoacid dehydrogenase complex which can catalyze a reaction using oxaloacetate as a substrate to synthesize malonyl-CoA.

The invention firstly provides a method for preparing malonyl-CoA, which comprises 11) and 12):

11) introducing a gene encoding a branched-chain α -keto acid dehydrogenase complex into a biological cell, and expressing the gene encoding the branched-chain α -keto acid dehydrogenase complex to obtain a recombinant cell, and designating the recombinant cell as recombinant cell a;

12) culturing the recombinant cell A to prepare malonyl coenzyme A.

In the above method, the branched-chain α -keto acid dehydrogenase complex may be M1) or M2) described below:

m1) a set of proteins consisting of a bkdF protein (branched-chain α -ketoacid dehydrogenase E1 α subunit), a bkdG protein (branched-chain β -ketoacid dehydrogenase E1 β subunit), a bkdH protein (branched-chain α -ketoacid dehydrogenase E2 subunit) and an lpdA1 protein (branched-chain α -ketoacid dehydrogenase E3 subunit);

m2) a set of proteins consisting of a bkdA protein (branched-chain. alpha. -ketoacid dehydrogenase E1. alpha. -subunit), a bkdB protein (branched-chain. beta. -ketoacid dehydrogenase E1. beta. -subunit), a bkdC protein (branched-chain. alpha. -ketoacid dehydrogenase E2 subunit) and the lpdA1 protein;

the gene encoding the branched-chain alpha-ketoacid dehydrogenase complex may be L1) or L2) described below:

l1) a set of genes consisting of the gene encoding the bkdF protein, the gene encoding the bkdG protein, the gene encoding the bkdH protein, and the gene encoding the lpdA1 protein;

l2) a set of genes consisting of the gene encoding the bkdA protein, the gene encoding the bkdB protein, the gene encoding the bkdC protein, and the gene encoding the lpdA1 protein.

In the above method, the bkdF protein, the bkdG protein, the bkdH protein, the lpdA1 protein, the bkdA protein, the bkdB protein, the bkdC protein, and a gene encoding the bkdC protein may be derived from Streptomyces avermitilis (Streptomyces avermitilis).

In the above method, the bkdF protein may be a protein of a1) or a2) below:

a1) protein shown as a sequence 10 in a sequence table;

a2) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 10 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 10 and has the same function.

The bkdG protein may be a protein of a3) or a4) as follows:

a3) protein shown as a sequence 11 in a sequence table;

a4) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 11 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 11 and has the same function.

The bkdH protein may be a protein of a5) or a6) as follows:

a5) protein shown as a sequence 12in a sequence table;

a6) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 12in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 12 and has the same function.

The lpdA1 protein can be a7) or a8) protein as follows:

a7) protein shown as a sequence 13 in a sequence table;

a8) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 13 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 13 and has the same function.

The bkdA protein may be a protein of a9) or a10) as follows:

a9) protein shown as a sequence 7 in a sequence table;

a10) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 7 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 7 and has the same function.

The bkdB protein can be the protein of the following a11) or a 12):

a11) protein shown as a sequence 8 in a sequence table;

a12) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 8 in the sequence table to obtain the protein which has 75% or more than 75% of identity with the amino acid sequence of the sequence 8 and has the same function.

The bkdC protein can be the protein of a13) or a14) as follows:

a13) protein shown as a sequence 9 in a sequence table;

a14) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 9 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 9, and has the same function.

In the above method, the bkdF protein-encoding gene may be b1) or b 2):

b1) DNA molecules shown in 1 st to 1221 th sites of a sequence 2in a sequence table;

b2) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b1) and has the same function.

The gene encoding the bkdG protein may be b3) or b4) below:

b3) a DNA molecule shown as 1223-2200 site of the sequence 2in the sequence table;

b4) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b3) and has the same function.

The gene encoding the bkdH protein may be b5) or b6) or b7) below:

b5) DNA molecule shown in sequence 3 in the sequence table;

b6) a DNA molecule shown in the 2220-3608 site of the sequence 2in the sequence table;

b7) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b5) or b6) and has the same function.

The gene encoding lpdA1 protein can be b8) or b9) or b10) as follows:

b8) DNA molecule shown in sequence 5 in the sequence table;

b9) DNA molecule shown in sequence 4 in the sequence table;

b10) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b8) or b9) and has the same function.

The gene encoding the bkdA protein may be b11) or b12) below:

b11) a DNA molecule shown in 1 st-1146 th site of a sequence 1 in a sequence table;

b12) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b11) and has the same function.

The gene encoding the bkdB protein can be the following b13) or b 14):

b13) a DNA molecule shown as 1220-2224 bit of the sequence 1 in the sequence table;

b14) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b13) and has the same function.

The gene encoding the bkdC protein may be the following b15) or b 16):

b15) a DNA molecule shown in position 2224-3591 of a sequence 1 in a sequence table;

b16) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b15) and has the same function.

In the above method, the gene encoding the branched-chain α -keto acid dehydrogenase complex introduced into the biological cell may be specifically an expression vector comprising the gene encoding the branched-chain α -keto acid dehydrogenase complex introduced into the biological cell.

The expression vectors can be plasmid, cosmid, phage, or viral vectors. The plasmid can be pYB1k or pLB1a, the sequence of pYB1k is a sequence 6 in a sequence table, and the sequence of pLB1a is a sequence 24 in the sequence table.

Four independent genes (the above-mentioned L1 or L2)) of the gene encoding the branched-chain alpha-ketoacid dehydrogenase complex can be introduced into the biological cell by a co-expression vector containing each gene. The co-expression vector can be pYB1k-bkdABC-lpdA1, pYB1k-bkdFGH-lpdA1 or pYB1k-bkdFG-opbkdH-oplpdA 1; pYB 1-1 k-bkdABC-lpdA1 is a recombinant vector obtained by inserting the gene encoding the bkdA protein, the gene encoding the bkdB protein, the gene encoding the bkdC protein and the gene encoding the lpdA1 protein into pYB 1-1 k, and is capable of expressing the bkdA protein, the bkdB protein, the bkdC protein and the lpdA1 protein; pYB 1-1 k-bkdFGH-lpdA1 and pYB 1-1 k-bkdFG-opbkdH-oplpdA1 are recombinant vectors in which the gene encoding the bkdF protein, the gene encoding the bkdG protein, the gene encoding the bkdH protein, and the gene encoding the lpdA1 protein are inserted into pYB1k, and the bkdF protein, the bkdG protein, the bkdH protein, and the lpdA1 protein are expressed.

In the above method, the biological cell contains a branched-chain α -keto acid synthesis pathway, and step 11) may further include inhibiting the synthesis of a branched-chain α -keto acid in the biological cell.

The recombinant cell A thus obtained contains a gene encoding the branched-chain alpha-keto acid dehydrogenase complex, and the synthesis of the branched-chain alpha-keto acid is inhibited.

In the above method, the inhibition of branched-chain α -keto acid synthesis may be achieved by knocking out at least one gene in the branched-chain α -keto acid synthesis pathway in the biological cell, or by reducing the content or activity of a protein encoded by at least one gene in the branched-chain α -keto acid synthesis pathway.

In the above method, the inhibition of branched-chain α -keto acid synthesis may be achieved by knocking out ilvA gene (threonine deaminase gene) or/and ilvE gene (branched-chain amino acid transaminase gene) in the biological cell, or by reducing the content or activity of a protein encoded by the ilvA gene or/and the ilvE gene in the biological cell.

The biological cell contains the ilvA gene or/and the ilvE gene.

In the above method, the ilvA gene may encode the following proteins a15) or a 16):

a15) protein shown as a sequence 15 in a sequence table;

a16) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 15 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 15 and has the same function.

The ilvE gene may encode the following proteins a17) or a 18):

a17) protein shown as a sequence 17 in a sequence table;

a18) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 17 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 17, and has the same function.

Further, in the present invention,

the ilvA gene may be b17) or b18) as follows:

b17) DNA molecule shown in sequence 14 in the sequence table;

b18) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b17) and has the same function.

The ilvE gene may be b19) or b20) below:

b19) DNA molecule shown in sequence 16 in the sequence table;

b20) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b19) and has the same function.

In the above method, the knockout of the ilvA gene in the biological cell can be performed by homologous recombination, specifically, by using Escherichia coli strain JW3745 having the ilvA gene knockout trait.

In the above method, the knockout of the ilvE gene in the biological cell can be carried out by homologous recombination, specifically, by using Escherichia coli strain JW5606 having the ilvE gene knockout trait.

In the above method, the step 11) may further comprise introducing a gene encoding the ppc protein (phosphoenolpyruvate carboxylase) into the biological cell and allowing the encoded gene to be expressed, or increasing the content of the ppc protein or enhancing the activity of the ppc protein in the biological cell. The recombinant cell A thus obtained contains a gene encoding the branched-chain alpha-keto acid dehydrogenase complex and a gene encoding the ppc protein, and the synthesis of branched-chain alpha-keto acids is inhibited.

Further, the ppc protein and the gene encoding the same may be derived from Corynebacterium glutamicum.

Still further, the ppc protein may be a19) or a20) below:

a19) protein shown as a sequence 19 in a sequence table;

a20) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 19 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 19 and has the same function.

The coding gene of the ppc protein can be the following b21) or b 22):

b21) DNA molecule shown in sequence 18 in the sequence table;

b22) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b21) and has the same function.

In the above method, the biological cell can express outer membrane protease VII, and step 11) may further comprise knocking out a gene encoding the outer membrane protease VII in the biological cell, or reducing the content or activity of the outer membrane protease VII in the biological cell.

Further, the outer membrane protease VII may be an ompT protein.

Still further, the ompT protein is a21) or a22) below:

a21) protein shown as a sequence 28 in a sequence table;

a22) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 28 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 28 and has the same function.

The encoding gene of the ompT protein is the following b23) or b 24):

b23) DNA molecule shown in sequence 27 in the sequence table;

b24) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b23) and has the same function.

In the above method, the gene encoding the ppc protein may be introduced into the biological cell by introducing an expression vector containing the gene encoding the ppc protein into the biological cell, or by recombining the gene encoding the ppc protein into the genome of the biological cell and expressing the gene encoding the ppc protein.

In the above method, the biological cell may comprise an oxaloacetate synthesis pathway capable of synthesizing oxaloacetate.

Further, the biological cell may be a microbial cell, an animal cell, or a plant cell.

Still further, the microbial cell may be N1) or N2) or N3): n1) bacteria or fungi; n2) E.coli; n3) escherichia coli BW 25113.

The present invention also provides a method for preparing malonyl-coa, the method comprising: and (3) carrying out catalytic reaction by using oxaloacetate as a substrate and adopting the branched-chain alpha-ketoacid dehydrogenase complex to obtain malonyl coenzyme A.

In the above method, the catalytic reaction may be carried out in an F buffer; the F buffer solution consists of a solvent and a solute, wherein the solvent is 50mM Tris-HCl buffer solution (pH 7.0), and the solute and the concentration of the solute in the F buffer solution are respectively 0.1mM coenzyme A, 0.2mM dithiothreitol, 0.2mM triphenyl phosphate and 1mM MgSO (MgSO)4And 2mM NAD+(oxidized form of nicotinamide adenine dinucleotide).

The catalytic reaction may be carried out at 30-37 ℃. Further, the catalytic reaction may be performed at 30 ℃.

The present invention also provides a method for producing a desired product with malonyl-coa as an intermediate, the method comprising: and culturing the recombinant cell A to prepare a target product.

In the above method, the target product may be 3-hydroxypropionic acid, and the method comprises: introducing a coding gene of mcr protein (malonyl-CoA reductase) into the recombinant cell A and expressing the coding gene, or increasing the content of the mcr protein in the recombinant cell A or enhancing the activity of the mcr protein to obtain a recombinant cell, and marking the recombinant cell as recombinant cell-mcr; and culturing the recombinant cell-mcr to prepare the target product.

Further, the mcr protein and the coding gene thereof can be derived from phomophilus thermophilus (Chloroflexus aurantiacus).

Still further, the mcr protein may consist of an mcr N-terminal domain and an mcr C-terminal domain, and the mcr N-terminal domain may be a23) or a24) below:

a23) protein shown as a sequence 22 in a sequence table;

a24) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues of the amino acid sequence of the sequence 22 in the sequence table, has 75 percent or more than 75 percent of identity with the amino acid sequence of the sequence 22 and has the same function;

the mcr C-terminal domain may be a25) or a26) below:

a25) protein shown as a sequence 23 in a sequence table;

a26) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 23 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 23, and has the same function.

The encoding gene of the mcr protein may consist of the encoding gene of the mcr N-terminal domain and the encoding gene of the mcr C-terminal domain, and the encoding gene of the mcr N-terminal domain may be b25) or b26) as follows:

b25) DNA molecule shown in 1-1689 bit of sequence 21 in the sequence table;

b26) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b25) and has the same function;

the encoding gene of the mcr C-terminal domain may be b27) or b28) below:

b27) a DNA molecule shown in the 1704-3749 site of the sequence 21 in the sequence table;

b28) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b27) and has the same function.

Still further, the encoding gene of the mcr protein may be the following b29) or b 30):

b29) DNA molecule shown in sequence 21 in the sequence table;

b30) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b29) and has the same function.

The 1 st-1689 th site of the sequence 21 is the mcr N-terminal domain nucleotide sequence, the 1704 nd-3749 th site is the mcr C-terminal domain nucleotide sequence, and the 1691 nd-1696 th site is the RBS site sequence.

In the above method, the introducing of the gene encoding mcr protein into the biological cell may be specifically introducing an expression vector containing the gene encoding mcr protein into the biological cell.

The expression vectors can be plasmid, cosmid, phage, or viral vectors. The plasmid can be pYB1k or pLB1a, the sequence of pYB1k is a sequence 6 in a sequence table, and the sequence of pLB1a is a sequence 24 in the sequence table.

The expression vector comprising the gene encoding the mcr protein may be pLB1 a-mcr; the pLB1a-mcr is a recombinant vector obtained by inserting a coding gene of the mcr protein into the pLB1a, and can express the mcr protein.

In practical use, it is possible to further determine whether or not the branched α -keto acid needs to be inhibited depending on whether or not the branched α -keto acid needs to participate in the production process of the target product, and when the branched α -keto acid needs to participate in the production process of the target product, the synthesis of the branched α -keto acid may not be inhibited, and when the branched α -keto acid does not need to participate in the production process of the target product, the production of the target product may be further improved by inhibiting the synthesis of the branched α -keto acid.

In the above method, the target product may be picric acid or an intermediate between malonyl-coa to picric acid in the synthetic pathway of picric acid, the method comprising: introducing a coding gene of a vps protein (cyclopentanone synthetase) into the recombinant cell A and expressing the coding gene, or increasing the content of the vps protein in the recombinant cell A or enhancing the activity of the vps protein to obtain a recombinant cell, and marking the recombinant cell as recombinant cell-vps; and culturing the recombinant cell-vps to prepare the target product.

Further, the vps protein and the gene encoding the same may be derived from hop (Humulus lupulus);

still further, the vps protein may be a27) or a28) below:

a27) protein shown as a sequence 26 in a sequence table;

a28) and (b) the protein which is obtained by substituting and/or deleting and/or adding one or more amino acid residues in the amino acid sequence of the sequence 26 in the sequence table, has 75% or more than 75% of identity with the amino acid sequence of the sequence 26, and has the same function.

The encoding gene of the vps protein may be the following b31) or b 32):

b31) DNA molecule shown in sequence 25 in the sequence table;

b32) a DNA molecule which has 75 percent or more than 75 percent of identity with the nucleotide sequence defined by b31) and has the same function.

In the above method, the introduction of the gene encoding the vps protein into the biological cell may specifically be the introduction of an expression vector comprising the gene encoding the vps protein into the biological cell.

The expression vectors can be plasmid, cosmid, phage, or viral vectors. The plasmid can be pYB1k or pLB1a, the sequence of pYB1k is a sequence 6 in a sequence table, and the sequence of pLB1a is a sequence 24 in the sequence table.

The expression vector comprising the gene encoding the vps protein may be pLB1 a-vps; the pLB1a-vps is a recombinant vector obtained by inserting a gene coding the vps protein into the pLB1a, and the vps protein can be expressed.

The intermediate product does not include malonyl-coa and picric acid. In one embodiment of the invention, the intermediate product is 3-methyl-isobutyryl phloroglucinol (PIVP).

The invention also provides a reagent set, wherein the reagent set is a reagent set A or a reagent set B or a reagent set C;

the kit A comprises the branched-chain alpha-ketoacid dehydrogenase complex or a gene encoding the branched-chain alpha-ketoacid dehydrogenase complex;

the kit B consists of the kit A and the mcr protein or the coding gene of the mcr protein;

the kit C consists of the kit A and the vps protein or the coding gene of the vps protein.

The kit A may further comprise the ppc protein or a gene encoding the ppc protein.

The kit a may further comprise a substance that inhibits the synthesis of branched-chain alpha-keto acids.

The substance that inhibits branched-chain α -keto acid synthesis may be a substance required to knock out at least one gene in the branched-chain α -keto acid synthesis pathway in a biological cell, or to reduce the content or activity of a protein encoded by at least one gene in the branched-chain α -keto acid synthesis pathway.

The substance inhibiting branched-chain α -keto acid synthesis may be a substance required for knocking out ilvA gene or/and ilvE gene in a biological cell.

The biological cell contains the ilvA gene or/and the ilvE gene.

The ilvA gene in the knocked-out biological cell can be specifically a gene fragment or a strain (such as Escherichia coli strain JW3745) containing the ilvA gene knock-out character.

The ilvE gene in the knocked-out biological cell can be specifically a gene fragment or a strain (such as Escherichia coli strain JW5606) containing the ilvE gene knock-out character.

The kit A may consist of only the branched alpha-ketoacid dehydrogenase complex or the gene encoding the branched alpha-ketoacid dehydrogenase complex, may consist of the branched alpha-ketoacid dehydrogenase complex or the gene encoding the branched alpha-ketoacid dehydrogenase complex, and the ppc protein or the gene encoding the ppc protein, may consist of the branched alpha-ketoacid dehydrogenase complex or the gene encoding the branched alpha-ketoacid dehydrogenase complex, the ppc protein or the gene encoding the ppc protein, and the substance inhibiting branched alpha-ketoacid synthesis.

The kit A has the following functions of D1) or D2):

D1) synthesizing malonyl coenzyme A;

D2) producing the target product with malonyl coenzyme A as an intermediate product.

The reagent kit B can be used for producing 3-hydroxypropionic acid.

The reagent set C can be used for preparing picric acid or an intermediate product between malonyl coenzyme A and picric acid in the synthesis path of the picric acid.

The invention also provides a recombinant cell, wherein the recombinant cell is the recombinant cell A, the recombinant cell-mcr or the recombinant cell-vps.

The invention also provides the use of I, II or III:

I. the branched-chain alpha-ketoacid dehydrogenase complex or a gene encoding the branched-chain alpha-ketoacid dehydrogenase complex, the recombinant cell A, or the kit A, is used in any one of the following applications:

x1) synthesizing malonyl-coenzyme A;

x2) preparing a synthetic malonyl-coenzyme A product;

x3) producing a target product with malonyl-coenzyme A as an intermediate product;

x4) preparing a product for producing a target product with malonyl-coenzyme A as an intermediate product;

x5) to synthesize 3-hydroxypropionic acid;

x6) preparing a synthetic 3-hydroxypropionic acid product;

x7) synthesis of picric acid or intermediates between malonyl-coenzyme A and picric acid in the synthesis pathway of picric acid;

x8) preparing synthetic picric acid or intermediate product between malonyl coenzyme A and picric acid in the synthetic route of picric acid;

x9) synthetic fatty acids;

x10) preparing a synthetic fatty acid product;

x11) synthesis of polyketides;

x12) preparing synthetic polyketide products;

x13) synthesis of flavone compounds;

x14) preparing a synthetic flavone compound product;

II. Any one of the following uses of said recombinant cell-mcr or said kit b:

y1) to synthesize 3-hydroxypropionic acid;

y2) preparing and synthesizing a 3-hydroxypropionic acid product;

III, said recombinant cell-vps or said kit of parts for any one of the following uses:

z1) synthesis of picric acid or intermediates between malonyl-CoA to picric acid in the synthesis pathway of picric acid;

z2) to prepare the synthetic picric acid or an intermediate product between malonyl-CoA and picric acid in the synthetic route of picric acid.

The synthesis of malonyl-CoA takes oxaloacetate as a substrate.

The synthesis pathway of the target product requires the participation of malonyl-CoA.

The target product can be 3-hydroxypropionic acid, picric acid or an intermediate product between malonyl-coenzyme A and picric acid in the synthetic pathway of picric acid.

The intermediate product does not include malonyl-coa and picric acid. In one embodiment of the invention, the intermediate product is 3-methyl-isobutyryl phloroglucinol (PIVP).

In the present invention, the identity of 75% or more is 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identity.

The invention discloses a novel malonyl-CoA source, namely malonyl-CoA can be obtained by catalyzing oxaloacetate by a branched-chain alpha-ketoacid dehydrogenase complex, and the branched-chain alpha-ketoacid dehydrogenase complex is proved to have the activity of the oxaloacetate dehydrogenase by tests such as biochemistry and heredity. In addition, the present inventors have found that the introduction/increase of phosphoenolpyruvate carboxylase can further increase the amount of malonyl CoA synthesized, and that the deletion of a gene in the branched-chain alpha-keto acid synthesis pathway can also increase the amount of malonyl CoA synthesized. The present invention further utilizes the branched-chain alpha-keto acid dehydrogenase complex to prepare a target product having malonyl-CoA as an intermediate, such as an intermediate between malonyl-CoA and picric acid in the synthetic pathway of 3-hydroxypropionic acid, picric acid or picric acid. The branched-chain alpha-ketoacid dehydrogenase complex can be used for preparing malonyl coenzyme A and target products taking the malonyl coenzyme A as intermediate products, such as 3-hydroxypropionic acid, picric acid, fatty acid, polyketide compounds, flavonoid compounds and the like, and has wide application prospects.

Drawings

FIG. 1 shows the results of measuring the relative content of malonyl-CoA in an engineered strain expressing the branched-chain alpha-keto acid dehydrogenase complex.

FIG. 2 shows the results of the measurement of the relative malonyl-CoA content after the ppc gene has been introduced into M-FGH.

FIG. 3 shows the yield of 3-hydroxypropionic acid after expression of the branched-chain alpha-keto acid dehydrogenase complex.

FIG. 4 is the hop alpha/beta acid metabolic pathway.

FIG. 5 shows the yield of the engineered strain PIVP after expression of the branched-chain alpha-keto acid dehydrogenase complex.

FIG. 6 shows the in vitro enzyme activity assay of branched-chain alpha-ketoacid dehydrogenase complex. Oxaloacetate is OAA group, oxaloacetate-EDTA is OAA-EDTA group, 3-methyl-2 oxobutanoate is KIV group, and 3-methyl-2 oxobutanoate-EDTA is KIV-EDTA group.

Detailed Description

The present invention is described in further detail below with reference to specific embodiments, which are given for the purpose of illustration only and are not intended to limit the scope of the invention. The experimental procedures in the following examples are conventional unless otherwise specified. Materials, reagents, instruments and the like used in the following examples are commercially available unless otherwise specified. The quantitative tests in the following examples, all set up three replicates and the results averaged. In the following examples, unless otherwise specified, the 1 st position of each nucleotide sequence in the sequence listing is the 5 'terminal nucleotide of the corresponding DNA/RNA, and the last position is the 3' terminal nucleotide of the corresponding DNA/RNA.

In the following examples, E.coli BW25113(Datsenko KA, Wanner BL. one-step inactivation of chromosomal genes in Escherichia coli K-12using PCR products. Proc. Natl. Acad. Sci. U.S.A. 2000; 97(12):6640 and 6645.) is a non-pathogenic bacterium, with clear genetic background, short generation time, easy culture and low cost of culture medium raw materials, which contains oxaloacetate synthesis pathway and can synthesize oxaloacetate. Coli BW25113 is publicly available from the institute of microbiology, academy of sciences, and this biomaterial is only used for repeating the relevant experiments of the present invention, and is not used for other purposes.

Wild-type P1 bacteriophage (Thomason LC, costatino N.2007.E. coli genome manipulation by. Current Protocols in Molecular Biology:1.17.1-8) in the examples described below are publicly available from the institute of microbiology, a national academy of sciences, and the biomaterial is used only for repeating the experiments related to the present invention and is not used for other purposes.

Example 1 catalysis of the Synthesis of malonyl-CoA by the branched-chain alpha-ketoacid dehydrogenase Complex

The present inventors have found that a branched-chain α -ketoacid dehydrogenase complex can catalyze the synthesis of malonyl-coa, and this example has produced a recombinant bacterium containing branched-chain α -ketoacid dehydrogenase complex-encoding genes (bkdA, bkdB, bkdC, lpdA1, bkdF, bkdG, and bkdH genes), and has examined the synthesis of malonyl-coa catalyzed by the α -ketoacid dehydrogenase complex by further deleting two genes (threonine deaminase ilvA and branched-chain amino acid transaminase ilvE genes) in the branched-chain α -ketoacid synthesis pathway, and the primers used are shown in table 1.

(1) Construction of plasmid expressing branched-chain alpha-ketoacid dehydrogenase complex of Streptomyces avermitilis

(1-a) PCR amplification of bkdA, bkdB, bkdC, lpdA1, bkdF, bkdG, bkdH genes

The streptomyces avermitilis genome DNA is extracted by adopting a bacterial genome extraction kit (Tiangen Biochemical technology Co., Ltd., product catalog DP 302). Taking the extracted genome DNA of the streptomyces avermitilis as a template, bkdA-NcoI and bkdC-rbs-R as primers, carrying out PCR amplification by using high-fidelity TransStart FastPfu DNA polymerase (Beijing all-open gold biotechnology Co., Ltd., product catalog is AP221), and marking the obtained gene fragment as ABC which contains the DNA fragment shown in sequence 1 in the sequence table; performing PCR amplification by using bkdF-NcoI and bkdH-rbs-R as primers, and marking the obtained gene fragment as FGH, wherein the FGH contains a DNA fragment shown in a sequence 2in a sequence table; performing PCR amplification by using bkdF-NcoI and bkdG-rbs-R as primers, and marking an obtained gene fragment as FG, wherein the FG contains 1 st to 2200 th sites of a sequence 2in a sequence table; the resulting gene fragment was designated lpd containing lpdA1 gene shown in sequence 4 of the sequence listing by PCR using rbs-lpdA1-F and lpdA1-XhoI as primers.

Wherein, the 1 st to 1146 th sites of the sequence 1 are the DNA sequence of bkdA gene, and encode bkdA protein shown in the sequence 7 in the sequence table; the 1220-2224 bit is bkdB DNA sequence which encodes the bkdB protein shown in the sequence 8 in the sequence table; 2224-3591 position bkdC DNA sequence, which encodes bkdC protein shown in sequence 9 in the sequence table;

the 1 st to 1221 th sites of the sequence 2 are DNA sequences of bkdF genes and encode bkdF proteins shown as a sequence 10 in the sequence table; the 1223-2200 site of the sequence 2 is a DNA sequence of the bkdG gene and encodes the bkdG protein shown as the sequence 11 in the sequence table; the 2220-3608 th site of the sequence 2 is a DNA sequence of the bkdH gene and encodes the bkdH protein shown as a sequence 12in the sequence table;

the lpdA1 gene shown in sequence 4 encodes lpdA1 protein shown in sequence 13 in the sequence table.

The sequences of the bkdH and lpdA1 genes are optimized according to codon preference in Escherichia coli, the optimized genes are respectively designated as opbkdH and oplpdA1 genes, the sequences of the opbkdH and oplpdA1 genes are respectively sequence 3 and sequence 5 in the sequence table, and the sequences 3 and 5 respectively encode bkdH protein and lpdA1 protein shown as sequences 12 and 13 in the sequence table. Artificially synthesizing opbkdH and oplpdA1 genes, taking the opbkdH gene as a template, performing PCR amplification by using rbs-opbkdH-F and rbs-opbkdH-R, and marking the obtained gene fragment as opH, wherein opH contains the opbkdH gene shown as a sequence 3; the gene fragment obtained by PCR amplification using the oplpdA1 gene as a template and rbs-oplpdA1-F and oplpdA1-XhoI was designated as oplpd, which contains the oplpdA1 gene shown in sequence 5 of the sequence listing.

(1-b) construction of recombinant expression vectors containing the bkdA, bkdB, bkdC, lpdA1, bkdF, bkdG and bkdH genes

Carrying out agarose gel electrophoresis on each PCR amplification fragment obtained in the step (1-a), and recovering a target fragment; meanwhile, the vector pYB1k (the nucleotide sequence of the vector pYB1k is shown as a sequence 6 in a sequence table) is digested by NcoI and XhoI, and a vector large fragment YB1k-NX fragment (namely a vector framework) is recovered. The recovered ABC and lpd fragments are connected with YB1k-NX fragments by a Gibson assembly method (Gibson DG, Young L, et al. enzymatic assembly of DNA molecules up to segmented cloned nucleic acids. Nat. methods. 2009; 6(5): 343-345); carrying out Gibson assembly and ligation reaction on the recovered FGH and lpd fragments and YB1k-NX fragments; and carrying out Gibson assembly and ligation reaction on the recovered FG, opH and oplpd and YB1k-NX fragments. The ligation products were ligated with CaCl2Escherichia coli DH 5. alpha. competent cells (Beijing Quanjin Biotechnology Co., Ltd., product catalog CD201) were transformed by the method, then spread on LB plates containing kanamycin uniformly, and cultured overnight at 37 ℃. Selecting clones, identifying clones capable of amplifying target fragments by using a primer F108/R124 and sequencing, selecting positive clones to extract plasmids, connecting ABC and lpd fragments with YB1k-NX fragments to obtain a recombinant plasmid with a correct sequence, and naming the recombinant plasmid as pYB1k-bkdABC-lpdA1, connecting FGH and lpd fragments with YB1k-NX fragments to obtain a recombinant plasmid with a correct sequence, and naming the recombinant plasmid as pYB1k-bkdFGH-lpdA1, and connecting FG, opH and oplpd with YB1k-NX fragments to obtain a recombinant plasmid with a correct sequence, and naming the recombinant plasmid as pYB1 k-bkdFG-opbkH-oplpdA 1.

pYB 1-1 k-bkdABC-lpdA1 contains DNA fragments shown in sequences 1 and 4 in the sequence table and can express four proteins shown in sequences 7, 8, 9 and 13, pYB 1-1 k-bkdFGH-lpdA1 contains DNA fragments shown in sequences 2 and 4 in the sequence table and can express four proteins shown in sequences 10, 11, 12 and 13, and pYB1 k-bkdFG-opbkH-oplpdA 1 contains DNA fragments shown in positions 1-2200 of sequence 2in the sequence table, sequence 3 and sequence 4 and can express four proteins shown in sequences 10, 11, 12 and 13.

(2) Engineered Strain threonine deaminase ilvA and branched-chain amino acid transaminase ilvE Gene knock-outs

And knocking out the ilvA gene of escherichia coli BW25113 serving as a starting bacterium, marking the obtained recombinant bacterium as M01A, knocking out the ilvE gene of the escherichia coli BW25113, and marking the obtained recombinant bacterium as M01E.

(2-a) preparation of P1 phage containing E.coli Gene fragment having ilvA and ilvE knockout Properties

The E.coli gene fragment containing the ilvA gene knockout trait and the E.coli gene fragment containing the ilvE gene knockout trait are derived from E.coli strains JW3745 and JW5606, respectively, and W3110 series strains containing the ilvA and ilvE knockout traits, respectively, both from the national institute of genetics (NIG, Japan), in which ilvA gene encoding threonine deaminase and ilvE gene encoding branched-chain amino acid transaminase are replaced with kanamycin-resistant gene (about 1300bp) having FRT sites at both ends to knock out the ilvA or ilvE genes (Baba T, Ara T, et al. The P1 phage was prepared as follows: the JW3745 or JW5606 strain was cultured overnight at 37 ℃ and then transferred to a medium containing 5mmol/L CaCl2And 0.1% glucose in LB medium, cultured at 37 ℃ for 1h, and then added with wild type P1 phage to continue culturing for 1-3 h. Adding a few drops of chloroform, culturing for a few minutes, centrifuging and taking the supernatant to obtain the bacteriophage P1vir ilvA containing the escherichia coli gene fragment with the ilvA knockout character and the bacteriophage P1vir ilvE containing the escherichia coli gene fragment with the ilvE knockout character.

(2-b) construction of E.coli strains M01A-Kan and M01E-Kan by P1 phage transduction

Escherichia coli BW25113 (recipient bacterium) cultured overnight was centrifuged at 10000g (1.5 mL) of the bacterial solution for 2 minutes, and then 0.75mL of a P1 salt solution (water as a solvent and 10mM CaCl as a solute) was added2And 5mM MgSO4) Resuspending BW25113 bacterial cells, mixing 100. mu.L of phage P1vir ilvA or P1vir ilvE with 100. mu.L of BW25113 cell suspension, incubating at 37 ℃ for 30min, adding 1mL of LB medium and 200. mu.L of 1mol/L sodium citrate, continuing culturing at 37 ℃ for 1h, collecting bacterial cells by centrifugation, resuspending with 100. mu.L of LB medium, and plating LB plate containing kanamycin (kanamycin concentration is 50. mu.g-ml), culturing at 37 ℃ overnight, selecting clones, carrying out PCR amplification identification by using ilvA-F/ilvA-R or ilvE-F/ilvE-R primers (the target band with 1700bp is amplified to be positive), selecting positive clones, naming the positive clone obtained by the bacteriophage P1vir ilvA as M01A-Kan, and naming the positive clone obtained by the bacteriophage P1vir ilvE as M01E-Kan.

(2-c) Elimination of resistance

The pCP20 plasmid (Clontech) was transformed into M01A-Kan and M01E-Kan, respectively, by calcium chloride transformation, and after overnight culture at 30 ℃ on LB plates containing ampicillin, clones were selected to obtain recombinant E.coli M01A-Kan/pCP20 and M01E-Kan/pCP20, respectively, containing plasmid pCP 20. Respectively culturing the two bacteria in an ampicillin-resistant LB culture medium at 30 ℃, coating the bacteria on a non-resistant LB plate, culturing overnight at 42 ℃, selecting clones, performing PCR amplification identification (the amplified 400bp target band is positive) by using ilvA-F/ilvA-R or ilvE-F/ilvE-R primers, selecting positive clones, and naming the positive clones obtained by M01A-Kan as M01A, wherein M01A is a strain for knocking out the ilvA gene of escherichia coli BW 25113; the positive clone obtained from M01E-Kan was named M01E, M01E was a strain in which the ilvE gene of E.coli BW25113 was knocked out.

In Escherichia coli BW25113, the coding sequence of the ilvA gene is shown as sequence 14 in the sequence table, the coding sequence of the ilvA protein is shown as sequence 15 in the sequence table, the coding sequence of the ilvE gene is shown as sequence 16 in the sequence table, and the coding sequence of the ilvE protein is shown as sequence 17 in the sequence table.

TABLE 1 list of primer sequences used

(3) Detection for improving synthesis amount of malonyl coenzyme A of engineering strain by exogenously expressing branched chain alpha-ketoacid dehydrogenase complex

(3-a) preparation of recombinant bacterium

Respectively introducing pYB 1-1 k-bkdABC-lpdA1, pYB 1-1 k-bkdFGH-lpdA1, pYB1 k-bkdFG-opbkH-oplpdA 1 and a vector pYB1k obtained in the step (1) into escherichia coli BW25113, and sequentially recording the obtained recombinant bacteria as M-ABC, M-FGH, M-opFGH and BW; pYB1k-bkdFGH-lpdA1 was introduced into M01A and M01E in step (1), and the recombinant bacteria obtained were designated as MA-FGH and ME-FGH, respectively.

(3-b) preparation of culture Medium

A culture medium: the culture medium A is a sterile culture medium consisting of solute and solvent, the solvent is water, and the solute and the concentration thereof in the culture medium are respectively as follows: NaHPO4 25mM,KH2PO4 25mM,NH4Cl 50mM。

B, culture medium: the culture medium B is obtained by adding Na into the culture medium A2SO4、MgSO4A sterile culture medium obtained from glycerol, yeast powder and trace elements, and Na in a culture medium B2SO4In a concentration of 5mM, MgSO4The concentration of the yeast powder is 2mM, the volume percentage of the glycerol is 0.5 percent, the mass percentage of the yeast powder is 0.5mg/100mL, and the concentration of each trace element in the B culture medium are respectively 50 MuM FeCl3,20μM CaCl2,10μM MnCl2,10μM ZnSO4,2μM CoCl2,2μM NiCl2,2μM Na2MO4,2μM Na2SeO3And 2 μ M H3BO3

C, culture medium: the medium C is a sterile medium obtained by adding glucose to the medium A, and the concentration of glucose in the medium C is 20 g/L.

(3-c) culture of cells and Induction of enzyme

Inoculating the engineering strain M-ABC cultured overnight into a shake flask of 20ml of B culture medium according to the inoculation amount of 1%, culturing at 37 ℃ for 6h, adding arabinose into the culture system to enable the mass percent concentration of the arabinose in the culture system to be 0.2%, continuously culturing for 12h, and collecting thalli, namely M-ABC thalli.

According to the method, M-ABC is replaced by M-FGH, M-opFGH, BW, MA-FGH and ME-FGH respectively to obtain M-FGH, M-opFGH, BW, MA-FGH and ME-FGH thalli.

(3-d) Whole cell catalysis of malonyl-CoA

The same amount of the collected cells of step (3-c) (the amount of cells used was 1mL OD)60090 cells) the amount of malonyl-coa synthesized in each cell was determined as follows:

the thalli is resuspended in a shake flask containing 5ml of C culture medium, cultured for 3h at 37 ℃, centrifuged and collected, then resuspended in 400 mu L of 80% (volume percentage) methanol aqueous solution precooled at minus 80 ℃, cells are broken by ultrasonic, centrifuged for 20min at 12000rpm and 4 ℃, and the supernatant is collected to detect the content of malonyl coenzyme A in the supernatant. The content of malonyl-CoA in the supernatant was analyzed by LCMS/MS using malonyl-CoA (Sigma, 63410-10MG-F) as a standard by a standard curve method (external standard method).

The results are shown in FIG. 1, in which the ordinate represents the relative signal intensity of malonyl-CoA detected in the supernatants of BW, M-ABC, M-FGH, M-opFGH, MA-FGH and ME-FGH, and the relative signal intensity of malonyl-CoA detected in the supernatants of BW, M-ABC, M-FGH, M-opFGH, MA-FGH and ME-FGH is 0.22, 1.13, 1.89, 2.43, 2.44 and 4.59, respectively. Relative to BW, the relative contents of malonyl coenzyme A in the supernatants of M-ABC, M-FGH, M-opFGH, MA-FGH and ME-FGH are 5.09, 8.48, 10.94, 10.98 and 20.61 respectively, namely the contents of malonyl coenzyme A in the supernatants of M-ABC, M-FGH, M-opFGH, MA-FGH and ME-FGH are 5.09 times, 8.48 times, 10.94 times, 10.98 times and 20.61 times of BW respectively.

The results show that the synthesis amount of malonyl-CoA is remarkably improved after the branched-chain alpha-ketoacid dehydrogenase complex encoding gene is introduced into escherichia coli; compared with the genes of bkdA, bkdB, bkdC and lpdA1, the genes of bkdF, bkdG, bkdH and lpdA1 have higher synthesis amount of malonyl-CoA; after the bkdH and lpdA1 genes are optimized according to the preference of the escherichia coli codon, the synthesis amount of malonyl coenzyme A can be further improved; by deleting ilvA and ilvE genes in the branched-chain alpha-keto acid synthesis pathway, which is a substrate of the branched-chain alpha-keto acid dehydrogenase complex, after the genes bkdF, bkdG, bkdH, and lpdA1 are introduced, the amount of malonyl CoA synthesized can be further increased, and the amount of malonyl CoA synthesized after the ilvE gene is deleted can be greatly increased.

Example 2 phosphoenolpyruvate carboxylase ppc Gene can increase the amount of M-FGH malonyl CoA synthesized in the engineered Strain

In this example, based on the engineered strain M-FGH of example 1, phosphoenolpyruvate carboxylase ppc gene was introduced and ompT gene (ompT protein shown by sequence 27 and sequence 28 in the sequence table) was deleted, and the amount of malonyl-CoA synthesized was further increased using primers shown in Table 2.

(4) Construction of phosphoenolpyruvate carboxylase (PPC) gene engineering strain

(4-a) extraction of Corynebacterium glutamicum (Corynebacterium glutamicum) and Escherichia coli genomes, and PCR amplification of upstream and downstream homology arms of ppc gene, chloramphenicol resistant fragment and ompT gene

The ppc-F and the ppc-R are used as PCR amplification primers, and Corynebacterium glutamicum genome DNA is used as a template to obtain a fragment tac-ppc containing a ppc gene, wherein the nucleotide sequence of the ppc gene is a ppc protein shown as a sequence 18 and a coding sequence 19 in a sequence table. Cm-F and Cm-R are used as PCR amplification primers, lox71-Cm-lox66-tac is used as a template, a fragment Cm is obtained through amplification, the nucleotide sequence of the lox71-Cm-lox66-tac fragment is a sequence 20 in a sequence table, and the fragment is obtained through whole gene synthesis (Nanjing Kingsri). And taking ompT-up-F and ompT-up-R as PCR amplification primers, taking escherichia coli genome DNA as a template, amplifying to obtain a fragment ompT-up, taking ompT-down-F and ompT-down-R as PCR amplification primers, and taking the escherichia coli genome DNA as a template, and amplifying to obtain a fragment ompT-down.

(4-b) preparation of ompT-up-Cm-tac-ppc-ompT-down targeting fragment

Four fragments of tac-ppc, Cm, ompT-up and ompT-down are taken as templates, ompT-up-F and ompT-down-R are taken as primers, a target fragment ompT-up-Cm-tac-ppc-ompT-down is obtained by means of fusion PCR amplification, and a target fragment is recovered by agarose gel electrophoresis (Tiangen Biochemical technology Co., Ltd., product catalog is DP 209).

(4-c) preparation of pKD 46-containing plasmid host bacterium

The pKD46 plasmid (Clontech) was transformed into the engineered strain M-FGH by calcium chloride transformation, and after overnight culture at 30 ℃ on LB plates containing ampicillin and kanamycin, clones were selected to obtain recombinant E.coli M-FGH/pKD46 containing plasmid pKD 46. After the recombinant Escherichia coli M-FGH/pKD46 is induced by arabinose, 3 recombinant proteins of phage are expressed, and the host bacteria have the capacity of homologous recombination. M-FGH/pKD46 competent cells were then prepared by 10% glycerol washing.

(4-d) homologous recombination

The ompT-up-Cm-tac-ppc-ompT-down fragment of (4-b) was electroporated into M-FGH/pKD46 competent cells prepared in (4-c), overnight at 37 ℃ on LB plate containing kanamycin (50. mu.g/ml) and chloramphenicol (34. mu.g/ml), clones were selected, identified by PCR amplification using ompT-up1k-F and ppc-R primers (6000 bp bands amplified as positive), and the selected positive clones were named M-FGH-ppc. The M-FGH-ppc contains a ppc gene shown as a sequence 18 in a sequence table and can express a ppc protein shown as a sequence 19. M-FGH-ppc does not contain ompT gene.

(5) Detection of synthesis amount of malonyl coenzyme A of over-expressed phosphoenolpyruvate carboxylase ppc gene and streptomyces avermitilis branched chain alpha-ketoacid dehydrogenase complex gene bkdFGH-lpdA1 engineering strain

The amounts of malonyl-CoA synthesized in these two strains were determined by replacing M-ABC with M-FGH and M-FGH-ppc, respectively, and keeping the other steps unchanged, according to the methods (3-c) and (3-d) in step (3) of example 1.

The results are shown in FIG. 2, in which the ordinate represents the relative signal intensity of malonyl-CoA detected in the supernatants of M-FGH and M-FGH-ppc, and the relative signal intensity of malonyl-CoA detected in the supernatants of M-FGH and M-FGH-ppc is 1.89 and 3.66, respectively. The relative content of malonyl-CoA in the supernatant of M-FGH-ppc was 1.94 relative to M-FGH, i.e., the content of malonyl-CoA in the supernatant of M-FGH-ppc was 1.94 times that of M-FGH. It is shown that the ppc gene can increase the synthesis amount of malonyl-CoA.

TABLE 2 primer sequence List

Example 3 expression of the branched-chain alpha-ketoacid dehydrogenase complex gene bkdFGH-lpdA1 of S.avermitilis can increase the yield of 3-hydroxypropionic acid (3-HP).

The 3-hydracrylic acid is an important platform compound, is a synthetic raw material of various chemicals, can obtain the 3-hydracrylic acid by taking malonyl coenzyme A as a precursor through two-step reduction reaction. In this example, 3-hydroxypropionic acid was prepared by introducing the malonyl-CoA reductase gene mcr of Thermolucophyta thermophila (Chloroflexus aurantiacaus) into M-ABC, M-FGH, M-opFGH, MA-FGH, ME-FGH, and M-FGH-ppc obtained in example 1 and example 2, and BW of example 1 was used as a control. The primers used are shown in Table 3.

(6) Construction of plasmid expressing Malonophycus thermophilus (Chloroflexus aurantiacaus) malonyl-CoA reductase Gene mcr

(6-a) the nucleotide sequence of the engineered pyrrophytic rhodochrous (Chloroflexus aurantiacus) malonyl-CoA reductase gene mcr gene is a sequence 21 in a sequence table, wherein the nucleotide sequence of the N-terminal domain of the mcr gene is the 1 st-1689 th site of the sequence 21, and the N-terminal domain of the mcr shown as a sequence 22 in the coding sequence table; the C-terminal domain nucleotide sequence of the mcr gene is 1704 (th) -3749 (th) of the sequence 21 and encodes the C-terminal domain of the mcr shown as the sequence 23 in the sequence table; the sequence contains RBS site between the N-terminal domain and the C-terminal domain, and is 1691-1696 th site of the sequence 21. The mcr gene fragment shown in the sequence 21 is synthesized by whole gene and is connected to a pUC57 vector to obtain a recombinant vector pUC 57-mcr. PCR amplified fragments were obtained by amplification with pUC57-mcr as a template and the primers mcr-F/mcrR.

(6-b) subjecting the PCR amplified fragment obtained in (6-a) to agarose gel electrophoresis, and recovering a target fragment; meanwhile, the vector pLB1a (the nucleotide sequence of the vector pLB1a is the sequence 24 in the sequence table) is cut by NcoI and XhoI, and the large fragment LB1a-NX (namely the vector framework) of the vector is recovered. The target fragment recovered above is ligated with LB1a-NX fragment by Gibson assembly method, and the ligation product is then ligated with CaCl2Escherichia coli DH 5. alpha. competent cells (Beijing Panzhijin Biotechnology Co., Ltd., product catalog CD201) were transformed by the method, spread on an LB plate containing streptomycin, and cultured overnight at 37 ℃. PickSelecting clones, identifying clones capable of amplifying target fragments by using a primer F-105/mcr-R, sequencing, selecting positive clones, extracting plasmids, and naming the obtained positive plasmids with correct sequences as pLB1 a-mcr.

pLB1a-mcr contains mcr gene shown in sequence 21 in the sequence table, and can express the N-terminal domain and C-terminal domain of mcr shown in sequences 22 and 23.

(7) Construction of engineering strain for producing 3-hydroxypropionic acid and whole-cell catalysis of 3-hydroxypropionic acid

(7-a) respectively introducing pLB1a-mcr obtained in the step (6) into M-ABC, M-FGH, M-opFGH, MA-FGH, ME-FGH, M-FGH-ppc and BW, sequentially naming the obtained recombinant bacteria as M-ABC-HP, M-FGH-HP, M-opFGH-HP, MA-FGH-HP, ME-FGH-HP, M-FGH-ppc-HP and BW-HP, and further using each recombinant bacterium as a strain to be detected for preparing 3-hydroxypropionic acid.

(7-b) culture of engineered Strain and Induction of enzyme

Inoculating the overnight cultured strain to be detected into the shake flask of the 20ml B culture medium in the step (3-B) according to the inoculation amount of 1%, culturing for 6h at 37 ℃, adding arabinose into the culture system to ensure that the mass percent concentration of the arabinose in the culture system is 0.2%, continuously culturing for 12h, and collecting thalli.

(7-c) Whole cell catalysis of 3-Hydroxypropionic acid

And (3) suspending the collected thalli in a shake flask containing 5ml of C culture medium, culturing for 8h at 37 ℃, centrifuging, taking supernatant, filtering, and collecting filtrate. The amount of the used bacteria was 5mL OD60030 cells. And (3) quantitatively analyzing the content of the 3-hydroxypropionic acid in the filtrate by using 3-hydroxypropionic acid (TCI, H0297-10G) as a standard substance and using an HPLC (high performance liquid chromatography) method by using a standard curve method (an external standard method).

As shown in FIG. 3, the 3-hydroxypropionic acid contents of the filtrates obtained from M-ABC-HP, M-FGH-HP, M-opFGH-HP, MA-FGH-HP, ME-FGH-HP, M-FGH-ppc-HP, and BW-HP were 0.86, 1.44, 1.65, 1.80, 3.84, 1.94, and 0.55g/L, respectively, and the 3-hydroxypropionic acid yields of M-ABC-HP, M-FGH-HP, M-opFGH-HP, MA-FGH-HP, ME-FGH-HP, and M-FGH-ppc-HP were 1.56 times, 2.62 times, 3.00 times, 3.27 times, 6.98 times, and 3.53 times, respectively, as compared with BW-HP.

The results show that the yield of 3-hydroxypropionic acid is remarkably improved after the branched chain alpha-ketoacid dehydrogenase complex encoding gene is introduced into escherichia coli; compared with the introduction of bkdA, bkdB, bkdC and lpdA1 genes, the introduction of bkdF, bkdG, bkdH and lpdA1 genes can lead to higher yield of 3-hydroxypropionic acid; after the bkdH and lpdA1 genes are optimized according to the preference of the Escherichia coli codon, the yield of the 3-hydroxypropionic acid can be further improved; on the basis of introducing bkdF, bkdG, bkdH and lpdA1 genes, the ilvA and ilvE genes in the branched-chain alpha-keto acid synthesis pathway, which is a substrate of the branched-chain alpha-keto acid dehydrogenase complex, are knocked out, so that the yield of 3-hydroxypropionic acid can be further improved, and the yield of 3-hydroxypropionic acid after the ilvE gene is knocked out can be greatly improved; the yield of 3-hydroxypropionic acid can be further improved by introducing the ppc gene into the host cell after introducing the bkdF, bkdG, bkdH and lpdA1 genes. The tendency of the production amount of 3-hydroxypropionic acid was the same as that of the synthesis amount of malonyl-CoA of the corresponding strains in examples 1 and 2.

TABLE 3 primer sequence List

Example 4 expression of Streptomyces avermitilis branched-chain alpha-ketoacid dehydrogenase complex gene bkdFGH-lpdA1 to increase yield of hop beta-acid precursor PIVP

Heterologous expression of the type III polyketide picric acid from the plant hop in E.coli. The picric acid is specifically synthesized and accumulated in glandular hair of hop (Humulus lupulus of Humulus of Cannabaceae) as a flavor substance, is an essential element in the beer brewing industry, has high medicinal value and health care function, and is a precursor substance of many medicines. It has now been reported that its pathway synthesis can be achieved in yeast. The pathway is mainly that branched fatty acyl coenzyme A and malonyl coenzyme A generate 3-methyl-isobutyryl phloroglucinol (PIVP) under the action of vps (phenylpentanone synthase), and then the direct precursor Di-Prenyl PIVP is generated by the PIVP and DMAPP under the action of HIPT1HIPT2 (isopentenyl transferase). Then oxidized to picric acid (figure 4). In this example, PIVP was synthesized by introducing hop (Humulus lupulus) cyclopentanone synthase gene vps gene into M-ABC, M-FGH, M-opFGH, MA-FGH, ME-FGH, and M-FGH-ppc obtained in example 1 and example 2, and using BW of example 1 as a control. The primers used are shown in tables 4 and 3.

(8) Construction of plasmid expressing hop (Humulus lupulus) cyclopentanone synthetase Gene vps Gene

(8-a) the nucleotide sequence of the hop cyclopentanone synthetase gene vps gene is the sequence 25 in the sequence table. The vps gene was synthesized as a whole gene and ligated to pUC57 vector to obtain vector pUC 57-vps. The vps gene fragment was PCR amplified using pUC57-vps as a template and primers vps-F/vps-R.

(8-b) subjecting the PCR amplified fragment obtained in (8-a) to agarose gel electrophoresis, and recovering a target fragment; the vector pLB1a (vector pLB1a nucleotide sequence such as sequence 24) was digested simultaneously with NcoI and XhoI, and the vector large fragment LB1a-NX fragment (i.e., vector backbone) was recovered. The target fragment recovered above is ligated with LB1a-NX fragment by Gibson assembly method, and the ligation product is then ligated with CaCl2Escherichia coli DH 5. alpha. competent cells (Beijing Panzhijin Biotechnology Co., Ltd., product catalog CD201) were transformed by the method, spread on an LB plate containing streptomycin, and cultured overnight at 37 ℃. Selecting clones, identifying clones capable of amplifying target fragments by using a primer F-105/vps-R, sequencing, selecting positive clones, extracting plasmids, and naming the obtained positive plasmids with correct sequences as pLB1 a-vps.

pLB1a-vps contains vps gene shown in sequence 25 in the sequence table, and can express vps protein shown in sequence 26.

(9) PIVP-producing strain construction and 3-hydroxypropionic acid whole-cell catalysis

(9-a) introducing pLB1a-vps obtained in the step (8) into M-ABC, M-FGH, M-opFGH, MA-FGH, ME-FGH, M-FGH-ppc and BW respectively, sequentially naming the obtained recombinant bacteria as M-ABC-PIVP, M-FGH-PIVP, M-opFGH-PIVP, MA-FGH-PIVP, ME-FGH-PIVP, M-FGH-ppc-PIVP and BW-PIVP, and further using each recombinant bacterium as a strain to be tested for synthesizing PIVP.

(9-b) culture of engineered Strain and Induction of enzyme

Inoculating each strain to be detected after overnight culture into a shake flask of the (3-B) 20ml B culture medium according to the inoculation amount of 1%, culturing for 6h at 37 ℃, adding arabinose into the culture system to enable the mass percent concentration of the arabinose in the culture system to be 0.2%, continuously culturing for 12h, and collecting thalli.

(9-c) Whole cell catalysis of PIVP

And (3) suspending the collected thalli in a shake flask containing 5ml of C culture medium, culturing for 8h at 37 ℃, centrifuging, suspending in 400 mu L of 80% (volume percentage) methanol aqueous solution precooled at minus 80 ℃, ultrasonically breaking cells, centrifuging at 12000rpm at 4 ℃ for 20min, collecting supernatant, and detecting the content of PIVP in the supernatant. The amount of the used bacteria was 5mL OD60030 cells. The content of PIVP in the supernatant was analyzed by LCMS/MS using PIVP (TRC, P339590-1g) as a standard by a standard curve method (external standard method).

As shown in FIG. 5, the supernatant obtained from M-ABC-PIVP, M-FGH-PIVP, M-opFGH-PIVP, MA-FGH-PIVP, ME-FGH-PIVP, M-FGH-ppc-PIVP and BW-PIVP contains PIVP in amounts of 8.96, 16.68, 23.63, 15.14, 2.49, 82.50 and 0mg/L, respectively, PIVP is not produced from BW-PIVP, PIVP can be produced from M-ABC-PIVP, M-FGH-PIVP, M-opFGH-PIVP, MA-FGH-PIVP, ME-FGH-PIVP and M-FGH-ppc-PIVP.

It was shown that PIVP can be produced by introducing a branched-chain alpha-ketoacid dehydrogenase complex-encoding gene into E.coli: BW-PIVP does not synthesize PIVP; compared with the introduction of bkdA, bkdB, bkdC and lpdA1 genes, the PIVP produced by introducing bkdF, bkdG, bkdH and lpdA1 genes is higher; after the bkdH and lpdA1 genes are optimized according to the preference of the Escherichia coli codon, the yield of PIVP can be further improved; the yield of PIVP can be further greatly improved by introducing the ppc gene on the basis of introducing the bkdF, bkdG, bkdH and lpdA1 genes. On the other hand, in the case of introducing the bkdF, bkdG, bkdH, and lpdA1 genes, the ilvA and ilvE genes in the branched-chain α -keto acid synthesis pathway, which are substrates of the branched-chain α -keto acid dehydrogenase complex, were deleted, and the PIVP production was not improved as much as the variation in the amount of synthesis of malonyl-CoA in the corresponding strain of example 1, because the branched-chain α -keto acid was involved in the PIVP production process, and the branched-chain α -keto acid content was decreased after the ilvA and ilvE genes were deleted, which in turn affected the PIVP production, and therefore, in producing the desired product using malonyl-CoA as an intermediate, whether or not the gene in the branched-keto acid synthesis pathway was deleted could be determined depending on whether or not the branched-chain α -keto acid was required in the synthesis pathway.

TABLE 4 primer sequence List

Example 5 expression and purification of branched-chain alpha-ketoacid dehydrogenase Complex Gene of Streptomyces avermitilis and Activity detection of oxaloacetate dehydrogenase Complex thereof

(10) Construction of streptomyces avermitilis branched chain alpha-ketoacid dehydrogenase complex protein expression vector

(10-a) YK-BCDH-His DNA fragment was obtained by PCR amplification using the pYB1 k-bkddFGH-lpdA 1 plasmid of (1-b) as a template and primers BCDH-His-F and BCDH-His-R (Table 5).

(10-b) subjecting the YK-BCDH-His DNA fragment obtained by PCR amplification in (10-a) to DpnI enzyme digestion treatment, and using CaCl to digest the enzyme digestion product2Escherichia coli DH 5. alpha. competent cells (Beijing Quanjin Biotechnology Co., Ltd., product catalog CD201) were transformed, plated on LB plate containing kanamycin, and cultured overnight at 37 ℃. Selecting clones, identifying clones capable of amplifying target fragments by using primers F108/lpdA1-XhoI, sequencing, selecting positive clones, extracting plasmids, and naming the obtained positive plasmids with correct sequences as pYB1 k-His-BCDH.

(11) Expression and purification of branched chain alpha-ketoacid dehydrogenase complex protein of streptomyces avermitilis

(11-a) pYB1k-His-BCDH was introduced into E.coli BW25113 of example 1, and the resulting recombinant strain was named His-BCDH.

(11-b) inoculating the engineering strain His-BCDH cultured overnight into the shake flask of the 5L B culture medium in the step (3-a) according to the inoculation amount of 1%, culturing at 30 ℃ for 6h, adding arabinose into the culture system to ensure that the mass percent concentration of the arabinose in the culture system is 0.2%, continuing culturing for 20h, collecting thalli, washing the thalli twice by using a D buffer solution, then suspending the thalli in the D buffer solution, crushing cells, centrifuging at 20000rpm for 2h, and collecting a supernatant.

The D buffer solution consists of a solvent and a solute, wherein the solvent is water, the solute and the concentration of the solute in the D buffer solution are 50mM Tris-HCl and 200mM KCl, and the pH value is 8.0.

(11-c) balancing 10 column volumes of the nickel column by using a buffer solution D, enabling a supernatant obtained in the step (11-b) through centrifugation to flow through the nickel column after balancing, washing by using a buffer solution D with the column volume of 10, sequentially washing by using mixed buffer solutions with the volume ratios of the buffer solution D to the buffer solution E of 49/1, 45/5 and 42/8 for 5 column volumes respectively, then eluting by using a mixed buffer solution with the volume ratio of the buffer solution D to the buffer solution E of 1/1 to obtain an avilamyces branched chain alpha-ketoacid dehydrogenase complex protein, washing and concentrating the complex protein by using a 100kDa ultrafiltration tube (Amicon Ultra-15) to obtain desalted protein, and the desalted protein can be used for in-vitro enzyme activity detection.

The E buffer was a solution having an imidazole concentration of 500mM obtained by adding imidazole to the D buffer.

(12) Activity detection of branched chain alpha-ketoacid dehydrogenase complex of streptomyces avermitilis for catalyzing oxaloacetate to generate malonyl coenzyme A

In a 96-well plate, 200. mu. L F buffer was added per well, and then divided into five groups, i.e., OAA group, OAA-EDTA group, KIV group (positive control), KIV-EDTA group, and control group, each of which was 3 replicates.

The buffer solution F consists of a solvent and a solute, wherein the solvent is 50mM Tris-HCl buffer solution (pH 7.0), and the solute and the concentration of the solute in the buffer solution F are respectively 0.1mM CoA (coenzyme A), 0.2mM DTT (dithiothreitol), 0.2mM TPP (thiamine pyrophosphate) and 1mM MgSO4And 2mM NAD+(oxidized form of nicotinamide adenine dinucleotide).

Adding oxaloacetate to each well of the OAA group at a concentration of 3mM in the reaction system;

adding oxaloacetic acid and disodium Ethylenediaminetetraacetate (EDTA) to each well of the OAA-EDTA group at concentrations of 3mM and 10mM, respectively, in the reaction system;

adding alpha-ketoisovalerate (3-methyl-2-oxobutyric acid) to each well of the KIV group, wherein the concentration of the alpha-ketoisovalerate in the reaction system is 3 mM;

to each well of the KIV-EDTA group, α -ketoisovalerate and EDTA were added at concentrations of 3mM and 10mM, respectively, in the reaction system.

The control group contained only F buffer.

After the addition of each reagent, the branched alpha-ketoacid dehydrogenase complex was added thereto, and 10. mu.L of a 0.054mg/mL branched alpha-ketoacid dehydrogenase complex solution was added to 200. mu.L of the reaction system.

Each set of 96-well plates was then placed at 30 ℃ for reaction for 30min, and absorbance at 340nm was measured once per minute using a microplate reader (BioTek).

The results are shown in FIG. 6. In vitro biochemical experiments prove that the branched-chain alpha-ketoacid dehydrogenase complex derived from the streptomyces avermitilis has the activity of catalyzing oxaloacetate to generate malonyl coenzyme A, the enzyme activity of the branched-chain alpha-ketoacid dehydrogenase complex is 2.238mM/min/mg protein, and the enzyme activity of the branched-chain alpha-ketoacid dehydrogenase complex is defined as the molar quantity of NADH catalytically generated per milligram of the branched-chain alpha-ketoacid dehydrogenase complex per minute.

TABLE 5 primer sequence List

Sequence listing

<110> institute of microbiology of Chinese academy of sciences

<120> application of branched-chain alpha-ketoacid dehydrogenase complex in preparation of malonyl-CoA

<160> 28

<170> SIPOSequenceListing 1.0

<210> 1

<211> 3591

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 1

atgacggtca tggagcagcg gggcgcttac cggcccacac cgccgcccgc ctggcagccc 60

cgcaccgacc ccgcgccact gctgcccgac gcgctgcccc accgcgtcct gggcaccgag 120

gcggccgcgg aggccgaccc gctactgctg cgccgcctgt acgcggagct ggtgcgcggc 180

cgccgctaca acacgcaggc cacggctctc accaagcagg gccggctcgc cgtctacccg 240

tcgagcacgg gccaggaggc ctgcgaggtc gccgccgcgc tcgtgctgga ggagcgcgac 300

tggctcttcc ccagctaccg ggacaccctc gccgccgtcg cccgcggcct cgatcccgtc 360

caggcgctca ccctcctgcg cggcgactgg cacaccgggt acgacccccg tgagcaccgc 420

atcgcgcccc tgtgcacccc tctcgcgacc cagctcccgc acgccgtcgg cctcgcgcac 480

gccgcccgcc tcaagggcga cgacgtggtc gcgctcgccc tggtcggcga cggcggcacc 540

agcgagggcg acttccacga ggcactgaac ttcgccgccg tctggcaggc gccggtcgtc 600

ttcctcgtgc agaacaacgg cttcgccatc tccgtcccgc tcgccaagca gaccgccgcc 660

ccgtcgctgg cccacaaggc cgtcggctac gggatgccgg gccgcctggt cgacggcaac 720

gacgcggcgg ccgtgcacga ggtcctcagc gacgccgtgg cccacgcgcg cgcgggaggg 780

gggccgacgc tcgtggaggc ggtgacctac cgcatcgacg cccacaccaa cgccgacgac 840

gcgacgcgct accgggggga ctccgaggtg gaggcctggc gcgcgcacga cccgatcgcg 900

ctcctggagc acgagttgac cgaacgcggg ctgctcgacg aggacggcat ccgggccgcc 960

cgcgaggacg ccgaggcgat ggccgcggac ctgcgcgcac gcatgaacca ggatccggcc 1020

ctggacccca tggacctgtt cgcccatgtg tatgccgagc ccacccccca gctgcgggag 1080

caggaagccc agttgcgggc cgagctggca gcggaggccg acgggcccca aggagtcggc 1140

cgatgaagag agttgaccat cgggccccga gaagcgggcc gatgacctcc gttggccttt 1200

ggccggaagg agccgggcga tgaccaccgt tgccctcaag ccggccacca tggcgcaggc 1260

actcacacgc gcgttgcgtg acgccatggc cgccgacccc gccgtccacg tgatgggcga 1320

ggacgtcggc acgctcggcg gggtcttccg ggtcaccgac gggctcgcca aggagttcgg 1380

cgaggaccgc tgcacggaca cgccgctcgc cgaggcaggc atcctcggca cggccgtcgg 1440

catggcgatg tacgggctgc ggccggtcgt cgagatgcag ttcgacgcgt tcgcgtaccc 1500

ggcgttcgag cagctcatca gccatgtcgc gcggatgcgc aaccgcaccc gcggggcgat 1560

gccgctgccg atcaccatcc gtgtccccta cggcggcgga atcggcggag tcgaacacca 1620

cagcgactcc tccgaggcgt actacatggc gactccgggg ctccatgtcg tcacgcccgc 1680

cacggtcgcc gacgcgtacg ggctgctgcg cgccgccatc gcctccgacg acccggtcgt 1740

cttcctggag cccaagcggc tgtactggtc gaaggactcc tggaacccgg acgagccggg 1800

gaccgttgaa ccgataggcc gcgcggtggt gcggcgctcg ggccggagcg ccacgctcat 1860

cacgtacggg ccttccctgc ccgtctgcct ggaggcggcc gaggcggccc gggccgaggg 1920

ctgggacctc gaagtcgtcg atctgcgctc cctggtgccc ttcgacgacg agacggtgtg 1980

cgcgtcggtg cgccggaccg gacgcgccgt cgtcgtgcac gagtcgggtg gttacggcgg 2040

cccgggcggg gagatcgccg cgcggatcac cgagcgctgc ttccaccatc tggaggcgcc 2100

ggtgctgcgc gtcgccgggt tcgacatccc gtatccgccg ccgatgctgg agcgccatca 2160

tctgcccggt gtcgaccgga tcctggacgc ggtggggcgg cttcagtggg aggcggggag 2220

ctgatggccc aggtgctcga gttcaagctc cccgacctcg gggagggcct gaccgaggcc 2280

gagatcgtcc gctggctggt gcaggtcggc gacgtcgtgg cgatcgacca gccggtcgtc 2340

gaggtggaga cggccaaggc gatggtcgag gtgccgtgcc cctacggggg cgtggtcacc 2400

gcccgcttcg gcgaggaggg cacggaactg cccgtgggct caccgctgtt gacggtggct 2460

gtcggagctc cgtcctcggt gcccgcggcg tcctcgctgt ccggggcgac atcggcgtcc 2520

tccgcgtcct cggtgtcatc ggacgacggc gagtcgtccg gcaacgtcct ggtcggatac 2580

ggcacgtcgg ccgcgcccgc gcgccggcgg agggtgcggc cgggccaggc ggcacccgtg 2640

gtgacggcaa ctgccgccgc ggccgccacg cgcgtggcgg ctcccgagcg gagcgacggc 2700

cccgtgcccg tgatctcccc gctggtccgc aggctcgccc gggagaacgg cctggatctg 2760

cgggcgctgg cgggctccgg gcccgacggg ctgatcctga ggtcggacgt cgagcaggcg 2820

ctgcgcgccg cgcccactcc tgcccccacc ccgaccatgc ctccggctcc cactcctgcc 2880

cccacccccg ccgcggcacc ccgcggcacc cgcatccccc tccgaggggt ccgcggtgcc 2940

gtcgccgaca aactctcccg cagccggcgt gagatccccg acgcgacctg ctgggtggac 3000

gccgacgcca cggcactcat gcacgcgcgc gtggcgatga acgcgaccgg cggcccgaag 3060

atctccctca tcgcgctgct cgccaggatc tgcaccgccg cactggcccg cttccccgag 3120

ctcaactcca ccgtcgacat ggacgcccgc gaggtcgtac ggctcgacca ggtgcacctg 3180

ggcttcgccg cgcagaccga acgggggctc gtcgtcccgg tcgtgcggga cgcgcacgcg 3240

cgggacgccg agtcgctcag cgccgagttc gcgcggctga ccgaggccgc ccggaccggc 3300

accctcacac ccggggaact gaccggcggc accttcacgt tgaacaacta cggggtgttc 3360

ggcgtcgacg gttccacgcc gatcatcaac caccccgagg cggccatgct gggcgtcggc 3420

cgcatcatcc ccaagccgtg ggtgcacgag ggcgagctgg cggtgcggca ggtcgtccag 3480

ctctcgctca ccttcgacca ccgggtgtgc gacggcggca cggcaggcgg tttcctgcgc 3540

tacgtggcgg actgcgtgga acagccggcg gtgctgctgc gcaccctgta g 3591

<210> 2

<211> 3608

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 2

atgaccgtgg agagcactgc cgcgcgaaag ccgcgacgca gcgccggtac gaagagcgcc 60

gcagccaagc gcaccagccc cggcgccaag aagtcaccga gcacgaccgg cgccgagcac 120

gagctgattc agctgctcac gcccgacggc cggcgggtga agaaccccga gtacgacgcg 180

tacgtcgcgg acatcacccc cgaagagctg cgcggtctgt accgggacat ggtgctgagc 240

cgccgcttcg acgcagaggc cacctccctg caacgccagg gcgagctggg cctgtgggcc 300

tcgatgctcg ggcaggaggc cgcccagatc ggctcgggcc gggccacccg tgacgacgac 360

tacgtcttcc cgacctaccg cgagcacggc gtcgcctggt gccgcggggt cgaccccacc 420

aacctgctcg gcatgttccg cggcgtgaac aacggcggct gggatcccaa cagcaacaac 480

ttccacctct acacgatcgt catcggctcg cagacgctgc acgccaccgg ctacgccatg 540

ggtatcgcca aggacggcgc cgactcggcc gtgatcgcgt acttcggtga cggcgcctcc 600

agccagggtg acgtcgccga atcgttcacc ttctccgcgg tctacaacgc ccctgtcgtc 660

ttcttctgcc agaacaacca gtgggcgatc tccgagccca ccgagaagca gacccgcgtc 720

ccgctctacc agcgcgcgca gggctacggc ttcccgggcg tccgcgtcga cggcaacgac 780

gtactggcct gcctcgccgt caccaagtgg gccctcgagc gggcccgccg gggcgagggg 840

cccacgttgg tcgaggcgtt cacgtaccgc atgggcgcgc acaccacctc cgacgacccg 900

accaagtacc gggccgacga ggagcgcgag gcgtgggagg cgaaggaccc gatcctgcgt 960

ctgcgcacgt atctcgaggc ctcaaaccac gcggacgagg gattcttcgc ggaactcgag 1020

gtggagagcg aggcgttggg aaggcgagtg cgcgaagtgg tgcgtgccat gccggacccg 1080

gaccacttcg ccatcttcga gaacgtgtac gcggacgggc atgcgctcgt cgacgaggag 1140

cgggcgcagt tcgccgccta ccaggcgtcg ttcacgacgg agcctgacgg cggctccgcc 1200

gcgggacagg ggggtaactg acatggccga gaagatggcg atcgccaagg cgatcaacga 1260

gtcgctgcgc aaggccctgg agtccgaccc caaggttctg atcatgggtg aggacgtcgg 1320

caagctcggt ggcgtcttcc gcgtcaccga cggcctgcag aaggacttcg gcgaggagcg 1380

ggtcatcgac accccgctcg ccgagtcggg catcgtcggc acggcgatcg gtctcgccct 1440

gcgcggctac cgcccggtgg tggagatcca gttcgacggc ttcgtcttcc cggcgtacga 1500

ccagatcgtc acgcagctcg cgaagatgca cgcgcgggcg ctcggcaaga tcaagctccc 1560

cgttgtcgtc cacatcccgt acggcggcgg catcggcgcc gtcgagcacc actccgagtc 1620

ccccgaggcg ctcttcgcgc acgtggcggg cctcaaggtg gtctccccgt ccaacgcgtc 1680

ggacgcgtac tggatgatgc agcaggccat ccagagcgac gacccggtga tcttcttcga 1740

gtcgaagcgg cgctactggg acaagggcga ggtcaacgtc gaggcgatcc ccgacccgct 1800

gcacaaggcc cgtgtggtgc gtgagggcac cgacctgacg ctcgccgcgt acggcccgat 1860

ggtgaaggtc tgccaggagg ccgcggccgc cgccgaggag gagggcaagt ccctggaggt 1920

cgtcgacctg cgctccatgt cgccgatcga cttcgacgcc gtccaggcct ccgtcgagaa 1980

gacccgccgt ctggtcgtgg tgcacgaggc gccggtgttc ctgggcacgg gcgcggagat 2040

cgccgcccgc atcacggagc gctgcttcta ccacctggag gcacccgtgc tgagggtcgg 2100

cggctaccac gccccgtatc cgccggcgcg tctggaagag gagtaccttc cgggccttga 2160

ccgggtgctc gatgccgtcg accgctcgct ggcgtactga ggagagggtc gtgacgacga 2220

tgactgaggc gtccgtgcgt gagttcaaga tgcccgatgt gggtgaggga ctcaccgagg 2280

ccgagatcct caagtggtac gtccagcccg gcgacaccgt caccgacggc caggtcgtct 2340

gcgaggtcga gaccgcgaag gcggccgtgg aactccccat tccgtacgac ggtgtcgtac 2400

gcgaactccg tttccccgag gggacgacgg tggacgtggg acaggtgatc atcgcggtgg 2460

acgtggccgg cgacgcaccg gtggcggaga tccccgtgcc cgcgcaggag gctccggtcc 2520

aggaggagcc caagcccgag ggccgcaagc ccgtcctcgt gggctacggg gtggccgagt 2580

cctccaccaa gcgccgtccg cgcaagagcg cgccggcgag cgagcccgct gcggagggca 2640

cgtacttcgc agcgaccgtt ctccagggca tccagggcga gctgaacgga cacggcgcgg 2700

tgaagcagcg tccgctggcg aagccgccgg tgcgcaagct ggccaaggac ctgggcgtcg 2760

acctcgcgac gatcacgccg tcgggccccg acggcgtcat cacgcgcgag gacgtgcacg 2820

cggcggtggc gccaccgccg ccggcacccc agcccgtgca gacgcccgct gccccggccc 2880

cggcgccggt ggccgcgtac gacacggctc gtgagacccg tgtccccgtc aagggcgtcc 2940

gcaaggcgac ggcggcggcg atggtcggct cggcgttcac ggcgccgcac gtcacggagt 3000

tcgtgacggt ggacgtgacg cgcacgatga agctggtcga ggagctgaag caggacaagg 3060

agttcaccgg cctgcgggtg aacccgctgc tcctcatcgc caaggcgctc ctggtcgcga 3120

tcaagcggaa cccggacatc aacgcgtcct gggacgaggc gaaccaggag atcgtcctca 3180

agcactatgt gaacctgggc atcgcggcgg ccaccccgcg cggtctgatc gtcccgaaca 3240

tcaaggacgc ccacgccaag acgctgccgc aactggccga gtcactgggt gagttggtgt 3300

cgacggcccg cgagggcaag acgtccccga cggccatgca gggcggcacg gtcacgatca 3360

cgaacgtcgg cgtcttcggc gtcgacacgg gcacgccgat cctcaacccc ggcgagtccg 3420

cgatcctcgc ggtcggcgcg atcaagctcc agccgtgggt ccacaagggc aaggtcaagc 3480

cccgacaggt caccacgctg gcgctcagct tcgaccatcg cctggtcgac ggcgagctgg 3540

gctccaaggt gctggccgac gtggcggcga tcctggagca gccgaagcgg ctgatcacct 3600

gggcctag 3608

<210> 3

<211> 1389

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 3

atgactgaag cgtccgtgcg tgagttcaaa atgccggacg tgggtgaagg tctgaccgaa 60

gcggaaatcc tgaaatggta tgtgcagcct ggtgacacgg ttaccgatgg ccaggttgtt 120

tgcgaggtag aaactgcgaa agcggctgtt gagctgccga tcccgtacga cggcgtggtg 180

cgtgaactgc gtttcccgga aggtactact gtcgatgtcg gccaggtaat tatcgcagtt 240

gatgtggccg gcgacgcacc ggttgcggaa atcccggtgc cggcgcagga agccccggtc 300

caggaagagc cgaaaccgga aggtcgtaaa cctgtgctgg taggttatgg tgttgctgaa 360

agcagcacta aacgtcgccc gcgtaagtcc gcgccagcgt ccgaaccggc ggcagaaggc 420

acctatttcg ctgccaccgt tctgcaaggt attcagggtg aactgaacgg ccacggtgca 480

gtaaaacagc gcccactggc gaaaccacca gttcgcaaac tggcgaaaga cctgggtgtg 540

gatctggcga ctattacccc gtccggcccg gatggcgtta ttacccgtga agacgtacac 600

gctgctgtgg cgcctccgcc gccggcacct caaccggtgc agaccccggc ggcaccggct 660

ccggctccgg tggccgcgta cgatacggcg cgtgagacgc gcgttccagt aaaaggtgtt 720

cgtaaggcta ctgccgctgc tatggtgggt agcgcgttca ctgcacctca cgttaccgaa 780

tttgttacgg tagatgtgac tcgtactatg aaactggtgg aagaactgaa acaggataaa 840

gagttcactg gtctgcgcgt taacccgctg ctgctgattg cgaaagcact gctggtcgct 900

atcaagcgta acccggacat caatgcatcc tgggacgaag caaaccagga aatcgttctg 960

aagcactacg taaacctcgg tatcgcggct gcaaccccgc gcggcctgat cgtgccaaat 1020

atcaaagacg ctcatgccaa aaccctgccg cagctggcgg aatctctggg tgaactggtt 1080

tccaccgctc gcgagggtaa gacctccccg actgcaatgc agggcggtac ggtcaccatc 1140

accaatgtgg gtgtattcgg tgttgacacc ggcacgccga tcctgaaccc gggtgagtcc 1200

gccatcctcg ctgtaggtgc tatcaaactg caaccgtggg ttcacaaagg caaagttaaa 1260

ccacgtcagg ttaccaccct ggctctgagc ttcgaccacc gtctggttga cggtgaactg 1320

ggctccaagg tactggcgga cgtggcggcg atcctggagc agccgaagcg tctcatcact 1380

tgggcataa 1389

<210> 4

<211> 1389

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 4

atggcgaacg acgccagcac cgttttcgac ctagtgatcc tcggcggtgg tagcggtggt 60

tacgccgcgg ccctgcgcgg agcgcagctg ggcctggacg tcgccctgat cgagaaggac 120

aaggtcggcg gtacctgcct gcaccgtggg tgcatcccca ccaaggcgct gctgcacgcg 180

ggcgagatcg ccgaccaggc ccgcgagagc gagcagttcg gcgtcaaggc caccttcgag 240

ggcatcgacg taccggccgt ccacaagtac aaggacgggg tcatctcggg cctgtacaag 300

ggtctgcagg ggctgatcgc ctcccgcaag gtgacgtaca tcgagggtga gggccgtctg 360

tcctccccga cctccgtcga cgtgaacggc cagcgcgtcc agggccgcca cgtgctcctg 420

gcgaccggct ccgtgccgaa gtcgctgccg ggcctggcga tcgacggcaa ccgcatcatc 480

tcctccgacc acgcgctggt cctggaccgc gtcccggagt ccgcgatcgt gctcggcggc 540

ggcgtcatcg gcgtcgagtt cgcctccgcg tggaagtcct tcggagccga cgtgacggtg 600

atcgagggcc tcaagcacct cgtcccggtc gaggacgaga actcctccaa gcttcttgag 660

cgcgcgttcc gcaagcgcgg catcaagttc aacctgggca ccttcttctc gaaggccgag 720

tacacccaga acggtgtcaa ggtcaccctc gccgacggca aggagttcga ggccgaggtc 780

ctgctcgtcg ccgtcggccg cggcccggtc tcgcagggcc tcggctacga ggagcagggc 840

gtcgccatgg accgcggcta cgtcctggtc gacgagtaca tgcggacgaa cgtcccgacc 900

atctccgccg tcggtgacct ggtcccgacg ctccagctcg cgcacgtcgg cttcgccgag 960

ggcatcctgg tggcggagcg tctggccggt ctgaagaccg tcccgatcga ctacgacggc 1020

gtgccgcggg tgacgtactg ccaccccgag gtcgcctccg tgggcatcac cgaggccaag 1080

gccaaggaga tctacggcgc ggacaaggtc gtcgctctga agtacaacct ggcgggcaac 1140

ggcaagagca agatcctcaa caccgcgggc gagatcaagc tcgtccaggt gaaggacggt 1200

gccgtggtcg gcgtccacat ggtcggtgac cgtatgggcg agcaggtcgg cgaagcccag 1260

ctgatctaca actgggaggc gctgccggcc gaggtcgccc agctcatcca cgcccacccg 1320

acgcagaacg aagcgatggg cgaggcccac ctggccctcg cgggcaagcc gctgcactcg 1380

cacgactga 1389

<210> 5

<211> 1389

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 5

atggcaaacg acgcatctac ggtgttcgac ctggttatcc tcggtggtgg ttccggtggc 60

tatgctgccg cgctgcgtgg tgcacagctg ggtctggatg ttgcgctgat cgagaaagac 120

aaagttggtg gcacttgcct gcatcgtggt tgcatcccga ccaaagcgct gctgcacgcg 180

ggcgaaattg ctgatcaggc acgcgaatct gaacaattcg gtgtgaaagc gaccttcgag 240

ggtatcgacg ttccggctgt gcacaagtac aaagatggtg ttattagcgg tctgtacaaa 300

ggcctccaag gtctgattgc gtcccgcaag gtgacttaca tcgaaggcga gggtcgtctg 360

tccagcccta cctctgttga cgttaatggt cagcgtgttc aaggccgcca cgttctgctg 420

gctaccggct ctgttcctaa aagcctgcca ggtctggcta tcgacggtaa ccgtattatc 480

tcctctgatc atgctctggt cctggaccgc gttccggagt ccgcgattgt tctgggtggt 540

ggcgttatcg gtgttgagtt tgcctctgca tggaaatctt tcggcgcaga tgtaaccgta 600

atcgaaggtc tgaaacacct ggttccggtc gaagacgaga actcctctaa actgctggaa 660

cgtgcattcc gcaaacgcgg tattaaattc aacctgggta ctttcttcag caaagcagag 720

tatacccaga atggtgttaa agtgactctg gccgacggta aggagtttga agccgaagtt 780

ctgctggtcg cagtaggtcg tggtcctgta tctcagggtc tgggctacga agaacagggt 840

gttgctatgg accgtggcta tgttctggtc gatgagtaca tgcgcaccaa cgtaccgacc 900

atctccgcag tgggtgacct ggtgcctacc ctgcaactgg ctcatgtagg tttcgcggaa 960

ggtatcctgg tagctgaacg tctggcgggc ctgaaaacgg ttccaatcga ttacgatggc 1020

gttccgcgcg tgacctattg ccacccggaa gtggcgtctg taggcatcac cgaagccaaa 1080

gcaaaagaaa tttacggtgc tgacaaagta gtcgctctga aatacaacct ggcgggtaac 1140

ggtaaaagca aaatcctgaa cactgctggt gaaatcaaac tggttcaggt taaagacggt 1200

gcagtggtcg gtgtgcacat ggttggcgac cgcatgggtg aacaggtggg cgaagcacag 1260

ctgatctata actgggaggc tctgccggct gaagttgcgc agctgatcca cgcgcacccg 1320

acccaaaacg aagctatggg cgaagctcac ctggctctgg ctggcaagcc gctgcattct 1380

cacgactaa 1389

<210> 6

<211> 3650

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 6

aatgtgcctg tcaaatggac gaagcaggga ttctgcaaac cctatgctac tccgtcaagc 60

cgtcaattgt ctgattcgtt accaattatg acaacttgac ggctacatca ttcacttttt 120

cttcacaacc ggcacggaac tcgctcgggc tggccccggt gcatttttta aatacccgcg 180

agaaatagag ttgatcgtca aaaccaacat tgcgaccgac ggtggcgata ggcatccggg 240

tggtgctcaa aagcagcttc gcctggctga tacgttggtc ctcgcgccag cttaagacgc 300

taatccctaa ctgctggcgg aaaagatgtg acagacgcga cggcgacaag caaacatgct 360

gtgcgacgct ggcgatatca aaattgctgt ctgccaggtg atcgctgatg tactgacaag 420

cctcgcgtac ccgattatcc atcggtggat ggagcgactc gttaatcgct tccatgcgcc 480

gcagtaacaa ttgctcaagc agatttatcg ccagcagctc cgaatagcgc ccttcccctt 540

gcccggcgtt aatgatttgc ccaaacaggt cgctgaaatg cggctggtgc gcttcatccg 600

ggcgaaagaa ccccgtattg gcaaatattg acggccagtt aagccattca tgccagtagg 660

cgcgcggacg aaagtaaacc cactggtgat accattcgcg agcctccgga tgacgaccgt 720

agtgatgaat ctctcctggc gggaacagca aaatatcacc cggtcggcaa acaaattctc 780

gtccctgatt tttcaccacc ccctgaccgc gaatggtgag attgagaata taacctttca 840

ttcccagcgg tcggtcgata aaaaaatcga gataaccgtt ggcctcaatc ggcgttaaac 900

ccgccaccag atgggcatta aacgagtatc ccggcagcag gggatcattt tgcgcttcag 960

ccatactttt catactcccg ccattcagag aagaaaccaa ttgtccatat tgcatcagac 1020

attgccgtca ctgcgtcttt tactggctct tctcgctaac caaaccggta accccgctta 1080

ttaaaagcat tctgtaacaa agcgggacca aagccatgac aaaaacgcgt aacaaaagtg 1140

tctataatca cggcagaaaa gtccacattg attatttgca cggcgtcaca ctttgctatg 1200

ccatagcatt tttatccata agattagcgg atcctacctg acgcttttta tcgcaactct 1260

ctactgtttc tccatacccg ttttttgggc taacaggagg aattaaccat gggtacctct 1320

catcatcatc atcatcacag cagcggcctg gtgccgcgcg gcagcctcga gggtagatct 1380

ggtactagtg gtgaattcgg tgagctcggt ctgcagctgg tgccgcgcgg cagccaccac 1440

caccaccacc actaatacag attaaatcag aacgcagaag cggtctgata aaacagaatt 1500

tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac 1560

gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat 1620

caaataaaac gaaaggctca gtcgaaagac tgggcctttc gtcgacgcgc tagcggagtg 1680

tatactggct tactatgttg gcactgatga gggtgtcagt gaagtgcttc atgtggcagg 1740

agaaaaaagg ctgcaccggt gcgtcagcag aatatgtgat acaggatata ttccgcttcc 1800

tcgctcactg actcgctacg ctcggtcgtt cgactgcggc gagcggaaat ggcttacgaa 1860

cggggcggag atttcctgga agatgccagg aagatactta acagggaagt gagagggccg 1920

cggcaaagcc gtttttccat aggctccgcc cccctgacaa gcatcacgaa atctgacgct 1980

caaatcagtg gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggcg 2040

gctccctcgt gcgctctcct gttcctgcct ttcggtttac cggtgtcatt ccgctgttat 2100

ggccgcgttt gtctcattcc acgcctgaca ctcagttccg ggtaggcagt tcgctccaag 2160

ctggactgta tgcacgaacc ccccgttcag tccgaccgct gcgccttatc cggtaactat 2220

cgtcttgagt ccaacccgga aagacatgca aaagcaccac tggcagcagc cactggtaat 2280

tgatttagag gagttagtct tgaagtcatg cgccggttaa ggctaaactg aaaggacaag 2340

ttttggtgac tgcgctcctc caagccagtt acctcggttc aaagagttgg tagctcagag 2400

aaccttcgaa aaaccgccct gcaaggcggt tttttcgttt tcagagcaag agattacgcg 2460

cagaccaaaa cgatctcaag aagatcatct tattaatcag ataaaatatt tctagatttc 2520

agtgcaattt atctcttcaa atgtagcacc tgaagtcagc cccatacgat ataagttgtg 2580

cggccgccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca 2640

ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagcc atattcaacg 2700

ggaaacgtct tgctctaggc cgcgattaaa ttccaacatg gatgctgatt tatatgggta 2760

taaatgggct cgcgataatg tcgggcaatc aggtgcgaca atctatcgat tgtatgggaa 2820

gcccgatgcg ccagagttgt ttctgaaaca tggcaaaggt agcgttgcca atgatgttac 2880

agatgagatg gtcagactaa actggctgac ggaatttatg cctcttccga ccatcaagca 2940

ttttatccgt actcctgatg atgcatggtt actcaccact gcgatccccg ggaaaacagc 3000

attccaggta ttagaagaat atcctgattc aggtgaaaat attgttgatg cgctggcagt 3060

gttcctgcgc cggttgcatt cgattcctgt ttgtaattgt ccttttaaca gcgaccgcgt 3120

atttcgtctc gctcaggcgc aatcacgaat gaataacggt ttggttgatg cgagtgattt 3180

tgatgacgag cgtaatggct ggcctgttga acaagtctgg aaagaaatgc ataaactttt 3240

gccattctca ccggattcag tcgtcactca tggtgatttc tcacttgata accttatttt 3300

tgacgagggg aaattaatag gttgtattga tgttggacga gtcggaatcg cagaccgata 3360

ccaggatctt gccatcctat ggaactgcct cggtgagttt tctccttcat tacagaaacg 3420

gctttttcaa aaatatggta ttgataatcc tgatatgaat aaattgcagt ttcatttgat 3480

gctcgatgag tttttctaag aattaattca tgagcggata catatttgaa tgtatttaga 3540

aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccactt gcggagaccc 3600

ggtcgtcagc ttgtcgtcgg ttcagggcag ggtcgttaaa tagcgcatgc 3650

<210> 7

<211> 381

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 7

Met Thr Val Met Glu Gln Arg Gly Ala Tyr Arg Pro Thr Pro Pro Pro

1 5 10 15

Ala Trp Gln Pro Arg Thr Asp Pro Ala Pro Leu Leu Pro Asp Ala Leu

20 25 30

Pro His Arg Val Leu Gly Thr Glu Ala Ala Ala Glu Ala Asp Pro Leu

35 40 45

Leu Leu Arg Arg Leu Tyr Ala Glu Leu Val Arg Gly Arg Arg Tyr Asn

50 55 60

Thr Gln Ala Thr Ala Leu Thr Lys Gln Gly Arg Leu Ala Val Tyr Pro

65 70 75 80

Ser Ser Thr Gly Gln Glu Ala Cys Glu Val Ala Ala Ala Leu Val Leu

85 90 95

Glu Glu Arg Asp Trp Leu Phe Pro Ser Tyr Arg Asp Thr Leu Ala Ala

100 105 110

Val Ala Arg Gly Leu Asp Pro Val Gln Ala Leu Thr Leu Leu Arg Gly

115 120 125

Asp Trp His Thr Gly Tyr Asp Pro Arg Glu His Arg Ile Ala Pro Leu

130 135 140

Cys Thr Pro Leu Ala Thr Gln Leu Pro His Ala Val Gly Leu Ala His

145 150 155 160

Ala Ala Arg Leu Lys Gly Asp Asp Val Val Ala Leu Ala Leu Val Gly

165 170 175

Asp Gly Gly Thr Ser Glu Gly Asp Phe His Glu Ala Leu Asn Phe Ala

180 185 190

Ala Val Trp Gln Ala Pro Val Val Phe Leu Val Gln Asn Asn Gly Phe

195 200 205

Ala Ile Ser Val Pro Leu Ala Lys Gln Thr Ala Ala Pro Ser Leu Ala

210 215 220

His Lys Ala Val Gly Tyr Gly Met Pro Gly Arg Leu Val Asp Gly Asn

225 230 235 240

Asp Ala Ala Ala Val His Glu Val Leu Ser Asp Ala Val Ala His Ala

245 250 255

Arg Ala Gly Gly Gly Pro Thr Leu Val Glu Ala Val Thr Tyr Arg Ile

260 265 270

Asp Ala His Thr Asn Ala Asp Asp Ala Thr Arg Tyr Arg Gly Asp Ser

275 280 285

Glu Val Glu Ala Trp Arg Ala His Asp Pro Ile Ala Leu Leu Glu His

290 295 300

Glu Leu Thr Glu Arg Gly Leu Leu Asp Glu Asp Gly Ile Arg Ala Ala

305 310 315 320

Arg Glu Asp Ala Glu Ala Met Ala Ala Asp Leu Arg Ala Arg Met Asn

325 330 335

Gln Asp Pro Ala Leu Asp Pro Met Asp Leu Phe Ala His Val Tyr Ala

340 345 350

Glu Pro Thr Pro Gln Leu Arg Glu Gln Glu Ala Gln Leu Arg Ala Glu

355 360 365

Leu Ala Ala Glu Ala Asp Gly Pro Gln Gly Val Gly Arg

370 375 380

<210> 8

<211> 334

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 8

Met Thr Thr Val Ala Leu Lys Pro Ala Thr Met Ala Gln Ala Leu Thr

1 5 10 15

Arg Ala Leu Arg Asp Ala Met Ala Ala Asp Pro Ala Val His Val Met

20 25 30

Gly Glu Asp Val Gly Thr Leu Gly Gly Val Phe Arg Val Thr Asp Gly

35 40 45

Leu Ala Lys Glu Phe Gly Glu Asp Arg Cys Thr Asp Thr Pro Leu Ala

50 55 60

Glu Ala Gly Ile Leu Gly Thr Ala Val Gly Met Ala Met Tyr Gly Leu

65 70 75 80

Arg Pro Val Val Glu Met Gln Phe Asp Ala Phe Ala Tyr Pro Ala Phe

85 90 95

Glu Gln Leu Ile Ser His Val Ala Arg Met Arg Asn Arg Thr Arg Gly

100 105 110

Ala Met Pro Leu Pro Ile Thr Ile Arg Val Pro Tyr Gly Gly Gly Ile

115 120 125

Gly Gly Val Glu His His Ser Asp Ser Ser Glu Ala Tyr Tyr Met Ala

130 135 140

Thr Pro Gly Leu His Val Val Thr Pro Ala Thr Val Ala Asp Ala Tyr

145 150 155 160

Gly Leu Leu Arg Ala Ala Ile Ala Ser Asp Asp Pro Val Val Phe Leu

165 170 175

Glu Pro Lys Arg Leu Tyr Trp Ser Lys Asp Ser Trp Asn Pro Asp Glu

180 185 190

Pro Gly Thr Val Glu Pro Ile Gly Arg Ala Val Val Arg Arg Ser Gly

195 200 205

Arg Ser Ala Thr Leu Ile Thr Tyr Gly Pro Ser Leu Pro Val Cys Leu

210 215 220

Glu Ala Ala Glu Ala Ala Arg Ala Glu Gly Trp Asp Leu Glu Val Val

225 230 235 240

Asp Leu Arg Ser Leu Val Pro Phe Asp Asp Glu Thr Val Cys Ala Ser

245 250 255

Val Arg Arg Thr Gly Arg Ala Val Val Val His Glu Ser Gly Gly Tyr

260 265 270

Gly Gly Pro Gly Gly Glu Ile Ala Ala Arg Ile Thr Glu Arg Cys Phe

275 280 285

His His Leu Glu Ala Pro Val Leu Arg Val Ala Gly Phe Asp Ile Pro

290 295 300

Tyr Pro Pro Pro Met Leu Glu Arg His His Leu Pro Gly Val Asp Arg

305 310 315 320

Ile Leu Asp Ala Val Gly Arg Leu Gln Trp Glu Ala Gly Ser

325 330

<210> 9

<211> 455

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 9

Met Ala Gln Val Leu Glu Phe Lys Leu Pro Asp Leu Gly Glu Gly Leu

1 5 10 15

Thr Glu Ala Glu Ile Val Arg Trp Leu Val Gln Val Gly Asp Val Val

20 25 30

Ala Ile Asp Gln Pro Val Val Glu Val Glu Thr Ala Lys Ala Met Val

35 40 45

Glu Val Pro Cys Pro Tyr Gly Gly Val Val Thr Ala Arg Phe Gly Glu

50 55 60

Glu Gly Thr Glu Leu Pro Val Gly Ser Pro Leu Leu Thr Val Ala Val

65 70 75 80

Gly Ala Pro Ser Ser Val Pro Ala Ala Ser Ser Leu Ser Gly Ala Thr

85 90 95

Ser Ala Ser Ser Ala Ser Ser Val Ser Ser Asp Asp Gly Glu Ser Ser

100 105 110

Gly Asn Val Leu Val Gly Tyr Gly Thr Ser Ala Ala Pro Ala Arg Arg

115 120 125

Arg Arg Val Arg Pro Gly Gln Ala Ala Pro Val Val Thr Ala Thr Ala

130 135 140

Ala Ala Ala Ala Thr Arg Val Ala Ala Pro Glu Arg Ser Asp Gly Pro

145 150 155 160

Val Pro Val Ile Ser Pro Leu Val Arg Arg Leu Ala Arg Glu Asn Gly

165 170 175

Leu Asp Leu Arg Ala Leu Ala Gly Ser Gly Pro Asp Gly Leu Ile Leu

180 185 190

Arg Ser Asp Val Glu Gln Ala Leu Arg Ala Ala Pro Thr Pro Ala Pro

195 200 205

Thr Pro Thr Met Pro Pro Ala Pro Thr Pro Ala Pro Thr Pro Ala Ala

210 215 220

Ala Pro Arg Gly Thr Arg Ile Pro Leu Arg Gly Val Arg Gly Ala Val

225 230 235 240

Ala Asp Lys Leu Ser Arg Ser Arg Arg Glu Ile Pro Asp Ala Thr Cys

245 250 255

Trp Val Asp Ala Asp Ala Thr Ala Leu Met His Ala Arg Val Ala Met

260 265 270

Asn Ala Thr Gly Gly Pro Lys Ile Ser Leu Ile Ala Leu Leu Ala Arg

275 280 285

Ile Cys Thr Ala Ala Leu Ala Arg Phe Pro Glu Leu Asn Ser Thr Val

290 295 300

Asp Met Asp Ala Arg Glu Val Val Arg Leu Asp Gln Val His Leu Gly

305 310 315 320

Phe Ala Ala Gln Thr Glu Arg Gly Leu Val Val Pro Val Val Arg Asp

325 330 335

Ala His Ala Arg Asp Ala Glu Ser Leu Ser Ala Glu Phe Ala Arg Leu

340 345 350

Thr Glu Ala Ala Arg Thr Gly Thr Leu Thr Pro Gly Glu Leu Thr Gly

355 360 365

Gly Thr Phe Thr Leu Asn Asn Tyr Gly Val Phe Gly Val Asp Gly Ser

370 375 380

Thr Pro Ile Ile Asn His Pro Glu Ala Ala Met Leu Gly Val Gly Arg

385 390 395 400

Ile Ile Pro Lys Pro Trp Val His Glu Gly Glu Leu Ala Val Arg Gln

405 410 415

Val Val Gln Leu Ser Leu Thr Phe Asp His Arg Val Cys Asp Gly Gly

420 425 430

Thr Ala Gly Gly Phe Leu Arg Tyr Val Ala Asp Cys Val Glu Gln Pro

435 440 445

Ala Val Leu Leu Arg Thr Leu

450 455

<210> 10

<211> 406

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 10

Met Thr Val Glu Ser Thr Ala Ala Arg Lys Pro Arg Arg Ser Ala Gly

1 5 10 15

Thr Lys Ser Ala Ala Ala Lys Arg Thr Ser Pro Gly Ala Lys Lys Ser

20 25 30

Pro Ser Thr Thr Gly Ala Glu His Glu Leu Ile Gln Leu Leu Thr Pro

35 40 45

Asp Gly Arg Arg Val Lys Asn Pro Glu Tyr Asp Ala Tyr Val Ala Asp

50 55 60

Ile Thr Pro Glu Glu Leu Arg Gly Leu Tyr Arg Asp Met Val Leu Ser

65 70 75 80

Arg Arg Phe Asp Ala Glu Ala Thr Ser Leu Gln Arg Gln Gly Glu Leu

85 90 95

Gly Leu Trp Ala Ser Met Leu Gly Gln Glu Ala Ala Gln Ile Gly Ser

100 105 110

Gly Arg Ala Thr Arg Asp Asp Asp Tyr Val Phe Pro Thr Tyr Arg Glu

115 120 125

His Gly Val Ala Trp Cys Arg Gly Val Asp Pro Thr Asn Leu Leu Gly

130 135 140

Met Phe Arg Gly Val Asn Asn Gly Gly Trp Asp Pro Asn Ser Asn Asn

145 150 155 160

Phe His Leu Tyr Thr Ile Val Ile Gly Ser Gln Thr Leu His Ala Thr

165 170 175

Gly Tyr Ala Met Gly Ile Ala Lys Asp Gly Ala Asp Ser Ala Val Ile

180 185 190

Ala Tyr Phe Gly Asp Gly Ala Ser Ser Gln Gly Asp Val Ala Glu Ser

195 200 205

Phe Thr Phe Ser Ala Val Tyr Asn Ala Pro Val Val Phe Phe Cys Gln

210 215 220

Asn Asn Gln Trp Ala Ile Ser Glu Pro Thr Glu Lys Gln Thr Arg Val

225 230 235 240

Pro Leu Tyr Gln Arg Ala Gln Gly Tyr Gly Phe Pro Gly Val Arg Val

245 250 255

Asp Gly Asn Asp Val Leu Ala Cys Leu Ala Val Thr Lys Trp Ala Leu

260 265 270

Glu Arg Ala Arg Arg Gly Glu Gly Pro Thr Leu Val Glu Ala Phe Thr

275 280 285

Tyr Arg Met Gly Ala His Thr Thr Ser Asp Asp Pro Thr Lys Tyr Arg

290 295 300

Ala Asp Glu Glu Arg Glu Ala Trp Glu Ala Lys Asp Pro Ile Leu Arg

305 310 315 320

Leu Arg Thr Tyr Leu Glu Ala Ser Asn His Ala Asp Glu Gly Phe Phe

325 330 335

Ala Glu Leu Glu Val Glu Ser Glu Ala Leu Gly Arg Arg Val Arg Glu

340 345 350

Val Val Arg Ala Met Pro Asp Pro Asp His Phe Ala Ile Phe Glu Asn

355 360 365

Val Tyr Ala Asp Gly His Ala Leu Val Asp Glu Glu Arg Ala Gln Phe

370 375 380

Ala Ala Tyr Gln Ala Ser Phe Thr Thr Glu Pro Asp Gly Gly Ser Ala

385 390 395 400

Ala Gly Gln Gly Gly Asn

405

<210> 11

<211> 325

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 11

Met Ala Glu Lys Met Ala Ile Ala Lys Ala Ile Asn Glu Ser Leu Arg

1 5 10 15

Lys Ala Leu Glu Ser Asp Pro Lys Val Leu Ile Met Gly Glu Asp Val

20 25 30

Gly Lys Leu Gly Gly Val Phe Arg Val Thr Asp Gly Leu Gln Lys Asp

35 40 45

Phe Gly Glu Glu Arg Val Ile Asp Thr Pro Leu Ala Glu Ser Gly Ile

50 55 60

Val Gly Thr Ala Ile Gly Leu Ala Leu Arg Gly Tyr Arg Pro Val Val

65 70 75 80

Glu Ile Gln Phe Asp Gly Phe Val Phe Pro Ala Tyr Asp Gln Ile Val

85 90 95

Thr Gln Leu Ala Lys Met His Ala Arg Ala Leu Gly Lys Ile Lys Leu

100 105 110

Pro Val Val Val His Ile Pro Tyr Gly Gly Gly Ile Gly Ala Val Glu

115 120 125

His His Ser Glu Ser Pro Glu Ala Leu Phe Ala His Val Ala Gly Leu

130 135 140

Lys Val Val Ser Pro Ser Asn Ala Ser Asp Ala Tyr Trp Met Met Gln

145 150 155 160

Gln Ala Ile Gln Ser Asp Asp Pro Val Ile Phe Phe Glu Ser Lys Arg

165 170 175

Arg Tyr Trp Asp Lys Gly Glu Val Asn Val Glu Ala Ile Pro Asp Pro

180 185 190

Leu His Lys Ala Arg Val Val Arg Glu Gly Thr Asp Leu Thr Leu Ala

195 200 205

Ala Tyr Gly Pro Met Val Lys Val Cys Gln Glu Ala Ala Ala Ala Ala

210 215 220

Glu Glu Glu Gly Lys Ser Leu Glu Val Val Asp Leu Arg Ser Met Ser

225 230 235 240

Pro Ile Asp Phe Asp Ala Val Gln Ala Ser Val Glu Lys Thr Arg Arg

245 250 255

Leu Val Val Val His Glu Ala Pro Val Phe Leu Gly Thr Gly Ala Glu

260 265 270

Ile Ala Ala Arg Ile Thr Glu Arg Cys Phe Tyr His Leu Glu Ala Pro

275 280 285

Val Leu Arg Val Gly Gly Tyr His Ala Pro Tyr Pro Pro Ala Arg Leu

290 295 300

Glu Glu Glu Tyr Leu Pro Gly Leu Asp Arg Val Leu Asp Ala Val Asp

305 310 315 320

Arg Ser Leu Ala Tyr

325

<210> 12

<211> 462

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 12

Met Thr Glu Ala Ser Val Arg Glu Phe Lys Met Pro Asp Val Gly Glu

1 5 10 15

Gly Leu Thr Glu Ala Glu Ile Leu Lys Trp Tyr Val Gln Pro Gly Asp

20 25 30

Thr Val Thr Asp Gly Gln Val Val Cys Glu Val Glu Thr Ala Lys Ala

35 40 45

Ala Val Glu Leu Pro Ile Pro Tyr Asp Gly Val Val Arg Glu Leu Arg

50 55 60

Phe Pro Glu Gly Thr Thr Val Asp Val Gly Gln Val Ile Ile Ala Val

65 70 75 80

Asp Val Ala Gly Asp Ala Pro Val Ala Glu Ile Pro Val Pro Ala Gln

85 90 95

Glu Ala Pro Val Gln Glu Glu Pro Lys Pro Glu Gly Arg Lys Pro Val

100 105 110

Leu Val Gly Tyr Gly Val Ala Glu Ser Ser Thr Lys Arg Arg Pro Arg

115 120 125

Lys Ser Ala Pro Ala Ser Glu Pro Ala Ala Glu Gly Thr Tyr Phe Ala

130 135 140

Ala Thr Val Leu Gln Gly Ile Gln Gly Glu Leu Asn Gly His Gly Ala

145 150 155 160

Val Lys Gln Arg Pro Leu Ala Lys Pro Pro Val Arg Lys Leu Ala Lys

165 170 175

Asp Leu Gly Val Asp Leu Ala Thr Ile Thr Pro Ser Gly Pro Asp Gly

180 185 190

Val Ile Thr Arg Glu Asp Val His Ala Ala Val Ala Pro Pro Pro Pro

195 200 205

Ala Pro Gln Pro Val Gln Thr Pro Ala Ala Pro Ala Pro Ala Pro Val

210 215 220

Ala Ala Tyr Asp Thr Ala Arg Glu Thr Arg Val Pro Val Lys Gly Val

225 230 235 240

Arg Lys Ala Thr Ala Ala Ala Met Val Gly Ser Ala Phe Thr Ala Pro

245 250 255

His Val Thr Glu Phe Val Thr Val Asp Val Thr Arg Thr Met Lys Leu

260 265 270

Val Glu Glu Leu Lys Gln Asp Lys Glu Phe Thr Gly Leu Arg Val Asn

275 280 285

Pro Leu Leu Leu Ile Ala Lys Ala Leu Leu Val Ala Ile Lys Arg Asn

290 295 300

Pro Asp Ile Asn Ala Ser Trp Asp Glu Ala Asn Gln Glu Ile Val Leu

305 310 315 320

Lys His Tyr Val Asn Leu Gly Ile Ala Ala Ala Thr Pro Arg Gly Leu

325 330 335

Ile Val Pro Asn Ile Lys Asp Ala His Ala Lys Thr Leu Pro Gln Leu

340 345 350

Ala Glu Ser Leu Gly Glu Leu Val Ser Thr Ala Arg Glu Gly Lys Thr

355 360 365

Ser Pro Thr Ala Met Gln Gly Gly Thr Val Thr Ile Thr Asn Val Gly

370 375 380

Val Phe Gly Val Asp Thr Gly Thr Pro Ile Leu Asn Pro Gly Glu Ser

385 390 395 400

Ala Ile Leu Ala Val Gly Ala Ile Lys Leu Gln Pro Trp Val His Lys

405 410 415

Gly Lys Val Lys Pro Arg Gln Val Thr Thr Leu Ala Leu Ser Phe Asp

420 425 430

His Arg Leu Val Asp Gly Glu Leu Gly Ser Lys Val Leu Ala Asp Val

435 440 445

Ala Ala Ile Leu Glu Gln Pro Lys Arg Leu Ile Thr Trp Ala

450 455 460

<210> 13

<211> 462

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 13

Met Ala Asn Asp Ala Ser Thr Val Phe Asp Leu Val Ile Leu Gly Gly

1 5 10 15

Gly Ser Gly Gly Tyr Ala Ala Ala Leu Arg Gly Ala Gln Leu Gly Leu

20 25 30

Asp Val Ala Leu Ile Glu Lys Asp Lys Val Gly Gly Thr Cys Leu His

35 40 45

Arg Gly Cys Ile Pro Thr Lys Ala Leu Leu His Ala Gly Glu Ile Ala

50 55 60

Asp Gln Ala Arg Glu Ser Glu Gln Phe Gly Val Lys Ala Thr Phe Glu

65 70 75 80

Gly Ile Asp Val Pro Ala Val His Lys Tyr Lys Asp Gly Val Ile Ser

85 90 95

Gly Leu Tyr Lys Gly Leu Gln Gly Leu Ile Ala Ser Arg Lys Val Thr

100 105 110

Tyr Ile Glu Gly Glu Gly Arg Leu Ser Ser Pro Thr Ser Val Asp Val

115 120 125

Asn Gly Gln Arg Val Gln Gly Arg His Val Leu Leu Ala Thr Gly Ser

130 135 140

Val Pro Lys Ser Leu Pro Gly Leu Ala Ile Asp Gly Asn Arg Ile Ile

145 150 155 160

Ser Ser Asp His Ala Leu Val Leu Asp Arg Val Pro Glu Ser Ala Ile

165 170 175

Val Leu Gly Gly Gly Val Ile Gly Val Glu Phe Ala Ser Ala Trp Lys

180 185 190

Ser Phe Gly Ala Asp Val Thr Val Ile Glu Gly Leu Lys His Leu Val

195 200 205

Pro Val Glu Asp Glu Asn Ser Ser Lys Leu Leu Glu Arg Ala Phe Arg

210 215 220

Lys Arg Gly Ile Lys Phe Asn Leu Gly Thr Phe Phe Ser Lys Ala Glu

225 230 235 240

Tyr Thr Gln Asn Gly Val Lys Val Thr Leu Ala Asp Gly Lys Glu Phe

245 250 255

Glu Ala Glu Val Leu Leu Val Ala Val Gly Arg Gly Pro Val Ser Gln

260 265 270

Gly Leu Gly Tyr Glu Glu Gln Gly Val Ala Met Asp Arg Gly Tyr Val

275 280 285

Leu Val Asp Glu Tyr Met Arg Thr Asn Val Pro Thr Ile Ser Ala Val

290 295 300

Gly Asp Leu Val Pro Thr Leu Gln Leu Ala His Val Gly Phe Ala Glu

305 310 315 320

Gly Ile Leu Val Ala Glu Arg Leu Ala Gly Leu Lys Thr Val Pro Ile

325 330 335

Asp Tyr Asp Gly Val Pro Arg Val Thr Tyr Cys His Pro Glu Val Ala

340 345 350

Ser Val Gly Ile Thr Glu Ala Lys Ala Lys Glu Ile Tyr Gly Ala Asp

355 360 365

Lys Val Val Ala Leu Lys Tyr Asn Leu Ala Gly Asn Gly Lys Ser Lys

370 375 380

Ile Leu Asn Thr Ala Gly Glu Ile Lys Leu Val Gln Val Lys Asp Gly

385 390 395 400

Ala Val Val Gly Val His Met Val Gly Asp Arg Met Gly Glu Gln Val

405 410 415

Gly Glu Ala Gln Leu Ile Tyr Asn Trp Glu Ala Leu Pro Ala Glu Val

420 425 430

Ala Gln Leu Ile His Ala His Pro Thr Gln Asn Glu Ala Met Gly Glu

435 440 445

Ala His Leu Ala Leu Ala Gly Lys Pro Leu His Ser His Asp

450 455 460

<210> 14

<211> 1545

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 14

atggctgact cgcaacccct gtccggtgct ccggaaggtg ccgaatattt aagagcagtg 60

ctgcgcgcgc cggtttacga ggcggcgcag gttacgccgc tacaaaaaat ggaaaaactg 120

tcgtcgcgtc ttgataacgt cattctggtg aagcgcgaag atcgccagcc agtgcacagc 180

tttaagctgc gcggcgcata cgccatgatg gcgggcctga cggaagaaca gaaagcgcac 240

ggcgtgatca ctgcttctgc gggtaaccac gcgcagggcg tcgcgttttc ttctgcgcgg 300

ttaggcgtga aggccctgat cgttatgcca accgccaccg ccgacatcaa agtcgacgcg 360

gtgcgcggct tcggcggcga agtgctgctc cacggcgcga actttgatga agcgaaagcc 420

aaagcgatcg aactgtcaca gcagcagggg ttcacctggg tgccgccgtt cgaccatccg 480

atggtgattg ccgggcaagg cacgctggcg ctggaactgc tccagcagga cgcccatctc 540

gaccgcgtat ttgtgccagt cggcggcggc ggtctggctg ctggcgtggc ggtgctgatc 600

aaacaactga tgccgcaaat caaagtgatc gccgtagaag cggaagactc cgcctgcctg 660

aaagcagcgc tggatgcggg tcatccggtt gatctgccgc gcgtagggct atttgctgaa 720

ggcgtagcgg taaaacgcat cggtgacgaa accttccgtt tatgccagga gtatctcgac 780

gacatcatca ccgtcgatag cgatgcgatc tgtgcggcga tgaaggattt attcgaagat 840

gtgcgcgcgg tggcggaacc ctctggcgcg ctggcgctgg cgggaatgaa aaaatatatc 900

gccctgcaca acattcgcgg cgaacggctg gcgcatattc tttccggtgc caacgtgaac 960

ttccacggcc tgcgctacgt ctcagaacgc tgcgaactgg gcgaacagcg tgaagcgttg 1020

ttggcggtga ccattccgga agaaaaaggc agcttcctca aattctgcca actgcttggc 1080

gggcgttcgg tcaccgagtt caactaccgt tttgccgatg ccaaaaacgc ctgcatcttt 1140

gtcggtgtgc gcctgagccg cggcctcgaa gagcgcaaag aaattttgca gatgctcaac 1200

gacggcggct acagcgtggt tgatctctcc gacgacgaaa tggcgaagct acacgtgcgc 1260

tatatggtcg gcggacgtcc atcgcatccg ttgcaggaac gcctctacag cttcgaattc 1320

ccggaatcac cgggcgcgct gctgcgcttc ctcaacacgc tgggtacgta ctggaacatt 1380

tctttgttcc actatcgcag ccatggcacc gactacgggc gcgtactggc ggcgttcgaa 1440

cttggcgacc atgaaccgga tttcgaaacc cggctgaatg agctgggcta cgattgccac 1500

gacgaaacca ataacccggc gttcaggttc tttttggcgg gttag 1545

<210> 15

<211> 514

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 15

Met Ala Asp Ser Gln Pro Leu Ser Gly Ala Pro Glu Gly Ala Glu Tyr

1 5 10 15

Leu Arg Ala Val Leu Arg Ala Pro Val Tyr Glu Ala Ala Gln Val Thr

20 25 30

Pro Leu Gln Lys Met Glu Lys Leu Ser Ser Arg Leu Asp Asn Val Ile

35 40 45

Leu Val Lys Arg Glu Asp Arg Gln Pro Val His Ser Phe Lys Leu Arg

50 55 60

Gly Ala Tyr Ala Met Met Ala Gly Leu Thr Glu Glu Gln Lys Ala His

65 70 75 80

Gly Val Ile Thr Ala Ser Ala Gly Asn His Ala Gln Gly Val Ala Phe

85 90 95

Ser Ser Ala Arg Leu Gly Val Lys Ala Leu Ile Val Met Pro Thr Ala

100 105 110

Thr Ala Asp Ile Lys Val Asp Ala Val Arg Gly Phe Gly Gly Glu Val

115 120 125

Leu Leu His Gly Ala Asn Phe Asp Glu Ala Lys Ala Lys Ala Ile Glu

130 135 140

Leu Ser Gln Gln Gln Gly Phe Thr Trp Val Pro Pro Phe Asp His Pro

145 150 155 160

Met Val Ile Ala Gly Gln Gly Thr Leu Ala Leu Glu Leu Leu Gln Gln

165 170 175

Asp Ala His Leu Asp Arg Val Phe Val Pro Val Gly Gly Gly Gly Leu

180 185 190

Ala Ala Gly Val Ala Val Leu Ile Lys Gln Leu Met Pro Gln Ile Lys

195 200 205

Val Ile Ala Val Glu Ala Glu Asp Ser Ala Cys Leu Lys Ala Ala Leu

210 215 220

Asp Ala Gly His Pro Val Asp Leu Pro Arg Val Gly Leu Phe Ala Glu

225 230 235 240

Gly Val Ala Val Lys Arg Ile Gly Asp Glu Thr Phe Arg Leu Cys Gln

245 250 255

Glu Tyr Leu Asp Asp Ile Ile Thr Val Asp Ser Asp Ala Ile Cys Ala

260 265 270

Ala Met Lys Asp Leu Phe Glu Asp Val Arg Ala Val Ala Glu Pro Ser

275 280 285

Gly Ala Leu Ala Leu Ala Gly Met Lys Lys Tyr Ile Ala Leu His Asn

290 295 300

Ile Arg Gly Glu Arg Leu Ala His Ile Leu Ser Gly Ala Asn Val Asn

305 310 315 320

Phe His Gly Leu Arg Tyr Val Ser Glu Arg Cys Glu Leu Gly Glu Gln

325 330 335

Arg Glu Ala Leu Leu Ala Val Thr Ile Pro Glu Glu Lys Gly Ser Phe

340 345 350

Leu Lys Phe Cys Gln Leu Leu Gly Gly Arg Ser Val Thr Glu Phe Asn

355 360 365

Tyr Arg Phe Ala Asp Ala Lys Asn Ala Cys Ile Phe Val Gly Val Arg

370 375 380

Leu Ser Arg Gly Leu Glu Glu Arg Lys Glu Ile Leu Gln Met Leu Asn

385 390 395 400

Asp Gly Gly Tyr Ser Val Val Asp Leu Ser Asp Asp Glu Met Ala Lys

405 410 415

Leu His Val Arg Tyr Met Val Gly Gly Arg Pro Ser His Pro Leu Gln

420 425 430

Glu Arg Leu Tyr Ser Phe Glu Phe Pro Glu Ser Pro Gly Ala Leu Leu

435 440 445

Arg Phe Leu Asn Thr Leu Gly Thr Tyr Trp Asn Ile Ser Leu Phe His

450 455 460

Tyr Arg Ser His Gly Thr Asp Tyr Gly Arg Val Leu Ala Ala Phe Glu

465 470 475 480

Leu Gly Asp His Glu Pro Asp Phe Glu Thr Arg Leu Asn Glu Leu Gly

485 490 495

Tyr Asp Cys His Asp Glu Thr Asn Asn Pro Ala Phe Arg Phe Phe Leu

500 505 510

Ala Gly

<210> 16

<211> 930

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 16

atgaccacga agaaagctga ttacatttgg ttcaatgggg agatggttcg ctgggaagac 60

gcgaaggtgc atgtgatgtc gcacgcgctg cactatggca cttcggtttt tgaaggcatc 120

cgttgctacg actcgcacaa aggaccggtt gtattccgcc atcgtgagca tatgcagcgt 180

ctgcatgact ccgccaaaat ctatcgcttc ccggtttcgc agagcattga tgagctgatg 240

gaagcttgtc gtgacgtgat ccgcaaaaac aatctcacca gcgcctatat ccgtccgctg 300

atcttcgtcg gtgatgttgg catgggagta aacccgccag cgggatactc aaccgacgtg 360

attatcgctg ctttcccgtg gggagcgtat ctgggcgcag aagcgctgga gcaggggatc 420

gatgcgatgg tttcctcctg gaaccgcgca gcaccaaaca ccatcccgac ggcggcaaaa 480

gccggtggta actacctctc ttccctgctg gtgggtagcg aagcgcgccg ccacggttat 540

caggaaggta tcgcgctgga tgtgaacggt tatatctctg aaggcgcagg cgaaaacctg 600

tttgaagtga aagatggtgt gctgttcacc ccaccgttca cctcctccgc gctgccgggt 660

attacccgtg atgccatcat caaactggcg aaagagctgg gaattgaagt acgtgagcag 720

gtgctgtcgc gcgaatccct gtacctggcg gatgaagtgt ttatgtccgg tacggcggca 780

gaaatcacgc cagtgcgcag cgtagacggt attcaggttg gcgaaggccg ttgtggcccg 840

gttaccaaac gcattcagca agccttcttc ggcctcttca ctggcgaaac cgaagataaa 900

tggggctggt tagatcaagt taatcaataa 930

<210> 17

<211> 309

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 17

Met Thr Thr Lys Lys Ala Asp Tyr Ile Trp Phe Asn Gly Glu Met Val

1 5 10 15

Arg Trp Glu Asp Ala Lys Val His Val Met Ser His Ala Leu His Tyr

20 25 30

Gly Thr Ser Val Phe Glu Gly Ile Arg Cys Tyr Asp Ser His Lys Gly

35 40 45

Pro Val Val Phe Arg His Arg Glu His Met Gln Arg Leu His Asp Ser

50 55 60

Ala Lys Ile Tyr Arg Phe Pro Val Ser Gln Ser Ile Asp Glu Leu Met

65 70 75 80

Glu Ala Cys Arg Asp Val Ile Arg Lys Asn Asn Leu Thr Ser Ala Tyr

85 90 95

Ile Arg Pro Leu Ile Phe Val Gly Asp Val Gly Met Gly Val Asn Pro

100 105 110

Pro Ala Gly Tyr Ser Thr Asp Val Ile Ile Ala Ala Phe Pro Trp Gly

115 120 125

Ala Tyr Leu Gly Ala Glu Ala Leu Glu Gln Gly Ile Asp Ala Met Val

130 135 140

Ser Ser Trp Asn Arg Ala Ala Pro Asn Thr Ile Pro Thr Ala Ala Lys

145 150 155 160

Ala Gly Gly Asn Tyr Leu Ser Ser Leu Leu Val Gly Ser Glu Ala Arg

165 170 175

Arg His Gly Tyr Gln Glu Gly Ile Ala Leu Asp Val Asn Gly Tyr Ile

180 185 190

Ser Glu Gly Ala Gly Glu Asn Leu Phe Glu Val Lys Asp Gly Val Leu

195 200 205

Phe Thr Pro Pro Phe Thr Ser Ser Ala Leu Pro Gly Ile Thr Arg Asp

210 215 220

Ala Ile Ile Lys Leu Ala Lys Glu Leu Gly Ile Glu Val Arg Glu Gln

225 230 235 240

Val Leu Ser Arg Glu Ser Leu Tyr Leu Ala Asp Glu Val Phe Met Ser

245 250 255

Gly Thr Ala Ala Glu Ile Thr Pro Val Arg Ser Val Asp Gly Ile Gln

260 265 270

Val Gly Glu Gly Arg Cys Gly Pro Val Thr Lys Arg Ile Gln Gln Ala

275 280 285

Phe Phe Gly Leu Phe Thr Gly Glu Thr Glu Asp Lys Trp Gly Trp Leu

290 295 300

Asp Gln Val Asn Gln

305

<210> 18

<211> 2760

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 18

atgactgatt ttttacgcga tgacatcagg ttcctcggtc aaatcctcgg tgaggtaatt 60

gcggaacaag aaggccagga ggtttatgaa ctggtcgaac aagcgcgcct gacttctttt 120

gatatcgcca agggcaacgc cgaaatggat agcctggttc aggttttcga cggcattact 180

ccagccaagg caacaccgat tgctcgcgca ttttcccact tcgctctgct ggctaacctg 240

gcggaagacc tctacgatga agagcttcgt gaacaggctc tcgatgcagg cgacacccct 300

ccggacagca ctcttgatgc cacctggctg aaactcaatg agggcaatgt tggcgcagaa 360

gctgtggccg atgtgctgcg caatgctgag gtggcgccgg ttctgactgc gcacccaact 420

gagactcgcc gccgcactgt ttttgatgcg caaaagtgga tcaccaccca catgcgtgaa 480

cgccacgctt tgcagtctgc ggagcctacc gctcgtacgc aaagcaagtt ggatgagatc 540

gagaagaaca tccgccgtcg catcaccatt ttgtggcaga ccgcgttgat tcgtgtggcc 600

cgcccacgta tcgaggacga gatcgaagta gggctgcgct actacaagct gagccttttg 660

gaagagattc cacgtatcaa ccgtgatgtg gctgttgagc ttcgtgagcg tttcggcgag 720

ggtgttcctt tgaagcccgt ggtcaagcca ggttcctgga ttggtggaga ccacgacggt 780

aacccttatg tcaccgcgga aacagttgag tattccactc accgcgctgc ggaaaccgtg 840

ctcaagtact atgcacgcca gctgcattcc ctcgagcatg agctcagcct gtcggaccgc 900

atgaataagg tcaccccgca gctgcttgcg ctggcagatg cagggcacaa cgacgtgcca 960

agccgcgtgg atgagcctta tcgacgcgcc gtccatggcg ttcgcggacg tatcctcgcg 1020

acgacggccg agctgatcgg cgaggacgcc gttgagggcg tgtggttcaa ggtctttact 1080

ccatacgcat ctccggaaga attcttaaac gatgcgttga ccattgatca ttctctgcgt 1140

gaatccaagg acgttctcat tgccgatgat cgtttgtctg tgctgatttc tgccatcgag 1200

agctttggat tcaaccttta cgcactggat ctgcgccaaa actccgaaag ctacgaggac 1260

gtcctcaccg agcttttcga acgcgcccaa gtcaccgcaa actaccgcga gctgtctgaa 1320

gcagagaagc ttgaggtgct gctgaaggaa ctgcgcagcc ctcgtccgct gatcccgcac 1380

ggttcagatg aatacagcga ggtcaccgac cgcgagctcg gcatcttccg caccgcgtcg 1440

gaggctgtta agaaattcgg gccacggatg gtgcctcact gcatcatctc catggcatca 1500

tcggtcaccg atgtgctcga gccgatggtg ttgctcaagg aattcggact catcgcagcc 1560

aacggcgaca acccacgcgg caccgtcgat gtcatcccac tgttcgaaac catcgaagat 1620

ctccaggccg gcgccggaat cctcgacgaa ctgtggaaaa ttgatctcta ccgcaactac 1680

ctcctgcagc gcgacaacgt ccaggaagtc atgctcggtt actccgattc caacaaggat 1740

ggcggatatt tctccgcaaa ctgggcgctt tacgacgcgg aactgcagct cgtcgaacta 1800

tgccgatcag ccggggtcaa gcttcgcctg ttccacggcc gtggtggcac cgtcggccgc 1860

ggtggcggac cttcctacga cgcgattctt gcccagccca ggggggctgt ccaaggttcc 1920

gtgcgcatca ccgagcaggg cgagatcatc tccgctaagt acggcaaccc cgaaaccgcg 1980

cgccgaaacc tcgaagccct ggtctcagcc acgcttgagg catcgcttct cgacgtctcc 2040

gaactcaccg atcaccaacg cgcgtacgac atcatgagtg agatctctga gctcagcttg 2100

aagaagtacg cctccttggt gcacgaggat caaggcttca tcgattactt cacccagtcc 2160

acgccgctgc aggagattgg atccctcaac atcggatcca ggccttcctc acgcaagcag 2220

acctcctcgg tggaagattt gcgagccatc ccatgggtgc tcagctggtc acagtctcgt 2280

gtcatgctgc caggctggtt tggtgtcgga accgcattag agcagtggat tggcgaaggg 2340

gagcaggcca cccaacgcat tgccgagctg caaacactca atgagtcctg gccatttttc 2400

acctcagtgt tggataacat ggctcaggtg atgtccaagg cagagctgcg tttggcaaag 2460

ctctacgcag acctgatccc agatacggaa gtagccgagc gagtctattc cgtcatccgc 2520

gaggagtact tcctgaccaa gaagatgttc tgcgtaatca ccggctctga tgatctgctt 2580

gatgacaacc cacttctcgc acgctctgtc cagcgccgat acccctacct gcttccactc 2640

aacgtgatcc aggtagagat gatgcgacgc taccgaaaag gcgaccaaag cgagcaagtg 2700

tcccgcaaca ttcagctgac catgaacggt ctttccactg cgctgcgcaa ctccggctag 2760

<210> 19

<211> 919

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 19

Met Thr Asp Phe Leu Arg Asp Asp Ile Arg Phe Leu Gly Gln Ile Leu

1 5 10 15

Gly Glu Val Ile Ala Glu Gln Glu Gly Gln Glu Val Tyr Glu Leu Val

20 25 30

Glu Gln Ala Arg Leu Thr Ser Phe Asp Ile Ala Lys Gly Asn Ala Glu

35 40 45

Met Asp Ser Leu Val Gln Val Phe Asp Gly Ile Thr Pro Ala Lys Ala

50 55 60

Thr Pro Ile Ala Arg Ala Phe Ser His Phe Ala Leu Leu Ala Asn Leu

65 70 75 80

Ala Glu Asp Leu Tyr Asp Glu Glu Leu Arg Glu Gln Ala Leu Asp Ala

85 90 95

Gly Asp Thr Pro Pro Asp Ser Thr Leu Asp Ala Thr Trp Leu Lys Leu

100 105 110

Asn Glu Gly Asn Val Gly Ala Glu Ala Val Ala Asp Val Leu Arg Asn

115 120 125

Ala Glu Val Ala Pro Val Leu Thr Ala His Pro Thr Glu Thr Arg Arg

130 135 140

Arg Thr Val Phe Asp Ala Gln Lys Trp Ile Thr Thr His Met Arg Glu

145 150 155 160

Arg His Ala Leu Gln Ser Ala Glu Pro Thr Ala Arg Thr Gln Ser Lys

165 170 175

Leu Asp Glu Ile Glu Lys Asn Ile Arg Arg Arg Ile Thr Ile Leu Trp

180 185 190

Gln Thr Ala Leu Ile Arg Val Ala Arg Pro Arg Ile Glu Asp Glu Ile

195 200 205

Glu Val Gly Leu Arg Tyr Tyr Lys Leu Ser Leu Leu Glu Glu Ile Pro

210 215 220

Arg Ile Asn Arg Asp Val Ala Val Glu Leu Arg Glu Arg Phe Gly Glu

225 230 235 240

Gly Val Pro Leu Lys Pro Val Val Lys Pro Gly Ser Trp Ile Gly Gly

245 250 255

Asp His Asp Gly Asn Pro Tyr Val Thr Ala Glu Thr Val Glu Tyr Ser

260 265 270

Thr His Arg Ala Ala Glu Thr Val Leu Lys Tyr Tyr Ala Arg Gln Leu

275 280 285

His Ser Leu Glu His Glu Leu Ser Leu Ser Asp Arg Met Asn Lys Val

290 295 300

Thr Pro Gln Leu Leu Ala Leu Ala Asp Ala Gly His Asn Asp Val Pro

305 310 315 320

Ser Arg Val Asp Glu Pro Tyr Arg Arg Ala Val His Gly Val Arg Gly

325 330 335

Arg Ile Leu Ala Thr Thr Ala Glu Leu Ile Gly Glu Asp Ala Val Glu

340 345 350

Gly Val Trp Phe Lys Val Phe Thr Pro Tyr Ala Ser Pro Glu Glu Phe

355 360 365

Leu Asn Asp Ala Leu Thr Ile Asp His Ser Leu Arg Glu Ser Lys Asp

370 375 380

Val Leu Ile Ala Asp Asp Arg Leu Ser Val Leu Ile Ser Ala Ile Glu

385 390 395 400

Ser Phe Gly Phe Asn Leu Tyr Ala Leu Asp Leu Arg Gln Asn Ser Glu

405 410 415

Ser Tyr Glu Asp Val Leu Thr Glu Leu Phe Glu Arg Ala Gln Val Thr

420 425 430

Ala Asn Tyr Arg Glu Leu Ser Glu Ala Glu Lys Leu Glu Val Leu Leu

435 440 445

Lys Glu Leu Arg Ser Pro Arg Pro Leu Ile Pro His Gly Ser Asp Glu

450 455 460

Tyr Ser Glu Val Thr Asp Arg Glu Leu Gly Ile Phe Arg Thr Ala Ser

465 470 475 480

Glu Ala Val Lys Lys Phe Gly Pro Arg Met Val Pro His Cys Ile Ile

485 490 495

Ser Met Ala Ser Ser Val Thr Asp Val Leu Glu Pro Met Val Leu Leu

500 505 510

Lys Glu Phe Gly Leu Ile Ala Ala Asn Gly Asp Asn Pro Arg Gly Thr

515 520 525

Val Asp Val Ile Pro Leu Phe Glu Thr Ile Glu Asp Leu Gln Ala Gly

530 535 540

Ala Gly Ile Leu Asp Glu Leu Trp Lys Ile Asp Leu Tyr Arg Asn Tyr

545 550 555 560

Leu Leu Gln Arg Asp Asn Val Gln Glu Val Met Leu Gly Tyr Ser Asp

565 570 575

Ser Asn Lys Asp Gly Gly Tyr Phe Ser Ala Asn Trp Ala Leu Tyr Asp

580 585 590

Ala Glu Leu Gln Leu Val Glu Leu Cys Arg Ser Ala Gly Val Lys Leu

595 600 605

Arg Leu Phe His Gly Arg Gly Gly Thr Val Gly Arg Gly Gly Gly Pro

610 615 620

Ser Tyr Asp Ala Ile Leu Ala Gln Pro Arg Gly Ala Val Gln Gly Ser

625 630 635 640

Val Arg Ile Thr Glu Gln Gly Glu Ile Ile Ser Ala Lys Tyr Gly Asn

645 650 655

Pro Glu Thr Ala Arg Arg Asn Leu Glu Ala Leu Val Ser Ala Thr Leu

660 665 670

Glu Ala Ser Leu Leu Asp Val Ser Glu Leu Thr Asp His Gln Arg Ala

675 680 685

Tyr Asp Ile Met Ser Glu Ile Ser Glu Leu Ser Leu Lys Lys Tyr Ala

690 695 700

Ser Leu Val His Glu Asp Gln Gly Phe Ile Asp Tyr Phe Thr Gln Ser

705 710 715 720

Thr Pro Leu Gln Glu Ile Gly Ser Leu Asn Ile Gly Ser Arg Pro Ser

725 730 735

Ser Arg Lys Gln Thr Ser Ser Val Glu Asp Leu Arg Ala Ile Pro Trp

740 745 750

Val Leu Ser Trp Ser Gln Ser Arg Val Met Leu Pro Gly Trp Phe Gly

755 760 765

Val Gly Thr Ala Leu Glu Gln Trp Ile Gly Glu Gly Glu Gln Ala Thr

770 775 780

Gln Arg Ile Ala Glu Leu Gln Thr Leu Asn Glu Ser Trp Pro Phe Phe

785 790 795 800

Thr Ser Val Leu Asp Asn Met Ala Gln Val Met Ser Lys Ala Glu Leu

805 810 815

Arg Leu Ala Lys Leu Tyr Ala Asp Leu Ile Pro Asp Thr Glu Val Ala

820 825 830

Glu Arg Val Tyr Ser Val Ile Arg Glu Glu Tyr Phe Leu Thr Lys Lys

835 840 845

Met Phe Cys Val Ile Thr Gly Ser Asp Asp Leu Leu Asp Asp Asn Pro

850 855 860

Leu Leu Ala Arg Ser Val Gln Arg Arg Tyr Pro Tyr Leu Leu Pro Leu

865 870 875 880

Asn Val Ile Gln Val Glu Met Met Arg Arg Tyr Arg Lys Gly Asp Gln

885 890 895

Ser Glu Gln Val Ser Arg Asn Ile Gln Leu Thr Met Asn Gly Leu Ser

900 905 910

Thr Ala Leu Arg Asn Ser Gly

915

<210> 20

<211> 1257

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 20

gtctcgagaa tatcctcctt ataacttcgt ataatgtatg ctatacgaac ggtaagagcg 60

cttggccgct cacttcgcag aataaataaa tcctggtgtc cctgttgata ccgggaagcc 120

ctgggccaac ttttggcgaa aatgagacgt tgatcggcac gtaagaggtt ccaactttca 180

ccataatgaa ataagatcac taccgggcgt attttttgag ttatcgagat tttcaggagc 240

taaggaagct aaaatggaga aaaaaatcac tggatatacc accgttgata tatcccaatg 300

gcatcgtaaa gaacattttg aggcatttca gtcagttgct caatgtacct ataaccagac 360

cgttcagctg gatattacgg cctttttaaa gaccgtaaag aaaaataagc acaagtttta 420

tccggccttt attcacattc ttgcccgcct gatgaatgct catccggagt tccgtatggc 480

aatgaaagac ggtgagctgg tgatatggga tagtgttcac ccttgttaca ccgttttcca 540

tgagcaaact gaaacgtttt catcgctctg gagtgaatac cacgacgatt tccggcagtt 600

tctacacata tattcgcaag atgtggcgtg ttacggtgaa aacctggcct atttccctaa 660

agggtttatt gagaatatgt ttttcgtctc agccaatccc tgggtgagtt tcaccagttt 720

tgatttaaac gtggccaata tggacaactt cttcgccccc gttttcacta tgggcaaata 780

ttatacgcaa ggcgacaagg tgctgatgcc gctggcgatt caggttcatc atgccgtctg 840

tgatggcttc catgtcggca gaatgcttaa tgaattacaa cagtactgcg atgagtggca 900

gggcggggcg taattttttt aaggcagtta ttggtgccct taaacgcctg gtgctacgcc 960

tgaataagtg ataataagcg gatgaatggc agaaattcga aagcaaattc gacccggtcg 1020

tcggttcagg gcagggtcgt taaatagccg cttatgtcta ttgctggttt accggtttat 1080

tgactaccgg aagcagtgtg accgtgtgct tctcaaatgc ctgagggcat gctgcggcag 1140

cgtgagggga tctttaccgt tcgtataatg tatgctatac caagttatga agctagctta 1200

tcaaaaagtt gacaattaat catcggctcg tataatgtgt ggaaggagga attaacc 1257

<210> 21

<211> 3749

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 21

atgggcagca gccatcacca tcatcaccac agccaggatc cgagcggaac aggacgactg 60

gcaggaaaga ttgcgttaat taccggtggc gccggcaata tcggcagtga attgacacgt 120

cgctttctcg cagagggagc gacggtcatt attagtggac ggaatcgggc gaagttgacc 180

gcactggccg aacggatgca ggcagaggca ggagtgccgg caaagcgcat cgatctcgaa 240

gtcatggatg ggagtgatcc ggtcgcggta cgtgccggta tcgaagcgat tgtggcccgt 300

cacggccaga tcgacattct ggtcaacaat gcaggaagtg ccggtgccca gcgtcgtctg 360

gccgagattc cactcactga agctgaatta ggccctggcg ccgaagagac gcttcatgcc 420

agcatcgcca atttacttgg tatgggatgg catctgatgc gtattgcggc acctcatatg 480

ccggtaggaa gtgcggtcat caatgtctcg accatctttt cacgggctga gtactacggg 540

cggattccgt atgtcacccc taaagctgct cttaatgctc tatctcaact tgctgcgcgt 600

gagttaggtg cacgtggcat ccgcgttaat acgatctttc ccggcccgat tgaaagtgat 660

cgcatccgta cagtgttcca gcgtatggat cagctcaagg ggcggcccga aggcgacaca 720

gcgcaccatt ttttgaacac catgcgattg tgtcgtgcca acgaccaggg cgcgcttgaa 780

cgtcggttcc cctccgtcgg tgatgtggca gacgccgctg tctttctggc cagtgccgaa 840

tccgccgctc tctccggtga gacgattgag gttacgcacg gaatggagtt gccggcctgc 900

agtgagacca gcctgctggc ccgtactgat ctgcgcacga ttgatgccag tggccgcacg 960

acgctcatct gcgccggcga ccagattgaa gaggtgatgg cgctcaccgg tatgttgcgt 1020

acctgtggga gtgaagtgat catcggcttc cgttcggctg cggcgctggc ccagttcgag 1080

caggcagtca atgagagtcg gcggctggcc ggcgcagact ttacgcctcc cattgccttg 1140

ccactcgatc cacgcgatcc ggcaacaatt gacgctgtct tcgattgggc cggcgagaat 1200

accggcggga ttcatgcagc ggtgattctg cctgctacca gtcacgaacc ggcaccgtgc 1260

gtgattgagg ttgatgatga gcgggtgctg aattttctgg ccgatgaaat caccgggaca 1320

attgtgattg ccagtcgcct ggcccgttac tggcagtcgc aacggcttac ccccggcgca 1380

cgtgcgcgtg ggccgcgtgt catttttctc tcgaacggtg ccgatcaaaa tgggaatgtt 1440

tacggacgca ttcaaagtgc cgctatcggt cagctcattc gtgtgtggcg tcacgaggct 1500

gaacttgact atcagcgtgc cagcgccgcc ggtgatcatg tgctgccgcc ggtatgggcc 1560

aatcagattg tgcgcttcgc taaccgcagc cttgaagggt tagaatttgc ctgtgcctgg 1620

acagctcaat tgctccatag tcaacgccat atcaatgaga ttaccctcaa catccctgcc 1680

aacatttaac aggaggaatt aacatggcag atctccatca ccatcatcac catcacagcg 1740

ccaccaccgg cgcacgcagt gcatcggtcg gatgggcgga aagcctgatc gggttgcatt 1800

tggggaaagt tgccttgatt accggtggca gcgccggtat tggtgggcag atcgggcgcc 1860

tcctggcttt gagtggcgcg cgcgtgatgc tggcagcccg tgatcggcat aagctcgaac 1920

agatgcaggc gatgatccaa tctgagctgg ctgaggtggg gtataccgat gtcgaagatc 1980

gcgtccacat tgcaccgggc tgcgatgtga gtagcgaagc gcagcttgcg gatcttgttg 2040

aacgtaccct gtcagctttt ggcaccgtcg attatctgat caacaacgcc gggatcgccg 2100

gtgtcgaaga gatggttatc gatatgccag ttgagggatg gcgccatacc ctcttcgcca 2160

atctgatcag caactactcg ttgatgcgca aactggcgcc gttgatgaaa aaacagggta 2220

gcggttacat ccttaacgtc tcatcatact ttggcggtga aaaagatgcg gccattccct 2280

accccaaccg tgccgattac gccgtctcga aggctggtca gcgggcaatg gccgaagtct 2340

ttgcgcgctt ccttggcccg gagatacaga tcaatgccat tgcgccgggt ccggtcgaag 2400

gtgatcgctt gcgcggtacc ggtgaacgtc ccggcctctt tgcccgtcgg gcgcggctga 2460

ttttggagaa caagcggctg aatgagcttc acgctgctct tatcgcggct gcgcgcaccg 2520

atgagcgatc tatgcacgaa ctggttgaac tgctcttacc caatgatgtg gccgcactag 2580

agcagaatcc cgcagcacct accgcgttgc gtgaactggc acgacgtttt cgcagcgaag 2640

gcgatccggc ggcatcatca agcagtgcgc tgctgaaccg ttcaattgcc gctaaattgc 2700

tggctcgttt gcataatggt ggctatgtgt tgcctgccga catctttgca aacctgccaa 2760

acccgcccga tcccttcttc acccgagccc agattgatcg cgaggctcgc aaggttcgtg 2820

acggcatcat ggggatgctc tacctgcaac ggatgccgac tgagtttgat gtcgcaatgg 2880

ccaccgtcta ttaccttgcc gaccgcgtgg tcagtggtga gacattccac ccatcaggtg 2940

gtttgcgtta cgaacgcacc cctaccggtg gcgaactctt cggcttgccc tcaccggaac 3000

ggctggcgga gctggtcgga agcacggtct atctgatagg tgaacatctg actgaacacc 3060

ttaacctgct tgcccgtgcg tacctcgaac gttacggggc acgtcaggta gtgatgattg 3120

ttgagacaga aaccggggca gagacaatgc gtcgcttgct ccacgatcac gtcgaggctg 3180

gtcggctgat gactattgtg gccggtgatc agatcgaagc cgctatcgac caggctatca 3240

ctcgctacgg tcgcccaggg ccggtcgtct gtaccccctt ccggccactg ccgacggtac 3300

cactggtcgg gcgtaaagac agtgactgga gcacagtgtt gagtgaggct gaatttgccg 3360

agttgtgcga acaccagctc acccaccatt tccgggtagc gcgctggatt gccctgagtg 3420

atggtgcccg tctcgcgctg gtcactcccg aaactacggc tacctcaact accgagcaat 3480

ttgctctggc taacttcatc aaaacgaccc ttcacgcttt tacggctacg attggtgtcg 3540

agagcgaaag aactgctcag cgcattctga tcaatcaagt cgatctgacc cggcgtgcgc 3600

gtgccgaaga gccgcgtgat ccgcacgagc gtcaacaaga actggaacgt tttatcgagg 3660

cagtcttgct ggtcactgca ccactcccgc ctgaagccga tacccgttac gccgggcgga 3720

ttcatcgcgg acgggcgatt accgtgtaa 3749

<210> 22

<211> 562

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 22

Met Gly Ser Ser His His His His His His Ser Gln Asp Pro Ser Gly

1 5 10 15

Thr Gly Arg Leu Ala Gly Lys Ile Ala Leu Ile Thr Gly Gly Ala Gly

20 25 30

Asn Ile Gly Ser Glu Leu Thr Arg Arg Phe Leu Ala Glu Gly Ala Thr

35 40 45

Val Ile Ile Ser Gly Arg Asn Arg Ala Lys Leu Thr Ala Leu Ala Glu

50 55 60

Arg Met Gln Ala Glu Ala Gly Val Pro Ala Lys Arg Ile Asp Leu Glu

65 70 75 80

Val Met Asp Gly Ser Asp Pro Val Ala Val Arg Ala Gly Ile Glu Ala

85 90 95

Ile Val Ala Arg His Gly Gln Ile Asp Ile Leu Val Asn Asn Ala Gly

100 105 110

Ser Ala Gly Ala Gln Arg Arg Leu Ala Glu Ile Pro Leu Thr Glu Ala

115 120 125

Glu Leu Gly Pro Gly Ala Glu Glu Thr Leu His Ala Ser Ile Ala Asn

130 135 140

Leu Leu Gly Met Gly Trp His Leu Met Arg Ile Ala Ala Pro His Met

145 150 155 160

Pro Val Gly Ser Ala Val Ile Asn Val Ser Thr Ile Phe Ser Arg Ala

165 170 175

Glu Tyr Tyr Gly Arg Ile Pro Tyr Val Thr Pro Lys Ala Ala Leu Asn

180 185 190

Ala Leu Ser Gln Leu Ala Ala Arg Glu Leu Gly Ala Arg Gly Ile Arg

195 200 205

Val Asn Thr Ile Phe Pro Gly Pro Ile Glu Ser Asp Arg Ile Arg Thr

210 215 220

Val Phe Gln Arg Met Asp Gln Leu Lys Gly Arg Pro Glu Gly Asp Thr

225 230 235 240

Ala His His Phe Leu Asn Thr Met Arg Leu Cys Arg Ala Asn Asp Gln

245 250 255

Gly Ala Leu Glu Arg Arg Phe Pro Ser Val Gly Asp Val Ala Asp Ala

260 265 270

Ala Val Phe Leu Ala Ser Ala Glu Ser Ala Ala Leu Ser Gly Glu Thr

275 280 285

Ile Glu Val Thr His Gly Met Glu Leu Pro Ala Cys Ser Glu Thr Ser

290 295 300

Leu Leu Ala Arg Thr Asp Leu Arg Thr Ile Asp Ala Ser Gly Arg Thr

305 310 315 320

Thr Leu Ile Cys Ala Gly Asp Gln Ile Glu Glu Val Met Ala Leu Thr

325 330 335

Gly Met Leu Arg Thr Cys Gly Ser Glu Val Ile Ile Gly Phe Arg Ser

340 345 350

Ala Ala Ala Leu Ala Gln Phe Glu Gln Ala Val Asn Glu Ser Arg Arg

355 360 365

Leu Ala Gly Ala Asp Phe Thr Pro Pro Ile Ala Leu Pro Leu Asp Pro

370 375 380

Arg Asp Pro Ala Thr Ile Asp Ala Val Phe Asp Trp Ala Gly Glu Asn

385 390 395 400

Thr Gly Gly Ile His Ala Ala Val Ile Leu Pro Ala Thr Ser His Glu

405 410 415

Pro Ala Pro Cys Val Ile Glu Val Asp Asp Glu Arg Val Leu Asn Phe

420 425 430

Leu Ala Asp Glu Ile Thr Gly Thr Ile Val Ile Ala Ser Arg Leu Ala

435 440 445

Arg Tyr Trp Gln Ser Gln Arg Leu Thr Pro Gly Ala Arg Ala Arg Gly

450 455 460

Pro Arg Val Ile Phe Leu Ser Asn Gly Ala Asp Gln Asn Gly Asn Val

465 470 475 480

Tyr Gly Arg Ile Gln Ser Ala Ala Ile Gly Gln Leu Ile Arg Val Trp

485 490 495

Arg His Glu Ala Glu Leu Asp Tyr Gln Arg Ala Ser Ala Ala Gly Asp

500 505 510

His Val Leu Pro Pro Val Trp Ala Asn Gln Ile Val Arg Phe Ala Asn

515 520 525

Arg Ser Leu Glu Gly Leu Glu Phe Ala Cys Ala Trp Thr Ala Gln Leu

530 535 540

Leu His Ser Gln Arg His Ile Asn Glu Ile Thr Leu Asn Ile Pro Ala

545 550 555 560

Asn Ile

<210> 23

<211> 681

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 23

Met Ala Asp Leu His His His His His His His Ser Ala Thr Thr Gly

1 5 10 15

Ala Arg Ser Ala Ser Val Gly Trp Ala Glu Ser Leu Ile Gly Leu His

20 25 30

Leu Gly Lys Val Ala Leu Ile Thr Gly Gly Ser Ala Gly Ile Gly Gly

35 40 45

Gln Ile Gly Arg Leu Leu Ala Leu Ser Gly Ala Arg Val Met Leu Ala

50 55 60

Ala Arg Asp Arg His Lys Leu Glu Gln Met Gln Ala Met Ile Gln Ser

65 70 75 80

Glu Leu Ala Glu Val Gly Tyr Thr Asp Val Glu Asp Arg Val His Ile

85 90 95

Ala Pro Gly Cys Asp Val Ser Ser Glu Ala Gln Leu Ala Asp Leu Val

100 105 110

Glu Arg Thr Leu Ser Ala Phe Gly Thr Val Asp Tyr Leu Ile Asn Asn

115 120 125

Ala Gly Ile Ala Gly Val Glu Glu Met Val Ile Asp Met Pro Val Glu

130 135 140

Gly Trp Arg His Thr Leu Phe Ala Asn Leu Ile Ser Asn Tyr Ser Leu

145 150 155 160

Met Arg Lys Leu Ala Pro Leu Met Lys Lys Gln Gly Ser Gly Tyr Ile

165 170 175

Leu Asn Val Ser Ser Tyr Phe Gly Gly Glu Lys Asp Ala Ala Ile Pro

180 185 190

Tyr Pro Asn Arg Ala Asp Tyr Ala Val Ser Lys Ala Gly Gln Arg Ala

195 200 205

Met Ala Glu Val Phe Ala Arg Phe Leu Gly Pro Glu Ile Gln Ile Asn

210 215 220

Ala Ile Ala Pro Gly Pro Val Glu Gly Asp Arg Leu Arg Gly Thr Gly

225 230 235 240

Glu Arg Pro Gly Leu Phe Ala Arg Arg Ala Arg Leu Ile Leu Glu Asn

245 250 255

Lys Arg Leu Asn Glu Leu His Ala Ala Leu Ile Ala Ala Ala Arg Thr

260 265 270

Asp Glu Arg Ser Met His Glu Leu Val Glu Leu Leu Leu Pro Asn Asp

275 280 285

Val Ala Ala Leu Glu Gln Asn Pro Ala Ala Pro Thr Ala Leu Arg Glu

290 295 300

Leu Ala Arg Arg Phe Arg Ser Glu Gly Asp Pro Ala Ala Ser Ser Ser

305 310 315 320

Ser Ala Leu Leu Asn Arg Ser Ile Ala Ala Lys Leu Leu Ala Arg Leu

325 330 335

His Asn Gly Gly Tyr Val Leu Pro Ala Asp Ile Phe Ala Asn Leu Pro

340 345 350

Asn Pro Pro Asp Pro Phe Phe Thr Arg Ala Gln Ile Asp Arg Glu Ala

355 360 365

Arg Lys Val Arg Asp Gly Ile Met Gly Met Leu Tyr Leu Gln Arg Met

370 375 380

Pro Thr Glu Phe Asp Val Ala Met Ala Thr Val Tyr Tyr Leu Ala Asp

385 390 395 400

Arg Val Val Ser Gly Glu Thr Phe His Pro Ser Gly Gly Leu Arg Tyr

405 410 415

Glu Arg Thr Pro Thr Gly Gly Glu Leu Phe Gly Leu Pro Ser Pro Glu

420 425 430

Arg Leu Ala Glu Leu Val Gly Ser Thr Val Tyr Leu Ile Gly Glu His

435 440 445

Leu Thr Glu His Leu Asn Leu Leu Ala Arg Ala Tyr Leu Glu Arg Tyr

450 455 460

Gly Ala Arg Gln Val Val Met Ile Val Glu Thr Glu Thr Gly Ala Glu

465 470 475 480

Thr Met Arg Arg Leu Leu His Asp His Val Glu Ala Gly Arg Leu Met

485 490 495

Thr Ile Val Ala Gly Asp Gln Ile Glu Ala Ala Ile Asp Gln Ala Ile

500 505 510

Thr Arg Tyr Gly Arg Pro Gly Pro Val Val Cys Thr Pro Phe Arg Pro

515 520 525

Leu Pro Thr Val Pro Leu Val Gly Arg Lys Asp Ser Asp Trp Ser Thr

530 535 540

Val Leu Ser Glu Ala Glu Phe Ala Glu Leu Cys Glu His Gln Leu Thr

545 550 555 560

His His Phe Arg Val Ala Arg Trp Ile Ala Leu Ser Asp Gly Ala Arg

565 570 575

Leu Ala Leu Val Thr Pro Glu Thr Thr Ala Thr Ser Thr Thr Glu Gln

580 585 590

Phe Ala Leu Ala Asn Phe Ile Lys Thr Thr Leu His Ala Phe Thr Ala

595 600 605

Thr Ile Gly Val Glu Ser Glu Arg Thr Ala Gln Arg Ile Leu Ile Asn

610 615 620

Gln Val Asp Leu Thr Arg Arg Ala Arg Ala Glu Glu Pro Arg Asp Pro

625 630 635 640

His Glu Arg Gln Gln Glu Leu Glu Arg Phe Ile Glu Ala Val Leu Leu

645 650 655

Val Thr Ala Pro Leu Pro Pro Glu Ala Asp Thr Arg Tyr Ala Gly Arg

660 665 670

Ile His Arg Gly Arg Ala Ile Thr Val

675 680

<210> 24

<211> 4293

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 24

aatgtgcctg tcaaatggac gaagcaggga ttctgcaaac cctatgctac tccgtcaagc 60

cgtcaattgt ctgattcgtt accaattatg acaacttgac ggctacatca ttcacttttt 120

cttcacaacc ggcacggaac tcgctcgggc tggccccggt gcatttttta aatacccgcg 180

agaaatagag ttgatcgtca aaaccaacat tgcgaccgac ggtggcgata ggcatccggg 240

tggtgctcaa aagcagcttc gcctggctga tacgttggtc ctcgcgccag cttaagacgc 300

taatccctaa ctgctggcgg aaaagatgtg acagacgcga cggcgacaag caaacatgct 360

gtgcgacgct ggcgatatca aaattgctgt ctgccaggtg atcgctgatg tactgacaag 420

cctcgcgtac ccgattatcc atcggtggat ggagcgactc gttaatcgct tccatgcgcc 480

gcagtaacaa ttgctcaagc agatttatcg ccagcagctc cgaatagcgc ccttcccctt 540

gcccggcgtt aatgatttgc ccaaacaggt cgctgaaatg cggctggtgc gcttcatccg 600

ggcgaaagaa ccccgtattg gcaaatattg acggccagtt aagccattca tgccagtagg 660

cgcgcggacg aaagtaaacc cactggtgat accattcgcg agcctccgga tgacgaccgt 720

agtgatgaat ctctcctggc gggaacagca aaatatcacc cggtcggcaa acaaattctc 780

gtccctgatt tttcaccacc ccctgaccgc gaatggtgag attgagaata taacctttca 840

ttcccagcgg tcggtcgata aaaaaatcga gataaccgtt ggcctcaatc ggcgttaaac 900

ccgccaccag atgggcatta aacgagtatc ccggcagcag gggatcattt tgcgcttcag 960

ccatactttt catactcccg ccattcagag aagaaaccaa ttgtccatat tgcatcagac 1020

attgccgtca ctgcgtcttt tactggctct tctcgctaac caaaccggta accccgctta 1080

ttaaaagcat tctgtaacaa agcgggacca aagccatgac aaaaacgcgt aacaaaagtg 1140

tctataatca cggcagaaaa gtccacattg attatttgca cggcgtcaca ctttgctatg 1200

ccatagcatt tttatccata agattagcgg atcctacctg acgcttttta tcgcaactct 1260

ctactgtttc tccatacccg ttttttgggc taacaggagg aattaaccat gggtacctct 1320

catcatcatc atcatcacag cagcggcctg gtgccgcgcg gcagcctcga gggtagatct 1380

ggtactagtg gtgaattcgg tgagctcggt ctgcagctgg tgccgcgcgg cagccaccac 1440

caccaccacc actaatacag attaaatcag aacgcagaag cggtctgata aaacagaatt 1500

tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac 1560

gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat 1620

caaataaaac gaaaggctca gtcgaaagac tgggcctttc gtcgacctaa ttcccatgtc 1680

agccgttaag tgttcctgtg tcactgaaaa ttgctttgag aggctctaag ggcttctcag 1740

tgcgttacat ccctggcttg ttgtccacaa ccgttaaacc ttaaaagctt taaaagcctt 1800

atatattctt ttttttctta taaaacttaa aaccttagag gctatttaag ttgctgattt 1860

atattaattt tattgttcaa acatgagagc ttagtacgtg aaacatgaga gcttagtacg 1920

ttagccatga gagcttagta cgttagccat gagggtttag ttcgttaaac atgagagctt 1980

agtacgttaa acatgagagc ttagtacgtg aaacatgaga gcttagtacg tactatcaac 2040

aggttgaact gcggatcttg atgagtggat agtacgttgc taaaacatga gataaaaatt 2100

gactctcatg ttattggcgt taagatatac agaatgatga ggttttttta tgagactcaa 2160

ggtcatgatg gacgtgaaca aaaaaacgaa aattcgccac cgaaacgagc taaatcacac 2220

cctggctcaa cttcctttgc ccgcaaagcg agtgatgtat atggcgcttg ctcccattga 2280

tagcaaggaa cctcttgaac gagggcgagt tttcaaaatt agggctgaag accttgcagc 2340

gctcgccaaa atcaccccat cgcttgctta tcgacaatta aaagagggtg gtaagttact 2400

tggtgccagc aaaatttcgc taagagggga tgatatcatt gcttcagcta aagagcttaa 2460

cctgctcttt actgctaaag actcccctga agagttagat cttaacatta ttgagtggat 2520

agcttattca aatgatgaag gatacttgtc tttaaaattc accagaacca tagaaccata 2580

tatctctagc cttattggga aaaaaaataa attcacaacg caattgttaa cggcaagctt 2640

acgcttaagt agccagtatt catcttctct ttatcaactt atcaggaagc attactctaa 2700

ttttaagaag aaaaattatt ttattatttc cgttgatgag ttaaaggaag agttaatagc 2760

ttatactttt gataaagatg gaagtattga gtacaaatac cctgactttc ctatttttaa 2820

aagggatgta ttaaataaag ccattgctga aattaaaaag aaaacagaaa tatcgtttgt 2880

tggctttact gttcatgaaa aagaaggaag aaaaattagt aagctgaagt tcgaatttgt 2940

cgttgatgaa gatgaatttt ctggcgataa agatgatgaa gcttttttta tgaatttatc 3000

tgaagctaat gcagcttttc tcaaggtatt tgatgaaacc gtacctccca aaaaagctaa 3060

ggggtgatat atggctaaaa tttacgattt ccctcaagga gccgaacgcc gcaggatgca 3120

ccgcaaaatc cagtggaaca acgctgtaaa attatctaaa aatggctgga gtaagccaga 3180

ggttaaacgc tggtcttttt tagcattcat ctcaactggc tggcggccgc ggaaccccta 3240

tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa taaccctgat 3300

aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc 3360

ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa acgctggtga 3420

aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa ctggatctca 3480

acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg atgagcactt 3540

ttaaagttct gctatgtgat acactattat cccgtattga cgccgggcaa gagcaactcg 3600

gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc acagaaaagc 3660

atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc atgagtgata 3720

acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta accgcttttt 3780

tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag 3840

ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatgccaaca acgttgcgca 3900

aactattaac tggcgaacta cttactctag cttcccggca acaattaata gactgaatgg 3960

aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc tggtttattg 4020

ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca ctggggccag 4080

atggtaagcg ctcccgtatc gtagttatct acaccacggg gagtcaggca actatggatg 4140

aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg taactgtcag 4200

accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa tttaaaagga 4260

tctaggtgaa gatccttttt gataatcgca tgc 4293

<210> 25

<211> 1185

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 25

atggcgtccg taactgtaga gcaaatccga aaggctcagc gagctgaagg tccggccacc 60

atcctcgcca ttggcaccgc cgttcctgcc aactgtttca accaagctga ttttcccgac 120

tactactttc gtgtcaccaa aagtgaacac atgactgatc tcaaaaagaa gttccaacga 180

atgtgtgaaa aatccactat aaaaaagcgt tacttgcact tgaccgaaga gcatctgaag 240

cagaacccac atctgtgcga gtacaatgca ccatctctga acacacgcca agacatgttg 300

gtggttgaag ttcccaagct tgggaaggag gctgcaatca atgccatcaa agaatggggc 360

caacccaagt ccaagatcac ccatctcatc ttctgcaccg gctcctccat cgacatgcca 420

ggagccgatt accaatgcgc caagcttctc ggcctccgac cctcggtgaa gcgagtgatg 480

ctgtatcaac tcggctgtta tgccggtgga aaagttcttc gcatagccaa ggacatagca 540

gagaacaaca agggcgctag agttctcatt gtgtgctctg agatcacagc ttgtatcttt 600

cgcgggccct cggagaaaca tttggattgc ttggtggggc aatctctgtt cggagacggg 660

gcatcttcgg tcatcgttgg tgccgaccct gatgcctcgg taggcgagcg gccgatcttc 720

gagttggttt cagctgcgca gacgattttg cctaactcgg atggagccat agccgggcac 780

gtaacggaag ccgggctgac atttcacttg ctgagggacg tgccagggtt gatctcccaa 840

aacattgaga agagcttgat tgaggccttc actccgattg ggattaatga ctggaacaac 900

atattctgga ttgcacatcc cggtggacct gccattctgg acgagataga ggccaagctc 960

gagctgaaga aggagaagat gaaggcgtct cgtgaaatgc tgagcgagta tgggaacatg 1020

tcatgtgcaa gcgttttctt catagtagat gagatgagga aacagtcgtc gaaggaaggg 1080

aagtctacca ccggagatgg actggagtgg ggcgctctct tcgggtttgg accgggtctg 1140

acggtggaga cggtggtctt gcacagcgtg cccacaaacg tctaa 1185

<210> 26

<211> 394

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 26

Met Ala Ser Val Thr Val Glu Gln Ile Arg Lys Ala Gln Arg Ala Glu

1 5 10 15

Gly Pro Ala Thr Ile Leu Ala Ile Gly Thr Ala Val Pro Ala Asn Cys

20 25 30

Phe Asn Gln Ala Asp Phe Pro Asp Tyr Tyr Phe Arg Val Thr Lys Ser

35 40 45

Glu His Met Thr Asp Leu Lys Lys Lys Phe Gln Arg Met Cys Glu Lys

50 55 60

Ser Thr Ile Lys Lys Arg Tyr Leu His Leu Thr Glu Glu His Leu Lys

65 70 75 80

Gln Asn Pro His Leu Cys Glu Tyr Asn Ala Pro Ser Leu Asn Thr Arg

85 90 95

Gln Asp Met Leu Val Val Glu Val Pro Lys Leu Gly Lys Glu Ala Ala

100 105 110

Ile Asn Ala Ile Lys Glu Trp Gly Gln Pro Lys Ser Lys Ile Thr His

115 120 125

Leu Ile Phe Cys Thr Gly Ser Ser Ile Asp Met Pro Gly Ala Asp Tyr

130 135 140

Gln Cys Ala Lys Leu Leu Gly Leu Arg Pro Ser Val Lys Arg Val Met

145 150 155 160

Leu Tyr Gln Leu Gly Cys Tyr Ala Gly Gly Lys Val Leu Arg Ile Ala

165 170 175

Lys Asp Ile Ala Glu Asn Asn Lys Gly Ala Arg Val Leu Ile Val Cys

180 185 190

Ser Glu Ile Thr Ala Cys Ile Phe Arg Gly Pro Ser Glu Lys His Leu

195 200 205

Asp Cys Leu Val Gly Gln Ser Leu Phe Gly Asp Gly Ala Ser Ser Val

210 215 220

Ile Val Gly Ala Asp Pro Asp Ala Ser Val Gly Glu Arg Pro Ile Phe

225 230 235 240

Glu Leu Val Ser Ala Ala Gln Thr Ile Leu Pro Asn Ser Asp Gly Ala

245 250 255

Ile Ala Gly His Val Thr Glu Ala Gly Leu Thr Phe His Leu Leu Arg

260 265 270

Asp Val Pro Gly Leu Ile Ser Gln Asn Ile Glu Lys Ser Leu Ile Glu

275 280 285

Ala Phe Thr Pro Ile Gly Ile Asn Asp Trp Asn Asn Ile Phe Trp Ile

290 295 300

Ala His Pro Gly Gly Pro Ala Ile Leu Asp Glu Ile Glu Ala Lys Leu

305 310 315 320

Glu Leu Lys Lys Glu Lys Met Lys Ala Ser Arg Glu Met Leu Ser Glu

325 330 335

Tyr Gly Asn Met Ser Cys Ala Ser Val Phe Phe Ile Val Asp Glu Met

340 345 350

Arg Lys Gln Ser Ser Lys Glu Gly Lys Ser Thr Thr Gly Asp Gly Leu

355 360 365

Glu Trp Gly Ala Leu Phe Gly Phe Gly Pro Gly Leu Thr Val Glu Thr

370 375 380

Val Val Leu His Ser Val Pro Thr Asn Val

385 390

<210> 27

<211> 954

<212> DNA

<213> Artificial sequence (Artificial sequence)

<400> 27

atgcgggcga aacttctggg aatagtcctg acaaccccta ttgcgatcag ctcttttgct 60

tctaccgaga ctttatcgtt tactcctgac aacataaatg cggacattag tcttggaact 120

ctgagcggaa aaacaaaaga gcgtgtttat ctagccgaag aaggaggccg aaaagtcagt 180

caactcgact ggaaattcaa taacgctgca attattaaag gtgcaattaa ttgggatttg 240

atgccccaga tatctatcgg ggctgctggc tggacaactc tcggcagccg aggtggcaat 300

atggtcgatc aggactggat ggattccagt aaccccggaa cctggacgga tgaaagtaga 360

caccctgata cacaactcaa ttatgccaac gaatttgatc tgaatatcaa aggctggctc 420

ctcaacgaac ccaattaccg cctgggactc atggccggat atcaggaaag ccgttatagc 480

tttacagcca gaggtggttc ctatatctac agttctgagg agggattcag agatgatatc 540

ggctccttcc cgaatggaga aagagcaatc ggctacaaac aacgttttaa aatgccctac 600

attggcttga ctggaagtta tcgttatgaa gattttgaac tcggtggcac atttaaatac 660

agcggctggg tggaatcatc tgataacgat gaacactatg acccgggaaa aagaatcact 720

tatcgcagta aggtcaaaga ccaaaattac tattctgttg cagtcaatgc aggttattac 780

gtcacaccta acgcaaaagt ttatgttgaa ggcgcatgga atcgggttac gaataaaaaa 840

ggtaatactt cactttatga tcacaataat aacacttcag actacagcaa aaatggagca 900

ggtatagaaa actataactt catcactact gctggtctta agtacacatt ttaa 954

<210> 28

<211> 317

<212> PRT

<213> Artificial sequence (Artificial sequence)

<400> 28

Met Arg Ala Lys Leu Leu Gly Ile Val Leu Thr Thr Pro Ile Ala Ile

1 5 10 15

Ser Ser Phe Ala Ser Thr Glu Thr Leu Ser Phe Thr Pro Asp Asn Ile

20 25 30

Asn Ala Asp Ile Ser Leu Gly Thr Leu Ser Gly Lys Thr Lys Glu Arg

35 40 45

Val Tyr Leu Ala Glu Glu Gly Gly Arg Lys Val Ser Gln Leu Asp Trp

50 55 60

Lys Phe Asn Asn Ala Ala Ile Ile Lys Gly Ala Ile Asn Trp Asp Leu

65 70 75 80

Met Pro Gln Ile Ser Ile Gly Ala Ala Gly Trp Thr Thr Leu Gly Ser

85 90 95

Arg Gly Gly Asn Met Val Asp Gln Asp Trp Met Asp Ser Ser Asn Pro

100 105 110

Gly Thr Trp Thr Asp Glu Ser Arg His Pro Asp Thr Gln Leu Asn Tyr

115 120 125

Ala Asn Glu Phe Asp Leu Asn Ile Lys Gly Trp Leu Leu Asn Glu Pro

130 135 140

Asn Tyr Arg Leu Gly Leu Met Ala Gly Tyr Gln Glu Ser Arg Tyr Ser

145 150 155 160

Phe Thr Ala Arg Gly Gly Ser Tyr Ile Tyr Ser Ser Glu Glu Gly Phe

165 170 175

Arg Asp Asp Ile Gly Ser Phe Pro Asn Gly Glu Arg Ala Ile Gly Tyr

180 185 190

Lys Gln Arg Phe Lys Met Pro Tyr Ile Gly Leu Thr Gly Ser Tyr Arg

195 200 205

Tyr Glu Asp Phe Glu Leu Gly Gly Thr Phe Lys Tyr Ser Gly Trp Val

210 215 220

Glu Ser Ser Asp Asn Asp Glu His Tyr Asp Pro Gly Lys Arg Ile Thr

225 230 235 240

Tyr Arg Ser Lys Val Lys Asp Gln Asn Tyr Tyr Ser Val Ala Val Asn

245 250 255

Ala Gly Tyr Tyr Val Thr Pro Asn Ala Lys Val Tyr Val Glu Gly Ala

260 265 270

Trp Asn Arg Val Thr Asn Lys Lys Gly Asn Thr Ser Leu Tyr Asp His

275 280 285

Asn Asn Asn Thr Ser Asp Tyr Ser Lys Asn Gly Ala Gly Ile Glu Asn

290 295 300

Tyr Asn Phe Ile Thr Thr Ala Gly Leu Lys Tyr Thr Phe

305 310 315

70页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种硫酸新霉素的发酵工艺

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!

技术分类