Application of soybean GmSPA3a/3b protein and related biological materials thereof in regulation and control of plant flowering and plant height

文档序号:1826534 发布日期:2021-11-12 浏览:10次 中文

阅读说明:本技术 大豆GmSPA3a/3b蛋白及其相关生物材料在调控植物开花和株高中的应用 (Application of soybean GmSPA3a/3b protein and related biological materials thereof in regulation and control of plant flowering and plant height ) 是由 赵涛 秦超 李宏宇 刘斌 刘军 于 2021-09-15 设计创作,主要内容包括:本发明公开了大豆GmSPA3a/3b蛋白及其相关生物材料在调控植物开花和株高中的应用。本发明所保护的一个技术方案是蛋白质相关生物材料在调控或缩短植物花期、调控或降低植物株高以及植物育种和改良中的应用。所述蛋白质的氨基酸序列可为序列表中序列1或序列2所示的蛋白。实验证明,GmSPA3a/3b蛋白活性丧失的转基因大豆Gmspa3a、Gmspa3b和Gmspa3ab突变体表现出花期提前且株高矮化的表型。通过敲除大豆Gmspa3a、Gmspa3b或GmSPA3a/3b基因,可降低大豆的光周期敏感性,培育不同开花时间和株高的多个品系以适应不同区域的种植,从而有效扩大优良大豆的播种面积,实现增产丰收。(The invention discloses application of soybean GmSPA3a/3b protein and related biological materials thereof in regulating and controlling flowering and plant height of plants. The invention provides a technical scheme for applying protein-related biological materials in regulation or shortening of the flowering phase of plants, regulation or reduction of the plant height of plants, plant breeding and improvement. The amino acid sequence of the protein can be a protein shown as a sequence 1 or a sequence 2 in a sequence table. Experiments prove that the transgenic soybeans Gmspa3a, Gmspa3b and Gmspa3ab mutant with the loss of the activity of the Gmsa 3a/3b protein show a phenotype of advanced flowering phase and dwarfing plant height. By knocking out soybean Gmspa3a, Gmspa3b or GmSPA3a/3b genes, the photoperiod sensitivity of soybeans can be reduced, and multiple strains with different flowering time and plant height are cultivated to adapt to planting in different areas, so that the sowing area of excellent soybeans is effectively enlarged, and yield and harvest are increased.)

1. Any of the following uses of protein-related biomaterials:

q1, use of the biomaterial for regulating or shortening the flowering phase of a plant;

q2, and the application of the biological material in regulating and controlling the plant height or reducing the plant height;

q3, use of the biomaterial in plant breeding;

q4, the use of the biological material in plant variety improvement;

the protein is a protein of A1), A2), A3), A4), A5), A6), A7), A8), A9) or A10) as follows:

A1) the amino acid sequence is protein of sequence 1 in a sequence table;

A2) the amino acid sequence is protein of a sequence 2 in a sequence table;

A3) the amino acid sequence is protein of sequence 5 in the sequence table;

A4) the amino acid sequence is protein of sequence 6 in the sequence table;

A5) the amino acid sequence is protein of a sequence 7 in a sequence table;

A6) the amino acid sequence is protein of sequence 8 in the sequence table;

A7) the amino acid sequence is protein of a sequence 9 in a sequence table;

A8) the amino acid sequence is protein of a sequence 10 in a sequence table;

A9) a protein which is obtained by substituting and/or deleting and/or adding more than one amino acid residue in an amino acid sequence shown by A1), A2), A3), A4), A5), A6), A7) or A8), has the same function, is derived from A1), A2), A3), A4), A5), A6), A7) or A8), or has the same function with a protein shown by A1), A2), A3), A4), A5), A6), A7) or A8) by more than 80 percent of identity;

A10) a fusion protein obtained by connecting protein tags to the N-terminal and/or C-terminal of A1), A2), A3), A4), A5), A6), A7), A8) or A9);

the biomaterial may be any one of the following B1) to B9):

B1) a nucleic acid molecule encoding a protein as described above.

B2) An expression cassette comprising the nucleic acid molecule according to B1).

B3) A recombinant vector containing the nucleic acid molecule according to B1) or a recombinant vector containing the expression cassette according to B2).

B4) A recombinant microorganism containing B1) the nucleic acid molecule, or a recombinant microorganism containing B2) the expression cassette, or a recombinant microorganism containing B3) the recombinant vector.

B5) A transgenic plant cell line comprising B1) the nucleic acid molecule or a transgenic plant cell line comprising B2) the expression cassette.

B6) Transgenic plant tissue comprising the nucleic acid molecule according to B1) or transgenic plant tissue comprising the expression cassette according to B2).

B7) A transgenic plant organ containing B1) the nucleic acid molecule or a transgenic plant organ containing B2) the expression cassette.

B8) A nucleic acid molecule that inhibits or reduces gene expression of the protein.

B9) An expression cassette, a recombinant vector, a recombinant microorganism or a transgenic plant cell line comprising the nucleic acid molecule according to B8).

2. Use according to claim 1, characterized in that: B1) the nucleic acid molecule is a DNA molecule shown in the following b1) or b2) or b3) or b4) or b5) or b6) or b7) or b8) or b9) or b 10):

b1) the coding sequence is a DNA molecule shown in a sequence 3 in a sequence table;

b2) the coding sequence is a DNA molecule shown in a sequence 4 in a sequence table;

b3) the coding sequence is a DNA molecule obtained by inserting 1 nucleotide T between the 160-161 th nucleotides of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged;

b4) the coding sequence is a DNA molecule obtained by deleting 5 nucleotides TTCGG at 159 th-164 th site of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged;

b5) the coding sequence is a DNA molecule obtained by deleting 2 nucleotides CC at the 420-421 st site of the sequence 4 in the sequence table and keeping other sequences of the sequence 4 unchanged;

b6) the coding sequence is a DNA molecule obtained by inserting 1 nucleotide T between CC nucleotides of 2 nucleotides at 419-420 sites of the sequence 4 in the sequence table and keeping other sequences of the sequence 4 unchanged;

b7) the coding sequence is a DNA molecule obtained by deleting 2 nucleotides CC at the 142 rd-143 th site of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged;

b8) the coding sequence is a DNA molecule obtained by deleting 2 nucleotides CC (2 nucleotides are deleted) at the 139-140 th site of the sequence 4 in the sequence table and keeping other sequences of the sequence 4 unchanged;

b9) a DNA molecule having 90% or more 90% identity to the nucleotide sequence defined by b1), b2), b3), b4), b5), b6), b7) or b8) and encoding the protein of claim 1;

b10) a DNA molecule which hybridizes under stringent conditions with a nucleotide sequence defined by b1), b2), b3), b4), b5), b6), b7), b8) or b9) and which encodes a protein as claimed in claim 1;

and/or the presence of a gas in the gas,

B8) the nucleic acid molecule is a DNA molecule that expresses a gRNA targeting the gene encoding the protein described in a1) or a2) above or a gRNA targeting the gene encoding the protein described in a1) or a2) above.

3. Use of the protein of claim 1 or a substance that modulates the activity or content of the protein of claim 1, for any one of the following applications:

use of P1, the protein or a substance that modulates the activity or content of the protein of claim 1 to modulate or shorten the flowering phase of a plant;

the use of P2, the protein or a substance that regulates the activity or content of the protein according to claim 1 for regulating plant height or reducing plant height;

use of P3, the protein or a substance modulating the activity or content of the protein of claim 1 in plant breeding;

use of P4, the protein or a substance modulating the activity or content of the protein of claim 1 for the improvement of plant varieties.

4. Use according to any one of claims 1-3, characterized in that: the protein is derived from soybean.

5. Use according to any one of claims 1-4, characterized in that: the plant is any one of the following plants:

D1) a dicotyledonous plant, a plant selected from the group consisting of dicotyledonous plants,

D2) a plant belonging to the order of the Sophora,

D3) a plant belonging to the family Leguminosae,

D4) a plant of the genus Glycine,

D5) and (4) soybeans.

6. A method for shortening the flowering time of a plant and/or reducing the plant height of a plant, comprising shortening the flowering time of a plant and/or reducing the plant height by inhibiting or reducing the expression of a gene encoding the protein of claim 1 in the genome of the plant.

7. The method of claim 6, wherein: the inhibition or reduction of the activity of the protein of claim 1 or/and the expression level of the gene encoding the protein of claim 1 in the plant of interest is achieved by knocking out the gene encoding the protein of claim 1 in the plant of interest.

8. The method of claim 6, wherein: the method comprising introducing into the plant an agent that inhibits or reduces the expression of a gene encoding the protein of claim 1; the substance for inhibiting or reducing the expression of the gene encoding the protein of claim 1 is any one of the following substances c1) -c 4):

c1) a nucleic acid molecule that inhibits or reduces the expression of a gene encoding the protein of claim 1;

c2) an expression cassette comprising the nucleic acid molecule of c 1);

c3) a recombinant vector comprising the nucleic acid molecule of c1) or a recombinant vector comprising the expression cassette of c 2);

c4) a recombinant microorganism containing c1) the nucleic acid molecule, or a recombinant microorganism containing c2) the expression cassette, or a recombinant microorganism containing c3) the recombinant vector.

9. The method of claim 8, wherein:

c1) the nucleic acid molecule is a DNA molecule for expressing a gRNA targeting the A1) or A2) protein coding gene or a gRNA targeting the A1) or A2) protein coding gene.

10. The method of claim 9, wherein: the method for inhibiting or reducing the expression of the gene coding for the protein in the claim 1 in the plant genome comprises the step of carrying out at least one mutation on the gene coding for the protein shown as a sequence 3 in a sequence table in the plant genome:

1) a nucleotide T is inserted between the 160-th and 161-th nucleotides of the sequence 3;

2) deletion of TTCGG at the 5 th nucleotide 159-164 th of the sequence 3;

3) deletion of 2 nucleotides from positions 142-143 of the sequence 3;

and/or the presence of a gas in the gas,

carrying out at least one of the following mutations on the protein coding gene shown in a sequence 4 in a sequence table in a plant genome:

1) deletion of 2 nucleotides CC from position 420-421 of the sequence 4;

2) 1T is inserted into the 419-420 th bit of the sequence 4;

3) deletion of 2 bases CC at 139-140 of the sequence 4;

and/or the presence of a gas in the gas,

the plant is soybean.

Technical Field

The invention belongs to the technical field of biology, and particularly relates to application of soybean GmSPA3a/3b protein and related biological materials thereof in regulation and control of plant flowering and main stem node number.

Background

Soybean (Glycine max) is an important source of edible vegetable oils and vegetable proteins in the world. Illumination is one of the most important environmental factors influencing the growth and development of soybeans. Soybeans are typically short-day plants, and most soybean varieties have narrow photoperiod adaptability to different latitudes.

The plant type of the crops refers to the overall embodiment of the aspects of the morphological layout, ecological characteristics, physiological characteristics and the like of the plants. The ideal plant type of the crops can improve the leaf area coefficient of the crops, increase the group photosynthetic efficiency and improve the demand relationship of the crops on fertilizers, thereby improving the yield of the crops. Soybeans are typically short-day plants, and most soybean varieties have narrow photoperiod adaptability to different latitudes. In high-latitude areas, soybean varieties with short stalks and short growth periods are generally needed, TL1 is used as a good variety in low-latitude areas, and the phenomenon of serious lodging and low fruiting rate can occur in the high-latitude areas.

In 2010, scientists have completed soybean genome sequencing, so that conditions are provided for gene function research, good genes in soybeans are more convenient to clone, and the molecular mechanism of the soybean strain type is not clear at present. With the continuous development of molecular biology, the research on plant type functional genes is carried out by utilizing high-throughput sequencing and transgenic technology, so that the yield of the soybean can be improved.

Disclosure of Invention

The technical problem to be solved by the invention is how to regulate the flowering phase of the plant and/or how to regulate the plant height of the plant.

In order to solve the above technical problem, the present invention firstly provides any one of the following uses of protein-related biomaterials:

q1, use of the biomaterial for regulating or shortening the flowering phase of a plant.

Q2 and application of the biological material in regulating and controlling plant height or reducing plant height.

Q3, use of the biomaterial in plant breeding.

Q4, and the application of the biological material in plant variety improvement.

The protein described above may be a protein of a1), a2), A3), a4), a5), a6), a7), A8), a9), or a10) as follows:

A1) the amino acid sequence is protein of sequence 1 in a sequence table;

A2) the amino acid sequence is protein of a sequence 2 in a sequence table;

A3) the amino acid sequence is protein of sequence 5 in the sequence table;

A4) the amino acid sequence is protein of sequence 6 in the sequence table;

A5) the amino acid sequence is protein of a sequence 7 in a sequence table;

A6) the amino acid sequence is protein of sequence 8 in the sequence table;

A7) the amino acid sequence is protein of a sequence 9 in a sequence table;

A8) the amino acid sequence is protein of a sequence 10 in a sequence table;

A9) a protein which is obtained by substituting and/or deleting and/or adding more than one amino acid residue in an amino acid sequence shown by A1), A2), A3), A4), A5), A6), A7) or A8), has the same function, is derived from A1), A2), A3), A4), A5), A6), A7) or A8), or has the same function with a protein shown by A1), A2), A3), A4), A5), A6), A7) or A8) by more than 80 percent of identity;

A10) a fusion protein obtained by connecting protein tags to the N-terminal and/or C-terminal of A1), A2), A3), A4), A5), A6), A7), A8) or A9).

The above-mentioned biomaterial may be any of the following B1) to B9):

B1) a nucleic acid molecule encoding a protein as described above.

B2) An expression cassette comprising the nucleic acid molecule according to B1).

B3) A recombinant vector containing the nucleic acid molecule according to B1) or a recombinant vector containing the expression cassette according to B2).

B4) A recombinant microorganism containing B1) the nucleic acid molecule, or a recombinant microorganism containing B2) the expression cassette, or a recombinant microorganism containing B3) the recombinant vector.

B5) A transgenic plant cell line comprising B1) the nucleic acid molecule or a transgenic plant cell line comprising B2) the expression cassette.

B6) Transgenic plant tissue comprising the nucleic acid molecule according to B1) or transgenic plant tissue comprising the expression cassette according to B2).

B7) A transgenic plant organ containing B1) the nucleic acid molecule or a transgenic plant organ containing B2) the expression cassette.

B8) A nucleic acid molecule which inhibits or reduces gene expression of a protein as described above.

B9) An expression cassette, a recombinant vector, a recombinant microorganism or a transgenic plant cell line comprising the nucleic acid molecule according to B8).

Among the proteins, the soybean GmSPA3a protein shown in the sequence 1 in the sequence table is composed of 907 amino acid residues. The soybean GmSPA3b protein shown in the sequence 2 in the sequence table consists of 906 amino acid residues.

The protein can be artificially synthesized, or can be obtained by synthesizing the coding gene and then carrying out biological expression.

In the above protein, the protein tag (protein-tag) refers to a polypeptide or protein that is expressed by fusion with a target protein using in vitro recombinant DNA technology, so as to facilitate expression, detection, tracking and/or purification of the target protein. The protein tag may be a Flag tag, a His tag, an MBP tag, an HA tag, a myc tag, a GST tag, and/or a SUMO tag, among others.

In the above proteins, identity refers to the identity of amino acid sequences. The identity of the amino acid sequences can be determined using homology search sites on the Internet, such as the BLAST web pages of the NCBI home website. For example, in the advanced BLAST2.1, by using blastp as a program, setting the value of Expect to 10, setting all filters to OFF, using BLOSUM62 as a Matrix, setting Gap existence cost, Per residual Gap cost, and Lambda ratio to 11, 1, and 0.85 (default values), respectively, and performing a calculation by searching for the identity of a pair of amino acid sequences, a value (%) of identity can be obtained.

In the above protein, the 80% or more identity may be at least 81%, 82%, 85%, 86%, 88%, 90%, 91%, 92%, 95%, 96%, 98%, 99% or 100% identity.

In the above biological material, the recombinant microorganism may be specifically yeast, bacteria, algae and fungi.

In the above application, the nucleic acid molecule of B1) is a DNA molecule represented by B1) or B2) or B3) or B4) or B5) or B6) or B7) or B8) or B9) or B10):

b1) the coding sequence is a DNA molecule shown in a sequence 3 in a sequence table;

b2) the coding sequence is a DNA molecule shown in a sequence 4 in a sequence table;

b3) the coding sequence is a DNA molecule obtained by inserting 1 nucleotide T between the 160-161 th nucleotides of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged;

b4) the coding sequence is a DNA molecule obtained by deleting 5 nucleotides TTCGG at 159 th-164 th site of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged;

b5) the coding sequence is a DNA molecule obtained by deleting 2 nucleotides CC at the 420-421 st site of the sequence 4 in the sequence table and keeping other sequences of the sequence 4 unchanged;

b6) the coding sequence is a DNA molecule obtained by inserting 1 nucleotide T between CC nucleotides of 2 nucleotides at 419-420 sites of the sequence 4 in the sequence table and keeping other sequences of the sequence 4 unchanged;

b7) the coding sequence is a DNA molecule obtained by deleting 2 nucleotides CC at the 142 rd-143 th site of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged;

b8) the coding sequence is a DNA molecule obtained by deleting 2 nucleotides CC (2 nucleotides are deleted) at the 139-140 th site of the sequence 4 in the sequence table and keeping other sequences of the sequence 4 unchanged;

b9) a DNA molecule having 90% or more 90% identity to the nucleotide sequence defined by b1), b2), b3), b4), b5), b6), b7) or b8) and encoding the protein of claim 1;

b10) a DNA molecule which hybridizes under stringent conditions to a nucleotide sequence defined by b1), b2), b3), b4), b5), b6), b7), b8) or b9) and which encodes a protein as claimed in claim 1.

The nucleic acid molecule described above B8) may be a DNA molecule that expresses a gRNA targeting the gene encoding the protein described above a1) or a2) or a gRNA targeting the gene encoding the protein described above a1) or a 2).

The target sequence of the gRNA of the gene coding for the protein described in A1) above may correspond to nucleotides 157-175 of sequence 3 and/or 129-147 of sequence 3 of the sequence listing. The target sequence of the gRNA of the gene encoding the above A2) protein can be represented by the nucleotides at position 403-422 of the sequence 4 and/or the nucleotides at position 126-144 of the sequence 4 in the sequence table.

The term "identity" refers to sequence similarity to a native nucleic acid sequence. Identity can be assessed visually or by computer software. Using computer software, the identity between two or more sequences can be expressed in percent (%), which can be used to assess the identity between related sequences. The identity of 90% or greater than 90% can be at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% identity.

In the above biological materials, the expression cassette containing a nucleic acid molecule described in B2) refers to a DNA capable of expressing the protein described in the above application in a host cell, and the DNA may include not only a promoter for initiating transcription of the gene encoding the protein but also a terminator for terminating transcription of the gene encoding the protein. Further, the expression cassette may also include an enhancer sequence. Promoters useful in the present invention include, but are not limited to: constitutive promoters, tissue, organ and development specific promoters, and inducible promoters.

In order to solve the above technical problems, the present invention also provides any one of the following uses of the protein described above or a substance that modulates the activity or content of the protein described above:

use of P1, a protein as described above, or a substance modulating the activity or content of a protein as described above, for modulating the flowering time or shortening the flowering time of a plant;

the application of P2, the protein or the substance for regulating the activity or the content of the protein in the plant to regulating the plant height or reducing the plant height;

use of P3, a protein as described above, or a substance modulating the activity or content of a protein as described above, in plant breeding;

the use of P4, a protein as described above or a substance modulating the activity or content of a protein as described above for the improvement of plant varieties.

The protein described above may be derived from soy.

The plant may be any one of:

D1) a dicotyledonous plant, a plant selected from the group consisting of dicotyledonous plants,

D2) a plant belonging to the order of the Sophora,

D3) a plant belonging to the family Leguminosae,

D4) a plant of the genus Glycine,

D5) and (4) soybeans.

In the above application, the substance for regulating the activity or content of the protein may be a substance for knocking out a gene encoding the protein and/or a substance for regulating the expression of a gene encoding the protein.

In the above application, the substance for regulating gene expression may be a substance for regulating at least one of the following 6 kinds of regulation: 1) regulation at the level of transcription of said gene; 2) regulation after transcription of the gene (i.e., regulation of splicing or processing of a primary transcript of the gene); 3) regulation of RNA transport of the gene (i.e., regulation of nuclear to cytoplasmic transport of mRNA of the gene); 4) regulation of translation of the gene; 5) regulation of mRNA degradation of the gene; 6) post-translational regulation of the gene (i.e., regulation of the activity of a protein translated from the gene).

In the above application, the regulation of gene expression may be the inhibition or reduction of gene expression, and the inhibition or reduction of gene expression may be achieved by gene knockout or by gene silencing.

The gene knockout (geneknockout) refers to a phenomenon in which a specific target gene is inactivated by homologous recombination. Gene knockout is the inactivation of a specific target gene by a change in the DNA sequence.

The gene silencing refers to the phenomenon that a gene is not expressed or is under expression under the condition of not damaging the original DNA. Gene silencing is premised on no change in DNA sequence, resulting in no or low expression of the gene. Gene silencing can occur at two levels, one at the transcriptional level due to DNA methylation, differential staining, and positional effects, and the other post-transcriptional gene silencing, i.e., inactivation of a gene at the post-transcriptional level by specific inhibition of a target RNA, including antisense RNA, co-suppression (co-suppression), gene suppression (quelling), RNA interference (RNAi), and micro-RNA (mirna) -mediated translational suppression, among others.

In the above application, the substance for regulating gene expression may be an agent for inhibiting or reducing the gene expression. The agent that inhibits or reduces the expression of the gene can be an agent that knocks out the gene, such as an agent that knocks out the gene by homologous recombination, or an agent that knocks out the gene by CRISPR-Cas 9. The agent that inhibits or reduces expression of the gene may comprise a polynucleotide that targets the gene, such as an siRNA, shRNA, sgRNA, miRNA, or antisense RNA.

In order to solve the above technical problems, the present invention also provides a method for shortening the flowering time and/or reducing the plant height of a plant, comprising shortening the flowering time and/or reducing the plant height of a plant by inhibiting or reducing the expression of a gene encoding a protein as described above in the genome of the plant.

The above-mentioned suppression or reduction of the expression level of the above-mentioned protein-encoding gene in the plant can be achieved by any means known in the art, such that the gene is subjected to deletion mutation, insertion mutation or base change mutation, thereby achieving the reduction or loss of the gene function, specifically, chemical mutagenesis, physical mutagenesis, RNAi, site-directed genome editing or homologous recombination, etc.

In the above-mentioned genome site-directed editing method, Zinc Finger Nuclease (ZFN) technology, Transcription activator-like effector nuclease (TALEN) technology, Clustered regularly spaced short palindromic repeats and their related systems (Clustered regularly interspaced short palindromic repeats/CRISPR associated, CRISPR/Cas9 system) technology, and other technologies capable of realizing genome site-directed editing can be used. In any case, the entire gene encoding the protein described above may be targeted, and each element regulating the expression of the gene encoding the protein described above may be targeted, as long as the loss or reduction of the function of the gene can be achieved. For example, exon 1 of the gene encoding the protein described above can be targeted.

The above-mentioned inhibition or reduction of the activity of the above-mentioned protein or/and the expression level of the gene encoding the above-mentioned protein in the plant of interest may be achieved by knocking out the gene encoding the above-mentioned protein in the plant of interest.

The protein in the above method may be the protein in A1) described above.

The method as described above further comprising introducing into said plant an agent that inhibits or reduces the expression of a gene encoding a protein as described above. The substance for inhibiting or reducing the expression of the protein-encoding gene described above may be any one of the following substances c1) -c 4):

c1) a nucleic acid molecule that inhibits or reduces the expression of a gene encoding a protein as described above;

c2) an expression cassette comprising the nucleic acid molecule of c 1);

c3) a recombinant vector comprising the nucleic acid molecule of c1) or a recombinant vector comprising the expression cassette of c 2);

c4) a recombinant microorganism containing c1) the nucleic acid molecule, or a recombinant microorganism containing c2) the expression cassette, or a recombinant microorganism containing c3) the recombinant vector.

The protein may be the protein of a1) described above.

c1) The nucleic acid molecule can be a DNA molecule for expressing a gRNA targeting the A1) protein encoding gene or a gRNA targeting the A1) protein encoding gene.

The target sequence of the gRNA of the gene encoding the above protein A1) may correspond to the targets at positions 157-175 and/or 129-147 of the sequence 3 of the sequence listing. The target sequence of the gRNA of the gene encoding the protein A2) above can be represented by nucleotides 126-144 and/or nucleotides 403-422 of the sequence 4 in the sequence table. The above-mentioned suppression or reduction of the expression of the gene encoding a1) described above in the genome of a plant may be carried out by subjecting the protein-encoding gene represented by sequence 3 in the sequence listing in the genome of a plant to at least one of the following mutations:

1) a nucleotide T is inserted between the 160-th and 161-th nucleotides of the sequence 3;

2) deletion of TTCGG at the 5 th nucleotide 159-164 th of the sequence 3;

3) the deletion of 2 nucleotides from positions 142 and 143 of the sequence 3.

The above-mentioned suppression or reduction of the expression of the gene encoding a1) described above in the plant genome may also be carried out by subjecting the protein-encoding gene represented by sequence 4 in the sequence listing in the plant genome to at least one of the following mutations:

1) deletion of 2 nucleotides CC at positions 418-419 of the sequence 4;

2) 1 nucleotide T is inserted into the 419-420 th site of the sequence 4;

3) the deletion of the 2 nd nucleotide CC at positions 139-144 of the sequence 4.

The protein described above may be the protein in a1) described above.

The plant described above may be any of the following:

D1) a dicotyledonous plant, a plant selected from the group consisting of dicotyledonous plants,

D2) a plant belonging to the order of the Sophora,

D3) a plant belonging to the family Leguminosae,

D4) a plant of the genus Glycine,

D5) and (4) soybeans.

Soybeans are typically short-day plants, and most soybean varieties have narrow photoperiod adaptability to different latitudes. Experiments in the invention prove that soybean variety TL1 is transformed by using recombinant agrobacterium tumefaciens containing recombinant plasmids GmSPA3a-gRNA1, GmSPA3b-gRNA2 and GmSPA3a3b-gRNA3, genes GmSPA3a/3b can be edited, after the genes GmSPA3a/3b are edited by CRESR/Cas 9 endonuclease, gene mutation of GmSPA3a/3b can be caused, when gene mutation of GmSPA3a/3b of two homologous chromosomes can cause activity loss of GmSPA3a/3b protein, transgenic soybean GmSPA3a, GmSPA3b and GmSPA3ab mutants with activity loss of GmSPA3a/3b protein are obtained. The Gmspa3a, Gmspa3b and Gmspa3ab mutants show the phenotype of advanced flowering phase and dwarf plant height, are very suitable for transplanting and planting excellent varieties in low-latitude areas to high-latitude areas, and can provide high-quality resources for breeding. By changing the response characteristic of soybean GmSPAs gene, it is possible to reduce the photoperiod sensitivity of soybean, and cultivate multiple strains with different flowering time and plant height to adapt to the planting in different areas, thereby effectively enlarging the sowing area of excellent soybean and realizing the purposes of increasing yield and harvest. The research finds that the flowering phase of the soybean is advanced and the plant height is dwarfed after the GmSPA3a/3b is deleted, and the method has important application value.

Drawings

FIG. 1 is a GmSPAs protein species evolutionary tree. "lenticular" represents GmSPA3a and GmSPA3 b.

FIG. 2 shows the results of molecular identification of the mutation types of Gmspa3a, Gmspa3b and Gmspa3 ab. A is a schematic diagram of the structure of the GmSPA3a gene and the position of mutant Gmspa3a where gene editing occurs, and B is a schematic diagram of the structure of the GmSPA3B gene and the position of mutant Gmspa3B where gene editing occurs; c is a schematic diagram showing the position of Gmspa3ab where gene editing occurs.

FIG. 3 is a statistical chart of the phenotypes of soybean varieties TL1, Gmspa3a, Gmspa3b and Gmspa3ab mutant plants.

FIG. 4 shows the detection of flowering related gene expression level of soybean variety TL1, Gmspa3a, Gmspa3b and Gmspa3ab mutant plants.

Detailed Description

The present invention is described in further detail below with reference to specific embodiments, which are given for the purpose of illustration only and are not intended to limit the scope of the invention. The examples provided below serve as a guide for further modifications by a person skilled in the art and do not constitute a limitation of the invention in any way.

The experimental procedures in the following examples, unless otherwise indicated, are conventional and are carried out according to the techniques or conditions described in the literature in the field or according to the instructions of the products. Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.

The co-culture medium, the liquid induction medium to which hormones are added, the solid induction medium, the solid elongation medium and the rooting medium used in the examples of the present invention are described in the following documents: paz, M.M., J.C. Martinez, A.B.Kalvig, T.M.Fonger and K.Wang (2006) "Improved coded node method used an alternative explicit derived from the information set for the expression of Plant Cell reconstruction 25(3): 206-" 213 ".

Sources of carriers and experimental materials in the examples of the invention are as follows:

pU3-gRNA vector: high charles laboratory benefit, in the literature of the reference: shann, Q, Y.Wang, J.Li, Y.Zhang, K.Chen, Z.Liang, K.Zhang, J.Liu, J.J.xi, J.L.Qiu and C.Gao (2013). Targeted genome modification of crop plants using a CRISPR-Cas system. Nat Biotechnol 31(8): 686. 688. the public is available from the inventors only for the purpose of repeating the present invention.

Soybean variety TL 1: Tian-Long1, given by Shicheng researchers of oil crop institute of Chinese academy of agricultural sciences, stored in the laboratory, transgenic with TL1 as background, hereinafter referred to as soybean. Reference documents: xiangguang Lyu et al, GmCRY1s modulated rubber in metabolism to regulated soybean shade in response to reduced blue light, Molecular Plant, Volume 14, Issue 2,2021.

In the present example, three replicates of the experiment were set up, the data were processed using SPSS11.5 statistical software, and the results were expressed as mean ± standard deviation, with One-way ANOVA test, P < 0.05 (x) indicating significant difference compared to wild-type soybean variety TL1, and P < 0.01 (x) indicating significant difference compared to wild-type soybean variety TL 1.

EXAMPLE I acquisition of GmSPA3a/3b Gene deletion mutant plants

1. Construction of CRISPR-Cas9 recombinant plasmids GmSPA3a-gRNA1 and GmSPA3a-gRNA2

The GmSPAs gene in the soybean genome comprises a pair of homologous genes GmSPA3a and GmSPA3b (hereinafter, GmSPA3a/3 b). The amino acid sequences of GmSPA3a and GmSPA3b proteins in the genome of the soybean William 82 are respectively shown as a sequence 1 and a sequence 2 in a sequence table, and CDS coding sequences corresponding to GmSPA3a and GmSPA3b are respectively nucleotide shown as a sequence 3 and a sequence 4 in the sequence table.

1.1 obtaining of U6-gRNA

A single guide RNA (single guide RNA) target point sequence is designed according to a genome sequence of soybean protein GmSPA3a/3b, the name of 1 designed sgRNA targeting GmSPA3a is sgRNA1, and the designed gRNA target point sequences are subjected to soybean genome sequence alignment to eliminate non-specific target points. The name of the designed 1 sgRNA targeting GmSPA3b is sgRNA2, and soybean genomic sequence comparison is carried out on the designed gRNA target point sequences to eliminate non-specific target points. The names of the designed 2 sgrnas simultaneously targeting GmSPA3a and GmSPA3b are sgRNA3 respectively, and soybean genomic sequence comparison is performed on the designed gRNA target sequences to eliminate non-specific target sites.

The nucleotide sequence of the sgRNA1 target point is 5'-TTTTCGGGTGAGGCATCAC-3', which corresponds to the 157 th-175 th nucleotide of the sequence 3 in the sequence table; the nucleotide sequence of the sgRNA2 target spot is 5'-GTCAGTGTAGCGCATTCTCA-3', which corresponds to the 403 th and 422 th nucleotides of the sequence 4 in the sequence table; the nucleotide sequence of the sgRNA3 target point is 5'-AAGGGTTCATTGTCCTCAA-3', which respectively corresponds to the 129-position 147-position nucleotide of the sequence 3 in the sequence table and the 126-position 144-position nucleotide of the sequence 4 in the sequence table.

1.1.1 acquisition of DNA fragments encoding gRNA

Preparing the following PCR reaction system for PCR amplification, wherein the total volume is 50 mu L: an aqueous solution (10. mu.M in concentration) of 25. mu.L of 2 XPphanta Max Buffer, 1. mu.L of dNTP Mix, 2. mu.L of primer F (3a-F1, 3b-F1 or 3ab-F1) (see Table 1 for sequence)2. mu.L of primer R (gRNA-R) (see Table 1 for sequence) (concentration: 10. mu.M), 1. mu.L of Phanta Max Super-Fidelity DNA Polymerase, 1. mu.L of template pU3-gRNA vector, 18. mu.L of ddH2And (C) O.

TABLE 1 GmSPA3a/3b-gRNA1-gRNA2 vector construction primer List

The amplification products of the 3a-F1 and gRNA-R primers in the system are coding DNA of sgRNA 1; the amplification products of the 3b-F1 and gRNA-R primers are coding DNA of sgRNA 2; the amplification products of the 3ab-F1 and gRNA-R primers were DNA encoding sgRNA 3.

The reaction procedure is as follows: 3min at 95 ℃; 35 cycles of 95 ℃ for 15s, 57 ℃ for 15s, 72 ℃ for 10 s; 5min at 72 ℃; storing at 4 ℃.

After the PCR amplification, the gRNA-encoding DNA fragments were recovered using an agarose gel recovery kit (Axygen Co.), to obtain about 141bp of the encoding DNA of sgRNA1 (position 306-443 of sequence 11 in the sequence Listing), the encoding DNA of sgRNA2 (position 306-443 of sequence 12 in the sequence Listing), and the encoding DNA of sgRNA3 (position 306-443 of sequence 13 in the sequence Listing), respectively.

1.1.2 obtaining of U6 promoter fragment

Preparing the following PCR reaction system for PCR amplification: total 50 μ L: the DNA was prepared from 25. mu.L of 2 XPPhanta Max Buffer, 1. mu.L of dNTP Mix, 2. mu.L of primer U6-F (see Table 1) in aqueous solution (concentration: 10. mu.M), 2. mu.L of primer U6-R (see Table 1) in aqueous solution (concentration: 10. mu.M), 1. mu.L of Phanta Max Super-Fidelity DNA Polymerase, 1. mu.L of genomic DNA of William 82 (soybean), and 18. mu.L of ddH2And (C) O.

After completion of the PCR amplification, the U6 promoter fragment (nucleotides 1 to 305 of SEQ ID NO: 15 in the sequence listing) of about 305bp was recovered using an agarose gel recovery kit (Axygen).

The reaction procedure is as follows: 3min at 95 ℃; 35 cycles of 95 ℃ for 15s, 57 ℃ for 15s, 72 ℃ for 10 s; 5min at 72 ℃; storing at 4 ℃.

1.1.3 obtaining of U6-gRNA fragments

Preparing the following PCR reaction system for PCR amplificationThe linkage of the U6 promoter and gRNA encoding DNA was achieved. Total 50 μ L: mu.L of 2 XPPhanta Max Buffer, 1. mu.L of dNTP Mix, 2. mu.L of aqueous primer U6-F (10. mu.M), 2. mu.L of aqueous primer gRNA-R (10. mu.M), 1. mu.L of template (containing 50ng of the gRNA-encoding DNA fragment obtained in step 1.1.1 and 50ng of the U6 promoter fragment obtained in step 1.1.2) and 18. mu.L of ddH2And (C) O.

The reaction procedure is as follows: 3min at 95 ℃; 35 cycles of 95 ℃ for 15s, 57 ℃ for 15s, 72 ℃ for 30 s; 5min at 72 ℃; storing at 4 ℃.

After the PCR amplification is finished, PCR products U6-sgRNA1 (sequence 11 in the sequence table), U6-sgRNA2 (sequence 12 in the sequence table) and U6-sgRNA3 (sequence 13 in the sequence table) are respectively obtained.

1.2 construction of CRISPR-Cas9 recombinant vector

1.2.1CRISPR-Cas9 vector linearization

The CRISPR-Cas9 vector pCas9-AtU6-sgRNA was stored for this laboratory (relevant documents: Li, C.et al.A. registration-associated gene GmPRR3b rules the cyclic addition clock and marketing time in a sober Plant 13, 745-759 (2020). the public is available from the Applicant only for the purpose of repeating the present invention).

The pCas9-AtU6-sgRNA vector was double digested with restriction enzymes StuI and XbaI, and about 14000bp linear pCas9-AtU6-sgRNA vector backbone was recovered.

U6-sgRNA1, U6-sgRNA2 and U6-sgRNA3 obtained in step 1.1 were simultaneously double-digested with restriction enzymes StuI and XbaI, respectively.

1.2.2 obtaining of ligation product

(1) Taking 1 mu L of the fragment obtained by double digestion of U6-sgRNA (U6-sgRNA1, U6-sgRNA2 or U6-sgRNA3) in the step 1.2.1 and 1 mu L of the linear pCas9-AtU6-sgRNA vector skeleton 50ng, 1 mu L T410 xbuffer,1 mu L T4 DNA Ligase and 5 mu L of ddH obtained in the step 1.2.1 respectively2O to obtain a linking system.

(2) And (3) taking the connection system in the step (1), and reacting overnight at 16 ℃ to obtain a connection product which is a CRISPR-Cas9 recombinant vector.

1.3CRISPR-Cas9 recombinant vector plasmid propagation

And (3) respectively transforming the CRISPR-Cas9 recombinant vector plasmid obtained in the step (1.2) into escherichia coli TOP10 competent cells (TRANS) to obtain a plurality of monoclonal transformants.

Each of the obtained single-clone transformed bacteria was used as a template, and PCR amplification was carried out using a primer pair consisting of primers 5'-ACATTTAATACGCGATAGAAAAC-3' and 5'-GTGCTCGGCAACGCGTTCTAGA-3'. If the DNA fragment with 528bp size of the nucleotide sequence of the PCR amplification product is a coding DNA molecule containing sgRNA1 shown in the position 306-443 of the sequence 15 in the sequence table: U6-sgRNA1, the corresponding monoclonal transformation bacteria are positive monoclonal bacteria; U6-sgRNA2 and U6-sgRNA3 were verified by the same procedure to obtain positive monoclonal bacteria.

Inoculating the obtained positive monoclonal bacteria to an LB liquid culture medium for culture, and then extracting plasmids from a culture medium liquid to obtain CRISPR-Cas9 recombinant plasmids GmSPA3a-gRNA1, GmSPA3b-gRNA2 and GmSPA3a3b-gRNA 3.

2、T0Generation of transgenic soybean plants

2.1 transforming competent cells of Agrobacterium tumefaciens EH105(ZOMANBIO ZC142) by a liquid nitrogen freezing method through recombinant plasmids GmSPA3a-gRNA1, GmSPA3b-gRNA2 and GmSPA3a3b-gRNA3 to obtain recombinant Agrobacterium tumefaciens which are respectively named as EH105/GmSPA3a-gRNA1, EH105/GmSPA3b-gRNA2 and EH105/GmSPA3a3b-gRNA 3.

2.2 transformation of recombinant Agrobacterium EH105/GmSPA3a-gRNA1, EH105/GmSPA3b-gRNA2 and EH105/GmSPA3a3b-gRNA3 into Soybean variety TL1(Tian-Long1, given by the oil researchers of the national academy of agricultural sciences, available to the present laboratory, from the Applicant only for the purposes of the invention related documents Xiangguang Lyu et al, GmCRY1s modulated rubber tissue from metabolism to regulation of soybean hull available from research in response to reduced blue, Molecular Plant, Volume 14, Issue 2,2021) as follows:

(1) the soybean is sterilized by chlorine gas generated by the reaction of concentrated hydrochloric acid and sodium hypochlorite.

(2) Cutting soybean into half, removing part of embryo tip, dividing wound in meristem, and soaking in sterile water for 7-8 hr; then, the sterile water was discarded and OD was added separately600nm0.5-0.8 of recombinant agrobacterium tumefaciens EH105/GmSPA3a-gRNA1, EH105/GmSPA3b-gRNA2 and EH105/GmSPA3a3b-gRNA3 bacterial liquid, oscillating for 30min at 28 ℃ and 200 rpm; taking out the bean cotyledon, blowing under aseptic condition for 10min, spreading on co-culture medium, and dark culturing at 28 deg.C for 3 days.

(3) Washing with sterile water and liquid induction medium containing timentin, cephalo and vancomycin for 4-5 times, respectively, and cleaning Agrobacterium.

(4) Cutting off the embryo bud with 3-4mm, placing the embryo bud downwards and the wound side upwards, obliquely inserting into solid induction culture medium, and culturing in greenhouse by irradiation.

(5) After 10 days some of the beans started to bud, the buds were cut off from the stumps and transferred to a new solid induction medium, no longer shoots were thrown away, and the cultivation was continued in the greenhouse.

(6) After 10 days, the long shoots were subcultured to a new solid induction medium, and bean petals without long shoots were discarded and cultivated in a greenhouse for 10 days. The bean cotyledon was co-cultured in the solid induction medium for 30 days.

(7) Separating the well-grown callus from bean, discarding bean, scraping off black surface of callus, transferring to solid elongation culture medium, replacing new solid elongation culture medium every 20 days, generally subculturing for 3-4 times, and standing for 60-80 days. The callus is also screened by Kan (kanamycin) antibiotics during the elongation culture, and seedlings can grow out in the screening process.

(8) When the shoots reached about 4-5cm, they were excised from the calli and transferred to rooting medium.

(9) Culturing in rooting culture medium for about 20-30 days, and hardening seedling in backlight place for 5 days.

(10) Transplanting the seedling in field, adding appropriate amount of water, green manure and slow release fertilizer, covering with a film, and exposing to light to make it suitable for strong light, removing film after 3 days, and getting T0The transgenic soybean plants are simulated.

3. Molecular identification of mutation types of transgenic soybean plants

3.1T obtained in step 2 respectively0The genome DNA of the leaf of the pseudo-transgenic soybean plant is taken as a template, and 5' -TC is adoptedCGCAGCCATTAACGACTT-3 ' and 5'-ACAGATAAAGCCACGCACATT-3' to obtain a corresponding PCR amplification product 1 (used for identifying whether contains a Basta resistance gene or not, the nucleotide shown in a sequence 14 in a sequence table is 989bp in length); carrying out PCR amplification on the 2 by using a primer pair consisting of 5'-CAGCTCGTCCAAACCTAC-3' and 5'-CTGTGCCATCCATCTTCT-3' to obtain a corresponding PCR amplification product 2 (used for identifying whether a Cas9 gene is contained or not, a nucleotide shown as a sequence 15 in a sequence table is 601bp in length); the primer pair 3 consisting of 5'-acatttaatacgcgatagaaaac-3' and 5'-GTGCTCGGCAACGCGTTCTAGA-3' is adopted for PCR amplification to obtain a corresponding PCR amplification product 3 (the nucleotide sequence of the sequence 16 in the sequence table has the size of 566bp and is used for identifying whether a U6-sgRNA fragment is contained).

If a certain T0The PCR amplification products of the pseudotransgenic soybean plants all contain DNA fragments of the three Basta resistance genes, Cas9 gene and U6-sgRNA (U6-sgRNA1, U6-sgRNA2 or U6-sgRNA3), so that the T is the mutant of the soybean0A transgenic soybean plant was identified as T0Transgenic soybean plants are generated.

3.2 by T, respectively0Genome DNA of transgenic soybean plant leaf is used as a template, and a primer pair consisting of 5'-TGATTGATATTGGAGTCTGTG-3' and 5'-GCTGATTCAATAAACGAGATA-3' is adopted to transform T of recombinant agrobacterium tumefaciens EH105/GmSPA3a-gRNA10Carrying out PCR amplification on the generation seedlings to obtain a corresponding PCR amplification product 4 with the size of 619bp and the sequence shown as sequence 17; t of recombinant agrobacterium tumefaciens EH105/GmSPA3b-gRNA2 is transformed by adopting primer pair consisting of 5'-AATGGTGATGGTGGAGGT-3' and 5'-AATCTTCACTTCCCATACTCT-3'0Carrying out PCR amplification on the generation seedlings to obtain a corresponding PCR amplification product 5 with the size of 686bp and the sequence of 18. The two pairs of primers respectively transfer T of the transformed recombinant agrobacterium tumefaciens EH105/GmSPA3a3b-gRNA30Carrying out PCR amplification on the seedlings to obtain corresponding PCR amplification products 4 and 5.

3.3, respectively subjecting the PCR amplification products 4 and 5 to Sanger sequencing. And (3) comparing the sequencing result with a target sequence of the GmSPA3a/3b gene (nucleotide molecules shown in a sequence 3 and a sequence 4 in a sequence table) respectively, and counting the mutation types.

Since soybean is diploid plant, when Cas9 protein plays a role to cut off a specific gene, the GmSPA3a/3b gene homozygous mutant refers to the same mutation of the GmSPA3a/3b gene of two homologous chromosomes of the plant. The GmSPA3a/3b gene biallelic mutant refers to that the GmSPA3a/3b gene of two homologous chromosomes of the plant is mutated but the mutation forms are different.

T of the recombinant agrobacterium tumefaciens EH105/GmSPA3a-gRNA1 is detected and transformed0Among 15 transgenic soybean plants, 5 plants subjected to gene editing are selected; t of the recombinant agrobacterium tumefaciens EH105/GmSPA3a-gRNA2 is detected and transformed04 plants subjected to gene editing in 16 transgenic soybean plants; t of recombinant agrobacterium tumefaciens EH105/GmSPA3a3b-gRNA3 by detection0Of 10 transgenic soybean plants, 3 plants are selected from GmSPA3a and GmSPA3b which have undergone gene editing simultaneously.

Will T0Plants edited by generation genes are respectively selfed for 2 generations to obtain T2And (4) generating gene edited plants. T is then detected as described above2The mutation type of the plant with gene editing. A total of 4 homozygous mutants and 1 biallelic mutant were obtained. The 4 pure synthetic mutants are named Gmspa3a-1#, Gmspa3a-2#, Gmspa3b-1#, Gmspa3b-2 #; 1 biallelic mutant was named Gmspa3ab-1 #.

The results of the 4 double homozygous mutants and the 1 double allelic mutant are shown in A and B of FIG. 2.

In the Gmspa3a-1# mutant, compared with the soybean Gmspa3a coding gene shown in the sequence 3 of the sequence table, the nucleotide sequence of the Gmspa3a-1# mutant gene is a DNA molecule obtained by inserting 1 nucleotide T (the insertion of 1 nucleotide occurs) between the 160-th and 161-th nucleotides of the sequence 3 of the sequence table and keeping other sequences of the sequence 3 unchanged, and the mutation causes a frame shift to cause the functional deletion of the Gmspa3a protein. That is, Gmspa3a-1# has a mutation in both of the GmSPA3a genes to Gmspa3a-1# gene in both homologous chromosomes as compared to the wild type for the GmSPA3a gene. The amino acid sequence of Gmspa3a-1# protein is shown as sequence 5 in the sequence table.

In the Gmspa3a-2# mutant, compared with the soybean Gmspa3a coding gene shown in the sequence 3 in the sequence table, the nucleotide sequence of the Gmspa3a-2# mutant gene is a DNA molecule obtained by deleting 5 nucleotides TTCGG (deletion of 5 nucleotides) at 159 th and 164 th positions of the sequence 3 in the sequence table and keeping other sequences of the sequence 3 unchanged, and the mutation causes frame shift to cause the functional deletion of the Gmspa3a protein. That is, Gmspa3a-2# has a mutation in both of Gmspa3a gene to Gmspa3a-2# gene in both homologous chromosomes for Gmspa3a gene compared to wild type. The amino acid sequence of Gmspa3a-2# protein is shown as sequence 6 in the sequence table.

In the Gmspa3b-1# mutant, compared with the soybean Gmspa3b gene shown in the sequence 4 of a sequence table, the nucleotide sequence of the Gmspa3b-1# mutant gene is a DNA molecule obtained by deleting 2 nucleotides CC (2 nucleotides are deleted) at the 420-position and 421-position of the sequence 4 and keeping other sequences of the sequence 4 unchanged, and the mutation causes a frame shift to cause the functional deletion of the Gmspa3b protein. That is, Gmspa3b-1# has a mutation in both of the GmSPA3b genes to Gmspa3b-1# gene in both homologous chromosomes as compared to the wild type for the GmSPA3b gene. The amino acid sequence of the mutated Gmspa3b-1# protein is shown as sequence 7.

Compared with soybean GmSPA3b gene shown in sequence 4 of a sequence table, in the GmSpa3b-2# mutant, the nucleotide sequence of the GmSpa3b-2# mutant gene is a DNA molecule obtained by inserting 1 nucleotide T between CC nucleotides at the 419-420 th site of the sequence 4 and keeping other sequences of the sequence 4 unchanged, and the mutation causes a frame shift to cause the functional deletion of GmSPA3b protein. That is, Gmspa3b-2# has a mutation in both of Gmspa3b gene to Gmspa3b-2# gene in both homologous chromosomes for Gmspa3b gene compared to wild type. The amino acid sequence of the mutated Gmspa3b-2# protein is shown as sequence 8.

In the Gmspa3ab-1# mutant, compared with a soybean Gmspa3a coding gene shown in a sequence 3 in a sequence table, the nucleotide sequence of a Gmspa3a mutant gene (Gmspa3ab-A) is a DNA molecule obtained by deleting 2-nucleotide CC (with 2-nucleotide deletion) at the 143 nd position of the 142-th-order fragment 143 of the sequence 3 and keeping other sequences of the sequence 3 unchanged, and simultaneously compared with the soybean Gmspa3B gene shown in the sequence 4 in the sequence table, the nucleotide sequence of a Gmspa3B mutant gene (Gmspa3ab-B) is a DNA molecule obtained by deleting 2-nucleotide CC (with 2-nucleotide deletion) at the 140-th position of the sequence 4 and keeping other sequences of the sequence 4 unchanged, and the mutation of the two genes causes code shift, so that the function of the Gmspa3a/3B protein is deleted.

The amino acid sequence of the Gmspa3ab-A protein is shown as a sequence 9 in a sequence table; the amino acid sequence of the Gmspa3ab-B protein is shown as a sequence 10 in a sequence table.

Seeds from 5 mutants Gmspa3a-1#, Gmspa3a-2#, Gmspa3b-1#, Gmspa3b-2# and Gmspa3ab-1# plants were harvested and subjected to phenotypic observations as described in example two below.

Example II, application of GmSPA3a/3b gene in regulating flowering and plant height of soybean

1. Phenotypic observation of flowering time and plant height in field

The experiment included three replicates. Data were processed using SPSS11.5 statistical software and experimental results were expressed as mean ± standard deviation, with One-way ANOVA test, P < 0.05 (x) indicating significant difference from wild-type soybean variety TL1 and P < 0.01 (x) indicating significant difference from wild-type soybean variety TL1 (see amendments to actual conditions).

Respectively mixing wild soybean variety TL1 and T obtained in example I2The plants of the GmSPA3a/3b gene deletion mutant Gmspa3a, Gmspa3b and Gmspa3ab are sowed in summer in fields with 100 seeds of Beijing (40 degrees 13 '28' N, 116 degrees 33 '37' E), and the flowering phase and the plant height phenotype are observed and counted. The results, by phenotypic observation in the field, indicate that TL1 blooms around 50 days under long-day sunlight, while the flowering time for 2 Gmspa3a mutants (Gmspa 3a-1# and Gmspa3a-2# in FIG. 3), 2 Gmspa3b mutants (Gmspa 3b-1# and Gmspa3b-2# in FIG. 3) and 1 Gmspa3ab mutant (Gmspa 3ab-1# in FIG. 3) is 42-47 days. After the plant is matured, the plant height and the number of main stem nodes are counted, and the result shows that the plant height of TL1 is 80-88 cm; the plant height of Gmspa3a (Gmspa 3a-1# and Gmspa3a-2# in figure 3) is 47-53cm, the plant height of Gmspa3b (Gmspa 3b-1# and Gmspa3b-2# in figure 3) is 55-68cm, and the plant height of Gmspa3ab (Gmspa 3ab-1# in figure 3) is 38-44cm (figure 3). Soybean TL1 variety and 5 GmSPA3a/3b gene deletion mutants Gmspa3a-1#, Gmspa3a-2#, Gmspa3b-1#, Gmspa3b-2#, and,The Gmspa3ab-1# has very significant difference in flowering phase and plant height. Therefore, the loss of function of the GmSPA3a/3b protein can lead to obvious early flowering phase and obvious reduction of plant height of the soybeans.

2. Detection of soybean flowering related gene expression level

Wild soybean variety TL1 of the second three-leaf fully developed (sampled every 4 hours) and deletion mutant T of 3 GmSPA3a/3b genes obtained in the first example were extracted and cultured for 21 days under long-day conditions (12h light/8 h dark)2RNA of Gmspa3a-1#, Gmspa3b-1#, and Gmspa3ab-1# plants is subjected to fluorescent quantitative PCR detection on soybean flowering related genes (including GmFT2a, GmFT5a, GmFT4, E1, J and GmCCA 1). The qPCR detection primers are respectively as follows:

GmFT2a-qF/R:

5’-GGATTGCCAGTTGCTGCTGT-3’/5’-GAGTGTGGGAGATTGCCAAT-3’;

GmFT5a-qF/R:

5’-AGCCCGAACCCTTCAGTAGGGA-3’/5’-GGTGATGACAGTGTCTCTGCCCA-3’;

GmFT4-qF/R:

5’-TTTGCATGACAAACACTAACCA-3’/5’-TCTTAGAGGAAAAGGAAAACCAGA-3’;

E1-qF/R:

5’-CACTCAAATTAAGCCCTTTCA-3’/5’-CTTATTGTTCATCTCCTCTT-3’;

J-qF/R:

5’-GTCTACCCTGTTCATTTGC-3’/5’-CAGGAACCCTAAAGTCGT-3’;

GmCCA1-qF/R:

5’-CGAAGATCCTGACAGCAAGA-3’/5’-TTCCTTGTCCAAGCCCTATG-3’;

GmActin-qF/R:

5’-CGGTGGTTCTATCTTGGCATC-3’/5’-GTCTTTCGCTTCAATAACCCTA-3’;

GmActin is used as an internal reference gene. Fluorescent quantitative detection results show that compared with a wild soybean variety TL1 (WT in figure 4), flowering promoting factors GmFT2a, GmFT5a and GmCCA1 are obviously up-regulated in Gmspa3a mutants (Gmspa 3a-1#), Gmspa3b mutants (Gmspa 3b-1#) and Gmspa3ab mutants (Gmspa 3ab-1#) in figure 4), and the expression levels of flowering inhibiting factors GmFT4, E1 and J are obviously down-regulated, which indicates that GmSPA3ab influences the soybean flowering phase, can regulate the soybean maturation phase and realize the north lead of the south variety.

The present invention has been described in detail above. It will be apparent to those skilled in the art that the invention can be practiced in a wide range of equivalent parameters, concentrations, and conditions without departing from the spirit and scope of the invention and without undue experimentation. While the invention has been described with reference to specific embodiments, it will be appreciated that the invention can be further modified. In general, this application is intended to cover any variations, uses, or adaptations of the invention following, in general, the principles of the invention and including such departures from the present disclosure as come within known or customary practice within the art to which the invention pertains. The use of some of the essential features is possible within the scope of the claims attached below.

Sequence listing

<110> institute of crop science of Chinese academy of agricultural sciences

<120> application of soybean GmSPA3a/3b protein and related biological materials thereof in regulation and control of plant flowering and plant height

<130> GNCSQ211050

<160> 18

<170> PatentIn version 3.5

<210> 1

<211> 907

<212> PRT

<213> Soybean (Glycine max)

<400> 1

Met Cys Cys Phe Thr Trp Pro Thr Cys Asn Ser Ser Trp Val Lys Met

1 5 10 15

Glu Gly Ser Ser Gly Ser Ala Phe His Asn Ser Gly Ser Ser Arg Ala

20 25 30

Leu Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Pro

35 40 45

Gln Arg Asn Pro Phe Ser Gly Glu Ala Ser Gln Asp Ser Gly Phe Arg

50 55 60

Lys Glu Arg Asp Arg Val Leu Leu Ala Gln Gly Gly Gln Pro Lys Asn

65 70 75 80

Leu Gly Gly Gly Phe Ser Gly Leu Cys Glu Asp Glu Val Glu Val Asp

85 90 95

Pro Phe Phe Cys Ala Val Glu Trp Gly Asp Ile Ser Leu Arg Gln Trp

100 105 110

Leu Asp Lys Pro Glu Arg Ser Val Asp Ala Phe Glu Cys Leu His Ile

115 120 125

Phe Arg Gln Ile Val Glu Ile Val Ser Val Ala His Ser Gln Gly Val

130 135 140

Val Val His Asn Val Arg Pro Ser Cys Phe Val Met Ser Ser Phe Asn

145 150 155 160

His Ile Ser Phe Ile Glu Ser Ala Ser Cys Ser Asp Thr Gly Ser Asp

165 170 175

Ser Leu Gly Asp Gly Met Asn Asn Gln Gly Gly Glu Val Lys Thr Pro

180 185 190

Thr Ser Leu Cys Pro His Asp Met His Gln Gln Ser Leu Gly Ser Glu

195 200 205

Asp Phe Met Pro Ile Lys Thr Ser Thr Thr Pro Ala Arg Ser Asp Ser

210 215 220

Ser Cys Met Leu Ser Ser Ala Val Tyr Ala Ala Arg Ala Ser Leu Ile

225 230 235 240

Glu Glu Thr Glu Glu Asn Lys Met Lys Asp Arg Arg Lys Asp Glu Glu

245 250 255

Val Glu Gly Lys Lys Gln Ser Phe Pro Met Lys Gln Ile Leu Leu Met

260 265 270

Glu Met Ser Trp Tyr Thr Ser Pro Glu Glu Gly Ala Gly Glu Ser Ser

275 280 285

Ser Cys Ala Ser Asp Val Tyr Arg Leu Gly Val Leu Leu Phe Glu Leu

290 295 300

Phe Cys Pro Leu Ser Ser Arg Glu Glu Lys Ser Arg Thr Met Ser Ser

305 310 315 320

Leu Arg His Arg Val Leu Pro Pro Gln Leu Leu Leu Lys Trp Pro Lys

325 330 335

Glu Ala Ser Phe Cys Leu Trp Leu Leu His Pro Asp Pro Lys Ser Arg

340 345 350

Pro Thr Leu Gly Glu Leu Leu Gln Ser Glu Phe Leu Asn Glu Gln Arg

355 360 365

Asp Asp Thr Glu Glu Arg Glu Ala Ala Ile Glu Leu Arg Gln Arg Ile

370 375 380

Glu Asp Gln Glu Leu Leu Leu Glu Phe Leu Leu Leu Leu Gln Gln Arg

385 390 395 400

Lys Gln Glu Val Ala Glu Lys Leu Gln His Thr Val Ser Phe Leu Cys

405 410 415

Ser Asp Ile Glu Glu Val Thr Lys Gln His Val Arg Phe Lys Glu Ile

420 425 430

Thr Gly Ala Glu Leu Gly Ser Asp Glu Arg Ser Ala Ser Ser Phe Pro

435 440 445

Ser Met Thr Phe Val Asp Ser Glu Asp Ser Ala Phe Leu Gly Thr Arg

450 455 460

Lys Arg Val Arg Leu Gly Met Asp Val Lys Asn Ile Glu Glu Cys Asp

465 470 475 480

Asp Asp Val Gly Asp Asp Gln Lys Ser Asn Gly Ser Phe Leu Ser Lys

485 490 495

Ser Ser Arg Leu Met Lys Asn Phe Lys Lys Leu Glu Ser Ala Tyr Phe

500 505 510

Leu Thr Arg Cys Arg Pro Ala Tyr Ser Ser Gly Lys Leu Ala Val Arg

515 520 525

His Pro Pro Val Thr Ser Asp Gly Arg Gly Ser Val Val Val Thr Glu

530 535 540

Arg Ser Cys Ile Asn Asp Leu Lys Ser Lys Glu Gln Cys Arg Glu Gly

545 550 555 560

Ala Ser Ala Trp Ile Asn Pro Phe Leu Glu Gly Leu Cys Lys Tyr Leu

565 570 575

Ser Phe Ser Lys Leu Lys Val Lys Ala Asp Leu Lys Gln Gly Asp Leu

580 585 590

Leu His Ser Ser Asn Leu Val Cys Ser Leu Ser Phe Asp Arg Asp Gly

595 600 605

Glu Phe Phe Ala Thr Ala Gly Val Asn Lys Lys Ile Lys Val Phe Glu

610 615 620

Cys Asp Ser Ile Ile Asn Glu Asp Arg Asp Ile His Tyr Pro Val Val

625 630 635 640

Glu Met Ala Ser Arg Ser Lys Leu Ser Ser Ile Cys Trp Asn Thr Tyr

645 650 655

Ile Lys Ser Gln Ile Ala Ser Ser Asn Phe Glu Gly Val Val Gln Leu

660 665 670

Trp Asp Val Thr Arg Ser Gln Val Ile Ser Glu Met Arg Glu His Glu

675 680 685

Arg Arg Val Trp Ser Ile Asp Phe Ser Ser Ala Asp Pro Thr Met Leu

690 695 700

Ala Ser Gly Ser Asp Asp Gly Ser Val Lys Leu Trp Ser Ile Asn Gln

705 710 715 720

Gly Val Ser Val Gly Thr Ile Lys Thr Lys Ala Asn Val Cys Cys Val

725 730 735

Gln Phe Pro Leu Asp Ser Ala Arg Phe Leu Ala Phe Gly Ser Ala Asp

740 745 750

His Arg Ile Tyr Tyr Tyr Asp Leu Arg Asn Leu Lys Met Pro Leu Cys

755 760 765

Thr Leu Val Gly His Asn Lys Thr Val Ser Tyr Ile Lys Phe Val Asp

770 775 780

Thr Val Asn Leu Val Ser Ala Ser Thr Asp Asn Thr Leu Lys Leu Trp

785 790 795 800

Asp Leu Ser Thr Cys Ala Ser Arg Val Ile Asp Ser Pro Ile Gln Ser

805 810 815

Phe Thr Gly His Ala Asn Val Lys Asn Phe Val Gly Leu Ser Val Ser

820 825 830

Asp Gly Tyr Ile Ala Thr Gly Ser Glu Thr Asn Glu Val Phe Ile Tyr

835 840 845

His Lys Ala Phe Pro Met Pro Ala Leu Ser Phe Lys Phe Gln Asn Thr

850 855 860

Asp Pro Leu Ser Gly Asn Glu Val Asp Asp Ala Val Gln Phe Val Ser

865 870 875 880

Ser Val Cys Trp His Gly Gln Ser Ser Ser Thr Leu Leu Ala Ala Asn

885 890 895

Ser Thr Gly Asn Val Lys Ile Leu Glu Met Val

900 905

<210> 2

<211> 906

<212> PRT

<213> Soybean (Glycine max)

<400> 2

Met Cys Cys Cys Thr Trp Pro Thr Cys Asn Ser Ser Trp Met Lys Met

1 5 10 15

Glu Pro Ser Gly Ser Ala Phe Gln Asn Ser Gly Ser Ser Arg Ala Leu

20 25 30

Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Pro Gln

35 40 45

Arg Asn Pro Phe Leu Gly Glu Ala Ser Gln Asp Ser Gly Phe Arg Lys

50 55 60

Glu Arg Asp Arg Phe Leu Leu Ala Gln Gly Gly Gln Pro Lys Asn Leu

65 70 75 80

Gly Gly Gly Phe Ser Gly Leu Cys Glu Asp Glu Val Glu Val Asp Pro

85 90 95

Phe Phe Cys Ala Val Glu Trp Gly Asp Ile Ser Leu Arg Gln Trp Leu

100 105 110

Asp Lys Pro Glu Arg Ser Val Gly Ala Phe Glu Cys Leu His Ile Phe

115 120 125

Arg Gln Ile Val Glu Ile Val Ser Val Ala His Ser Gln Gly Val Val

130 135 140

Val His Asn Val Arg Pro Ser Cys Phe Val Met Ser Ser Phe Asn His

145 150 155 160

Ile Ser Phe Ile Glu Ser Ala Ser Cys Ser Asp Thr Gly Ser Asp Ser

165 170 175

Leu Gly Glu Gly Leu Asn Asn Gln Gly Gly Glu Val Lys Thr Pro Thr

180 185 190

Ser Leu Cys Pro His Asp Met Pro Gln Gln Ser Met Gly Ser Glu Asp

195 200 205

Phe Met Pro Val Lys Thr Leu Thr Thr Pro Ala Gln Ser Asp Ser Ser

210 215 220

Cys Met Leu Ser Ser Ala Val Tyr Ala Ala Arg Ala Ser Leu Ile Glu

225 230 235 240

Glu Thr Glu Glu Asn Lys Met Lys Asp Arg Arg Lys Asp Asp Glu Val

245 250 255

Glu Gly Lys Lys Gln Ser Phe Pro Met Lys Gln Ile Leu Leu Met Glu

260 265 270

Met Ser Trp Tyr Thr Ser Pro Glu Glu Gly Ala Gly Glu Ser Ser Ser

275 280 285

Cys Ala Ser Asp Val Tyr Arg Leu Gly Val Leu Leu Phe Glu Leu Phe

290 295 300

Cys Pro Leu Ser Ser Arg Glu Glu Lys Ser Arg Thr Met Ser Ser Leu

305 310 315 320

Arg His Arg Val Leu Pro Pro Gln Leu Leu Leu Lys Trp Pro Lys Glu

325 330 335

Ala Ser Phe Cys Leu Trp Leu Leu His Pro Asp Pro Ser Gly Arg Pro

340 345 350

Thr Leu Gly Arg Glu Leu Leu Gln Ser Asp Phe Leu Asn Glu Gln Arg

355 360 365

Asp Asp Met Glu Glu Arg Glu Ala Ala Ile Glu Leu Arg Gln Arg Ile

370 375 380

Asp Asp Gln Glu Leu Leu Leu Glu Phe Leu Leu Leu Leu Gln Gln Arg

385 390 395 400

Lys Gln Glu Val Ala Glu Lys Leu Gln His Thr Val Ser Phe Leu Cys

405 410 415

Ser Asp Ile Glu Glu Val Thr Lys Gln His Val Arg Phe Lys Glu Ile

420 425 430

Thr Gly Ala Glu Leu Gly Ser Asp Glu His Ser Ala Ser Ser Phe Pro

435 440 445

Ser Met Thr Val Val Asp Ser Glu Gly Ser Ala Phe Leu Gly Thr Arg

450 455 460

Lys Arg Val Arg Leu Gly Met Asp Val Lys Asn Ile Glu Glu Cys Val

465 470 475 480

Asp Asp Val Gly Asp Asp Gln Lys Ser Asn Gly Ser Phe Leu Ser Lys

485 490 495

Ser Ser Arg Leu Met Lys Asn Phe Lys Lys Leu Glu Ser Ala Tyr Phe

500 505 510

Leu Thr Arg Cys Arg Pro Ala Tyr Ser Ser Gly Lys Leu Ala Val Arg

515 520 525

His Pro Pro Val Thr Ser Asp Gly Arg Gly Ser Val Val Met Thr Glu

530 535 540

Arg Ser Cys Ile Asn Asp Leu Lys Ser Lys Glu Gln Cys Arg Glu Gly

545 550 555 560

Ala Ser Ala Trp Ile Asn Pro Phe Leu Glu Gly Leu Cys Lys Tyr Leu

565 570 575

Ser Phe Ser Lys Leu Lys Val Lys Ala Asp Leu Lys Gln Gly Asp Leu

580 585 590

Leu His Ser Ser Asn Leu Val Cys Ser Leu Ser Phe Asp Arg Asp Gly

595 600 605

Glu Phe Phe Ala Thr Ala Gly Val Asn Lys Lys Ile Lys Val Phe Glu

610 615 620

Cys Asp Ser Ile Ile Asn Glu Asp Arg Asp Ile His Tyr Pro Val Val

625 630 635 640

Glu Met Ala Ser Arg Ser Lys Leu Ser Ser Ile Cys Trp Asn Thr Tyr

645 650 655

Ile Lys Ser Gln Ile Ala Ser Ser Asn Phe Glu Gly Val Val Gln Leu

660 665 670

Trp Asp Val Thr Arg Ser Gln Val Ile Ser Glu Met Arg Glu His Glu

675 680 685

Arg Arg Val Trp Ser Ile Asp Phe Ser Ser Ala Asp Pro Thr Met Leu

690 695 700

Ala Ser Gly Ser Asp Asp Gly Ser Val Lys Leu Trp Ser Ile Asn Gln

705 710 715 720

Gly Val Ser Val Gly Thr Ile Lys Thr Lys Ala Asn Val Cys Cys Val

725 730 735

Gln Phe Pro Leu Asp Ser Ala Arg Phe Leu Ala Phe Gly Ser Ala Asp

740 745 750

His Arg Ile Tyr Tyr Tyr Asp Leu Arg Asn Leu Lys Met Pro Leu Cys

755 760 765

Thr Leu Val Gly His Asn Lys Thr Val Ser Tyr Ile Lys Phe Val Asp

770 775 780

Thr Val Asn Leu Val Ser Ala Ser Thr Asp Asn Thr Leu Lys Leu Trp

785 790 795 800

Asp Leu Ser Thr Cys Ala Ser Arg Val Ile Asp Ser Pro Ile Gln Ser

805 810 815

Phe Thr Gly His Ala Asn Val Lys Asn Phe Val Gly Leu Ser Val Ser

820 825 830

Asp Gly Tyr Ile Ala Thr Gly Ser Glu Thr Asn Glu Val Phe Ile Tyr

835 840 845

His Lys Ala Phe Ser Met Pro Ala Leu Ser Phe Lys Phe Gln Asn Thr

850 855 860

Asp Pro Leu Ser Gly Asn Glu Val Asp Asp Ala Ala Gln Phe Val Ser

865 870 875 880

Ser Val Cys Trp Arg Gly Gln Ser Ser Thr Leu Leu Ala Ala Asn Ser

885 890 895

Thr Gly Asn Val Lys Ile Leu Glu Met Val

900 905

<210> 3

<211> 2724

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 3

atgtgttgtt ttacttggcc tacatgcaat tctagctggg tgaagatgga gggttcttct 60

gggtctgctt ttcacaattc tggcagttct agggccttga atagttccgg agtctcagat 120

aggaatcaaa gggttcattg tcctcaaagg aacccctttt cgggtgaggc atcacaggat 180

tcggggttta gaaaggaaag ggatagggtt ctgttggctc aaggtggtca gcctaaaaat 240

ttgggtggcg ggttttcggg gttgtgtgag gatgaggtgg aggttgaccc ctttttctgt 300

gctgtagaat ggggtgatat tagcttgagg caatggttgg ataaacctga acgatcggtg 360

gatgcctttg aatgcttgca catatttagg caaatagtag agatcgttag tgtagcacat 420

tctcaaggag ttgtagttca caatgtgagg ccttcctgct tcgtcatgtc atctttcaac 480

catatctcgt ttattgaatc agcatcttgt tcagatactg gatcggattc tttaggagat 540

ggaatgaaca accaaggcgg tgaggttaaa actccaacat ctctctgtcc ccatgatatg 600

catcagcaga gtttgggaag tgaagatttt atgcccatca agacttcaac cacccctgct 660

cggtcagatt ctagttgcat gctgtcgagt gccgtgtatg cagctcgtgc atcattgata 720

gaagaaacag aagaaaataa aatgaaagat aggagaaagg atgaagaagt agaaggaaag 780

aagcaatcat ttccaatgaa acagatacta ctaatggaga tgagttggta cactagtcct 840

gaagagggtg ctggtgaatc tagttcttgt gcttcagacg tttatcgatt gggggttctc 900

ctttttgagc tattttgtcc gctcagctca agagaagaaa agagtagaac catgtctagc 960

cttagacaca gagttcttcc tccacagtta cttctaaagt ggcctaaaga agcttcattt 1020

tgcttatggt tactgcatcc tgaccctaag agtcgtccaa cacttgggga gttgttgcag 1080

agcgagttcc ttaatgaaca gagagatgat acggaagaac gtgaagcagc gatagagctg 1140

agacaaagga tagaggatca ggagttgttg ttagagttcc ttttgttact tcaacagaga 1200

aaacaggaag ttgctgagaa gttgcaacat actgtctctt ttctgtgttc agatattgaa 1260

gaagtgacca agcagcacgt tagatttaaa gagattactg gtgctgaact ggggagtgat 1320

gagcgttcag catcaagttt cccatccatg acatttgttg atagtgagga ttctgctttt 1380

ctagggacta gaaaacgagt cagactaggg atggatgtta aaaacattga ggaatgtgac 1440

gatgatgtag gggatgatca gaaaagtaac ggaagttttc tttcaaaaag ttcacgacta 1500

atgaagaact ttaagaaact tgagtcagct tactttttaa cacgatgtag accagcctat 1560

tcctctggga aactggcggt tagacatcca cctgtaacaa gtgatggtag agggtctgtt 1620

gtcgtgactg aaagaagttg catcaatgac ttgaaatcta aagagcagtg cagggagggt 1680

gcaagtgctt ggataaatcc ttttcttgag ggtttgtgca agtatttatc attcagtaag 1740

ctaaaggtta aggctgacct aaagcaagga gatcttttgc attcttccaa cctagtatgc 1800

tcactcagct ttgatcgtga tggagaattt ttcgctaccg cgggtgtgaa taagaaaatt 1860

aaagtgtttg aatgtgattc aataataaat gaggatcgtg atatccacta tccagttgtg 1920

gagatggcta gcaggtcaaa gttaagcagt atatgttgga atacatacat caaaagtcaa 1980

attgcttcaa gcaactttga aggtgttgta cagttatggg atgtgacaag aagtcaagta 2040

atctctgaaa tgagggagca cgagcggcgg gtgtggtcca ttgatttctc atcagcagac 2100

ccaacaatgt tggcaagtgg gagcgatgac ggttctgtca agctatggag tatcaatcag 2160

ggagttagtg ttggtaccat caaaaccaag gcaaatgttt gctgcgttca gttccccctg 2220

gattctgctc gtttccttgc ttttggttca gcagatcacc gaatatatta ctatgatctt 2280

cgcaacctta aaatgccact ttgtacttta gttggacata acaagactgt gagctacatc 2340

aagtttgtag acactgtgaa ccttgtctct gcttccacag ataacacttt gaagctttgg 2400

gatttgtcta catgtgcatc tcgagttata gactcaccga ttcaatcatt cactggtcac 2460

gcgaatgtta agaactttgt ggggttatca gtatctgatg gttacattgc cactggttca 2520

gagacaaatg aggtgttcat ataccacaag gccttcccca tgccagcatt gtcattcaag 2580

tttcagaaca cagaccctct ttctggcaac gaagtggacg atgctgtgca gttcgtgtcc 2640

tcagtctgtt ggcacggcca gtcatcgtcc accttgctcg ctgcaaattc cacagggaat 2700

gtcaaaattc tggagatggt ttaa 2724

<210> 4

<211> 2721

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 4

atgtgttgtt gtacttggcc tacatgcaac tctagctgga tgaagatgga gccttctggg 60

tctgcttttc agaattctgg cagttctagg gccttgaaca gttctggagt ctcagatagg 120

aaccaaaggg ttcattgtcc tcaaaggaac cccttcttgg gtgaggcatc ccaggattcg 180

gggttcagaa aggaaaggga taggtttctg ttggctcaag gtggtcagcc taagaacttg 240

ggcggtggtt tttcggggtt gtgtgaggac gaggtggagg ttgacccctt tttctgtgct 300

gtagaatggg gtgatattag cttgaggcaa tggttggata aacctgaacg atcggtgggt 360

gccttcgaat gcttgcacat atttaggcaa atagtagaga ttgtcagtgt agcgcattct 420

caaggagttg tagttcacaa tgtgaggcct tcctgctttg tcatgtcctc tttcaaccat 480

atctcattta ttgaatcagc atcttgttca gatactggat cggattcttt gggagaagga 540

ttaaacaacc aaggcggtga ggttaaaact ccaacatctc tctgtcccca tgatatgcct 600

cagcagagta tgggaagtga agattttatg cctgtcaaga ctttaaccac ccctgctcag 660

tcagattcta gttgcatgtt gtcgagtgcc gtgtatgcgg ctcgtgcatc attgatagaa 720

gaaacagaag aaaataaaat gaaagatagg agaaaggatg acgaagtaga aggaaagaag 780

caatcatttc caatgaaaca gatactacta atggagatga gttggtacac tagtcctgaa 840

gagggtgctg gtgaatctag ttcttgtgct tcagacgttt atcgattggg ggttctcctt 900

tttgagctat tttgtccgct cagctcaaga gaagaaaaga gtagaaccat gtctagcctt 960

agacacagag ttcttcctcc acagttactt ctaaagtggc ctaaagaagc ttctttttgt 1020

ttatggttac tgcatcctga ccctagtggt cgtccaacac tcggcaggga gttgttgcag 1080

agcgacttcc ttaatgaaca gagagatgat atggaagaac gtgaagcagc gatagagctg 1140

agacaaagga tagatgatca ggagttgctg ttagaattcc ttttattact tcaacagaga 1200

aaacaggaag ttgctgagaa gttgcaacat actgtctctt ttctgtgttc agatattgaa 1260

gaagtgacca agcagcacgt tagatttaaa gagattactg gtgctgaact ggggagtgat 1320

gagcattcag catcaagttt cccatccatg acagttgttg atagtgaggg ttctgctttt 1380

ctagggacta gaaaacgagt cagactaggg atggatgtta aaaacattga ggaatgtgtc 1440

gatgatgtag gggatgatca gaaaagtaac ggaagttttc tttcaaaaag ttcacggcta 1500

atgaagaact ttaagaaact tgagtcagca tactttttaa cacgatgtag accagcctat 1560

tcctctggga aactggcggt tagacatcca cctgtaacaa gtgatggtag agggtctgtt 1620

gtcatgactg aaagaagttg catcaatgac ttgaaatcta aagagcagtg cagggagggt 1680

gcgagtgctt ggataaatcc ttttcttgag ggtttgtgca agtatttatc attcagtaag 1740

ctaaaggtta aggctgacct aaagcaagga gatcttttgc attcttccaa cctagtatgc 1800

tcactcagct ttgatcgtga tggagagttt ttcgctactg ccggtgtgaa taagaaaatt 1860

aaagtgtttg aatgtgattc aataataaat gaggatcgtg atatccacta tccagttgtg 1920

gagatggcta gcaggtcaaa gttaagcagt atatgttgga atacatatat caaaagtcaa 1980

attgcttcaa gcaactttga aggtgttgta cagttatggg atgtaacaag aagtcaagta 2040

atctctgaaa tgagggagca tgagcggcgg gtgtggtcaa ttgatttctc atcagcagac 2100

ccaacaatgt tagcaagtgg gagcgatgac ggttctgtca agctatggag tatcaatcag 2160

ggagttagtg ttggtaccat caaaaccaag gcaaatgttt gctgcgttca gttccctctg 2220

gattctgctc gttttcttgc ttttggctca gcagatcacc gaatatatta ctatgatcta 2280

cgcaacctaa aaatgccgct ttgtacttta gttggacata acaagactgt gagctacatc 2340

aagtttgtag acactgtgaa ccttgtctct gcttccacag ataacacttt gaagctttgg 2400

gatttgtcta catgtgcatc tcgagttata gactcaccga ttcaatcatt cacgggtcat 2460

gcgaatgtta agaactttgt ggggttatca gtatctgatg gttacattgc cactggttca 2520

gagacaaatg aggtgttcat ataccacaaa gccttctcca tgccagcatt gtcattcaag 2580

tttcagaaca cagaccctct ttccggtaat gaagtggacg acgctgcgca gtttgtgtcc 2640

tcggtttgct ggcgcggcca gtcgtccacc ttgctcgctg caaattccac agggaatgtc 2700

aaaattctgg agatggttta a 2721

<210> 5

<211> 55

<212> PRT

<213> Artificial Sequence (Artificial Sequence)

<400> 5

Met Cys Cys Phe Thr Trp Pro Thr Cys Asn Ser Ser Trp Val Lys Met

1 5 10 15

Glu Gly Ser Ser Gly Ser Ala Phe His Asn Ser Gly Ser Ser Arg Ala

20 25 30

Leu Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Pro

35 40 45

Gln Arg Asn Pro Phe Phe Gly

50 55

<210> 6

<211> 53

<212> PRT

<213> Artificial Sequence (Artificial Sequence)

<400> 6

Met Cys Cys Phe Thr Trp Pro Thr Cys Asn Ser Ser Trp Val Lys Met

1 5 10 15

Glu Gly Ser Ser Gly Ser Ala Phe His Asn Ser Gly Ser Ser Arg Ala

20 25 30

Leu Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Pro

35 40 45

Gln Arg Asn Pro Leu

50

<210> 7

<211> 163

<212> PRT

<213> Artificial Sequence (Artificial Sequence)

<400> 7

Met Cys Cys Cys Thr Trp Pro Thr Cys Asn Ser Ser Trp Met Lys Met

1 5 10 15

Glu Pro Ser Gly Ser Ala Phe Gln Asn Ser Gly Ser Ser Arg Ala Leu

20 25 30

Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Pro Gln

35 40 45

Arg Asn Pro Phe Leu Gly Glu Ala Ser Gln Asp Ser Gly Phe Arg Lys

50 55 60

Glu Arg Asp Arg Phe Leu Leu Ala Gln Gly Gly Gln Pro Lys Asn Leu

65 70 75 80

Gly Gly Gly Phe Ser Gly Leu Cys Glu Asp Glu Val Glu Val Asp Pro

85 90 95

Phe Phe Cys Ala Val Glu Trp Gly Asp Ile Ser Leu Arg Gln Trp Leu

100 105 110

Asp Lys Pro Glu Arg Ser Val Gly Ala Phe Glu Cys Leu His Ile Phe

115 120 125

Arg Gln Ile Val Glu Ile Val Ser Val Ala His Ser Arg Ser Cys Ser

130 135 140

Ser Gln Cys Glu Ala Phe Leu Leu Cys His Val Leu Phe Gln Pro Tyr

145 150 155 160

Leu Ile Tyr

<210> 8

<211> 164

<212> PRT

<213> Artificial Sequence (Artificial Sequence)

<400> 8

Met Cys Cys Cys Thr Trp Pro Thr Cys Asn Ser Ser Trp Met Lys Met

1 5 10 15

Glu Pro Ser Gly Ser Ala Phe Gln Asn Ser Gly Ser Ser Arg Ala Leu

20 25 30

Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Pro Gln

35 40 45

Arg Asn Pro Phe Leu Gly Glu Ala Ser Gln Asp Ser Gly Phe Arg Lys

50 55 60

Glu Arg Asp Arg Phe Leu Leu Ala Gln Gly Gly Gln Pro Lys Asn Leu

65 70 75 80

Gly Gly Gly Phe Ser Gly Leu Cys Glu Asp Glu Val Glu Val Asp Pro

85 90 95

Phe Phe Cys Ala Val Glu Trp Gly Asp Ile Ser Leu Arg Gln Trp Leu

100 105 110

Asp Lys Pro Glu Arg Ser Val Gly Ala Phe Glu Cys Leu His Ile Phe

115 120 125

Arg Gln Ile Val Glu Ile Val Ser Val Ala His Phe Ser Arg Ser Cys

130 135 140

Ser Ser Gln Cys Glu Ala Phe Leu Leu Cys His Val Leu Phe Gln Pro

145 150 155 160

Tyr Leu Ile Tyr

<210> 9

<211> 54

<212> PRT

<213> Artificial Sequence (Artificial Sequence)

<400> 9

Met Cys Cys Phe Thr Trp Pro Thr Cys Asn Ser Ser Trp Val Lys Met

1 5 10 15

Glu Gly Ser Ser Gly Ser Ala Phe His Asn Ser Gly Ser Ser Arg Ala

20 25 30

Leu Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Ser

35 40 45

Lys Glu Pro Leu Phe Gly

50

<210> 10

<211> 53

<212> PRT

<213> Artificial Sequence (Artificial Sequence)

<400> 10

Met Cys Cys Cys Thr Trp Pro Thr Cys Asn Ser Ser Trp Met Lys Met

1 5 10 15

Glu Pro Ser Gly Ser Ala Phe Gln Asn Ser Gly Ser Ser Arg Ala Leu

20 25 30

Asn Ser Ser Gly Val Ser Asp Arg Asn Gln Arg Val His Cys Ser Lys

35 40 45

Glu Pro Leu Leu Gly

50

<210> 11

<211> 443

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 11

tagatcggaa gcttaggcct aaaataaatg gtaaaatgtc aaatcaaaac taggctgcag 60

tatgcagagc agagtcatga tgatactact tactacaccg attcttgtgt gcagaaaaat 120

atgttaaaat aattgaatct ttctctagcc aaatttgaca acaatgtaca ccgttcatat 180

tgagagacga tgcttcttgt ttgctttcgg tggaagctgc atatactcaa cattactcct 240

tcagcgagtt ttccaactga gtcccacatt gcccagacct aacacggtat tcttgtttat 300

aatgaaatgt gccaccacat ggattgtttc gggtgaggca tcacgtttta gagctagaaa 360

tagcaagtta aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc 420

ttttttttct agaacgcgtt gcc 443

<210> 12

<211> 443

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 12

tagatcggaa gcttaggcct aaaataaatg gtaaaatgtc aaatcaaaac taggctgcag 60

tatgcagagc agagtcatga tgatactact tactacaccg attcttgtgt gcagaaaaat 120

atgttaaaat aattgaatct ttctctagcc aaatttgaca acaatgtaca ccgttcatat 180

tgagagacga tgcttcttgt ttgctttcgg tggaagctgc atatactcaa cattactcct 240

tcagcgagtt ttccaactga gtcccacatt gcccagacct aacacggtat tcttgtttat 300

aatgaaatgt gccaccacat ggattgaggg ttcattgtcc tcaagtttta gagctagaaa 360

tagcaagtta aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc 420

ttttttttct agaacgcgtt gcc 443

<210> 13

<211> 443

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 13

tagatcggaa gcttaggcct aaaataaatg gtaaaatgtc aaatcaaaac taggctgcag 60

tatgcagagc agagtcatga tgatactact tactacaccg attcttgtgt gcagaaaaat 120

atgttaaaat aattgaatct ttctctagcc aaatttgaca acaatgtaca ccgttcatat 180

tgagagacga tgcttcttgt ttgctttcgg tggaagctgc atatactcaa cattactcct 240

tcagcgagtt ttccaactga gtcccacatt gcccagacct aacacggtat tcttgtttat 300

aatgaaatgt gccaccacat ggattgaggg ttcattgtcc tcaagtttta gagctagaaa 360

tagcaagtta aaataaggct agtccgttat caacttgaaa aagtggcacc gagtcggtgc 420

ttttttttct agaacgcgtt gcc 443

<210> 14

<211> 989

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 14

acagataaag ccacgcacat ttaggatatt ggccgagatt actgaatatt gagtaagatc 60

acggaatttc tgacaggagc atgtcttcaa ttcagcccaa atggcagttg aaatactcaa 120

accgccccat atgcaggagc ggatcattca ttgtttgttt ggttgccttt gccaacatgg 180

gagtccaaga ttctgcagtc aaatctcggt gacgggcagg accggacggg gcggtaccgg 240

caggctgaag tccagctgcc agaaacccac gtcatgccag ttcccgtgct tgaagccggc 300

cgcccgcagc atgccgcggg gggcatatcc gagcgcctcg tgcatgcgca cgctcgggtc 360

gttgggcagc ccgatgacag cgaccacgct cttgaagccc tgtgcctcca gggacttcag 420

caggtgggtg tagagcgtgg agcccagtcc cgtccgctgg tggcgggggg agacgtacac 480

ggtcgactcg gccgtccagt cgtaggcgtt gcgtgccttc caggggcccg cgtaggcgat 540

gccggcgacc tcgccgtcca cctcggcgac gagccaggga tagcgctccc gcagacggac 600

gaggtcgtcc gtccactcct gcggttcctg cggctcggta cggaagttga ccgtgcttgt 660

ctcgatgtag tggttgacga tggtgcagac cgccggcatg tccgcctcgg tggcacggcg 720

gatgtcggcc gggcgtcgtt ctgggctcat cgattcgatt tggtgtatcg agattggtta 780

tgaaattcag atgctagtgt aatgtattgg taatttggga agatataata ggaagcaagg 840

ctatttatcc atttctgaaa aggcgaaatg gcgtcaccgc gagcgtcacg cgcattccgt 900

tcttgctgta aagcgttgtt tggtacactt ttgactagcg aggcttggcg tgtcagcgta 960

tctattcaaa agtcgttaat ggctgcgga 989

<210> 15

<211> 601

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 15

cagctcgtcc aaacctacaa tcagctcttt gaggaaaacc caattaatgc ttcaggcgtc 60

gacgccaagg cgatcctgtc tgcacgcctt tcaaagtctc gccggcttga gaacttgatc 120

gctcaactcc cgggcgaaaa gaagaacggc ttgttcggga atctcattgc actttcgttg 180

gggctcacac caaacttcaa gagtaatttt gatctcgctg aggacgcaaa gctgcagctt 240

tccaaggaca cttatgacga tgacctggat aaccttttgg cccaaatcgg cgatcagtac 300

gcggacttgt tcctcgccgc gaagaatttg tcggacgcga tcctcctgag tgatattctc 360

cgcgtgaaca ccgagattac aaaggccccg ctctcggcga gtatgatcaa gcgctatgac 420

gagcaccatc aggatctgac ccttttgaag gctttggtcc ggcagcaact cccagagaag 480

tacaaggaaa tcttctttga tcaatccaag aacggctacg ctggttatat tgacggcggg 540

gcatcgcagg aggaattcta caagtttatc aagccaattc tggagaagat ggatggcaca 600

g 601

<210> 16

<211> 566

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 16

acatttaata cgcgatagaa aacaaaatat agcgcgcaaa ctaggataaa ttatcgcgcg 60

cggtgtcatc tatgttacta gatcggaagc ttaggcctaa aataaatggt aaaatgtcaa 120

atcaaaacta ggctgcagta tgcagagcag agtcatgatg atactactta ctacaccgat 180

tcttgtgtgc agaaaaatat gttaaaataa ttgaatcttt ctctagccaa atttgacaac 240

aatgtacacc gttcatattg agagacgatg cttcttgttt gctttcggtg gaagctgcat 300

atactcaaca ttactccttc agcgagtttt ccaactgagt cccacattgc ccagacctaa 360

cacggtattc ttgtttataa tgaaatgtgc caccacatgg attgtttcgg gtgaggcatc 420

acgtcagtgt agcgcattct cagagggttc attgtcctca agttttagag ctagaaatag 480

caagttaaaa taaggctagt ccgttatcaa cttgaaaaag tggcaccgag tcggtgcttt 540

tttttctaga acgcgttgcc gagcac 566

<210> 17

<211> 619

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 17

tgattgatat tggagtctgt gctttttctt gagttaatga aatgagtttg agttataacg 60

gtgatggtgg aggtgcaagg gtgtgaggca agttggagga ggaccacagt ttttggatgt 120

gttgttttac ttggcctaca tgcaattcta gctgggtgaa gatggagggt tcttctgggt 180

ctgcttttca caattctggc agttctaggg ccttgaatag ttccggagtc tcagatagga 240

atcaaagggt tcattgtcct caaaggaacc ccttttcggg tgaggcatca caggattcgg 300

ggtttagaaa ggaaagggat agggttctgt tggctcaagg tggtcagcct aaaaatttgg 360

gtggcgggtt ttcggggttg tgtgaggatg aggtggaggt tgaccccttt ttctgtgctg 420

tagaatgggg tgatattagc ttgaggcaat ggttggataa acctgaacga tcggtggatg 480

cctttgaatg cttgcacata tttaggcaaa tagtagagat cgttagtgta gcacattctc 540

aaggagttgt agttcacaat gtgaggcctt cctgcttcgt catgtcatct ttcaaccata 600

tctcgtttat tgaatcagc 619

<210> 18

<211> 686

<212> DNA

<213> Artificial Sequence (Artificial Sequence)

<400> 18

aatggtgatg gtggaggtgc tagagtgtga ggcaagttgg aggagggcca tagtttatgg 60

atgtgttgtt gtacttggcc tacatgcaac tctagctgga tgaagatgga gccttctggg 120

tctgcttttc agaattctgg cagttctagg gccttgaaca gttctggagt ctcagatagg 180

aaccaaaggg ttcattgtcc tcaaaggaac cccttcttgg gtgaggcatc ccaggattcg 240

gggttcagaa aggaaaggga taggtttctg ttggctcaag gtggtcagcc taagaacttg 300

ggcggtggtt tttcggggtt gtgtgaggac gaggtggagg ttgacccctt tttctgtgct 360

gtagaatggg gtgatattag cttgaggcaa tggttggata aacctgaacg atcggtgggt 420

gccttcgaat gcttgcacat atttaggcaa atagtagaga ttgtcagtgt agcgcattct 480

caaggagttg tagttcacaa tgtgaggcct tcctgctttg tcatgtcctc tttcaaccat 540

atctcattta ttgaatcagc atcttgttca gatactggat cggattcttt gggagaagga 600

ttaaacaacc aaggcggtga ggttaaaact ccaacatctc tctgtcccca tgatatgcct 660

cagcagagta tgggaagtga agattt 686

35页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:一种大刺鳅抗菌肽Hepcidin-like及其制备方法

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!