Method for modifying and regulating protein expression based on N-terminal coding sequence

文档序号:939875 发布日期:2020-10-30 浏览:5次 中文

阅读说明:本技术 一种基于n端编码序列改造调控蛋白质表达的方法 (Method for modifying and regulating protein expression based on N-terminal coding sequence ) 是由 刘松 徐奎栋 李江华 陈坚 周景文 于 2020-07-29 设计创作,主要内容包括:本发明公开了一种基于N端编码序列改造调控蛋白质表达的方法,属于基因工程及酶工程技术领域。本发明是以枯草芽孢杆菌为表达宿主,通过预测模型,评价N端编码区同义突变中,最有利于促进基因表达的核苷酸序列。通过结合超折叠绿色荧光蛋白(sfGFP)的NCS的前十个氨基酸同义突变文库,测定文库中蛋白的荧光强度,选择172个代表性样本并测序鉴定,使用统计学方法建立预测模型。通过该模型优化融合了BlgS信号肽的普鲁兰酶,可使普鲁兰酶胞外酶活提高至改造前的2.67倍以及降低48%,从而为N端基因的从头设计提供理性改造的方向,有利于简易地调控基因的表达。(The invention discloses a method for modifying and regulating protein expression based on an N-terminal coding sequence, belonging to the technical field of genetic engineering and enzyme engineering. The invention uses bacillus subtilis as an expression host, and evaluates the nucleotide sequence which is most beneficial to promoting gene expression in the synonymous mutation of an N-terminal coding region through a prediction model. Fluorescence intensity of proteins in the library was determined by combining the first ten amino acid-synonymous mutation libraries of NCS of superfolder green fluorescent protein (sfGFP), selecting 172 representative samples and sequencing for identification, and using statistical methods to build a predictive model. The pullulanase fused with the BlgS signal peptide is optimized through the model, so that the extracellular enzyme activity of the pullulanase is improved to 2.67 times before modification and reduced by 48%, a rational modification direction is provided for de novo design of an N-terminal gene, and the expression of the gene is easily regulated.)

1. A method for screening a nucleotide sequence encoding a protein, characterized in that values of GC3 and Δ G are determined, and the relative expression level of the protein, i.e., P, is calculated using the following equationsfGFPA value; psfGFPThe value is positively correlated with the actual expression level of the protein:

PsfGFP=274497.657-108717.401×GC3+4886.529×ΔG;

the third base of the synonymous codon of the GC3, which is 9-10 amino acids before the N-terminal coding region of the target gene is close to the ATG, is the content of GC; the delta G is the minimum free energy of the mRNA secondary structure between any promoter transcription start site of the target gene and the 90-99 bp region of the N-terminal coding region.

2. The method of claim 1, wherein the protein is any protein that can be expressed in Bacillus subtilis.

3. The method of claim 1, wherein the method is based on PsfGFPThe corresponding nucleotide sequence was screened.

4. A method for regulating and controlling protein expression quantity of genetic engineering bacteria is characterized in that 27-30 nucleotides in front of an N-terminal coding region of a target protein are taken to establish a synonymous mutation library; calculating parameters GC3 and delta G of genes in the synonymous mutation library, calculating the relative expression quantity of each nucleotide sequence according to an equation, selecting the nucleotide sequence with the required expression quantity, carrying out corresponding mutation on the N-terminal coding region of the target protein, and transforming the target protein into a host cell;

the equation is: psfGFP=274497.657-108717.401×GC3+4886.529×ΔG;

The third base of the synonymous codon of the GC3, which is 9-10 amino acids before the N-terminal coding region of the target gene is close to the ATG, is the content of GC; the delta G is the minimum free energy of the mRNA secondary structure from any promoter transcription start site of the target gene to the 90-99 bp region of the N-terminal coding region;

when the protein expression quantity needs to be up-regulated, P in the mutation library is selectedsfGFPThe value is at the top 10%;

when the protein expression quantity needs to be reduced, selecting P in the mutation librarysfGFPThe value is at the last 10%.

5. The method of claim 4, wherein the genetically engineered bacteria are Bacillus subtilis as a host.

6. The method according to claim 4, wherein the protein is any protein capable of being expressed in Bacillus subtilis.

7. A method for regulating and controlling pullulanase expression quantity is characterized in that the first 27-30 nucleotides of a pullulanase N-terminal coding region are selected for carrying out synonymous mutation, a mutant library is constructed, and P is calculatedsfGFPValue according to PsfGFPValue selection of the corresponding synonymous mutant sequence; carrying out corresponding mutation on an N-terminal coding region of the target protein, connecting the N-terminal coding region to an expression vector, and constructing a recombinant plasmid;

when the expression level of the pullulanase needs to be up-regulated, P in a mutation library is selectedsfGFPThe value is at the top 10%;

when the expression level of the pullulanase needs to be reduced, selecting P in a mutation librarysfGFPThe value is at the last 10%.

8. The method according to claim 7, wherein the recombinant plasmid is introduced into Bacillus subtilis to produce the protein using Bacillus subtilis.

9. The method according to claim 7, wherein the pullulanase NCBI accession number is AMQ 67157.

10. Use of the method of any one of claims 1 to 6 for regulating the expression level of a protein of interest.

Technical Field

The invention relates to a method for modifying and regulating protein expression based on an N-terminal coding sequence, belonging to the technical field of genetic engineering and enzyme engineering.

Background

The mutation of the gene has very important significance for changing the property of the protein, and a mutant sequence with better property can be found from the mutation, so that the application value of the protein is improved. The synonymous mutation of the gene is a commonly used mutation means, and the expression quantity of the synonymous mutation of the gene can be greatly different.

The current commonly used methods are as follows: the optimal mutant is found by constructing a synonymous mutation library and combining a high-throughput screening strategy. However, this method is time-consuming, labor-intensive, and highly specific, and cannot be used to guide the design of other genes. Although some researches find that a series of short peptides are synthesized to be beneficial to widely improving the expression of genes, the method can affect the enzyme activity, and the short peptides promoting the expression occupy the position of a signal peptide, so that the short peptides are not suitable for extracellular proteins needing the addition of the signal peptide.

The existing methods for improving the gene expression level are all optimized through an untranslated region (5 'UTR), however, when the 5' UTR module is strong enough, the continuous optimization is difficult and the expression level is obviously improved. While there has been less research on the N-terminal coding region (NCS). Therefore, it is important to establish an NCS modification strategy that is suitable for a wide range of gene designs.

Disclosure of Invention

The method of the present invention is established based on bioinformatics analysis of a representative sample, by which a nucleotide sequence of the first 30 bases of the N-terminus of an arbitrary gene can be de novo designed and subjected to synonymous mutation. In the embodiment of the present model, the modification of the NCS nucleotide sequence of any gene as the target nucleotide sequence is carried out by mutating the primer. The invention can be used for guiding the design of any gene by optimizing the nucleotide sequence of NCS, does not need to add additional amino acid sequence, and reduces the property of protein to the minimum. Can greatly improve the expression level of the target gene.

The invention provides a method for screening a nucleotide sequence for coding a protein with high expression level, which comprises the steps of measuring values of GC3 and delta G, and calculating the relative expression level of the protein by using the following equation, namely PsfGFPThe value:

PsfGFP=274497.657-108717.401×GC3+4886.529×ΔG。

in one embodiment of the invention, the GC3 is the content of GC in the third base of the synonymous codon of 9-10 amino acids before the N-terminal coding region of the target gene is the ATG; the delta G is the minimum free energy of the mRNA secondary structure between any promoter transcription start site of the target gene and the 90-99 bp region of the N-terminal coding region.

In one embodiment of the invention, the protein is any protein that can be expressed in bacillus subtilis.

In one embodiment of the invention, PsfGFPThe value is positively correlated with the actual expression level of the protein.

In one embodiment of the invention, according to PsfGFPThe corresponding nucleotide sequence was screened.

The invention provides a method for regulating and controlling the protein expression quantity of a genetic engineering bacterium, which comprises the steps of selecting 27-30 nucleotides in length of an N-terminal coding region of a target protein, and establishing a synonymous mutation library; calculating GC3 and delta G parameters of genes in the synonymous mutation library, calculating the relative expression quantity of each nucleotide sequence according to an equation, selecting the nucleotide sequence with the required expression quantity, carrying out corresponding mutation on the N-terminal coding region of the target protein, and transforming the target protein into a host cell;

the equation is: psfGFP=274497.657-108717.401×GC3+4886.529×ΔG。

In one embodiment of the invention, the third base of the GC3 is the GC content of the synonymous codon which is 9-10 amino acids before the N-terminal coding region of the target gene.

In one embodiment of the invention, Δ G is the minimum free energy of the secondary structure of mRNA from any promoter transcription initiation site of the target gene to the region of 90-99 bp of the N-terminal coding region;

in one embodiment of the invention, P in the mutation pool is selected when the protein expression level needs to be up-regulatedsfGFPThe value is at the top 10%; when the protein expression quantity needs to be reduced, selecting P in the mutation librarysfGFPThe value is at the last 10%.

In one embodiment of the present invention, the genetically engineered bacterium is a bacillus subtilis host.

In one embodiment of the invention, the protein is any protein that can be expressed in bacillus subtilis.

The invention provides a method for regulating and controlling pullulanase expression quantity, which comprises the steps of selecting the first 27-30 nucleotides of an N-terminal coding region of pullulanase, carrying out synonymous mutation, constructing a mutant library, and calculating PsfGFPValue according to PsfGFPValue selection of the corresponding synonymous mutant sequence; the N-terminal coding region of the target protein is mutated correspondingly and connected to an expression vector to construct a recombinant plasmid.

In one embodiment of the present invention, when the expression level of pullulanase needs to be up-regulated, P in the mutant library is selectedsfGFPThe value is at the top 10%.

In one embodiment of the invention, when the expression level of pullulanase needs to be down-regulated, P in the mutant library is selectedsfGFPThe value is at the last 10%.

In one embodiment of the present invention, the recombinant plasmid is introduced into Bacillus subtilis, and the Bacillus subtilis is used to produce a protein.

In one embodiment of the invention, the pullulanase NCBI accession number is AMQ 67157.

The invention also protects the application of the method for screening the nucleotide sequence for coding the protein with high expression level or the method for regulating the expression level of the protein of the genetic engineering bacteria in regulating the expression level of the target protein.

The invention also protects the application of the method for regulating the expression level of the pullulanase in regulating the pullulanase.

The invention has the beneficial effects that:

the invention researches a formula PsfGFP (274497.657-108717.401 XGC 3+4886.529 Xdelta G) for guiding protein to make directional modification so as to improve or reduce the expression of target protein by combining sfGFP and modifying an N-terminal coding region of the target gene (synonymous mutation). The calculated value of PsfGFP is positively correlated with the actual expression level of the protein, and according to the formula, the value of PsfGFP is calculated, namely, the corresponding synonymous mutant sequence is selected as required. The mutant is applied to the N end of pullulanase fused with nucleotide sequences of Bgls signal peptides, and the selected synonymous mutant sequence can enable the extracellular enzyme activity to be up-regulated by 2.67 times and down-regulated by 48%.

Drawings

FIG. 1 is a map of sfGFP expression plasmid P43-NMK-sfGFP.

FIG. 2 is a graph showing the relative fluorescence intensity of the NCS library of sfGFP.

FIG. 3 shows the nucleotide sequence index and fluorescence values of 172 samples.

FIG. 4 is a graph showing the relative fluorescence values before and after the modification.

FIG. 5 is a map of the pullulanase expression plasmid P43-NMK-Bgls fused with BglS signal peptide.

FIG. 6 is a protein gel diagram of 5 NCS variants of the BglS signal peptide.

FIG. 7 is a correlation diagram of the expression prediction value and the enzyme activity measurement value of pullulanase added with 5 Bgls signal peptide sequences.

Detailed Description

1. The culture medium comprises the following components:

seed medium (g/L): peptone 10, yeast extract 5, sodium chloride 5;

fermentation medium (g/L): the following components were dissolved in 0.9L of water: peptone 12g, yeast extract 24g, glycerin 4 mL.

Dissolving the components and then sterilizing under high pressure; cooling to 60 deg.C, adding 100mL of sterilized 0.17mol/L KH2PO40.72mol/L of K2HPO4Solution (2.31g KH)2PO4And 12.54g of K2HPO4Dissolved in sufficient water to give a final volume of 100 mL; filter sterilization with 0.22 μm filter membrane);

2. the culture method comprises the following steps:

seed culture: selecting a single colony of engineering bacteria, inoculating the single colony of the engineering bacteria into a seed culture medium, culturing at 37 ℃ and at a shaking table rotating speed of 200r/min for 24 hours;

fermentation culture: inoculating the seed culture solution into a fermentation culture medium according to the inoculum size of 4%, and fermenting at 37 deg.C for 24h

3. Green fluorescent protein expression level and biomass measurement

Fermentation broth diluted with PBS buffer (100mM and pH7.2) to an appropriate concentration was added to a 96-well plate, and the mixture was analyzed using a rotation 3 cell imaging microplate detector (beton instruments ltd., usa), green fluorescence excitation wavelength: 480nm, green fluorescence emission wavelength: 520nm, cell growth OD absorption wavelength: 600 nm.

The one-step cloning kit was purchased from biotechnology limited of nuozokenza, south kyo.

4. SDS-PAGE electrophoretic detection

With glue concentration of 10%

Figure BDA0002607281890000031

SDS-PAGE gel was used to analyze the expression level of the protein, using MES or MOPS buffer as the running buffer, and loading 10. mu.L. The electrophoretic voltage was 150V. The specific sample preparation and electrophoresis operations were performed according to the kit instructions. When the electrophoresis is carried out by using MES buffer, the molecular weights (kDa) of the standard proteins are respectively as follows: 188, 98, 62, 49, 38, 28, 17, 14, 6, and 3; when electrophoresis is performed by MOPS buffer, the molecular weights (kDa) of the standard proteins are respectively as follows: 191, 97, 64, 51, 39, 28, 19, 14

5. Pullulanase enzyme activity determination mode

1mL of 1g/100mL pullulan polysaccharide substrate and 0.9mL of 100mM acetic acid-sodium acetate buffer solution with pH of 4.5 are uniformly mixed, the mixture is placed in a water bath kettle at 60 ℃ for preheating for 10min, 0.1mL of pullulanase solution is added, after the reaction is carried out for 10min, 3mL of DNS color developing solution is added, then the mixture is boiled in a boiling water bath for 7min, the mixture is placed in ice water to stop the color developing reaction, 10mL of deionized water is added, the mixture is uniformly mixed, and the light absorption value is measured at 540 nm. The amount of enzyme that produces 1. mu. mol of reducing sugar per unit time is defined as one unit of enzyme activity.

19页详细技术资料下载
上一篇:一种医用注射器针头装配设备
下一篇:基于CRISPR技术的核酸检测的信号放大磁珠技术系统及其应用

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!